If you like this, please donate:
Hybrid HDD + SSD RAID1
What is hybrid RAID1?In general, hybrid RAID1 is RAID1 that mirrors data on two different storage technologies. Here we are talking about a HDD and an SSD. (Or more if you want more than 2-way RAID1. Why would you want, e.g. 3-way RAID1? Simple: If one disk fails, you still have redundancy, same reason as for using RAID6. And that one disk will fail sooner or later.)
Why do it?HDDs and SSDs have different characteristics. With a hybrid slution, you get some of the advantages of both. Let me brifly state the man characteristics of HDDs and SSDs. (+) is something positive, (o) is neutral and (-) is a disadvantage.
How to do it with Linux Software RAIDThe trick is to create the RAID1 array and set the HDD(s) during creation as "write-mostly". This will cause the kernel to only do (slow) reads from the HDD if they are really needed. All other reads will go to the SSD. This option was originally added when mirroring over a slow network interface, but performs equally well to concentrate reads on an SSD. Here is how to do it. Let us assume you want to RAID1 a HDD partition sdb6 and an SSD partition sdc6 as md1. (Substitute full disk dev if needed. You can mix partitions and full disks.) The respective call to mdadm would be as follows:
mdadm --create -n 2 -l 1 /dev/md1 /dev/sdc6 -W /dev/sdb6A subsequent check in /proc/mdstat should show a "(W)" after the HDD components. Here is an example from my set-up, with sdb6 and sdc6 HDD partitions and sdd1 an SSD partition (Yes, that is a triple-RAID1.):
cat /proc/mdstat Personalities : [linear] [raid1] [raid6] [raid5] [raid4] ... md6 : active raid1 sdc6(W) sdb6(W) sdd1 62508800 blocks [3/3] [UUU] ...Observed read speeds are the same as for an SSD alone. The write speeds are comparable to a HDD-only RAID1. Since writes can often been buffered, overall a hybrid array is a lot faster than one with only HDDs.
How to do it for an existing RAID1You can enable "write-mostly" for a RAID component in the following way:
echo writemostly > /sys/block/md6/md/dev-sdc6/stateand disable it this way:
echo -writemostly > /sys/block/md6/md/dev-sdc6/state
If for some reason you cannot set a component of a RAID1 to "write-mostly", you can kick it from the array and re-add it with the write-mostly flag active. This will temporarily lower your redundancy level. Backup before doing this is recomended.
To set /dev/sdc6 from the last example to "write-mostly" would work as follows:
To kick, first set it to "faulty":
mdadm --fail /dev/md6 /dev/sdc6Then kick it:
mdadm --remove /dev/md6 /dev/sdc6Then add it again:
mdadm --add /dev/md6 --write-mostly /dev/sdc6Wait for the RAID1 resync to complete, and /dev/sdc6 will now only be read when needed.
MaintenanceThere are two aspects to storage maintenance with RAID: RAID maintenance and storage device maintenance. Both have to goal to detect problems early, when there is still a chance to correct them and to notify you in time when it looks like manual intervention is needed. Still, keep in mind that RAID is not backup. It only covers some of the areas a backup covers, but not all. For example, user error and malware problems are not covered by RAID. Your computer being hit by lightening is coverd also not covered. You do need the backup in addition. What RAID gives you is that the probability of needing that backup is lower, hence the process for restoring from backup can take higher effort, which makes it cheaper. Or you just have the hassel far less often.
RAID consistency checksI recommend running a RAID consistency check every 7 - 15 days. The way to run it is a bit obscure. Basically, you read "/sys/block/mdx/md/mismatch_cnt" (substitute your md device for "mdx") before to make sure it is zero. Then you put the string "check" into "/sys/block/mdx/md/sync_action" (replace "mdx" as before) and wait for it to not give "check" anymore. Then you read the mismatch count again and make sure it is zero. Here is a Python script I wrote that does this md_check.py, just adjust the configured device at the start and use if from cron like this:
# check array with SDD 33 6 * * * /root/sys_tools/mdadm/md_check.pyThis script can be used for other RAID arrays as well, not just for RAID1 or hybrid arrays. I run it two times a month from cron for each of my RAID arrays (I currently have 8), whith a maximum of one check per day. Make sure cron can send email to you or you will not be notified in case of errors. That would make the check basically worthless. It is also possible that your distribution already does this check automatically. Debian does so, but only once a month and with a pretty convoluted script that only works sometimes. It also seems to be missing any meaningful reporting, which makes the check worthless. From my experience, the only reporting that works are checks that send email to an address that is read regularly. For faster alerting, use a mailbox system that notifies you via text message or send the email to your mobile phone in the first place. Forget about anything else, it just it just does not work. Email is the base mechanism to use. And it has the added advantage that you can either send it directly or put a message to stdout when called from cron and cron will send the email for you. That is also one reason any real sysadmin makes reliable email sending a top priority.
SMART selftestsThe seond thing that should be done regularly is a full device read test. I do it every 14 days. For HDDs, you can run a long SMART selftest, e.g. with smartd or manually from cron as well. Make sure you have smartd configured and working to catch errors! smartd also needs to be able to send email to you, otherwise all monitoring is basically worthless. For SSDs, the problem is that not all support SMART or long SMART selftests. If yours does, do the same as for the HDDs. Otherwise hope that read errors will show up during the RAID consistency checks. The RAID consistency check will read all component devices in full, but will not notice if sectors are slow to read or extensive error correction was needed. SMART attributes will show that and smartd will notice and notify you.