RAID 4 / Level 4 RAID / RAID Level 4
We said that the two primary disadvantages of the RAID 3 architecture were the large (and/or inconsistent) transfer block sizes and the inability to perform multiple simultaneous I/O transactions. Both of these problems result from the fact that a single transfer block of data is interleaved across all of the data drives.
In RAID 4, we place the entire first transfer block on the first data drive, the second transfer block on the second data drive, and so forth. Diagram below shows this scheme for 4 drives. There is still only one ECC / parity drive and it is computed as the exclusive-or (XOR) of all of the data drives:
In the event of unreadable data, the lost sector/drive is reconstituted by computing the exclusive-or of the remaining drives.
RAID 4 Read Operations
A read transaction involves only a single data drive and the timing is identical to a single drive configuration. Also, since only one drive is involved, a multitasking operating system could issue an independent read transaction against each data drive. This means that in a saturated I/O environment, with ‘n’ data drives, ‘n’ times as many seeks will complete during a given interval. So a RAID Level 4 array with 6 data drives, each with a mean access time of 12ms will have a virtual access time that approaches 2ms as system loading increases. This effect has been seen for years in benchmarking of operating systems such as Unix, where parallel operations on multiple drives are supported. However, achieving it in practice has required two conditions. First, you have to be able to segregate the data into distinct subsets for inclusion on different drives, and secondly, the system administrator has to have a good enough understanding of how and when the data is used so that an effective segregation is achieved. A RAID 4 makes the allocation to distinct drives automatic, since each and every file is automatically distributed.
RAID 4 Write Operations
Contrary to initial inclinations, a write transaction does not involve all of the drives. It requires reads and writes of the data drive involved and the parity disk. Because the existing ECC block already contains information about the other data drives, the following mechanism can be used:
a. Read the existing data and parity;
b. Compute the new parity block as: new parity = existing parity x existing data x new data;
c. Write the new data and parity.
As a result, write operations have slightly longer seek times for both the read and write portions than a single drive. However, more important is the fact that the ECC drive is involved in every write transaction and thus only one write transaction can be performed at a time. The parallelism possible in the read transactions is impossible in write transactions.
Transfer rates for a single read/write transaction are the same as for a single drive. In a multitasking transaction processing system, the virtual transfer rate will approach the number of data drives times the single drive transfer rate as the ratio of reads to writes increases.
RAID 4 Benefits and Drawbacks
The primary advantage of RAID 4 is the ability to process multiple simultaneous reads. This
makes it extremely efficient for transaction or multitasking systems where the ratio of reads to
writes is very high. The disadvantage is that it can only process one write transaction at a time. Single parity drive becomes a bottleneck in RAID Level 4.