Line 81: |
Line 81: |
| Sometimes a partition will be taken offline automatically. Admin will receive an email '''DegradedArray event on /dev/md2'''. | | Sometimes a partition will be taken offline automatically. Admin will receive an email '''DegradedArray event on /dev/md2'''. |
| | | |
− | This will happen if, for example, a read or write error is detected in a disk in the RAID set, or a disk does not respond fast enough, causing a timeout. When this happens, the details of the raid can be seen by inspecting the mdstat file. | + | This will happen if, for example, a read or write error is detected in a disk in the RAID set, or a disk does not respond fast enough, causing a timeout. As a precaution, verify the health of your disks as documented in: http://wiki.contribs.org/Monitor_Disk_Health and specifically with the command: |
| + | smartctl -a /dev/hda |
| + | Where hda is the device to be checked; check all of them. |
| + | |
| + | When this happens, the details of the raid can be seen by inspecting the mdstat file. |
| | | |
| [root@sme]# cat /proc/mdstat | | [root@sme]# cat /proc/mdstat |
Line 133: |
Line 137: |
| 255936 blocks [2/2] [UU] | | 255936 blocks [2/2] [UU] |
| | | |
− | If this action is required regularly, you should test your disks for SMART errors and physical errors, check your disk cables, and make sure no two hard drives share the same IDE port. Also check your driver cards, since a faulty card can destroy the data on a full RAID set as easily as it can a single disk. | + | If this action is required regularly, you should test your disks for SMART errors and physical errors, check your disk cables, and make sure no two hard drives share the same IDE port. See also: http://wiki.contribs.org/Monitor_Disk_Health |
| + | |
| + | Also check your driver cards, since a faulty card can destroy the data on a full RAID set as easily as it can a single disk. |
| | | |
| <noinclude>[[Category:Howto]]</noinclude> | | <noinclude>[[Category:Howto]]</noinclude> |