NeilBrown 4274215d24 md: avoid endless recovery loop when waiting for fail device to complete.
If a device fails in a way that causes pending request to take a while
to complete, md will not be able to immediately remove it from the
array in remove_and_add_spares.
It will then incorrectly look like a spare device and md will try to
recover it even though it is failed.
This leads to a recovery process starting and instantly aborting over
and over again.

We should check if the device is faulty before considering it to be a
spare.  This will avoid trying to start a recovery that cannot
proceed.

This bug was introduced in 2.6.26 so that patch is suitable for any
kernel since then.

Cc: stable@kernel.org
Reported-by: Jim Paradis <james.paradis@stratus.com>
Signed-off-by: NeilBrown <neilb@suse.de>
2011-06-28 16:59:42 +10:00
..
2011-03-24 13:54:24 +00:00
2011-05-29 13:03:09 +01:00
2010-08-12 04:14:11 +01:00
2011-05-29 13:03:09 +01:00
2011-03-31 11:26:23 -03:00
2011-03-24 13:54:35 +00:00
2007-10-20 02:01:26 +01:00
2010-08-12 04:14:12 +01:00
2011-03-31 11:26:23 -03:00
2011-03-24 13:54:24 +00:00
2011-03-24 13:54:24 +00:00
2011-04-18 18:25:42 +10:00
2011-03-31 11:26:23 -03:00