Bart Van Assche 3a0a529971 block, scsi: Make SCSI quiesce and resume work reliably
The contexts from which a SCSI device can be quiesced or resumed are:
* Writing into /sys/class/scsi_device/*/device/state.
* SCSI parallel (SPI) domain validation.
* The SCSI device power management methods. See also scsi_bus_pm_ops.

It is essential during suspend and resume that neither the filesystem
state nor the filesystem metadata in RAM changes. This is why while
the hibernation image is being written or restored that SCSI devices
are quiesced. The SCSI core quiesces devices through scsi_device_quiesce()
and scsi_device_resume(). In the SDEV_QUIESCE state execution of
non-preempt requests is deferred. This is realized by returning
BLKPREP_DEFER from inside scsi_prep_state_check() for quiesced SCSI
devices. Avoid that a full queue prevents power management requests
to be submitted by deferring allocation of non-preempt requests for
devices in the quiesced state. This patch has been tested by running
the following commands and by verifying that after each resume the
fio job was still running:

for ((i=0; i<10; i++)); do
  (
    cd /sys/block/md0/md &&
    while true; do
      [ "$(<sync_action)" = "idle" ] && echo check > sync_action
      sleep 1
    done
  ) &
  pids=($!)
  for d in /sys/class/block/sd*[a-z]; do
    bdev=${d#/sys/class/block/}
    hcil=$(readlink "$d/device")
    hcil=${hcil#../../../}
    echo 4 > "$d/queue/nr_requests"
    echo 1 > "/sys/class/scsi_device/$hcil/device/queue_depth"
    fio --name="$bdev" --filename="/dev/$bdev" --buffered=0 --bs=512 \
      --rw=randread --ioengine=libaio --numjobs=4 --iodepth=16       \
      --iodepth_batch=1 --thread --loops=$((2**31)) &
    pids+=($!)
  done
  sleep 1
  echo "$(date) Hibernating ..." >>hibernate-test-log.txt
  systemctl hibernate
  sleep 10
  kill "${pids[@]}"
  echo idle > /sys/block/md0/md/sync_action
  wait
  echo "$(date) Done." >>hibernate-test-log.txt
done

Reported-by: Oleksandr Natalenko <oleksandr@natalenko.name>
References: "I/O hangs after resuming from suspend-to-ram" (https://marc.info/?l=linux-block&m=150340235201348).
Signed-off-by: Bart Van Assche <bart.vanassche@wdc.com>
Reviewed-by: Hannes Reinecke <hare@suse.com>
Tested-by: Martin Steigerwald <martin@lichtvoll.de>
Tested-by: Oleksandr Natalenko <oleksandr@natalenko.name>
Cc: Martin K. Petersen <martin.petersen@oracle.com>
Cc: Ming Lei <ming.lei@redhat.com>
Cc: Christoph Hellwig <hch@lst.de>
Cc: Johannes Thumshirn <jthumshirn@suse.de>
Signed-off-by: Jens Axboe <axboe@kernel.dk>
2017-11-10 19:53:25 -07:00
..
2017-09-07 12:12:43 -07:00
2017-09-14 21:02:03 -04:00
2017-09-07 12:12:43 -07:00
2017-09-07 12:12:43 -07:00
2017-09-07 12:12:43 -07:00
2017-08-25 17:29:29 -04:00
2017-09-07 12:12:43 -07:00
2017-09-07 12:12:43 -07:00
2017-09-07 12:12:43 -07:00
2017-08-25 17:21:11 -04:00
2015-04-17 10:13:56 -07:00
2015-11-25 22:08:55 -05:00
2017-08-24 22:29:06 -04:00
2014-07-17 22:07:37 +02:00
2016-05-10 22:01:07 -04:00
2015-11-09 16:32:14 -08:00
2017-09-05 08:18:42 -04:00
2017-08-29 21:51:44 -04:00
2017-08-25 17:21:10 -04:00
2017-08-08 11:49:51 -04:00
2016-01-27 20:36:10 +08:00
2017-11-02 08:26:55 -06:00
2016-01-27 20:36:10 +08:00
2017-08-25 17:08:08 -04:00
2017-04-26 15:09:04 -06:00
2014-07-17 22:07:35 +02:00
2017-08-25 17:21:11 -04:00
2016-11-28 15:51:31 -05:00
2017-04-13 13:58:20 -04:00
2017-09-07 12:12:43 -07:00
2017-05-16 08:44:15 -03:00
2017-09-07 12:12:43 -07:00
2016-02-23 21:27:02 -05:00
2014-07-17 22:07:39 +02:00
2014-11-12 11:15:57 +01:00
2017-08-08 11:49:51 -04:00
2015-11-18 11:59:09 -05:00
2017-04-26 18:32:29 -04:00