for-linus-20181205 - pub/scm/linux/kernel/git/axboe/linux-block

tag	2f19ab7c40c92372601d59b8bc8be36e7913cbe3
tagger	Jens Axboe <axboe@kernel.dk>	Wed Dec 05 13:22:01 2018 -0700
object	ffe81d45322cc3cb140f0db080a4727ea284661e

for-linus-20181205

commit	ffe81d45322cc3cb140f0db080a4727ea284661e	[log] [tgz]
author	Jens Axboe <axboe@kernel.dk>	Tue Dec 04 20:06:48 2018 -0700
committer	Jens Axboe <axboe@kernel.dk>	Tue Dec 04 20:06:48 2018 -0700
tree	fe3625f3578ff929ad395d2b41e8a1fce937c6ed
parent	fd6f32f78645db32b6b95a42e45da2ddd6de0e67 [diff]

blk-mq: fix corruption with direct issue

If we attempt a direct issue to a SCSI device, and it returns BUSY, then
we queue the request up normally. However, the SCSI layer may have
already setup SG tables etc for this particular command. If we later
merge with this request, then the old tables are no longer valid. Once
we issue the IO, we only read/write the original part of the request,
not the new state of it.

This causes data corruption, and is most often noticed with the file
system complaining about the just read data being invalid:

[  235.934465] EXT4-fs error (device sda1): ext4_iget:4831: inode #7142: comm dpkg-query: bad extra_isize 24937 (inode size 256)

because most of it is garbage...

This doesn't happen from the normal issue path, as we will simply defer
the request to the hardware queue dispatch list if we fail. Once it's on
the dispatch list, we never merge with it.

Fix this from the direct issue path by flagging the request as
REQ_NOMERGE so we don't change the size of it before issue.

See also:
  https://bugzilla.kernel.org/show_bug.cgi?id=201685

Tested-by: Guenter Roeck <linux@roeck-us.net>
Fixes: 6ce3dd6eec1 ("blk-mq: issue directly if hw queue isn't busy in case of 'none'")
Cc: stable@vger.kernel.org
Signed-off-by: Jens Axboe <axboe@kernel.dk>

block/blk-mq.c[diff]

1 file changed

tree: fe3625f3578ff929ad395d2b41e8a1fce937c6ed