linux

iv/linux

Go to file

Dave Chinner e53946dbd3 xfs: xfs_iflush_abort() can be called twice on cluster writeback failure

When a corrupt inode is detected during xfs_iflush_cluster, we can
get a shutdown ASSERT failure like this:

XFS (pmem1): Metadata corruption detected at xfs_symlink_shortform_verify+0x5c/0xa0, inode 0x86627 data fork
XFS (pmem1): Unmount and run xfs_repair
XFS (pmem1): xfs_do_force_shutdown(0x8) called from line 3372 of file fs/xfs/xfs_inode.c.  Return address = ffffffff814f4116
XFS (pmem1): Corruption of in-memory data detected.  Shutting down filesystem
XFS (pmem1): xfs_do_force_shutdown(0x1) called from line 222 of file fs/xfs/libxfs/xfs_defer.c.  Return address = ffffffff814a8a88
XFS (pmem1): xfs_do_force_shutdown(0x1) called from line 222 of file fs/xfs/libxfs/xfs_defer.c.  Return address = ffffffff814a8ef9
XFS (pmem1): Please umount the filesystem and rectify the problem(s)
XFS: Assertion failed: xfs_isiflocked(ip), file: fs/xfs/xfs_inode.h, line: 258
.....
Call Trace:
 xfs_iflush_abort+0x10a/0x110
 xfs_iflush+0xf3/0x390
 xfs_inode_item_push+0x126/0x1e0
 xfsaild+0x2c5/0x890
 kthread+0x11c/0x140
 ret_from_fork+0x24/0x30

Essentially, xfs_iflush_abort() has been called twice on the
original inode that that was flushed. This happens because the
inode has been flushed to teh buffer successfully via
xfs_iflush_int(), and so when another inode is detected as corrupt
in xfs_iflush_cluster, the buffer is marked stale and EIO, and
iodone callbacks are run on it.

Running the iodone callbacks walks across the original inode and
calls xfs_iflush_abort() on it. When xfs_iflush_cluster() returns
to xfs_iflush(), it runs the error path for that function, and that
calls xfs_iflush_abort() on the inode a second time, leading to the
above assert failure as the inode is not flush locked anymore.

This bug has been there a long time.

The simple fix would be to just avoid calling xfs_iflush_abort() in
xfs_iflush() if we've got a failure from xfs_iflush_cluster().
However, xfs_iflush_cluster() has magic delwri buffer handling that
means it may or may not have run IO completion on the buffer, and
hence sometimes we have to call xfs_iflush_abort() from
xfs_iflush(), and sometimes we shouldn't.

After reading through all the error paths and the delwri buffer
code, it's clear that the error handling in xfs_iflush_cluster() is
unnecessary. If the buffer is delwri, it leaves it on the delwri
list so that when the delwri list is submitted it sees a shutdown
fliesystem in xfs_buf_submit() and that marks the buffer stale, EIO
and runs IO completion. i.e. exactly what xfs+iflush_cluster() does
when it's not a delwri buffer. Further, marking a buffer stale
clears the _XBF_DELWRI_Q flag on the buffer, which means when
submission of the buffer occurs, it just skips over it and releases
it.

IOWs, the error handling in xfs_iflush_cluster doesn't need to care
if the buffer is already on a the delwri queue or not - it just
needs to mark the buffer stale, EIO and run completions. That means
we can just use the easy fix for xfs_iflush() to avoid the double
abort.

Signed-off-by: Dave Chinner <dchinner@redhat.com>
Reviewed-by: Brian Foster <bfoster@redhat.com>
Reviewed-by: Darrick J. Wong <darrick.wong@oracle.com>
Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>

2018-06-21 23:31:38 -07:00

arch

Solve a series of broken links for files under Documentation:

2018-06-17 05:25:18 +09:00

block

for-linus-20180616

2018-06-17 05:37:55 +09:00

certs

docs: Fix some broken references

2018-06-15 18:10:01 -03:00

crypto

docs: Fix some broken references

2018-06-15 18:10:01 -03:00

Documentation

for-linus-20180616

2018-06-17 05:37:55 +09:00

drivers

for-linus-20180616

2018-06-17 05:37:55 +09:00

firmware

kbuild: remove all dummy assignments to obj-

2017-11-18 11:46:06 +09:00

xfs: xfs_iflush_abort() can be called twice on cluster writeback failure

2018-06-21 23:31:38 -07:00

include

for-linus-20180616

2018-06-17 05:37:55 +09:00

init

Kbuild updates for v4.18 (2nd)

2018-06-13 08:40:34 -07:00

ipc

ipc: use new return type vm_fault_t

2018-06-15 07:55:25 +09:00

kernel

Solve a series of broken links for files under Documentation:

2018-06-17 05:25:18 +09:00

lib

docs: Fix some broken references

2018-06-15 18:10:01 -03:00

LICENSES

LICENSES: Add Linux-OpenIB license text

2018-04-27 16:41:53 -06:00

mm: fix oom_kill event handling

2018-06-15 07:55:25 +09:00

net

Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net

2018-06-16 07:39:34 +09:00

samples

VFIO updates for v4.18

2018-06-12 13:11:26 -07:00

scripts

scripts/documentation-file-ref-check: check tools/*/Documentation

2018-06-15 18:10:01 -03:00

security

docs: Fix some broken references

2018-06-15 18:10:01 -03:00

sound

docs: Fix some broken references

2018-06-15 18:10:01 -03:00

tools

Solve a series of broken links for files under Documentation:

2018-06-17 05:25:18 +09:00

usr

kbuild: rename built-in.o to built-in.a

2018-03-26 02:01:19 +09:00

virt

- Error path bug fix for overflow tests (Dan)

2018-06-12 18:28:00 -07:00

.clang-format

clang-format: add configuration file

2018-04-11 10:28:35 -07:00

.cocciconfig

scripts: add Linux .cocciconfig for coccinelle

2016-07-22 12:13:39 +02:00

.get_maintainer.ignore

…

.gitattributes

.gitattributes: set git diff driver for C source code files

2016-10-07 18:46:30 -07:00

.gitignore

Kbuild updates for v4.17 (2nd)

2018-04-15 17:21:30 -07:00

.mailmap

Merge branch 'asoc-4.17' into asoc-4.18 for compress dependencies

2018-04-26 12:24:28 +01:00

COPYING

COPYING: use the new text with points to the license files

2018-03-23 12:41:45 -06:00

CREDITS

MAINTAINERS/CREDITS: Drop METAG ARCHITECTURE

2018-03-05 16:34:24 +00:00

Kbuild

Kbuild updates for v4.15

2017-11-17 17:45:29 -08:00

Kconfig

kconfig: add basic helper macros to scripts/Kconfig.include

2018-05-29 03:31:19 +09:00

MAINTAINERS

Solve a series of broken links for files under Documentation:

2018-06-17 05:25:18 +09:00

Makefile

Linux 4.18-rc1

2018-06-17 08:04:49 +09:00

README

Docs: Added a pointer to the formatted docs to README

2018-03-21 09:02:53 -06:00

README

Linux kernel
============

There are several guides for kernel developers and users. These guides can
be rendered in a number of formats, like HTML and PDF. Please read
Documentation/admin-guide/README.rst first.

In order to build the documentation, use ``make htmldocs`` or
``make pdfdocs``.  The formatted documentation can also be read online at:

    https://www.kernel.org/doc/html/latest/

There are various text files in the Documentation/ subdirectory,
several of them using the Restructured Text markup notation.
See Documentation/00-INDEX for a list of what is contained in each file.

Please read the Documentation/process/changes.rst file, as it contains the
requirements for building and running the kernel, and information about
the problems which may result by upgrading your kernel.

Languages

C 97.6%

Assembly 1%

Shell 0.5%

Python 0.3%

Makefile 0.3%