linux

iv/linux

History

Filipe Manana b38ef71cb1 Btrfs: ensure ordered extent errors aren't missed on fsync

When doing a fsync with a fast path we have a time window where we can miss
the fact that writeback of some file data failed, and therefore we endup
returning success (0) from fsync when we should return an error.
The steps that lead to this are the following:

1) We start all ordered extents by calling filemap_fdatawrite_range();

2) We do some other work like locking the inode's i_mutex, start a transaction,
   start a log transaction, etc;

3) We enter btrfs_log_inode(), acquire the inode's log_mutex and collect all the
   ordered extents from inode's ordered tree into a list;

4) But by the time we do ordered extent collection, some ordered extents we started
   at step 1) might have already completed with an error, and therefore we didn't
   found them in the ordered tree and had no idea they finished with an error. This
   makes our fsync return success (0) to userspace, but has no bad effects on the log
   like for example insertion of file extent items into the log that point to unwritten
   extents, because the invalid extent maps were removed before the ordered extent
   completed (in inode.c:btrfs_finish_ordered_io).

So after collecting the ordered extents just check if the inode's i_mapping has any
error flags set (AS_EIO or AS_ENOSPC) and leave with an error if it does. Whenever
writeback fails for a page of an ordered extent, we call mapping_set_error (done in
extent_io.c:end_extent_writepage, called by extent_io.c:end_bio_extent_writepage)
that sets one of those error flags in the inode's i_mapping flags.

This change also has the side effect of fixing the issue where for fast fsyncs we
never checked/cleared the error flags from the inode's i_mapping flags, which means
that a full fsync performed after a fast fsync could get such errors that belonged
to the fast fsync - because the full fsync calls btrfs_wait_ordered_range() which
calls filemap_fdatawait_range(), and the later checks for and clears those flags,
while for fast fsyncs we never call filemap_fdatawait_range() or anything else
that checks for and clears the error flags from the inode's i_mapping.

Signed-off-by: Filipe Manana <fdmanana@suse.com>
Signed-off-by: Chris Mason <clm@fb.com>

2014-11-21 11:59:57 -08:00

tests

Btrfs: remove empty block groups automatically

2014-09-22 17:13:21 -07:00

acl.c

btrfs: remove useless ACL check

2014-06-09 17:20:42 -07:00

async-thread.c

btrfs: remove unlikely from NULL checks

2014-10-02 16:06:19 +02:00

async-thread.h

Btrfs: implement repair function when direct read fails

2014-09-17 13:39:01 -07:00

backref.c

btrfs: remove parameter blocksize from read_tree_block

2014-10-02 17:14:50 +02:00

backref.h

Btrfs: make fiemap not blow when you have lots of snapshots

2014-09-17 13:38:24 -07:00

btrfs_inode.h

Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/mason/linux-btrfs

2014-10-11 08:03:52 -04:00

check-integrity.c

Btrfs: check-int: don't complain about balanced blocks

2014-11-20 17:14:30 -08:00

check-integrity.h

block: submit_bio_wait() conversions

2013-11-24 16:33:41 -07:00

compression.c

Btrfs: don't ignore compressed bio write errors

2014-11-20 17:14:26 -08:00

compression.h

btrfs: make static code static & remove dead code

2013-05-06 15:55:23 -04:00

ctree.c

Btrfs: make xattr replace operations atomic

2014-11-20 17:20:07 -08:00

ctree.h

Btrfs: ensure ordered extent errors aren't missed on fsync

2014-11-21 11:59:57 -08:00

delayed-inode.c

btrfs: kill the key type accessor helpers

2014-09-17 13:37:12 -07:00

delayed-inode.h

Btrfs: introduce the delayed inode ref deletion for the single link inode

2014-01-28 13:20:09 -08:00

delayed-ref.c

Btrfs: rework qgroup accounting

2014-06-09 17:20:48 -07:00

delayed-ref.h

Btrfs: rework qgroup accounting

2014-06-09 17:20:48 -07:00

dev-replace.c

Btrfs: return failure if btrfs_dev_replace_finishing() failed

2014-11-20 17:14:28 -08:00

dev-replace.h

Btrfs: add new sources for device replace code

2012-12-12 17:15:41 -05:00

dir-item.c

Btrfs: make xattr replace operations atomic

2014-11-20 17:20:07 -08:00

disk-io.c

Btrfs: make sure logged extents complete in the current transaction V3

2014-11-21 11:58:32 -08:00

disk-io.h

Merge branch 'cleanup/blocksize-diet-part1' of git://git.kernel.org/pub/scm/linux/kernel/git/kdave/linux into for-linus

2014-10-04 09:57:14 -07:00

export.c

btrfs: kill the key type accessor helpers

2014-09-17 13:37:12 -07:00

export.h

…

extent_io.c

Btrfs: avoid premature -ENOMEM in clear_extent_bit()

2014-11-20 17:20:06 -08:00

extent_io.h

Btrfs: set page and mapping error on compressed write failure

2014-11-20 17:14:25 -08:00

extent_map.c

Btrfs: do not move em to modified list when unpinning

2014-11-21 11:59:54 -08:00

extent_map.h

Btrfs: fix NULL pointer crash when running balance and scrub concurrently

2014-06-19 14:20:55 -07:00

extent-tree.c

Btrfs: move read only block groups onto their own list V2

2014-11-20 17:20:04 -08:00

file-item.c

Btrfs: fix kfree on list_head in btrfs_lookup_csums_range error cleanup

2014-11-04 06:59:04 -08:00

file.c

Btrfs: add helper btrfs_fdatawrite_range

2014-11-20 17:14:28 -08:00

free-space-cache.c

Btrfs: improve free space cache management and space allocation

2014-09-17 13:38:13 -07:00

free-space-cache.h

Btrfs: remove path arg from btrfs_truncate_free_space_cache

2013-11-11 21:51:33 -05:00

hash.c

btrfs: LLVMLinux: Remove VLAIS

2014-10-14 10:51:22 +02:00

hash.h

Btrfs: fix btrfs boot when compiled as built-in

2014-01-28 13:20:31 -08:00

inode-item.c

btrfs: kill the key type accessor helpers

2014-09-17 13:37:12 -07:00

inode-map.c

btrfs: cleanup ino cache members of btrfs_root

2014-09-17 13:37:09 -07:00

inode-map.h

…

inode.c

Btrfs: ensure ordered extent errors aren't missed on fsync

2014-11-21 11:59:57 -08:00

ioctl.c

vfs: export check_sticky()

2014-10-24 00:14:36 +02:00

Kconfig

Btrfs: fix btrfs boot when compiled as built-in

2014-01-28 13:20:31 -08:00

locking.c

Btrfs: fix deadlocks with trylock on tree nodes

2014-06-19 14:19:55 -07:00

locking.h

Btrfs: remove btrfs_try_spin_lock

2013-03-14 14:57:10 -04:00

lzo.c

btrfs: use DIV_ROUND_UP instead of open-coded variants

2014-09-17 13:37:17 -07:00

Makefile

Btrfs: add sanity tests for new qgroup accounting code

2014-06-09 17:20:49 -07:00

math.h

Btrfs: cleanup duplicated division functions

2012-12-11 13:31:30 -05:00

ordered-data.c

Btrfs: collect only the necessary ordered extents on ranged fsync

2014-11-21 11:59:56 -08:00

ordered-data.h

Btrfs: collect only the necessary ordered extents on ranged fsync

2014-11-21 11:59:56 -08:00

orphan.c

btrfs: kill the key type accessor helpers

2014-09-17 13:37:12 -07:00

print-tree.c

btrfs: remove parameter blocksize from read_tree_block

2014-10-02 17:14:50 +02:00

print-tree.h

btrfs: make static code static & remove dead code

2013-05-06 15:55:23 -04:00

props.c

Btrfs: add support for inode properties

2014-01-28 13:20:24 -08:00

props.h

Btrfs: add support for inode properties

2014-01-28 13:20:24 -08:00

qgroup.c

btrfs: move checks for DUMMY_ROOT into a helper

2014-10-02 17:30:33 +02:00

qgroup.h

btrfs: qgroup: account shared subtrees during snapshot delete

2014-08-15 07:43:14 -07:00

raid56.c

btrfs: use DIV_ROUND_UP instead of open-coded variants

2014-09-17 13:37:17 -07:00

raid56.h

Btrfs: RAID5 and RAID6

2013-02-01 14:24:23 -05:00

rcu-string.h

…

reada.c

btrfs: use nodesize everywhere, kill leafsize

2014-09-17 13:37:14 -07:00

relocation.c

Merge branch 'cleanup/blocksize-diet-part1' of git://git.kernel.org/pub/scm/linux/kernel/git/kdave/linux into for-linus

2014-10-04 09:57:14 -07:00

root-tree.c

Btrfs: use bitfield instead of integer data type for the some variants in btrfs_root

2014-06-09 17:20:40 -07:00

scrub.c

btrfs: fix dead lock while running replace and defrag concurrently

2014-11-20 17:20:08 -08:00

send.c

Merge branch 'cleanup/misc-for-3.18' of git://git.kernel.org/pub/scm/linux/kernel/git/kdave/linux into for-linus

2014-10-04 09:56:45 -07:00

send.h

btrfs: make static code static & remove dead code

2013-05-06 15:55:23 -04:00

struct-funcs.c

…

super.c

btrfs: fix wrong accounting of raid1 data profile in statfs

2014-11-20 17:20:09 -08:00

sysfs.c

btrfs: sysfs label interface should check for read only FS

2014-09-17 13:38:01 -07:00

sysfs.h

btrfs: code optimize: BTRFS_ATTR_RW could set the mode

2014-09-17 13:37:59 -07:00

transaction.c

Btrfs: make sure logged extents complete in the current transaction V3

2014-11-21 11:58:32 -08:00

transaction.h

Btrfs: make sure logged extents complete in the current transaction V3

2014-11-21 11:58:32 -08:00

tree-defrag.c

Btrfs: use bitfield instead of integer data type for the some variants in btrfs_root

2014-06-09 17:20:40 -07:00

tree-log.c

Btrfs: ensure ordered extent errors aren't missed on fsync

2014-11-21 11:59:57 -08:00

tree-log.h

Btrfs: fix data corruption after fast fsync and writeback error

2014-09-19 06:57:51 -07:00

ulist.c

Btrfs: do not export ulist functions

2014-01-29 07:06:27 -08:00

ulist.h

Btrfs: Fix memory corruption by ulist_add_merge() on 32bit arch

2014-08-15 07:43:19 -07:00

uuid-tree.c

Btrfs: make btrfs_search_forward return with nodes unlocked

2014-09-17 13:38:02 -07:00

volumes.c

Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/mason/linux-btrfs

2014-10-11 08:03:52 -04:00

volumes.h

Btrfs: remove empty block groups automatically

2014-09-22 17:13:21 -07:00

xattr.c

Btrfs: make xattr replace operations atomic

2014-11-20 17:20:07 -08:00

xattr.h

btrfs: use generic posix ACL infrastructure

2014-01-25 23:58:18 -05:00

zlib.c

btrfs compression: merge inflate and deflate z_streams

2014-09-17 13:37:33 -07:00