linux/fs/bcachefs
Kent Overstreet 5dfd3746b6 bcachefs: Fix needs_whiteout BUG_ON() in bkey_sort()
Btree nodes are log structured; thus, we need to emit whiteouts when
we're deleting a key that's been written out to disk.

k->needs_whiteout tracks whether a key will need a whiteout when it's
deleted, and this requires some careful handling; e.g. the key we're
deleting may not have been written out to disk, but it may have
overwritten a key that was - thus we need to carry this flag around on
overwrites.

Invariants:
There may be multiple key for the same position in a given node (because
of overwrites), but only one of them will be a live (non deleted) key,
and only one key for a given position will have the needs_whiteout flag
set.

Additionally, we don't want to carry around whiteouts that need to be
written in the main searchable part of a btree node - btree_iter_peek()
will have to skip past them, and this can lead to an O(n^2) issues when
doing sequential deletions (e.g. inode rm/truncate). So there's a
separate region in the btree node buffer for unwritten whiteouts; these
are merge sorted with the rest of the keys we're writing in the btree
node write path.

The unwritten whiteouts was a later optimization that bch2_sort_keys()
didn't take into account; the unwritten whiteouts area means that we
never have deleted keys with needs_whiteout set in the main searchable
part of a btree node.

That means we can simplify and optimize some sort paths, and eliminate
an assertion that syzbot found:

- Unless we're in the btree node write path, it's always ok to drop
  whiteouts when sorting
- When sorting for a btree node write, we drop the whiteout if it's not
  from the unwritten whiteouts area, or if it's overwritten by a real
  key at the same position.

This completely eliminates some tricky logic for propagating the
needs_whiteout flag: syzbot was able to hit the assertion that checked
that there shouldn't be more than one key at the same pos with
needs_whiteout set, likely due to a combination of flipping on
needs_whiteout on all written keys (they need whiteouts if overwritten),
combined with not always dropping unneeded whiteouts, and the tricky
logic in the sort path for preserving needs_whiteout that wasn't really
needed.

Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
2024-05-08 14:56:09 -04:00
..
2024-01-21 06:01:45 -05:00
2024-03-13 21:22:26 -04:00
2024-03-13 21:22:24 -04:00
2024-01-21 06:01:45 -05:00
2024-03-13 18:39:12 -04:00
2024-01-01 11:46:52 -05:00
2024-01-21 13:27:10 -05:00
2024-01-05 23:24:21 -05:00
2024-01-21 13:27:11 -05:00
2024-01-01 11:47:44 -05:00
2024-01-21 13:27:11 -05:00
2024-04-08 22:56:37 -04:00
2024-03-13 18:39:12 -04:00
2024-05-06 10:58:17 -04:00
2024-04-03 14:46:51 -04:00
2024-01-21 13:27:10 -05:00
2024-04-28 21:34:29 -04:00
2024-04-06 13:50:26 -04:00
2024-04-06 13:50:26 -04:00
2024-03-10 15:34:08 -04:00
2024-03-13 21:22:26 -04:00
2024-03-10 15:34:09 -04:00
2024-02-13 21:59:27 -05:00
2024-01-01 11:47:07 -05:00
2024-01-21 13:27:10 -05:00
2024-05-06 10:58:17 -04:00
2024-01-21 13:27:11 -05:00
2024-01-21 06:01:45 -05:00
2024-04-03 14:44:18 -04:00
2024-01-01 11:47:07 -05:00
2024-01-01 11:47:40 -05:00
2024-01-21 13:27:10 -05:00
2024-01-05 23:24:21 -05:00
2024-04-05 16:21:18 -04:00
2024-04-03 14:44:18 -04:00
2024-04-06 13:50:26 -04:00
2024-01-21 13:27:10 -05:00