linux

iv/linux

Go to file

Kent Overstreet 33bd5d0686 bcachefs: Deadlock cycle detector We've outgrown our own deadlock avoidance strategy. The btree iterator API provides an interface where the user doesn't need to concern themselves with lock ordering - different btree iterators can be traversed in any order. Without special care, this will lead to deadlocks. Our previous strategy was to define a lock ordering internally, and whenever we attempt to take a lock and trylock() fails, we'd check if the current btree transaction is holding any locks that cause a lock ordering violation. If so, we'd issue a transaction restart, and then bch2_trans_begin() would re-traverse all previously used iterators, but in the correct order. That approach had some issues, though. - Sometimes we'd issue transaction restarts unnecessarily, when no deadlock would have actually occured. Lock ordering restarts have become our primary cause of transaction restarts, on some workloads totally 20% of actual transaction commits. - To avoid deadlock or livelock, we'd often have to take intent locks when we only wanted a read lock: with the lock ordering approach, it is actually illegal to hold _any_ read lock while blocking on an intent lock, and this has been causing us unnecessary lock contention. - It was getting fragile - the various lock ordering rules are not trivial, and we'd been seeing occasional livelock issues related to this machinery. So, since bcachefs is already a relational database masquerading as a filesystem, we're stealing the next traditional database technique and switching to a cycle detector for avoiding deadlocks. When we block taking a btree lock, after adding ourself to the waitlist but before sleeping, we do a DFS of btree transactions waiting on other btree transactions, starting with the current transaction and walking our held locks, and transactions blocking on our held locks. If we find a cycle, we emit a transaction restart. Occasionally (e.g. the btree split path) we can not allow the lock() operation to fail, so if necessary we'll tell another transaction that it has to fail. Result: trans_restart_would_deadlock events are reduced by a factor of 10 to 100, and we'll be able to delete a whole bunch of grotty, fragile code. Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>		2023-10-22 17:09:41 -04:00
arch	Fix preemption delays in the SGX code, remove unnecessarily UAPI-exported code,	2023-09-10 10:39:31 -07:00
block	block: fix pin count management when merging same-page segments	2023-09-06 07:32:27 -06:00
certs	certs: Reference revocation list for all keyrings	2023-08-17 20:12:41 +00:00
crypto	This update includes the following changes:	2023-08-29 11:23:29 -07:00
Documentation	drm ci for 6.6-rc1	2023-09-10 11:55:26 -07:00
drivers	bcache: move closures to lib/	2023-10-19 14:47:33 -04:00
fs	bcachefs: Deadlock cycle detector	2023-10-22 17:09:41 -04:00
include	bcachefs: Update export_operations for snapshots	2023-10-22 17:09:17 -04:00
init	sched: Add task_struct->faults_disabled_mapping	2023-09-11 23:59:46 -04:00
io_uring	Revert "io_uring: fix IO hang in io_wq_put_and_exit from do_exit()"	2023-09-07 09:41:49 -06:00
ipc	Add x86 shadow stack support	2023-08-31 12:20:12 -07:00
kernel	locking: export contention tracepoints for bcachefs six locks	2023-10-19 14:47:33 -04:00
lib	lib/generic-radix-tree.c: Add peek_prev()	2023-10-19 14:47:33 -04:00
LICENSES	LICENSES: Add the copyleft-next-0.3.1 license	2022-11-08 15:44:01 +01:00
mm	LoongArch changes for v6.6	2023-09-08 12:16:52 -07:00
net	Including fixes from netfilter and bpf.	2023-09-07 18:33:07 -07:00
rust	Documentation work keeps chugging along; stuff for 6.6 includes:	2023-08-30 20:05:42 -07:00
samples	VFIO updates for v6.6-rc1	2023-08-30 20:36:01 -07:00
scripts	Fix preemption delays in the SGX code, remove unnecessarily UAPI-exported code,	2023-09-10 10:39:31 -07:00
security	Landlock updates for v6.6-rc1	2023-09-08 12:06:51 -07:00
sound	sound fixes for 6.6-rc1	2023-09-08 13:07:50 -07:00
tools	objtool: Add bcachefs noreturns	2023-10-19 14:58:29 -04:00
usr	initramfs: Encode dependency on KBUILD_BUILD_TIMESTAMP	2023-06-06 17:54:49 +09:00
virt	ARM:	2023-09-07 13:52:20 -07:00
.clang-format	iommu: Add for_each_group_device()	2023-05-23 08:15:51 +02:00
.cocciconfig
.get_maintainer.ignore
.gitattributes	.gitattributes: set diff driver for Rust source code files	2023-05-31 17:48:25 +02:00
.gitignore	kbuild: rpm-pkg: rename binkernel.spec to kernel.spec	2023-07-25 00:59:33 +09:00
.mailmap	for-linus-2023083101	2023-09-01 12:31:44 -07:00
.rustfmt.toml
COPYING
CREDITS	USB: Remove Wireless USB and UWB documentation	2023-08-09 14:17:32 +02:00
Kbuild
Kconfig
MAINTAINERS	MAINTAINERS: Add entry for bcachefs	2023-10-22 17:08:07 -04:00
Makefile	Linux 6.6-rc1	2023-09-10 16:28:41 -07:00
README

README

Linux kernel
============

There are several guides for kernel developers and users. These guides can
be rendered in a number of formats, like HTML and PDF. Please read
Documentation/admin-guide/README.rst first.

In order to build the documentation, use ``make htmldocs`` or
``make pdfdocs``.  The formatted documentation can also be read online at:

    https://www.kernel.org/doc/html/latest/

There are various text files in the Documentation/ subdirectory,
several of them using the Restructured Text markup notation.

Please read the Documentation/process/changes.rst file, as it contains the
requirements for building and running the kernel, and information about
the problems which may result by upgrading your kernel.