linux

iv/linux

History

Radoslaw Burny 5ec27ec735 fs/proc/proc_sysctl.c: fix the default values of i_uid/i_gid on /proc/sys inodes. Normally, the inode's i_uid/i_gid are translated relative to s_user_ns, but this is not a correct behavior for proc. Since sysctl permission check in test_perm is done against GLOBAL_ROOT_[UG]ID, it makes more sense to use these values in u_[ug]id of proc inodes. In other words: although uid/gid in the inode is not read during test_perm, the inode logically belongs to the root of the namespace. I have confirmed this with Eric Biederman at LPC and in this thread: https://lore.kernel.org/lkml/87k1kzjdff.fsf@xmission.com Consequences ============ Since the i_[ug]id values of proc nodes are not used for permissions checks, this change usually makes no functional difference. However, it causes an issue in a setup where: * a namespace container is created without root user in container - hence the i_[ug]id of proc nodes are set to INVALID_[UG]ID * container creator tries to configure it by writing /proc/sys files, e.g. writing /proc/sys/kernel/shmmax to configure shared memory limit Kernel does not allow to open an inode for writing if its i_[ug]id are invalid, making it impossible to write shmmax and thus - configure the container. Using a container with no root mapping is apparently rare, but we do use this configuration at Google. Also, we use a generic tool to configure the container limits, and the inability to write any of them causes a failure. History ======= The invalid uids/gids in inodes first appeared due to `8175435777` (fs: Update i_[ug]id_(read\|write) to translate relative to s_user_ns). However, AFAIK, this did not immediately cause any issues. The inability to write to these "invalid" inodes was only caused by a later commit `0bd23d09b8` (vfs: Don't modify inodes with a uid or gid unknown to the vfs). Tested: Used a repro program that creates a user namespace without any mapping and stat'ed /proc/$PID/root/proc/sys/kernel/shmmax from outside. Before the change, it shows the overflow uid, with the change it's 0. The overflow uid indicates that the uid in the inode is not correct and thus it is not possible to open the file for writing. Link: http://lkml.kernel.org/r/20190708115130.250149-1-rburny@google.com Fixes: `0bd23d09b8` ("vfs: Don't modify inodes with a uid or gid unknown to the vfs") Signed-off-by: Radoslaw Burny <rburny@google.com> Acked-by: Luis Chamberlain <mcgrof@kernel.org> Cc: Kees Cook <keescook@chromium.org> Cc: "Eric W . Biederman" <ebiederm@xmission.com> Cc: Seth Forshee <seth.forshee@canonical.com> Cc: John Sperbeck <jsperbeck@google.com> Cc: Alexey Dobriyan <adobriyan@gmail.com> Cc: <stable@vger.kernel.org> [4.8+] Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>		2019-07-16 19:23:21 -07:00
..
array.c	Merge branch 'sched-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip	2019-07-08 16:39:53 -07:00
base.c	oom: decouple mems_allowed from oom_unkillable_task	2019-07-12 11:05:47 -07:00
cmdline.c	proc: introduce proc_create_single{,_data}	2018-05-16 07:23:35 +02:00
consoles.c	treewide: Replace GPLv2 boilerplate/reference with SPDX - rule 191	2019-05-30 11:29:21 -07:00
cpuinfo.c	x86 / CPU: Always show current CPU frequency in /proc/cpuinfo	2017-11-15 19:46:50 +01:00
devices.c	proc: introduce proc_create_seq{,_data}	2018-05-16 07:23:35 +02:00
fd.c	proc: use "unsigned int" in proc_fill_cache()	2018-06-07 17:34:38 -07:00
fd.h	License cleanup: add SPDX GPL-2.0 license identifier to files with no license	2017-11-02 11:10:55 +01:00
generic.c	treewide: Add SPDX license identifier for missed files	2019-05-21 10:50:45 +02:00
inode.c	fs/proc/inode.c: use typeof_member() macro	2019-07-16 19:23:21 -07:00
internal.h	treewide: Replace GPLv2 boilerplate/reference with SPDX - rule 152	2019-05-30 11:26:32 -07:00
interrupts.c	proc: introduce proc_create_seq{,_data}	2018-05-16 07:23:35 +02:00
Kconfig	vmcore: add a kernel parameter novmcoredd	2019-07-16 19:23:21 -07:00
kcore.c	fs/proc/kcore.c: make kcore_modules static	2019-03-29 10:01:37 -07:00
kmsg.c	vfs: do bulk POLL* -> EPOLL* replacement	2018-02-11 14:34:03 -08:00
loadavg.c	sched: loadavg: consolidate LOAD_INT, LOAD_FRAC, CALC_LOAD	2018-10-26 16:26:32 -07:00
Makefile	proc: : uninline name_to_int()	2017-11-17 16:10:00 -08:00
meminfo.c	mm: vmalloc: show number of vmalloc pages in /proc/meminfo	2019-07-12 11:05:47 -07:00
namespaces.c	procfs: switch instantiate_t to d_splice_alias()	2018-05-26 14:20:50 -04:00
nommu.c	treewide: Replace GPLv2 boilerplate/reference with SPDX - rule 152	2019-05-30 11:26:32 -07:00
page.c	mm: convert PG_balloon to PG_offline	2019-03-05 21:07:14 -08:00
proc_net.c	treewide: Add SPDX license identifier for missed files	2019-05-21 10:50:45 +02:00
proc_sysctl.c	fs/proc/proc_sysctl.c: fix the default values of i_uid/i_gid on /proc/sys inodes.	2019-07-16 19:23:21 -07:00
proc_tty.c	tty: replace ->proc_fops with ->proc_show	2018-05-16 07:24:30 +02:00
root.c	fanotify: Disallow permission events for proc filesystem	2019-05-28 18:10:07 +02:00
self.c	fs/proc/self.c: code cleanup for proc_setup_self()	2019-03-05 21:07:21 -08:00
softirqs.c	proc: introduce proc_create_single{,_data}	2018-05-16 07:23:35 +02:00
stat.c	Merge branch 'akpm' (patches from Andrew)	2019-03-06 10:31:36 -08:00
task_mmu.c	HMM patches for 5.3	2019-07-14 19:42:11 -07:00
task_nommu.c	proc: use down_read_killable mmap_sem for /proc/pid/maps	2019-07-12 11:05:46 -07:00
thread_self.c	fs/proc/thread_self.c: code cleanup for proc_setup_thread_self()	2019-03-05 21:07:21 -08:00
uptime.c	fs/proc/uptime.c: use ktime_get_boottime_ts64	2018-08-22 10:52:45 -07:00
util.c	fs/proc/util.c: include fs/proc/internal.h for name_to_int()	2019-01-04 13:13:45 -08:00
version.c	proc: introduce proc_create_single{,_data}	2018-05-16 07:23:35 +02:00
vmcore.c	vmcore: add a kernel parameter novmcoredd	2019-07-16 19:23:21 -07:00