2019-05-27 09:55:01 +03:00
// SPDX-License-Identifier: GPL-2.0-or-later
2005-04-17 02:20:36 +04:00
/*
* Security plug functions
*
* Copyright ( C ) 2001 WireX Communications , Inc < chris @ wirex . com >
* Copyright ( C ) 2001 - 2002 Greg Kroah - Hartman < greg @ kroah . com >
* Copyright ( C ) 2001 Networks Associates Technology , Inc < ssmalley @ nai . com >
IB/core: Enforce PKey security on QPs
Add new LSM hooks to allocate and free security contexts and check for
permission to access a PKey.
Allocate and free a security context when creating and destroying a QP.
This context is used for controlling access to PKeys.
When a request is made to modify a QP that changes the port, PKey index,
or alternate path, check that the QP has permission for the PKey in the
PKey table index on the subnet prefix of the port. If the QP is shared
make sure all handles to the QP also have access.
Store which port and PKey index a QP is using. After the reset to init
transition the user can modify the port, PKey index and alternate path
independently. So port and PKey settings changes can be a merge of the
previous settings and the new ones.
In order to maintain access control if there are PKey table or subnet
prefix change keep a list of all QPs are using each PKey index on
each port. If a change occurs all QPs using that device and port must
have access enforced for the new cache settings.
These changes add a transaction to the QP modify process. Association
with the old port and PKey index must be maintained if the modify fails,
and must be removed if it succeeds. Association with the new port and
PKey index must be established prior to the modify and removed if the
modify fails.
1. When a QP is modified to a particular Port, PKey index or alternate
path insert that QP into the appropriate lists.
2. Check permission to access the new settings.
3. If step 2 grants access attempt to modify the QP.
4a. If steps 2 and 3 succeed remove any prior associations.
4b. If ether fails remove the new setting associations.
If a PKey table or subnet prefix changes walk the list of QPs and
check that they have permission. If not send the QP to the error state
and raise a fatal error event. If it's a shared QP make sure all the
QPs that share the real_qp have permission as well. If the QP that
owns a security structure is denied access the security structure is
marked as such and the QP is added to an error_list. Once the moving
the QP to error is complete the security structure mark is cleared.
Maintaining the lists correctly turns QP destroy into a transaction.
The hardware driver for the device frees the ib_qp structure, so while
the destroy is in progress the ib_qp pointer in the ib_qp_security
struct is undefined. When the destroy process begins the ib_qp_security
structure is marked as destroying. This prevents any action from being
taken on the QP pointer. After the QP is destroyed successfully it
could still listed on an error_list wait for it to be processed by that
flow before cleaning up the structure.
If the destroy fails the QPs port and PKey settings are reinserted into
the appropriate lists, the destroying flag is cleared, and access control
is enforced, in case there were any cache changes during the destroy
flow.
To keep the security changes isolated a new file is used to hold security
related functionality.
Signed-off-by: Daniel Jurgens <danielj@mellanox.com>
Acked-by: Doug Ledford <dledford@redhat.com>
[PM: merge fixup in ib_verbs.h and uverbs_cmd.c]
Signed-off-by: Paul Moore <paul@paul-moore.com>
2017-05-19 15:48:52 +03:00
* Copyright ( C ) 2016 Mellanox Technologies
2005-04-17 02:20:36 +04:00
*/
2018-10-11 03:18:25 +03:00
# define pr_fmt(fmt) "LSM: " fmt
2017-10-18 23:00:24 +03:00
# include <linux/bpf.h>
2006-01-11 23:17:46 +03:00
# include <linux/capability.h>
2013-05-22 20:50:34 +04:00
# include <linux/dcache.h>
2018-12-09 23:36:29 +03:00
# include <linux/export.h>
2005-04-17 02:20:36 +04:00
# include <linux/init.h>
# include <linux/kernel.h>
2020-10-02 20:38:15 +03:00
# include <linux/kernel_read_file.h>
2015-05-03 01:10:46 +03:00
# include <linux/lsm_hooks.h>
2011-03-09 22:13:22 +03:00
# include <linux/integrity.h>
2009-10-23 01:30:13 +04:00
# include <linux/ima.h>
2011-03-09 22:38:26 +03:00
# include <linux/evm.h>
2012-02-13 07:58:52 +04:00
# include <linux/fsnotify.h>
2012-05-31 01:11:23 +04:00
# include <linux/mman.h>
# include <linux/mount.h>
# include <linux/personality.h>
2012-07-02 09:34:11 +04:00
# include <linux/backing-dev.h>
LSM: Enable multiple calls to security_add_hooks() for the same LSM
The commit d69dece5f5b6 ("LSM: Add /sys/kernel/security/lsm") extend
security_add_hooks() with a new parameter to register the LSM name,
which may be useful to make the list of currently loaded LSM available
to userspace. However, there is no clean way for an LSM to split its
hook declarations into multiple files, which may reduce the mess with
all the included files (needed for LSM hook argument types) and make the
source code easier to review and maintain.
This change allows an LSM to register multiple times its hook while
keeping a consistent list of LSM names as described in
Documentation/security/LSM.txt . The list reflects the order in which
checks are made. This patch only check for the last registered LSM. If
an LSM register multiple times its hooks, interleaved with other LSM
registrations (which should not happen), its name will still appear in
the same order that the hooks are called, hence multiple times.
To sum up, "capability,selinux,foo,foo" will be replaced with
"capability,selinux,foo", however "capability,foo,selinux,foo" will
remain as is.
Signed-off-by: Mickaël Salaün <mic@digikod.net>
Acked-by: Kees Cook <keescook@chromium.org>
Acked-by: Casey Schaufler <casey@schaufler-ca.com>
Signed-off-by: James Morris <james.l.morris@oracle.com>
2017-05-10 23:48:48 +03:00
# include <linux/string.h>
2018-11-20 22:55:02 +03:00
# include <linux/msg.h>
2012-02-13 07:58:52 +04:00
# include <net/flow.h>
2005-04-17 02:20:36 +04:00
2011-06-16 05:19:10 +04:00
# define MAX_LSM_EVM_XATTR 2
2005-04-17 02:20:36 +04:00
2018-09-20 02:58:31 +03:00
/* How many LSMs were built into the kernel? */
# define LSM_COUNT (__end_lsm_info - __start_lsm_info)
security,lockdown,selinux: implement SELinux lockdown
Implement a SELinux hook for lockdown. If the lockdown module is also
enabled, then a denial by the lockdown module will take precedence over
SELinux, so SELinux can only further restrict lockdown decisions.
The SELinux hook only distinguishes at the granularity of integrity
versus confidentiality similar to the lockdown module, but includes the
full lockdown reason as part of the audit record as a hint in diagnosing
what triggered the denial. To support this auditing, move the
lockdown_reasons[] string array from being private to the lockdown
module to the security framework so that it can be used by the lsm audit
code and so that it is always available even when the lockdown module
is disabled.
Note that the SELinux implementation allows the integrity and
confidentiality reasons to be controlled independently from one another.
Thus, in an SELinux policy, one could allow operations that specify
an integrity reason while blocking operations that specify a
confidentiality reason. The SELinux hook implementation is
stricter than the lockdown module in validating the provided reason value.
Sample AVC audit output from denials:
avc: denied { integrity } for pid=3402 comm="fwupd"
lockdown_reason="/dev/mem,kmem,port" scontext=system_u:system_r:fwupd_t:s0
tcontext=system_u:system_r:fwupd_t:s0 tclass=lockdown permissive=0
avc: denied { confidentiality } for pid=4628 comm="cp"
lockdown_reason="/proc/kcore access"
scontext=unconfined_u:unconfined_r:test_lockdown_integrity_t:s0-s0:c0.c1023
tcontext=unconfined_u:unconfined_r:test_lockdown_integrity_t:s0-s0:c0.c1023
tclass=lockdown permissive=0
Signed-off-by: Stephen Smalley <sds@tycho.nsa.gov>
Reviewed-by: James Morris <jamorris@linux.microsoft.com>
[PM: some merge fuzz do the the perf hooks]
Signed-off-by: Paul Moore <paul@paul-moore.com>
2019-11-27 20:04:36 +03:00
/*
* These are descriptions of the reasons that can be passed to the
* security_locked_down ( ) LSM hook . Placing this array here allows
* all security modules to use the same descriptions for auditing
* purposes .
*/
const char * const lockdown_reasons [ LOCKDOWN_CONFIDENTIALITY_MAX + 1 ] = {
[ LOCKDOWN_NONE ] = " none " ,
[ LOCKDOWN_MODULE_SIGNATURE ] = " unsigned module loading " ,
[ LOCKDOWN_DEV_MEM ] = " /dev/mem,kmem,port " ,
[ LOCKDOWN_EFI_TEST ] = " /dev/efi_test access " ,
[ LOCKDOWN_KEXEC ] = " kexec of unsigned images " ,
[ LOCKDOWN_HIBERNATION ] = " hibernation " ,
[ LOCKDOWN_PCI_ACCESS ] = " direct PCI access " ,
[ LOCKDOWN_IOPORT ] = " raw io port access " ,
[ LOCKDOWN_MSR ] = " raw MSR access " ,
[ LOCKDOWN_ACPI_TABLES ] = " modifying ACPI tables " ,
2022-09-26 16:16:42 +03:00
[ LOCKDOWN_DEVICE_TREE ] = " modifying device tree contents " ,
security,lockdown,selinux: implement SELinux lockdown
Implement a SELinux hook for lockdown. If the lockdown module is also
enabled, then a denial by the lockdown module will take precedence over
SELinux, so SELinux can only further restrict lockdown decisions.
The SELinux hook only distinguishes at the granularity of integrity
versus confidentiality similar to the lockdown module, but includes the
full lockdown reason as part of the audit record as a hint in diagnosing
what triggered the denial. To support this auditing, move the
lockdown_reasons[] string array from being private to the lockdown
module to the security framework so that it can be used by the lsm audit
code and so that it is always available even when the lockdown module
is disabled.
Note that the SELinux implementation allows the integrity and
confidentiality reasons to be controlled independently from one another.
Thus, in an SELinux policy, one could allow operations that specify
an integrity reason while blocking operations that specify a
confidentiality reason. The SELinux hook implementation is
stricter than the lockdown module in validating the provided reason value.
Sample AVC audit output from denials:
avc: denied { integrity } for pid=3402 comm="fwupd"
lockdown_reason="/dev/mem,kmem,port" scontext=system_u:system_r:fwupd_t:s0
tcontext=system_u:system_r:fwupd_t:s0 tclass=lockdown permissive=0
avc: denied { confidentiality } for pid=4628 comm="cp"
lockdown_reason="/proc/kcore access"
scontext=unconfined_u:unconfined_r:test_lockdown_integrity_t:s0-s0:c0.c1023
tcontext=unconfined_u:unconfined_r:test_lockdown_integrity_t:s0-s0:c0.c1023
tclass=lockdown permissive=0
Signed-off-by: Stephen Smalley <sds@tycho.nsa.gov>
Reviewed-by: James Morris <jamorris@linux.microsoft.com>
[PM: some merge fuzz do the the perf hooks]
Signed-off-by: Paul Moore <paul@paul-moore.com>
2019-11-27 20:04:36 +03:00
[ LOCKDOWN_PCMCIA_CIS ] = " direct PCMCIA CIS storage " ,
[ LOCKDOWN_TIOCSSERIAL ] = " reconfiguration of serial port IO " ,
[ LOCKDOWN_MODULE_PARAMETERS ] = " unsafe module parameters " ,
[ LOCKDOWN_MMIOTRACE ] = " unsafe mmio " ,
[ LOCKDOWN_DEBUGFS ] = " debugfs access " ,
[ LOCKDOWN_XMON_WR ] = " xmon write access " ,
bpf: Add lockdown check for probe_write_user helper
Back then, commit 96ae52279594 ("bpf: Add bpf_probe_write_user BPF helper
to be called in tracers") added the bpf_probe_write_user() helper in order
to allow to override user space memory. Its original goal was to have a
facility to "debug, divert, and manipulate execution of semi-cooperative
processes" under CAP_SYS_ADMIN. Write to kernel was explicitly disallowed
since it would otherwise tamper with its integrity.
One use case was shown in cf9b1199de27 ("samples/bpf: Add test/example of
using bpf_probe_write_user bpf helper") where the program DNATs traffic
at the time of connect(2) syscall, meaning, it rewrites the arguments to
a syscall while they're still in userspace, and before the syscall has a
chance to copy the argument into kernel space. These days we have better
mechanisms in BPF for achieving the same (e.g. for load-balancers), but
without having to write to userspace memory.
Of course the bpf_probe_write_user() helper can also be used to abuse
many other things for both good or bad purpose. Outside of BPF, there is
a similar mechanism for ptrace(2) such as PTRACE_PEEK{TEXT,DATA} and
PTRACE_POKE{TEXT,DATA}, but would likely require some more effort.
Commit 96ae52279594 explicitly dedicated the helper for experimentation
purpose only. Thus, move the helper's availability behind a newly added
LOCKDOWN_BPF_WRITE_USER lockdown knob so that the helper is disabled under
the "integrity" mode. More fine-grained control can be implemented also
from LSM side with this change.
Fixes: 96ae52279594 ("bpf: Add bpf_probe_write_user BPF helper to be called in tracers")
Signed-off-by: Daniel Borkmann <daniel@iogearbox.net>
Acked-by: Andrii Nakryiko <andrii@kernel.org>
2021-08-09 13:43:17 +03:00
[ LOCKDOWN_BPF_WRITE_USER ] = " use of bpf to write user RAM " ,
2022-05-23 21:11:02 +03:00
[ LOCKDOWN_DBG_WRITE_KERNEL ] = " use of kgdb/kdb to write kernel RAM " ,
2022-09-26 16:16:43 +03:00
[ LOCKDOWN_RTAS_ERROR_INJECTION ] = " RTAS error injection " ,
security,lockdown,selinux: implement SELinux lockdown
Implement a SELinux hook for lockdown. If the lockdown module is also
enabled, then a denial by the lockdown module will take precedence over
SELinux, so SELinux can only further restrict lockdown decisions.
The SELinux hook only distinguishes at the granularity of integrity
versus confidentiality similar to the lockdown module, but includes the
full lockdown reason as part of the audit record as a hint in diagnosing
what triggered the denial. To support this auditing, move the
lockdown_reasons[] string array from being private to the lockdown
module to the security framework so that it can be used by the lsm audit
code and so that it is always available even when the lockdown module
is disabled.
Note that the SELinux implementation allows the integrity and
confidentiality reasons to be controlled independently from one another.
Thus, in an SELinux policy, one could allow operations that specify
an integrity reason while blocking operations that specify a
confidentiality reason. The SELinux hook implementation is
stricter than the lockdown module in validating the provided reason value.
Sample AVC audit output from denials:
avc: denied { integrity } for pid=3402 comm="fwupd"
lockdown_reason="/dev/mem,kmem,port" scontext=system_u:system_r:fwupd_t:s0
tcontext=system_u:system_r:fwupd_t:s0 tclass=lockdown permissive=0
avc: denied { confidentiality } for pid=4628 comm="cp"
lockdown_reason="/proc/kcore access"
scontext=unconfined_u:unconfined_r:test_lockdown_integrity_t:s0-s0:c0.c1023
tcontext=unconfined_u:unconfined_r:test_lockdown_integrity_t:s0-s0:c0.c1023
tclass=lockdown permissive=0
Signed-off-by: Stephen Smalley <sds@tycho.nsa.gov>
Reviewed-by: James Morris <jamorris@linux.microsoft.com>
[PM: some merge fuzz do the the perf hooks]
Signed-off-by: Paul Moore <paul@paul-moore.com>
2019-11-27 20:04:36 +03:00
[ LOCKDOWN_INTEGRITY_MAX ] = " integrity " ,
[ LOCKDOWN_KCORE ] = " /proc/kcore access " ,
[ LOCKDOWN_KPROBES ] = " use of kprobes " ,
2021-08-09 22:45:32 +03:00
[ LOCKDOWN_BPF_READ_KERNEL ] = " use of bpf to read kernel RAM " ,
2022-05-23 21:11:02 +03:00
[ LOCKDOWN_DBG_READ_KERNEL ] = " use of kgdb/kdb to read kernel RAM " ,
security,lockdown,selinux: implement SELinux lockdown
Implement a SELinux hook for lockdown. If the lockdown module is also
enabled, then a denial by the lockdown module will take precedence over
SELinux, so SELinux can only further restrict lockdown decisions.
The SELinux hook only distinguishes at the granularity of integrity
versus confidentiality similar to the lockdown module, but includes the
full lockdown reason as part of the audit record as a hint in diagnosing
what triggered the denial. To support this auditing, move the
lockdown_reasons[] string array from being private to the lockdown
module to the security framework so that it can be used by the lsm audit
code and so that it is always available even when the lockdown module
is disabled.
Note that the SELinux implementation allows the integrity and
confidentiality reasons to be controlled independently from one another.
Thus, in an SELinux policy, one could allow operations that specify
an integrity reason while blocking operations that specify a
confidentiality reason. The SELinux hook implementation is
stricter than the lockdown module in validating the provided reason value.
Sample AVC audit output from denials:
avc: denied { integrity } for pid=3402 comm="fwupd"
lockdown_reason="/dev/mem,kmem,port" scontext=system_u:system_r:fwupd_t:s0
tcontext=system_u:system_r:fwupd_t:s0 tclass=lockdown permissive=0
avc: denied { confidentiality } for pid=4628 comm="cp"
lockdown_reason="/proc/kcore access"
scontext=unconfined_u:unconfined_r:test_lockdown_integrity_t:s0-s0:c0.c1023
tcontext=unconfined_u:unconfined_r:test_lockdown_integrity_t:s0-s0:c0.c1023
tclass=lockdown permissive=0
Signed-off-by: Stephen Smalley <sds@tycho.nsa.gov>
Reviewed-by: James Morris <jamorris@linux.microsoft.com>
[PM: some merge fuzz do the the perf hooks]
Signed-off-by: Paul Moore <paul@paul-moore.com>
2019-11-27 20:04:36 +03:00
[ LOCKDOWN_PERF ] = " unsafe use of perf " ,
[ LOCKDOWN_TRACEFS ] = " use of tracefs " ,
[ LOCKDOWN_XMON_RW ] = " xmon read and write access " ,
2020-11-17 19:47:23 +03:00
[ LOCKDOWN_XFRM_SECRET ] = " xfrm SA secret " ,
security,lockdown,selinux: implement SELinux lockdown
Implement a SELinux hook for lockdown. If the lockdown module is also
enabled, then a denial by the lockdown module will take precedence over
SELinux, so SELinux can only further restrict lockdown decisions.
The SELinux hook only distinguishes at the granularity of integrity
versus confidentiality similar to the lockdown module, but includes the
full lockdown reason as part of the audit record as a hint in diagnosing
what triggered the denial. To support this auditing, move the
lockdown_reasons[] string array from being private to the lockdown
module to the security framework so that it can be used by the lsm audit
code and so that it is always available even when the lockdown module
is disabled.
Note that the SELinux implementation allows the integrity and
confidentiality reasons to be controlled independently from one another.
Thus, in an SELinux policy, one could allow operations that specify
an integrity reason while blocking operations that specify a
confidentiality reason. The SELinux hook implementation is
stricter than the lockdown module in validating the provided reason value.
Sample AVC audit output from denials:
avc: denied { integrity } for pid=3402 comm="fwupd"
lockdown_reason="/dev/mem,kmem,port" scontext=system_u:system_r:fwupd_t:s0
tcontext=system_u:system_r:fwupd_t:s0 tclass=lockdown permissive=0
avc: denied { confidentiality } for pid=4628 comm="cp"
lockdown_reason="/proc/kcore access"
scontext=unconfined_u:unconfined_r:test_lockdown_integrity_t:s0-s0:c0.c1023
tcontext=unconfined_u:unconfined_r:test_lockdown_integrity_t:s0-s0:c0.c1023
tclass=lockdown permissive=0
Signed-off-by: Stephen Smalley <sds@tycho.nsa.gov>
Reviewed-by: James Morris <jamorris@linux.microsoft.com>
[PM: some merge fuzz do the the perf hooks]
Signed-off-by: Paul Moore <paul@paul-moore.com>
2019-11-27 20:04:36 +03:00
[ LOCKDOWN_CONFIDENTIALITY_MAX ] = " confidentiality " ,
} ;
2017-03-22 13:46:19 +03:00
struct security_hook_heads security_hook_heads __lsm_ro_after_init ;
2019-06-14 15:20:14 +03:00
static BLOCKING_NOTIFIER_HEAD ( blocking_lsm_notifier_chain ) ;
2017-05-19 15:48:53 +03:00
2018-11-12 23:02:49 +03:00
static struct kmem_cache * lsm_file_cache ;
2018-09-22 03:19:29 +03:00
static struct kmem_cache * lsm_inode_cache ;
2018-11-12 23:02:49 +03:00
2017-01-19 04:09:05 +03:00
char * lsm_names ;
2018-11-12 20:30:56 +03:00
static struct lsm_blob_sizes blob_sizes __lsm_ro_after_init ;
2008-03-06 19:09:10 +03:00
/* Boot-time LSM user choice */
2018-09-20 03:30:09 +03:00
static __initdata const char * chosen_lsm_order ;
2018-09-19 23:11:41 +03:00
static __initdata const char * chosen_major_lsm ;
2005-04-17 02:20:36 +04:00
2018-10-10 00:27:46 +03:00
static __initconst const char * const builtin_lsm_order = CONFIG_LSM ;
2018-09-20 02:58:31 +03:00
/* Ordered list of LSMs to initialize. */
static __initdata struct lsm_info * * ordered_lsms ;
2018-09-20 05:57:06 +03:00
static __initdata struct lsm_info * exclusive ;
2018-09-20 02:58:31 +03:00
2018-10-11 03:18:25 +03:00
static __initdata bool debug ;
# define init_debug(...) \
do { \
if ( debug ) \
pr_info ( __VA_ARGS__ ) ; \
} while ( 0 )
2018-09-14 09:17:50 +03:00
static bool __init is_enabled ( struct lsm_info * lsm )
{
2018-10-10 00:42:57 +03:00
if ( ! lsm - > enabled )
return false ;
2018-09-14 09:17:50 +03:00
2018-10-10 00:42:57 +03:00
return * lsm - > enabled ;
2018-09-14 09:17:50 +03:00
}
/* Mark an LSM's enabled flag. */
static int lsm_enabled_true __initdata = 1 ;
static int lsm_enabled_false __initdata = 0 ;
static void __init set_enabled ( struct lsm_info * lsm , bool enabled )
{
/*
* When an LSM hasn ' t configured an enable variable , we can use
* a hard - coded location for storing the default enabled state .
*/
if ( ! lsm - > enabled ) {
if ( enabled )
lsm - > enabled = & lsm_enabled_true ;
else
lsm - > enabled = & lsm_enabled_false ;
} else if ( lsm - > enabled = = & lsm_enabled_true ) {
if ( ! enabled )
lsm - > enabled = & lsm_enabled_false ;
} else if ( lsm - > enabled = = & lsm_enabled_false ) {
if ( enabled )
lsm - > enabled = & lsm_enabled_true ;
} else {
* lsm - > enabled = enabled ;
}
}
2018-09-20 02:58:31 +03:00
/* Is an LSM already listed in the ordered LSMs list? */
static bool __init exists_ordered_lsm ( struct lsm_info * lsm )
{
struct lsm_info * * check ;
for ( check = ordered_lsms ; * check ; check + + )
if ( * check = = lsm )
return true ;
return false ;
}
/* Append an LSM to the list of ordered LSMs to initialize. */
static int last_lsm __initdata ;
static void __init append_ordered_lsm ( struct lsm_info * lsm , const char * from )
{
/* Ignore duplicate selections. */
if ( exists_ordered_lsm ( lsm ) )
return ;
if ( WARN ( last_lsm = = LSM_COUNT , " %s: out of LSM slots!? \n " , from ) )
return ;
2018-10-10 00:42:57 +03:00
/* Enable this LSM, if it is not already set. */
if ( ! lsm - > enabled )
lsm - > enabled = & lsm_enabled_true ;
2018-09-20 02:58:31 +03:00
ordered_lsms [ last_lsm + + ] = lsm ;
2018-10-10 00:42:57 +03:00
2018-09-20 02:58:31 +03:00
init_debug ( " %s ordering: %s (%sabled) \n " , from , lsm - > name ,
is_enabled ( lsm ) ? " en " : " dis " ) ;
}
2018-09-14 09:17:50 +03:00
/* Is an LSM allowed to be initialized? */
static bool __init lsm_allowed ( struct lsm_info * lsm )
{
/* Skip if the LSM is disabled. */
if ( ! is_enabled ( lsm ) )
return false ;
2018-09-20 05:57:06 +03:00
/* Not allowed if another exclusive LSM already initialized. */
if ( ( lsm - > flags & LSM_FLAG_EXCLUSIVE ) & & exclusive ) {
init_debug ( " exclusive disabled: %s \n " , lsm - > name ) ;
return false ;
}
2018-09-14 09:17:50 +03:00
return true ;
}
2018-11-12 20:30:56 +03:00
static void __init lsm_set_blob_size ( int * need , int * lbs )
{
int offset ;
if ( * need > 0 ) {
offset = * lbs ;
* lbs + = * need ;
* need = offset ;
}
}
static void __init lsm_set_blob_sizes ( struct lsm_blob_sizes * needed )
{
if ( ! needed )
return ;
lsm_set_blob_size ( & needed - > lbs_cred , & blob_sizes . lbs_cred ) ;
2018-11-12 23:02:49 +03:00
lsm_set_blob_size ( & needed - > lbs_file , & blob_sizes . lbs_file ) ;
2018-09-22 03:19:29 +03:00
/*
* The inode blob gets an rcu_head in addition to
* what the modules might need .
*/
if ( needed - > lbs_inode & & blob_sizes . lbs_inode = = 0 )
blob_sizes . lbs_inode = sizeof ( struct rcu_head ) ;
lsm_set_blob_size ( & needed - > lbs_inode , & blob_sizes . lbs_inode ) ;
2018-11-20 22:55:02 +03:00
lsm_set_blob_size ( & needed - > lbs_ipc , & blob_sizes . lbs_ipc ) ;
lsm_set_blob_size ( & needed - > lbs_msg_msg , & blob_sizes . lbs_msg_msg ) ;
2021-04-22 18:41:15 +03:00
lsm_set_blob_size ( & needed - > lbs_superblock , & blob_sizes . lbs_superblock ) ;
2018-09-22 03:19:37 +03:00
lsm_set_blob_size ( & needed - > lbs_task , & blob_sizes . lbs_task ) ;
2018-11-12 20:30:56 +03:00
}
2018-10-11 01:45:22 +03:00
/* Prepare LSM for initialization. */
static void __init prepare_lsm ( struct lsm_info * lsm )
2018-09-14 09:17:50 +03:00
{
int enabled = lsm_allowed ( lsm ) ;
/* Record enablement (to handle any following exclusive LSMs). */
set_enabled ( lsm , enabled ) ;
2018-10-11 01:45:22 +03:00
/* If enabled, do pre-initialization work. */
2018-09-14 09:17:50 +03:00
if ( enabled ) {
2018-09-20 05:57:06 +03:00
if ( ( lsm - > flags & LSM_FLAG_EXCLUSIVE ) & & ! exclusive ) {
exclusive = lsm ;
init_debug ( " exclusive chosen: %s \n " , lsm - > name ) ;
}
2018-11-12 20:30:56 +03:00
lsm_set_blob_sizes ( lsm - > blobs ) ;
2018-10-11 01:45:22 +03:00
}
}
/* Initialize a given LSM, if it is enabled. */
static void __init initialize_lsm ( struct lsm_info * lsm )
{
if ( is_enabled ( lsm ) ) {
int ret ;
2018-09-20 05:57:06 +03:00
2018-09-14 09:17:50 +03:00
init_debug ( " initializing %s \n " , lsm - > name ) ;
ret = lsm - > init ( ) ;
WARN ( ret , " %s failed to initialize: %d \n " , lsm - > name , ret ) ;
}
}
2018-10-10 00:27:46 +03:00
/* Populate ordered LSMs list from comma-separated LSM name list. */
2018-09-20 02:58:31 +03:00
static void __init ordered_lsm_parse ( const char * order , const char * origin )
2018-09-20 02:16:55 +03:00
{
struct lsm_info * lsm ;
2018-10-10 00:27:46 +03:00
char * sep , * name , * next ;
2018-09-20 03:48:21 +03:00
/* LSM_ORDER_FIRST is always first. */
for ( lsm = __start_lsm_info ; lsm < __end_lsm_info ; lsm + + ) {
if ( lsm - > order = = LSM_ORDER_FIRST )
append_ordered_lsm ( lsm , " first " ) ;
}
2018-09-19 23:32:15 +03:00
/* Process "security=", if given. */
if ( chosen_major_lsm ) {
struct lsm_info * major ;
/*
* To match the original " security= " behavior , this
* explicitly does NOT fallback to another Legacy Major
* if the selected one was separately disabled : disable
* all non - matching Legacy Major LSMs .
*/
for ( major = __start_lsm_info ; major < __end_lsm_info ;
major + + ) {
if ( ( major - > flags & LSM_FLAG_LEGACY_MAJOR ) & &
strcmp ( major - > name , chosen_major_lsm ) ! = 0 ) {
set_enabled ( major , false ) ;
init_debug ( " security=%s disabled: %s \n " ,
chosen_major_lsm , major - > name ) ;
}
}
}
2018-09-19 23:11:41 +03:00
2018-10-10 00:27:46 +03:00
sep = kstrdup ( order , GFP_KERNEL ) ;
next = sep ;
/* Walk the list, looking for matching LSMs. */
while ( ( name = strsep ( & next , " , " ) ) ! = NULL ) {
bool found = false ;
for ( lsm = __start_lsm_info ; lsm < __end_lsm_info ; lsm + + ) {
2018-09-20 03:48:21 +03:00
if ( lsm - > order = = LSM_ORDER_MUTABLE & &
strcmp ( lsm - > name , name ) = = 0 ) {
2018-10-10 00:27:46 +03:00
append_ordered_lsm ( lsm , origin ) ;
found = true ;
}
}
if ( ! found )
init_debug ( " %s ignored: %s \n " , origin , name ) ;
2018-09-20 02:16:55 +03:00
}
2018-11-20 05:04:32 +03:00
/* Process "security=", if given. */
if ( chosen_major_lsm ) {
for ( lsm = __start_lsm_info ; lsm < __end_lsm_info ; lsm + + ) {
if ( exists_ordered_lsm ( lsm ) )
continue ;
if ( strcmp ( lsm - > name , chosen_major_lsm ) = = 0 )
append_ordered_lsm ( lsm , " security= " ) ;
}
}
/* Disable all LSMs not in the ordered list. */
for ( lsm = __start_lsm_info ; lsm < __end_lsm_info ; lsm + + ) {
if ( exists_ordered_lsm ( lsm ) )
continue ;
set_enabled ( lsm , false ) ;
init_debug ( " %s disabled: %s \n " , origin , lsm - > name ) ;
}
2018-10-10 00:27:46 +03:00
kfree ( sep ) ;
2018-09-20 02:16:55 +03:00
}
2019-01-18 13:15:59 +03:00
static void __init lsm_early_cred ( struct cred * cred ) ;
static void __init lsm_early_task ( struct task_struct * task ) ;
2019-08-20 03:17:37 +03:00
static int lsm_append ( const char * new , char * * result ) ;
2018-09-20 02:58:31 +03:00
static void __init ordered_lsm_init ( void )
{
struct lsm_info * * lsm ;
ordered_lsms = kcalloc ( LSM_COUNT + 1 , sizeof ( * ordered_lsms ) ,
GFP_KERNEL ) ;
2019-02-12 21:23:18 +03:00
if ( chosen_lsm_order ) {
if ( chosen_major_lsm ) {
pr_info ( " security= is ignored because it is superseded by lsm= \n " ) ;
chosen_major_lsm = NULL ;
}
2018-09-20 03:30:09 +03:00
ordered_lsm_parse ( chosen_lsm_order , " cmdline " ) ;
2019-02-12 21:23:18 +03:00
} else
2018-09-20 03:30:09 +03:00
ordered_lsm_parse ( builtin_lsm_order , " builtin " ) ;
2018-09-20 02:58:31 +03:00
for ( lsm = ordered_lsms ; * lsm ; lsm + + )
2018-10-11 01:45:22 +03:00
prepare_lsm ( * lsm ) ;
2021-04-22 18:41:15 +03:00
init_debug ( " cred blob size = %d \n " , blob_sizes . lbs_cred ) ;
init_debug ( " file blob size = %d \n " , blob_sizes . lbs_file ) ;
init_debug ( " inode blob size = %d \n " , blob_sizes . lbs_inode ) ;
init_debug ( " ipc blob size = %d \n " , blob_sizes . lbs_ipc ) ;
init_debug ( " msg_msg blob size = %d \n " , blob_sizes . lbs_msg_msg ) ;
init_debug ( " superblock blob size = %d \n " , blob_sizes . lbs_superblock ) ;
init_debug ( " task blob size = %d \n " , blob_sizes . lbs_task ) ;
2018-11-12 23:02:49 +03:00
/*
* Create any kmem_caches needed for blobs
*/
if ( blob_sizes . lbs_file )
lsm_file_cache = kmem_cache_create ( " lsm_file_cache " ,
blob_sizes . lbs_file , 0 ,
SLAB_PANIC , NULL ) ;
2018-09-22 03:19:29 +03:00
if ( blob_sizes . lbs_inode )
lsm_inode_cache = kmem_cache_create ( " lsm_inode_cache " ,
blob_sizes . lbs_inode , 0 ,
SLAB_PANIC , NULL ) ;
2018-11-12 20:30:56 +03:00
2019-01-18 13:15:59 +03:00
lsm_early_cred ( ( struct cred * ) current - > cred ) ;
lsm_early_task ( current ) ;
2018-10-11 01:45:22 +03:00
for ( lsm = ordered_lsms ; * lsm ; lsm + + )
initialize_lsm ( * lsm ) ;
2018-09-20 02:58:31 +03:00
kfree ( ordered_lsms ) ;
}
2019-08-20 03:17:37 +03:00
int __init early_security_init ( void )
{
struct lsm_info * lsm ;
2022-04-07 20:59:30 +03:00
# define LSM_HOOK(RET, DEFAULT, NAME, ...) \
INIT_HLIST_HEAD ( & security_hook_heads . NAME ) ;
# include "linux/lsm_hook_defs.h"
# undef LSM_HOOK
2019-08-20 03:17:37 +03:00
for ( lsm = __start_early_lsm_info ; lsm < __end_early_lsm_info ; lsm + + ) {
if ( ! lsm - > enabled )
lsm - > enabled = & lsm_enabled_true ;
prepare_lsm ( lsm ) ;
initialize_lsm ( lsm ) ;
}
return 0 ;
}
2005-04-17 02:20:36 +04:00
/**
* security_init - initializes the security framework
*
* This should be called early in the kernel initialization sequence .
*/
int __init security_init ( void )
{
2019-08-20 03:17:37 +03:00
struct lsm_info * lsm ;
2017-03-22 13:46:19 +03:00
2018-10-11 03:18:17 +03:00
pr_info ( " Security Framework initializing \n " ) ;
2019-08-20 03:17:37 +03:00
/*
* Append the names of the early LSM modules now that kmalloc ( ) is
* available
*/
for ( lsm = __start_early_lsm_info ; lsm < __end_early_lsm_info ; lsm + + ) {
if ( lsm - > enabled )
lsm_append ( lsm - > name , & lsm_names ) ;
}
2005-04-17 02:20:36 +04:00
2018-09-20 02:16:55 +03:00
/* Load LSMs in specified order. */
ordered_lsm_init ( ) ;
2005-04-17 02:20:36 +04:00
return 0 ;
}
2008-03-06 19:09:10 +03:00
/* Save user chosen LSM */
2018-09-19 23:11:41 +03:00
static int __init choose_major_lsm ( char * str )
2008-03-06 19:09:10 +03:00
{
2018-09-19 23:11:41 +03:00
chosen_major_lsm = str ;
2008-03-06 19:09:10 +03:00
return 1 ;
}
2018-09-19 23:11:41 +03:00
__setup ( " security= " , choose_major_lsm ) ;
2008-03-06 19:09:10 +03:00
2018-09-20 03:30:09 +03:00
/* Explicitly choose LSM initialization order. */
static int __init choose_lsm_order ( char * str )
{
chosen_lsm_order = str ;
return 1 ;
}
__setup ( " lsm= " , choose_lsm_order ) ;
2018-10-11 03:18:25 +03:00
/* Enable LSM order debugging. */
static int __init enable_debug ( char * str )
{
debug = true ;
return 1 ;
}
__setup ( " lsm.debug " , enable_debug ) ;
LSM: Enable multiple calls to security_add_hooks() for the same LSM
The commit d69dece5f5b6 ("LSM: Add /sys/kernel/security/lsm") extend
security_add_hooks() with a new parameter to register the LSM name,
which may be useful to make the list of currently loaded LSM available
to userspace. However, there is no clean way for an LSM to split its
hook declarations into multiple files, which may reduce the mess with
all the included files (needed for LSM hook argument types) and make the
source code easier to review and maintain.
This change allows an LSM to register multiple times its hook while
keeping a consistent list of LSM names as described in
Documentation/security/LSM.txt . The list reflects the order in which
checks are made. This patch only check for the last registered LSM. If
an LSM register multiple times its hooks, interleaved with other LSM
registrations (which should not happen), its name will still appear in
the same order that the hooks are called, hence multiple times.
To sum up, "capability,selinux,foo,foo" will be replaced with
"capability,selinux,foo", however "capability,foo,selinux,foo" will
remain as is.
Signed-off-by: Mickaël Salaün <mic@digikod.net>
Acked-by: Kees Cook <keescook@chromium.org>
Acked-by: Casey Schaufler <casey@schaufler-ca.com>
Signed-off-by: James Morris <james.l.morris@oracle.com>
2017-05-10 23:48:48 +03:00
static bool match_last_lsm ( const char * list , const char * lsm )
{
const char * last ;
if ( WARN_ON ( ! list | | ! lsm ) )
return false ;
last = strrchr ( list , ' , ' ) ;
if ( last )
/* Pass the comma, strcmp() will check for '\0' */
last + + ;
else
last = list ;
return ! strcmp ( last , lsm ) ;
}
2019-08-20 03:17:37 +03:00
static int lsm_append ( const char * new , char * * result )
2017-01-19 04:09:05 +03:00
{
char * cp ;
if ( * result = = NULL ) {
* result = kstrdup ( new , GFP_KERNEL ) ;
2018-07-17 20:36:04 +03:00
if ( * result = = NULL )
return - ENOMEM ;
2017-01-19 04:09:05 +03:00
} else {
LSM: Enable multiple calls to security_add_hooks() for the same LSM
The commit d69dece5f5b6 ("LSM: Add /sys/kernel/security/lsm") extend
security_add_hooks() with a new parameter to register the LSM name,
which may be useful to make the list of currently loaded LSM available
to userspace. However, there is no clean way for an LSM to split its
hook declarations into multiple files, which may reduce the mess with
all the included files (needed for LSM hook argument types) and make the
source code easier to review and maintain.
This change allows an LSM to register multiple times its hook while
keeping a consistent list of LSM names as described in
Documentation/security/LSM.txt . The list reflects the order in which
checks are made. This patch only check for the last registered LSM. If
an LSM register multiple times its hooks, interleaved with other LSM
registrations (which should not happen), its name will still appear in
the same order that the hooks are called, hence multiple times.
To sum up, "capability,selinux,foo,foo" will be replaced with
"capability,selinux,foo", however "capability,foo,selinux,foo" will
remain as is.
Signed-off-by: Mickaël Salaün <mic@digikod.net>
Acked-by: Kees Cook <keescook@chromium.org>
Acked-by: Casey Schaufler <casey@schaufler-ca.com>
Signed-off-by: James Morris <james.l.morris@oracle.com>
2017-05-10 23:48:48 +03:00
/* Check if it is the last registered name */
if ( match_last_lsm ( * result , new ) )
return 0 ;
2017-01-19 04:09:05 +03:00
cp = kasprintf ( GFP_KERNEL , " %s,%s " , * result , new ) ;
if ( cp = = NULL )
return - ENOMEM ;
kfree ( * result ) ;
* result = cp ;
}
return 0 ;
}
/**
* security_add_hooks - Add a modules hooks to the hook lists .
* @ hooks : the hooks to add
* @ count : the number of hooks to add
* @ lsm : the name of the security module
*
* Each LSM has to register its hooks with the infrastructure .
*/
void __init security_add_hooks ( struct security_hook_list * hooks , int count ,
2022-02-17 17:18:57 +03:00
const char * lsm )
2017-01-19 04:09:05 +03:00
{
int i ;
for ( i = 0 ; i < count ; i + + ) {
hooks [ i ] . lsm = lsm ;
2018-03-29 04:28:23 +03:00
hlist_add_tail_rcu ( & hooks [ i ] . list , hooks [ i ] . head ) ;
2017-01-19 04:09:05 +03:00
}
2019-08-20 03:17:37 +03:00
/*
* Don ' t try to append during early_security_init ( ) , we ' ll come back
* and fix this up afterwards .
*/
if ( slab_is_available ( ) ) {
if ( lsm_append ( lsm , & lsm_names ) < 0 )
panic ( " %s - Cannot get early memory. \n " , __func__ ) ;
}
2017-01-19 04:09:05 +03:00
}
2019-06-14 15:20:14 +03:00
int call_blocking_lsm_notifier ( enum lsm_event event , void * data )
2017-05-19 15:48:53 +03:00
{
2019-06-14 15:20:14 +03:00
return blocking_notifier_call_chain ( & blocking_lsm_notifier_chain ,
event , data ) ;
2017-05-19 15:48:53 +03:00
}
2019-06-14 15:20:14 +03:00
EXPORT_SYMBOL ( call_blocking_lsm_notifier ) ;
2017-05-19 15:48:53 +03:00
2019-06-14 15:20:14 +03:00
int register_blocking_lsm_notifier ( struct notifier_block * nb )
2017-05-19 15:48:53 +03:00
{
2019-06-14 15:20:14 +03:00
return blocking_notifier_chain_register ( & blocking_lsm_notifier_chain ,
nb ) ;
2017-05-19 15:48:53 +03:00
}
2019-06-14 15:20:14 +03:00
EXPORT_SYMBOL ( register_blocking_lsm_notifier ) ;
2017-05-19 15:48:53 +03:00
2019-06-14 15:20:14 +03:00
int unregister_blocking_lsm_notifier ( struct notifier_block * nb )
2017-05-19 15:48:53 +03:00
{
2019-06-14 15:20:14 +03:00
return blocking_notifier_chain_unregister ( & blocking_lsm_notifier_chain ,
nb ) ;
2017-05-19 15:48:53 +03:00
}
2019-06-14 15:20:14 +03:00
EXPORT_SYMBOL ( unregister_blocking_lsm_notifier ) ;
2017-05-19 15:48:53 +03:00
2018-11-12 20:30:56 +03:00
/**
* lsm_cred_alloc - allocate a composite cred blob
* @ cred : the cred that needs a blob
* @ gfp : allocation type
*
* Allocate the cred blob for all the modules
*
* Returns 0 , or - ENOMEM if memory can ' t be allocated .
*/
static int lsm_cred_alloc ( struct cred * cred , gfp_t gfp )
{
if ( blob_sizes . lbs_cred = = 0 ) {
cred - > security = NULL ;
return 0 ;
}
cred - > security = kzalloc ( blob_sizes . lbs_cred , gfp ) ;
if ( cred - > security = = NULL )
return - ENOMEM ;
return 0 ;
}
/**
* lsm_early_cred - during initialization allocate a composite cred blob
* @ cred : the cred that needs a blob
*
2019-01-18 13:15:59 +03:00
* Allocate the cred blob for all the modules
2018-11-12 20:30:56 +03:00
*/
2019-01-18 13:15:59 +03:00
static void __init lsm_early_cred ( struct cred * cred )
2018-11-12 20:30:56 +03:00
{
2019-01-18 13:15:59 +03:00
int rc = lsm_cred_alloc ( cred , GFP_KERNEL ) ;
2018-11-12 20:30:56 +03:00
if ( rc )
panic ( " %s: Early cred alloc failed. \n " , __func__ ) ;
}
2018-11-12 23:02:49 +03:00
/**
* lsm_file_alloc - allocate a composite file blob
* @ file : the file that needs a blob
*
* Allocate the file blob for all the modules
*
* Returns 0 , or - ENOMEM if memory can ' t be allocated .
*/
static int lsm_file_alloc ( struct file * file )
{
if ( ! lsm_file_cache ) {
file - > f_security = NULL ;
return 0 ;
}
file - > f_security = kmem_cache_zalloc ( lsm_file_cache , GFP_KERNEL ) ;
if ( file - > f_security = = NULL )
return - ENOMEM ;
return 0 ;
}
2018-09-22 03:19:29 +03:00
/**
* lsm_inode_alloc - allocate a composite inode blob
* @ inode : the inode that needs a blob
*
* Allocate the inode blob for all the modules
*
* Returns 0 , or - ENOMEM if memory can ' t be allocated .
*/
int lsm_inode_alloc ( struct inode * inode )
{
if ( ! lsm_inode_cache ) {
inode - > i_security = NULL ;
return 0 ;
}
inode - > i_security = kmem_cache_zalloc ( lsm_inode_cache , GFP_NOFS ) ;
if ( inode - > i_security = = NULL )
return - ENOMEM ;
return 0 ;
}
2018-09-22 03:19:37 +03:00
/**
* lsm_task_alloc - allocate a composite task blob
* @ task : the task that needs a blob
*
* Allocate the task blob for all the modules
*
* Returns 0 , or - ENOMEM if memory can ' t be allocated .
*/
2019-01-16 08:44:32 +03:00
static int lsm_task_alloc ( struct task_struct * task )
2018-09-22 03:19:37 +03:00
{
if ( blob_sizes . lbs_task = = 0 ) {
task - > security = NULL ;
return 0 ;
}
task - > security = kzalloc ( blob_sizes . lbs_task , GFP_KERNEL ) ;
if ( task - > security = = NULL )
return - ENOMEM ;
return 0 ;
}
2018-11-20 22:55:02 +03:00
/**
* lsm_ipc_alloc - allocate a composite ipc blob
* @ kip : the ipc that needs a blob
*
* Allocate the ipc blob for all the modules
*
* Returns 0 , or - ENOMEM if memory can ' t be allocated .
*/
2019-01-16 08:44:32 +03:00
static int lsm_ipc_alloc ( struct kern_ipc_perm * kip )
2018-11-20 22:55:02 +03:00
{
if ( blob_sizes . lbs_ipc = = 0 ) {
kip - > security = NULL ;
return 0 ;
}
kip - > security = kzalloc ( blob_sizes . lbs_ipc , GFP_KERNEL ) ;
if ( kip - > security = = NULL )
return - ENOMEM ;
return 0 ;
}
/**
* lsm_msg_msg_alloc - allocate a composite msg_msg blob
* @ mp : the msg_msg that needs a blob
*
* Allocate the ipc blob for all the modules
*
* Returns 0 , or - ENOMEM if memory can ' t be allocated .
*/
2019-01-16 08:44:32 +03:00
static int lsm_msg_msg_alloc ( struct msg_msg * mp )
2018-11-20 22:55:02 +03:00
{
if ( blob_sizes . lbs_msg_msg = = 0 ) {
mp - > security = NULL ;
return 0 ;
}
mp - > security = kzalloc ( blob_sizes . lbs_msg_msg , GFP_KERNEL ) ;
if ( mp - > security = = NULL )
return - ENOMEM ;
return 0 ;
}
2018-09-22 03:19:37 +03:00
/**
* lsm_early_task - during initialization allocate a composite task blob
* @ task : the task that needs a blob
*
2019-01-18 13:15:59 +03:00
* Allocate the task blob for all the modules
2018-09-22 03:19:37 +03:00
*/
2019-01-18 13:15:59 +03:00
static void __init lsm_early_task ( struct task_struct * task )
2018-09-22 03:19:37 +03:00
{
2019-01-18 13:15:59 +03:00
int rc = lsm_task_alloc ( task ) ;
2018-09-22 03:19:37 +03:00
if ( rc )
panic ( " %s: Early task alloc failed. \n " , __func__ ) ;
}
2021-04-22 18:41:15 +03:00
/**
* lsm_superblock_alloc - allocate a composite superblock blob
* @ sb : the superblock that needs a blob
*
* Allocate the superblock blob for all the modules
*
* Returns 0 , or - ENOMEM if memory can ' t be allocated .
*/
static int lsm_superblock_alloc ( struct super_block * sb )
{
if ( blob_sizes . lbs_superblock = = 0 ) {
sb - > s_security = NULL ;
return 0 ;
}
sb - > s_security = kzalloc ( blob_sizes . lbs_superblock , GFP_KERNEL ) ;
if ( sb - > s_security = = NULL )
return - ENOMEM ;
return 0 ;
}
2020-03-29 03:43:50 +03:00
/*
* The default value of the LSM hook is defined in linux / lsm_hook_defs . h and
* can be accessed with :
*
* LSM_RET_DEFAULT ( < hook_name > )
*
* The macros below define static constants for the default value of each
* LSM hook .
*/
# define LSM_RET_DEFAULT(NAME) (NAME##_default)
# define DECLARE_LSM_RET_DEFAULT_void(DEFAULT, NAME)
# define DECLARE_LSM_RET_DEFAULT_int(DEFAULT, NAME) \
LSM: Avoid warnings about potentially unused hook variables
Building with W=1 shows many unused const variable warnings. These can
be silenced, as we're well aware of their being potentially unused:
./include/linux/lsm_hook_defs.h:36:18: error: 'ptrace_access_check_default' defined but not used [-Werror=unused-const-variable=]
36 | LSM_HOOK(int, 0, ptrace_access_check, struct task_struct *child,
| ^~~~~~~~~~~~~~~~~~~
security/security.c:706:32: note: in definition of macro 'LSM_RET_DEFAULT'
706 | #define LSM_RET_DEFAULT(NAME) (NAME##_default)
| ^~~~
security/security.c:711:9: note: in expansion of macro 'DECLARE_LSM_RET_DEFAULT_int'
711 | DECLARE_LSM_RET_DEFAULT_##RET(DEFAULT, NAME)
| ^~~~~~~~~~~~~~~~~~~~~~~~
./include/linux/lsm_hook_defs.h:36:1: note: in expansion of macro 'LSM_HOOK'
36 | LSM_HOOK(int, 0, ptrace_access_check, struct task_struct *child,
| ^~~~~~~~
Cc: James Morris <jmorris@namei.org>
Cc: "Serge E. Hallyn" <serge@hallyn.com>
Cc: Paul Moore <paul@paul-moore.com>
Cc: Casey Schaufler <casey@schaufler-ca.com>
Cc: KP Singh <kpsingh@chromium.org>
Cc: linux-security-module@vger.kernel.org
Reported-by: kernel test robot <lkp@intel.com>
Link: https://lore.kernel.org/linux-mm/202110131608.zms53FPR-lkp@intel.com/
Fixes: 98e828a0650f ("security: Refactor declaration of LSM hooks")
Signed-off-by: Kees Cook <keescook@chromium.org>
Acked-by: James Morris <jamorris@linux.microsoft.com>
Signed-off-by: Paul Moore <paul@paul-moore.com>
2021-10-13 20:28:48 +03:00
static const int __maybe_unused LSM_RET_DEFAULT ( NAME ) = ( DEFAULT ) ;
2020-03-29 03:43:50 +03:00
# define LSM_HOOK(RET, DEFAULT, NAME, ...) \
DECLARE_LSM_RET_DEFAULT_ # # RET ( DEFAULT , NAME )
# include <linux/lsm_hook_defs.h>
# undef LSM_HOOK
2015-05-03 01:11:29 +03:00
/*
2015-05-03 01:11:42 +03:00
* Hook list operation macros .
2005-04-17 02:20:36 +04:00
*
2015-05-03 01:11:29 +03:00
* call_void_hook :
* This is a hook that does not return a value .
2005-04-17 02:20:36 +04:00
*
2015-05-03 01:11:29 +03:00
* call_int_hook :
* This is a hook that returns a value .
2005-04-17 02:20:36 +04:00
*/
2015-05-03 01:11:42 +03:00
# define call_void_hook(FUNC, ...) \
do { \
struct security_hook_list * P ; \
\
2018-03-29 04:28:23 +03:00
hlist_for_each_entry ( P , & security_hook_heads . FUNC , list ) \
2015-05-03 01:11:42 +03:00
P - > hook . FUNC ( __VA_ARGS__ ) ; \
} while ( 0 )
# define call_int_hook(FUNC, IRC, ...) ({ \
int RC = IRC ; \
do { \
struct security_hook_list * P ; \
\
2018-03-29 04:28:23 +03:00
hlist_for_each_entry ( P , & security_hook_heads . FUNC , list ) { \
2015-05-03 01:11:42 +03:00
RC = P - > hook . FUNC ( __VA_ARGS__ ) ; \
if ( RC ! = 0 ) \
break ; \
} \
} while ( 0 ) ; \
RC ; \
} )
2005-04-17 02:20:36 +04:00
2007-10-17 10:31:32 +04:00
/* Security operations */
2021-10-12 19:56:13 +03:00
int security_binder_set_context_mgr ( const struct cred * mgr )
2015-01-21 18:54:10 +03:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( binder_set_context_mgr , 0 , mgr ) ;
2015-01-21 18:54:10 +03:00
}
2021-10-12 19:56:13 +03:00
int security_binder_transaction ( const struct cred * from ,
const struct cred * to )
2015-01-21 18:54:10 +03:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( binder_transaction , 0 , from , to ) ;
2015-01-21 18:54:10 +03:00
}
2021-10-12 19:56:13 +03:00
int security_binder_transfer_binder ( const struct cred * from ,
const struct cred * to )
2015-01-21 18:54:10 +03:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( binder_transfer_binder , 0 , from , to ) ;
2015-01-21 18:54:10 +03:00
}
2021-10-12 19:56:13 +03:00
int security_binder_transfer_file ( const struct cred * from ,
const struct cred * to , struct file * file )
2015-01-21 18:54:10 +03:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( binder_transfer_file , 0 , from , to , file ) ;
2015-01-21 18:54:10 +03:00
}
2009-05-07 13:26:19 +04:00
int security_ptrace_access_check ( struct task_struct * child , unsigned int mode )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( ptrace_access_check , 0 , child , mode ) ;
security: Fix setting of PF_SUPERPRIV by __capable()
Fix the setting of PF_SUPERPRIV by __capable() as it could corrupt the flags
the target process if that is not the current process and it is trying to
change its own flags in a different way at the same time.
__capable() is using neither atomic ops nor locking to protect t->flags. This
patch removes __capable() and introduces has_capability() that doesn't set
PF_SUPERPRIV on the process being queried.
This patch further splits security_ptrace() in two:
(1) security_ptrace_may_access(). This passes judgement on whether one
process may access another only (PTRACE_MODE_ATTACH for ptrace() and
PTRACE_MODE_READ for /proc), and takes a pointer to the child process.
current is the parent.
(2) security_ptrace_traceme(). This passes judgement on PTRACE_TRACEME only,
and takes only a pointer to the parent process. current is the child.
In Smack and commoncap, this uses has_capability() to determine whether
the parent will be permitted to use PTRACE_ATTACH if normal checks fail.
This does not set PF_SUPERPRIV.
Two of the instances of __capable() actually only act on current, and so have
been changed to calls to capable().
Of the places that were using __capable():
(1) The OOM killer calls __capable() thrice when weighing the killability of a
process. All of these now use has_capability().
(2) cap_ptrace() and smack_ptrace() were using __capable() to check to see
whether the parent was allowed to trace any process. As mentioned above,
these have been split. For PTRACE_ATTACH and /proc, capable() is now
used, and for PTRACE_TRACEME, has_capability() is used.
(3) cap_safe_nice() only ever saw current, so now uses capable().
(4) smack_setprocattr() rejected accesses to tasks other than current just
after calling __capable(), so the order of these two tests have been
switched and capable() is used instead.
(5) In smack_file_send_sigiotask(), we need to allow privileged processes to
receive SIGIO on files they're manipulating.
(6) In smack_task_wait(), we let a process wait for a privileged process,
whether or not the process doing the waiting is privileged.
I've tested this with the LTP SELinux and syscalls testscripts.
Signed-off-by: David Howells <dhowells@redhat.com>
Acked-by: Serge Hallyn <serue@us.ibm.com>
Acked-by: Casey Schaufler <casey@schaufler-ca.com>
Acked-by: Andrew G. Morgan <morgan@kernel.org>
Acked-by: Al Viro <viro@zeniv.linux.org.uk>
Signed-off-by: James Morris <jmorris@namei.org>
2008-08-14 14:37:28 +04:00
}
int security_ptrace_traceme ( struct task_struct * parent )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( ptrace_traceme , 0 , parent ) ;
2007-10-17 10:31:32 +04:00
}
int security_capget ( struct task_struct * target ,
kernel_cap_t * effective ,
kernel_cap_t * inheritable ,
kernel_cap_t * permitted )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( capget , 0 , target ,
effective , inheritable , permitted ) ;
2007-10-17 10:31:32 +04:00
}
CRED: Inaugurate COW credentials
Inaugurate copy-on-write credentials management. This uses RCU to manage the
credentials pointer in the task_struct with respect to accesses by other tasks.
A process may only modify its own credentials, and so does not need locking to
access or modify its own credentials.
A mutex (cred_replace_mutex) is added to the task_struct to control the effect
of PTRACE_ATTACHED on credential calculations, particularly with respect to
execve().
With this patch, the contents of an active credentials struct may not be
changed directly; rather a new set of credentials must be prepared, modified
and committed using something like the following sequence of events:
struct cred *new = prepare_creds();
int ret = blah(new);
if (ret < 0) {
abort_creds(new);
return ret;
}
return commit_creds(new);
There are some exceptions to this rule: the keyrings pointed to by the active
credentials may be instantiated - keyrings violate the COW rule as managing
COW keyrings is tricky, given that it is possible for a task to directly alter
the keys in a keyring in use by another task.
To help enforce this, various pointers to sets of credentials, such as those in
the task_struct, are declared const. The purpose of this is compile-time
discouragement of altering credentials through those pointers. Once a set of
credentials has been made public through one of these pointers, it may not be
modified, except under special circumstances:
(1) Its reference count may incremented and decremented.
(2) The keyrings to which it points may be modified, but not replaced.
The only safe way to modify anything else is to create a replacement and commit
using the functions described in Documentation/credentials.txt (which will be
added by a later patch).
This patch and the preceding patches have been tested with the LTP SELinux
testsuite.
This patch makes several logical sets of alteration:
(1) execve().
This now prepares and commits credentials in various places in the
security code rather than altering the current creds directly.
(2) Temporary credential overrides.
do_coredump() and sys_faccessat() now prepare their own credentials and
temporarily override the ones currently on the acting thread, whilst
preventing interference from other threads by holding cred_replace_mutex
on the thread being dumped.
This will be replaced in a future patch by something that hands down the
credentials directly to the functions being called, rather than altering
the task's objective credentials.
(3) LSM interface.
A number of functions have been changed, added or removed:
(*) security_capset_check(), ->capset_check()
(*) security_capset_set(), ->capset_set()
Removed in favour of security_capset().
(*) security_capset(), ->capset()
New. This is passed a pointer to the new creds, a pointer to the old
creds and the proposed capability sets. It should fill in the new
creds or return an error. All pointers, barring the pointer to the
new creds, are now const.
(*) security_bprm_apply_creds(), ->bprm_apply_creds()
Changed; now returns a value, which will cause the process to be
killed if it's an error.
(*) security_task_alloc(), ->task_alloc_security()
Removed in favour of security_prepare_creds().
(*) security_cred_free(), ->cred_free()
New. Free security data attached to cred->security.
(*) security_prepare_creds(), ->cred_prepare()
New. Duplicate any security data attached to cred->security.
(*) security_commit_creds(), ->cred_commit()
New. Apply any security effects for the upcoming installation of new
security by commit_creds().
(*) security_task_post_setuid(), ->task_post_setuid()
Removed in favour of security_task_fix_setuid().
(*) security_task_fix_setuid(), ->task_fix_setuid()
Fix up the proposed new credentials for setuid(). This is used by
cap_set_fix_setuid() to implicitly adjust capabilities in line with
setuid() changes. Changes are made to the new credentials, rather
than the task itself as in security_task_post_setuid().
(*) security_task_reparent_to_init(), ->task_reparent_to_init()
Removed. Instead the task being reparented to init is referred
directly to init's credentials.
NOTE! This results in the loss of some state: SELinux's osid no
longer records the sid of the thread that forked it.
(*) security_key_alloc(), ->key_alloc()
(*) security_key_permission(), ->key_permission()
Changed. These now take cred pointers rather than task pointers to
refer to the security context.
(4) sys_capset().
This has been simplified and uses less locking. The LSM functions it
calls have been merged.
(5) reparent_to_kthreadd().
This gives the current thread the same credentials as init by simply using
commit_thread() to point that way.
(6) __sigqueue_alloc() and switch_uid()
__sigqueue_alloc() can't stop the target task from changing its creds
beneath it, so this function gets a reference to the currently applicable
user_struct which it then passes into the sigqueue struct it returns if
successful.
switch_uid() is now called from commit_creds(), and possibly should be
folded into that. commit_creds() should take care of protecting
__sigqueue_alloc().
(7) [sg]et[ug]id() and co and [sg]et_current_groups.
The set functions now all use prepare_creds(), commit_creds() and
abort_creds() to build and check a new set of credentials before applying
it.
security_task_set[ug]id() is called inside the prepared section. This
guarantees that nothing else will affect the creds until we've finished.
The calling of set_dumpable() has been moved into commit_creds().
Much of the functionality of set_user() has been moved into
commit_creds().
The get functions all simply access the data directly.
(8) security_task_prctl() and cap_task_prctl().
security_task_prctl() has been modified to return -ENOSYS if it doesn't
want to handle a function, or otherwise return the return value directly
rather than through an argument.
Additionally, cap_task_prctl() now prepares a new set of credentials, even
if it doesn't end up using it.
(9) Keyrings.
A number of changes have been made to the keyrings code:
(a) switch_uid_keyring(), copy_keys(), exit_keys() and suid_keys() have
all been dropped and built in to the credentials functions directly.
They may want separating out again later.
(b) key_alloc() and search_process_keyrings() now take a cred pointer
rather than a task pointer to specify the security context.
(c) copy_creds() gives a new thread within the same thread group a new
thread keyring if its parent had one, otherwise it discards the thread
keyring.
(d) The authorisation key now points directly to the credentials to extend
the search into rather pointing to the task that carries them.
(e) Installing thread, process or session keyrings causes a new set of
credentials to be created, even though it's not strictly necessary for
process or session keyrings (they're shared).
(10) Usermode helper.
The usermode helper code now carries a cred struct pointer in its
subprocess_info struct instead of a new session keyring pointer. This set
of credentials is derived from init_cred and installed on the new process
after it has been cloned.
call_usermodehelper_setup() allocates the new credentials and
call_usermodehelper_freeinfo() discards them if they haven't been used. A
special cred function (prepare_usermodeinfo_creds()) is provided
specifically for call_usermodehelper_setup() to call.
call_usermodehelper_setkeys() adjusts the credentials to sport the
supplied keyring as the new session keyring.
(11) SELinux.
SELinux has a number of changes, in addition to those to support the LSM
interface changes mentioned above:
(a) selinux_setprocattr() no longer does its check for whether the
current ptracer can access processes with the new SID inside the lock
that covers getting the ptracer's SID. Whilst this lock ensures that
the check is done with the ptracer pinned, the result is only valid
until the lock is released, so there's no point doing it inside the
lock.
(12) is_single_threaded().
This function has been extracted from selinux_setprocattr() and put into
a file of its own in the lib/ directory as join_session_keyring() now
wants to use it too.
The code in SELinux just checked to see whether a task shared mm_structs
with other tasks (CLONE_VM), but that isn't good enough. We really want
to know if they're part of the same thread group (CLONE_THREAD).
(13) nfsd.
The NFS server daemon now has to use the COW credentials to set the
credentials it is going to use. It really needs to pass the credentials
down to the functions it calls, but it can't do that until other patches
in this series have been applied.
Signed-off-by: David Howells <dhowells@redhat.com>
Acked-by: James Morris <jmorris@namei.org>
Signed-off-by: James Morris <jmorris@namei.org>
2008-11-14 02:39:23 +03:00
int security_capset ( struct cred * new , const struct cred * old ,
const kernel_cap_t * effective ,
const kernel_cap_t * inheritable ,
const kernel_cap_t * permitted )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( capset , 0 , new , old ,
effective , inheritable , permitted ) ;
2007-10-17 10:31:32 +04:00
}
2019-01-08 03:10:53 +03:00
int security_capable ( const struct cred * cred ,
struct user_namespace * ns ,
int cap ,
unsigned int opts )
2007-10-17 10:31:32 +04:00
{
2019-01-08 03:10:53 +03:00
return call_int_hook ( capable , 0 , cred , ns , cap , opts ) ;
2007-10-17 10:31:32 +04:00
}
int security_quotactl ( int cmds , int type , int id , struct super_block * sb )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( quotactl , 0 , cmds , type , id , sb ) ;
2007-10-17 10:31:32 +04:00
}
int security_quota_on ( struct dentry * dentry )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( quota_on , 0 , dentry ) ;
2007-10-17 10:31:32 +04:00
}
2010-11-16 02:36:29 +03:00
int security_syslog ( int type )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( syslog , 0 , type ) ;
2007-10-17 10:31:32 +04:00
}
2016-04-08 09:02:11 +03:00
int security_settime64 ( const struct timespec64 * ts , const struct timezone * tz )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( settime , 0 , ts , tz ) ;
2007-10-17 10:31:32 +04:00
}
int security_vm_enough_memory_mm ( struct mm_struct * mm , long pages )
{
2015-05-03 01:11:42 +03:00
struct security_hook_list * hp ;
int cap_sys_admin = 1 ;
int rc ;
/*
* The module will respond with a positive value if
* it thinks the __vm_enough_memory ( ) call should be
* made with the cap_sys_admin set . If all of the modules
* agree that it should be set it will . If any module
* thinks it should not be set it won ' t .
*/
2018-03-29 04:28:23 +03:00
hlist_for_each_entry ( hp , & security_hook_heads . vm_enough_memory , list ) {
2015-05-03 01:11:42 +03:00
rc = hp - > hook . vm_enough_memory ( mm , pages ) ;
if ( rc < = 0 ) {
cap_sys_admin = 0 ;
break ;
}
}
return __vm_enough_memory ( mm , pages , cap_sys_admin ) ;
2007-10-17 10:31:32 +04:00
}
2020-03-22 23:46:24 +03:00
int security_bprm_creds_for_exec ( struct linux_binprm * bprm )
2007-10-17 10:31:32 +04:00
{
2020-03-22 23:46:24 +03:00
return call_int_hook ( bprm_creds_for_exec , 0 , bprm ) ;
}
2020-05-30 06:00:54 +03:00
int security_bprm_creds_from_file ( struct linux_binprm * bprm , struct file * file )
2007-10-17 10:31:32 +04:00
{
2020-05-30 06:00:54 +03:00
return call_int_hook ( bprm_creds_from_file , 0 , bprm , file ) ;
2007-10-17 10:31:32 +04:00
}
CRED: Make execve() take advantage of copy-on-write credentials
Make execve() take advantage of copy-on-write credentials, allowing it to set
up the credentials in advance, and then commit the whole lot after the point
of no return.
This patch and the preceding patches have been tested with the LTP SELinux
testsuite.
This patch makes several logical sets of alteration:
(1) execve().
The credential bits from struct linux_binprm are, for the most part,
replaced with a single credentials pointer (bprm->cred). This means that
all the creds can be calculated in advance and then applied at the point
of no return with no possibility of failure.
I would like to replace bprm->cap_effective with:
cap_isclear(bprm->cap_effective)
but this seems impossible due to special behaviour for processes of pid 1
(they always retain their parent's capability masks where normally they'd
be changed - see cap_bprm_set_creds()).
The following sequence of events now happens:
(a) At the start of do_execve, the current task's cred_exec_mutex is
locked to prevent PTRACE_ATTACH from obsoleting the calculation of
creds that we make.
(a) prepare_exec_creds() is then called to make a copy of the current
task's credentials and prepare it. This copy is then assigned to
bprm->cred.
This renders security_bprm_alloc() and security_bprm_free()
unnecessary, and so they've been removed.
(b) The determination of unsafe execution is now performed immediately
after (a) rather than later on in the code. The result is stored in
bprm->unsafe for future reference.
(c) prepare_binprm() is called, possibly multiple times.
(i) This applies the result of set[ug]id binaries to the new creds
attached to bprm->cred. Personality bit clearance is recorded,
but now deferred on the basis that the exec procedure may yet
fail.
(ii) This then calls the new security_bprm_set_creds(). This should
calculate the new LSM and capability credentials into *bprm->cred.
This folds together security_bprm_set() and parts of
security_bprm_apply_creds() (these two have been removed).
Anything that might fail must be done at this point.
(iii) bprm->cred_prepared is set to 1.
bprm->cred_prepared is 0 on the first pass of the security
calculations, and 1 on all subsequent passes. This allows SELinux
in (ii) to base its calculations only on the initial script and
not on the interpreter.
(d) flush_old_exec() is called to commit the task to execution. This
performs the following steps with regard to credentials:
(i) Clear pdeath_signal and set dumpable on certain circumstances that
may not be covered by commit_creds().
(ii) Clear any bits in current->personality that were deferred from
(c.i).
(e) install_exec_creds() [compute_creds() as was] is called to install the
new credentials. This performs the following steps with regard to
credentials:
(i) Calls security_bprm_committing_creds() to apply any security
requirements, such as flushing unauthorised files in SELinux, that
must be done before the credentials are changed.
This is made up of bits of security_bprm_apply_creds() and
security_bprm_post_apply_creds(), both of which have been removed.
This function is not allowed to fail; anything that might fail
must have been done in (c.ii).
(ii) Calls commit_creds() to apply the new credentials in a single
assignment (more or less). Possibly pdeath_signal and dumpable
should be part of struct creds.
(iii) Unlocks the task's cred_replace_mutex, thus allowing
PTRACE_ATTACH to take place.
(iv) Clears The bprm->cred pointer as the credentials it was holding
are now immutable.
(v) Calls security_bprm_committed_creds() to apply any security
alterations that must be done after the creds have been changed.
SELinux uses this to flush signals and signal handlers.
(f) If an error occurs before (d.i), bprm_free() will call abort_creds()
to destroy the proposed new credentials and will then unlock
cred_replace_mutex. No changes to the credentials will have been
made.
(2) LSM interface.
A number of functions have been changed, added or removed:
(*) security_bprm_alloc(), ->bprm_alloc_security()
(*) security_bprm_free(), ->bprm_free_security()
Removed in favour of preparing new credentials and modifying those.
(*) security_bprm_apply_creds(), ->bprm_apply_creds()
(*) security_bprm_post_apply_creds(), ->bprm_post_apply_creds()
Removed; split between security_bprm_set_creds(),
security_bprm_committing_creds() and security_bprm_committed_creds().
(*) security_bprm_set(), ->bprm_set_security()
Removed; folded into security_bprm_set_creds().
(*) security_bprm_set_creds(), ->bprm_set_creds()
New. The new credentials in bprm->creds should be checked and set up
as appropriate. bprm->cred_prepared is 0 on the first call, 1 on the
second and subsequent calls.
(*) security_bprm_committing_creds(), ->bprm_committing_creds()
(*) security_bprm_committed_creds(), ->bprm_committed_creds()
New. Apply the security effects of the new credentials. This
includes closing unauthorised files in SELinux. This function may not
fail. When the former is called, the creds haven't yet been applied
to the process; when the latter is called, they have.
The former may access bprm->cred, the latter may not.
(3) SELinux.
SELinux has a number of changes, in addition to those to support the LSM
interface changes mentioned above:
(a) The bprm_security_struct struct has been removed in favour of using
the credentials-under-construction approach.
(c) flush_unauthorized_files() now takes a cred pointer and passes it on
to inode_has_perm(), file_has_perm() and dentry_open().
Signed-off-by: David Howells <dhowells@redhat.com>
Acked-by: James Morris <jmorris@namei.org>
Acked-by: Serge Hallyn <serue@us.ibm.com>
Signed-off-by: James Morris <jmorris@namei.org>
2008-11-14 02:39:24 +03:00
int security_bprm_check ( struct linux_binprm * bprm )
2007-10-17 10:31:32 +04:00
{
2009-10-23 01:30:13 +04:00
int ret ;
2015-05-03 01:11:29 +03:00
ret = call_int_hook ( bprm_check_security , 0 , bprm ) ;
2009-10-23 01:30:13 +04:00
if ( ret )
return ret ;
return ima_bprm_check ( bprm ) ;
2007-10-17 10:31:32 +04:00
}
CRED: Make execve() take advantage of copy-on-write credentials
Make execve() take advantage of copy-on-write credentials, allowing it to set
up the credentials in advance, and then commit the whole lot after the point
of no return.
This patch and the preceding patches have been tested with the LTP SELinux
testsuite.
This patch makes several logical sets of alteration:
(1) execve().
The credential bits from struct linux_binprm are, for the most part,
replaced with a single credentials pointer (bprm->cred). This means that
all the creds can be calculated in advance and then applied at the point
of no return with no possibility of failure.
I would like to replace bprm->cap_effective with:
cap_isclear(bprm->cap_effective)
but this seems impossible due to special behaviour for processes of pid 1
(they always retain their parent's capability masks where normally they'd
be changed - see cap_bprm_set_creds()).
The following sequence of events now happens:
(a) At the start of do_execve, the current task's cred_exec_mutex is
locked to prevent PTRACE_ATTACH from obsoleting the calculation of
creds that we make.
(a) prepare_exec_creds() is then called to make a copy of the current
task's credentials and prepare it. This copy is then assigned to
bprm->cred.
This renders security_bprm_alloc() and security_bprm_free()
unnecessary, and so they've been removed.
(b) The determination of unsafe execution is now performed immediately
after (a) rather than later on in the code. The result is stored in
bprm->unsafe for future reference.
(c) prepare_binprm() is called, possibly multiple times.
(i) This applies the result of set[ug]id binaries to the new creds
attached to bprm->cred. Personality bit clearance is recorded,
but now deferred on the basis that the exec procedure may yet
fail.
(ii) This then calls the new security_bprm_set_creds(). This should
calculate the new LSM and capability credentials into *bprm->cred.
This folds together security_bprm_set() and parts of
security_bprm_apply_creds() (these two have been removed).
Anything that might fail must be done at this point.
(iii) bprm->cred_prepared is set to 1.
bprm->cred_prepared is 0 on the first pass of the security
calculations, and 1 on all subsequent passes. This allows SELinux
in (ii) to base its calculations only on the initial script and
not on the interpreter.
(d) flush_old_exec() is called to commit the task to execution. This
performs the following steps with regard to credentials:
(i) Clear pdeath_signal and set dumpable on certain circumstances that
may not be covered by commit_creds().
(ii) Clear any bits in current->personality that were deferred from
(c.i).
(e) install_exec_creds() [compute_creds() as was] is called to install the
new credentials. This performs the following steps with regard to
credentials:
(i) Calls security_bprm_committing_creds() to apply any security
requirements, such as flushing unauthorised files in SELinux, that
must be done before the credentials are changed.
This is made up of bits of security_bprm_apply_creds() and
security_bprm_post_apply_creds(), both of which have been removed.
This function is not allowed to fail; anything that might fail
must have been done in (c.ii).
(ii) Calls commit_creds() to apply the new credentials in a single
assignment (more or less). Possibly pdeath_signal and dumpable
should be part of struct creds.
(iii) Unlocks the task's cred_replace_mutex, thus allowing
PTRACE_ATTACH to take place.
(iv) Clears The bprm->cred pointer as the credentials it was holding
are now immutable.
(v) Calls security_bprm_committed_creds() to apply any security
alterations that must be done after the creds have been changed.
SELinux uses this to flush signals and signal handlers.
(f) If an error occurs before (d.i), bprm_free() will call abort_creds()
to destroy the proposed new credentials and will then unlock
cred_replace_mutex. No changes to the credentials will have been
made.
(2) LSM interface.
A number of functions have been changed, added or removed:
(*) security_bprm_alloc(), ->bprm_alloc_security()
(*) security_bprm_free(), ->bprm_free_security()
Removed in favour of preparing new credentials and modifying those.
(*) security_bprm_apply_creds(), ->bprm_apply_creds()
(*) security_bprm_post_apply_creds(), ->bprm_post_apply_creds()
Removed; split between security_bprm_set_creds(),
security_bprm_committing_creds() and security_bprm_committed_creds().
(*) security_bprm_set(), ->bprm_set_security()
Removed; folded into security_bprm_set_creds().
(*) security_bprm_set_creds(), ->bprm_set_creds()
New. The new credentials in bprm->creds should be checked and set up
as appropriate. bprm->cred_prepared is 0 on the first call, 1 on the
second and subsequent calls.
(*) security_bprm_committing_creds(), ->bprm_committing_creds()
(*) security_bprm_committed_creds(), ->bprm_committed_creds()
New. Apply the security effects of the new credentials. This
includes closing unauthorised files in SELinux. This function may not
fail. When the former is called, the creds haven't yet been applied
to the process; when the latter is called, they have.
The former may access bprm->cred, the latter may not.
(3) SELinux.
SELinux has a number of changes, in addition to those to support the LSM
interface changes mentioned above:
(a) The bprm_security_struct struct has been removed in favour of using
the credentials-under-construction approach.
(c) flush_unauthorized_files() now takes a cred pointer and passes it on
to inode_has_perm(), file_has_perm() and dentry_open().
Signed-off-by: David Howells <dhowells@redhat.com>
Acked-by: James Morris <jmorris@namei.org>
Acked-by: Serge Hallyn <serue@us.ibm.com>
Signed-off-by: James Morris <jmorris@namei.org>
2008-11-14 02:39:24 +03:00
void security_bprm_committing_creds ( struct linux_binprm * bprm )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
call_void_hook ( bprm_committing_creds , bprm ) ;
2007-10-17 10:31:32 +04:00
}
CRED: Make execve() take advantage of copy-on-write credentials
Make execve() take advantage of copy-on-write credentials, allowing it to set
up the credentials in advance, and then commit the whole lot after the point
of no return.
This patch and the preceding patches have been tested with the LTP SELinux
testsuite.
This patch makes several logical sets of alteration:
(1) execve().
The credential bits from struct linux_binprm are, for the most part,
replaced with a single credentials pointer (bprm->cred). This means that
all the creds can be calculated in advance and then applied at the point
of no return with no possibility of failure.
I would like to replace bprm->cap_effective with:
cap_isclear(bprm->cap_effective)
but this seems impossible due to special behaviour for processes of pid 1
(they always retain their parent's capability masks where normally they'd
be changed - see cap_bprm_set_creds()).
The following sequence of events now happens:
(a) At the start of do_execve, the current task's cred_exec_mutex is
locked to prevent PTRACE_ATTACH from obsoleting the calculation of
creds that we make.
(a) prepare_exec_creds() is then called to make a copy of the current
task's credentials and prepare it. This copy is then assigned to
bprm->cred.
This renders security_bprm_alloc() and security_bprm_free()
unnecessary, and so they've been removed.
(b) The determination of unsafe execution is now performed immediately
after (a) rather than later on in the code. The result is stored in
bprm->unsafe for future reference.
(c) prepare_binprm() is called, possibly multiple times.
(i) This applies the result of set[ug]id binaries to the new creds
attached to bprm->cred. Personality bit clearance is recorded,
but now deferred on the basis that the exec procedure may yet
fail.
(ii) This then calls the new security_bprm_set_creds(). This should
calculate the new LSM and capability credentials into *bprm->cred.
This folds together security_bprm_set() and parts of
security_bprm_apply_creds() (these two have been removed).
Anything that might fail must be done at this point.
(iii) bprm->cred_prepared is set to 1.
bprm->cred_prepared is 0 on the first pass of the security
calculations, and 1 on all subsequent passes. This allows SELinux
in (ii) to base its calculations only on the initial script and
not on the interpreter.
(d) flush_old_exec() is called to commit the task to execution. This
performs the following steps with regard to credentials:
(i) Clear pdeath_signal and set dumpable on certain circumstances that
may not be covered by commit_creds().
(ii) Clear any bits in current->personality that were deferred from
(c.i).
(e) install_exec_creds() [compute_creds() as was] is called to install the
new credentials. This performs the following steps with regard to
credentials:
(i) Calls security_bprm_committing_creds() to apply any security
requirements, such as flushing unauthorised files in SELinux, that
must be done before the credentials are changed.
This is made up of bits of security_bprm_apply_creds() and
security_bprm_post_apply_creds(), both of which have been removed.
This function is not allowed to fail; anything that might fail
must have been done in (c.ii).
(ii) Calls commit_creds() to apply the new credentials in a single
assignment (more or less). Possibly pdeath_signal and dumpable
should be part of struct creds.
(iii) Unlocks the task's cred_replace_mutex, thus allowing
PTRACE_ATTACH to take place.
(iv) Clears The bprm->cred pointer as the credentials it was holding
are now immutable.
(v) Calls security_bprm_committed_creds() to apply any security
alterations that must be done after the creds have been changed.
SELinux uses this to flush signals and signal handlers.
(f) If an error occurs before (d.i), bprm_free() will call abort_creds()
to destroy the proposed new credentials and will then unlock
cred_replace_mutex. No changes to the credentials will have been
made.
(2) LSM interface.
A number of functions have been changed, added or removed:
(*) security_bprm_alloc(), ->bprm_alloc_security()
(*) security_bprm_free(), ->bprm_free_security()
Removed in favour of preparing new credentials and modifying those.
(*) security_bprm_apply_creds(), ->bprm_apply_creds()
(*) security_bprm_post_apply_creds(), ->bprm_post_apply_creds()
Removed; split between security_bprm_set_creds(),
security_bprm_committing_creds() and security_bprm_committed_creds().
(*) security_bprm_set(), ->bprm_set_security()
Removed; folded into security_bprm_set_creds().
(*) security_bprm_set_creds(), ->bprm_set_creds()
New. The new credentials in bprm->creds should be checked and set up
as appropriate. bprm->cred_prepared is 0 on the first call, 1 on the
second and subsequent calls.
(*) security_bprm_committing_creds(), ->bprm_committing_creds()
(*) security_bprm_committed_creds(), ->bprm_committed_creds()
New. Apply the security effects of the new credentials. This
includes closing unauthorised files in SELinux. This function may not
fail. When the former is called, the creds haven't yet been applied
to the process; when the latter is called, they have.
The former may access bprm->cred, the latter may not.
(3) SELinux.
SELinux has a number of changes, in addition to those to support the LSM
interface changes mentioned above:
(a) The bprm_security_struct struct has been removed in favour of using
the credentials-under-construction approach.
(c) flush_unauthorized_files() now takes a cred pointer and passes it on
to inode_has_perm(), file_has_perm() and dentry_open().
Signed-off-by: David Howells <dhowells@redhat.com>
Acked-by: James Morris <jmorris@namei.org>
Acked-by: Serge Hallyn <serue@us.ibm.com>
Signed-off-by: James Morris <jmorris@namei.org>
2008-11-14 02:39:24 +03:00
void security_bprm_committed_creds ( struct linux_binprm * bprm )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
call_void_hook ( bprm_committed_creds , bprm ) ;
2007-10-17 10:31:32 +04:00
}
2018-12-24 00:02:47 +03:00
int security_fs_context_dup ( struct fs_context * fc , struct fs_context * src_fc )
{
return call_int_hook ( fs_context_dup , 0 , fc , src_fc ) ;
}
2022-01-27 07:51:00 +03:00
int security_fs_context_parse_param ( struct fs_context * fc ,
struct fs_parameter * param )
2018-11-02 02:07:24 +03:00
{
2022-01-27 07:51:00 +03:00
struct security_hook_list * hp ;
int trc ;
int rc = - ENOPARAM ;
hlist_for_each_entry ( hp , & security_hook_heads . fs_context_parse_param ,
list ) {
trc = hp - > hook . fs_context_parse_param ( fc , param ) ;
if ( trc = = 0 )
rc = 0 ;
else if ( trc ! = - ENOPARAM )
return trc ;
}
return rc ;
2018-11-02 02:07:24 +03:00
}
2007-10-17 10:31:32 +04:00
int security_sb_alloc ( struct super_block * sb )
{
2021-04-22 18:41:15 +03:00
int rc = lsm_superblock_alloc ( sb ) ;
if ( unlikely ( rc ) )
return rc ;
rc = call_int_hook ( sb_alloc_security , 0 , sb ) ;
if ( unlikely ( rc ) )
security_sb_free ( sb ) ;
return rc ;
2007-10-17 10:31:32 +04:00
}
2021-04-22 18:41:16 +03:00
void security_sb_delete ( struct super_block * sb )
{
call_void_hook ( sb_delete , sb ) ;
2007-10-17 10:31:32 +04:00
}
void security_sb_free ( struct super_block * sb )
{
2015-05-03 01:11:29 +03:00
call_void_hook ( sb_free_security , sb ) ;
2021-04-22 18:41:15 +03:00
kfree ( sb - > s_security ) ;
sb - > s_security = NULL ;
2007-10-17 10:31:32 +04:00
}
2018-12-13 21:41:47 +03:00
void security_free_mnt_opts ( void * * mnt_opts )
2007-10-17 10:31:32 +04:00
{
2018-12-13 21:41:47 +03:00
if ( ! * mnt_opts )
return ;
call_void_hook ( sb_free_mnt_opts , * mnt_opts ) ;
* mnt_opts = NULL ;
2007-10-17 10:31:32 +04:00
}
2018-12-13 21:41:47 +03:00
EXPORT_SYMBOL ( security_free_mnt_opts ) ;
2007-10-17 10:31:32 +04:00
2018-12-13 21:41:47 +03:00
int security_sb_eat_lsm_opts ( char * options , void * * mnt_opts )
2011-03-04 00:09:14 +03:00
{
2018-12-13 21:41:47 +03:00
return call_int_hook ( sb_eat_lsm_opts , 0 , options , mnt_opts ) ;
2011-03-04 00:09:14 +03:00
}
2018-11-17 20:09:18 +03:00
EXPORT_SYMBOL ( security_sb_eat_lsm_opts ) ;
2011-03-04 00:09:14 +03:00
2021-02-27 06:37:55 +03:00
int security_sb_mnt_opts_compat ( struct super_block * sb ,
void * mnt_opts )
{
return call_int_hook ( sb_mnt_opts_compat , 0 , sb , mnt_opts ) ;
}
EXPORT_SYMBOL ( security_sb_mnt_opts_compat ) ;
2018-12-02 07:06:57 +03:00
int security_sb_remount ( struct super_block * sb ,
2018-12-13 21:41:47 +03:00
void * mnt_opts )
2007-10-17 10:31:32 +04:00
{
2018-12-13 21:41:47 +03:00
return call_int_hook ( sb_remount , 0 , sb , mnt_opts ) ;
2011-03-04 00:09:14 +03:00
}
2018-12-11 01:19:21 +03:00
EXPORT_SYMBOL ( security_sb_remount ) ;
2011-03-04 00:09:14 +03:00
2018-12-05 19:58:35 +03:00
int security_sb_kern_mount ( struct super_block * sb )
2007-10-17 10:31:32 +04:00
{
2018-12-05 19:58:35 +03:00
return call_int_hook ( sb_kern_mount , 0 , sb ) ;
2007-10-17 10:31:32 +04:00
}
2008-07-04 03:47:13 +04:00
int security_sb_show_options ( struct seq_file * m , struct super_block * sb )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( sb_show_options , 0 , m , sb ) ;
2008-07-04 03:47:13 +04:00
}
2007-10-17 10:31:32 +04:00
int security_sb_statfs ( struct dentry * dentry )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( sb_statfs , 0 , dentry ) ;
2007-10-17 10:31:32 +04:00
}
2016-03-25 21:52:53 +03:00
int security_sb_mount ( const char * dev_name , const struct path * path ,
2012-10-11 19:42:01 +04:00
const char * type , unsigned long flags , void * data )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( sb_mount , 0 , dev_name , path , type , flags , data ) ;
2007-10-17 10:31:32 +04:00
}
int security_sb_umount ( struct vfsmount * mnt , int flags )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( sb_umount , 0 , mnt , flags ) ;
2007-10-17 10:31:32 +04:00
}
2016-03-25 22:31:19 +03:00
int security_sb_pivotroot ( const struct path * old_path , const struct path * new_path )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( sb_pivotroot , 0 , old_path , new_path ) ;
2007-10-17 10:31:32 +04:00
}
2007-11-30 21:00:35 +03:00
int security_sb_set_mnt_opts ( struct super_block * sb ,
2018-12-13 21:41:47 +03:00
void * mnt_opts ,
2013-05-22 20:50:36 +04:00
unsigned long kern_flags ,
unsigned long * set_kern_flags )
2007-11-30 21:00:35 +03:00
{
2015-05-03 01:11:42 +03:00
return call_int_hook ( sb_set_mnt_opts ,
2018-12-13 21:41:47 +03:00
mnt_opts ? - EOPNOTSUPP : 0 , sb ,
mnt_opts , kern_flags , set_kern_flags ) ;
2007-11-30 21:00:35 +03:00
}
2008-03-05 18:31:54 +03:00
EXPORT_SYMBOL ( security_sb_set_mnt_opts ) ;
2007-11-30 21:00:35 +03:00
selinux: make security_sb_clone_mnt_opts return an error on context mismatch
I had the following problem reported a while back. If you mount the
same filesystem twice using NFSv4 with different contexts, then the
second context= option is ignored. For instance:
# mount server:/export /mnt/test1
# mount server:/export /mnt/test2 -o context=system_u:object_r:tmp_t:s0
# ls -dZ /mnt/test1
drwxrwxrwt. root root system_u:object_r:nfs_t:s0 /mnt/test1
# ls -dZ /mnt/test2
drwxrwxrwt. root root system_u:object_r:nfs_t:s0 /mnt/test2
When we call into SELinux to set the context of a "cloned" superblock,
it will currently just bail out when it notices that we're reusing an
existing superblock. Since the existing superblock is already set up and
presumably in use, we can't go overwriting its context with the one from
the "original" sb. Because of this, the second context= option in this
case cannot take effect.
This patch fixes this by turning security_sb_clone_mnt_opts into an int
return operation. When it finds that the "new" superblock that it has
been handed is already set up, it checks to see whether the contexts on
the old superblock match it. If it does, then it will just return
success, otherwise it'll return -EBUSY and emit a printk to tell the
admin why the second mount failed.
Note that this patch may cause casualties. The NFSv4 code relies on
being able to walk down to an export from the pseudoroot. If you mount
filesystems that are nested within one another with different contexts,
then this patch will make those mounts fail in new and "exciting" ways.
For instance, suppose that /export is a separate filesystem on the
server:
# mount server:/ /mnt/test1
# mount salusa:/export /mnt/test2 -o context=system_u:object_r:tmp_t:s0
mount.nfs: an incorrect mount option was specified
...with the printk in the ring buffer. Because we *might* eventually
walk down to /mnt/test1/export, the mount is denied due to this patch.
The second mount needs the pseudoroot superblock, but that's already
present with the wrong context.
OTOH, if we mount these in the reverse order, then both mounts work,
because the pseudoroot superblock created when mounting /export is
discarded once that mount is done. If we then however try to walk into
that directory, the automount fails for the similar reasons:
# cd /mnt/test1/scratch/
-bash: cd: /mnt/test1/scratch: Device or resource busy
The story I've gotten from the SELinux folks that I've talked to is that
this is desirable behavior. In SELinux-land, mounting the same data
under different contexts is wrong -- there can be only one.
Cc: Steve Dickson <steved@redhat.com>
Cc: Stephen Smalley <sds@tycho.nsa.gov>
Signed-off-by: Jeff Layton <jlayton@redhat.com>
Acked-by: Eric Paris <eparis@redhat.com>
Signed-off-by: James Morris <james.l.morris@oracle.com>
2013-04-01 16:14:24 +04:00
int security_sb_clone_mnt_opts ( const struct super_block * oldsb ,
2017-06-05 18:45:04 +03:00
struct super_block * newsb ,
unsigned long kern_flags ,
unsigned long * set_kern_flags )
2007-11-30 21:00:35 +03:00
{
2017-06-05 18:45:04 +03:00
return call_int_hook ( sb_clone_mnt_opts , 0 , oldsb , newsb ,
kern_flags , set_kern_flags ) ;
2007-11-30 21:00:35 +03:00
}
2008-03-05 18:31:54 +03:00
EXPORT_SYMBOL ( security_sb_clone_mnt_opts ) ;
2018-11-05 20:40:30 +03:00
int security_move_mount ( const struct path * from_path , const struct path * to_path )
{
return call_int_hook ( move_mount , 0 , from_path , to_path ) ;
}
fanotify, inotify, dnotify, security: add security hook for fs notifications
As of now, setting watches on filesystem objects has, at most, applied a
check for read access to the inode, and in the case of fanotify, requires
CAP_SYS_ADMIN. No specific security hook or permission check has been
provided to control the setting of watches. Using any of inotify, dnotify,
or fanotify, it is possible to observe, not only write-like operations, but
even read access to a file. Modeling the watch as being merely a read from
the file is insufficient for the needs of SELinux. This is due to the fact
that read access should not necessarily imply access to information about
when another process reads from a file. Furthermore, fanotify watches grant
more power to an application in the form of permission events. While
notification events are solely, unidirectional (i.e. they only pass
information to the receiving application), permission events are blocking.
Permission events make a request to the receiving application which will
then reply with a decision as to whether or not that action may be
completed. This causes the issue of the watching application having the
ability to exercise control over the triggering process. Without drawing a
distinction within the permission check, the ability to read would imply
the greater ability to control an application. Additionally, mount and
superblock watches apply to all files within the same mount or superblock.
Read access to one file should not necessarily imply the ability to watch
all files accessed within a given mount or superblock.
In order to solve these issues, a new LSM hook is implemented and has been
placed within the system calls for marking filesystem objects with inotify,
fanotify, and dnotify watches. These calls to the hook are placed at the
point at which the target path has been resolved and are provided with the
path struct, the mask of requested notification events, and the type of
object on which the mark is being set (inode, superblock, or mount). The
mask and obj_type have already been translated into common FS_* values
shared by the entirety of the fs notification infrastructure. The path
struct is passed rather than just the inode so that the mount is available,
particularly for mount watches. This also allows for use of the hook by
pathname-based security modules. However, since the hook is intended for
use even by inode based security modules, it is not placed under the
CONFIG_SECURITY_PATH conditional. Otherwise, the inode-based security
modules would need to enable all of the path hooks, even though they do not
use any of them.
This only provides a hook at the point of setting a watch, and presumes
that permission to set a particular watch implies the ability to receive
all notification about that object which match the mask. This is all that
is required for SELinux. If other security modules require additional hooks
or infrastructure to control delivery of notification, these can be added
by them. It does not make sense for us to propose hooks for which we have
no implementation. The understanding that all notifications received by the
requesting application are all strictly of a type for which the application
has been granted permission shows that this implementation is sufficient in
its coverage.
Security modules wishing to provide complete control over fanotify must
also implement a security_file_open hook that validates that the access
requested by the watching application is authorized. Fanotify has the issue
that it returns a file descriptor with the file mode specified during
fanotify_init() to the watching process on event. This is already covered
by the LSM security_file_open hook if the security module implements
checking of the requested file mode there. Otherwise, a watching process
can obtain escalated access to a file for which it has not been authorized.
The selinux_path_notify hook implementation works by adding five new file
permissions: watch, watch_mount, watch_sb, watch_reads, and watch_with_perm
(descriptions about which will follow), and one new filesystem permission:
watch (which is applied to superblock checks). The hook then decides which
subset of these permissions must be held by the requesting application
based on the contents of the provided mask and the obj_type. The
selinux_file_open hook already checks the requested file mode and therefore
ensures that a watching process cannot escalate its access through
fanotify.
The watch, watch_mount, and watch_sb permissions are the baseline
permissions for setting a watch on an object and each are a requirement for
any watch to be set on a file, mount, or superblock respectively. It should
be noted that having either of the other two permissions (watch_reads and
watch_with_perm) does not imply the watch, watch_mount, or watch_sb
permission. Superblock watches further require the filesystem watch
permission to the superblock. As there is no labeled object in view for
mounts, there is no specific check for mount watches beyond watch_mount to
the inode. Such a check could be added in the future, if a suitable labeled
object existed representing the mount.
The watch_reads permission is required to receive notifications from
read-exclusive events on filesystem objects. These events include accessing
a file for the purpose of reading and closing a file which has been opened
read-only. This distinction has been drawn in order to provide a direct
indication in the policy for this otherwise not obvious capability. Read
access to a file should not necessarily imply the ability to observe read
events on a file.
Finally, watch_with_perm only applies to fanotify masks since it is the
only way to set a mask which allows for the blocking, permission event.
This permission is needed for any watch which is of this type. Though
fanotify requires CAP_SYS_ADMIN, this is insufficient as it gives implicit
trust to root, which we do not do, and does not support least privilege.
Signed-off-by: Aaron Goidel <acgoide@tycho.nsa.gov>
Acked-by: Casey Schaufler <casey@schaufler-ca.com>
Acked-by: Jan Kara <jack@suse.cz>
Signed-off-by: Paul Moore <paul@paul-moore.com>
2019-08-12 18:20:00 +03:00
int security_path_notify ( const struct path * path , u64 mask ,
unsigned int obj_type )
{
return call_int_hook ( path_notify , 0 , path , mask , obj_type ) ;
}
2007-10-17 10:31:32 +04:00
int security_inode_alloc ( struct inode * inode )
{
2018-09-22 03:19:29 +03:00
int rc = lsm_inode_alloc ( inode ) ;
if ( unlikely ( rc ) )
return rc ;
rc = call_int_hook ( inode_alloc_security , 0 , inode ) ;
if ( unlikely ( rc ) )
security_inode_free ( inode ) ;
return rc ;
}
static void inode_free_by_rcu ( struct rcu_head * head )
{
/*
* The rcu head is at the start of the inode blob
*/
kmem_cache_free ( lsm_inode_cache , head ) ;
2007-10-17 10:31:32 +04:00
}
void security_inode_free ( struct inode * inode )
{
2011-03-09 22:13:22 +03:00
integrity_inode_free ( inode ) ;
2015-05-03 01:11:29 +03:00
call_void_hook ( inode_free_security , inode ) ;
2018-09-22 03:19:29 +03:00
/*
* The inode may still be referenced in a path walk and
* a call to security_inode_permission ( ) can be made
* after inode_free_security ( ) is called . Ideally , the VFS
* wouldn ' t do this , but fixing that is a much harder
* job . For now , simply free the i_security via RCU , and
* leave the current inode - > i_security pointer intact .
* The inode will be freed after the RCU grace period too .
*/
if ( inode - > i_security )
call_rcu ( ( struct rcu_head * ) inode - > i_security ,
inode_free_by_rcu ) ;
2007-10-17 10:31:32 +04:00
}
2013-05-22 20:50:34 +04:00
int security_dentry_init_security ( struct dentry * dentry , int mode ,
2021-10-12 16:23:07 +03:00
const struct qstr * name ,
const char * * xattr_name , void * * ctx ,
u32 * ctxlen )
2013-05-22 20:50:34 +04:00
{
security, lsm: dentry_init_security() Handle multi LSM registration
A ceph user has reported that ceph is crashing with kernel NULL pointer
dereference. Following is the backtrace.
/proc/version: Linux version 5.16.2-arch1-1 (linux@archlinux) (gcc (GCC)
11.1.0, GNU ld (GNU Binutils) 2.36.1) #1 SMP PREEMPT Thu, 20 Jan 2022
16:18:29 +0000
distro / arch: Arch Linux / x86_64
SELinux is not enabled
ceph cluster version: 16.2.7 (dd0603118f56ab514f133c8d2e3adfc983942503)
relevant dmesg output:
[ 30.947129] BUG: kernel NULL pointer dereference, address:
0000000000000000
[ 30.947206] #PF: supervisor read access in kernel mode
[ 30.947258] #PF: error_code(0x0000) - not-present page
[ 30.947310] PGD 0 P4D 0
[ 30.947342] Oops: 0000 [#1] PREEMPT SMP PTI
[ 30.947388] CPU: 5 PID: 778 Comm: touch Not tainted 5.16.2-arch1-1 #1
86fbf2c313cc37a553d65deb81d98e9dcc2a3659
[ 30.947486] Hardware name: Gigabyte Technology Co., Ltd. B365M
DS3H/B365M DS3H, BIOS F5 08/13/2019
[ 30.947569] RIP: 0010:strlen+0x0/0x20
[ 30.947616] Code: b6 07 38 d0 74 16 48 83 c7 01 84 c0 74 05 48 39 f7 75
ec 31 c0 31 d2 89 d6 89 d7 c3 48 89 f8 31 d2 89 d6 89 d7 c3 0
f 1f 40 00 <80> 3f 00 74 12 48 89 f8 48 83 c0 01 80 38 00 75 f7 48 29 f8 31
ff
[ 30.947782] RSP: 0018:ffffa4ed80ffbbb8 EFLAGS: 00010246
[ 30.947836] RAX: 0000000000000000 RBX: ffffa4ed80ffbc60 RCX:
0000000000000000
[ 30.947904] RDX: 0000000000000000 RSI: 0000000000000000 RDI:
0000000000000000
[ 30.947971] RBP: ffff94b0d15c0ae0 R08: 0000000000000000 R09:
0000000000000000
[ 30.948040] R10: 0000000000000000 R11: 0000000000000000 R12:
0000000000000000
[ 30.948106] R13: 0000000000000001 R14: ffffa4ed80ffbc60 R15:
0000000000000000
[ 30.948174] FS: 00007fc7520f0740(0000) GS:ffff94b7ced40000(0000)
knlGS:0000000000000000
[ 30.948252] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 30.948308] CR2: 0000000000000000 CR3: 0000000104a40001 CR4:
00000000003706e0
[ 30.948376] Call Trace:
[ 30.948404] <TASK>
[ 30.948431] ceph_security_init_secctx+0x7b/0x240 [ceph
49f9c4b9bf5be8760f19f1747e26da33920bce4b]
[ 30.948582] ceph_atomic_open+0x51e/0x8a0 [ceph
49f9c4b9bf5be8760f19f1747e26da33920bce4b]
[ 30.948708] ? get_cached_acl+0x4d/0xa0
[ 30.948759] path_openat+0x60d/0x1030
[ 30.948809] do_filp_open+0xa5/0x150
[ 30.948859] do_sys_openat2+0xc4/0x190
[ 30.948904] __x64_sys_openat+0x53/0xa0
[ 30.948948] do_syscall_64+0x5c/0x90
[ 30.948989] ? exc_page_fault+0x72/0x180
[ 30.949034] entry_SYSCALL_64_after_hwframe+0x44/0xae
[ 30.949091] RIP: 0033:0x7fc7521e25bb
[ 30.950849] Code: 25 00 00 41 00 3d 00 00 41 00 74 4b 64 8b 04 25 18 00
00 00 85 c0 75 67 44 89 e2 48 89 ee bf 9c ff ff ff b8 01 01 0
0 00 0f 05 <48> 3d 00 f0 ff ff 0f 87 91 00 00 00 48 8b 54 24 28 64 48 2b 14
25
Core of the problem is that ceph checks for return code from
security_dentry_init_security() and if return code is 0, it assumes
everything is fine and continues to call strlen(name), which crashes.
Typically SELinux LSM returns 0 and sets name to "security.selinux" and
it is not a problem. Or if selinux is not compiled in or disabled, it
returns -EOPNOTSUP and ceph deals with it.
But somehow in this configuration, 0 is being returned and "name" is
not being initialized and that's creating the problem.
Our suspicion is that BPF LSM is registering a hook for
dentry_init_security() and returns hook default of 0.
LSM_HOOK(int, 0, dentry_init_security, struct dentry *dentry,...)
I have not been able to reproduce it just by doing CONFIG_BPF_LSM=y.
Stephen has tested the patch though and confirms it solves the problem
for him.
dentry_init_security() is written in such a way that it expects only one
LSM to register the hook. Atleast that's the expectation with current code.
If another LSM returns a hook and returns default, it will simply return
0 as of now and that will break ceph.
Hence, suggestion is that change semantics of this hook a bit. If there
are no LSMs or no LSM is taking ownership and initializing security context,
then return -EOPNOTSUP. Also allow at max one LSM to initialize security
context. This hook can't deal with multiple LSMs trying to init security
context. This patch implements this new behavior.
Reported-by: Stephen Muth <smuth4@gmail.com>
Tested-by: Stephen Muth <smuth4@gmail.com>
Suggested-by: Casey Schaufler <casey@schaufler-ca.com>
Acked-by: Casey Schaufler <casey@schaufler-ca.com>
Reviewed-by: Serge Hallyn <serge@hallyn.com>
Cc: Jeff Layton <jlayton@kernel.org>
Cc: Christian Brauner <brauner@kernel.org>
Cc: Paul Moore <paul@paul-moore.com>
Cc: <stable@vger.kernel.org> # 5.16.0
Signed-off-by: Vivek Goyal <vgoyal@redhat.com>
Reviewed-by: Jeff Layton <jlayton@kernel.org>
Acked-by: Paul Moore <paul@paul-moore.com>
Acked-by: Christian Brauner <brauner@kernel.org>
Signed-off-by: James Morris <jmorris@namei.org>
2022-01-26 23:35:14 +03:00
struct security_hook_list * hp ;
int rc ;
/*
* Only one module will provide a security context .
*/
hlist_for_each_entry ( hp , & security_hook_heads . dentry_init_security , list ) {
rc = hp - > hook . dentry_init_security ( dentry , mode , name ,
xattr_name , ctx , ctxlen ) ;
if ( rc ! = LSM_RET_DEFAULT ( dentry_init_security ) )
return rc ;
}
return LSM_RET_DEFAULT ( dentry_init_security ) ;
2013-05-22 20:50:34 +04:00
}
EXPORT_SYMBOL ( security_dentry_init_security ) ;
2016-07-13 17:44:52 +03:00
int security_dentry_create_files_as ( struct dentry * dentry , int mode ,
struct qstr * name ,
const struct cred * old , struct cred * new )
{
return call_int_hook ( dentry_create_files_as , 0 , dentry , mode ,
name , old , new ) ;
}
EXPORT_SYMBOL ( security_dentry_create_files_as ) ;
2007-10-17 10:31:32 +04:00
int security_inode_init_security ( struct inode * inode , struct inode * dir ,
2011-06-06 23:29:25 +04:00
const struct qstr * qstr ,
const initxattrs initxattrs , void * fs_data )
2007-10-17 10:31:32 +04:00
{
2011-06-16 05:19:10 +04:00
struct xattr new_xattrs [ MAX_LSM_EVM_XATTR + 1 ] ;
struct xattr * lsm_xattr , * evm_xattr , * xattr ;
2011-06-06 23:29:25 +04:00
int ret ;
2007-10-17 10:31:32 +04:00
if ( unlikely ( IS_PRIVATE ( inode ) ) )
2011-08-15 18:13:18 +04:00
return 0 ;
2011-06-06 23:29:25 +04:00
if ( ! initxattrs )
2015-08-24 15:22:25 +03:00
return call_int_hook ( inode_init_security , - EOPNOTSUPP , inode ,
dir , qstr , NULL , NULL , NULL ) ;
2013-07-25 00:44:02 +04:00
memset ( new_xattrs , 0 , sizeof ( new_xattrs ) ) ;
2011-06-06 23:29:25 +04:00
lsm_xattr = new_xattrs ;
2015-05-03 01:11:42 +03:00
ret = call_int_hook ( inode_init_security , - EOPNOTSUPP , inode , dir , qstr ,
2011-06-06 23:29:25 +04:00
& lsm_xattr - > name ,
& lsm_xattr - > value ,
& lsm_xattr - > value_len ) ;
if ( ret )
goto out ;
2011-06-16 05:19:10 +04:00
evm_xattr = lsm_xattr + 1 ;
ret = evm_inode_init_security ( inode , lsm_xattr , evm_xattr ) ;
if ( ret )
goto out ;
2011-06-06 23:29:25 +04:00
ret = initxattrs ( inode , new_xattrs , fs_data ) ;
out :
2013-07-25 00:44:02 +04:00
for ( xattr = new_xattrs ; xattr - > value ! = NULL ; xattr + + )
2011-06-16 05:19:10 +04:00
kfree ( xattr - > value ) ;
2011-06-06 23:29:25 +04:00
return ( ret = = - EOPNOTSUPP ) ? 0 : ret ;
}
EXPORT_SYMBOL ( security_inode_init_security ) ;
2021-01-09 01:22:20 +03:00
int security_inode_init_security_anon ( struct inode * inode ,
const struct qstr * name ,
const struct inode * context_inode )
{
return call_int_hook ( inode_init_security_anon , 0 , inode , name ,
context_inode ) ;
}
2011-06-06 23:29:25 +04:00
int security_old_inode_init_security ( struct inode * inode , struct inode * dir ,
2013-07-25 00:44:02 +04:00
const struct qstr * qstr , const char * * name ,
2011-06-06 23:29:25 +04:00
void * * value , size_t * len )
2007-10-17 10:31:32 +04:00
{
if ( unlikely ( IS_PRIVATE ( inode ) ) )
2012-01-03 16:14:29 +04:00
return - EOPNOTSUPP ;
2015-08-24 15:22:25 +03:00
return call_int_hook ( inode_init_security , - EOPNOTSUPP , inode , dir ,
qstr , name , value , len ) ;
2007-10-17 10:31:32 +04:00
}
2011-06-06 23:29:25 +04:00
EXPORT_SYMBOL ( security_old_inode_init_security ) ;
2007-10-17 10:31:32 +04:00
2008-12-17 07:24:15 +03:00
# ifdef CONFIG_SECURITY_PATH
2016-03-25 22:21:09 +03:00
int security_path_mknod ( const struct path * dir , struct dentry * dentry , umode_t mode ,
2008-12-17 07:24:15 +03:00
unsigned int dev )
{
2015-03-18 01:26:22 +03:00
if ( unlikely ( IS_PRIVATE ( d_backing_inode ( dir - > dentry ) ) ) )
2008-12-17 07:24:15 +03:00
return 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( path_mknod , 0 , dir , dentry , mode , dev ) ;
2008-12-17 07:24:15 +03:00
}
EXPORT_SYMBOL ( security_path_mknod ) ;
2016-03-25 22:21:09 +03:00
int security_path_mkdir ( const struct path * dir , struct dentry * dentry , umode_t mode )
2008-12-17 07:24:15 +03:00
{
2015-03-18 01:26:22 +03:00
if ( unlikely ( IS_PRIVATE ( d_backing_inode ( dir - > dentry ) ) ) )
2008-12-17 07:24:15 +03:00
return 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( path_mkdir , 0 , dir , dentry , mode ) ;
2008-12-17 07:24:15 +03:00
}
2010-12-24 17:48:35 +03:00
EXPORT_SYMBOL ( security_path_mkdir ) ;
2008-12-17 07:24:15 +03:00
2016-03-25 22:13:39 +03:00
int security_path_rmdir ( const struct path * dir , struct dentry * dentry )
2008-12-17 07:24:15 +03:00
{
2015-03-18 01:26:22 +03:00
if ( unlikely ( IS_PRIVATE ( d_backing_inode ( dir - > dentry ) ) ) )
2008-12-17 07:24:15 +03:00
return 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( path_rmdir , 0 , dir , dentry ) ;
2008-12-17 07:24:15 +03:00
}
2016-03-25 22:13:39 +03:00
int security_path_unlink ( const struct path * dir , struct dentry * dentry )
2008-12-17 07:24:15 +03:00
{
2015-03-18 01:26:22 +03:00
if ( unlikely ( IS_PRIVATE ( d_backing_inode ( dir - > dentry ) ) ) )
2008-12-17 07:24:15 +03:00
return 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( path_unlink , 0 , dir , dentry ) ;
2008-12-17 07:24:15 +03:00
}
2010-12-24 17:48:35 +03:00
EXPORT_SYMBOL ( security_path_unlink ) ;
2008-12-17 07:24:15 +03:00
2016-03-25 22:21:09 +03:00
int security_path_symlink ( const struct path * dir , struct dentry * dentry ,
2008-12-17 07:24:15 +03:00
const char * old_name )
{
2015-03-18 01:26:22 +03:00
if ( unlikely ( IS_PRIVATE ( d_backing_inode ( dir - > dentry ) ) ) )
2008-12-17 07:24:15 +03:00
return 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( path_symlink , 0 , dir , dentry , old_name ) ;
2008-12-17 07:24:15 +03:00
}
2016-03-25 22:27:45 +03:00
int security_path_link ( struct dentry * old_dentry , const struct path * new_dir ,
2008-12-17 07:24:15 +03:00
struct dentry * new_dentry )
{
2015-03-18 01:26:22 +03:00
if ( unlikely ( IS_PRIVATE ( d_backing_inode ( old_dentry ) ) ) )
2008-12-17 07:24:15 +03:00
return 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( path_link , 0 , old_dentry , new_dir , new_dentry ) ;
2008-12-17 07:24:15 +03:00
}
2016-03-25 22:27:45 +03:00
int security_path_rename ( const struct path * old_dir , struct dentry * old_dentry ,
const struct path * new_dir , struct dentry * new_dentry ,
2014-04-01 19:08:43 +04:00
unsigned int flags )
2008-12-17 07:24:15 +03:00
{
2015-03-18 01:26:22 +03:00
if ( unlikely ( IS_PRIVATE ( d_backing_inode ( old_dentry ) ) | |
( d_is_positive ( new_dentry ) & & IS_PRIVATE ( d_backing_inode ( new_dentry ) ) ) ) )
2008-12-17 07:24:15 +03:00
return 0 ;
2014-04-01 19:08:43 +04:00
2015-05-03 01:11:29 +03:00
return call_int_hook ( path_rename , 0 , old_dir , old_dentry , new_dir ,
2022-05-06 19:10:56 +03:00
new_dentry , flags ) ;
2008-12-17 07:24:15 +03:00
}
2010-12-24 17:48:35 +03:00
EXPORT_SYMBOL ( security_path_rename ) ;
2008-12-17 07:24:15 +03:00
2016-03-25 21:22:01 +03:00
int security_path_truncate ( const struct path * path )
2008-12-17 07:24:15 +03:00
{
2015-03-18 01:26:22 +03:00
if ( unlikely ( IS_PRIVATE ( d_backing_inode ( path - > dentry ) ) ) )
2008-12-17 07:24:15 +03:00
return 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( path_truncate , 0 , path ) ;
2008-12-17 07:24:15 +03:00
}
2009-10-04 16:49:47 +04:00
2016-03-25 21:56:23 +03:00
int security_path_chmod ( const struct path * path , umode_t mode )
2009-10-04 16:49:47 +04:00
{
2015-03-18 01:26:22 +03:00
if ( unlikely ( IS_PRIVATE ( d_backing_inode ( path - > dentry ) ) ) )
2009-10-04 16:49:47 +04:00
return 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( path_chmod , 0 , path , mode ) ;
2009-10-04 16:49:47 +04:00
}
2016-03-25 21:44:41 +03:00
int security_path_chown ( const struct path * path , kuid_t uid , kgid_t gid )
2009-10-04 16:49:47 +04:00
{
2015-03-18 01:26:22 +03:00
if ( unlikely ( IS_PRIVATE ( d_backing_inode ( path - > dentry ) ) ) )
2009-10-04 16:49:47 +04:00
return 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( path_chown , 0 , path , uid , gid ) ;
2009-10-04 16:49:47 +04:00
}
2009-10-04 16:49:48 +04:00
2016-03-25 22:28:43 +03:00
int security_path_chroot ( const struct path * path )
2009-10-04 16:49:48 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( path_chroot , 0 , path ) ;
2009-10-04 16:49:48 +04:00
}
2008-12-17 07:24:15 +03:00
# endif
2011-07-26 09:42:34 +04:00
int security_inode_create ( struct inode * dir , struct dentry * dentry , umode_t mode )
2007-10-17 10:31:32 +04:00
{
if ( unlikely ( IS_PRIVATE ( dir ) ) )
return 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( inode_create , 0 , dir , dentry , mode ) ;
2007-10-17 10:31:32 +04:00
}
2009-04-03 19:42:40 +04:00
EXPORT_SYMBOL_GPL ( security_inode_create ) ;
2007-10-17 10:31:32 +04:00
int security_inode_link ( struct dentry * old_dentry , struct inode * dir ,
struct dentry * new_dentry )
{
2015-03-18 01:26:22 +03:00
if ( unlikely ( IS_PRIVATE ( d_backing_inode ( old_dentry ) ) ) )
2007-10-17 10:31:32 +04:00
return 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( inode_link , 0 , old_dentry , dir , new_dentry ) ;
2007-10-17 10:31:32 +04:00
}
int security_inode_unlink ( struct inode * dir , struct dentry * dentry )
{
2015-03-18 01:26:22 +03:00
if ( unlikely ( IS_PRIVATE ( d_backing_inode ( dentry ) ) ) )
2007-10-17 10:31:32 +04:00
return 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( inode_unlink , 0 , dir , dentry ) ;
2007-10-17 10:31:32 +04:00
}
int security_inode_symlink ( struct inode * dir , struct dentry * dentry ,
const char * old_name )
{
if ( unlikely ( IS_PRIVATE ( dir ) ) )
return 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( inode_symlink , 0 , dir , dentry , old_name ) ;
2007-10-17 10:31:32 +04:00
}
2011-07-26 09:41:39 +04:00
int security_inode_mkdir ( struct inode * dir , struct dentry * dentry , umode_t mode )
2007-10-17 10:31:32 +04:00
{
if ( unlikely ( IS_PRIVATE ( dir ) ) )
return 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( inode_mkdir , 0 , dir , dentry , mode ) ;
2007-10-17 10:31:32 +04:00
}
2009-04-03 19:42:40 +04:00
EXPORT_SYMBOL_GPL ( security_inode_mkdir ) ;
2007-10-17 10:31:32 +04:00
int security_inode_rmdir ( struct inode * dir , struct dentry * dentry )
{
2015-03-18 01:26:22 +03:00
if ( unlikely ( IS_PRIVATE ( d_backing_inode ( dentry ) ) ) )
2007-10-17 10:31:32 +04:00
return 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( inode_rmdir , 0 , dir , dentry ) ;
2007-10-17 10:31:32 +04:00
}
2011-07-26 09:52:52 +04:00
int security_inode_mknod ( struct inode * dir , struct dentry * dentry , umode_t mode , dev_t dev )
2007-10-17 10:31:32 +04:00
{
if ( unlikely ( IS_PRIVATE ( dir ) ) )
return 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( inode_mknod , 0 , dir , dentry , mode , dev ) ;
2007-10-17 10:31:32 +04:00
}
int security_inode_rename ( struct inode * old_dir , struct dentry * old_dentry ,
2014-04-01 19:08:43 +04:00
struct inode * new_dir , struct dentry * new_dentry ,
unsigned int flags )
2007-10-17 10:31:32 +04:00
{
2015-03-18 01:26:22 +03:00
if ( unlikely ( IS_PRIVATE ( d_backing_inode ( old_dentry ) ) | |
( d_is_positive ( new_dentry ) & & IS_PRIVATE ( d_backing_inode ( new_dentry ) ) ) ) )
2007-10-17 10:31:32 +04:00
return 0 ;
2014-04-01 19:08:43 +04:00
if ( flags & RENAME_EXCHANGE ) {
2015-05-03 01:11:29 +03:00
int err = call_int_hook ( inode_rename , 0 , new_dir , new_dentry ,
2014-04-01 19:08:43 +04:00
old_dir , old_dentry ) ;
if ( err )
return err ;
}
2015-05-03 01:11:29 +03:00
return call_int_hook ( inode_rename , 0 , old_dir , old_dentry ,
2007-10-17 10:31:32 +04:00
new_dir , new_dentry ) ;
}
int security_inode_readlink ( struct dentry * dentry )
{
2015-03-18 01:26:22 +03:00
if ( unlikely ( IS_PRIVATE ( d_backing_inode ( dentry ) ) ) )
2007-10-17 10:31:32 +04:00
return 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( inode_readlink , 0 , dentry ) ;
2007-10-17 10:31:32 +04:00
}
2015-03-23 05:37:39 +03:00
int security_inode_follow_link ( struct dentry * dentry , struct inode * inode ,
bool rcu )
2007-10-17 10:31:32 +04:00
{
2015-03-23 05:37:39 +03:00
if ( unlikely ( IS_PRIVATE ( inode ) ) )
2007-10-17 10:31:32 +04:00
return 0 ;
2015-06-27 23:26:03 +03:00
return call_int_hook ( inode_follow_link , 0 , dentry , inode , rcu ) ;
2007-10-17 10:31:32 +04:00
}
2008-07-17 17:37:02 +04:00
int security_inode_permission ( struct inode * inode , int mask )
2007-10-17 10:31:32 +04:00
{
if ( unlikely ( IS_PRIVATE ( inode ) ) )
return 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( inode_permission , 0 , inode , mask ) ;
2007-10-17 10:31:32 +04:00
}
2022-06-21 17:14:53 +03:00
int security_inode_setattr ( struct user_namespace * mnt_userns ,
struct dentry * dentry , struct iattr * attr )
2007-10-17 10:31:32 +04:00
{
2011-05-13 20:53:38 +04:00
int ret ;
2015-03-18 01:26:22 +03:00
if ( unlikely ( IS_PRIVATE ( d_backing_inode ( dentry ) ) ) )
2007-10-17 10:31:32 +04:00
return 0 ;
2015-05-03 01:11:29 +03:00
ret = call_int_hook ( inode_setattr , 0 , dentry , attr ) ;
2011-05-13 20:53:38 +04:00
if ( ret )
return ret ;
2022-06-21 17:14:53 +03:00
return evm_inode_setattr ( mnt_userns , dentry , attr ) ;
2007-10-17 10:31:32 +04:00
}
2008-07-01 17:01:28 +04:00
EXPORT_SYMBOL_GPL ( security_inode_setattr ) ;
2007-10-17 10:31:32 +04:00
2015-03-09 02:28:30 +03:00
int security_inode_getattr ( const struct path * path )
2007-10-17 10:31:32 +04:00
{
2015-03-18 01:26:22 +03:00
if ( unlikely ( IS_PRIVATE ( d_backing_inode ( path - > dentry ) ) ) )
2007-10-17 10:31:32 +04:00
return 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( inode_getattr , 0 , path ) ;
2007-10-17 10:31:32 +04:00
}
2021-01-21 16:19:29 +03:00
int security_inode_setxattr ( struct user_namespace * mnt_userns ,
struct dentry * dentry , const char * name ,
2008-04-29 11:59:41 +04:00
const void * value , size_t size , int flags )
2007-10-17 10:31:32 +04:00
{
2011-03-09 22:38:26 +03:00
int ret ;
2015-03-18 01:26:22 +03:00
if ( unlikely ( IS_PRIVATE ( d_backing_inode ( dentry ) ) ) )
2007-10-17 10:31:32 +04:00
return 0 ;
2015-05-03 01:11:42 +03:00
/*
* SELinux and Smack integrate the cap call ,
* so assume that all LSMs supplying this call do so .
*/
2021-01-21 16:19:29 +03:00
ret = call_int_hook ( inode_setxattr , 1 , mnt_userns , dentry , name , value ,
size , flags ) ;
2015-05-03 01:11:42 +03:00
if ( ret = = 1 )
ret = cap_inode_setxattr ( dentry , name , value , size , flags ) ;
2011-03-11 02:54:15 +03:00
if ( ret )
return ret ;
ret = ima_inode_setxattr ( dentry , name , value , size ) ;
2011-03-09 22:38:26 +03:00
if ( ret )
return ret ;
2021-05-14 18:27:48 +03:00
return evm_inode_setxattr ( mnt_userns , dentry , name , value , size ) ;
2007-10-17 10:31:32 +04:00
}
2008-04-29 11:59:41 +04:00
void security_inode_post_setxattr ( struct dentry * dentry , const char * name ,
const void * value , size_t size , int flags )
2007-10-17 10:31:32 +04:00
{
2015-03-18 01:26:22 +03:00
if ( unlikely ( IS_PRIVATE ( d_backing_inode ( dentry ) ) ) )
2007-10-17 10:31:32 +04:00
return ;
2015-05-03 01:11:29 +03:00
call_void_hook ( inode_post_setxattr , dentry , name , value , size , flags ) ;
2011-03-09 22:38:26 +03:00
evm_inode_post_setxattr ( dentry , name , value , size ) ;
2007-10-17 10:31:32 +04:00
}
2008-04-29 11:59:41 +04:00
int security_inode_getxattr ( struct dentry * dentry , const char * name )
2007-10-17 10:31:32 +04:00
{
2015-03-18 01:26:22 +03:00
if ( unlikely ( IS_PRIVATE ( d_backing_inode ( dentry ) ) ) )
2007-10-17 10:31:32 +04:00
return 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( inode_getxattr , 0 , dentry , name ) ;
2007-10-17 10:31:32 +04:00
}
int security_inode_listxattr ( struct dentry * dentry )
{
2015-03-18 01:26:22 +03:00
if ( unlikely ( IS_PRIVATE ( d_backing_inode ( dentry ) ) ) )
2007-10-17 10:31:32 +04:00
return 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( inode_listxattr , 0 , dentry ) ;
2007-10-17 10:31:32 +04:00
}
2021-01-21 16:19:29 +03:00
int security_inode_removexattr ( struct user_namespace * mnt_userns ,
struct dentry * dentry , const char * name )
2007-10-17 10:31:32 +04:00
{
2011-03-09 22:38:26 +03:00
int ret ;
2015-03-18 01:26:22 +03:00
if ( unlikely ( IS_PRIVATE ( d_backing_inode ( dentry ) ) ) )
2007-10-17 10:31:32 +04:00
return 0 ;
2015-05-03 01:11:42 +03:00
/*
* SELinux and Smack integrate the cap call ,
* so assume that all LSMs supplying this call do so .
*/
2021-01-21 16:19:29 +03:00
ret = call_int_hook ( inode_removexattr , 1 , mnt_userns , dentry , name ) ;
2015-05-03 01:11:42 +03:00
if ( ret = = 1 )
2021-01-21 16:19:29 +03:00
ret = cap_inode_removexattr ( mnt_userns , dentry , name ) ;
2011-03-11 02:54:15 +03:00
if ( ret )
return ret ;
ret = ima_inode_removexattr ( dentry , name ) ;
2011-03-09 22:38:26 +03:00
if ( ret )
return ret ;
2021-05-14 18:27:48 +03:00
return evm_inode_removexattr ( mnt_userns , dentry , name ) ;
2007-10-17 10:31:32 +04:00
}
Implement file posix capabilities
Implement file posix capabilities. This allows programs to be given a
subset of root's powers regardless of who runs them, without having to use
setuid and giving the binary all of root's powers.
This version works with Kaigai Kohei's userspace tools, found at
http://www.kaigai.gr.jp/index.php. For more information on how to use this
patch, Chris Friedhoff has posted a nice page at
http://www.friedhoff.org/fscaps.html.
Changelog:
Nov 27:
Incorporate fixes from Andrew Morton
(security-introduce-file-caps-tweaks and
security-introduce-file-caps-warning-fix)
Fix Kconfig dependency.
Fix change signaling behavior when file caps are not compiled in.
Nov 13:
Integrate comments from Alexey: Remove CONFIG_ ifdef from
capability.h, and use %zd for printing a size_t.
Nov 13:
Fix endianness warnings by sparse as suggested by Alexey
Dobriyan.
Nov 09:
Address warnings of unused variables at cap_bprm_set_security
when file capabilities are disabled, and simultaneously clean
up the code a little, by pulling the new code into a helper
function.
Nov 08:
For pointers to required userspace tools and how to use
them, see http://www.friedhoff.org/fscaps.html.
Nov 07:
Fix the calculation of the highest bit checked in
check_cap_sanity().
Nov 07:
Allow file caps to be enabled without CONFIG_SECURITY, since
capabilities are the default.
Hook cap_task_setscheduler when !CONFIG_SECURITY.
Move capable(TASK_KILL) to end of cap_task_kill to reduce
audit messages.
Nov 05:
Add secondary calls in selinux/hooks.c to task_setioprio and
task_setscheduler so that selinux and capabilities with file
cap support can be stacked.
Sep 05:
As Seth Arnold points out, uid checks are out of place
for capability code.
Sep 01:
Define task_setscheduler, task_setioprio, cap_task_kill, and
task_setnice to make sure a user cannot affect a process in which
they called a program with some fscaps.
One remaining question is the note under task_setscheduler: are we
ok with CAP_SYS_NICE being sufficient to confine a process to a
cpuset?
It is a semantic change, as without fsccaps, attach_task doesn't
allow CAP_SYS_NICE to override the uid equivalence check. But since
it uses security_task_setscheduler, which elsewhere is used where
CAP_SYS_NICE can be used to override the uid equivalence check,
fixing it might be tough.
task_setscheduler
note: this also controls cpuset:attach_task. Are we ok with
CAP_SYS_NICE being used to confine to a cpuset?
task_setioprio
task_setnice
sys_setpriority uses this (through set_one_prio) for another
process. Need same checks as setrlimit
Aug 21:
Updated secureexec implementation to reflect the fact that
euid and uid might be the same and nonzero, but the process
might still have elevated caps.
Aug 15:
Handle endianness of xattrs.
Enforce capability version match between kernel and disk.
Enforce that no bits beyond the known max capability are
set, else return -EPERM.
With this extra processing, it may be worth reconsidering
doing all the work at bprm_set_security rather than
d_instantiate.
Aug 10:
Always call getxattr at bprm_set_security, rather than
caching it at d_instantiate.
[morgan@kernel.org: file-caps clean up for linux/capability.h]
[bunk@kernel.org: unexport cap_inode_killpriv]
Signed-off-by: Serge E. Hallyn <serue@us.ibm.com>
Cc: Stephen Smalley <sds@tycho.nsa.gov>
Cc: James Morris <jmorris@namei.org>
Cc: Chris Wright <chrisw@sous-sol.org>
Cc: Andrew Morgan <morgan@kernel.org>
Signed-off-by: Andrew Morgan <morgan@kernel.org>
Signed-off-by: Adrian Bunk <bunk@kernel.org>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2007-10-17 10:31:36 +04:00
int security_inode_need_killpriv ( struct dentry * dentry )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( inode_need_killpriv , 0 , dentry ) ;
Implement file posix capabilities
Implement file posix capabilities. This allows programs to be given a
subset of root's powers regardless of who runs them, without having to use
setuid and giving the binary all of root's powers.
This version works with Kaigai Kohei's userspace tools, found at
http://www.kaigai.gr.jp/index.php. For more information on how to use this
patch, Chris Friedhoff has posted a nice page at
http://www.friedhoff.org/fscaps.html.
Changelog:
Nov 27:
Incorporate fixes from Andrew Morton
(security-introduce-file-caps-tweaks and
security-introduce-file-caps-warning-fix)
Fix Kconfig dependency.
Fix change signaling behavior when file caps are not compiled in.
Nov 13:
Integrate comments from Alexey: Remove CONFIG_ ifdef from
capability.h, and use %zd for printing a size_t.
Nov 13:
Fix endianness warnings by sparse as suggested by Alexey
Dobriyan.
Nov 09:
Address warnings of unused variables at cap_bprm_set_security
when file capabilities are disabled, and simultaneously clean
up the code a little, by pulling the new code into a helper
function.
Nov 08:
For pointers to required userspace tools and how to use
them, see http://www.friedhoff.org/fscaps.html.
Nov 07:
Fix the calculation of the highest bit checked in
check_cap_sanity().
Nov 07:
Allow file caps to be enabled without CONFIG_SECURITY, since
capabilities are the default.
Hook cap_task_setscheduler when !CONFIG_SECURITY.
Move capable(TASK_KILL) to end of cap_task_kill to reduce
audit messages.
Nov 05:
Add secondary calls in selinux/hooks.c to task_setioprio and
task_setscheduler so that selinux and capabilities with file
cap support can be stacked.
Sep 05:
As Seth Arnold points out, uid checks are out of place
for capability code.
Sep 01:
Define task_setscheduler, task_setioprio, cap_task_kill, and
task_setnice to make sure a user cannot affect a process in which
they called a program with some fscaps.
One remaining question is the note under task_setscheduler: are we
ok with CAP_SYS_NICE being sufficient to confine a process to a
cpuset?
It is a semantic change, as without fsccaps, attach_task doesn't
allow CAP_SYS_NICE to override the uid equivalence check. But since
it uses security_task_setscheduler, which elsewhere is used where
CAP_SYS_NICE can be used to override the uid equivalence check,
fixing it might be tough.
task_setscheduler
note: this also controls cpuset:attach_task. Are we ok with
CAP_SYS_NICE being used to confine to a cpuset?
task_setioprio
task_setnice
sys_setpriority uses this (through set_one_prio) for another
process. Need same checks as setrlimit
Aug 21:
Updated secureexec implementation to reflect the fact that
euid and uid might be the same and nonzero, but the process
might still have elevated caps.
Aug 15:
Handle endianness of xattrs.
Enforce capability version match between kernel and disk.
Enforce that no bits beyond the known max capability are
set, else return -EPERM.
With this extra processing, it may be worth reconsidering
doing all the work at bprm_set_security rather than
d_instantiate.
Aug 10:
Always call getxattr at bprm_set_security, rather than
caching it at d_instantiate.
[morgan@kernel.org: file-caps clean up for linux/capability.h]
[bunk@kernel.org: unexport cap_inode_killpriv]
Signed-off-by: Serge E. Hallyn <serue@us.ibm.com>
Cc: Stephen Smalley <sds@tycho.nsa.gov>
Cc: James Morris <jmorris@namei.org>
Cc: Chris Wright <chrisw@sous-sol.org>
Cc: Andrew Morgan <morgan@kernel.org>
Signed-off-by: Andrew Morgan <morgan@kernel.org>
Signed-off-by: Adrian Bunk <bunk@kernel.org>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2007-10-17 10:31:36 +04:00
}
2021-01-21 16:19:29 +03:00
int security_inode_killpriv ( struct user_namespace * mnt_userns ,
struct dentry * dentry )
Implement file posix capabilities
Implement file posix capabilities. This allows programs to be given a
subset of root's powers regardless of who runs them, without having to use
setuid and giving the binary all of root's powers.
This version works with Kaigai Kohei's userspace tools, found at
http://www.kaigai.gr.jp/index.php. For more information on how to use this
patch, Chris Friedhoff has posted a nice page at
http://www.friedhoff.org/fscaps.html.
Changelog:
Nov 27:
Incorporate fixes from Andrew Morton
(security-introduce-file-caps-tweaks and
security-introduce-file-caps-warning-fix)
Fix Kconfig dependency.
Fix change signaling behavior when file caps are not compiled in.
Nov 13:
Integrate comments from Alexey: Remove CONFIG_ ifdef from
capability.h, and use %zd for printing a size_t.
Nov 13:
Fix endianness warnings by sparse as suggested by Alexey
Dobriyan.
Nov 09:
Address warnings of unused variables at cap_bprm_set_security
when file capabilities are disabled, and simultaneously clean
up the code a little, by pulling the new code into a helper
function.
Nov 08:
For pointers to required userspace tools and how to use
them, see http://www.friedhoff.org/fscaps.html.
Nov 07:
Fix the calculation of the highest bit checked in
check_cap_sanity().
Nov 07:
Allow file caps to be enabled without CONFIG_SECURITY, since
capabilities are the default.
Hook cap_task_setscheduler when !CONFIG_SECURITY.
Move capable(TASK_KILL) to end of cap_task_kill to reduce
audit messages.
Nov 05:
Add secondary calls in selinux/hooks.c to task_setioprio and
task_setscheduler so that selinux and capabilities with file
cap support can be stacked.
Sep 05:
As Seth Arnold points out, uid checks are out of place
for capability code.
Sep 01:
Define task_setscheduler, task_setioprio, cap_task_kill, and
task_setnice to make sure a user cannot affect a process in which
they called a program with some fscaps.
One remaining question is the note under task_setscheduler: are we
ok with CAP_SYS_NICE being sufficient to confine a process to a
cpuset?
It is a semantic change, as without fsccaps, attach_task doesn't
allow CAP_SYS_NICE to override the uid equivalence check. But since
it uses security_task_setscheduler, which elsewhere is used where
CAP_SYS_NICE can be used to override the uid equivalence check,
fixing it might be tough.
task_setscheduler
note: this also controls cpuset:attach_task. Are we ok with
CAP_SYS_NICE being used to confine to a cpuset?
task_setioprio
task_setnice
sys_setpriority uses this (through set_one_prio) for another
process. Need same checks as setrlimit
Aug 21:
Updated secureexec implementation to reflect the fact that
euid and uid might be the same and nonzero, but the process
might still have elevated caps.
Aug 15:
Handle endianness of xattrs.
Enforce capability version match between kernel and disk.
Enforce that no bits beyond the known max capability are
set, else return -EPERM.
With this extra processing, it may be worth reconsidering
doing all the work at bprm_set_security rather than
d_instantiate.
Aug 10:
Always call getxattr at bprm_set_security, rather than
caching it at d_instantiate.
[morgan@kernel.org: file-caps clean up for linux/capability.h]
[bunk@kernel.org: unexport cap_inode_killpriv]
Signed-off-by: Serge E. Hallyn <serue@us.ibm.com>
Cc: Stephen Smalley <sds@tycho.nsa.gov>
Cc: James Morris <jmorris@namei.org>
Cc: Chris Wright <chrisw@sous-sol.org>
Cc: Andrew Morgan <morgan@kernel.org>
Signed-off-by: Andrew Morgan <morgan@kernel.org>
Signed-off-by: Adrian Bunk <bunk@kernel.org>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2007-10-17 10:31:36 +04:00
{
2021-01-21 16:19:29 +03:00
return call_int_hook ( inode_killpriv , 0 , mnt_userns , dentry ) ;
Implement file posix capabilities
Implement file posix capabilities. This allows programs to be given a
subset of root's powers regardless of who runs them, without having to use
setuid and giving the binary all of root's powers.
This version works with Kaigai Kohei's userspace tools, found at
http://www.kaigai.gr.jp/index.php. For more information on how to use this
patch, Chris Friedhoff has posted a nice page at
http://www.friedhoff.org/fscaps.html.
Changelog:
Nov 27:
Incorporate fixes from Andrew Morton
(security-introduce-file-caps-tweaks and
security-introduce-file-caps-warning-fix)
Fix Kconfig dependency.
Fix change signaling behavior when file caps are not compiled in.
Nov 13:
Integrate comments from Alexey: Remove CONFIG_ ifdef from
capability.h, and use %zd for printing a size_t.
Nov 13:
Fix endianness warnings by sparse as suggested by Alexey
Dobriyan.
Nov 09:
Address warnings of unused variables at cap_bprm_set_security
when file capabilities are disabled, and simultaneously clean
up the code a little, by pulling the new code into a helper
function.
Nov 08:
For pointers to required userspace tools and how to use
them, see http://www.friedhoff.org/fscaps.html.
Nov 07:
Fix the calculation of the highest bit checked in
check_cap_sanity().
Nov 07:
Allow file caps to be enabled without CONFIG_SECURITY, since
capabilities are the default.
Hook cap_task_setscheduler when !CONFIG_SECURITY.
Move capable(TASK_KILL) to end of cap_task_kill to reduce
audit messages.
Nov 05:
Add secondary calls in selinux/hooks.c to task_setioprio and
task_setscheduler so that selinux and capabilities with file
cap support can be stacked.
Sep 05:
As Seth Arnold points out, uid checks are out of place
for capability code.
Sep 01:
Define task_setscheduler, task_setioprio, cap_task_kill, and
task_setnice to make sure a user cannot affect a process in which
they called a program with some fscaps.
One remaining question is the note under task_setscheduler: are we
ok with CAP_SYS_NICE being sufficient to confine a process to a
cpuset?
It is a semantic change, as without fsccaps, attach_task doesn't
allow CAP_SYS_NICE to override the uid equivalence check. But since
it uses security_task_setscheduler, which elsewhere is used where
CAP_SYS_NICE can be used to override the uid equivalence check,
fixing it might be tough.
task_setscheduler
note: this also controls cpuset:attach_task. Are we ok with
CAP_SYS_NICE being used to confine to a cpuset?
task_setioprio
task_setnice
sys_setpriority uses this (through set_one_prio) for another
process. Need same checks as setrlimit
Aug 21:
Updated secureexec implementation to reflect the fact that
euid and uid might be the same and nonzero, but the process
might still have elevated caps.
Aug 15:
Handle endianness of xattrs.
Enforce capability version match between kernel and disk.
Enforce that no bits beyond the known max capability are
set, else return -EPERM.
With this extra processing, it may be worth reconsidering
doing all the work at bprm_set_security rather than
d_instantiate.
Aug 10:
Always call getxattr at bprm_set_security, rather than
caching it at d_instantiate.
[morgan@kernel.org: file-caps clean up for linux/capability.h]
[bunk@kernel.org: unexport cap_inode_killpriv]
Signed-off-by: Serge E. Hallyn <serue@us.ibm.com>
Cc: Stephen Smalley <sds@tycho.nsa.gov>
Cc: James Morris <jmorris@namei.org>
Cc: Chris Wright <chrisw@sous-sol.org>
Cc: Andrew Morgan <morgan@kernel.org>
Signed-off-by: Andrew Morgan <morgan@kernel.org>
Signed-off-by: Adrian Bunk <bunk@kernel.org>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2007-10-17 10:31:36 +04:00
}
2021-01-21 16:19:29 +03:00
int security_inode_getsecurity ( struct user_namespace * mnt_userns ,
struct inode * inode , const char * name ,
void * * buffer , bool alloc )
2007-10-17 10:31:32 +04:00
{
2016-06-01 03:24:15 +03:00
struct security_hook_list * hp ;
int rc ;
2007-10-17 10:31:32 +04:00
if ( unlikely ( IS_PRIVATE ( inode ) ) )
2020-03-29 03:43:50 +03:00
return LSM_RET_DEFAULT ( inode_getsecurity ) ;
2016-06-01 03:24:15 +03:00
/*
* Only one module will provide an attribute with a given name .
*/
2018-03-29 04:28:23 +03:00
hlist_for_each_entry ( hp , & security_hook_heads . inode_getsecurity , list ) {
2021-01-21 16:19:29 +03:00
rc = hp - > hook . inode_getsecurity ( mnt_userns , inode , name , buffer , alloc ) ;
2020-03-29 03:43:50 +03:00
if ( rc ! = LSM_RET_DEFAULT ( inode_getsecurity ) )
2016-06-01 03:24:15 +03:00
return rc ;
}
2020-03-29 03:43:50 +03:00
return LSM_RET_DEFAULT ( inode_getsecurity ) ;
2007-10-17 10:31:32 +04:00
}
int security_inode_setsecurity ( struct inode * inode , const char * name , const void * value , size_t size , int flags )
{
2016-06-01 03:24:15 +03:00
struct security_hook_list * hp ;
int rc ;
2007-10-17 10:31:32 +04:00
if ( unlikely ( IS_PRIVATE ( inode ) ) )
2020-03-29 03:43:50 +03:00
return LSM_RET_DEFAULT ( inode_setsecurity ) ;
2016-06-01 03:24:15 +03:00
/*
* Only one module will provide an attribute with a given name .
*/
2018-03-29 04:28:23 +03:00
hlist_for_each_entry ( hp , & security_hook_heads . inode_setsecurity , list ) {
2016-06-01 03:24:15 +03:00
rc = hp - > hook . inode_setsecurity ( inode , name , value , size ,
flags ) ;
2020-03-29 03:43:50 +03:00
if ( rc ! = LSM_RET_DEFAULT ( inode_setsecurity ) )
2016-06-01 03:24:15 +03:00
return rc ;
}
2020-03-29 03:43:50 +03:00
return LSM_RET_DEFAULT ( inode_setsecurity ) ;
2007-10-17 10:31:32 +04:00
}
int security_inode_listsecurity ( struct inode * inode , char * buffer , size_t buffer_size )
{
if ( unlikely ( IS_PRIVATE ( inode ) ) )
return 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( inode_listsecurity , 0 , inode , buffer , buffer_size ) ;
2007-10-17 10:31:32 +04:00
}
2013-05-22 20:50:45 +04:00
EXPORT_SYMBOL ( security_inode_listsecurity ) ;
2007-10-17 10:31:32 +04:00
2015-12-24 19:09:39 +03:00
void security_inode_getsecid ( struct inode * inode , u32 * secid )
2008-03-01 22:51:09 +03:00
{
2015-05-03 01:11:29 +03:00
call_void_hook ( inode_getsecid , inode , secid ) ;
2008-03-01 22:51:09 +03:00
}
2016-07-13 18:13:56 +03:00
int security_inode_copy_up ( struct dentry * src , struct cred * * new )
{
return call_int_hook ( inode_copy_up , 0 , src , new ) ;
}
EXPORT_SYMBOL ( security_inode_copy_up ) ;
2016-07-13 17:44:49 +03:00
int security_inode_copy_up_xattr ( const char * name )
{
2020-06-22 01:21:35 +03:00
struct security_hook_list * hp ;
int rc ;
/*
* The implementation can return 0 ( accept the xattr ) , 1 ( discard the
* xattr ) , - EOPNOTSUPP if it does not know anything about the xattr or
* any other error code incase of an error .
*/
hlist_for_each_entry ( hp ,
& security_hook_heads . inode_copy_up_xattr , list ) {
rc = hp - > hook . inode_copy_up_xattr ( name ) ;
if ( rc ! = LSM_RET_DEFAULT ( inode_copy_up_xattr ) )
return rc ;
}
return LSM_RET_DEFAULT ( inode_copy_up_xattr ) ;
2016-07-13 17:44:49 +03:00
}
EXPORT_SYMBOL ( security_inode_copy_up_xattr ) ;
2019-02-22 17:57:16 +03:00
int security_kernfs_init_security ( struct kernfs_node * kn_dir ,
struct kernfs_node * kn )
{
return call_int_hook ( kernfs_init_security , 0 , kn_dir , kn ) ;
}
2007-10-17 10:31:32 +04:00
int security_file_permission ( struct file * file , int mask )
{
2009-12-18 05:24:34 +03:00
int ret ;
2015-05-03 01:11:29 +03:00
ret = call_int_hook ( file_permission , 0 , file , mask ) ;
2009-12-18 05:24:34 +03:00
if ( ret )
return ret ;
return fsnotify_perm ( file , mask ) ;
2007-10-17 10:31:32 +04:00
}
int security_file_alloc ( struct file * file )
{
2018-11-12 23:02:49 +03:00
int rc = lsm_file_alloc ( file ) ;
if ( rc )
return rc ;
rc = call_int_hook ( file_alloc_security , 0 , file ) ;
if ( unlikely ( rc ) )
security_file_free ( file ) ;
return rc ;
2007-10-17 10:31:32 +04:00
}
void security_file_free ( struct file * file )
{
2018-11-12 23:02:49 +03:00
void * blob ;
2015-05-03 01:11:29 +03:00
call_void_hook ( file_free_security , file ) ;
2018-11-12 23:02:49 +03:00
blob = file - > f_security ;
if ( blob ) {
file - > f_security = NULL ;
kmem_cache_free ( lsm_file_cache , blob ) ;
}
2007-10-17 10:31:32 +04:00
}
int security_file_ioctl ( struct file * file , unsigned int cmd , unsigned long arg )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( file_ioctl , 0 , file , cmd , arg ) ;
2007-10-17 10:31:32 +04:00
}
2020-06-02 23:20:26 +03:00
EXPORT_SYMBOL_GPL ( security_file_ioctl ) ;
2007-10-17 10:31:32 +04:00
2012-05-31 03:58:30 +04:00
static inline unsigned long mmap_prot ( struct file * file , unsigned long prot )
2007-10-17 10:31:32 +04:00
{
2012-05-31 01:11:23 +04:00
/*
2012-05-31 03:58:30 +04:00
* Does we have PROT_READ and does the application expect
* it to imply PROT_EXEC ? If not , nothing to talk about . . .
2012-05-31 01:11:23 +04:00
*/
2012-05-31 03:58:30 +04:00
if ( ( prot & ( PROT_READ | PROT_EXEC ) ) ! = PROT_READ )
return prot ;
2012-05-31 01:11:23 +04:00
if ( ! ( current - > personality & READ_IMPLIES_EXEC ) )
2012-05-31 03:58:30 +04:00
return prot ;
/*
* if that ' s an anonymous mapping , let it .
*/
if ( ! file )
return prot | PROT_EXEC ;
/*
* ditto if it ' s not on noexec mount , except that on ! MMU we need
2015-01-14 12:42:32 +03:00
* NOMMU_MAP_EXEC ( = = VM_MAYEXEC ) in this case
2012-05-31 03:58:30 +04:00
*/
2015-06-29 22:42:03 +03:00
if ( ! path_noexec ( & file - > f_path ) ) {
2012-05-31 01:11:23 +04:00
# ifndef CONFIG_MMU
2015-01-14 12:42:32 +03:00
if ( file - > f_op - > mmap_capabilities ) {
unsigned caps = file - > f_op - > mmap_capabilities ( file ) ;
if ( ! ( caps & NOMMU_MAP_EXEC ) )
return prot ;
}
2012-05-31 01:11:23 +04:00
# endif
2012-05-31 03:58:30 +04:00
return prot | PROT_EXEC ;
2012-05-31 01:11:23 +04:00
}
2012-05-31 03:58:30 +04:00
/* anything on noexec mount won't get PROT_EXEC */
return prot ;
}
int security_mmap_file ( struct file * file , unsigned long prot ,
unsigned long flags )
{
int ret ;
2015-05-03 01:11:29 +03:00
ret = call_int_hook ( mmap_file , 0 , file , prot ,
2012-05-31 03:58:30 +04:00
mmap_prot ( file , prot ) , flags ) ;
2009-10-23 01:30:13 +04:00
if ( ret )
return ret ;
return ima_file_mmap ( file , prot ) ;
2007-10-17 10:31:32 +04:00
}
2012-05-30 21:30:51 +04:00
int security_mmap_addr ( unsigned long addr )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( mmap_addr , 0 , addr ) ;
2012-05-30 21:30:51 +04:00
}
2007-10-17 10:31:32 +04:00
int security_file_mprotect ( struct vm_area_struct * vma , unsigned long reqprot ,
unsigned long prot )
{
2020-05-03 08:00:02 +03:00
int ret ;
ret = call_int_hook ( file_mprotect , 0 , vma , reqprot , prot ) ;
if ( ret )
return ret ;
return ima_file_mprotect ( vma , prot ) ;
2007-10-17 10:31:32 +04:00
}
int security_file_lock ( struct file * file , unsigned int cmd )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( file_lock , 0 , file , cmd ) ;
2007-10-17 10:31:32 +04:00
}
int security_file_fcntl ( struct file * file , unsigned int cmd , unsigned long arg )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( file_fcntl , 0 , file , cmd , arg ) ;
2007-10-17 10:31:32 +04:00
}
2014-08-22 19:27:32 +04:00
void security_file_set_fowner ( struct file * file )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
call_void_hook ( file_set_fowner , file ) ;
2007-10-17 10:31:32 +04:00
}
int security_file_send_sigiotask ( struct task_struct * tsk ,
struct fown_struct * fown , int sig )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( file_send_sigiotask , 0 , tsk , fown , sig ) ;
2007-10-17 10:31:32 +04:00
}
int security_file_receive ( struct file * file )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( file_receive , 0 , file ) ;
2007-10-17 10:31:32 +04:00
}
2018-07-10 20:25:29 +03:00
int security_file_open ( struct file * file )
2007-10-17 10:31:32 +04:00
{
2009-12-18 05:24:34 +03:00
int ret ;
2018-07-10 21:13:18 +03:00
ret = call_int_hook ( file_open , 0 , file ) ;
2009-12-18 05:24:34 +03:00
if ( ret )
return ret ;
return fsnotify_perm ( file , MAY_OPEN ) ;
2007-10-17 10:31:32 +04:00
}
2017-03-24 14:46:33 +03:00
int security_task_alloc ( struct task_struct * task , unsigned long clone_flags )
{
2018-09-22 03:19:37 +03:00
int rc = lsm_task_alloc ( task ) ;
if ( rc )
return rc ;
rc = call_int_hook ( task_alloc , 0 , task , clone_flags ) ;
if ( unlikely ( rc ) )
security_task_free ( task ) ;
return rc ;
2017-03-24 14:46:33 +03:00
}
2011-12-22 00:17:03 +04:00
void security_task_free ( struct task_struct * task )
{
2015-05-03 01:11:29 +03:00
call_void_hook ( task_free , task ) ;
2018-09-22 03:19:37 +03:00
kfree ( task - > security ) ;
task - > security = NULL ;
2011-12-22 00:17:03 +04:00
}
KEYS: Add a keyctl to install a process's session keyring on its parent [try #6]
Add a keyctl to install a process's session keyring onto its parent. This
replaces the parent's session keyring. Because the COW credential code does
not permit one process to change another process's credentials directly, the
change is deferred until userspace next starts executing again. Normally this
will be after a wait*() syscall.
To support this, three new security hooks have been provided:
cred_alloc_blank() to allocate unset security creds, cred_transfer() to fill in
the blank security creds and key_session_to_parent() - which asks the LSM if
the process may replace its parent's session keyring.
The replacement may only happen if the process has the same ownership details
as its parent, and the process has LINK permission on the session keyring, and
the session keyring is owned by the process, and the LSM permits it.
Note that this requires alteration to each architecture's notify_resume path.
This has been done for all arches barring blackfin, m68k* and xtensa, all of
which need assembly alteration to support TIF_NOTIFY_RESUME. This allows the
replacement to be performed at the point the parent process resumes userspace
execution.
This allows the userspace AFS pioctl emulation to fully emulate newpag() and
the VIOCSETTOK and VIOCSETTOK2 pioctls, all of which require the ability to
alter the parent process's PAG membership. However, since kAFS doesn't use
PAGs per se, but rather dumps the keys into the session keyring, the session
keyring of the parent must be replaced if, for example, VIOCSETTOK is passed
the newpag flag.
This can be tested with the following program:
#include <stdio.h>
#include <stdlib.h>
#include <keyutils.h>
#define KEYCTL_SESSION_TO_PARENT 18
#define OSERROR(X, S) do { if ((long)(X) == -1) { perror(S); exit(1); } } while(0)
int main(int argc, char **argv)
{
key_serial_t keyring, key;
long ret;
keyring = keyctl_join_session_keyring(argv[1]);
OSERROR(keyring, "keyctl_join_session_keyring");
key = add_key("user", "a", "b", 1, keyring);
OSERROR(key, "add_key");
ret = keyctl(KEYCTL_SESSION_TO_PARENT);
OSERROR(ret, "KEYCTL_SESSION_TO_PARENT");
return 0;
}
Compiled and linked with -lkeyutils, you should see something like:
[dhowells@andromeda ~]$ keyctl show
Session Keyring
-3 --alswrv 4043 4043 keyring: _ses
355907932 --alswrv 4043 -1 \_ keyring: _uid.4043
[dhowells@andromeda ~]$ /tmp/newpag
[dhowells@andromeda ~]$ keyctl show
Session Keyring
-3 --alswrv 4043 4043 keyring: _ses
1055658746 --alswrv 4043 4043 \_ user: a
[dhowells@andromeda ~]$ /tmp/newpag hello
[dhowells@andromeda ~]$ keyctl show
Session Keyring
-3 --alswrv 4043 4043 keyring: hello
340417692 --alswrv 4043 4043 \_ user: a
Where the test program creates a new session keyring, sticks a user key named
'a' into it and then installs it on its parent.
Signed-off-by: David Howells <dhowells@redhat.com>
Signed-off-by: James Morris <jmorris@namei.org>
2009-09-02 12:14:21 +04:00
int security_cred_alloc_blank ( struct cred * cred , gfp_t gfp )
{
2018-11-12 20:30:56 +03:00
int rc = lsm_cred_alloc ( cred , gfp ) ;
if ( rc )
return rc ;
rc = call_int_hook ( cred_alloc_blank , 0 , cred , gfp ) ;
2018-11-12 23:02:49 +03:00
if ( unlikely ( rc ) )
2018-11-12 20:30:56 +03:00
security_cred_free ( cred ) ;
return rc ;
KEYS: Add a keyctl to install a process's session keyring on its parent [try #6]
Add a keyctl to install a process's session keyring onto its parent. This
replaces the parent's session keyring. Because the COW credential code does
not permit one process to change another process's credentials directly, the
change is deferred until userspace next starts executing again. Normally this
will be after a wait*() syscall.
To support this, three new security hooks have been provided:
cred_alloc_blank() to allocate unset security creds, cred_transfer() to fill in
the blank security creds and key_session_to_parent() - which asks the LSM if
the process may replace its parent's session keyring.
The replacement may only happen if the process has the same ownership details
as its parent, and the process has LINK permission on the session keyring, and
the session keyring is owned by the process, and the LSM permits it.
Note that this requires alteration to each architecture's notify_resume path.
This has been done for all arches barring blackfin, m68k* and xtensa, all of
which need assembly alteration to support TIF_NOTIFY_RESUME. This allows the
replacement to be performed at the point the parent process resumes userspace
execution.
This allows the userspace AFS pioctl emulation to fully emulate newpag() and
the VIOCSETTOK and VIOCSETTOK2 pioctls, all of which require the ability to
alter the parent process's PAG membership. However, since kAFS doesn't use
PAGs per se, but rather dumps the keys into the session keyring, the session
keyring of the parent must be replaced if, for example, VIOCSETTOK is passed
the newpag flag.
This can be tested with the following program:
#include <stdio.h>
#include <stdlib.h>
#include <keyutils.h>
#define KEYCTL_SESSION_TO_PARENT 18
#define OSERROR(X, S) do { if ((long)(X) == -1) { perror(S); exit(1); } } while(0)
int main(int argc, char **argv)
{
key_serial_t keyring, key;
long ret;
keyring = keyctl_join_session_keyring(argv[1]);
OSERROR(keyring, "keyctl_join_session_keyring");
key = add_key("user", "a", "b", 1, keyring);
OSERROR(key, "add_key");
ret = keyctl(KEYCTL_SESSION_TO_PARENT);
OSERROR(ret, "KEYCTL_SESSION_TO_PARENT");
return 0;
}
Compiled and linked with -lkeyutils, you should see something like:
[dhowells@andromeda ~]$ keyctl show
Session Keyring
-3 --alswrv 4043 4043 keyring: _ses
355907932 --alswrv 4043 -1 \_ keyring: _uid.4043
[dhowells@andromeda ~]$ /tmp/newpag
[dhowells@andromeda ~]$ keyctl show
Session Keyring
-3 --alswrv 4043 4043 keyring: _ses
1055658746 --alswrv 4043 4043 \_ user: a
[dhowells@andromeda ~]$ /tmp/newpag hello
[dhowells@andromeda ~]$ keyctl show
Session Keyring
-3 --alswrv 4043 4043 keyring: hello
340417692 --alswrv 4043 4043 \_ user: a
Where the test program creates a new session keyring, sticks a user key named
'a' into it and then installs it on its parent.
Signed-off-by: David Howells <dhowells@redhat.com>
Signed-off-by: James Morris <jmorris@namei.org>
2009-09-02 12:14:21 +04:00
}
CRED: Inaugurate COW credentials
Inaugurate copy-on-write credentials management. This uses RCU to manage the
credentials pointer in the task_struct with respect to accesses by other tasks.
A process may only modify its own credentials, and so does not need locking to
access or modify its own credentials.
A mutex (cred_replace_mutex) is added to the task_struct to control the effect
of PTRACE_ATTACHED on credential calculations, particularly with respect to
execve().
With this patch, the contents of an active credentials struct may not be
changed directly; rather a new set of credentials must be prepared, modified
and committed using something like the following sequence of events:
struct cred *new = prepare_creds();
int ret = blah(new);
if (ret < 0) {
abort_creds(new);
return ret;
}
return commit_creds(new);
There are some exceptions to this rule: the keyrings pointed to by the active
credentials may be instantiated - keyrings violate the COW rule as managing
COW keyrings is tricky, given that it is possible for a task to directly alter
the keys in a keyring in use by another task.
To help enforce this, various pointers to sets of credentials, such as those in
the task_struct, are declared const. The purpose of this is compile-time
discouragement of altering credentials through those pointers. Once a set of
credentials has been made public through one of these pointers, it may not be
modified, except under special circumstances:
(1) Its reference count may incremented and decremented.
(2) The keyrings to which it points may be modified, but not replaced.
The only safe way to modify anything else is to create a replacement and commit
using the functions described in Documentation/credentials.txt (which will be
added by a later patch).
This patch and the preceding patches have been tested with the LTP SELinux
testsuite.
This patch makes several logical sets of alteration:
(1) execve().
This now prepares and commits credentials in various places in the
security code rather than altering the current creds directly.
(2) Temporary credential overrides.
do_coredump() and sys_faccessat() now prepare their own credentials and
temporarily override the ones currently on the acting thread, whilst
preventing interference from other threads by holding cred_replace_mutex
on the thread being dumped.
This will be replaced in a future patch by something that hands down the
credentials directly to the functions being called, rather than altering
the task's objective credentials.
(3) LSM interface.
A number of functions have been changed, added or removed:
(*) security_capset_check(), ->capset_check()
(*) security_capset_set(), ->capset_set()
Removed in favour of security_capset().
(*) security_capset(), ->capset()
New. This is passed a pointer to the new creds, a pointer to the old
creds and the proposed capability sets. It should fill in the new
creds or return an error. All pointers, barring the pointer to the
new creds, are now const.
(*) security_bprm_apply_creds(), ->bprm_apply_creds()
Changed; now returns a value, which will cause the process to be
killed if it's an error.
(*) security_task_alloc(), ->task_alloc_security()
Removed in favour of security_prepare_creds().
(*) security_cred_free(), ->cred_free()
New. Free security data attached to cred->security.
(*) security_prepare_creds(), ->cred_prepare()
New. Duplicate any security data attached to cred->security.
(*) security_commit_creds(), ->cred_commit()
New. Apply any security effects for the upcoming installation of new
security by commit_creds().
(*) security_task_post_setuid(), ->task_post_setuid()
Removed in favour of security_task_fix_setuid().
(*) security_task_fix_setuid(), ->task_fix_setuid()
Fix up the proposed new credentials for setuid(). This is used by
cap_set_fix_setuid() to implicitly adjust capabilities in line with
setuid() changes. Changes are made to the new credentials, rather
than the task itself as in security_task_post_setuid().
(*) security_task_reparent_to_init(), ->task_reparent_to_init()
Removed. Instead the task being reparented to init is referred
directly to init's credentials.
NOTE! This results in the loss of some state: SELinux's osid no
longer records the sid of the thread that forked it.
(*) security_key_alloc(), ->key_alloc()
(*) security_key_permission(), ->key_permission()
Changed. These now take cred pointers rather than task pointers to
refer to the security context.
(4) sys_capset().
This has been simplified and uses less locking. The LSM functions it
calls have been merged.
(5) reparent_to_kthreadd().
This gives the current thread the same credentials as init by simply using
commit_thread() to point that way.
(6) __sigqueue_alloc() and switch_uid()
__sigqueue_alloc() can't stop the target task from changing its creds
beneath it, so this function gets a reference to the currently applicable
user_struct which it then passes into the sigqueue struct it returns if
successful.
switch_uid() is now called from commit_creds(), and possibly should be
folded into that. commit_creds() should take care of protecting
__sigqueue_alloc().
(7) [sg]et[ug]id() and co and [sg]et_current_groups.
The set functions now all use prepare_creds(), commit_creds() and
abort_creds() to build and check a new set of credentials before applying
it.
security_task_set[ug]id() is called inside the prepared section. This
guarantees that nothing else will affect the creds until we've finished.
The calling of set_dumpable() has been moved into commit_creds().
Much of the functionality of set_user() has been moved into
commit_creds().
The get functions all simply access the data directly.
(8) security_task_prctl() and cap_task_prctl().
security_task_prctl() has been modified to return -ENOSYS if it doesn't
want to handle a function, or otherwise return the return value directly
rather than through an argument.
Additionally, cap_task_prctl() now prepares a new set of credentials, even
if it doesn't end up using it.
(9) Keyrings.
A number of changes have been made to the keyrings code:
(a) switch_uid_keyring(), copy_keys(), exit_keys() and suid_keys() have
all been dropped and built in to the credentials functions directly.
They may want separating out again later.
(b) key_alloc() and search_process_keyrings() now take a cred pointer
rather than a task pointer to specify the security context.
(c) copy_creds() gives a new thread within the same thread group a new
thread keyring if its parent had one, otherwise it discards the thread
keyring.
(d) The authorisation key now points directly to the credentials to extend
the search into rather pointing to the task that carries them.
(e) Installing thread, process or session keyrings causes a new set of
credentials to be created, even though it's not strictly necessary for
process or session keyrings (they're shared).
(10) Usermode helper.
The usermode helper code now carries a cred struct pointer in its
subprocess_info struct instead of a new session keyring pointer. This set
of credentials is derived from init_cred and installed on the new process
after it has been cloned.
call_usermodehelper_setup() allocates the new credentials and
call_usermodehelper_freeinfo() discards them if they haven't been used. A
special cred function (prepare_usermodeinfo_creds()) is provided
specifically for call_usermodehelper_setup() to call.
call_usermodehelper_setkeys() adjusts the credentials to sport the
supplied keyring as the new session keyring.
(11) SELinux.
SELinux has a number of changes, in addition to those to support the LSM
interface changes mentioned above:
(a) selinux_setprocattr() no longer does its check for whether the
current ptracer can access processes with the new SID inside the lock
that covers getting the ptracer's SID. Whilst this lock ensures that
the check is done with the ptracer pinned, the result is only valid
until the lock is released, so there's no point doing it inside the
lock.
(12) is_single_threaded().
This function has been extracted from selinux_setprocattr() and put into
a file of its own in the lib/ directory as join_session_keyring() now
wants to use it too.
The code in SELinux just checked to see whether a task shared mm_structs
with other tasks (CLONE_VM), but that isn't good enough. We really want
to know if they're part of the same thread group (CLONE_THREAD).
(13) nfsd.
The NFS server daemon now has to use the COW credentials to set the
credentials it is going to use. It really needs to pass the credentials
down to the functions it calls, but it can't do that until other patches
in this series have been applied.
Signed-off-by: David Howells <dhowells@redhat.com>
Acked-by: James Morris <jmorris@namei.org>
Signed-off-by: James Morris <jmorris@namei.org>
2008-11-14 02:39:23 +03:00
void security_cred_free ( struct cred * cred )
2007-10-17 10:31:32 +04:00
{
2019-01-17 02:41:11 +03:00
/*
* There is a failure case in prepare_creds ( ) that
* may result in a call here with - > security being NULL .
*/
if ( unlikely ( cred - > security = = NULL ) )
return ;
2015-05-03 01:11:29 +03:00
call_void_hook ( cred_free , cred ) ;
2018-11-12 20:30:56 +03:00
kfree ( cred - > security ) ;
cred - > security = NULL ;
2007-10-17 10:31:32 +04:00
}
CRED: Inaugurate COW credentials
Inaugurate copy-on-write credentials management. This uses RCU to manage the
credentials pointer in the task_struct with respect to accesses by other tasks.
A process may only modify its own credentials, and so does not need locking to
access or modify its own credentials.
A mutex (cred_replace_mutex) is added to the task_struct to control the effect
of PTRACE_ATTACHED on credential calculations, particularly with respect to
execve().
With this patch, the contents of an active credentials struct may not be
changed directly; rather a new set of credentials must be prepared, modified
and committed using something like the following sequence of events:
struct cred *new = prepare_creds();
int ret = blah(new);
if (ret < 0) {
abort_creds(new);
return ret;
}
return commit_creds(new);
There are some exceptions to this rule: the keyrings pointed to by the active
credentials may be instantiated - keyrings violate the COW rule as managing
COW keyrings is tricky, given that it is possible for a task to directly alter
the keys in a keyring in use by another task.
To help enforce this, various pointers to sets of credentials, such as those in
the task_struct, are declared const. The purpose of this is compile-time
discouragement of altering credentials through those pointers. Once a set of
credentials has been made public through one of these pointers, it may not be
modified, except under special circumstances:
(1) Its reference count may incremented and decremented.
(2) The keyrings to which it points may be modified, but not replaced.
The only safe way to modify anything else is to create a replacement and commit
using the functions described in Documentation/credentials.txt (which will be
added by a later patch).
This patch and the preceding patches have been tested with the LTP SELinux
testsuite.
This patch makes several logical sets of alteration:
(1) execve().
This now prepares and commits credentials in various places in the
security code rather than altering the current creds directly.
(2) Temporary credential overrides.
do_coredump() and sys_faccessat() now prepare their own credentials and
temporarily override the ones currently on the acting thread, whilst
preventing interference from other threads by holding cred_replace_mutex
on the thread being dumped.
This will be replaced in a future patch by something that hands down the
credentials directly to the functions being called, rather than altering
the task's objective credentials.
(3) LSM interface.
A number of functions have been changed, added or removed:
(*) security_capset_check(), ->capset_check()
(*) security_capset_set(), ->capset_set()
Removed in favour of security_capset().
(*) security_capset(), ->capset()
New. This is passed a pointer to the new creds, a pointer to the old
creds and the proposed capability sets. It should fill in the new
creds or return an error. All pointers, barring the pointer to the
new creds, are now const.
(*) security_bprm_apply_creds(), ->bprm_apply_creds()
Changed; now returns a value, which will cause the process to be
killed if it's an error.
(*) security_task_alloc(), ->task_alloc_security()
Removed in favour of security_prepare_creds().
(*) security_cred_free(), ->cred_free()
New. Free security data attached to cred->security.
(*) security_prepare_creds(), ->cred_prepare()
New. Duplicate any security data attached to cred->security.
(*) security_commit_creds(), ->cred_commit()
New. Apply any security effects for the upcoming installation of new
security by commit_creds().
(*) security_task_post_setuid(), ->task_post_setuid()
Removed in favour of security_task_fix_setuid().
(*) security_task_fix_setuid(), ->task_fix_setuid()
Fix up the proposed new credentials for setuid(). This is used by
cap_set_fix_setuid() to implicitly adjust capabilities in line with
setuid() changes. Changes are made to the new credentials, rather
than the task itself as in security_task_post_setuid().
(*) security_task_reparent_to_init(), ->task_reparent_to_init()
Removed. Instead the task being reparented to init is referred
directly to init's credentials.
NOTE! This results in the loss of some state: SELinux's osid no
longer records the sid of the thread that forked it.
(*) security_key_alloc(), ->key_alloc()
(*) security_key_permission(), ->key_permission()
Changed. These now take cred pointers rather than task pointers to
refer to the security context.
(4) sys_capset().
This has been simplified and uses less locking. The LSM functions it
calls have been merged.
(5) reparent_to_kthreadd().
This gives the current thread the same credentials as init by simply using
commit_thread() to point that way.
(6) __sigqueue_alloc() and switch_uid()
__sigqueue_alloc() can't stop the target task from changing its creds
beneath it, so this function gets a reference to the currently applicable
user_struct which it then passes into the sigqueue struct it returns if
successful.
switch_uid() is now called from commit_creds(), and possibly should be
folded into that. commit_creds() should take care of protecting
__sigqueue_alloc().
(7) [sg]et[ug]id() and co and [sg]et_current_groups.
The set functions now all use prepare_creds(), commit_creds() and
abort_creds() to build and check a new set of credentials before applying
it.
security_task_set[ug]id() is called inside the prepared section. This
guarantees that nothing else will affect the creds until we've finished.
The calling of set_dumpable() has been moved into commit_creds().
Much of the functionality of set_user() has been moved into
commit_creds().
The get functions all simply access the data directly.
(8) security_task_prctl() and cap_task_prctl().
security_task_prctl() has been modified to return -ENOSYS if it doesn't
want to handle a function, or otherwise return the return value directly
rather than through an argument.
Additionally, cap_task_prctl() now prepares a new set of credentials, even
if it doesn't end up using it.
(9) Keyrings.
A number of changes have been made to the keyrings code:
(a) switch_uid_keyring(), copy_keys(), exit_keys() and suid_keys() have
all been dropped and built in to the credentials functions directly.
They may want separating out again later.
(b) key_alloc() and search_process_keyrings() now take a cred pointer
rather than a task pointer to specify the security context.
(c) copy_creds() gives a new thread within the same thread group a new
thread keyring if its parent had one, otherwise it discards the thread
keyring.
(d) The authorisation key now points directly to the credentials to extend
the search into rather pointing to the task that carries them.
(e) Installing thread, process or session keyrings causes a new set of
credentials to be created, even though it's not strictly necessary for
process or session keyrings (they're shared).
(10) Usermode helper.
The usermode helper code now carries a cred struct pointer in its
subprocess_info struct instead of a new session keyring pointer. This set
of credentials is derived from init_cred and installed on the new process
after it has been cloned.
call_usermodehelper_setup() allocates the new credentials and
call_usermodehelper_freeinfo() discards them if they haven't been used. A
special cred function (prepare_usermodeinfo_creds()) is provided
specifically for call_usermodehelper_setup() to call.
call_usermodehelper_setkeys() adjusts the credentials to sport the
supplied keyring as the new session keyring.
(11) SELinux.
SELinux has a number of changes, in addition to those to support the LSM
interface changes mentioned above:
(a) selinux_setprocattr() no longer does its check for whether the
current ptracer can access processes with the new SID inside the lock
that covers getting the ptracer's SID. Whilst this lock ensures that
the check is done with the ptracer pinned, the result is only valid
until the lock is released, so there's no point doing it inside the
lock.
(12) is_single_threaded().
This function has been extracted from selinux_setprocattr() and put into
a file of its own in the lib/ directory as join_session_keyring() now
wants to use it too.
The code in SELinux just checked to see whether a task shared mm_structs
with other tasks (CLONE_VM), but that isn't good enough. We really want
to know if they're part of the same thread group (CLONE_THREAD).
(13) nfsd.
The NFS server daemon now has to use the COW credentials to set the
credentials it is going to use. It really needs to pass the credentials
down to the functions it calls, but it can't do that until other patches
in this series have been applied.
Signed-off-by: David Howells <dhowells@redhat.com>
Acked-by: James Morris <jmorris@namei.org>
Signed-off-by: James Morris <jmorris@namei.org>
2008-11-14 02:39:23 +03:00
int security_prepare_creds ( struct cred * new , const struct cred * old , gfp_t gfp )
2007-10-17 10:31:32 +04:00
{
2018-11-12 20:30:56 +03:00
int rc = lsm_cred_alloc ( new , gfp ) ;
if ( rc )
return rc ;
rc = call_int_hook ( cred_prepare , 0 , new , old , gfp ) ;
2018-11-12 23:02:49 +03:00
if ( unlikely ( rc ) )
2018-11-12 20:30:56 +03:00
security_cred_free ( new ) ;
return rc ;
CRED: Inaugurate COW credentials
Inaugurate copy-on-write credentials management. This uses RCU to manage the
credentials pointer in the task_struct with respect to accesses by other tasks.
A process may only modify its own credentials, and so does not need locking to
access or modify its own credentials.
A mutex (cred_replace_mutex) is added to the task_struct to control the effect
of PTRACE_ATTACHED on credential calculations, particularly with respect to
execve().
With this patch, the contents of an active credentials struct may not be
changed directly; rather a new set of credentials must be prepared, modified
and committed using something like the following sequence of events:
struct cred *new = prepare_creds();
int ret = blah(new);
if (ret < 0) {
abort_creds(new);
return ret;
}
return commit_creds(new);
There are some exceptions to this rule: the keyrings pointed to by the active
credentials may be instantiated - keyrings violate the COW rule as managing
COW keyrings is tricky, given that it is possible for a task to directly alter
the keys in a keyring in use by another task.
To help enforce this, various pointers to sets of credentials, such as those in
the task_struct, are declared const. The purpose of this is compile-time
discouragement of altering credentials through those pointers. Once a set of
credentials has been made public through one of these pointers, it may not be
modified, except under special circumstances:
(1) Its reference count may incremented and decremented.
(2) The keyrings to which it points may be modified, but not replaced.
The only safe way to modify anything else is to create a replacement and commit
using the functions described in Documentation/credentials.txt (which will be
added by a later patch).
This patch and the preceding patches have been tested with the LTP SELinux
testsuite.
This patch makes several logical sets of alteration:
(1) execve().
This now prepares and commits credentials in various places in the
security code rather than altering the current creds directly.
(2) Temporary credential overrides.
do_coredump() and sys_faccessat() now prepare their own credentials and
temporarily override the ones currently on the acting thread, whilst
preventing interference from other threads by holding cred_replace_mutex
on the thread being dumped.
This will be replaced in a future patch by something that hands down the
credentials directly to the functions being called, rather than altering
the task's objective credentials.
(3) LSM interface.
A number of functions have been changed, added or removed:
(*) security_capset_check(), ->capset_check()
(*) security_capset_set(), ->capset_set()
Removed in favour of security_capset().
(*) security_capset(), ->capset()
New. This is passed a pointer to the new creds, a pointer to the old
creds and the proposed capability sets. It should fill in the new
creds or return an error. All pointers, barring the pointer to the
new creds, are now const.
(*) security_bprm_apply_creds(), ->bprm_apply_creds()
Changed; now returns a value, which will cause the process to be
killed if it's an error.
(*) security_task_alloc(), ->task_alloc_security()
Removed in favour of security_prepare_creds().
(*) security_cred_free(), ->cred_free()
New. Free security data attached to cred->security.
(*) security_prepare_creds(), ->cred_prepare()
New. Duplicate any security data attached to cred->security.
(*) security_commit_creds(), ->cred_commit()
New. Apply any security effects for the upcoming installation of new
security by commit_creds().
(*) security_task_post_setuid(), ->task_post_setuid()
Removed in favour of security_task_fix_setuid().
(*) security_task_fix_setuid(), ->task_fix_setuid()
Fix up the proposed new credentials for setuid(). This is used by
cap_set_fix_setuid() to implicitly adjust capabilities in line with
setuid() changes. Changes are made to the new credentials, rather
than the task itself as in security_task_post_setuid().
(*) security_task_reparent_to_init(), ->task_reparent_to_init()
Removed. Instead the task being reparented to init is referred
directly to init's credentials.
NOTE! This results in the loss of some state: SELinux's osid no
longer records the sid of the thread that forked it.
(*) security_key_alloc(), ->key_alloc()
(*) security_key_permission(), ->key_permission()
Changed. These now take cred pointers rather than task pointers to
refer to the security context.
(4) sys_capset().
This has been simplified and uses less locking. The LSM functions it
calls have been merged.
(5) reparent_to_kthreadd().
This gives the current thread the same credentials as init by simply using
commit_thread() to point that way.
(6) __sigqueue_alloc() and switch_uid()
__sigqueue_alloc() can't stop the target task from changing its creds
beneath it, so this function gets a reference to the currently applicable
user_struct which it then passes into the sigqueue struct it returns if
successful.
switch_uid() is now called from commit_creds(), and possibly should be
folded into that. commit_creds() should take care of protecting
__sigqueue_alloc().
(7) [sg]et[ug]id() and co and [sg]et_current_groups.
The set functions now all use prepare_creds(), commit_creds() and
abort_creds() to build and check a new set of credentials before applying
it.
security_task_set[ug]id() is called inside the prepared section. This
guarantees that nothing else will affect the creds until we've finished.
The calling of set_dumpable() has been moved into commit_creds().
Much of the functionality of set_user() has been moved into
commit_creds().
The get functions all simply access the data directly.
(8) security_task_prctl() and cap_task_prctl().
security_task_prctl() has been modified to return -ENOSYS if it doesn't
want to handle a function, or otherwise return the return value directly
rather than through an argument.
Additionally, cap_task_prctl() now prepares a new set of credentials, even
if it doesn't end up using it.
(9) Keyrings.
A number of changes have been made to the keyrings code:
(a) switch_uid_keyring(), copy_keys(), exit_keys() and suid_keys() have
all been dropped and built in to the credentials functions directly.
They may want separating out again later.
(b) key_alloc() and search_process_keyrings() now take a cred pointer
rather than a task pointer to specify the security context.
(c) copy_creds() gives a new thread within the same thread group a new
thread keyring if its parent had one, otherwise it discards the thread
keyring.
(d) The authorisation key now points directly to the credentials to extend
the search into rather pointing to the task that carries them.
(e) Installing thread, process or session keyrings causes a new set of
credentials to be created, even though it's not strictly necessary for
process or session keyrings (they're shared).
(10) Usermode helper.
The usermode helper code now carries a cred struct pointer in its
subprocess_info struct instead of a new session keyring pointer. This set
of credentials is derived from init_cred and installed on the new process
after it has been cloned.
call_usermodehelper_setup() allocates the new credentials and
call_usermodehelper_freeinfo() discards them if they haven't been used. A
special cred function (prepare_usermodeinfo_creds()) is provided
specifically for call_usermodehelper_setup() to call.
call_usermodehelper_setkeys() adjusts the credentials to sport the
supplied keyring as the new session keyring.
(11) SELinux.
SELinux has a number of changes, in addition to those to support the LSM
interface changes mentioned above:
(a) selinux_setprocattr() no longer does its check for whether the
current ptracer can access processes with the new SID inside the lock
that covers getting the ptracer's SID. Whilst this lock ensures that
the check is done with the ptracer pinned, the result is only valid
until the lock is released, so there's no point doing it inside the
lock.
(12) is_single_threaded().
This function has been extracted from selinux_setprocattr() and put into
a file of its own in the lib/ directory as join_session_keyring() now
wants to use it too.
The code in SELinux just checked to see whether a task shared mm_structs
with other tasks (CLONE_VM), but that isn't good enough. We really want
to know if they're part of the same thread group (CLONE_THREAD).
(13) nfsd.
The NFS server daemon now has to use the COW credentials to set the
credentials it is going to use. It really needs to pass the credentials
down to the functions it calls, but it can't do that until other patches
in this series have been applied.
Signed-off-by: David Howells <dhowells@redhat.com>
Acked-by: James Morris <jmorris@namei.org>
Signed-off-by: James Morris <jmorris@namei.org>
2008-11-14 02:39:23 +03:00
}
KEYS: Add a keyctl to install a process's session keyring on its parent [try #6]
Add a keyctl to install a process's session keyring onto its parent. This
replaces the parent's session keyring. Because the COW credential code does
not permit one process to change another process's credentials directly, the
change is deferred until userspace next starts executing again. Normally this
will be after a wait*() syscall.
To support this, three new security hooks have been provided:
cred_alloc_blank() to allocate unset security creds, cred_transfer() to fill in
the blank security creds and key_session_to_parent() - which asks the LSM if
the process may replace its parent's session keyring.
The replacement may only happen if the process has the same ownership details
as its parent, and the process has LINK permission on the session keyring, and
the session keyring is owned by the process, and the LSM permits it.
Note that this requires alteration to each architecture's notify_resume path.
This has been done for all arches barring blackfin, m68k* and xtensa, all of
which need assembly alteration to support TIF_NOTIFY_RESUME. This allows the
replacement to be performed at the point the parent process resumes userspace
execution.
This allows the userspace AFS pioctl emulation to fully emulate newpag() and
the VIOCSETTOK and VIOCSETTOK2 pioctls, all of which require the ability to
alter the parent process's PAG membership. However, since kAFS doesn't use
PAGs per se, but rather dumps the keys into the session keyring, the session
keyring of the parent must be replaced if, for example, VIOCSETTOK is passed
the newpag flag.
This can be tested with the following program:
#include <stdio.h>
#include <stdlib.h>
#include <keyutils.h>
#define KEYCTL_SESSION_TO_PARENT 18
#define OSERROR(X, S) do { if ((long)(X) == -1) { perror(S); exit(1); } } while(0)
int main(int argc, char **argv)
{
key_serial_t keyring, key;
long ret;
keyring = keyctl_join_session_keyring(argv[1]);
OSERROR(keyring, "keyctl_join_session_keyring");
key = add_key("user", "a", "b", 1, keyring);
OSERROR(key, "add_key");
ret = keyctl(KEYCTL_SESSION_TO_PARENT);
OSERROR(ret, "KEYCTL_SESSION_TO_PARENT");
return 0;
}
Compiled and linked with -lkeyutils, you should see something like:
[dhowells@andromeda ~]$ keyctl show
Session Keyring
-3 --alswrv 4043 4043 keyring: _ses
355907932 --alswrv 4043 -1 \_ keyring: _uid.4043
[dhowells@andromeda ~]$ /tmp/newpag
[dhowells@andromeda ~]$ keyctl show
Session Keyring
-3 --alswrv 4043 4043 keyring: _ses
1055658746 --alswrv 4043 4043 \_ user: a
[dhowells@andromeda ~]$ /tmp/newpag hello
[dhowells@andromeda ~]$ keyctl show
Session Keyring
-3 --alswrv 4043 4043 keyring: hello
340417692 --alswrv 4043 4043 \_ user: a
Where the test program creates a new session keyring, sticks a user key named
'a' into it and then installs it on its parent.
Signed-off-by: David Howells <dhowells@redhat.com>
Signed-off-by: James Morris <jmorris@namei.org>
2009-09-02 12:14:21 +04:00
void security_transfer_creds ( struct cred * new , const struct cred * old )
{
2015-05-03 01:11:29 +03:00
call_void_hook ( cred_transfer , new , old ) ;
KEYS: Add a keyctl to install a process's session keyring on its parent [try #6]
Add a keyctl to install a process's session keyring onto its parent. This
replaces the parent's session keyring. Because the COW credential code does
not permit one process to change another process's credentials directly, the
change is deferred until userspace next starts executing again. Normally this
will be after a wait*() syscall.
To support this, three new security hooks have been provided:
cred_alloc_blank() to allocate unset security creds, cred_transfer() to fill in
the blank security creds and key_session_to_parent() - which asks the LSM if
the process may replace its parent's session keyring.
The replacement may only happen if the process has the same ownership details
as its parent, and the process has LINK permission on the session keyring, and
the session keyring is owned by the process, and the LSM permits it.
Note that this requires alteration to each architecture's notify_resume path.
This has been done for all arches barring blackfin, m68k* and xtensa, all of
which need assembly alteration to support TIF_NOTIFY_RESUME. This allows the
replacement to be performed at the point the parent process resumes userspace
execution.
This allows the userspace AFS pioctl emulation to fully emulate newpag() and
the VIOCSETTOK and VIOCSETTOK2 pioctls, all of which require the ability to
alter the parent process's PAG membership. However, since kAFS doesn't use
PAGs per se, but rather dumps the keys into the session keyring, the session
keyring of the parent must be replaced if, for example, VIOCSETTOK is passed
the newpag flag.
This can be tested with the following program:
#include <stdio.h>
#include <stdlib.h>
#include <keyutils.h>
#define KEYCTL_SESSION_TO_PARENT 18
#define OSERROR(X, S) do { if ((long)(X) == -1) { perror(S); exit(1); } } while(0)
int main(int argc, char **argv)
{
key_serial_t keyring, key;
long ret;
keyring = keyctl_join_session_keyring(argv[1]);
OSERROR(keyring, "keyctl_join_session_keyring");
key = add_key("user", "a", "b", 1, keyring);
OSERROR(key, "add_key");
ret = keyctl(KEYCTL_SESSION_TO_PARENT);
OSERROR(ret, "KEYCTL_SESSION_TO_PARENT");
return 0;
}
Compiled and linked with -lkeyutils, you should see something like:
[dhowells@andromeda ~]$ keyctl show
Session Keyring
-3 --alswrv 4043 4043 keyring: _ses
355907932 --alswrv 4043 -1 \_ keyring: _uid.4043
[dhowells@andromeda ~]$ /tmp/newpag
[dhowells@andromeda ~]$ keyctl show
Session Keyring
-3 --alswrv 4043 4043 keyring: _ses
1055658746 --alswrv 4043 4043 \_ user: a
[dhowells@andromeda ~]$ /tmp/newpag hello
[dhowells@andromeda ~]$ keyctl show
Session Keyring
-3 --alswrv 4043 4043 keyring: hello
340417692 --alswrv 4043 4043 \_ user: a
Where the test program creates a new session keyring, sticks a user key named
'a' into it and then installs it on its parent.
Signed-off-by: David Howells <dhowells@redhat.com>
Signed-off-by: James Morris <jmorris@namei.org>
2009-09-02 12:14:21 +04:00
}
2018-01-09 00:36:19 +03:00
void security_cred_getsecid ( const struct cred * c , u32 * secid )
{
* secid = 0 ;
call_void_hook ( cred_getsecid , c , secid ) ;
}
EXPORT_SYMBOL ( security_cred_getsecid ) ;
2008-11-14 02:39:28 +03:00
int security_kernel_act_as ( struct cred * new , u32 secid )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( kernel_act_as , 0 , new , secid ) ;
2008-11-14 02:39:28 +03:00
}
int security_kernel_create_files_as ( struct cred * new , struct inode * inode )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( kernel_create_files_as , 0 , new , inode ) ;
2008-11-14 02:39:28 +03:00
}
2009-11-03 08:35:32 +03:00
int security_kernel_module_request ( char * kmod_name )
2009-08-13 17:44:57 +04:00
{
integrity: prevent deadlock during digsig verification.
This patch aimed to prevent deadlock during digsig verification.The point
of issue - user space utility modprobe and/or it's dependencies (ld-*.so,
libz.so.*, libc-*.so and /lib/modules/ files) that could be used for
kernel modules load during digsig verification and could be signed by
digsig in the same time.
First at all, look at crypto_alloc_tfm() work algorithm:
crypto_alloc_tfm() will first attempt to locate an already loaded
algorithm. If that fails and the kernel supports dynamically loadable
modules, it will then attempt to load a module of the same name or alias.
If that fails it will send a query to any loaded crypto manager to
construct an algorithm on the fly.
We have situation, when public_key_verify_signature() in case of RSA
algorithm use alg_name to store internal information in order to construct
an algorithm on the fly, but crypto_larval_lookup() will try to use
alg_name in order to load kernel module with same name.
1) we can't do anything with crypto module work, since it designed to work
exactly in this way;
2) we can't globally filter module requests for modprobe, since it
designed to work with any requests.
In this patch, I propose add an exception for "crypto-pkcs1pad(rsa,*)"
module requests only in case of enabled integrity asymmetric keys support.
Since we don't have any real "crypto-pkcs1pad(rsa,*)" kernel modules for
sure, we are safe to fail such module request from crypto_larval_lookup().
In this way we prevent modprobe execution during digsig verification and
avoid possible deadlock if modprobe and/or it's dependencies also signed
with digsig.
Requested "crypto-pkcs1pad(rsa,*)" kernel module name formed by:
1) "pkcs1pad(rsa,%s)" in public_key_verify_signature();
2) "crypto-%s" / "crypto-%s-all" in crypto_larval_lookup().
"crypto-pkcs1pad(rsa," part of request is a constant and unique and could
be used as filter.
Signed-off-by: Mikhail Kurinnoi <viewizard@viewizard.com>
Signed-off-by: Mimi Zohar <zohar@linux.vnet.ibm.com>
include/linux/integrity.h | 13 +++++++++++++
security/integrity/digsig_asymmetric.c | 23 +++++++++++++++++++++++
security/security.c | 7 ++++++-
3 files changed, 42 insertions(+), 1 deletion(-)
2018-06-27 16:33:42 +03:00
int ret ;
ret = call_int_hook ( kernel_module_request , 0 , kmod_name ) ;
if ( ret )
return ret ;
return integrity_kernel_module_request ( kmod_name ) ;
2009-08-13 17:44:57 +04:00
}
2020-10-02 20:38:23 +03:00
int security_kernel_read_file ( struct file * file , enum kernel_read_file_id id ,
bool contents )
2016-01-31 06:23:26 +03:00
{
int ret ;
2020-10-02 20:38:23 +03:00
ret = call_int_hook ( kernel_read_file , 0 , file , id , contents ) ;
2016-01-31 06:23:26 +03:00
if ( ret )
return ret ;
2020-10-02 20:38:23 +03:00
return ima_read_file ( file , id , contents ) ;
2016-01-31 06:23:26 +03:00
}
EXPORT_SYMBOL_GPL ( security_kernel_read_file ) ;
2016-01-24 18:07:32 +03:00
int security_kernel_post_read_file ( struct file * file , char * buf , loff_t size ,
enum kernel_read_file_id id )
2015-12-29 00:02:29 +03:00
{
2016-01-15 01:57:47 +03:00
int ret ;
ret = call_int_hook ( kernel_post_read_file , 0 , file , buf , size , id ) ;
if ( ret )
return ret ;
return ima_post_read_file ( file , buf , size , id ) ;
2015-12-29 00:02:29 +03:00
}
EXPORT_SYMBOL_GPL ( security_kernel_post_read_file ) ;
2020-10-02 20:38:20 +03:00
int security_kernel_load_data ( enum kernel_load_data_id id , bool contents )
2018-07-13 21:05:56 +03:00
{
2018-07-13 21:05:58 +03:00
int ret ;
2020-10-02 20:38:20 +03:00
ret = call_int_hook ( kernel_load_data , 0 , id , contents ) ;
2018-07-13 21:05:58 +03:00
if ( ret )
return ret ;
2020-10-02 20:38:20 +03:00
return ima_load_data ( id , contents ) ;
2018-07-13 21:05:56 +03:00
}
2018-07-17 23:23:37 +03:00
EXPORT_SYMBOL_GPL ( security_kernel_load_data ) ;
2018-07-13 21:05:56 +03:00
2020-10-02 20:38:20 +03:00
int security_kernel_post_load_data ( char * buf , loff_t size ,
enum kernel_load_data_id id ,
char * description )
{
int ret ;
ret = call_int_hook ( kernel_post_load_data , 0 , buf , size , id ,
description ) ;
if ( ret )
return ret ;
return ima_post_load_data ( buf , size , id , description ) ;
}
EXPORT_SYMBOL_GPL ( security_kernel_post_load_data ) ;
CRED: Inaugurate COW credentials
Inaugurate copy-on-write credentials management. This uses RCU to manage the
credentials pointer in the task_struct with respect to accesses by other tasks.
A process may only modify its own credentials, and so does not need locking to
access or modify its own credentials.
A mutex (cred_replace_mutex) is added to the task_struct to control the effect
of PTRACE_ATTACHED on credential calculations, particularly with respect to
execve().
With this patch, the contents of an active credentials struct may not be
changed directly; rather a new set of credentials must be prepared, modified
and committed using something like the following sequence of events:
struct cred *new = prepare_creds();
int ret = blah(new);
if (ret < 0) {
abort_creds(new);
return ret;
}
return commit_creds(new);
There are some exceptions to this rule: the keyrings pointed to by the active
credentials may be instantiated - keyrings violate the COW rule as managing
COW keyrings is tricky, given that it is possible for a task to directly alter
the keys in a keyring in use by another task.
To help enforce this, various pointers to sets of credentials, such as those in
the task_struct, are declared const. The purpose of this is compile-time
discouragement of altering credentials through those pointers. Once a set of
credentials has been made public through one of these pointers, it may not be
modified, except under special circumstances:
(1) Its reference count may incremented and decremented.
(2) The keyrings to which it points may be modified, but not replaced.
The only safe way to modify anything else is to create a replacement and commit
using the functions described in Documentation/credentials.txt (which will be
added by a later patch).
This patch and the preceding patches have been tested with the LTP SELinux
testsuite.
This patch makes several logical sets of alteration:
(1) execve().
This now prepares and commits credentials in various places in the
security code rather than altering the current creds directly.
(2) Temporary credential overrides.
do_coredump() and sys_faccessat() now prepare their own credentials and
temporarily override the ones currently on the acting thread, whilst
preventing interference from other threads by holding cred_replace_mutex
on the thread being dumped.
This will be replaced in a future patch by something that hands down the
credentials directly to the functions being called, rather than altering
the task's objective credentials.
(3) LSM interface.
A number of functions have been changed, added or removed:
(*) security_capset_check(), ->capset_check()
(*) security_capset_set(), ->capset_set()
Removed in favour of security_capset().
(*) security_capset(), ->capset()
New. This is passed a pointer to the new creds, a pointer to the old
creds and the proposed capability sets. It should fill in the new
creds or return an error. All pointers, barring the pointer to the
new creds, are now const.
(*) security_bprm_apply_creds(), ->bprm_apply_creds()
Changed; now returns a value, which will cause the process to be
killed if it's an error.
(*) security_task_alloc(), ->task_alloc_security()
Removed in favour of security_prepare_creds().
(*) security_cred_free(), ->cred_free()
New. Free security data attached to cred->security.
(*) security_prepare_creds(), ->cred_prepare()
New. Duplicate any security data attached to cred->security.
(*) security_commit_creds(), ->cred_commit()
New. Apply any security effects for the upcoming installation of new
security by commit_creds().
(*) security_task_post_setuid(), ->task_post_setuid()
Removed in favour of security_task_fix_setuid().
(*) security_task_fix_setuid(), ->task_fix_setuid()
Fix up the proposed new credentials for setuid(). This is used by
cap_set_fix_setuid() to implicitly adjust capabilities in line with
setuid() changes. Changes are made to the new credentials, rather
than the task itself as in security_task_post_setuid().
(*) security_task_reparent_to_init(), ->task_reparent_to_init()
Removed. Instead the task being reparented to init is referred
directly to init's credentials.
NOTE! This results in the loss of some state: SELinux's osid no
longer records the sid of the thread that forked it.
(*) security_key_alloc(), ->key_alloc()
(*) security_key_permission(), ->key_permission()
Changed. These now take cred pointers rather than task pointers to
refer to the security context.
(4) sys_capset().
This has been simplified and uses less locking. The LSM functions it
calls have been merged.
(5) reparent_to_kthreadd().
This gives the current thread the same credentials as init by simply using
commit_thread() to point that way.
(6) __sigqueue_alloc() and switch_uid()
__sigqueue_alloc() can't stop the target task from changing its creds
beneath it, so this function gets a reference to the currently applicable
user_struct which it then passes into the sigqueue struct it returns if
successful.
switch_uid() is now called from commit_creds(), and possibly should be
folded into that. commit_creds() should take care of protecting
__sigqueue_alloc().
(7) [sg]et[ug]id() and co and [sg]et_current_groups.
The set functions now all use prepare_creds(), commit_creds() and
abort_creds() to build and check a new set of credentials before applying
it.
security_task_set[ug]id() is called inside the prepared section. This
guarantees that nothing else will affect the creds until we've finished.
The calling of set_dumpable() has been moved into commit_creds().
Much of the functionality of set_user() has been moved into
commit_creds().
The get functions all simply access the data directly.
(8) security_task_prctl() and cap_task_prctl().
security_task_prctl() has been modified to return -ENOSYS if it doesn't
want to handle a function, or otherwise return the return value directly
rather than through an argument.
Additionally, cap_task_prctl() now prepares a new set of credentials, even
if it doesn't end up using it.
(9) Keyrings.
A number of changes have been made to the keyrings code:
(a) switch_uid_keyring(), copy_keys(), exit_keys() and suid_keys() have
all been dropped and built in to the credentials functions directly.
They may want separating out again later.
(b) key_alloc() and search_process_keyrings() now take a cred pointer
rather than a task pointer to specify the security context.
(c) copy_creds() gives a new thread within the same thread group a new
thread keyring if its parent had one, otherwise it discards the thread
keyring.
(d) The authorisation key now points directly to the credentials to extend
the search into rather pointing to the task that carries them.
(e) Installing thread, process or session keyrings causes a new set of
credentials to be created, even though it's not strictly necessary for
process or session keyrings (they're shared).
(10) Usermode helper.
The usermode helper code now carries a cred struct pointer in its
subprocess_info struct instead of a new session keyring pointer. This set
of credentials is derived from init_cred and installed on the new process
after it has been cloned.
call_usermodehelper_setup() allocates the new credentials and
call_usermodehelper_freeinfo() discards them if they haven't been used. A
special cred function (prepare_usermodeinfo_creds()) is provided
specifically for call_usermodehelper_setup() to call.
call_usermodehelper_setkeys() adjusts the credentials to sport the
supplied keyring as the new session keyring.
(11) SELinux.
SELinux has a number of changes, in addition to those to support the LSM
interface changes mentioned above:
(a) selinux_setprocattr() no longer does its check for whether the
current ptracer can access processes with the new SID inside the lock
that covers getting the ptracer's SID. Whilst this lock ensures that
the check is done with the ptracer pinned, the result is only valid
until the lock is released, so there's no point doing it inside the
lock.
(12) is_single_threaded().
This function has been extracted from selinux_setprocattr() and put into
a file of its own in the lib/ directory as join_session_keyring() now
wants to use it too.
The code in SELinux just checked to see whether a task shared mm_structs
with other tasks (CLONE_VM), but that isn't good enough. We really want
to know if they're part of the same thread group (CLONE_THREAD).
(13) nfsd.
The NFS server daemon now has to use the COW credentials to set the
credentials it is going to use. It really needs to pass the credentials
down to the functions it calls, but it can't do that until other patches
in this series have been applied.
Signed-off-by: David Howells <dhowells@redhat.com>
Acked-by: James Morris <jmorris@namei.org>
Signed-off-by: James Morris <jmorris@namei.org>
2008-11-14 02:39:23 +03:00
int security_task_fix_setuid ( struct cred * new , const struct cred * old ,
int flags )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( task_fix_setuid , 0 , new , old , flags ) ;
2007-10-17 10:31:32 +04:00
}
2020-06-09 20:22:13 +03:00
int security_task_fix_setgid ( struct cred * new , const struct cred * old ,
int flags )
{
return call_int_hook ( task_fix_setgid , 0 , new , old , flags ) ;
}
2022-06-08 23:57:11 +03:00
int security_task_fix_setgroups ( struct cred * new , const struct cred * old )
{
return call_int_hook ( task_fix_setgroups , 0 , new , old ) ;
}
2007-10-17 10:31:32 +04:00
int security_task_setpgid ( struct task_struct * p , pid_t pgid )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( task_setpgid , 0 , p , pgid ) ;
2007-10-17 10:31:32 +04:00
}
int security_task_getpgid ( struct task_struct * p )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( task_getpgid , 0 , p ) ;
2007-10-17 10:31:32 +04:00
}
int security_task_getsid ( struct task_struct * p )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( task_getsid , 0 , p ) ;
2007-10-17 10:31:32 +04:00
}
2021-09-29 18:01:21 +03:00
void security_current_getsecid_subj ( u32 * secid )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:42 +03:00
* secid = 0 ;
2021-09-29 18:01:21 +03:00
call_void_hook ( current_getsecid_subj , secid ) ;
2007-10-17 10:31:32 +04:00
}
2021-09-29 18:01:21 +03:00
EXPORT_SYMBOL ( security_current_getsecid_subj ) ;
2021-02-19 22:26:21 +03:00
void security_task_getsecid_obj ( struct task_struct * p , u32 * secid )
{
* secid = 0 ;
call_void_hook ( task_getsecid_obj , p , secid ) ;
}
EXPORT_SYMBOL ( security_task_getsecid_obj ) ;
2007-10-17 10:31:32 +04:00
int security_task_setnice ( struct task_struct * p , int nice )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( task_setnice , 0 , p , nice ) ;
2007-10-17 10:31:32 +04:00
}
int security_task_setioprio ( struct task_struct * p , int ioprio )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( task_setioprio , 0 , p , ioprio ) ;
2007-10-17 10:31:32 +04:00
}
int security_task_getioprio ( struct task_struct * p )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( task_getioprio , 0 , p ) ;
2007-10-17 10:31:32 +04:00
}
prlimit,security,selinux: add a security hook for prlimit
When SELinux was first added to the kernel, a process could only get
and set its own resource limits via getrlimit(2) and setrlimit(2), so no
MAC checks were required for those operations, and thus no security hooks
were defined for them. Later, SELinux introduced a hook for setlimit(2)
with a check if the hard limit was being changed in order to be able to
rely on the hard limit value as a safe reset point upon context
transitions.
Later on, when prlimit(2) was added to the kernel with the ability to get
or set resource limits (hard or soft) of another process, LSM/SELinux was
not updated other than to pass the target process to the setrlimit hook.
This resulted in incomplete control over both getting and setting the
resource limits of another process.
Add a new security_task_prlimit() hook to the check_prlimit_permission()
function to provide complete mediation. The hook is only called when
acting on another task, and only if the existing DAC/capability checks
would allow access. Pass flags down to the hook to indicate whether the
prlimit(2) call will read, write, or both read and write the resource
limits of the target process.
The existing security_task_setrlimit() hook is left alone; it continues
to serve a purpose in supporting the ability to make decisions based on
the old and/or new resource limit values when setting limits. This
is consistent with the DAC/capability logic, where
check_prlimit_permission() performs generic DAC/capability checks for
acting on another task, while do_prlimit() performs a capability check
based on a comparison of the old and new resource limits. Fix the
inline documentation for the hook to match the code.
Implement the new hook for SELinux. For setting resource limits, we
reuse the existing setrlimit permission. Note that this does overload
the setrlimit permission to mean the ability to set the resource limit
(soft or hard) of another process or the ability to change one's own
hard limit. For getting resource limits, a new getrlimit permission
is defined. This was not originally defined since getrlimit(2) could
only be used to obtain a process' own limits.
Signed-off-by: Stephen Smalley <sds@tycho.nsa.gov>
Signed-off-by: James Morris <james.l.morris@oracle.com>
2017-02-17 15:57:00 +03:00
int security_task_prlimit ( const struct cred * cred , const struct cred * tcred ,
unsigned int flags )
{
return call_int_hook ( task_prlimit , 0 , cred , tcred , flags ) ;
}
2009-08-26 20:41:16 +04:00
int security_task_setrlimit ( struct task_struct * p , unsigned int resource ,
struct rlimit * new_rlim )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( task_setrlimit , 0 , p , resource , new_rlim ) ;
2007-10-17 10:31:32 +04:00
}
2010-10-14 23:21:18 +04:00
int security_task_setscheduler ( struct task_struct * p )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( task_setscheduler , 0 , p ) ;
2007-10-17 10:31:32 +04:00
}
int security_task_getscheduler ( struct task_struct * p )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( task_getscheduler , 0 , p ) ;
2007-10-17 10:31:32 +04:00
}
int security_task_movememory ( struct task_struct * p )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( task_movememory , 0 , p ) ;
2007-10-17 10:31:32 +04:00
}
2018-09-25 12:27:20 +03:00
int security_task_kill ( struct task_struct * p , struct kernel_siginfo * info ,
usb, signal, security: only pass the cred, not the secid, to kill_pid_info_as_cred and security_task_kill
commit d178bc3a708f39cbfefc3fab37032d3f2511b4ec ("user namespace: usb:
make usb urbs user namespace aware (v2)") changed kill_pid_info_as_uid
to kill_pid_info_as_cred, saving and passing a cred structure instead of
uids. Since the secid can be obtained from the cred, drop the secid fields
from the usb_dev_state and async structures, and drop the secid argument to
kill_pid_info_as_cred. Replace the secid argument to security_task_kill
with the cred. Update SELinux, Smack, and AppArmor to use the cred, which
avoids the need for Smack and AppArmor to use a secid at all in this hook.
Further changes to Smack might still be required to take full advantage of
this change, since it should now be possible to perform capability
checking based on the supplied cred. The changes to Smack and AppArmor
have only been compile-tested.
Signed-off-by: Stephen Smalley <sds@tycho.nsa.gov>
Acked-by: Paul Moore <paul@paul-moore.com>
Acked-by: Casey Schaufler <casey@schaufler-ca.com>
Acked-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Acked-by: John Johansen <john.johansen@canonical.com>
Signed-off-by: James Morris <james.morris@microsoft.com>
2017-09-08 19:40:01 +03:00
int sig , const struct cred * cred )
2007-10-17 10:31:32 +04:00
{
usb, signal, security: only pass the cred, not the secid, to kill_pid_info_as_cred and security_task_kill
commit d178bc3a708f39cbfefc3fab37032d3f2511b4ec ("user namespace: usb:
make usb urbs user namespace aware (v2)") changed kill_pid_info_as_uid
to kill_pid_info_as_cred, saving and passing a cred structure instead of
uids. Since the secid can be obtained from the cred, drop the secid fields
from the usb_dev_state and async structures, and drop the secid argument to
kill_pid_info_as_cred. Replace the secid argument to security_task_kill
with the cred. Update SELinux, Smack, and AppArmor to use the cred, which
avoids the need for Smack and AppArmor to use a secid at all in this hook.
Further changes to Smack might still be required to take full advantage of
this change, since it should now be possible to perform capability
checking based on the supplied cred. The changes to Smack and AppArmor
have only been compile-tested.
Signed-off-by: Stephen Smalley <sds@tycho.nsa.gov>
Acked-by: Paul Moore <paul@paul-moore.com>
Acked-by: Casey Schaufler <casey@schaufler-ca.com>
Acked-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Acked-by: John Johansen <john.johansen@canonical.com>
Signed-off-by: James Morris <james.morris@microsoft.com>
2017-09-08 19:40:01 +03:00
return call_int_hook ( task_kill , 0 , p , info , sig , cred ) ;
2007-10-17 10:31:32 +04:00
}
int security_task_prctl ( int option , unsigned long arg2 , unsigned long arg3 ,
CRED: Inaugurate COW credentials
Inaugurate copy-on-write credentials management. This uses RCU to manage the
credentials pointer in the task_struct with respect to accesses by other tasks.
A process may only modify its own credentials, and so does not need locking to
access or modify its own credentials.
A mutex (cred_replace_mutex) is added to the task_struct to control the effect
of PTRACE_ATTACHED on credential calculations, particularly with respect to
execve().
With this patch, the contents of an active credentials struct may not be
changed directly; rather a new set of credentials must be prepared, modified
and committed using something like the following sequence of events:
struct cred *new = prepare_creds();
int ret = blah(new);
if (ret < 0) {
abort_creds(new);
return ret;
}
return commit_creds(new);
There are some exceptions to this rule: the keyrings pointed to by the active
credentials may be instantiated - keyrings violate the COW rule as managing
COW keyrings is tricky, given that it is possible for a task to directly alter
the keys in a keyring in use by another task.
To help enforce this, various pointers to sets of credentials, such as those in
the task_struct, are declared const. The purpose of this is compile-time
discouragement of altering credentials through those pointers. Once a set of
credentials has been made public through one of these pointers, it may not be
modified, except under special circumstances:
(1) Its reference count may incremented and decremented.
(2) The keyrings to which it points may be modified, but not replaced.
The only safe way to modify anything else is to create a replacement and commit
using the functions described in Documentation/credentials.txt (which will be
added by a later patch).
This patch and the preceding patches have been tested with the LTP SELinux
testsuite.
This patch makes several logical sets of alteration:
(1) execve().
This now prepares and commits credentials in various places in the
security code rather than altering the current creds directly.
(2) Temporary credential overrides.
do_coredump() and sys_faccessat() now prepare their own credentials and
temporarily override the ones currently on the acting thread, whilst
preventing interference from other threads by holding cred_replace_mutex
on the thread being dumped.
This will be replaced in a future patch by something that hands down the
credentials directly to the functions being called, rather than altering
the task's objective credentials.
(3) LSM interface.
A number of functions have been changed, added or removed:
(*) security_capset_check(), ->capset_check()
(*) security_capset_set(), ->capset_set()
Removed in favour of security_capset().
(*) security_capset(), ->capset()
New. This is passed a pointer to the new creds, a pointer to the old
creds and the proposed capability sets. It should fill in the new
creds or return an error. All pointers, barring the pointer to the
new creds, are now const.
(*) security_bprm_apply_creds(), ->bprm_apply_creds()
Changed; now returns a value, which will cause the process to be
killed if it's an error.
(*) security_task_alloc(), ->task_alloc_security()
Removed in favour of security_prepare_creds().
(*) security_cred_free(), ->cred_free()
New. Free security data attached to cred->security.
(*) security_prepare_creds(), ->cred_prepare()
New. Duplicate any security data attached to cred->security.
(*) security_commit_creds(), ->cred_commit()
New. Apply any security effects for the upcoming installation of new
security by commit_creds().
(*) security_task_post_setuid(), ->task_post_setuid()
Removed in favour of security_task_fix_setuid().
(*) security_task_fix_setuid(), ->task_fix_setuid()
Fix up the proposed new credentials for setuid(). This is used by
cap_set_fix_setuid() to implicitly adjust capabilities in line with
setuid() changes. Changes are made to the new credentials, rather
than the task itself as in security_task_post_setuid().
(*) security_task_reparent_to_init(), ->task_reparent_to_init()
Removed. Instead the task being reparented to init is referred
directly to init's credentials.
NOTE! This results in the loss of some state: SELinux's osid no
longer records the sid of the thread that forked it.
(*) security_key_alloc(), ->key_alloc()
(*) security_key_permission(), ->key_permission()
Changed. These now take cred pointers rather than task pointers to
refer to the security context.
(4) sys_capset().
This has been simplified and uses less locking. The LSM functions it
calls have been merged.
(5) reparent_to_kthreadd().
This gives the current thread the same credentials as init by simply using
commit_thread() to point that way.
(6) __sigqueue_alloc() and switch_uid()
__sigqueue_alloc() can't stop the target task from changing its creds
beneath it, so this function gets a reference to the currently applicable
user_struct which it then passes into the sigqueue struct it returns if
successful.
switch_uid() is now called from commit_creds(), and possibly should be
folded into that. commit_creds() should take care of protecting
__sigqueue_alloc().
(7) [sg]et[ug]id() and co and [sg]et_current_groups.
The set functions now all use prepare_creds(), commit_creds() and
abort_creds() to build and check a new set of credentials before applying
it.
security_task_set[ug]id() is called inside the prepared section. This
guarantees that nothing else will affect the creds until we've finished.
The calling of set_dumpable() has been moved into commit_creds().
Much of the functionality of set_user() has been moved into
commit_creds().
The get functions all simply access the data directly.
(8) security_task_prctl() and cap_task_prctl().
security_task_prctl() has been modified to return -ENOSYS if it doesn't
want to handle a function, or otherwise return the return value directly
rather than through an argument.
Additionally, cap_task_prctl() now prepares a new set of credentials, even
if it doesn't end up using it.
(9) Keyrings.
A number of changes have been made to the keyrings code:
(a) switch_uid_keyring(), copy_keys(), exit_keys() and suid_keys() have
all been dropped and built in to the credentials functions directly.
They may want separating out again later.
(b) key_alloc() and search_process_keyrings() now take a cred pointer
rather than a task pointer to specify the security context.
(c) copy_creds() gives a new thread within the same thread group a new
thread keyring if its parent had one, otherwise it discards the thread
keyring.
(d) The authorisation key now points directly to the credentials to extend
the search into rather pointing to the task that carries them.
(e) Installing thread, process or session keyrings causes a new set of
credentials to be created, even though it's not strictly necessary for
process or session keyrings (they're shared).
(10) Usermode helper.
The usermode helper code now carries a cred struct pointer in its
subprocess_info struct instead of a new session keyring pointer. This set
of credentials is derived from init_cred and installed on the new process
after it has been cloned.
call_usermodehelper_setup() allocates the new credentials and
call_usermodehelper_freeinfo() discards them if they haven't been used. A
special cred function (prepare_usermodeinfo_creds()) is provided
specifically for call_usermodehelper_setup() to call.
call_usermodehelper_setkeys() adjusts the credentials to sport the
supplied keyring as the new session keyring.
(11) SELinux.
SELinux has a number of changes, in addition to those to support the LSM
interface changes mentioned above:
(a) selinux_setprocattr() no longer does its check for whether the
current ptracer can access processes with the new SID inside the lock
that covers getting the ptracer's SID. Whilst this lock ensures that
the check is done with the ptracer pinned, the result is only valid
until the lock is released, so there's no point doing it inside the
lock.
(12) is_single_threaded().
This function has been extracted from selinux_setprocattr() and put into
a file of its own in the lib/ directory as join_session_keyring() now
wants to use it too.
The code in SELinux just checked to see whether a task shared mm_structs
with other tasks (CLONE_VM), but that isn't good enough. We really want
to know if they're part of the same thread group (CLONE_THREAD).
(13) nfsd.
The NFS server daemon now has to use the COW credentials to set the
credentials it is going to use. It really needs to pass the credentials
down to the functions it calls, but it can't do that until other patches
in this series have been applied.
Signed-off-by: David Howells <dhowells@redhat.com>
Acked-by: James Morris <jmorris@namei.org>
Signed-off-by: James Morris <jmorris@namei.org>
2008-11-14 02:39:23 +03:00
unsigned long arg4 , unsigned long arg5 )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:42 +03:00
int thisrc ;
2020-03-29 03:43:50 +03:00
int rc = LSM_RET_DEFAULT ( task_prctl ) ;
2015-05-03 01:11:42 +03:00
struct security_hook_list * hp ;
2018-03-29 04:28:23 +03:00
hlist_for_each_entry ( hp , & security_hook_heads . task_prctl , list ) {
2015-05-03 01:11:42 +03:00
thisrc = hp - > hook . task_prctl ( option , arg2 , arg3 , arg4 , arg5 ) ;
2020-03-29 03:43:50 +03:00
if ( thisrc ! = LSM_RET_DEFAULT ( task_prctl ) ) {
2015-05-03 01:11:42 +03:00
rc = thisrc ;
if ( thisrc ! = 0 )
break ;
}
}
return rc ;
2007-10-17 10:31:32 +04:00
}
void security_task_to_inode ( struct task_struct * p , struct inode * inode )
{
2015-05-03 01:11:29 +03:00
call_void_hook ( task_to_inode , p , inode ) ;
2007-10-17 10:31:32 +04:00
}
security, lsm: Introduce security_create_user_ns()
User namespaces are an effective tool to allow programs to run with
permission without requiring the need for a program to run as root. User
namespaces may also be used as a sandboxing technique. However, attackers
sometimes leverage user namespaces as an initial attack vector to perform
some exploit. [1,2,3]
While it is not the unprivileged user namespace functionality, which
causes the kernel to be exploitable, users/administrators might want to
more granularly limit or at least monitor how various processes use this
functionality, while vulnerable kernel subsystems are being patched.
Preventing user namespace already creation comes in a few of forms in
order of granularity:
1. /proc/sys/user/max_user_namespaces sysctl
2. Distro specific patch(es)
3. CONFIG_USER_NS
To block a task based on its attributes, the LSM hook cred_prepare is a
decent candidate for use because it provides more granular control, and
it is called before create_user_ns():
cred = prepare_creds()
security_prepare_creds()
call_int_hook(cred_prepare, ...
if (cred)
create_user_ns(cred)
Since security_prepare_creds() is meant for LSMs to copy and prepare
credentials, access control is an unintended use of the hook. [4]
Further, security_prepare_creds() will always return a ENOMEM if the
hook returns any non-zero error code.
This hook also does not handle the clone3 case which requires us to
access a user space pointer to know if we're in the CLONE_NEW_USER
call path which may be subject to a TOCTTOU attack.
Lastly, cred_prepare is called in many call paths, and a targeted hook
further limits the frequency of calls which is a beneficial outcome.
Therefore introduce a new function security_create_user_ns() with an
accompanying userns_create LSM hook.
With the new userns_create hook, users will have more control over the
observability and access control over user namespace creation. Users
should expect that normal operation of user namespaces will behave as
usual, and only be impacted when controls are implemented by users or
administrators.
This hook takes the prepared creds for LSM authors to write policy
against. On success, the new namespace is applied to credentials,
otherwise an error is returned.
Links:
1. https://nvd.nist.gov/vuln/detail/CVE-2022-0492
2. https://nvd.nist.gov/vuln/detail/CVE-2022-25636
3. https://nvd.nist.gov/vuln/detail/CVE-2022-34918
4. https://lore.kernel.org/all/1c4b1c0d-12f6-6e9e-a6a3-cdce7418110c@schaufler-ca.com/
Reviewed-by: Christian Brauner (Microsoft) <brauner@kernel.org>
Reviewed-by: KP Singh <kpsingh@kernel.org>
Signed-off-by: Frederick Lawler <fred@cloudflare.com>
Signed-off-by: Paul Moore <paul@paul-moore.com>
2022-08-15 19:20:25 +03:00
int security_create_user_ns ( const struct cred * cred )
{
return call_int_hook ( userns_create , 0 , cred ) ;
}
2007-10-17 10:31:32 +04:00
int security_ipc_permission ( struct kern_ipc_perm * ipcp , short flag )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( ipc_permission , 0 , ipcp , flag ) ;
2007-10-17 10:31:32 +04:00
}
2008-03-01 22:51:09 +03:00
void security_ipc_getsecid ( struct kern_ipc_perm * ipcp , u32 * secid )
{
2015-05-03 01:11:42 +03:00
* secid = 0 ;
2015-05-03 01:11:29 +03:00
call_void_hook ( ipc_getsecid , ipcp , secid ) ;
2008-03-01 22:51:09 +03:00
}
2007-10-17 10:31:32 +04:00
int security_msg_msg_alloc ( struct msg_msg * msg )
{
2018-11-20 22:55:02 +03:00
int rc = lsm_msg_msg_alloc ( msg ) ;
if ( unlikely ( rc ) )
return rc ;
rc = call_int_hook ( msg_msg_alloc_security , 0 , msg ) ;
if ( unlikely ( rc ) )
security_msg_msg_free ( msg ) ;
return rc ;
2007-10-17 10:31:32 +04:00
}
void security_msg_msg_free ( struct msg_msg * msg )
{
2015-05-03 01:11:29 +03:00
call_void_hook ( msg_msg_free_security , msg ) ;
2018-11-20 22:55:02 +03:00
kfree ( msg - > security ) ;
msg - > security = NULL ;
2007-10-17 10:31:32 +04:00
}
2018-03-23 05:22:26 +03:00
int security_msg_queue_alloc ( struct kern_ipc_perm * msq )
2007-10-17 10:31:32 +04:00
{
2018-11-20 22:55:02 +03:00
int rc = lsm_ipc_alloc ( msq ) ;
if ( unlikely ( rc ) )
return rc ;
rc = call_int_hook ( msg_queue_alloc_security , 0 , msq ) ;
if ( unlikely ( rc ) )
security_msg_queue_free ( msq ) ;
return rc ;
2007-10-17 10:31:32 +04:00
}
2018-03-23 05:22:26 +03:00
void security_msg_queue_free ( struct kern_ipc_perm * msq )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
call_void_hook ( msg_queue_free_security , msq ) ;
2018-11-20 22:55:02 +03:00
kfree ( msq - > security ) ;
msq - > security = NULL ;
2007-10-17 10:31:32 +04:00
}
2018-03-23 05:22:26 +03:00
int security_msg_queue_associate ( struct kern_ipc_perm * msq , int msqflg )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( msg_queue_associate , 0 , msq , msqflg ) ;
2007-10-17 10:31:32 +04:00
}
2018-03-23 05:22:26 +03:00
int security_msg_queue_msgctl ( struct kern_ipc_perm * msq , int cmd )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( msg_queue_msgctl , 0 , msq , cmd ) ;
2007-10-17 10:31:32 +04:00
}
2018-03-23 05:22:26 +03:00
int security_msg_queue_msgsnd ( struct kern_ipc_perm * msq ,
2007-10-17 10:31:32 +04:00
struct msg_msg * msg , int msqflg )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( msg_queue_msgsnd , 0 , msq , msg , msqflg ) ;
2007-10-17 10:31:32 +04:00
}
2018-03-23 05:22:26 +03:00
int security_msg_queue_msgrcv ( struct kern_ipc_perm * msq , struct msg_msg * msg ,
2007-10-17 10:31:32 +04:00
struct task_struct * target , long type , int mode )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( msg_queue_msgrcv , 0 , msq , msg , target , type , mode ) ;
2007-10-17 10:31:32 +04:00
}
2018-03-23 05:08:27 +03:00
int security_shm_alloc ( struct kern_ipc_perm * shp )
2007-10-17 10:31:32 +04:00
{
2018-11-20 22:55:02 +03:00
int rc = lsm_ipc_alloc ( shp ) ;
if ( unlikely ( rc ) )
return rc ;
rc = call_int_hook ( shm_alloc_security , 0 , shp ) ;
if ( unlikely ( rc ) )
security_shm_free ( shp ) ;
return rc ;
2007-10-17 10:31:32 +04:00
}
2018-03-23 05:08:27 +03:00
void security_shm_free ( struct kern_ipc_perm * shp )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
call_void_hook ( shm_free_security , shp ) ;
2018-11-20 22:55:02 +03:00
kfree ( shp - > security ) ;
shp - > security = NULL ;
2007-10-17 10:31:32 +04:00
}
2018-03-23 05:08:27 +03:00
int security_shm_associate ( struct kern_ipc_perm * shp , int shmflg )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( shm_associate , 0 , shp , shmflg ) ;
2007-10-17 10:31:32 +04:00
}
2018-03-23 05:08:27 +03:00
int security_shm_shmctl ( struct kern_ipc_perm * shp , int cmd )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( shm_shmctl , 0 , shp , cmd ) ;
2007-10-17 10:31:32 +04:00
}
2018-03-23 05:08:27 +03:00
int security_shm_shmat ( struct kern_ipc_perm * shp , char __user * shmaddr , int shmflg )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( shm_shmat , 0 , shp , shmaddr , shmflg ) ;
2007-10-17 10:31:32 +04:00
}
2018-03-23 04:52:43 +03:00
int security_sem_alloc ( struct kern_ipc_perm * sma )
2007-10-17 10:31:32 +04:00
{
2018-11-20 22:55:02 +03:00
int rc = lsm_ipc_alloc ( sma ) ;
if ( unlikely ( rc ) )
return rc ;
rc = call_int_hook ( sem_alloc_security , 0 , sma ) ;
if ( unlikely ( rc ) )
security_sem_free ( sma ) ;
return rc ;
2007-10-17 10:31:32 +04:00
}
2018-03-23 04:52:43 +03:00
void security_sem_free ( struct kern_ipc_perm * sma )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
call_void_hook ( sem_free_security , sma ) ;
2018-11-20 22:55:02 +03:00
kfree ( sma - > security ) ;
sma - > security = NULL ;
2007-10-17 10:31:32 +04:00
}
2018-03-23 04:52:43 +03:00
int security_sem_associate ( struct kern_ipc_perm * sma , int semflg )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( sem_associate , 0 , sma , semflg ) ;
2007-10-17 10:31:32 +04:00
}
2018-03-23 04:52:43 +03:00
int security_sem_semctl ( struct kern_ipc_perm * sma , int cmd )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( sem_semctl , 0 , sma , cmd ) ;
2007-10-17 10:31:32 +04:00
}
2018-03-23 04:52:43 +03:00
int security_sem_semop ( struct kern_ipc_perm * sma , struct sembuf * sops ,
2007-10-17 10:31:32 +04:00
unsigned nsops , int alter )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( sem_semop , 0 , sma , sops , nsops , alter ) ;
2007-10-17 10:31:32 +04:00
}
void security_d_instantiate ( struct dentry * dentry , struct inode * inode )
{
if ( unlikely ( inode & & IS_PRIVATE ( inode ) ) )
return ;
2015-05-03 01:11:29 +03:00
call_void_hook ( d_instantiate , dentry , inode ) ;
2007-10-17 10:31:32 +04:00
}
EXPORT_SYMBOL ( security_d_instantiate ) ;
2022-01-31 03:57:52 +03:00
int security_getprocattr ( struct task_struct * p , const char * lsm ,
const char * name , char * * value )
2007-10-17 10:31:32 +04:00
{
2018-09-22 03:16:59 +03:00
struct security_hook_list * hp ;
hlist_for_each_entry ( hp , & security_hook_heads . getprocattr , list ) {
if ( lsm ! = NULL & & strcmp ( lsm , hp - > lsm ) )
continue ;
return hp - > hook . getprocattr ( p , name , value ) ;
}
2020-03-29 03:43:50 +03:00
return LSM_RET_DEFAULT ( getprocattr ) ;
2007-10-17 10:31:32 +04:00
}
2018-09-22 03:16:59 +03:00
int security_setprocattr ( const char * lsm , const char * name , void * value ,
size_t size )
2007-10-17 10:31:32 +04:00
{
2018-09-22 03:16:59 +03:00
struct security_hook_list * hp ;
hlist_for_each_entry ( hp , & security_hook_heads . setprocattr , list ) {
if ( lsm ! = NULL & & strcmp ( lsm , hp - > lsm ) )
continue ;
return hp - > hook . setprocattr ( name , value , size ) ;
}
2020-03-29 03:43:50 +03:00
return LSM_RET_DEFAULT ( setprocattr ) ;
2007-10-17 10:31:32 +04:00
}
int security_netlink_send ( struct sock * sk , struct sk_buff * skb )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( netlink_send , 0 , sk , skb ) ;
2007-10-17 10:31:32 +04:00
}
2013-05-22 20:50:35 +04:00
int security_ismaclabel ( const char * name )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( ismaclabel , 0 , name ) ;
2013-05-22 20:50:35 +04:00
}
EXPORT_SYMBOL ( security_ismaclabel ) ;
2007-10-17 10:31:32 +04:00
int security_secid_to_secctx ( u32 secid , char * * secdata , u32 * seclen )
{
2020-05-20 15:56:16 +03:00
struct security_hook_list * hp ;
int rc ;
/*
* Currently , only one LSM can implement secid_to_secctx ( i . e this
* LSM hook is not " stackable " ) .
*/
hlist_for_each_entry ( hp , & security_hook_heads . secid_to_secctx , list ) {
rc = hp - > hook . secid_to_secctx ( secid , secdata , seclen ) ;
if ( rc ! = LSM_RET_DEFAULT ( secid_to_secctx ) )
return rc ;
}
return LSM_RET_DEFAULT ( secid_to_secctx ) ;
2007-10-17 10:31:32 +04:00
}
EXPORT_SYMBOL ( security_secid_to_secctx ) ;
2008-04-29 23:52:51 +04:00
int security_secctx_to_secid ( const char * secdata , u32 seclen , u32 * secid )
2008-01-16 02:47:35 +03:00
{
2015-05-03 01:11:42 +03:00
* secid = 0 ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( secctx_to_secid , 0 , secdata , seclen , secid ) ;
2008-01-16 02:47:35 +03:00
}
EXPORT_SYMBOL ( security_secctx_to_secid ) ;
2007-10-17 10:31:32 +04:00
void security_release_secctx ( char * secdata , u32 seclen )
{
2015-05-03 01:11:29 +03:00
call_void_hook ( release_secctx , secdata , seclen ) ;
2007-10-17 10:31:32 +04:00
}
EXPORT_SYMBOL ( security_release_secctx ) ;
2015-12-24 19:09:40 +03:00
void security_inode_invalidate_secctx ( struct inode * inode )
{
call_void_hook ( inode_invalidate_secctx , inode ) ;
}
EXPORT_SYMBOL ( security_inode_invalidate_secctx ) ;
2009-09-03 22:25:57 +04:00
int security_inode_notifysecctx ( struct inode * inode , void * ctx , u32 ctxlen )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( inode_notifysecctx , 0 , inode , ctx , ctxlen ) ;
2009-09-03 22:25:57 +04:00
}
EXPORT_SYMBOL ( security_inode_notifysecctx ) ;
int security_inode_setsecctx ( struct dentry * dentry , void * ctx , u32 ctxlen )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( inode_setsecctx , 0 , dentry , ctx , ctxlen ) ;
2009-09-03 22:25:57 +04:00
}
EXPORT_SYMBOL ( security_inode_setsecctx ) ;
int security_inode_getsecctx ( struct inode * inode , void * * ctx , u32 * ctxlen )
{
2015-05-03 01:11:42 +03:00
return call_int_hook ( inode_getsecctx , - EOPNOTSUPP , inode , ctx , ctxlen ) ;
2009-09-03 22:25:57 +04:00
}
EXPORT_SYMBOL ( security_inode_getsecctx ) ;
2020-02-12 16:58:35 +03:00
# ifdef CONFIG_WATCH_QUEUE
int security_post_notification ( const struct cred * w_cred ,
const struct cred * cred ,
struct watch_notification * n )
{
return call_int_hook ( post_notification , 0 , w_cred , cred , n ) ;
}
# endif /* CONFIG_WATCH_QUEUE */
2020-02-12 16:58:35 +03:00
# ifdef CONFIG_KEY_NOTIFICATIONS
int security_watch_key ( struct key * key )
{
return call_int_hook ( watch_key , 0 , key ) ;
}
# endif
2007-10-17 10:31:32 +04:00
# ifdef CONFIG_SECURITY_NETWORK
2011-01-06 02:38:53 +03:00
int security_unix_stream_connect ( struct sock * sock , struct sock * other , struct sock * newsk )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( unix_stream_connect , 0 , sock , other , newsk ) ;
2007-10-17 10:31:32 +04:00
}
EXPORT_SYMBOL ( security_unix_stream_connect ) ;
int security_unix_may_send ( struct socket * sock , struct socket * other )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( unix_may_send , 0 , sock , other ) ;
2007-10-17 10:31:32 +04:00
}
EXPORT_SYMBOL ( security_unix_may_send ) ;
int security_socket_create ( int family , int type , int protocol , int kern )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( socket_create , 0 , family , type , protocol , kern ) ;
2007-10-17 10:31:32 +04:00
}
int security_socket_post_create ( struct socket * sock , int family ,
int type , int protocol , int kern )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( socket_post_create , 0 , sock , family , type ,
2007-10-17 10:31:32 +04:00
protocol , kern ) ;
}
2018-05-04 17:28:19 +03:00
int security_socket_socketpair ( struct socket * socka , struct socket * sockb )
{
return call_int_hook ( socket_socketpair , 0 , socka , sockb ) ;
}
EXPORT_SYMBOL ( security_socket_socketpair ) ;
2007-10-17 10:31:32 +04:00
int security_socket_bind ( struct socket * sock , struct sockaddr * address , int addrlen )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( socket_bind , 0 , sock , address , addrlen ) ;
2007-10-17 10:31:32 +04:00
}
int security_socket_connect ( struct socket * sock , struct sockaddr * address , int addrlen )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( socket_connect , 0 , sock , address , addrlen ) ;
2007-10-17 10:31:32 +04:00
}
int security_socket_listen ( struct socket * sock , int backlog )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( socket_listen , 0 , sock , backlog ) ;
2007-10-17 10:31:32 +04:00
}
int security_socket_accept ( struct socket * sock , struct socket * newsock )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( socket_accept , 0 , sock , newsock ) ;
2007-10-17 10:31:32 +04:00
}
int security_socket_sendmsg ( struct socket * sock , struct msghdr * msg , int size )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( socket_sendmsg , 0 , sock , msg , size ) ;
2007-10-17 10:31:32 +04:00
}
int security_socket_recvmsg ( struct socket * sock , struct msghdr * msg ,
int size , int flags )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( socket_recvmsg , 0 , sock , msg , size , flags ) ;
2007-10-17 10:31:32 +04:00
}
int security_socket_getsockname ( struct socket * sock )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( socket_getsockname , 0 , sock ) ;
2007-10-17 10:31:32 +04:00
}
int security_socket_getpeername ( struct socket * sock )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( socket_getpeername , 0 , sock ) ;
2007-10-17 10:31:32 +04:00
}
int security_socket_getsockopt ( struct socket * sock , int level , int optname )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( socket_getsockopt , 0 , sock , level , optname ) ;
2007-10-17 10:31:32 +04:00
}
int security_socket_setsockopt ( struct socket * sock , int level , int optname )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( socket_setsockopt , 0 , sock , level , optname ) ;
2007-10-17 10:31:32 +04:00
}
int security_socket_shutdown ( struct socket * sock , int how )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( socket_shutdown , 0 , sock , how ) ;
2007-10-17 10:31:32 +04:00
}
int security_sock_rcv_skb ( struct sock * sk , struct sk_buff * skb )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( socket_sock_rcv_skb , 0 , sk , skb ) ;
2007-10-17 10:31:32 +04:00
}
EXPORT_SYMBOL ( security_sock_rcv_skb ) ;
2022-10-10 19:31:21 +03:00
int security_socket_getpeersec_stream ( struct socket * sock , sockptr_t optval ,
sockptr_t optlen , unsigned int len )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:42 +03:00
return call_int_hook ( socket_getpeersec_stream , - ENOPROTOOPT , sock ,
2022-10-10 19:31:21 +03:00
optval , optlen , len ) ;
2007-10-17 10:31:32 +04:00
}
int security_socket_getpeersec_dgram ( struct socket * sock , struct sk_buff * skb , u32 * secid )
{
2015-08-24 15:22:25 +03:00
return call_int_hook ( socket_getpeersec_dgram , - ENOPROTOOPT , sock ,
skb , secid ) ;
2007-10-17 10:31:32 +04:00
}
EXPORT_SYMBOL ( security_socket_getpeersec_dgram ) ;
int security_sk_alloc ( struct sock * sk , int family , gfp_t priority )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( sk_alloc_security , 0 , sk , family , priority ) ;
2007-10-17 10:31:32 +04:00
}
void security_sk_free ( struct sock * sk )
{
2015-05-03 01:11:29 +03:00
call_void_hook ( sk_free_security , sk ) ;
2007-10-17 10:31:32 +04:00
}
void security_sk_clone ( const struct sock * sk , struct sock * newsk )
{
2015-05-03 01:11:29 +03:00
call_void_hook ( sk_clone_security , sk , newsk ) ;
2007-10-17 10:31:32 +04:00
}
2011-10-07 13:40:59 +04:00
EXPORT_SYMBOL ( security_sk_clone ) ;
2007-10-17 10:31:32 +04:00
2020-09-28 05:38:26 +03:00
void security_sk_classify_flow ( struct sock * sk , struct flowi_common * flic )
2007-10-17 10:31:32 +04:00
{
2020-09-28 05:38:26 +03:00
call_void_hook ( sk_getsecid , sk , & flic - > flowic_secid ) ;
2007-10-17 10:31:32 +04:00
}
EXPORT_SYMBOL ( security_sk_classify_flow ) ;
2020-09-28 05:38:26 +03:00
void security_req_classify_flow ( const struct request_sock * req ,
struct flowi_common * flic )
2007-10-17 10:31:32 +04:00
{
2020-09-28 05:38:26 +03:00
call_void_hook ( req_classify_flow , req , flic ) ;
2007-10-17 10:31:32 +04:00
}
EXPORT_SYMBOL ( security_req_classify_flow ) ;
void security_sock_graft ( struct sock * sk , struct socket * parent )
{
2015-05-03 01:11:29 +03:00
call_void_hook ( sock_graft , sk , parent ) ;
2007-10-17 10:31:32 +04:00
}
EXPORT_SYMBOL ( security_sock_graft ) ;
2020-11-30 18:36:29 +03:00
int security_inet_conn_request ( const struct sock * sk ,
2007-10-17 10:31:32 +04:00
struct sk_buff * skb , struct request_sock * req )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( inet_conn_request , 0 , sk , skb , req ) ;
2007-10-17 10:31:32 +04:00
}
EXPORT_SYMBOL ( security_inet_conn_request ) ;
void security_inet_csk_clone ( struct sock * newsk ,
const struct request_sock * req )
{
2015-05-03 01:11:29 +03:00
call_void_hook ( inet_csk_clone , newsk , req ) ;
2007-10-17 10:31:32 +04:00
}
void security_inet_conn_established ( struct sock * sk ,
struct sk_buff * skb )
{
2015-05-03 01:11:29 +03:00
call_void_hook ( inet_conn_established , sk , skb ) ;
2007-10-17 10:31:32 +04:00
}
2018-02-13 23:53:21 +03:00
EXPORT_SYMBOL ( security_inet_conn_established ) ;
2007-10-17 10:31:32 +04:00
2010-10-14 00:24:41 +04:00
int security_secmark_relabel_packet ( u32 secid )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( secmark_relabel_packet , 0 , secid ) ;
2010-10-14 00:24:41 +04:00
}
EXPORT_SYMBOL ( security_secmark_relabel_packet ) ;
void security_secmark_refcount_inc ( void )
{
2015-05-03 01:11:29 +03:00
call_void_hook ( secmark_refcount_inc ) ;
2010-10-14 00:24:41 +04:00
}
EXPORT_SYMBOL ( security_secmark_refcount_inc ) ;
void security_secmark_refcount_dec ( void )
{
2015-05-03 01:11:29 +03:00
call_void_hook ( secmark_refcount_dec ) ;
2010-10-14 00:24:41 +04:00
}
EXPORT_SYMBOL ( security_secmark_refcount_dec ) ;
2013-01-14 11:12:19 +04:00
int security_tun_dev_alloc_security ( void * * security )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( tun_dev_alloc_security , 0 , security ) ;
2013-01-14 11:12:19 +04:00
}
EXPORT_SYMBOL ( security_tun_dev_alloc_security ) ;
void security_tun_dev_free_security ( void * security )
{
2015-05-03 01:11:29 +03:00
call_void_hook ( tun_dev_free_security , security ) ;
2013-01-14 11:12:19 +04:00
}
EXPORT_SYMBOL ( security_tun_dev_free_security ) ;
2009-08-29 02:12:43 +04:00
int security_tun_dev_create ( void )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( tun_dev_create , 0 ) ;
2009-08-29 02:12:43 +04:00
}
EXPORT_SYMBOL ( security_tun_dev_create ) ;
2013-01-14 11:12:19 +04:00
int security_tun_dev_attach_queue ( void * security )
2009-08-29 02:12:43 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( tun_dev_attach_queue , 0 , security ) ;
2009-08-29 02:12:43 +04:00
}
2013-01-14 11:12:19 +04:00
EXPORT_SYMBOL ( security_tun_dev_attach_queue ) ;
2009-08-29 02:12:43 +04:00
2013-01-14 11:12:19 +04:00
int security_tun_dev_attach ( struct sock * sk , void * security )
2009-08-29 02:12:43 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( tun_dev_attach , 0 , sk , security ) ;
2009-08-29 02:12:43 +04:00
}
EXPORT_SYMBOL ( security_tun_dev_attach ) ;
2013-01-14 11:12:19 +04:00
int security_tun_dev_open ( void * security )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( tun_dev_open , 0 , security ) ;
2013-01-14 11:12:19 +04:00
}
EXPORT_SYMBOL ( security_tun_dev_open ) ;
2021-11-02 15:02:47 +03:00
int security_sctp_assoc_request ( struct sctp_association * asoc , struct sk_buff * skb )
2018-02-13 23:53:21 +03:00
{
2021-11-02 15:02:47 +03:00
return call_int_hook ( sctp_assoc_request , 0 , asoc , skb ) ;
2018-02-13 23:53:21 +03:00
}
EXPORT_SYMBOL ( security_sctp_assoc_request ) ;
int security_sctp_bind_connect ( struct sock * sk , int optname ,
struct sockaddr * address , int addrlen )
{
return call_int_hook ( sctp_bind_connect , 0 , sk , optname ,
address , addrlen ) ;
}
EXPORT_SYMBOL ( security_sctp_bind_connect ) ;
2021-11-02 15:02:47 +03:00
void security_sctp_sk_clone ( struct sctp_association * asoc , struct sock * sk ,
2018-02-13 23:53:21 +03:00
struct sock * newsk )
{
2021-11-02 15:02:47 +03:00
call_void_hook ( sctp_sk_clone , asoc , sk , newsk ) ;
2018-02-13 23:53:21 +03:00
}
EXPORT_SYMBOL ( security_sctp_sk_clone ) ;
2022-02-12 20:59:21 +03:00
int security_sctp_assoc_established ( struct sctp_association * asoc ,
struct sk_buff * skb )
{
return call_int_hook ( sctp_assoc_established , 0 , asoc , skb ) ;
}
EXPORT_SYMBOL ( security_sctp_assoc_established ) ;
2007-10-17 10:31:32 +04:00
# endif /* CONFIG_SECURITY_NETWORK */
IB/core: Enforce PKey security on QPs
Add new LSM hooks to allocate and free security contexts and check for
permission to access a PKey.
Allocate and free a security context when creating and destroying a QP.
This context is used for controlling access to PKeys.
When a request is made to modify a QP that changes the port, PKey index,
or alternate path, check that the QP has permission for the PKey in the
PKey table index on the subnet prefix of the port. If the QP is shared
make sure all handles to the QP also have access.
Store which port and PKey index a QP is using. After the reset to init
transition the user can modify the port, PKey index and alternate path
independently. So port and PKey settings changes can be a merge of the
previous settings and the new ones.
In order to maintain access control if there are PKey table or subnet
prefix change keep a list of all QPs are using each PKey index on
each port. If a change occurs all QPs using that device and port must
have access enforced for the new cache settings.
These changes add a transaction to the QP modify process. Association
with the old port and PKey index must be maintained if the modify fails,
and must be removed if it succeeds. Association with the new port and
PKey index must be established prior to the modify and removed if the
modify fails.
1. When a QP is modified to a particular Port, PKey index or alternate
path insert that QP into the appropriate lists.
2. Check permission to access the new settings.
3. If step 2 grants access attempt to modify the QP.
4a. If steps 2 and 3 succeed remove any prior associations.
4b. If ether fails remove the new setting associations.
If a PKey table or subnet prefix changes walk the list of QPs and
check that they have permission. If not send the QP to the error state
and raise a fatal error event. If it's a shared QP make sure all the
QPs that share the real_qp have permission as well. If the QP that
owns a security structure is denied access the security structure is
marked as such and the QP is added to an error_list. Once the moving
the QP to error is complete the security structure mark is cleared.
Maintaining the lists correctly turns QP destroy into a transaction.
The hardware driver for the device frees the ib_qp structure, so while
the destroy is in progress the ib_qp pointer in the ib_qp_security
struct is undefined. When the destroy process begins the ib_qp_security
structure is marked as destroying. This prevents any action from being
taken on the QP pointer. After the QP is destroyed successfully it
could still listed on an error_list wait for it to be processed by that
flow before cleaning up the structure.
If the destroy fails the QPs port and PKey settings are reinserted into
the appropriate lists, the destroying flag is cleared, and access control
is enforced, in case there were any cache changes during the destroy
flow.
To keep the security changes isolated a new file is used to hold security
related functionality.
Signed-off-by: Daniel Jurgens <danielj@mellanox.com>
Acked-by: Doug Ledford <dledford@redhat.com>
[PM: merge fixup in ib_verbs.h and uverbs_cmd.c]
Signed-off-by: Paul Moore <paul@paul-moore.com>
2017-05-19 15:48:52 +03:00
# ifdef CONFIG_SECURITY_INFINIBAND
int security_ib_pkey_access ( void * sec , u64 subnet_prefix , u16 pkey )
{
return call_int_hook ( ib_pkey_access , 0 , sec , subnet_prefix , pkey ) ;
}
EXPORT_SYMBOL ( security_ib_pkey_access ) ;
2017-05-19 15:48:54 +03:00
int security_ib_endport_manage_subnet ( void * sec , const char * dev_name , u8 port_num )
{
return call_int_hook ( ib_endport_manage_subnet , 0 , sec , dev_name , port_num ) ;
}
EXPORT_SYMBOL ( security_ib_endport_manage_subnet ) ;
IB/core: Enforce PKey security on QPs
Add new LSM hooks to allocate and free security contexts and check for
permission to access a PKey.
Allocate and free a security context when creating and destroying a QP.
This context is used for controlling access to PKeys.
When a request is made to modify a QP that changes the port, PKey index,
or alternate path, check that the QP has permission for the PKey in the
PKey table index on the subnet prefix of the port. If the QP is shared
make sure all handles to the QP also have access.
Store which port and PKey index a QP is using. After the reset to init
transition the user can modify the port, PKey index and alternate path
independently. So port and PKey settings changes can be a merge of the
previous settings and the new ones.
In order to maintain access control if there are PKey table or subnet
prefix change keep a list of all QPs are using each PKey index on
each port. If a change occurs all QPs using that device and port must
have access enforced for the new cache settings.
These changes add a transaction to the QP modify process. Association
with the old port and PKey index must be maintained if the modify fails,
and must be removed if it succeeds. Association with the new port and
PKey index must be established prior to the modify and removed if the
modify fails.
1. When a QP is modified to a particular Port, PKey index or alternate
path insert that QP into the appropriate lists.
2. Check permission to access the new settings.
3. If step 2 grants access attempt to modify the QP.
4a. If steps 2 and 3 succeed remove any prior associations.
4b. If ether fails remove the new setting associations.
If a PKey table or subnet prefix changes walk the list of QPs and
check that they have permission. If not send the QP to the error state
and raise a fatal error event. If it's a shared QP make sure all the
QPs that share the real_qp have permission as well. If the QP that
owns a security structure is denied access the security structure is
marked as such and the QP is added to an error_list. Once the moving
the QP to error is complete the security structure mark is cleared.
Maintaining the lists correctly turns QP destroy into a transaction.
The hardware driver for the device frees the ib_qp structure, so while
the destroy is in progress the ib_qp pointer in the ib_qp_security
struct is undefined. When the destroy process begins the ib_qp_security
structure is marked as destroying. This prevents any action from being
taken on the QP pointer. After the QP is destroyed successfully it
could still listed on an error_list wait for it to be processed by that
flow before cleaning up the structure.
If the destroy fails the QPs port and PKey settings are reinserted into
the appropriate lists, the destroying flag is cleared, and access control
is enforced, in case there were any cache changes during the destroy
flow.
To keep the security changes isolated a new file is used to hold security
related functionality.
Signed-off-by: Daniel Jurgens <danielj@mellanox.com>
Acked-by: Doug Ledford <dledford@redhat.com>
[PM: merge fixup in ib_verbs.h and uverbs_cmd.c]
Signed-off-by: Paul Moore <paul@paul-moore.com>
2017-05-19 15:48:52 +03:00
int security_ib_alloc_security ( void * * sec )
{
return call_int_hook ( ib_alloc_security , 0 , sec ) ;
}
EXPORT_SYMBOL ( security_ib_alloc_security ) ;
void security_ib_free_security ( void * sec )
{
call_void_hook ( ib_free_security , sec ) ;
}
EXPORT_SYMBOL ( security_ib_free_security ) ;
# endif /* CONFIG_SECURITY_INFINIBAND */
2007-10-17 10:31:32 +04:00
# ifdef CONFIG_SECURITY_NETWORK_XFRM
2014-03-07 15:44:19 +04:00
int security_xfrm_policy_alloc ( struct xfrm_sec_ctx * * ctxp ,
struct xfrm_user_sec_ctx * sec_ctx ,
gfp_t gfp )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( xfrm_policy_alloc_security , 0 , ctxp , sec_ctx , gfp ) ;
2007-10-17 10:31:32 +04:00
}
EXPORT_SYMBOL ( security_xfrm_policy_alloc ) ;
2008-04-13 06:07:52 +04:00
int security_xfrm_policy_clone ( struct xfrm_sec_ctx * old_ctx ,
struct xfrm_sec_ctx * * new_ctxp )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( xfrm_policy_clone_security , 0 , old_ctx , new_ctxp ) ;
2007-10-17 10:31:32 +04:00
}
2008-04-13 06:07:52 +04:00
void security_xfrm_policy_free ( struct xfrm_sec_ctx * ctx )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
call_void_hook ( xfrm_policy_free_security , ctx ) ;
2007-10-17 10:31:32 +04:00
}
EXPORT_SYMBOL ( security_xfrm_policy_free ) ;
2008-04-13 06:07:52 +04:00
int security_xfrm_policy_delete ( struct xfrm_sec_ctx * ctx )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( xfrm_policy_delete_security , 0 , ctx ) ;
2007-10-17 10:31:32 +04:00
}
2013-07-24 01:38:38 +04:00
int security_xfrm_state_alloc ( struct xfrm_state * x ,
struct xfrm_user_sec_ctx * sec_ctx )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( xfrm_state_alloc , 0 , x , sec_ctx ) ;
2007-10-17 10:31:32 +04:00
}
EXPORT_SYMBOL ( security_xfrm_state_alloc ) ;
int security_xfrm_state_alloc_acquire ( struct xfrm_state * x ,
struct xfrm_sec_ctx * polsec , u32 secid )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( xfrm_state_alloc_acquire , 0 , x , polsec , secid ) ;
2007-10-17 10:31:32 +04:00
}
int security_xfrm_state_delete ( struct xfrm_state * x )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( xfrm_state_delete_security , 0 , x ) ;
2007-10-17 10:31:32 +04:00
}
EXPORT_SYMBOL ( security_xfrm_state_delete ) ;
void security_xfrm_state_free ( struct xfrm_state * x )
{
2015-05-03 01:11:29 +03:00
call_void_hook ( xfrm_state_free_security , x ) ;
2007-10-17 10:31:32 +04:00
}
2021-04-09 08:48:41 +03:00
int security_xfrm_policy_lookup ( struct xfrm_sec_ctx * ctx , u32 fl_secid )
2007-10-17 10:31:32 +04:00
{
2021-04-09 08:48:41 +03:00
return call_int_hook ( xfrm_policy_lookup , 0 , ctx , fl_secid ) ;
2007-10-17 10:31:32 +04:00
}
int security_xfrm_state_pol_flow_match ( struct xfrm_state * x ,
2011-02-23 05:13:15 +03:00
struct xfrm_policy * xp ,
2020-09-28 05:38:26 +03:00
const struct flowi_common * flic )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:42 +03:00
struct security_hook_list * hp ;
2020-03-29 03:43:50 +03:00
int rc = LSM_RET_DEFAULT ( xfrm_state_pol_flow_match ) ;
2015-05-03 01:11:42 +03:00
/*
* Since this function is expected to return 0 or 1 , the judgment
* becomes difficult if multiple LSMs supply this call . Fortunately ,
* we can use the first LSM ' s judgment because currently only SELinux
* supplies this call .
*
* For speed optimization , we explicitly break the loop rather than
* using the macro
*/
2018-03-29 04:28:23 +03:00
hlist_for_each_entry ( hp , & security_hook_heads . xfrm_state_pol_flow_match ,
2015-05-03 01:11:42 +03:00
list ) {
2020-09-28 05:38:26 +03:00
rc = hp - > hook . xfrm_state_pol_flow_match ( x , xp , flic ) ;
2015-05-03 01:11:42 +03:00
break ;
}
return rc ;
2007-10-17 10:31:32 +04:00
}
int security_xfrm_decode_session ( struct sk_buff * skb , u32 * secid )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( xfrm_decode_session , 0 , skb , secid , 1 ) ;
2007-10-17 10:31:32 +04:00
}
2020-09-28 05:38:26 +03:00
void security_skb_classify_flow ( struct sk_buff * skb , struct flowi_common * flic )
2007-10-17 10:31:32 +04:00
{
2020-09-28 05:38:26 +03:00
int rc = call_int_hook ( xfrm_decode_session , 0 , skb , & flic - > flowic_secid ,
2015-05-03 01:11:29 +03:00
0 ) ;
2007-10-17 10:31:32 +04:00
BUG_ON ( rc ) ;
}
EXPORT_SYMBOL ( security_skb_classify_flow ) ;
# endif /* CONFIG_SECURITY_NETWORK_XFRM */
# ifdef CONFIG_KEYS
CRED: Inaugurate COW credentials
Inaugurate copy-on-write credentials management. This uses RCU to manage the
credentials pointer in the task_struct with respect to accesses by other tasks.
A process may only modify its own credentials, and so does not need locking to
access or modify its own credentials.
A mutex (cred_replace_mutex) is added to the task_struct to control the effect
of PTRACE_ATTACHED on credential calculations, particularly with respect to
execve().
With this patch, the contents of an active credentials struct may not be
changed directly; rather a new set of credentials must be prepared, modified
and committed using something like the following sequence of events:
struct cred *new = prepare_creds();
int ret = blah(new);
if (ret < 0) {
abort_creds(new);
return ret;
}
return commit_creds(new);
There are some exceptions to this rule: the keyrings pointed to by the active
credentials may be instantiated - keyrings violate the COW rule as managing
COW keyrings is tricky, given that it is possible for a task to directly alter
the keys in a keyring in use by another task.
To help enforce this, various pointers to sets of credentials, such as those in
the task_struct, are declared const. The purpose of this is compile-time
discouragement of altering credentials through those pointers. Once a set of
credentials has been made public through one of these pointers, it may not be
modified, except under special circumstances:
(1) Its reference count may incremented and decremented.
(2) The keyrings to which it points may be modified, but not replaced.
The only safe way to modify anything else is to create a replacement and commit
using the functions described in Documentation/credentials.txt (which will be
added by a later patch).
This patch and the preceding patches have been tested with the LTP SELinux
testsuite.
This patch makes several logical sets of alteration:
(1) execve().
This now prepares and commits credentials in various places in the
security code rather than altering the current creds directly.
(2) Temporary credential overrides.
do_coredump() and sys_faccessat() now prepare their own credentials and
temporarily override the ones currently on the acting thread, whilst
preventing interference from other threads by holding cred_replace_mutex
on the thread being dumped.
This will be replaced in a future patch by something that hands down the
credentials directly to the functions being called, rather than altering
the task's objective credentials.
(3) LSM interface.
A number of functions have been changed, added or removed:
(*) security_capset_check(), ->capset_check()
(*) security_capset_set(), ->capset_set()
Removed in favour of security_capset().
(*) security_capset(), ->capset()
New. This is passed a pointer to the new creds, a pointer to the old
creds and the proposed capability sets. It should fill in the new
creds or return an error. All pointers, barring the pointer to the
new creds, are now const.
(*) security_bprm_apply_creds(), ->bprm_apply_creds()
Changed; now returns a value, which will cause the process to be
killed if it's an error.
(*) security_task_alloc(), ->task_alloc_security()
Removed in favour of security_prepare_creds().
(*) security_cred_free(), ->cred_free()
New. Free security data attached to cred->security.
(*) security_prepare_creds(), ->cred_prepare()
New. Duplicate any security data attached to cred->security.
(*) security_commit_creds(), ->cred_commit()
New. Apply any security effects for the upcoming installation of new
security by commit_creds().
(*) security_task_post_setuid(), ->task_post_setuid()
Removed in favour of security_task_fix_setuid().
(*) security_task_fix_setuid(), ->task_fix_setuid()
Fix up the proposed new credentials for setuid(). This is used by
cap_set_fix_setuid() to implicitly adjust capabilities in line with
setuid() changes. Changes are made to the new credentials, rather
than the task itself as in security_task_post_setuid().
(*) security_task_reparent_to_init(), ->task_reparent_to_init()
Removed. Instead the task being reparented to init is referred
directly to init's credentials.
NOTE! This results in the loss of some state: SELinux's osid no
longer records the sid of the thread that forked it.
(*) security_key_alloc(), ->key_alloc()
(*) security_key_permission(), ->key_permission()
Changed. These now take cred pointers rather than task pointers to
refer to the security context.
(4) sys_capset().
This has been simplified and uses less locking. The LSM functions it
calls have been merged.
(5) reparent_to_kthreadd().
This gives the current thread the same credentials as init by simply using
commit_thread() to point that way.
(6) __sigqueue_alloc() and switch_uid()
__sigqueue_alloc() can't stop the target task from changing its creds
beneath it, so this function gets a reference to the currently applicable
user_struct which it then passes into the sigqueue struct it returns if
successful.
switch_uid() is now called from commit_creds(), and possibly should be
folded into that. commit_creds() should take care of protecting
__sigqueue_alloc().
(7) [sg]et[ug]id() and co and [sg]et_current_groups.
The set functions now all use prepare_creds(), commit_creds() and
abort_creds() to build and check a new set of credentials before applying
it.
security_task_set[ug]id() is called inside the prepared section. This
guarantees that nothing else will affect the creds until we've finished.
The calling of set_dumpable() has been moved into commit_creds().
Much of the functionality of set_user() has been moved into
commit_creds().
The get functions all simply access the data directly.
(8) security_task_prctl() and cap_task_prctl().
security_task_prctl() has been modified to return -ENOSYS if it doesn't
want to handle a function, or otherwise return the return value directly
rather than through an argument.
Additionally, cap_task_prctl() now prepares a new set of credentials, even
if it doesn't end up using it.
(9) Keyrings.
A number of changes have been made to the keyrings code:
(a) switch_uid_keyring(), copy_keys(), exit_keys() and suid_keys() have
all been dropped and built in to the credentials functions directly.
They may want separating out again later.
(b) key_alloc() and search_process_keyrings() now take a cred pointer
rather than a task pointer to specify the security context.
(c) copy_creds() gives a new thread within the same thread group a new
thread keyring if its parent had one, otherwise it discards the thread
keyring.
(d) The authorisation key now points directly to the credentials to extend
the search into rather pointing to the task that carries them.
(e) Installing thread, process or session keyrings causes a new set of
credentials to be created, even though it's not strictly necessary for
process or session keyrings (they're shared).
(10) Usermode helper.
The usermode helper code now carries a cred struct pointer in its
subprocess_info struct instead of a new session keyring pointer. This set
of credentials is derived from init_cred and installed on the new process
after it has been cloned.
call_usermodehelper_setup() allocates the new credentials and
call_usermodehelper_freeinfo() discards them if they haven't been used. A
special cred function (prepare_usermodeinfo_creds()) is provided
specifically for call_usermodehelper_setup() to call.
call_usermodehelper_setkeys() adjusts the credentials to sport the
supplied keyring as the new session keyring.
(11) SELinux.
SELinux has a number of changes, in addition to those to support the LSM
interface changes mentioned above:
(a) selinux_setprocattr() no longer does its check for whether the
current ptracer can access processes with the new SID inside the lock
that covers getting the ptracer's SID. Whilst this lock ensures that
the check is done with the ptracer pinned, the result is only valid
until the lock is released, so there's no point doing it inside the
lock.
(12) is_single_threaded().
This function has been extracted from selinux_setprocattr() and put into
a file of its own in the lib/ directory as join_session_keyring() now
wants to use it too.
The code in SELinux just checked to see whether a task shared mm_structs
with other tasks (CLONE_VM), but that isn't good enough. We really want
to know if they're part of the same thread group (CLONE_THREAD).
(13) nfsd.
The NFS server daemon now has to use the COW credentials to set the
credentials it is going to use. It really needs to pass the credentials
down to the functions it calls, but it can't do that until other patches
in this series have been applied.
Signed-off-by: David Howells <dhowells@redhat.com>
Acked-by: James Morris <jmorris@namei.org>
Signed-off-by: James Morris <jmorris@namei.org>
2008-11-14 02:39:23 +03:00
int security_key_alloc ( struct key * key , const struct cred * cred ,
unsigned long flags )
2007-10-17 10:31:32 +04:00
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( key_alloc , 0 , key , cred , flags ) ;
2007-10-17 10:31:32 +04:00
}
void security_key_free ( struct key * key )
{
2015-05-03 01:11:29 +03:00
call_void_hook ( key_free , key ) ;
2007-10-17 10:31:32 +04:00
}
2020-05-12 17:16:29 +03:00
int security_key_permission ( key_ref_t key_ref , const struct cred * cred ,
enum key_need_perm need_perm )
2007-10-17 10:31:32 +04:00
{
2020-05-12 17:16:29 +03:00
return call_int_hook ( key_permission , 0 , key_ref , cred , need_perm ) ;
2007-10-17 10:31:32 +04:00
}
2008-04-29 12:01:26 +04:00
int security_key_getsecurity ( struct key * key , char * * _buffer )
{
2015-05-03 01:11:42 +03:00
* _buffer = NULL ;
2015-05-03 01:11:29 +03:00
return call_int_hook ( key_getsecurity , 0 , key , _buffer ) ;
2008-04-29 12:01:26 +04:00
}
2007-10-17 10:31:32 +04:00
# endif /* CONFIG_KEYS */
2008-03-01 23:00:05 +03:00
# ifdef CONFIG_AUDIT
int security_audit_rule_init ( u32 field , u32 op , char * rulestr , void * * lsmrule )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( audit_rule_init , 0 , field , op , rulestr , lsmrule ) ;
2008-03-01 23:00:05 +03:00
}
int security_audit_rule_known ( struct audit_krule * krule )
{
2015-05-03 01:11:29 +03:00
return call_int_hook ( audit_rule_known , 0 , krule ) ;
2008-03-01 23:00:05 +03:00
}
void security_audit_rule_free ( void * lsmrule )
{
2015-05-03 01:11:29 +03:00
call_void_hook ( audit_rule_free , lsmrule ) ;
2008-03-01 23:00:05 +03:00
}
2019-01-31 19:52:11 +03:00
int security_audit_rule_match ( u32 secid , u32 field , u32 op , void * lsmrule )
2008-03-01 23:00:05 +03:00
{
2019-01-31 19:52:11 +03:00
return call_int_hook ( audit_rule_match , 0 , secid , field , op , lsmrule ) ;
2008-03-01 23:00:05 +03:00
}
2015-05-03 01:11:42 +03:00
# endif /* CONFIG_AUDIT */
2017-10-18 23:00:24 +03:00
# ifdef CONFIG_BPF_SYSCALL
int security_bpf ( int cmd , union bpf_attr * attr , unsigned int size )
{
return call_int_hook ( bpf , 0 , cmd , attr , size ) ;
}
int security_bpf_map ( struct bpf_map * map , fmode_t fmode )
{
return call_int_hook ( bpf_map , 0 , map , fmode ) ;
}
int security_bpf_prog ( struct bpf_prog * prog )
{
return call_int_hook ( bpf_prog , 0 , prog ) ;
}
int security_bpf_map_alloc ( struct bpf_map * map )
{
return call_int_hook ( bpf_map_alloc_security , 0 , map ) ;
}
int security_bpf_prog_alloc ( struct bpf_prog_aux * aux )
{
return call_int_hook ( bpf_prog_alloc_security , 0 , aux ) ;
}
void security_bpf_map_free ( struct bpf_map * map )
{
call_void_hook ( bpf_map_free_security , map ) ;
}
void security_bpf_prog_free ( struct bpf_prog_aux * aux )
{
call_void_hook ( bpf_prog_free_security , aux ) ;
}
# endif /* CONFIG_BPF_SYSCALL */
2019-08-20 03:17:38 +03:00
int security_locked_down ( enum lockdown_reason what )
{
return call_int_hook ( locked_down , 0 , what ) ;
}
EXPORT_SYMBOL ( security_locked_down ) ;
perf_event: Add support for LSM and SELinux checks
In current mainline, the degree of access to perf_event_open(2) system
call depends on the perf_event_paranoid sysctl. This has a number of
limitations:
1. The sysctl is only a single value. Many types of accesses are controlled
based on the single value thus making the control very limited and
coarse grained.
2. The sysctl is global, so if the sysctl is changed, then that means
all processes get access to perf_event_open(2) opening the door to
security issues.
This patch adds LSM and SELinux access checking which will be used in
Android to access perf_event_open(2) for the purposes of attaching BPF
programs to tracepoints, perf profiling and other operations from
userspace. These operations are intended for production systems.
5 new LSM hooks are added:
1. perf_event_open: This controls access during the perf_event_open(2)
syscall itself. The hook is called from all the places that the
perf_event_paranoid sysctl is checked to keep it consistent with the
systctl. The hook gets passed a 'type' argument which controls CPU,
kernel and tracepoint accesses (in this context, CPU, kernel and
tracepoint have the same semantics as the perf_event_paranoid sysctl).
Additionally, I added an 'open' type which is similar to
perf_event_paranoid sysctl == 3 patch carried in Android and several other
distros but was rejected in mainline [1] in 2016.
2. perf_event_alloc: This allocates a new security object for the event
which stores the current SID within the event. It will be useful when
the perf event's FD is passed through IPC to another process which may
try to read the FD. Appropriate security checks will limit access.
3. perf_event_free: Called when the event is closed.
4. perf_event_read: Called from the read(2) and mmap(2) syscalls for the event.
5. perf_event_write: Called from the ioctl(2) syscalls for the event.
[1] https://lwn.net/Articles/696240/
Since Peter had suggest LSM hooks in 2016 [1], I am adding his
Suggested-by tag below.
To use this patch, we set the perf_event_paranoid sysctl to -1 and then
apply selinux checking as appropriate (default deny everything, and then
add policy rules to give access to domains that need it). In the future
we can remove the perf_event_paranoid sysctl altogether.
Suggested-by: Peter Zijlstra <peterz@infradead.org>
Co-developed-by: Peter Zijlstra <peterz@infradead.org>
Signed-off-by: Joel Fernandes (Google) <joel@joelfernandes.org>
Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org>
Acked-by: James Morris <jmorris@namei.org>
Cc: Arnaldo Carvalho de Melo <acme@kernel.org>
Cc: rostedt@goodmis.org
Cc: Yonghong Song <yhs@fb.com>
Cc: Kees Cook <keescook@chromium.org>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: Alexei Starovoitov <ast@kernel.org>
Cc: jeffv@google.com
Cc: Jiri Olsa <jolsa@redhat.com>
Cc: Daniel Borkmann <daniel@iogearbox.net>
Cc: primiano@google.com
Cc: Song Liu <songliubraving@fb.com>
Cc: rsavitski@google.com
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Matthew Garrett <matthewgarrett@google.com>
Link: https://lkml.kernel.org/r/20191014170308.70668-1-joel@joelfernandes.org
2019-10-14 20:03:08 +03:00
# ifdef CONFIG_PERF_EVENTS
int security_perf_event_open ( struct perf_event_attr * attr , int type )
{
return call_int_hook ( perf_event_open , 0 , attr , type ) ;
}
int security_perf_event_alloc ( struct perf_event * event )
{
return call_int_hook ( perf_event_alloc , 0 , event ) ;
}
void security_perf_event_free ( struct perf_event * event )
{
call_void_hook ( perf_event_free , event ) ;
}
int security_perf_event_read ( struct perf_event * event )
{
return call_int_hook ( perf_event_read , 0 , event ) ;
}
int security_perf_event_write ( struct perf_event * event )
{
return call_int_hook ( perf_event_write , 0 , event ) ;
}
# endif /* CONFIG_PERF_EVENTS */
lsm,io_uring: add LSM hooks to io_uring
A full expalantion of io_uring is beyond the scope of this commit
description, but in summary it is an asynchronous I/O mechanism
which allows for I/O requests and the resulting data to be queued
in memory mapped "rings" which are shared between the kernel and
userspace. Optionally, io_uring offers the ability for applications
to spawn kernel threads to dequeue I/O requests from the ring and
submit the requests in the kernel, helping to minimize the syscall
overhead. Rings are accessed in userspace by memory mapping a file
descriptor provided by the io_uring_setup(2), and can be shared
between applications as one might do with any open file descriptor.
Finally, process credentials can be registered with a given ring
and any process with access to that ring can submit I/O requests
using any of the registered credentials.
While the io_uring functionality is widely recognized as offering a
vastly improved, and high performing asynchronous I/O mechanism, its
ability to allow processes to submit I/O requests with credentials
other than its own presents a challenge to LSMs. When a process
creates a new io_uring ring the ring's credentials are inhertied
from the calling process; if this ring is shared with another
process operating with different credentials there is the potential
to bypass the LSMs security policy. Similarly, registering
credentials with a given ring allows any process with access to that
ring to submit I/O requests with those credentials.
In an effort to allow LSMs to apply security policy to io_uring I/O
operations, this patch adds two new LSM hooks. These hooks, in
conjunction with the LSM anonymous inode support previously
submitted, allow an LSM to apply access control policy to the
sharing of io_uring rings as well as any io_uring credential changes
requested by a process.
The new LSM hooks are described below:
* int security_uring_override_creds(cred)
Controls if the current task, executing an io_uring operation,
is allowed to override it's credentials with @cred. In cases
where the current task is a user application, the current
credentials will be those of the user application. In cases
where the current task is a kernel thread servicing io_uring
requests the current credentials will be those of the io_uring
ring (inherited from the process that created the ring).
* int security_uring_sqpoll(void)
Controls if the current task is allowed to create an io_uring
polling thread (IORING_SETUP_SQPOLL). Without a SQPOLL thread
in the kernel processes must submit I/O requests via
io_uring_enter(2) which allows us to compare any requested
credential changes against the application making the request.
With a SQPOLL thread, we can no longer compare requested
credential changes against the application making the request,
the comparison is made against the ring's credentials.
Signed-off-by: Paul Moore <paul@paul-moore.com>
2021-02-02 03:56:49 +03:00
# ifdef CONFIG_IO_URING
int security_uring_override_creds ( const struct cred * new )
{
return call_int_hook ( uring_override_creds , 0 , new ) ;
}
int security_uring_sqpoll ( void )
{
return call_int_hook ( uring_sqpoll , 0 ) ;
}
2022-07-15 22:16:22 +03:00
int security_uring_cmd ( struct io_uring_cmd * ioucmd )
{
return call_int_hook ( uring_cmd , 0 , ioucmd ) ;
}
lsm,io_uring: add LSM hooks to io_uring
A full expalantion of io_uring is beyond the scope of this commit
description, but in summary it is an asynchronous I/O mechanism
which allows for I/O requests and the resulting data to be queued
in memory mapped "rings" which are shared between the kernel and
userspace. Optionally, io_uring offers the ability for applications
to spawn kernel threads to dequeue I/O requests from the ring and
submit the requests in the kernel, helping to minimize the syscall
overhead. Rings are accessed in userspace by memory mapping a file
descriptor provided by the io_uring_setup(2), and can be shared
between applications as one might do with any open file descriptor.
Finally, process credentials can be registered with a given ring
and any process with access to that ring can submit I/O requests
using any of the registered credentials.
While the io_uring functionality is widely recognized as offering a
vastly improved, and high performing asynchronous I/O mechanism, its
ability to allow processes to submit I/O requests with credentials
other than its own presents a challenge to LSMs. When a process
creates a new io_uring ring the ring's credentials are inhertied
from the calling process; if this ring is shared with another
process operating with different credentials there is the potential
to bypass the LSMs security policy. Similarly, registering
credentials with a given ring allows any process with access to that
ring to submit I/O requests with those credentials.
In an effort to allow LSMs to apply security policy to io_uring I/O
operations, this patch adds two new LSM hooks. These hooks, in
conjunction with the LSM anonymous inode support previously
submitted, allow an LSM to apply access control policy to the
sharing of io_uring rings as well as any io_uring credential changes
requested by a process.
The new LSM hooks are described below:
* int security_uring_override_creds(cred)
Controls if the current task, executing an io_uring operation,
is allowed to override it's credentials with @cred. In cases
where the current task is a user application, the current
credentials will be those of the user application. In cases
where the current task is a kernel thread servicing io_uring
requests the current credentials will be those of the io_uring
ring (inherited from the process that created the ring).
* int security_uring_sqpoll(void)
Controls if the current task is allowed to create an io_uring
polling thread (IORING_SETUP_SQPOLL). Without a SQPOLL thread
in the kernel processes must submit I/O requests via
io_uring_enter(2) which allows us to compare any requested
credential changes against the application making the request.
With a SQPOLL thread, we can no longer compare requested
credential changes against the application making the request,
the comparison is made against the ring's credentials.
Signed-off-by: Paul Moore <paul@paul-moore.com>
2021-02-02 03:56:49 +03:00
# endif /* CONFIG_IO_URING */