systemd

mirror of https://github.com/systemd/systemd.git synced 2024-11-01 00:51:24 +03:00

Author	SHA1	Message	Date
Yu Watanabe	3536f49e8f	core: add {State,Cache,Log,Configuration}Directory= (#6384 ) This introduces {State,Cache,Log,Configuration}Directory= those are similar to RuntimeDirectory=. They create the directories under /var/lib, /var/cache/, /var/log, or /etc, respectively, with the mode specified in {State,Cache,Log,Configuration}DirectoryMode=. This also fixes #6391.	2017-07-18 14:34:52 +02:00
Lennart Poettering	7398320f9a	Merge pull request #6328 from yuwata/runtime-preserve core: Allow preserving contents of RuntimeDirectory over process restart	2017-07-17 10:02:19 +02:00
Yu Watanabe	23a7448efa	core: support subdirectories in RuntimeDirectory= option	2017-07-17 16:30:53 +09:00
Yu Watanabe	53f47dfc7b	core: allow preserving contents of RuntimeDirectory= over process restart This introduces RuntimeDirectoryPreserve= option which takes a boolean argument or 'restart'. Closes #6087.	2017-07-17 16:22:25 +09:00
Lucas Werkmeister	ceabfb889d	Fix spelling (#6378 )	2017-07-15 12:29:09 -04:00
Lennart Poettering	6297d07b82	Merge pull request #6300 from keszybz/refuse-to-load-some-units Refuse to load some units	2017-07-12 09:28:20 +02:00
Zbigniew Jędrzejewski-Szmek	b023856884	man: add warnings that Private*= settings are not always applied	2017-07-11 13:38:13 -04:00
Lennart Poettering	565dab8ef4	man: briefly document permitted user/group name syntax for User=/Group= and syusers.d (#6321 ) As discussed here: https://lists.freedesktop.org/archives/systemd-devel/2017-July/039237.html	2017-07-10 13:44:06 -04:00
Zbigniew Jędrzejewski-Szmek	189cd8c2ab	man: describe RuntimeDirectoryMode= Fixes #5509.	2017-06-17 15:23:02 -04:00
Zbigniew Jędrzejewski-Szmek	03c3c52040	man: update MemoryDenyWriteExecute description for executable stacks Without going into details, mention that libraries are also covered by the filters, and that executable stacks are a no no. Closes #5970.	2017-05-30 16:44:48 -04:00
Zbigniew Jędrzejewski-Szmek	98e9d71022	man: fix links to external man pages linkchecker ftw!	2017-05-07 11:29:40 -04:00
James Cowgill	a3645cc6dd	seccomp: add clone syscall definitions for mips (#5880 ) Also updates the documentation and adds a mention of ppc64 support which was enabled by #5325. Tested on Debian mipsel and mips64el. The other 4 mips architectures should have an identical user <-> kernel ABI to one of the 2 tested systems.	2017-05-03 18:35:45 +02:00
Mark Stosberg	b8e485faf1	man: document how to include an equals sign in a value provided to Environment= (#5710 ) It wasn't clear before how an equals sign in an "Environment=" value might be handled. Ref: http://stackoverflow.com/questions/43278883/how-to-write-systemd-environment-variables-value-which-contains/43280157	2017-04-11 23:19:06 +02:00
Torstein Husebø	6cf5a96489	man: fix typo (#5556 )	2017-03-08 07:54:22 -05:00
Lennart Poettering	525872bfab	man: document that ProtectKernelTunables= and ProtectControlGroups= implies MountAPIVFS= See: #5384	2017-02-21 21:55:43 +01:00
AsciiWolf	28a0ad81ee	man: use https:// in URLs	2017-02-21 16:28:04 +01:00
Lennart Poettering	0b8fab97cf	man: improve documentation on seccomp regarding alternative ABIs Let's clarify that RestrictAddressFamilies= and MemoryDenyWriteExecute= are only fully effective if non-native system call architectures are disabled, since they otherwise may be used to circumvent the filters, as the filters aren't equally effective on all ABIs. Fixes: #5277	2017-02-09 18:42:17 +01:00
Lennart Poettering	23deef88b9	Revert "core/execute: set HOME, USER also for root users" This reverts commit `8b89628a10`. This broke #5246	2017-02-09 11:43:44 +01:00
Zbigniew Jędrzejewski-Szmek	fc6149a6ce	Merge pull request #4962 from poettering/root-directory-2 Add new MountAPIVFS= boolean unit file setting + RootImage=	2017-02-08 23:05:05 -05:00
Zbigniew Jędrzejewski-Szmek	ef3116b5d4	man: add more commas for clarify and reword a few sentences	2017-02-08 22:53:16 -05:00
Lennart Poettering	ae9d60ce4e	seccomp: on s390 the clone() parameters are reversed Add a bit of code that tries to get the right parameter order in place for some of the better known architectures, and skips restrict_namespaces for other archs. This also bypasses the test on archs where we don't know the right order. In this case I didn't bother with testing the case where no filter is applied, since that is hopefully just an issue for now, as there's nothing stopping us from supporting more archs, we just need to know which order is right. Fixes: #5241	2017-02-08 22:21:27 +01:00
Lennart Poettering	8a50cf6957	seccomp: MemoryDenyWriteExecute= should affect both mmap() and mmap2() (#5254 ) On i386 we block the old mmap() call entirely, since we cannot properly filter it. Thankfully it hasn't been used by glibc since quite some time. Fixes: #5240	2017-02-08 15:14:02 +01:00
Lennart Poettering	915e6d1676	core: add RootImage= setting for using a specific image file as root directory for a service This is similar to RootDirectory= but mounts the root file system from a block device or loopback file instead of another directory. This reuses the image dissector code now used by nspawn and gpt-auto-discovery.	2017-02-07 12:19:42 +01:00
Lennart Poettering	5d997827e2	core: add a per-unit setting MountAPIVFS= for mounting /dev, /proc, /sys in conjunction with RootDirectory= This adds a boolean unit file setting MountAPIVFS=. If set, the three main API VFS mounts will be mounted for the service. This only has an effect on RootDirectory=, which it makes a ton times more useful. (This is basically the /dev + /proc + /sys mounting code posted in the original #4727, but rebased on current git, and with the automatic logic replaced by explicit logic controlled by a unit file setting)	2017-02-07 11:22:05 +01:00
Lennart Poettering	142bd808a1	man: Document that RestrictAddressFamilies= doesn't work on s390/s390x/... We already say that it doesn't work on i386, but there are more archs like that apparently.	2017-02-06 14:17:12 +01:00
Zbigniew Jędrzejewski-Szmek	8b89628a10	core/execute: set HOME, USER also for root users This changes the environment for services running as root from: LANG=C.utf8 PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin INVOCATION_ID=ffbdec203c69499a9b83199333e31555 JOURNAL_STREAM=8:1614518 to LANG=C.utf8 PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin HOME=/root LOGNAME=root USER=root SHELL=/bin/sh INVOCATION_ID=15a077963d7b4ca0b82c91dc6519f87c JOURNAL_STREAM=8:1616718 Making the environment special for the root user complicates things unnecessarily. This change simplifies both our logic (by making the setting of the variables unconditional), and should also simplify the logic in services (particularly scripts). Fixes #5124.	2017-02-03 11:49:22 -05:00
Brandon Philips	9806301614	man: fix spelling error parth -> path	2017-02-02 00:54:42 +01:00
Jakub Wilk	301a21a880	man: fix typos (#5109 )	2017-01-19 16:54:22 +01:00
Zbigniew Jędrzejewski-Szmek	5b3637b44a	Merge pull request #4991 from poettering/seccomp-fix	2017-01-17 23:10:46 -05:00
Zbigniew Jędrzejewski-Szmek	374e692252	Merge pull request #5009 from ian-kelling/ian-mnt-namespace-doc	2017-01-11 15:23:00 -05:00
Ian Kelling	fa2a396620	doc: MountFlags= don't reference container which may not exist (#5011 )	2017-01-03 21:32:31 +01:00
Ian Kelling	7141028d30	doc: correct "or" to "and" in MountFlags= description (#5010 )	2017-01-03 21:31:20 +01:00
Ian Kelling	4b957756b8	man: document mount deletion between commands	2017-01-03 02:17:50 -08:00
Martin Pitt	56a9366d7d	Merge pull request #4994 from poettering/private-tmp-tmpfiles automatically clean up PrivateTmp= left-overs in /var/tmp on next boot	2016-12-29 11:18:38 +01:00
Lennart Poettering	9eb484fa40	man: add brief documentation for the (sd-pam) processes created due to PAMName= (#4967 ) A follow-up for #4942, adding a brief but more correct explanation of the processes.	2016-12-29 10:55:27 +01:00
Lennart Poettering	d71f050599	core: implicitly order units with PrivateTmp= after systemd-tmpfiles-setup.service Preparation for fixing #4401.	2016-12-27 23:25:24 +01:00
Lennart Poettering	bd2ab3f4f6	seccomp: add two new filter sets: @reboot and @swap These groupe reboot()/kexec() and swapon()/swapoff() respectively	2016-12-27 18:09:37 +01:00
Lennart Poettering	d2d6c096f6	core: add ability to define arbitrary bind mounts for services This adds two new settings BindPaths= and BindReadOnlyPaths=. They allow defining arbitrary bind mounts specific to particular services. This is particularly useful for services with RootDirectory= set as this permits making specific bits of the host directory available to chrooted services. The two new settings follow the concepts nspawn already possess in --bind= and --bind-ro=, as well as the .nspawn settings Bind= and BindReadOnly= (and these latter options should probably be renamed to BindPaths= and BindReadOnlyPaths= too). Fixes: #3439	2016-12-14 00:54:10 +01:00
Jouke Witteveen	a4e26faf33	man: fix $SERVICE_RESULT/$EXIT_CODE/$EXIT_STATUS documentation Note that any exit code is available through $EXIT_STATUS and not through $EXIT_CODE. This mimics siginfo.	2016-12-06 13:37:14 +01:00
Jouke Witteveen	7ed0a4c537	bus-util: add protocol error type explanation	2016-11-29 23:19:52 +01:00
Jouke Witteveen	e0c7d5f7be	man: document protocol error type for service failures (#4724 )	2016-11-23 22:51:33 +01:00
Lennart Poettering	1a1b13c957	seccomp: add @filesystem syscall group (#4537 ) @filesystem groups various file system operations, such as opening files and directories for read/write and stat()ing them, plus renaming, deleting, symlinking, hardlinking.	2016-11-21 19:29:12 -05:00
Lennart Poettering	5327c910d2	namespace: simplify, optimize and extend handling of mounts for namespace This changes a couple of things in the namespace handling: It merges the BindMount and TargetMount structures. They are mostly the same, hence let's just use the same structue, and rely on C's implicit zero initialization of partially initialized structures for the unneeded fields. This reworks memory management of each entry a bit. It now contains one "const" and one "malloc" path. We use the former whenever we can, but use the latter when we have to, which is the case when we have to chase symlinks or prefix a root directory. This means in the common case we don't actually need to allocate any dynamic memory. To make this easy to use we add an accessor function bind_mount_path() which retrieves the right path string from a BindMount structure. While we are at it, also permit "+" as prefix for dirs configured with ReadOnlyPaths= and friends: if specified the root directory of the unit is implicited prefixed. This also drops set_bind_mount() and uses C99 structure initialization instead, which I think is more readable and clarifies what is being done. This drops append_protect_kernel_tunables() and append_protect_kernel_modules() as append_static_mounts() is now simple enough to be called directly. Prefixing with the root dir is now done in an explicit step in prefix_where_needed(). It will prepend the root directory on each entry that doesn't have it prefixed yet. The latter is determined depending on an extra bit in the BindMount structure.	2016-11-17 18:08:32 +01:00
Djalal Harouni	8526555680	doc: move ProtectKernelModules= documentation near ProtectKernelTunalbes=	2016-11-15 15:04:41 +01:00
Djalal Harouni	a7db8614f3	doc: note when no new privileges is implied	2016-11-15 15:04:35 +01:00
Lennart Poettering	add005357d	core: add new RestrictNamespaces= unit file setting This new setting permits restricting whether namespaces may be created and managed by processes started by a unit. It installs a seccomp filter blocking certain invocations of unshare(), clone() and setns(). RestrictNamespaces=no is the default, and does not restrict namespaces in any way. RestrictNamespaces=yes takes away the ability to create or manage any kind of namspace. "RestrictNamespaces=mnt ipc" restricts the creation of namespaces so that only mount and IPC namespaces may be created/managed, but no other kind of namespaces. This setting should be improve security quite a bit as in particular user namespacing was a major source of CVEs in the kernel in the past, and is accessible to unprivileged processes. With this setting the entire attack surface may be removed for system services that do not make use of namespaces.	2016-11-04 07:40:13 -06:00
Zbigniew Jędrzejewski-Szmek	cf88547034	Merge pull request #4548 from keszybz/seccomp-help systemd-analyze syscall-filter	2016-11-03 20:27:45 -04:00
Kees Cook	d974f949f1	doc: clarify NoNewPrivileges (#4562 ) Setting no_new_privs does not stop UID changes, but rather blocks gaining privileges through execve(). Also fixes a small typo.	2016-11-03 20:26:59 -04:00
Zbigniew Jędrzejewski-Szmek	d5efc18b60	seccomp-util, analyze: export comments as a help string Just to make the whole thing easier for users.	2016-11-03 09:35:36 -04:00
Zbigniew Jędrzejewski-Szmek	869feb3388	analyze: add syscall-filter verb This should make it easier for users to understand what each filter means as the list of syscalls is updated in subsequent systemd versions.	2016-11-03 09:35:35 -04:00
Lennart Poettering	2ca8dc15f9	man: document that too strict system call filters may affect the service manager If execve() or socket() is filtered the service manager might get into trouble executing the service binary, or handling any failures when this fails. Mention this in the documentation. The other option would be to implicitly whitelist all system calls that are required for these codepaths. However, that appears less than desirable as this would mean socket() and many related calls have to be whitelisted unconditionally. As writing system call filters requires a certain level of expertise anyway it sounds like the better option to simply document these issues and suggest that the user disables system call filters in the service temporarily in order to debug any such failures. See: #3993.	2016-11-02 08:55:24 -06:00
Lennart Poettering	133ddbbeae	seccomp: add two new syscall groups @resources contains various syscalls that alter resource limits and memory and scheduling parameters of processes. As such they are good candidates to block for most services. @basic-io contains a number of basic syscalls for I/O, similar to the list seccomp v1 permitted but slightly more complete. It should be useful for building basic whitelisting for minimal sandboxes	2016-11-02 08:50:00 -06:00
Lennart Poettering	aa6b9cec88	man: two minor fixes	2016-11-02 08:50:00 -06:00
Lennart Poettering	cd5bfd7e60	seccomp: include pipes and memfd in @ipc These system calls clearly fall in the @ipc category, hence should be listed there, simply to avoid confusion and surprise by the user.	2016-11-02 08:50:00 -06:00
Lennart Poettering	a8c157ff30	seccomp: drop execve() from @process list The system call is already part in @default hence implicitly allowed anyway. Also, if it is actually blocked then systemd couldn't execute the service in question anymore, since the application of seccomp is immediately followed by it.	2016-11-02 08:49:59 -06:00
Lennart Poettering	c79aff9a82	seccomp: add clock query and sleeping syscalls to "@default" group Timing and sleep are so basic operations, it makes very little sense to ever block them, hence don't.	2016-11-02 08:49:59 -06:00
Zbigniew Jędrzejewski-Szmek	aa34055ffb	seccomp: allow specifying arm64, mips, ppc (#4491 ) "Secondary arch" table for mips is entirely speculative…	2016-11-01 09:33:18 -06:00
Jakub Wilk	b17649ee5e	man: fix typos (#4527 )	2016-10-31 08:08:08 -04:00
Djalal Harouni	fa1f250d6f	Merge pull request #4495 from topimiettinen/block-shmat-exec seccomp: also block shmat(..., SHM_EXEC) for MemoryDenyWriteExecute	2016-10-28 15:41:07 +02:00
Topi Miettinen	d2ffa389b8	seccomp: also block shmat(..., SHM_EXEC) for MemoryDenyWriteExecute shmat(..., SHM_EXEC) can be used to create writable and executable memory, so let's block it when MemoryDenyWriteExecute is set.	2016-10-26 18:59:14 +03:00
Zbigniew Jędrzejewski-Szmek	74388c2d11	man: document the default value of NoNewPrivileges= Fixes #4329.	2016-10-24 23:45:57 -04:00
Lennart Poettering	47da760efd	man: document default for User= Replaces: #4375	2016-10-20 13:21:25 +02:00
Luca Bruno	52c239d770	core/exec: add a named-descriptor option ("fd") for streams (#4179 ) This commit adds a `fd` option to `StandardInput=`, `StandardOutput=` and `StandardError=` properties in order to connect standard streams to externally named descriptors provided by some socket units. This option looks for a file descriptor named as the corresponding stream. Custom names can be specified, separated by a colon. If multiple name-matches exist, the first matching fd will be used.	2016-10-17 20:05:49 -04:00
Lennart Poettering	c7458f9399	man: avoid abbreviated "cgroups" terminology (#4396 ) Let's avoid the overly abbreviated "cgroups" terminology. Let's instead write: "Linux Control Groups (cgroups)" is the long form wherever the term is introduced in prose. Use "control groups" in the short form wherever the term is used within brief explanations. Follow-up to: #4381	2016-10-17 09:50:26 -04:00
Zbigniew Jędrzejewski-Szmek	74b47bbd5d	man: add crosslink between systemd.resource-control(5) and systemd.exec(5) Fixes #4379.	2016-10-15 18:38:20 -04:00
Lennart Poettering	8bfdf29b24	Merge pull request #4243 from endocode/djalal/sandbox-first-protection-kernelmodules-v1 core:sandbox: Add ProtectKernelModules= and some fixes	2016-10-13 18:36:29 +02:00
Thomas Hindoe Paaboel Andersen	2dd678171e	man: typo fixes A mix of fixes for typos and UK english	2016-10-12 23:02:44 +02:00
Djalal Harouni	c575770b75	core:sandbox: lets make /lib/modules/ inaccessible on ProtectKernelModules= Lets go further and make /lib/modules/ inaccessible for services that do not have business with modules, this is a minor improvment but it may help on setups with custom modules and they are limited... in regard of kernel auto-load feature. This change introduce NameSpaceInfo struct which we may embed later inside ExecContext but for now lets just reduce the argument number to setup_namespace() and merge ProtectKernelModules feature.	2016-10-12 14:11:16 +02:00
Djalal Harouni	ac246d9868	doc: minor hint about InaccessiblePaths= in regard of ProtectKernelTunables=	2016-10-12 13:52:40 +02:00
Djalal Harouni	2cd0a73547	core:sandbox: remove CAP_SYS_RAWIO on PrivateDevices=yes The rawio system calls were filtered, but CAP_SYS_RAWIO allows to access raw data through /proc, ioctl and some other exotic system calls...	2016-10-12 13:39:49 +02:00
Djalal Harouni	502d704e5e	core:sandbox: Add ProtectKernelModules= option This is useful to turn off explicit module load and unload operations on modular kernels. This option removes CAP_SYS_MODULE from the capability bounding set for the unit, and installs a system call filter to block module system calls. This option will not prevent the kernel from loading modules using the module auto-load feature which is a system wide operation.	2016-10-12 13:31:21 +02:00
Zbigniew Jędrzejewski-Szmek	56b4c80b42	Merge pull request #4348 from poettering/docfixes Various smaller documentation fixes.	2016-10-11 13:49:15 -04:00
Lennart Poettering	f4c9356d13	man: beef up documentation on per-unit resource limits a bit Let's clarify that for user services some OS-defined limits bound the settings in the unit files. Fixes: #4232	2016-10-11 18:42:22 +02:00
Lennart Poettering	4b58153dd2	core: add "invocation ID" concept to service manager This adds a new invocation ID concept to the service manager. The invocation ID identifies each runtime cycle of a unit uniquely. A new randomized 128bit ID is generated each time a unit moves from and inactive to an activating or active state. The primary usecase for this concept is to connect the runtime data PID 1 maintains about a service with the offline data the journal stores about it. Previously we'd use the unit name plus start/stop times, which however is highly racy since the journal will generally process log data after the service already ended. The "invocation ID" kinda matches the "boot ID" concept of the Linux kernel, except that it applies to an individual unit instead of the whole system. The invocation ID is passed to the activated processes as environment variable. It is additionally stored as extended attribute on the cgroup of the unit. The latter is used by journald to automatically retrieve it for each log logged message and attach it to the log entry. The environment variable is very easily accessible, even for unprivileged services. OTOH the extended attribute is only accessible to privileged processes (this is because cgroupfs only supports the "trusted." xattr namespace, not "user."). The environment variable may be altered by services, the extended attribute may not be, hence is the better choice for the journal. Note that reading the invocation ID off the extended attribute from journald is racy, similar to the way reading the unit name for a logging process is. This patch adds APIs to read the invocation ID to sd-id128: sd_id128_get_invocation() may be used in a similar fashion to sd_id128_get_boot(). PID1's own logging is updated to always include the invocation ID when it logs information about a unit. A new bus call GetUnitByInvocationID() is added that allows retrieving a bus path to a unit by its invocation ID. The bus path is built using the invocation ID, thus providing a path for referring to a unit that is valid only for the current runtime cycleof it. Outlook for the future: should the kernel eventually allow passing of cgroup information along AF_UNIX/SOCK_DGRAM messages via a unique cgroup id, then we can alter the invocation ID to be generated as hash from that rather than entirely randomly. This way we can derive the invocation race-freely from the messages.	2016-10-07 20:14:38 +02:00
hbrueckner	6abfd30372	seccomp: add support for the s390 architecture (#4287 ) Add seccomp support for the s390 architecture (31-bit and 64-bit) to systemd. This requires libseccomp >= 2.3.1.	2016-10-05 13:58:55 +02:00
Stefan Schweter	cfaf4b75e0	man: remove consecutive duplicate words (#4268 ) This PR removes consecutive duplicate words from the man pages of: * `resolved.conf.xml` * `systemd.exec.xml` * `systemd.socket.xml`	2016-10-03 17:09:54 +02:00
Djalal Harouni	8f81a5f61b	core: Use @raw-io syscall group to filter I/O syscalls when PrivateDevices= is set Instead of having a local syscall list, use the @raw-io group which contains the same set of syscalls to filter.	2016-09-25 12:52:27 +02:00
Djalal Harouni	49accde7bd	core:sandbox: add more /proc/* entries to ProtectKernelTunables= Make ALSA entries, latency interface, mtrr, apm/acpi, suspend interface, filesystems configuration and IRQ tuning readonly. Most of these interfaces now days should be in /sys but they are still available through /proc, so just protect them. This patch does not touch /proc/net/...	2016-09-25 11:30:11 +02:00
Djalal Harouni	9221aec8d0	doc: explicitly document that /dev/mem and /dev/port are blocked by PrivateDevices=true	2016-09-25 11:25:44 +02:00
Djalal Harouni	e778185bb5	doc: documentation fixes for ReadWritePaths= and ProtectKernelTunables= Documentation fixes for ReadWritePaths= and ProtectKernelTunables= as reported by Evgeny Vereshchagin.	2016-09-25 11:25:31 +02:00
Lennart Poettering	6757c06a1a	man: shorten the exit status table a bit Let's merge a couple of columns, to make the table a bit shorter. This effectively just drops whitespace, not contents, but makes the currently humungous table much much more compact.	2016-09-25 10:52:57 +02:00
Lennart Poettering	81c8aceed4	man: the exit code/signal is stored in $EXIT_CODE, not $EXIT_STATUS	2016-09-25 10:52:57 +02:00
Lennart Poettering	effbd6d2ea	man: rework documentation for ReadOnlyPaths= and related settings This reworks the documentation for ReadOnlyPaths=, ReadWritePaths=, InaccessiblePaths=. It no longer claims that we'd follow symlinks relative to the host file system. (Which wasn't true actually, as we didn't follow symlinks at all in the most recent releases, and we know do follow them, but relative to RootDirectory=). This also replaces all references to the fact that all fs namespacing options can be undone with enough privileges and disable propagation by a single one in the documentation of ReadOnlyPaths= and friends, and then directs the read to this in all other places. Moreover a hint is added to the documentation of SystemCallFilter=, suggesting usage of ~@mount in case any of the fs namespacing related options are used.	2016-09-25 10:42:18 +02:00
Lennart Poettering	b2656f1b1c	man: in user-facing documentaiton don't reference C function names Let's drop the reference to the cap_from_name() function in the documentation for the capabilities setting, as it is hardly helpful. Our readers are not necessarily C hackers knowing the semantics of cap_from_name(). Moreover, the strings we accept are just the plain capability names as listed in capabilities(7) hence there's really no point in confusing the user with anything else.	2016-09-25 10:42:18 +02:00
Lennart Poettering	63bb64a056	core: imply ProtectHome=read-only and ProtectSystem=strict if DynamicUser=1 Let's make sure that services that use DynamicUser=1 cannot leave files in the file system should the system accidentally have a world-writable directory somewhere. This effectively ensures that directories need to be whitelisted rather than blacklisted for access when DynamicUser=1 is set.	2016-09-25 10:42:18 +02:00
Lennart Poettering	3f815163ff	core: introduce ProtectSystem=strict Let's tighten our sandbox a bit more: with this change ProtectSystem= gains a new setting "strict". If set, the entire directory tree of the system is mounted read-only, but the API file systems /proc, /dev, /sys are excluded (they may be managed with PrivateDevices= and ProtectKernelTunables=). Also, /home and /root are excluded as those are left for ProtectHome= to manage. In this mode, all "real" file systems (i.e. non-API file systems) are mounted read-only, and specific directories may only be excluded via ReadWriteDirectories=, thus implementing an effective whitelist instead of blacklist of writable directories. While we are at, also add /efi to the list of paths always affected by ProtectSystem=. This is a follow-up for `b52a109ad3` which added /efi as alternative for /boot. Our namespacing logic should respect that too.	2016-09-25 10:42:18 +02:00
Lennart Poettering	59eeb84ba6	core: add two new service settings ProtectKernelTunables= and ProtectControlGroups= If enabled, these will block write access to /sys, /proc/sys and /proc/sys/fs/cgroup.	2016-09-25 10:18:48 +02:00
Lennart Poettering	00d9ef8560	core: add RemoveIPC= setting This adds the boolean RemoveIPC= setting to service, socket, mount and swap units (i.e. all unit types that may invoke processes). if turned on, and the unit's user/group is not root, all IPC objects of the user/group are removed when the service is shut down. The life-cycle of the IPC objects is hence bound to the unit life-cycle. This is particularly relevant for units with dynamic users, as it is essential that no objects owned by the dynamic users survive the service exiting. In fact, this patch adds code to imply RemoveIPC= if DynamicUser= is set. In order to communicate the UID/GID of an executed process back to PID 1 this adds a new "user lookup" socket pair, that is inherited into the forked processes, and closed before the exec(). This is needed since we cannot do NSS from PID 1 due to deadlock risks, However need to know the used UID/GID in order to clean up IPC owned by it if the unit shuts down.	2016-08-19 00:37:25 +02:00
Zbigniew Jędrzejewski-Szmek	29df65f913	man: add "timeout" to status table (#3919 )	2016-08-11 10:51:49 +02:00
Lennart Poettering	56bf97e10f	Merge pull request #3914 from keszybz/fix-man-links Fix man links	2016-08-07 11:17:56 +02:00
Zbigniew Jędrzejewski-Szmek	e64e1bfd86	man: add a table of possible exit statuses (#3910 )	2016-08-07 11:14:40 +02:00
Zbigniew Jędrzejewski-Szmek	d87a2ef782	Merge pull request #3884 from poettering/private-users	2016-08-06 17:04:45 -04:00
Zbigniew Jędrzejewski-Szmek	0a07667d8d	man: provide html links to a bunch of external man pages	2016-08-06 16:39:53 -04:00
Lennart Poettering	136dc4c435	core: set $SERVICE_RESULT, $EXIT_CODE and $EXIT_STATUS in ExecStop=/ExecStopPost= commands This should simplify monitoring tools for services, by passing the most basic information about service result/exit information via environment variables, thus making it unnecessary to retrieve them explicitly via the bus.	2016-08-04 23:08:05 +02:00
Lennart Poettering	d251207d55	core: add new PrivateUsers= option to service execution This setting adds minimal user namespacing support to a service. When set the invoked processes will run in their own user namespace. Only a trivial mapping will be set up: the root user/group is mapped to root, and the user/group of the service will be mapped to itself, everything else is mapped to nobody. If this setting is used the service runs with no capabilities on the host, but configurable capabilities within the service. This setting is particularly useful in conjunction with RootDirectory= as the need to synchronize /etc/passwd and /etc/group between the host and the service OS tree is reduced, as only three UID/GIDs need to match: root, nobody and the user of the service itself. But even outside the RootDirectory= case this setting is useful to substantially reduce the attack surface of a service. Example command to test this: systemd-run -p PrivateUsers=1 -p User=foobar -t /bin/sh This runs a shell as user "foobar". When typing "ps" only processes owned by "root", by "foobar", and by "nobody" should be visible.	2016-08-03 20:42:04 +02:00
Zbigniew Jędrzejewski-Szmek	dadd6ecfa5	Merge pull request #3728 from poettering/dynamic-users	2016-07-25 16:40:26 -04:00
Lennart Poettering	43eb109aa9	core: change ExecStart=! syntax to ExecStart=+ (#3797 ) As suggested by @mbiebl we already use the "!" special char in unit file assignments for negation, hence we should not use it in a different context for privileged execution. Let's use "+" instead.	2016-07-25 16:53:33 +02:00
Lennart Poettering	29206d4619	core: add a concept of "dynamic" user ids, that are allocated as long as a service is running This adds a new boolean setting DynamicUser= to service files. If set, a new user will be allocated dynamically when the unit is started, and released when it is stopped. The user ID is allocated from the range 61184..65519. The user will not be added to /etc/passwd (but an NSS module to be added later should make it show up in getent passwd). For now, care should be taken that the service writes no files to disk, since this might result in files owned by UIDs that might get assigned dynamically to a different service later on. Later patches will tighten sandboxing in order to ensure that this cannot happen, except for a few selected directories. A simple way to test this is: systemd-run -p DynamicUser=1 /bin/sleep 99999	2016-07-22 15:53:45 +02:00
Alessandro Puccetti	2a624c36e6	doc,core: Read{Write,Only}Paths= and InaccessiblePaths= This patch renames Read{Write,Only}Directories= and InaccessibleDirectories= to Read{Write,Only}Paths= and InaccessiblePaths=, previous names are kept as aliases but they are not advertised in the documentation. Renamed variables: `read_write_dirs` --> `read_write_paths` `read_only_dirs` --> `read_only_paths` `inaccessible_dirs` --> `inaccessible_paths`	2016-07-19 17:22:02 +02:00
Alessandro Puccetti	c4b4170746	namespace: unify limit behavior on non-directory paths Despite the name, `Read{Write,Only}Directories=` already allows for regular file paths to be masked. This commit adds the same behavior to `InaccessibleDirectories=` and makes it explicit in the doc. This patch introduces `/run/systemd/inaccessible/{reg,dir,chr,blk,fifo,sock}` {dile,device}nodes and mounts on the appropriate one the paths specified in `InacessibleDirectories=`. Based on Luca's patch from https://github.com/systemd/systemd/pull/3327	2016-07-19 17:22:02 +02:00
Lennart Poettering	f4170c671b	execute: add a new easy-to-use RestrictRealtime= option to units It takes a boolean value. If true, access to SCHED_RR, SCHED_FIFO and SCHED_DEADLINE is blocked, which my be used to lock up the system.	2016-06-23 01:45:45 +02:00
Lennart Poettering	7bce046bcf	core: set $JOURNAL_STREAM to the dev_t/ino_t of the journal stream of executed services This permits services to detect whether their stdout/stderr is connected to the journal, and if so talk to the journal directly, thus permitting carrying of metadata. As requested by the gtk folks: #2473	2016-06-15 23:00:27 +02:00
Lennart Poettering	1f9ac68b5b	core: improve seccomp syscall grouping a bit This adds three new seccomp syscall groups: @keyring for kernel keyring access, @cpu-emulation for CPU emulation features, for exampe vm86() for dosemu and suchlike, and @debug for ptrace() and related calls. Also, the @clock group is updated with more syscalls that alter the system clock. capset() is added to @privileged, and pciconfig_iobase() is added to @raw-io. Finally, @obsolete is a cleaned up. A number of syscalls that never existed on Linux and have no number assigned on any architecture are removed, as they only exist in the man pages and other operating sytems, but not in code at all. create_module() is moved from @module to @obsolete, as it is an obsolete system call. mem_getpolicy() is removed from the @obsolete list, as it is not obsolete, but simply a NUMA API.	2016-06-13 16:25:54 +02:00
Alessandro Puccetti	cf677fe686	core/execute: add the magic character '!' to allow privileged execution (#3493 ) This patch implements the new magic character '!'. By putting '!' in front of a command, systemd executes it with full privileges ignoring paramters such as User, Group, SupplementaryGroups, CapabilityBoundingSet, AmbientCapabilities, SecureBits, SystemCallFilter, SELinuxContext, AppArmorProfile, SmackProcessLabel, and RestrictAddressFamilies. Fixes partially https://github.com/systemd/systemd/issues/3414 Related to https://github.com/coreos/rkt/issues/2482 Testing: 1. Create a user 'bob' 2. Create the unit file /etc/systemd/system/exec-perm.service (You can use the example below) 3. sudo systemctl start ext-perm.service 4. Verify that the commands starting with '!' were not executed as bob, 4.1 Looking to the output of ls -l /tmp/exec-perm 4.2 Each file contains the result of the id command. ````````````````````````````````````````````````````````````````` [Unit] Description=ext-perm [Service] Type=oneshot TimeoutStartSec=0 User=bob ExecStartPre=!/usr/bin/sh -c "/usr/bin/rm /tmp/exec-perm*" ; /usr/bin/sh -c "/usr/bin/id > /tmp/exec-perm-start-pre" ExecStart=/usr/bin/sh -c "/usr/bin/id > /tmp/exec-perm-start" ; !/usr/bin/sh -c "/usr/bin/id > /tmp/exec-perm-star-2" ExecStartPost=/usr/bin/sh -c "/usr/bin/id > /tmp/exec-perm-start-post" ExecReload=/usr/bin/sh -c "/usr/bin/id > /tmp/exec-perm-reload" ExecStop=!/usr/bin/sh -c "/usr/bin/id > /tmp/exec-perm-stop" ExecStopPost=/usr/bin/sh -c "/usr/bin/id > /tmp/exec-perm-stop-post" [Install] WantedBy=multi-user.target] `````````````````````````````````````````````````````````````````	2016-06-10 18:19:54 +02:00
Topi Miettinen	f3e4363593	core: Restrict mmap and mprotect with PAGE_WRITE\|PAGE_EXEC (#3319 ) (#3379 ) New exec boolean MemoryDenyWriteExecute, when set, installs a seccomp filter to reject mmap(2) with PAGE_WRITE\|PAGE_EXEC and mprotect(2) with PAGE_EXEC.	2016-06-03 17:58:18 +02:00
Topi Miettinen	201c1cc22a	core: add pre-defined syscall groups to SystemCallFilter= (#3053 ) (#3157 ) Implement sets of system calls to help constructing system call filters. A set starts with '@' to distinguish from a system call. Closes: #3053, #3157	2016-06-01 11:56:01 +02:00
Alessandro Puccetti	043cc71512	doc: clarify systemd.exec's paths definition (#3368 ) Definitions of ReadWriteDirectories=, ReadOnlyDirectories=, InaccessibleDirectories=, WorkingDirectory=, and RootDirecory= were not clear. This patch specifies when they are relative to the host's root directory and when they are relative to the service's root directory. Fixes #3248	2016-05-30 16:37:07 +02:00
Luca Bruno	008dce3875	man: fix recurring typo	2016-05-30 13:43:53 +02:00
topimiettinen	737ba3c82c	namespace: Make private /dev noexec and readonly (#3263 ) Private /dev will not be managed by udev or others, so we can make it noexec and readonly after we have made all device nodes. As /dev/shm needs to be writable, we can't use bind_remount_recursive().	2016-05-15 22:34:05 -04:00
Lennart Poettering	2985700185	core: make parsing of RLIMIT_NICE aware of actual nice levels	2016-04-29 16:27:49 +02:00
Lennart Poettering	dfe85b38d2	man: minor wording fixes As suggested in: https://github.com/systemd/systemd/pull/3124#discussion_r61068789	2016-04-29 12:23:34 +02:00
Lennart Poettering	28c75e2501	man: elaborate on the automatic systemd-journald.socket service dependencies Fixes: #1603	2016-04-26 12:00:49 +02:00
Nicolas Braud-Santoni	b50a16af8e	man: systemd.exec: Clarify InaccessibleDirectories (#3048 ) (#3048 )	2016-04-17 14:22:17 +02:00
Ronny Chevalier	19c0b0b9a5	core: set NoNewPrivileges for seccomp if we don't have CAP_SYS_ADMIN The manpage of seccomp specify that using seccomp with SECCOMP_SET_MODE_FILTER will return EACCES if the caller do not have CAP_SYS_ADMIN set, or if the no_new_privileges bit is not set. Hence, without NoNewPrivilege set, it is impossible to use a SystemCall* directive with a User directive set in system mode. Now, NoNewPrivileges is set if we are in user mode, or if we are in system mode and we don't have CAP_SYS_ADMIN, and SystemCall* directives are used.	2016-02-28 14:44:26 +01:00
Lennart Poettering	7882632d5a	man: extend the Personality= documentation Among other fixes, add information about more architectures that are supported these days.	2016-02-22 23:23:06 +01:00
Lennart Poettering	479050b363	core: drop Capabilities= setting The setting is hardly useful (since its effect is generally reduced to zero due to file system caps), and with the advent of ambient caps an actually useful replacement exists, hence let's get rid of this. I am pretty sure this was unused and our man page already recommended against its use, hence this should be a safe thing to remove.	2016-02-13 11:59:34 +01:00
Ismo Puustinen	ece87975a9	man: add AmbientCapabilities entry.	2016-01-12 12:14:50 +02:00
Karel Zak	91518d20dd	core: support <soft:hard> ranges for RLIMIT options The new parser supports: <value> - specify both limits to the same value <soft:hard> - specify both limits the size or time specific suffixes are supported, for example LimitRTTIME=1sec LimitAS=4G:16G The patch introduces parse_rlimit_range() and rlim type (size, sec, usec, etc.) specific parsers. No code is duplicated now. The patch also sync docs for DefaultLimitXXX= and LimitXXX=. References: https://github.com/systemd/systemd/issues/1769	2015-11-25 12:03:32 +01:00
Evgeny Vereshchagin	5c019cf260	man: systemd.exec: add missing variables	2015-11-19 13:37:16 +00:00
Tom Gundersen	fb5c8184a9	Merge pull request #1854 from poettering/unit-deps Dependency engine improvements	2015-11-11 23:14:12 +01:00
Lennart Poettering	c129bd5df3	man: document automatic dependencies For all units ensure there's an "Automatic Dependencies" section in the man page, and explain which dependencies are automatically added in all cases, and which ones are added on top if DefaultDependencies=yes is set. This is also done for systemd.exec(5), systemd.resource-control(5) and systemd.unit(5) as these pages describe common behaviour of various unit types.	2015-11-11 20:47:07 +01:00
Filipe Brandenburger	b4c14404b3	execute: Add new PassEnvironment= directive This directive allows passing environment variables from the system manager to spawned services. Variables in the system manager can be set inside a container by passing `--set-env=...` options to systemd-spawn. Tested with an on-disk test.service unit. Tested using multiple variable names on a single line, with an empty setting to clear the current list of variables, with non-existing variables. Tested using `systemd-run -p PassEnvironment=VARNAME` to confirm it works with transient units. Confirmed that `systemctl show` will display the PassEnvironment settings. Checked that man pages are generated correctly. No regressions in `make check`.	2015-11-11 07:55:23 -08:00
Lennart Poettering	a4c1800284	core: accept time units for time-based resource limits Let's make sure "LimitCPU=30min" can be parsed properly, following the usual logic how we parse time values. Similar for LimitRTTIME=. While we are at it, extend a bit on the man page section about resource limits. Fixes: #1772	2015-11-10 17:36:46 +01:00
Lennart Poettering	6c9e781eba	Merge pull request #1799 from jengelh/doc doc: typo and ortho fixes	2015-11-09 18:16:21 +01:00
Jan Engelhardt	a8eaaee72a	doc: correct orthography, word forms and missing/extraneous words	2015-11-06 13:45:21 +01:00
Jan Engelhardt	b938cb902c	doc: correct punctuation and improve typography in documentation	2015-11-06 13:00:02 +01:00
Karel Zak	412ea7a936	core: support IEC suffixes for RLIMIT stuff Let's make things more user-friendly and support for example LimitAS=16G rather than force users to always use LimitAS=16106127360. The change is relevant for options: [Default]Limit{FSIZE,DATA,STACK,CORE,RSS,AS,MEMLOCK,MSGQUEUE} The patch introduces config_parse_bytes_limit(), it's the same as config_parse_limit() but uses parse_size() tu support the suffixes. Addresses: https://github.com/systemd/systemd/issues/1772	2015-11-06 11:06:52 +01:00
Thomas Hindoe Paaboel Andersen	f2c624cb8b	man: various typos	2015-11-02 23:18:20 +01:00
Filipe Brandenburger	71b1c27a40	man: Update man page documentation for CPUAffinity Document support for commas as a separator and possibility of specifying ranges of CPU indices. Tested by regenerating the manpages locally and reading them on man.	2015-10-27 17:56:26 -07:00
Lennart Poettering	5f5d8eab1f	core: allow setting WorkingDirectory= to the special value ~ If set to ~ the working directory is set to the home directory of the user configured in User=. This change also exposes the existing switch for the working directory that allowed making missing working directories non-fatal. This also changes "machinectl shell" to make use of this to ensure that the invoked shell is by default in the user's home directory. Fixes #1268.	2015-09-29 21:55:51 +02:00
Lennart Poettering	6cd16034fc	man: add hyphen to improve man text	2015-08-25 18:37:53 +02:00
Lennart Poettering	023a4f6701	core: optionally create LOGIN_PROCESS or USER_PROCESS utmp entries When generating utmp/wtmp entries, optionally add both LOGIN_PROCESS and INIT_PROCESS entries or even all three of LOGIN_PROCESS, INIT_PROCESS and USER_PROCESS entries, instead of just a single INIT_PROCESS entry. With this change systemd may be used to not only invoke a getty directly in a SysV-compliant way but alternatively also a login(1) implementation or even forego getty and login entirely, and invoke arbitrary shells in a way that they appear in who(1) or w(1). This is preparation for a later commit that adds a "machinectl shell" operation to invoke a shell in a container, in a way that is compatible with who(1) and w(1).	2015-08-24 22:46:45 +02:00
Richard Maw	8f0d2981ca	man: Document invalid lines in EnvironmentFile If a line doesn't contain an = separator, it is skipped, rather than raising an error. This is potentially useful, so let's document this behaviour.	2015-08-04 09:58:50 +00:00
Christian Hesse	5833143708	man: ProtectHome= protects /root as well	2015-06-30 19:12:20 +02:00
Tom Gundersen	12b42c7667	man: revert dynamic paths for split-usr setups This did not really work out as we had hoped. Trying to do this upstream introduced several problems that probably makes it better suited as a downstream patch after all. At any rate, it is not releaseable in the current state, so we at least need to revert this before the release. * by adjusting the path to binaries, but not do the same thing to the search path we end up with inconsistent man-pages. Adjusting the search path too would be quite messy, and it is not at all obvious that this is worth the effort, but at any rate it would have to be done before we could ship this. * this means that distributed man-pages does not make sense as they depend on config options, and for better or worse we are still distributing man pages, so that is something that definitely needs sorting out before we could ship with this patch. * we have long held that split-usr is only minimally supported in order to boot, and something we hope will eventually go away. So before we start adding even more magic/effort in order to make this work nicely, we should probably question if it makes sense at all.	2015-06-18 19:47:44 +02:00
Filipe Brandenburger	681eb9cf2b	man: generate configured paths in manpages In particular, use /lib/systemd instead of /usr/lib/systemd in distributions like Debian which still have not adopted a /usr merge setup. Use XML entities from man/custom-entities.ent to replace configured paths while doing XSLT processing of the original XML files. There was precedent of some files (such as systemd.generator.xml) which were already using this approach. This addresses most of the (manual) fixes from this patch: http://anonscm.debian.org/cgit/pkg-systemd/systemd.git/tree/debian/patches/Fix-paths-in-man-pages.patch?h=experimental-220 The idea of using generic XML entities was presented here: http://lists.freedesktop.org/archives/systemd-devel/2015-May/032240.html This patch solves almost all the issues, with the exception of: - Path to /bin/mount and /bin/umount. - Generic statements about preference of /lib over /etc. These will be handled separately by follow up patches. Tested: - With default configure settings, ran "make install" to two separate directories and compared the output to confirm they matched exactly. - Used a set of configure flags including $CONFFLAGS from Debian: http://anonscm.debian.org/cgit/pkg-systemd/systemd.git/tree/debian/rules Installed the tree and confirmed the paths use /lib/systemd instead of /usr/lib/systemd and that no other unexpected differences exist. - Confirmed that `make distcheck` still passes.	2015-05-28 19:28:19 +02:00
Zbigniew Jędrzejewski-Szmek	b5c7d097ec	man: link to freebsd.org for inetd(8)	2015-03-13 23:42:18 -04:00
Zbigniew Jędrzejewski-Szmek	3ba3a79df4	man: fix a bunch of links All hail linkchecker!	2015-03-13 23:42:18 -04:00
David Herrmann	f407824d75	man: split paragraph Explicitly put the "multiple EnvironmentFile=" description into its own paragraph to make it much easier to find.	2015-03-12 12:48:22 +01:00
Zbigniew Jędrzejewski-Szmek	b975b0d514	man: boilerplate unification	2015-02-10 23:24:27 -05:00
Zbigniew Jędrzejewski-Szmek	798d3a524e	Reindent man pages to 2ch	2015-02-03 23:11:35 -05:00
Lennart Poettering	c51cbfdcc7	man: document that ProtectSystem= also covers /boot	2015-01-27 02:19:33 +01:00
Ronny Chevalier	6067b34a1f	man: document that we set both soft and hard limits for Limit directives See http://cgit.freedesktop.org/systemd/systemd/tree/src/core/load-fragment.c#n1100	2014-11-30 20:45:01 +01:00
Ronny Chevalier	536256fc91	man: fix typos	2014-11-30 20:20:59 +01:00
Ronny Chevalier	b8825fff7b	man: document equivalence between Limit directives and ulimit See https://bugs.freedesktop.org/show_bug.cgi?id=80341	2014-11-30 20:17:00 +01:00
WaLyong Cho	2ca620c4ed	smack: introduce new SmackProcessLabel option In service file, if the file has some of special SMACK label in ExecStart= and systemd has no permission for the special SMACK label then permission error will occurred. To resolve this, systemd should be able to set its SMACK label to something accessible of ExecStart=. So introduce new SmackProcessLabel. If label is specified with SmackProcessLabel= then the child systemd will set its label to that. To successfully execute the ExecStart=, accessible label should be specified with SmackProcessLabel=. Additionally, by SMACK policy, if the file in ExecStart= has no SMACK64EXEC then the executed process will have given label by SmackProcessLabel=. But if the file has SMACK64EXEC then the SMACK64EXEC label will be overridden. [zj: reword man page]	2014-11-24 10:20:53 -05:00
Lennart Poettering	2134b5ef6b	man: SyslogIdentifier= has an effect on journal logging too	2014-10-09 11:37:01 +02:00
Zbigniew Jędrzejewski-Szmek	e060073a8f	man: say that SecureBits= are space separated	2014-10-03 21:06:52 -04:00
Michael Biebl	67826132ad	man: fix references to systemctl man page which is now in section 1 https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=760613	2014-09-06 13:45:18 +02:00
Ruben Kerkhof	06b643e7f5	Fix a few more typos	2014-08-30 13:46:07 -04:00
Ronny Chevalier	8257df2767	man: fix typo	2014-08-18 21:02:07 +02:00
Lennart Poettering	79c1afc67f	man: improve documentation for StandardOutput= and StandardInput=	2014-08-11 19:29:25 +02:00
Ansgar Burchardt	ef392da6c5	Correct references to ProtectSystem and ProtectHome in documentation	2014-08-04 09:27:20 -04:00
Zbigniew Jędrzejewski-Szmek	8e8ba962c7	man: proper link for dmesg	2014-07-10 22:52:23 -04:00
Zbigniew Jędrzejewski-Szmek	5aded36978	man: add a mapping for external manpages It is annoying when we have dead links on fd.o. Add project='man-pages\|die-net\|archlinux' to <citerefentry>-ies. In generated html, add external links to http://man7.org/linux/man-pages/man, http://linux.die.net/man/, https://www.archlinux.org/. By default, pages in sections 2 and 4 go to man7, since Michael Kerrisk is the autorative source on kernel related stuff. The rest of links goes to linux.die.net, because they have the manpages. Except for the pacman stuff, since it seems to be only available from archlinux.org. Poor gummiboot gets no link, because gummitboot(8) ain't to be found on the net. According to common wisdom, that would mean that it does not exist. But I have seen Kay using it, so I know it does, and deserves to be found. Can somebody be nice and put it up somewhere?	2014-07-07 18:36:55 -04:00
Jan Engelhardt	8d0e0ddda6	doc: grammatical corrections	2014-06-28 00:06:30 -04:00
Lennart Poettering	d6797c920e	namespace: beef up read-only bind mount logic Instead of blindly creating another bind mount for read-only mounts, check if there's already one we can use, and if so, use it. Also, recursively mark all submounts read-only too. Also, ignore autofs mounts when remounting read-only unless they are already triggered.	2014-06-06 14:37:40 +02:00
Lennart Poettering	5331194c12	core: don't include /boot in effect of ProtectSystem= This would otherwise unconditionally trigger any /boot autofs mount, which we probably should avoid. ProtectSystem= will now only cover /usr and (optionally) /etc, both of which cannot be autofs anyway. ProtectHome will continue to cover /run/user and /home. The former cannot be autofs either. /home could be, however is frequently enough used (unlikey /boot) so that it isn't too problematic to simply trigger it unconditionally via ProtectHome=.	2014-06-05 10:03:26 +02:00
Lennart Poettering	1b8689f949	core: rename ReadOnlySystem= to ProtectSystem= and add a third value for also mounting /etc read-only Also, rename ProtectedHome= to ProtectHome=, to simplify things a bit. With this in place we now have two neat options ProtectSystem= and ProtectHome= for protecting the OS itself (and optionally its configuration), and for protecting the user's data.	2014-06-04 18:12:55 +02:00
Lennart Poettering	417116f234	core: add new ReadOnlySystem= and ProtectedHome= settings for service units ReadOnlySystem= uses fs namespaces to mount /usr and /boot read-only for a service. ProtectedHome= uses fs namespaces to mount /home and /run/user inaccessible or read-only for a service. This patch also enables these settings for all our long-running services. Together they should be good building block for a minimal service sandbox, removing the ability for services to modify the operating system or access the user's private data.	2014-06-03 23:57:51 +02:00
Nis Martensen	f1721625e7	fix spelling of privilege	2014-05-19 00:40:44 +09:00
Jan Engelhardt	b8bde11658	doc: comma placement corrections and word order Set commas where there should be some. Some improvements to word order.	2014-05-07 20:13:27 -04:00
Jan Engelhardt	dca348bcbb	doc: corrections to words and forms This patch exchange words which are inappropriate for a situation, deletes duplicated words, and adds particles where needed.	2014-05-07 20:13:26 -04:00
Jan Engelhardt	70a44afee3	doc: typographical fine tuning	2014-05-06 23:05:39 +02:00
Lennart Poettering	905826156d	man: be more specific when EnvironmentFile= is read http://lists.freedesktop.org/archives/systemd-devel/2014-March/018004.html	2014-03-25 00:26:09 +01:00
Lennart Poettering	7f8aa67131	core: remove tcpwrap support tcpwrap is legacy code, that is barely maintained upstream. It's APIs are awful, and the feature set it exposes (such as DNS and IDENT access control) questionnable. We should not support this natively in systemd. Hence, let's remove the code. If people want to continue making use of this, they can do so by plugging in "tcpd" for the processes they start. With that scheme things are as well or badly supported as they were from traditional inetd, hence no functionality is really lost.	2014-03-24 20:07:42 +01:00
Lennart Poettering	c2c13f2df4	unit: turn off mount propagation for udevd Keep mounts done by udev rules private to udevd. Also, document how MountFlags= may be used for this.	2014-03-20 04:16:39 +01:00
Lennart Poettering	907afa0682	man: improve documentation of fs namespace related settings	2014-03-19 22:26:08 +01:00
Lennart Poettering	f1660f96f5	core: drop CAP_MKNOD when PrivateDevices= is set	2014-03-18 17:58:19 +01:00
Lennart Poettering	e66cf1a3f9	core: introduce new RuntimeDirectory= and RuntimeDirectoryMode= unit settings As discussed on the ML these are useful to manage runtime directories below /run for services.	2014-03-03 17:55:32 +01:00
Lennart Poettering	f513e420c8	exec: imply NoNewPriviliges= only when seccomp filters are used in user mode	2014-02-26 02:28:52 +01:00
Lennart Poettering	4298d0b512	core: add new RestrictAddressFamilies= switch This new unit settings allows restricting which address families are available to processes. This is an effective way to minimize the attack surface of services, by turning off entire network stacks for them. This is based on seccomp, and does not work on x86-32, since seccomp cannot filter socketcall() syscalls on that platform.	2014-02-26 02:19:28 +01:00
Michael Scherer	eef65bf3ee	core: Add AppArmor profile switching This permit to switch to a specific apparmor profile when starting a daemon. This will result in a non operation if apparmor is disabled. It also add a new build requirement on libapparmor for using this feature.	2014-02-21 03:44:20 +01:00
Lennart Poettering	b67f562c9c	man: document $MAINPID	2014-02-19 03:27:03 +01:00
Lennart Poettering	ac45f971a1	core: add Personality= option for units to set the personality for spawned processes	2014-02-19 03:27:03 +01:00
Lennart Poettering	e9642be2cc	seccomp: add helper call to add all secondary archs to a seccomp filter And make use of it where appropriate for executing services and for nspawn.	2014-02-18 22:14:00 +01:00
Jan Engelhardt	66f756d437	doc: resolve missing/extraneous words or inappropriate forms Issues fixed: * missing words required by grammar * duplicated or extraneous words * inappropriate forms (e.g. singular/plural), and declinations * orthographic misspellings	2014-02-17 19:03:07 -05:00
Jan Engelhardt	73e231abde	doc: update punctuation Resolve spotted issues related to missing or extraneous commas, dashes.	2014-02-17 19:03:07 -05:00
Zbigniew Jędrzejewski-Szmek	6db2742802	man: replace STDOUT with standard output, etc. Actually 'STDOUT' is something that doesn't appear anywhere: in the stdlib we have 'stdin', and there's only the constant STDOUT_FILENO, so there's no reason to use capitals. When refering to code, STDOUT/STDOUT/STDERR are replaced with stdin/stdout/stderr, and in other places they are replaced with normal phrases like standard output, etc.	2014-02-14 22:03:40 -05:00
Jason St. John	bcddd5bf80	man: fix grammatical errors and other formatting issues * standardize capitalization of STDIN, STDOUT, and STDERR * reword some sentences for clarity * reflow some very long lines to be shorter than ~80 characters * add some missing <literal>, <constant>, <varname>, <option>, and <filename> tags	2014-02-14 22:03:40 -05:00
Lennart Poettering	57183d117a	core: add SystemCallArchitectures= unit setting to allow disabling of non-native architecture support for system calls Also, turn system call filter bus properties into complex types instead of concatenated strings.	2014-02-13 00:24:00 +01:00
Lennart Poettering	17df7223be	core: rework syscall filter - Allow configuration of an errno error to return from blacklisted syscalls, instead of immediately terminating a process. - Fix parsing logic when libseccomp support is turned off - Only keep the actual syscall set in the ExecContext, and generate the string version only on demand.	2014-02-12 18:30:36 +01:00
Ronny Chevalier	c0467cf387	syscallfilter: port to libseccomp	2014-02-12 18:30:36 +01:00
Lennart Poettering	82adf6af7c	nspawn,man: use a common vocabulary when referring to selinux security contexts Let's always call the security labels the same way: SMACK: "Smack Label" SELINUX: "SELinux Security Context" And the low-level encapsulation is called "seclabel". Now let's hope we stick to this vocabulary in future, too, and don't mix "label"s and "security contexts" and so on wildly.	2014-02-10 13:18:16 +01:00
Michael Scherer	0d3f7bb3a5	exec: Add support for ignoring errors on SELinuxContext by prefixing it with -, like for others settings. Also remove call to security_check_context, as this doesn't serve anything, since setexeccon will fail anyway.	2014-02-10 13:18:16 +01:00
Michael Scherer	7b52a628f8	exec: Add SELinuxContext configuration item This permit to let system administrators decide of the domain of a service. This can be used with templated units to have each service in a différent domain ( for example, a per customer database, using MLS or anything ), or can be used to force a non selinux enabled system (jvm, erlang, etc) to start in a different domain for each service.	2014-02-10 13:18:16 +01:00
Lennart Poettering	7f112f50fe	exec: introduce PrivateDevices= switch to provide services with a private /dev Similar to PrivateNetwork=, PrivateTmp= introduce PrivateDevices= that sets up a private /dev with only the API pseudo-devices like /dev/null, /dev/zero, /dev/random, but not any physical devices in them.	2014-01-20 21:28:37 +01:00
Zbigniew Jędrzejewski-Szmek	c5b37953b7	man: mention which variables will be expanded in ExecStart	2014-01-09 22:23:42 -05:00
Jan Engelhardt	e0e009c067	man: grammar and wording improvements This is a recurring submission and includes corrections to: - missing words, preposition choice. - change of /lib to /usr/lib, because that is what most distros are using as the system-wide location for systemd/udev files.	2013-12-25 22:53:46 -05:00
Jan Engelhardt	b040723ea4	man: improvements to comma placement This is a recurring submission and includes corrections to: comma placement.	2013-12-25 22:53:46 -05:00
Lennart Poettering	613b411c94	service: add the ability for units to join other unit's PrivateNetwork= and PrivateTmp= namespaces	2013-11-27 20:28:48 +01:00
Jan Engelhardt	72f4d9669c	man: wording and grammar updates This is a recurring submission and includes corrections to various issue spotted. I guess I can just skip over reporting ubiquitous comma placement fixes…	2013-10-15 08:19:49 -04:00
Zbigniew Jędrzejewski-Szmek	59fccd8211	execute.c: always set $SHELL In `e6dca81` $SHELL was added to user@.service. Let's instead provide it to all units which have a user.	2013-10-02 22:23:56 +02:00
Lennart Poettering	3fde5f30bd	man: drop references to "cgroup" wher appropriate Since cgroups are mostly now an implementation detail of systemd lets deemphasize it a bit in the man pages. This renames systemd.cgroup(5) to systemd.resource-control(5) and uses the term "resource control" rather than "cgroup" where appropriate. This leaves the word "cgroup" in at a couple of places though, like for example systemd-cgtop and systemd-cgls where cgroup stuff is at the core of what is happening.	2013-09-27 00:05:07 +02:00
Zbigniew Jędrzejewski-Szmek	43638332c4	man: add a list of environment variables	2013-09-17 10:26:30 -05:00
Jan Engelhardt	7964042405	man: wording and grammar updates This is a recurring submission and includes corrections to various issue spotted. I guess I can just skip over reporting ubiquitous comma placement fixes… Highligts in this particular commit: - the "unsigned" type qualifier is completed to form a full type "unsigned int" - alphabetic -> lexicographic (that way we automatically define how numbers get sorted)	2013-09-12 22:09:57 +02:00
Zbigniew Jędrzejewski-Szmek	f4ae69117b	man: Add a note about what environment variables are available by default	2013-09-12 09:29:01 -04:00
Jan Engelhardt	6b4991cfde	man: wording and grammar updates This includes regularly-submitted corrections to comma setting and orthographical mishaps that appeared in man/ in recent commits. In this particular commit: - the usual comma fixes - expand contractions (this is prose)	2013-09-10 18:34:41 +02:00
Maciej Wereski	ea92ae33e0	"-" prefix for InaccessibleDirectories and ReadOnlyDirectories	2013-08-23 12:48:14 -04:00
Lennart Poettering	dc7adf202b	man: drop the old cgroup settings from the man pages	2013-07-19 17:23:34 +02:00
Jason St. John	6ed80a4e34	man: use HTTPS links for links that support it	2013-07-16 17:42:56 +02:00
Jan Engelhardt	6a75304e41	man: wording and grammar update	2013-07-13 07:56:11 -04:00
Zbigniew Jędrzejewski-Szmek	d868475ad6	man: document the slice and scope units, add systemd.cgroup(5)	2013-07-12 01:10:04 -04:00
Zbigniew Jędrzejewski-Szmek	05cc726731	man: add more formatting markup	2013-07-02 23:06:22 -04:00
Jason St. John	e9dd9f9547	man: improve grammar and word formatting in numerous man pages Use proper grammar, word usage, adjective hyphenation, commas, capitalization, spelling, etc. To improve readability, some run-on sentences or sentence fragments were revised. [zj: remove the space from 'file name', 'host name', and 'time zone'.]	2013-07-02 23:06:22 -04:00
Zbigniew Jędrzejewski-Szmek	74d005783e	man: use <constant> for various constants which look ugly with quotes	2013-06-26 19:47:34 -04:00
Umut Tezduyar	97d0e5f83b	manager: add DefaultEnvironment option This complements existing functionality of setting variables through 'systemctl set-environment', the kernel command line, and through normal environment variables for systemd in session mode.	2013-06-20 16:27:45 -04:00
David Strauss	12f25b6e74	Standardize on 'file system' and 'namespace' in man pages. This change is based on existing usage in systemd and online. 'File-system' may make sense in adjectival form, but man pages seem to prefer 'file system' even in those situations.	2013-05-18 02:28:25 -07:00
Zbigniew Jędrzejewski-Szmek	845c53246f	man: add various filenames to the index Everything which is an absolute filename marked with <filename></filename> lands in the index, unless noindex= attribute is present. Should make it easier for people to find stuff when they are looking at a file on disk. Various formatting errors in manpages are fixed, kernel-install(1) is restored to formatting sanity.	2013-05-03 01:00:42 -04:00
Lennart Poettering	fbc15b7663	man: be clearer that it's not OK to manipulate systemd's own cgroup hirearchy	2013-04-08 20:35:25 +02:00
Lennart Poettering	d91c34f21f	exec: Assigning the empty string to CapabilityBoundSet= should drop all caps Previously, it would set all caps, but it should drop them all, anything else makes little sense. Also, document that this works as it does, and what to do in order to assign all caps to the bounding set. https://bugzilla.redhat.com/show_bug.cgi?id=914705	2013-03-22 23:28:44 +01:00
Michal Sekletar	c17ec25e4d	core: reuse the same /tmp, /var/tmp and inaccessible dir All Execs within the service, will get mounted the same /tmp and /var/tmp directories, if service is configured with PrivateTmp=yes. Temporary directories are cleaned up by service itself in addition to systemd-tmpfiles. Directory which is mounted as inaccessible is created at runtime in /run/systemd.	2013-03-15 22:56:40 -04:00
Zbigniew Jędrzejewski-Szmek	e670b166a0	man: use <replaceable> in various places	2013-02-13 23:09:00 -05:00
Zbigniew Jędrzejewski-Szmek	5f9cfd4c38	man: rename systemd.conf to systemd-system.conf Alias as systemd-user.conf is also provided. This should help users running systemd in session mode. https://bugzilla.redhat.com/show_bug.cgi?id=690868	2013-02-13 09:48:32 -05:00
Zbigniew Jędrzejewski-Szmek	ccc9a4f9ff	man: extend systemd.directives(7) to all manual pages New sections are added: PAM options, crypttab options, commandline options, miscellaneous. The last category will be used for all untagged <varname> elements. Commandline options sections is meant to be a developer tool: when adding an option it is sometimes useful to be able to check if similarly named options exist elsewhere.	2013-01-26 11:36:53 -05:00
Zbigniew Jędrzejewski-Szmek	652d0dd709	man: mention that PrivateTmp means /var/tmp too	2013-01-26 10:52:32 -05:00
Frederic Crozat	0ae9c92a93	man: systemd.exec - explicit Environment assignment Hi all, while working on another bug, I discovered the "strange" way systemd is parsing Environment= in .service and thought it was worth documenting (because I don't expect people to find this syntax by themselves unless they read the parsing code ;) Be more verbose about using space in Environment field and not using value of other variables Fixes https://bugzilla.redhat.com/show_bug.cgi?id=840260 [zj: expand and reformat the example a bit]	2013-01-24 19:36:47 -05:00
Michal Vyskocil	565d91fdf1	util: continuation support for load_env_file Variable definitions can be written on more than one line - if each ends with a backslash, then is concatenated with a previous one. Only backslash and unix end of line (\n) are treated as a continuation. Fixes: https://bugs.freedesktop.org/show_bug.cgi?id=58083 [zj: squashed two patches together; cleaned up grammar; removed comment about ignoring trailing backslash -- it is not ignored.] Document continuation support in systemd.exec	2013-01-18 11:06:15 -05:00
Lennart Poettering	74051b9b58	units: for all unit settings that take lists, allow the empty string for resetting the lists https://bugzilla.redhat.com/show_bug.cgi?id=756787	2013-01-17 02:50:05 +01:00
Zbigniew Jędrzejewski-Szmek	9cc2c8b763	man: add links to directive index to see-alsos systemd.directives(5) is renamed to systemd.directives(7). Section 7 is "Miscellaneous".	2013-01-15 11:30:42 -05:00
Pekka Lundstrom	2bef10ab36	Added globbing support to EnvironmentFile This patch allows globbing to be used with EnvironmentFile option. Example: EnvironmentFile=/etc/foo.d/*.conf t. Pekka	2013-01-04 01:11:50 +01:00
Kay Sievers	8050c22151	man: systemd.exec - mention mount(2) https://bugzilla.redhat.com/show_bug.cgi?id=880552	2012-11-27 11:40:08 +01:00
Holger Hans Peter Freyther	bb11271068	sched: Only setting CPUSchedulingPriority=rr doesn't work A service that only sets the scheduling policy to round-robin fails to be started. This is because the cpu_sched_priority is initialized to 0 and is not adjusted when the policy is changed. Clamp the cpu_sched_priority when the scheduler policy is set. Use the current policy to validate the new priority. Change the manual page to state that the given range only applies to the real-time scheduling policies. Add a testcase that verifies this change: $ make test-sched-prio; ./test-sched-prio [test/sched_idle_bad.service:6] CPU scheduling priority is out of range, ignoring: 1 [test/sched_rr_bad.service:7] CPU scheduling priority is out of range, ignoring: 0 [test/sched_rr_bad.service:8] CPU scheduling priority is out of range, ignoring: 100	2012-11-15 16:16:45 +01:00
Lennart Poettering	df688b23da	man: minor updates	2012-10-26 01:18:41 +02:00
Andrew Eikum	16dad32e43	Reword sentences that contain psuedo-English "resp." As you likely know, Arch Linux is in the process of moving to systemd. So I was reading through the various systemd docs and quickly became baffled by this new abbreviation "resp.", which I've never seen before in my English-mother-tongue life. Some quick Googling turned up a reference: <http://www.transblawg.eu/index.php?/archives/870-Resp.-and-other-non-existent-English-wordsNicht-existente-englische-Woerter.html> I guess it's a literal translation of the German "Beziehungsweise", but English doesn't work the same way. The word "respectively" is used exclusively to provide an ordering connection between two lists. E.g. "the prefixes k, M, and G refer to kilo-, mega-, and giga-, respectively." It is also never abbreviated to "resp." So the sentence "Sets the default output resp. error output for all services and sockets" makes no sense to a natural English speaker. This patch removes all instances of "resp." in the man pages and replaces them with sentences which are much more clear and, hopefully, grammatically valid. In almost all instances, it was simply replacing "resp." with "or," which the original author (Lennart?) could probably just do in the future. The only other instances of "resp." are in the src/ subtree, which I don't feel privileged to correct. Signed-off-by: Andrew Eikum <aeikum@codeweavers.com>	2012-10-16 01:03:01 +02:00
Thomas Hindoe Paaboel Andersen	c53158818d	man: fix a bunch of typos in docs https://bugs.freedesktop.org/show_bug.cgi?id=54501	2012-09-13 19:34:24 +02:00
Lennart Poettering	ac0930c892	namespace: rework namespace support - don't use pivot_root() anymore, just reuse root hierarchy - first create all mounts, then mark them read-only so that we get the right behaviour when people want writable mounts inside of read-only mounts - don't pass invalid combinations of MS_ constants to the kernel	2012-08-13 15:27:04 +02:00
Lennart Poettering	4819ff0358	unit: split off KillContext from ExecContext containing only kill definitions	2012-07-20 00:10:31 +02:00
Lennart Poettering	8351ceaea9	execute: support syscall filtering using seccomp filters	2012-07-17 04:17:53 +02:00
Lennart Poettering	34511ca7b1	man: reword man page titles Make sure the man page titles are similar in style and capitalization so that our man page index looks pretty.	2012-07-16 18:08:25 +02:00
Lennart Poettering	e06c73cc91	unit: set default working directory to the user's home directory when running in user mode	2012-07-16 12:44:42 +02:00
Ville Skyttä	49f43d5f91	Spelling fixes.	2012-07-16 12:16:29 +02:00
Lennart Poettering	cb07866b1b	man: move header file man pages from section 7 to 3 This way we can include documentation about minor macros/inline function within the introducionary man page in a sane way.	2012-07-13 01:50:05 +02:00
Lennart Poettering	d88a251b12	util: introduce a proper nsec_t and make use of it where appropriate	2012-05-31 04:27:03 +02:00
Lennart Poettering	ec8927ca59	main: add configuration option to alter capability bounding set for PID 1 This also ensures that caps dropped from the bounding set are also dropped from the inheritable set, to be extra-secure. Usually that should change very little though as the inheritable set is empty for all our uses anyway.	2012-05-24 04:00:56 +02:00
Lennart Poettering	5430f7f2bc	relicense to LGPLv2.1 (with exceptions) We finally got the OK from all contributors with non-trivial commits to relicense systemd from GPL2+ to LGPL2.1+. Some udev bits continue to be GPL2+ for now, but we are looking into relicensing them too, to allow free copy/paste of all code within systemd. The bits that used to be MIT continue to be MIT. The big benefit of the relicensing is that closed source code may now link against libsystemd-login.so and friends.	2012-04-12 00:24:39 +02:00
Lennart Poettering	169c4f6513	journalctl,loginctl: drop systemd- prefix in binary names Let's make things a bit easier to type, drop the systemd- prefix for journalctl and loginctl, but provide the old names for compat. All systemd binaries are hence now prefixed with "systemd-" with the exception of the three primary user interface binaries: systemctl loginctl journalctl For those three we do provide systemd-xyz names as well, via symlinks: systemd-systemctl → systemctl systemd-loginctl → loginctl systemd-journalctl → journalctl We do this only for the primary user tools, in order to avoid unnecessary namespace problems. That means tools like systemd-notify stay the way they are.	2012-03-26 20:58:47 +02:00
Lennart Poettering	353e12c2f4	service: ignore SIGPIPE by default	2012-02-09 03:18:04 +01:00
Lennart Poettering	9f056f4087	man: document that we support tcpwrappers only for access control We do not support, and explicitly never want to support environment variable settings and suchlike in tcpwrappers. https://bugs.freedesktop.org/show_bug.cgi?id=45143	2012-02-02 06:22:36 +01:00
Kay Sievers	891703e1ee	persistant -> persistent	2012-01-18 21:47:30 +01:00
Lennart Poettering	8d53b4534a	exec: introduce ControlGroupPersistant= to make cgroups persistant	2012-01-18 15:40:21 +01:00
Lennart Poettering	706343f492	journal: introduce log target 'journal' for executed processes	2012-01-06 02:48:38 +01:00
Barry Scott	7734f77373	man: for ExecStart= provide more details on env var substitution and how that turns into arguments. For EnvironmentFile= explain that double quotes can be used to protect whitespace.	2011-10-11 01:11:26 +02:00
Lennart Poettering	de6c78f879	service: change default stdout/stderr to syslog	2011-08-30 22:57:58 +02:00
Lennart Poettering	346bce1f4c	stdout-bridge: rename logger to stdout-syslog-bridge to make it more descriptive	2011-08-30 22:42:49 +02:00
Lennart Poettering	3377af3e22	man: fix securebits docs	2011-08-29 13:44:12 +02:00
Lennart Poettering	94959f0fa0	exec: allow passing arbitrary path names to blkio cgroup attributes If a device node is specified, then adjust the bandwidth/weight of it, otherwise find the backing block device of the file system the path refers to and adjust its bandwidth/weight.	2011-08-21 20:07:45 +02:00
Lennart Poettering	9e37286844	exec: add high-level controls for blkio cgroup attributes	2011-08-21 20:07:08 +02:00
Lennart Poettering	ab1f063390	exec: optionally apply cgroup attributes to the cgroups we create	2011-08-20 00:22:02 +02:00
Lennart Poettering	ff01d048b4	exec: introduce PrivateNetwork= process option to turn off network access to specific services	2011-08-02 05:24:58 +02:00

... 3 4 5 6 7 ...

473 Commits