systemd

mirror of https://github.com/systemd/systemd.git synced 2024-11-07 01:27:11 +03:00

Author	SHA1	Message	Date
Kyle Walker	1e706c8dff	manager: Fixing a debug printf formatting mistake (#3640 ) A 'llu' formatting statement was used in a debugging printf statement instead of a 'PRIu64'. Correcting that mistake here.	2016-07-01 20:03:35 +03:00
Lennart Poettering	b12cc5b0f8	Merge pull request #3634 from disneyworldguy/v2sigchld manager: Only invoke a single sigchld per unit within a cleanup cycle	2016-06-30 15:57:39 -07:00
Martin Pitt	f15461b2b2	Merge pull request #3596 from poettering/machine-clean make "machinectl clean" asynchronous, and open it up via PolicyKit	2016-06-30 21:30:35 +02:00
Kyle Walker	36f20ae3b2	manager: Only invoke a single sigchld per unit within a cleanup cycle By default, each iteration of manager_dispatch_sigchld() results in a unit level sigchld event being invoked. For scope units, this results in a scope_sigchld_event() which can seemingly stall for workloads that have a large number of PIDs within the scope. The stall exhibits itself as a SIG_0 being initiated for each u->pids entry as a result of pid_is_unwaited(). v2: This patch resolves this condition by only paying to cost of a sigchld in the underlying scope unit once per sigchld iteration. A new "sigchldgen" member resides within the Unit struct. The Manager is incremented via the sd event loop, accessed via sd_event_get_iteration, and the Unit member is set to the same value as the manager each time that a sigchld event is invoked. If the Manager iteration value and Unit member match, the sigchld event is not invoked for that iteration.	2016-06-30 15:16:47 -04:00
Franck Bui	6edefe0b06	pid1: restore console color support for containers (#3595 ) Commit `3a18b60489` introduced a regression that disabled the color mode for container. This patch fixes this.	2016-06-24 16:08:43 +02:00
Lennart Poettering	2b40998d3c	cgroup: minor coding style fix	2016-06-24 15:59:24 +02:00
Lennart Poettering	f4170c671b	execute: add a new easy-to-use RestrictRealtime= option to units It takes a boolean value. If true, access to SCHED_RR, SCHED_FIFO and SCHED_DEADLINE is blocked, which my be used to lock up the system.	2016-06-23 01:45:45 +02:00
Lennart Poettering	abd84d4d83	execute: be a little less drastic when MemoryDenyWriteExecute= hits Let's politely refuse with EPERM rather than kill the whole thing right-away.	2016-06-23 01:35:04 +02:00
Lennart Poettering	686d9ba614	execute: set PR_SET_NO_NEW_PRIVS also in case the exec memory protection is used This was forgotten when MemoryDenyWriteExecute= was added: we should set NNP in all cases when we set seccomp filters.	2016-06-23 01:33:07 +02:00
Lennart Poettering	03857c43ce	execute: use the return value of setrlimit_closest() properly It's a function defined by us, hence we should look for the error in its return value, not in "errno".	2016-06-23 01:31:24 +02:00
Lennart Poettering	fc40065bcd	core: when writing transient unit files, make sure all lines end with a newline This is a fix-up for `2a9a6f8ac0` which covered non-transient units, but missed the case for transient units.	2016-06-23 01:29:33 +02:00
Minkyung	2787d83c28	watchdog: Support changing watchdog_usec during runtime (#3492 ) Add sd_notify() parameter to change watchdog_usec during runtime. Application can change watchdog_usec value by sd_notify like this. Example. sd_notify(0, "WATCHDOG_USEC=20000000"). To reset watchdog_usec as configured value in service file, restart service. Notice. sd_event is not currently supported. If application uses sd_event_set_watchdog, or sd_watchdog_enabled, do not use "WATCHDOG_USEC" option through sd_notify.	2016-06-22 13:26:05 +02:00
Lennart Poettering	98471bf0fa	Merge pull request #3526 from fbuihuu/fix-console-log-color Fix console log color	2016-06-22 12:34:25 +02:00
Franck Bui	3a18b60489	pid1: initialize status color mode after setting up TERM Also we had to connect PID's stdio to null later since colors_enabled() assume that stdout is connected to the console.	2016-06-22 08:29:02 +02:00
Franck Bui	32391275c0	pid1: initialize TERM environment variable correctly When systemd is started by the kernel, the kernel set the TERM environment variable unconditionnally to "linux" no matter the console device used. This might be an issue for dumb devices with no colors support. This patch uses default_term_for_tty() for getting a more accurate value. But it makes sure to keep the user preferences (if any) which might be passed via the kernel command line. For that purpose /proc should be mounted.	2016-06-22 08:28:55 +02:00
Evgeny Vereshchagin	eee0a1e48e	core: log the right set of the supported controllers (#3558 ) Jun 16 05:12:08 systemd[1]: Controller 'io' supported: yes Jun 16 05:12:08 systemd[1]: Controller 'memory' supported: yes Jun 16 05:12:08 systemd[1]: Controller 'pids' supported: yes instead of Jun 16 04:06:50 systemd[1]: Controller 'memory' supported: yes Jun 16 04:06:50 systemd[1]: Controller 'devices' supported: yes Jun 16 04:06:50 systemd[1]: Controller 'pids' supported: yes	2016-06-20 20:40:46 +02:00
Franck Bui	8ce0611e42	Revert "do not pass-along the environment from the kernel or initrd" This reverts commit `ce8aba5681`. We should pass an environment as close as possible to what we originally got.	2016-06-20 18:55:09 +02:00
Franck Bui	affd7ed1a9	pid1: reconnect to the console before being re-executed When re-executed, reconnect the console to PID1's stdios as it was the case when PID1 was initially started by the kernel.	2016-06-20 18:40:51 +02:00
Dave Reisner	222953e87f	Ensure kdbus isn't used (#3501 ) Delete the dbus1 generator and some critical wiring. This prevents kdbus from being loaded or detected. As such, it will never be used, even if the user still has a useful kdbus module loaded on their system. Sort of fixes #3480. Not really, but it's better than the current state.	2016-06-18 17:24:23 -04:00
Lennart Poettering	616aab6085	Merge pull request #3481 from poettering/relative-memcg various changes, most importantly regarding memory metrics	2016-06-16 13:56:23 +02:00
Zbigniew Jędrzejewski-Szmek	732cd53eeb	Merge pull request #3537 from poettering/journal-stream-env Permit services to detect whether their stdout/stderr is connected to the journal.	2016-06-15 21:30:59 -04:00
Zbigniew Jędrzejewski-Szmek	a1feacf77f	load-fragment: ignore ENOTDIR/EACCES errors (#3510 ) If for whatever reason the file system is "corrupted", we want to be resilient and ignore the error, as long as we can load the units from a different place. Arch bug https://bugs.archlinux.org/task/49547. A user had an ntfs symlink (essentially a file) instead of a directory after restoring from backup. We should just ignore that like we would treat a missing directory, for general resiliency. We should treat permission errors similarly. For example an unreadable /usr/local/lib directory would prevent (user) instances of systemd from loading any units. It seems better to continue.	2016-06-15 23:02:27 +02:00
Lennart Poettering	7bce046bcf	core: set $JOURNAL_STREAM to the dev_t/ino_t of the journal stream of executed services This permits services to detect whether their stdout/stderr is connected to the journal, and if so talk to the journal directly, thus permitting carrying of metadata. As requested by the gtk folks: #2473	2016-06-15 23:00:27 +02:00
Lennart Poettering	fd1f9c89f7	execute: minor coding style improvements	2016-06-15 22:51:01 +02:00
Lennart Poettering	8e38570ebe	tree-wide: htonl() is weird, let's use htobe32() instead (#3538 ) Super-important change, yeah!	2016-06-15 01:26:01 +02:00
Lennart Poettering	3f71dec5d7	unit: properly comment generated comments in unit files Fix-up for `2a9a6f8ac0`	2016-06-14 20:01:45 +02:00
Lennart Poettering	d58d600efd	systemctl: allow percent-based MemoryLimit= settings via systemctl set-property The unit files already accept relative, percent-based memory limit specification, let's make sure "systemctl set-property" support this too. Since we want the physical memory size of the destination machine to apply we pass the percentage in a new set of properties that only exist for this purpose, and can only be set.	2016-06-14 20:01:45 +02:00
Lennart Poettering	d8cf2ac79b	util: introduce physical_memory_scale() to unify how we scale by physical memory The various bits of code did the scaling all different, let's unify this, given that the code is not trivial.	2016-06-14 20:01:45 +02:00
Lennart Poettering	799ec13412	core: make sure to use "infinity" in unit files, not "max" THe latter is a kernelism, we only understand "infinity".	2016-06-14 19:50:38 +02:00
Lennart Poettering	cd0a7a8e58	core: when receiving a memory limit via the bus, refuse 0 When parsing unit files we already refuse unit memory limits of zero, let's also refuse it when the value is set via the bus.	2016-06-14 19:50:38 +02:00
Lennart Poettering	875ae5661a	core: optionally, accept a percentage value for MemoryLimit= and related settings If a percentage is used, it is taken relative to the installed RAM size. This should make it easier to write generic unit files that adapt to the local system.	2016-06-14 19:50:38 +02:00
Lennart Poettering	9184ca48ea	util-lib: introduce parse_percent() for parsing percent specifications And port a couple of users over to it.	2016-06-14 19:50:38 +02:00
Lukáš Nykrýn	4892084f09	manager: reduce complexity of unit_gc_sweep (#3507 ) When unit is marked as UNSURE, we are trying to find if it state was changed over and over again. So lets not go through the UNSURE states again. Also when we find a GOOD unit lets propagate the GOOD state to all units that this unit reference. This is a problem on machines with a lot of initscripts with different starting priority, since those units will reference each other and the original algorithm might get to n! complexity. Thanks HATAYAMA Daisuke for the expand_good_state code.	2016-06-14 14:20:56 +02:00
Evgeny Vereshchagin	bbc85a16e1	core: on unified we don't need to check u->pids: we can use proper notifications (#3531 ) Fixes: #3483	2016-06-14 14:08:01 +02:00
Ivan Shapovalov	dcd6145002	core: parse `rd.rescue` and `rd.emergency` as initrd-specific shorthands (#3488 ) Typing `rd.rescue` is easier than `rd.systemd.unit=rescue.target`.	2016-06-13 16:28:42 +02:00
Jouke Witteveen	2065ca699b	core/execute: pass env vars to PAM session setup (#3503 ) Move the merger of environment variables before setting up the PAM session and pass the aggregate environment to PAM setup. This allows control over the PAM session hooks through environment variables. PAM session initiation may update the environment. On successful initiation of a PAM session, we adopt the environment of the PAM context.	2016-06-13 12:50:12 +02:00
Franck Bui	64c3610b55	core: disable colors when displaying cylon when systemd.log_color=off (#3495 )	2016-06-10 18:33:15 +02:00
Alessandro Puccetti	cf677fe686	core/execute: add the magic character '!' to allow privileged execution (#3493 ) This patch implements the new magic character '!'. By putting '!' in front of a command, systemd executes it with full privileges ignoring paramters such as User, Group, SupplementaryGroups, CapabilityBoundingSet, AmbientCapabilities, SecureBits, SystemCallFilter, SELinuxContext, AppArmorProfile, SmackProcessLabel, and RestrictAddressFamilies. Fixes partially https://github.com/systemd/systemd/issues/3414 Related to https://github.com/coreos/rkt/issues/2482 Testing: 1. Create a user 'bob' 2. Create the unit file /etc/systemd/system/exec-perm.service (You can use the example below) 3. sudo systemctl start ext-perm.service 4. Verify that the commands starting with '!' were not executed as bob, 4.1 Looking to the output of ls -l /tmp/exec-perm 4.2 Each file contains the result of the id command. ````````````````````````````````````````````````````````````````` [Unit] Description=ext-perm [Service] Type=oneshot TimeoutStartSec=0 User=bob ExecStartPre=!/usr/bin/sh -c "/usr/bin/rm /tmp/exec-perm*" ; /usr/bin/sh -c "/usr/bin/id > /tmp/exec-perm-start-pre" ExecStart=/usr/bin/sh -c "/usr/bin/id > /tmp/exec-perm-start" ; !/usr/bin/sh -c "/usr/bin/id > /tmp/exec-perm-star-2" ExecStartPost=/usr/bin/sh -c "/usr/bin/id > /tmp/exec-perm-start-post" ExecReload=/usr/bin/sh -c "/usr/bin/id > /tmp/exec-perm-reload" ExecStop=!/usr/bin/sh -c "/usr/bin/id > /tmp/exec-perm-stop" ExecStopPost=/usr/bin/sh -c "/usr/bin/id > /tmp/exec-perm-stop-post" [Install] WantedBy=multi-user.target] `````````````````````````````````````````````````````````````````	2016-06-10 18:19:54 +02:00
Lennart Poettering	9d3e340639	load-fragment: don't try to do a template instance replacement if we are not an instance (#3451 ) Corrects: `7aad67e7` Fixes: #3438	2016-06-09 10:49:36 +02:00
Lennart Poettering	1ff74fb6e3	execute: check whether the specified fd is a tty before chowning/chmoding it (#3457 ) Let's add an extra safety check before we chmod/chown a TTY to the right user, as we might end up having connected something to STDIN/STDOUT that is actually not a TTY, even though this might have been requested, due to permissive StandardInput= settings or transient service activation with fds passed in. Fixes: https://bugs.freedesktop.org/show_bug.cgi?id=85255	2016-06-09 10:01:16 +02:00
Lennart Poettering	cf6f7f66a4	core: add minor comment Let's explain #3444 briefly in the sources, too.	2016-06-06 22:03:31 +02:00
michaelolbrich	53203e5f8f	mount: make sure got into MOUNT_DEAD state after a successful umount (#3444 ) Without this code the following can happen: 1. Open a file to keep a mount busy 2. Try to stop the corresponding mount unit with systemctl -> umount fails and the failure is remembered in mount->result 3. Close the file and umount the filesystem manually -> mount_dispatch_io() calls "mount_enter_dead(mount, MOUNT_SUCCESS)" -> Old error in mount->result is reused and the mount unit enters a failed state Clear the old error result when 'mountinfo' reports a successful umount to fix this.	2016-06-06 21:59:51 +02:00
michaelolbrich	0a62f81045	automount: handle expire_tokens when the mount unit changes its state (#3434 ) This basically reverts `7b2fd9d512` ("core: remove duplicate code in automount_update_mount()"). This was not duplicate code. The expire_tokens need to be handled as well: Send 0 == success for MOUNT_DEAD (umount successful), do nothing for MOUNT_UNMOUNTING (not yet done) and an error for everything else. Otherwise the automount logic will assume unmounting is not done and will not send any new requests for mounting. As a result, the corresponding mount unit is never mounted. Without this, automounts with TimeoutIdleSec= are broken. Once the idle timeout triggered a umount, any access to the corresponding filesystem hangs forever. Fixes #3332.	2016-06-05 11:25:14 -04:00
Topi Miettinen	f3e4363593	core: Restrict mmap and mprotect with PAGE_WRITE\|PAGE_EXEC (#3319 ) (#3379 ) New exec boolean MemoryDenyWriteExecute, when set, installs a seccomp filter to reject mmap(2) with PAGE_WRITE\|PAGE_EXEC and mprotect(2) with PAGE_EXEC.	2016-06-03 17:58:18 +02:00
Lennart Poettering	18b9beb199	Merge pull request #3418 from htejun/cgroup2-log-compat core: log cgroup legacy and unified hierarchy setting translations	2016-06-03 17:49:50 +02:00
Tejun Heo	e57c9ce169	core: always use "infinity" for no upper limit instead of "max" (#3417 ) Recently added cgroup unified hierarchy support uses "max" in configurations for no upper limit. While consistent with what the kernel uses for no upper limit, it is inconsistent with what systemd uses for other controllers such as memory or pids. There's no point in introducing another term. Update cgroup unified hierarchy support so that "infinity" is the only term that systemd uses for no upper limit.	2016-06-03 17:49:05 +02:00
Tejun Heo	128fadc927	core: log cgroup legacy and unified hierarchy setting translations To accommodate changes in kernel interface, cgroup unified hierarchy support added several configuration items which overlap with the existing resource control settings and there is simple config translation between the overlapping settings to ease the transition. As why certain cgroup knobs are being configured can become confusing, this patch adds a master warning message which is printed once when such translation is first used and logs each translation with a debug message. v2: - Switched to log_unit*().	2016-06-02 13:02:53 -04:00
Tejun Heo	f29ff1159c	core: pass Unit into cgroup_context_apply() and use log_unit() cgroup_context_apply() and friends take CGroupContext and cgroup path as input and has no way of getting back to the associated Unit and thus uses raw cgroup path for logging. This makes the log messages difficult to track down. There's no reason to avoid passing in Unit into these functions. Pass in Unit and use log_unit() instead. While at it, make cgroup_context_apply(), which has no outside users, static. Also, drop cgroup path from log messages where the path itself isn't too interesting and can be easily obtained from the unit.	2016-06-02 13:02:49 -04:00
Topi Miettinen	201c1cc22a	core: add pre-defined syscall groups to SystemCallFilter= (#3053 ) (#3157 ) Implement sets of system calls to help constructing system call filters. A set starts with '@' to distinguish from a system call. Closes: #3053, #3157	2016-06-01 11:56:01 +02:00
Michael Karcher	8869a0b40b	util-lib: Add sparc64 support for process creation (#3348 ) The current raw_clone function takes two arguments, the cloning flags and a pointer to the stack for the cloned child. The raw cloning without passing a "thread main" function does not make sense if a new stack is specified, as it returns in both the parent and the child, which will fail in the child as the stack is virgin. All uses of raw_clone indeed pass NULL for the stack pointer which indicates that both processes should share the stack address (so you better don't pass CLONE_VM). This commit refactors the code to not require the caller to pass the stack address, as NULL is the only sensible option. It also adds the magic code needed to make raw_clone work on sparc64, which does not return 0 in %o0 for the child, but indicates the child process by setting %o1 to non-zero. This refactoring is not plain aesthetic, because non-NULL stack addresses need to get mangled before being passed to the clone syscall (you have to apply STACK_BIAS), whereas NULL must not be mangled. Implementing the conditional mangling of the stack address would needlessly complicate the code. raw_clone is moved to a separete header, because the burden of including the assert machinery and sched.h shouldn't be applied to every user of missing_syscalls.h	2016-05-29 20:03:51 -04:00
Zbigniew Jędrzejewski-Szmek	b27b4b51c6	tree-wide: remove newlines from unit_write_drop_in This reverts part of #3329, but all for a good cause.	2016-05-28 16:29:42 -04:00
Zbigniew Jędrzejewski-Szmek	2a9a6f8ac0	core/unit: append newline when writing drop ins unit_write_drop_in{,_private}{,_format} are all affected. We already append a header to the file (and section markers), so those functions can only be used to write a whole file at once. Including the newline at the end feels natural. After this commit newlines will be duplicated. They will be removed in subsequent commit. Also, rewrap the "autogenerated" header to fit within 80 columns.	2016-05-28 16:17:54 -04:00
Zbigniew Jędrzejewski-Szmek	16690d0e15	Merge pull request #3328 from poettering/ipv4ll-ipv4acd-fixes various sd-Ipv4ll/sd-ipv4acd fixes	2016-05-28 11:48:37 -04:00
Tejun Heo	da4d897e75	core: add cgroup memory controller support on the unified hierarchy (#3315 ) On the unified hierarchy, memory controller implements three control knobs - low, high and max which enables more useable and versatile control over memory usage. This patch implements support for the three control knobs. * MemoryLow, MemoryHigh and MemoryMax are added for memory.low, memory.high and memory.max, respectively. * As all absolute limits on the unified hierarchy use "max" for no limit, make memory limit parse functions accept "max" in addition to "infinity" and document "max" for the new knobs. * Implement compatibility translation between MemoryMax and MemoryLimit. v2: - Fixed missing else's in config_parse_memory_limit(). - Fixed missing newline when writing out drop-ins. - Coding style updates to use "val > 0" instead of "val". - Minor updates to documentation.	2016-05-27 18:10:18 +02:00
Lennart Poettering	ffe54bf4bf	Merge pull request #3329 from htejun/dbus-cgroup-fixes dbus-cgroup fixes	2016-05-27 18:06:18 +02:00
Tejun Heo	83afa09d97	core: fix missing newline when writing drop-in for WorkingDirectory (#3337 )	2016-05-27 17:41:45 +02:00
Lennart Poettering	3d0b8a55f2	manager: remove spurious newline	2016-05-26 15:34:41 +02:00
Tejun Heo	0c2d96f5f5	core: fix missing newlines when writing out drop-ins for cgroup settings Except for per-device BlockIO, IO and DeviceAllow/Deny settings, all were missing newline causing the next drop-in to be concatenated at the end of the line. Fix it.	2016-05-23 16:48:46 -04:00
Tejun Heo	6fb0926976	core: fix the reversed sanity check when setting StartupBlockIOWeight over dbus bus_cgroup_set_property() was rejecting if the input value was in range. Reverse it.	2016-05-23 16:48:46 -04:00
Tejun Heo	508c45daa4	core: put opening curly braces on the same line as function names (#3313 ) Recently added cgroup helper functions break the style convention. Fix them up.	2016-05-20 23:46:42 +03:00
Lennart Poettering	7673628066	Revert "core/dbus: further simplify branch code" (#3307 )	2016-05-20 23:36:22 +03:00
Lennart Poettering	f9e26ecc48	Merge pull request #3290 from htejun/cgroup2-io-compat Implement compat translation between IO* and BlockIO* settings	2016-05-20 18:53:11 +02:00
Jonathan Boulle	4e282d11b7	core/dbus: revert oversimplification (#3309 ) free_and_strdup handles NULL but not empty strings. See also: https://github.com/systemd/systemd/pull/3283#issuecomment-220603145 https://github.com/systemd/systemd/pull/3307	2016-05-20 16:20:00 +02:00
Jonathan Boulle	186ad4b1a0	core/dbus: expose SELinuxContext property (#3284 ) Adds support to core for systemd D-Bus clients to send the `SELinuxContext` property . This means `systemd-run -p SELinuxContext=foo` should now work.	2016-05-20 15:09:14 +02:00
Jonathan Boulle	42e1d23f32	core/dbus: further simplify branch code (#3283 ) free_and_strdup already handles the NULL case for us, so we can remove an extraneous conditional check. As noted in https://github.com/systemd/systemd/pull/3279/files#r63687717	2016-05-18 20:54:22 -04:00
Tejun Heo	538b48524c	core: translate between IO and BlockIO settings to ease transition Due to the substantial interface changes in cgroup unified hierarchy, new IO settings are introduced. Currently, IO settings apply only to unified hierarchy and BlockIO to legacy. While the transition is necessary, it's painful for users to have to provide configs for both. This patch implements translation from one config set to another for configs which make sense. * The translation takes place during application of the configs. Users won't see IO or BlockIO settings appearing without being explicitly created. * The translation takes place only if there is no config for the matching cgroup hierarchy type at all. While this doesn't provide comprehensive compatibility, it should considerably ease transition to the new IO settings which are a superset of BlockIO settings. v2: - Update test-cgroup-mask.c so that it accounts for the fact that CGROUP_MASK_IO and CGROUP_MASK_BLKIO move together. Also, test/parent.slice now sets IOWeight instead of BlockIOWeight.	2016-05-18 17:35:12 -07:00
Tejun Heo	64faf04c1f	core: factor out io and blkio helper functions from cgroup_context_apply() Factor out the following functions out of cgroup_context_apply() * cgroup_context_[blk]io_weight() * cgroup_apply_[blk]io_device_weight() * cgroup_apply_[blk]io_device_limit() This is pure refactoring and shouldn't cause any functional differences.	2016-05-18 13:51:46 -07:00
Tejun Heo	979d03117f	core: update CGroupBlockIODeviceBandwidth to record both rbps and wbps CGroupBlockIODeviceBandwith is used to keep track of IO bandwidth limits for legacy cgroup hierarchies. Unlike the unified hierarchy counterpart CGroupIODeviceLimit, a CGroupBlockIODeviceBandwiddth records either a read or write limit and has a couple issues. * There's no way to clear specific config entry. * When configs are cleared for an IO direction of a unit, the kernel settings aren't cleared accordingly creating discrepancies. This patch updates CGroupBlockIODeviceBandwidth so that it behaves similarly to CGroupIODeviceLimit - each entry records both rbps and wbps limits and is cleared if both are at default values after kernel settings are updated.	2016-05-18 13:51:46 -07:00
Tejun Heo	ac06a0cf8a	core: add support for IOReadIOPSMax and IOWriteIOPSMax cgroup IO controller supports maximum limits for both bandwidth and IOPS but systemd resource control currently only supports bandwidth limits. This patch adds support for IOReadIOPSMax and IOWriteIOPSMax when unified cgroup hierarchy is in use. It isn't difficult to also add BlockIOReadIOPS and BlockIOWriteIOPS for legacy hierarchies but IO control on legacy hierarchies is half-broken anyway, so let's leave it alone for now.	2016-05-18 13:50:56 -07:00
Tejun Heo	9be572497d	core: introduce CGroupIOLimitType enums Currently, there are two cgroup IO limits, bandwidth max for read and write, and they are hard-coded in various places. This is fine for two limits but IO is expected to grow more limits - low, high and max limits for bandwidth and IOPS - and hard-coding each limit won't make sense. This patch replaces hard-coded limits with an array indexed by CGroupIOLimitType and accompanying string and default value tables so that new limits can be added trivially.	2016-05-18 13:50:56 -07:00
Jonathan Boulle	a363a2e45f	core/dbus: use free_and_strdup to simplify code (#3279 ) Makes it consistent with the other branches here.	2016-05-17 19:30:37 -04:00
Lennart Poettering	3103459e90	Merge pull request #3193 from htejun/cgroup-io-controller core: add io controller support on the unified hierarchy	2016-05-16 22:05:27 +02:00
Michal Sekletar	833f92ad39	core: don't log job status message in case job was effectively NOP (#3199 ) We currently generate log message about unit being started even when unit was started already and job didn't do anything. This is because job was requested explicitly and hence became anchor job of the transaction thus we could not eliminate it. That is fine but, let's not pollute journal with useless log messages. $ systemctl start systemd-resolved $ systemctl start systemd-resolved $ systemctl start systemd-resolved Current state: $ journalctl -u systemd-resolved \| grep Started May 05 15:31:42 rawhide systemd[1]: Started Network Name Resolution. May 05 15:31:59 rawhide systemd[1]: Started Network Name Resolution. May 05 15:32:01 rawhide systemd[1]: Started Network Name Resolution. After patch applied: $ journalctl -u systemd-resolved \| grep Started May 05 16:42:12 rawhide systemd[1]: Started Network Name Resolution. Fixes #1723	2016-05-16 11:24:51 -04:00
topimiettinen	737ba3c82c	namespace: Make private /dev noexec and readonly (#3263 ) Private /dev will not be managed by udev or others, so we can make it noexec and readonly after we have made all device nodes. As /dev/shm needs to be writable, we can't use bind_remount_recursive().	2016-05-15 22:34:05 -04:00
Tejun Heo	99e66921c8	core: allow slice to be overriden if cgroups aren't realized (#3246 ) unit_set_slice() fails with -EBUSY if the unit already has a slice associated with it. This makes it impossible to override slice through dropin config or over dbus. There's no reason to disallow slice changes as long as cgroups aren't realized. Fix it. Fixes #3240. Signed-off-by: Tejun Heo <htejun@fb.com> Reported-by: Davide Cavalca <dcavalca@fb.com>	2016-05-14 15:56:53 -04:00
topimiettinen	9e5f825280	namespace: unmount old /dev under our new private /dev (#3254 ) Drop all dangling old /dev mounts before mounting a new private /dev tree.	2016-05-14 12:46:23 -04:00
kayrus	6900c740e1	core: added ListUnitsByNames dbus method (#3182 ) This new method returns information by unit names. Instead of ListUnitsByPatterns this method returns information of inactive and even unexisting units. Moved dbus unit reply logic into a separate shared function. Resolves https://github.com/coreos/fleet/pull/1418	2016-05-12 18:58:59 +02:00
Lennart Poettering	4b273d46bb	Merge pull request #3220 from keszybz/install-fixes Fix "preset-all" with dangling symlinks and install-section hint emitted too eagerly	2016-05-10 16:44:29 +02:00
Lennart Poettering	d31645adef	tree-wide: port more code to use ifname_valid()	2016-05-09 15:45:31 +02:00
Martin Pitt	d75103d4c6	Merge pull request #3202 from poettering/socket-fixes don't reopen socket fds when reloading the daemon	2016-05-08 21:09:35 +02:00
Zbigniew Jędrzejewski-Szmek	94ad3616c8	core/mount: add helper function for mount states	2016-05-07 16:19:53 -04:00
Zbigniew Jędrzejewski-Szmek	74ad38ff0e	Merge pull request #3160 from htejun/cgroup-fixes-rev2 Cgroup fixes.	2016-05-07 15:08:57 -04:00
Evgeny Vereshchagin	5ab42bc85a	Merge pull request #3191 from poettering/cgroups-agent-dgram core: use an AF_UNIX/SOCK_DGRAM socket for cgroup agent notification	2016-05-07 19:17:44 +03:00
Evgeny Vereshchagin	1745fa70e7	core: dump TriggerLimitIntervalSec and TriggerLimitBurst too	2016-05-06 21:03:16 +00:00
Evgeny Vereshchagin	5d105c4a4f	core: expose TriggerLimitIntervalUSec Before: $ systemctl show --property TriggerLimitIntervalSec test.socket TriggerLimitIntervalSec=2000000 After: $ systemctl show --property TriggerLimitIntervalUSec test.socket TriggerLimitIntervalUSec=2s	2016-05-06 20:14:06 +00:00
Lennart Poettering	f76707da45	core: update the right mtime after finishing writing of transient units (#3203 ) Fixes: #3194	2016-05-06 19:22:22 +03:00
Lennart Poettering	60d9771c59	core: rework how we flush incoming traffic when a socket unit goes down Previously, we'd simply close and reopen the socket file descriptors. This is problematic however, as we won't transition through the SOCKET_CHOWN state then, and thus the file ownership won't be correct for the sockets. Rework the flushing logic, and actually read any queued data from the sockets for flushing, and accept any queued messages and disconnect them.	2016-05-06 13:29:26 +02:00
Lennart Poettering	01a8b46757	core: don't implicit open missing socket fds on daemon reload Previously, when the daemon was reloaded and the configuration of a socket unit file was changed so that a different set of socket ports was defined for the socket we'd simply reopen the socket fds not yet open. This is problematic however, as this means the SOCKET_CHOWN state is not run for them, and thus their UID/GID is not corrected. With this change, don't open the missing file descriptors, but log about this issue, and ask the user to restart the socket explicit, to make sure all missing fds are opened. Fixes: #3171	2016-05-06 13:01:17 +02:00
Lennart Poettering	d24e561d96	core: split out selinux label retrieval logic into a function of its own This should bring no behavioural change.	2016-05-06 12:16:58 +02:00
Tejun Heo	13c31542cc	core: add io controller support on the unified hierarchy On the unified hierarchy, blkio controller is renamed to io and the interface is changed significantly. * blkio.weight and blkio.weight_device are consolidated into io.weight which uses the standardized weight range [1, 10000] with 100 as the default value. * blkio.throttle.{read\|write}_{bps\|iops}_device are consolidated into io.max. Expansion of throttling features is being worked on to support work-conserving absolute limits (io.low and io.high). * All stats are consolidated into io.stats. This patchset adds support for the new interface. As the interface has been revamped and new features are expected to be added, it seems best to treat it as a separate controller rather than trying to expand the blkio settings although we might add automatic translation if only blkio settings are specified. * io.weight handling is mostly identical to blkio.weight[_device] handling except that the weight range is different. * Both read and write bandwidth settings are consolidated into CGroupIODeviceLimit which describes all limits applicable to the device. This makes it less painful to add new limits. * "max" can be used to specify the maximum limit which is equivalent to no config for max limits and treated as such. If a given CGroupIODeviceLimit doesn't contain any non-default configs, the config struct is discarded once the no limit config is applied to cgroup. * lookup_blkio_device() is renamed to lookup_block_device(). Signed-off-by: Tejun Heo <htejun@fb.com>	2016-05-05 16:43:06 -04:00
Lennart Poettering	d2a50e3b52	core: fix owner user/group output in socket dump The unit file settings are called SocketUser= and SocketGroup= hence name these fields that way in the "systemd-analyze dump" output too. https://github.com/systemd/systemd/issues/3171#issuecomment-216216995	2016-05-05 22:34:47 +02:00
Lennart Poettering	1f15ce2846	core: change default trigger limits for socket units Let's lower the default values a bit, and pick different defaults for Accept=yes and Accept=no sockets. Fixes: #3167	2016-05-05 22:34:47 +02:00
Lennart Poettering	fc2fffe770	tree-wide: introduce new SOCKADDR_UN_LEN() macro, and use it everywhere The macro determines the right length of a AF_UNIX "struct sockaddr_un" to pass to connect() or bind(). It automatically figures out if the socket refers to an abstract namespace socket, or a socket in the file system, and properly handles the full length of the path field. This macro is not only safer, but also simpler to use, than the usual offsetof() + strlen() logic.	2016-05-05 22:24:36 +02:00
Lennart Poettering	d8fdc62037	core: use an AF_UNIX/SOCK_DGRAM socket for cgroup agent notification dbus-daemon currently uses a backlog of 30 on its D-bus system bus socket. On overloaded systems this means that only 30 connections may be queued without dbus-daemon processing them before further connection attempts fail. Our cgroups-agent binary so far used D-Bus for its messaging, and hitting this limit hence may result in us losing cgroup empty messages. This patch adds a seperate cgroup agent socket of type AF_UNIX/SOCK_DGRAM. Since sockets of these types need no connection set up, no listen() backlog applies. Our cgroup-agent binary will hence simply block as long as it can't enqueue its datagram message, so that we won't lose cgroup empty messages as likely anymore. This also rearranges the ordering of the processing of SIGCHLD signals, service notification messages (sd_notify()...) and the two types of cgroup notifications (inotify for the unified hierarchy support, and agent for the classic hierarchy support). We now always process events for these in the following order: 1. service notification messages (SD_EVENT_PRIORITY_NORMAL-7) 2. SIGCHLD signals (SD_EVENT_PRIORITY_NORMAL-6) 3. cgroup inotify and cgroup agent (SD_EVENT_PRIORITY_NORMAL-5) This is because when receiving SIGCHLD we invalidate PID information, which we need to process the service notification messages which are bound to PIDs. Hence the order between the first two items. And we want to process SIGCHLD metadata to detect whether a service is gone, before using cgroup notifications, to decide when a service is gone, since the former carries more useful metadata. Related to this: https://bugs.freedesktop.org/show_bug.cgi?id=95264 https://github.com/systemd/systemd/issues/1961	2016-05-05 12:37:04 +02:00
Lennart Poettering	1ed7ebcfca	Merge pull request #3170 from poettering/v230-preparation-fixes make virtualization detection quieter, rework unit start limit logic, detect unit file drop-in changes correctly, fix autofs state propagation	2016-05-04 10:46:13 +02:00
Zbigniew Jędrzejewski-Szmek	a82394c889	Merge pull request #2921 from keszybz/do-not-report-masked-units-as-changed	2016-05-03 14:08:39 -04:00
Zbigniew Jędrzejewski-Szmek	d43bbb52de	Revert "Do not report masked units as changed (#2921 )" This reverts commit `6d10d308c6`. It got squashed by mistake.	2016-05-03 14:08:23 -04:00
Lennart Poettering	5c6c275e43	Merge pull request #3162 from keszybz/alias-refusal Refuse Alias, DefaultInstance, templated units in install (as appropriate)	2016-05-02 20:40:54 +02:00
Lennart Poettering	dbb0578edc	automount: move resetting of expiry timeout to automount_set_state() that way we can be sure that there's no expiry timeout in place at any time when we aren't in the RUNNING state.	2016-05-02 17:12:35 +02:00
Lennart Poettering	fae03ed32a	automount: rework propagation between automount and mount units Port the progagation logic to the generic Unit->trigger_notify() callback logic in the unit vtable, that is called for a unit not only when the triggered unit of it changes state but also when a job for that unit finishes. This, firstly allows us to make the code a bit cleaner and more generic, but more importantly, allows us to notice correctly when a mount job fails, and propagate that back to autofs client processes. Fixes: #2181	2016-05-02 16:51:45 +02:00

1 2 3 4 5 ...

2611 Commits