mirror of
https://github.com/systemd/systemd-stable.git
synced 2025-01-11 05:17:44 +03:00
6b000af4f2
https://tools.ietf.org/html/draft-knodel-terminology-02 https://lwn.net/Articles/823224/ This gets rid of most but not occasions of these loaded terms: 1. scsi_id and friends are something that is supposed to be removed from our tree (see #7594) 2. The test suite defines an API used by the ubuntu CI. We can remove this too later, but this needs to be done in sync with the ubuntu CI. 3. In some cases the terms are part of APIs we call or where we expose concepts the kernel names the way it names them. (In particular all remaining uses of the word "slave" in our codebase are like this, it's used by the POSIX PTY layer, by the network subsystem, the mount API and the block device subsystem). Getting rid of the term in these contexts would mean doing some major fixes of the kernel ABI first. Regarding the replacements: when whitelist/blacklist is used as noun we replace with with allow list/deny list, and when used as verb with allow-list/deny-list.
170 lines
8.3 KiB
Markdown
170 lines
8.3 KiB
Markdown
---
|
|
title: User/Group Name Syntax
|
|
category: Users, Groups and Home Directories
|
|
layout: default
|
|
---
|
|
|
|
# User/Group Name Syntax
|
|
|
|
The precise set of allowed user and group names on Linux systems is weakly
|
|
defined. Depending on the distribution a different set of requirements and
|
|
restrictions on the syntax of user/group names are enforced — on some
|
|
distributions the accepted syntax is even configurable by the administrator. In
|
|
the interest of interoperability systemd enforces different rules when
|
|
processing users/group defined by other subsystems and when defining users/groups
|
|
itself, following the principle of "Be conservative in what you send, be
|
|
liberal in what you accept". Also in the interest of interoperability systemd
|
|
will enforce the same rules everywhere and not make them configurable or
|
|
distribution dependent. The precise rules are described below.
|
|
|
|
Generally, the same rules apply for user as for group names.
|
|
|
|
## Other Systems
|
|
|
|
* On POSIX the set of [valid user
|
|
names](https://pubs.opengroup.org/onlinepubs/9699919799/basedefs/V1_chap03.html#tag_03_437)
|
|
is defined as [lower and upper case ASCII letters, digits, period,
|
|
underscore, and
|
|
hyphen](https://pubs.opengroup.org/onlinepubs/9699919799/basedefs/V1_chap03.html#tag_03_282),
|
|
with the restriction that hyphen is not allowed as first character of the
|
|
user name. Interestingly no size limit is declared, i.e. in neither
|
|
direction, meaning that strictly speaking according to POSIX both the empty
|
|
string is a valid user name as well as a string of gigabytes in length.
|
|
|
|
* Debian/Ubuntu based systems enforce the regular expression
|
|
`^[a-z][-a-z0-9]*$`, i.e. only lower case ASCII letters, digits and
|
|
hyphens. As first character only lowercase ASCII letters are allowed. This
|
|
regular expression is configurable by the administrator at runtime
|
|
though. This rule enforces a minimum length of one character but no maximum
|
|
length.
|
|
|
|
* Upstream shadow-utils enforces the regular expression
|
|
`^[a-z_][a-z0-9_-]*[$]$`, i.e. is similar to the Debian/Ubuntu rule, but
|
|
allows underscores and hyphens, but the latter not as first character. Also,
|
|
an optional trailing dollar character is permitted.
|
|
|
|
* Fedora/Red Hat based systems enforce the regular expression of
|
|
`^[a-zA-Z0-9_.][a-zA-Z0-9_.-]{0,30}[a-zA-Z0-9_.$-]?$`, i.e. a size limit of
|
|
32 characters, with upper and lower case letters, digits, underscores,
|
|
hyphens and periods. No hyphen as first character though, and the last
|
|
character may be a dollar character. On top of that, `.` and `..` are not
|
|
allowed as user/group names.
|
|
|
|
* sssd is known to generate user names with embedded `@` and white-space
|
|
characters, as well as non-ASCII (i.e. UTF-8) user/group names.
|
|
|
|
* winbindd is known to generate user/group names with embedded `\` and
|
|
white-space characters, as well as non-ASCII (i.e. UTF-8) user/group names.
|
|
|
|
Other operating systems enforce different rules; in this documentation we'll
|
|
focus on Linux systems only however, hence those are out of scope. That said,
|
|
software like Samba is frequently deployed on Linux for providing compatibility
|
|
with Windows systems; on such systems it might be wise to stick to user/group
|
|
names also valid according to Windows rules.
|
|
|
|
## Rules systemd enforces
|
|
|
|
Distilled from the above, below are the rules systemd enforces on user/group
|
|
names. An additional, common rule between both modes listed below is that empty
|
|
strings are not valid user/group names.
|
|
|
|
Philosophically, the strict mode described below enforces an allow list of
|
|
what's allowed and prohibits everything else, while the relaxed mode described
|
|
below implements a deny list of what's not allowed and permits everything else.
|
|
|
|
### Strict mode
|
|
|
|
Strict user/group name syntax is enforced whenever a systemd component is used
|
|
to register a user or group in the system, for example a system user/group
|
|
using
|
|
[`systemd-sysusers.service`](https://www.freedesktop.org/software/systemd/man/systemd-sysusers.html)
|
|
or a regular user with
|
|
[`systemd-homed.service`](https://www.freedesktop.org/software/systemd/man/systemd-homed.html).
|
|
|
|
In strict mode, only uppercase and lowercase characters are allowed, as well as
|
|
digits, underscores and hyphens. The first character may not be a digit or
|
|
hyphen. A size limit is enforced: the minimum of `sysconf(_SC_LOGIN_NAME_MAX)`
|
|
(typically 256 on Linux; rationale: this is how POSIX suggests to detect the
|
|
limit), `UT_NAMESIZE-1` (typically 31 on Linux; rationale: names longer than
|
|
this cannot correctly appear in `utmp`/`wtmp` and create ambiguity with login
|
|
accounting) and `FILENAME_MAX` (4096 on Linux; rationale: user names typically
|
|
appear in directory names, i.e. the home directory), thus MIN(256, 31, 4096) =
|
|
31.
|
|
|
|
Note that these rules are both more strict and more relaxed than all of the
|
|
rules enforced by other systems listed above. A user/group name conforming to
|
|
systemd's strict rules will not necessarily pass a test by the rules enforced
|
|
by these other subsystems.
|
|
|
|
Written as regular expression the above is: `^[a-zA-Z_][a-zA-Z0-9_-]{0,30}$`
|
|
|
|
### Relaxed mode
|
|
|
|
Relaxed user/group name syntax is enforced whenever a systemd component accepts
|
|
and makes use of user/group names registered by other (non-systemd)
|
|
components of the system, for example in
|
|
[`systemd-logind.service`](https://www.freedesktop.org/software/systemd/man/systemd-logind.html).
|
|
|
|
Relaxed syntax is also enforced by the `User=` setting in service unit files,
|
|
i.e. for system services used for running services. Since these users may be
|
|
registered by a variety of tools relaxed mode is used, but since the primary
|
|
purpose of these users is to run a system service and thus a job for systemd a
|
|
warning is shown if the specified user name does not qualify by the strict
|
|
rules above.
|
|
|
|
* No embedded NUL bytes (rationale: handling in C must be possible and
|
|
straight-forward)
|
|
|
|
* No names consisting fully of digits (rationale: avoid confusion with numeric
|
|
UID/GID specifications)
|
|
|
|
* Similar, no names consisting of an initial hyphen and otherwise entirely made
|
|
up of digits (rationale: avoid confusion with negative, numeric UID/GID
|
|
specifications, e.g. `-1`)
|
|
|
|
* No strings that do not qualify as valid UTF-8 (rationale: we want to be able
|
|
to embed these strings in JSON, with permits only valid UTF-8 in its strings;
|
|
user names using other character sets, such as JIS/Shift-JIS will cause
|
|
validation errors)
|
|
|
|
* No control characters (i.e. characters in ASCII range 1…31; rationale: they
|
|
tend to have special meaning when output on a terminal in other contexts,
|
|
moreover the newline character — as a specific control character — is used as
|
|
record separator in `/etc/passwd`, and hence it's crucial to avoid
|
|
ambiguities here)
|
|
|
|
* No colon characters (rationale: it is used as field separator in `/etc/passwd`)
|
|
|
|
* The two strings `.` and `..` are not permitted, as these have special meaning
|
|
in file system paths, and user names are frequently included in file system
|
|
paths, in particular for the purpose of home directories.
|
|
|
|
* Similar, no slashes, as these have special meaning in file system paths
|
|
|
|
* No leading or trailing white-space is permitted; and hence no user/group names
|
|
consisting of white-space only either (rationale: this typically indicates
|
|
parsing errors, and creates confusion since not visible on screen)
|
|
|
|
Note that these relaxed rules are implied by the strict rules above, i.e. all
|
|
user/group names accepted by the strict rules are also accepted by the relaxed
|
|
rules, but not vice versa.
|
|
|
|
Note that this relaxed mode does not refuse a couple of very questionable
|
|
syntaxes. For example it permits a leading or embedded period. A leading period
|
|
is problematic because the matching home directory would typically be hidden
|
|
from the user's/administrator's view. An embedded period is problematic since
|
|
it creates ambiguity in traditional `chown` syntax (which is still accepted
|
|
today) that uses it to separate user and group names in the command's
|
|
parameter: without consulting the user/group databases it is not possible to
|
|
determine if a `chown` invocation would change just the owning user or both the
|
|
owning user and group. It also allows embedding `@` (which is confusing to
|
|
MTAs).
|
|
|
|
## Common Core
|
|
|
|
Combining all rules listed above, user/group names that shall be considered
|
|
valid in all systemd contexts and on all Linux systems should match the
|
|
following regular expression (at least according to our understanding):
|
|
|
|
`^[a-z][a-z0-9-]{0,30}$`
|