pve-cluster

mirror of git://git.proxmox.com/git/pve-cluster.git synced 2025-03-12 20:58:25 +03:00

Author	SHA1	Message	Date
Fabian Grünbichler	da578f5a86	fix #3957 : spell 'occurred' correctly Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2022-03-25 09:07:42 +01:00
Dominik Csapak	8a93eeae9e	Cluster: fix typo Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>	2022-03-16 10:51:58 +01:00
Thomas Lamprecht	92f5c7f00a	bump version to 7.1-3 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-12-15 15:24:26 +01:00
Fabian Grünbichler	dcd4841f39	clusterlog: fix segfault / wrong iteration bounds the clusterlog struct is a basic ring buffer: struct clog_base { uint32_t size; // total size of this clog_base uint32_t cpos; // index into data, starts counting at start of clog_base, initially 0 char data[]; }; an entry consists of indices of the next and previous entries and various fields (fixed-length ones omitted here): typedef struct { uint32_t prev; // index of previous entry, or 0 if none exists uint32_t next; // index of next entry [..] // fixed-length fields uint8_t node_len; uint8_t ident_len; uint8_t tag_len; uint32_t msg_len; char data[]; // node+ident+tag+msg - variable-length fields } clog_entry_t; the next and prev indices are calculated when allocating a new entry, and the position of the current entry 'cpos' is updated accordingly (clog_alloc_entry): - size of the entry is padded with up to 7 bytes - first entry goes to index 8 - second and subsequent entries go to the current entry's 'next' index - if the current entry's 'next' index is out of bonds, the first entry is overwritten => wrap-around - the 'prev' index of the new entry is set to cpos - cpos is set to the index of the new entry - the 'next' index of the new entry is set to its index+padded size when iterating over the entries, the following bounds are used to follow the 'prev' links starting at the current entry: while (cpos && (cpos <= clog->cpos \|\| cpos > (clog->cpos + CLOG_MAX_ENTRY_SIZE))) { while this handles a not-yet-wrapped around ring buffer (cpos would be 0 when reaching the first entry), and tries to handle wrap-arounds by terminating when reaching a 'red-zone' of 'CLOG_MAX_ENTRY_SIZE' starting at the current entry (this covers the current entry which was already visited as first entry during the iteration, and the next entry after it which might have been overwritten) - but it's possible that entries line up so that the wrap-around 'prev' index of the first entry points to a location before the current entry. for example, looking at clog_base with S being the size field, C being the cpos field, followed by the actual data. N/P are the next/prev indices of the entry at C, Q denotes the 'prev' index of the first entry in the data array, and 'R' the red zone used for the loop check in case of wrap-around. first, fill up the buffer with six large entries: Q P C N \| \| \| \| \| \| \| \| v v v v +-+-+------+------+------+------+------+------+-+ \| \| \| \| \| \| \| \| \|x\| \| \| \| 1 \| 2 \| 3 \| 4 \| 5 \| 6 \|x\| \| \| \| \| \| \| \| \| \|x\| +-+-+------+------+------+------+------+------+-+ S C RRRRRRRRRRR iterating from C backwards ends up at Q being 0, terminating the loop without a wrap-around after having visit 6->1 now the next (in this example, smaller) entry that gets allocated/insert needs to wrap around, because the empty space at the end (denoted by XXX) is too small: C N QP \| \| \|\| \| \| \|\| v v vv +-+-+------+------+------+------+------+------+-+ \| \| \| \| \| \| \| \| \|x\| \| \| \| 7 \| 2 \| 3 \| 4 \| 5 \| 6 \|x\| \| \| \| \| \| \| \| \| \|x\| +-+-+------+------+------+------+------+------+-+ S C RRRRRRRRRRR iterating backwards from C terminates the loop when reaching the red zone, with the (second) entry no longer being considered since it partly overlaps it. only 7->3 are visited. adding more entries we end up with the following layout: P QC N \| \|\| \| \| \|\| \| v vv v +-+-+------+---+---+---+---+---+---+---+---+--+-+ \| \| \| \| \| \| \| \| \| \| \| \|##\|x\| \| \| \| 7 \| 8 \| 9 \|10 \|11 \|12 \|13 \|14 \|15 \|#6\|x\| \| \| \| \| \| \| \| \| \| \| \| \|##\|x\| +-+-+------+---+---+---+---+---+---+---+---+--+-+ S C RRRRRRRRRRR with # denoting space previously occupied the last large entry (#6) which is still unmodified (the rest of that entry's data has been overwritten by entries #14 and #15). iterating from C (to the left/P) the loop ends up at entry #7, follows the link to Q (which satisfies the loop bounds as Q < C), and the data starting at (invalid index) Q gets interpreted as an entry. it is possible (though even more unlikely than the partial overwrite case) that Q and C line up perfectly, which would cause the loop to become an infinite loop. the loop should terminate after having visited 15-7, without wrapping around. note that the actual sizes of the entries are not relevant, the requirements are: - entry before last wrap-around must be big enough that entry of current index can overtake it without another wrap-around - method that does iteration must be called before next wrap-around the fix is obviously trivial once the issue became apparent - when wrapping around during iteration, additionally check that we are not jumping across the red zone into already invalidated parts of data. clusterlog_merge is technically not affected since it aborts before a wrap-around anyway, but it doesn't hurt to have the checks consistently in case this ever changes. thanks to @kev1904 on our community forums for reporting and providing the data to nail the cause down fast! Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2021-12-14 15:15:14 +01:00
Fabian Grünbichler	cfac4d1eb4	clusterlog: segfault reproducer see next commit for details. get_state mimics the code path triggered in the wild, the other two are affected just the same. Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2021-12-14 15:15:14 +01:00
Thomas Lamprecht	b4afda7e4c	d/copyright: update years Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-11-18 08:48:22 +01:00
Thomas Lamprecht	537a44508d	buildsys: fix variable names wrong was only the DBG one but make the LIB one use an underscore separator too... Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-11-15 14:43:41 +01:00
Thomas Lamprecht	5840283565	bump version to 7.1-2 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-11-15 14:32:00 +01:00
Thomas Lamprecht	459f60841a	datacenter.cfg: code style fixes Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-11-15 14:30:02 +01:00
Thomas Lamprecht	b5e2b244dd	datacenter.cfg: fix fall back for undefined config Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com> Reported-by: Oguz Bektas <o.bektas@proxmox.com>	2021-11-15 14:27:18 +01:00
Thomas Lamprecht	87f7608e66	bump version to 7.1-1 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-11-15 08:24:36 +01:00
Fabian Grünbichler	5a40b91ff3	fix #3596 : handle delnode of offline node the recommended way is to first shutdown, then delnode, and never let it come back online, in which case corosync-cfgtool won't be able to kill the removed (offline) node. also, the order was wrong - if we first update corosync.conf to remove the node entry from the nodelist, corosync doesn't know about the nodeid anymore, so killing will fail even if the node is still online. Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2021-11-12 11:04:05 +01:00
Thomas Lamprecht	46ba8e2674	bump version to 7.0-5 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-11-11 17:22:03 +01:00
Alexandre Derumier	fa42079929	sysctl: disable net.ipv4.igmp_link_local_mcast_reports currently, when veth or tap interfaces are plugged to bridge, an igmp v3 report is broadcasted to the network, with the bridge mac adddress. Users have reported problems with hetzner for example, blocking the server because of the unknown mac flooding the network. https://forum.proxmox.com/threads/proxmox-claiming-mac-address.52601/page-6#post-421676 some traces: ip addr: 190: fwbr109i0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP group default qlen 1000 link/ether 22:5f:0b:cb:ac:42 brd ff:ff:ff:ff:ff:ff ebtable log: Oct 6 09:46:24 kvmformation3 kernel: [437256.753355] MAC-FLOOD-F IN=fwpr109p0 OUT=eno1 MAC source = 22:5f:0b:cb:ac:42 MAC dest = 01:00:5e:00:00:16 proto = 0x0800 IP SRC=0.0.0.0 IP DST=224.0.0.22, IP tos=0xC0, IP proto=2 tcpdump -e -i eno1 igmp 09:53:23.914825 22:5f:0b:cb:ac:42 (oui Unknown) > 01:00:5e:00:00:16 (oui Unknown), ethertype IPv4 (0x0800), length 54: 0.0.0.0 > igmp.mcast.net: igmp v3 report, 1 group record(s) Signed-off-by: Alexandre Derumier <aderumier@odiso.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-11-11 17:17:38 +01:00
Stoiko Ivanov	fd12066f6e	sysctl snippet: move to /usr/lib and prefix with 10- following best-practices according to `sysctl.d(5)`: * Packages should install their configuration files in /usr/lib/ ... * It is recommended to prefix all filenames with a two-digit number and a dash ... the conffile removal is inspired by how it was done in `procps` (one of the few packages in the debian repository, which did this transition) and by following `dpkg-maintscript-helper(1)` and `deb-conffiles(5)` (the former recommending the latter) The choice of 10- as prefix is due to pve-container shipping its snippet with that prefix already. other packages use higher numbers (e.g. systemd - 50-) Tested on 2 VMs (one with modifications, the other without) - worked as advertised (the modified file was kept as /etc/sysctl.d/pve.conf.dpkg-old and the upgrade notified me of the change) Signed-off-by: Stoiko Ivanov <s.ivanov@proxmox.com>	2021-11-11 17:16:52 +01:00
Thomas Lamprecht	4d04cad098	cluster: small code/style cleanups Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-11-11 17:08:16 +01:00
Wolfgang Bumiller	8545a70546	add webauthn configuration to datacenter.cfg Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2021-11-10 11:10:40 +01:00
Thomas Lamprecht	bf3daa3e6e	bump version to 7.0-4 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-11-09 18:30:14 +01:00
Dominik Csapak	1fdf150ede	add 'jobs.cfg' to observed files Signed-off-by: Dominik Csapak <d.csapak@proxmox.com> Tested-By: Dylan Whyte <d.whyte@proxmox.com> Tested-By: Aaron Lauterer <a.lauterer@proxmox.com>	2021-11-09 18:16:59 +01:00
Thomas Lamprecht	a7b6c5289f	api: join info: return explicit error code for no-cluster allows an API client to more easily differ between this OK "error" and an actual exception. Note that I'd rather now just return undef or an empty object for the no cluster case (not to sure about the original reasons about the die anymore), but that would be a breaking change, and in fact it would break current pve-manager versions out there, so schedule that for the next major release (if we still want it then) Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-11-07 20:45:41 +01:00
Thomas Lamprecht	3e1a95d785	addnode: code reduction no semantic change intended Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-11-07 20:44:00 +01:00
Thomas Lamprecht	d44f324744	setup: gen pve cert: code-style & indentation fixes Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-11-05 14:27:04 +01:00
Thomas Lamprecht	6af0c1add0	Revert "d/control: better handle fuse3 transition" This reverts commit a9592e415670131e35cd149eafd0993ddb526536.	2021-10-06 15:32:25 +02:00
Dominic Jäger	2ae1c0bb12	dc.cfg: Add notes to datacenter config Similar to notes for nodes. datacenter.cfg normally uses key-value pairs defined in the schema. We bypass this to allow potentially long comments at the top. Signed-off-by: Dominic Jäger <d.jaeger@proxmox.com> Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>	2021-10-04 09:38:05 +02:00
Thomas Lamprecht	a9592e4156	d/control: better handle fuse3 transition Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-07-20 08:22:35 +02:00
Thomas Lamprecht	6453548eee	bump version to 7.0-3 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-07-01 12:42:57 +02:00
Thomas Lamprecht	a8df0863b5	pmxcfs: bump basic FS limits, 1 MiB per-file, 128 MiB total We have some users running into issues in some cases, like syncing huge user base through LDAP into users.cfg or having a few thousands+ of HA services, as then the per-file limit is exhausted. Bumping that one provides only half of the solution as the total limit of 30 MiB would only allow a few files getting that big, or reduce the amount left over for actual guest configurations quite a bit. So also bump the total filesystem limit from 30 MiB to 128 MiB, so by a factor of ~4 and in the same spirit bump the maximal numbers of inodes (i.e., different files) from 10k to 256k, which pmxcfs can handle still rather easily (tested with touch) and would allow to max out the full FS limit with 512 byte files, which fits small guest configs, so sounds like an OK proportioned limit. That should give use quite some wiggle room again, and should be relatively safe as most of our access is rather small and on a few files only, only root has full access anyway and that user can break everything already, so not much lost here. Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-07-01 12:40:22 +02:00
Aaron Lauterer	be7f39fb8d	pve-cluster.service: remove ceph.service The ceph.service file has been removed in pve-manager commit be244f1. Therefore, there is no need to reference it anymore. This also avoids showing the `ceph.service` as a `not found` unit. Signed-off-by: Aaron Lauterer <a.lauterer@proxmox.com>	2021-06-21 09:39:42 +02:00
Thomas Lamprecht	a47058f234	bump version to 7.0-2 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-05-20 11:23:56 +02:00
Fabian Grünbichler	305facd9d4	d/control: add missing libtest-mockmodule-perl b-d Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2021-05-19 11:45:32 +02:00
Thomas Lamprecht	aea0f5a66f	get tasklist: unpack null-terminated C string before decoding as JSON This was always an "issue", but with Perl 5.28, from our Debian Buster based release, decode_json just ignored the \0 NUL byte. For example: ``` perl -w -MJSON -e 'my $raw = "[]\0"; print to_json(decode_json($raw), {pretty=>1});' ``` will get you the following error on perl 5.32 ``` garbage after JSON object, at character offset 2 (before "\x{0}") at -e line 1. ``` Note, I did not find anything related in the perldelta aricles for the 28 -> 30 or 30 -> 32 update, the first one made a bigger jump for the JSON module version used, so possibly a change there. Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-05-16 16:12:35 +02:00
Thomas Lamprecht	1f1c5c4309	get tasklist: code cleanup Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-05-15 13:00:10 +02:00
Thomas Lamprecht	4c9ce43a08	bump version to 7.0-1 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-05-09 19:31:37 +02:00
Thomas Lamprecht	11bc808013	buildsys: fix IPCC.so linkage... Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-05-09 19:31:37 +02:00
Thomas Lamprecht	e4be708695	buildsys: change upload dist to bullseye Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-05-09 18:08:20 +02:00
Thomas Lamprecht	6d21f2451b	d/control: bump debhelper compat to >= 12 dh_systemd was enabled by default since level 10, with level 12 the compat plugin does not exists anymore so enabling it manually results in an error. The dh_strip override is now obsolete too, as users need to go through 5.4 AND 6.4 anyway on upgrade, and new installations do not matter here. Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-05-09 18:05:32 +02:00
Thomas Lamprecht	747cf0db49	d/control: adapt libqb SO-Version dependency change Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-05-09 17:39:11 +02:00
Thomas Lamprecht	2fd0b1f682	bump version to 6.4-1 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-04-26 16:01:12 +02:00
Oguz Bektas	80d19645c4	pvecm: fix typo in description for 'updatecerts' Signed-off-by: Oguz Bektas <o.bektas@proxmox.com>	2021-04-22 21:54:41 +02:00
Thomas Lamprecht	e085fe6f9f	cfs lock: avoid confusing lock prefix on error we have lots of forum posts where users think that the locking was the error, not the actual error message from the called code. This has limited value as general-applied prefix, if a code requires the lockid or whatever to be included in the error message they can already do so, so just re-raise the error and be done, at least if it is a error from the code and not from the lock setup,. Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-04-22 21:40:30 +02:00
Thomas Lamprecht	4942611503	pmxcfs: db: tell query planner that prepared statement are long living SQLITE_PREPARE_PERSISTENT The SQLITE_PREPARE_PERSISTENT flag is a hint to the query planner that the prepared statement will be retained for a long time and probably reused many times. Without this flag, sqlite3_prepare_v3() and sqlite3_prepare16_v3() assume that the prepared statement will be used just once or at most a few times and then destroyed using sqlite3_finalize() relatively soon. The current implementation acts on this hint by avoiding the use of lookaside memory so as not to deplete the limited store of lookaside memory. Future versions of SQLite may act on this hint differently. -- https://sqlite.org/c3ref/c_prepare_normalize.html#sqlitepreparepersistent Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-04-22 21:37:55 +02:00
Thomas Lamprecht	c44bb3d626	pmxcfs: db: use SQLITE_STATIC to avoid memory copies we can trust that we own value and name until the sqlite statement was executed, so use the STATIC bind flag to tell sqlite that it does not need to make it's own copy in the bind statement. Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-04-22 21:37:55 +02:00
Thomas Lamprecht	42f0a0a528	pmxcfs: more debug info on backend write and duplicate inode checks + cleanup Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-04-22 21:37:55 +02:00
Alexandre Derumier	a3d44df833	rename sdn/.version to sdn/.running-config Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-02-08 13:36:17 +01:00
Alexandre Derumier	9d3ea5ef77	add sdn/dns.cfg Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-02-08 13:36:17 +01:00
Alexandre Derumier	fc28c2f8aa	add priv/ipam.db Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-02-08 13:36:17 +01:00
Alexandre Derumier	be1aa34bf1	add sdn/ipams.cfg Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-02-08 13:36:17 +01:00
Alexandre Derumier	c20823f850	add sdn/subnets.cfg Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-02-08 13:36:17 +01:00
Thomas Lamprecht	8d3e188275	pmxcfs: tests: make add_test signature backward compatible we still need to be able to build with the libcheck version from buster.. Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-01-31 03:21:50 +01:00
Thomas Lamprecht	b68b75f973	pmxcfs: status: catch possible allocation error even if not really realistic to happen in Linux Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-01-31 03:09:30 +01:00

1 2 3 4 5 ...

706 Commits