5
0
mirror of git://git.proxmox.com/git/pve-guest-common.git synced 2024-12-22 13:34:00 +03:00
Commit Graph

286 Commits

Author SHA1 Message Date
Thomas Lamprecht
a5cbcc91c7 bump version to 4.2-2
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2022-11-17 13:14:03 +01:00
Thomas Lamprecht
a2fc394c75 d/control: bump versioned dependency for libpve-cluster-perl to >= 7-2-3
to ensure we got the tag stuff available

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2022-11-17 13:11:48 +01:00
Dominik Csapak
2b00f6236d GuestHelpers: add tag related helpers
'get_allowed_tags':
returns the allowed tags for the given user

'assert_tag_permissions'
helper to check permissions for tag setting/updating/deleting
for both container and qemu-server

gets the list of allowed tags from the DataCenterConfig and the current
user permissions, and checks for each tag that is added/removed if
the user has permissions to modify it

'normal' tags require 'VM.Config.Options' on '/vms/<vmid>', but not
allowed tags (either limited with 'user-tag-access' or
'privileged-tags' in the datacenter.cfg) requrie 'Sys.Modify' on '/'

Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>
2022-11-17 10:24:33 +01:00
Thomas Lamprecht
b208122a11 bump version to 4.2-1
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2022-11-12 16:52:14 +01:00
Thomas Lamprecht
fb8d2eeca7 d/control: bump versioned dependency for libpve-common-perl to 7.2-6
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2022-11-12 16:50:22 +01:00
Thomas Lamprecht
a90246abb1 vzdump: comment that vzdump.cron is the legacy parser
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2022-11-12 16:30:02 +01:00
Thomas Lamprecht
57f3baeb76 vzdump: handle new jobs.cfg when removing VMIDs from backup jobs
we use the relatively new SectionConfig functionallity of allowing to
parse/write unknown config types, that way we can directly use the
directly available base job plugin for vzdump jobs and update only
those, keeping the other jobs untouched.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2022-11-12 16:25:39 +01:00
Thomas Lamprecht
1791dd4a71 add VZDump::JobBase, split out from manager
We need access to vzdump type jobs at this level, else we cannot do
things like removing VMIDs on purge of their guest.

So split out the independent part (all but the actual run method)
from pve-manager's PVE::Jobs::VZDump module.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2022-11-12 16:21:35 +01:00
Thomas Lamprecht
915e7a0216 vzdump: re-factor remove_vmid_from_jobs to checking a single job
that way we'll be able to re-use it for adding support of cleaning
out vzdump jobs from the newish job.cfg

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2022-11-12 14:18:08 +01:00
Wolfgang Bumiller
b41392062f bump version to 4.1-4
Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>
2022-10-12 14:02:29 +02:00
Fiona Ebner
e8d7ef5be3 vzdump: add 'performance' property string as a setting
Initially, to be used for tuning backup performance with QEMU.

A few users reported IO-related issues during backup after upgrading
to PVE 7.x and using a modified QEMU build with max-workers reduced to
8 instead of 16 helped them [0].

Also generalizes the way vzdump property string are handled for easier
extension in the future.

[0]: https://forum.proxmox.com/threads/113790/

Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>
2022-10-10 13:02:53 +02:00
Thomas Lamprecht
bdbc909757 bump version to 4.1-3
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2022-09-27 13:15:54 +02:00
Fiona Ebner
18b6c8d1fe replication: avoid "expected snapshot missing" warning when irrelevant
Only print it when there is a snapshot that would've been removed
without the safeguard. Mostly relevant when a new volume is added to
an already replicated guest.

Fixes replication tests in pve-manager.

Fixes: c0b2948 ("replication: prepare: safeguard against removal if expected snapshot is missing")
Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>
2022-09-26 11:23:17 +02:00
Fiona Ebner
c0b29481e4 replication: prepare: safeguard against removal if expected snapshot is missing
Such a check would also have prevented the issue in 1aa4d84
("ReplicationState: purge state from non local vms") and other
scenarios where state and disk state are inconsistent with regard to
the last_sync snapshot.

AFAICT, all existing callers intending to remove all snapshots use
last_sync=1 so chaning the behavior for other (non-zero) values should
be fine.

Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>
2022-08-02 11:05:52 +02:00
Fiona Ebner
1e6416f172 replication: also consider storages from replication state upon removal
This prevents left-over volume(s) in the following situation:
1. replication with volumes on different storages A and B
2. remove all volumes on storage B from the guest configuration
3. schedule full removal before the next normal replication runs

Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>
2022-08-02 11:05:49 +02:00
Fiona Ebner
4c1bd50289 replication: rename last_snapshots to local_snapshots
because prepare() was changed in 8d1cd44 ("partially fix #3111:
replication: be less picky when selecting incremental base") to return
all local snapshots.

Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>
2022-08-02 11:05:45 +02:00
Fiona Ebner
efe85efbb7 replication: prepare: adapt/expand function comment
Commit 8d1cd44 ("partially fix #3111: replication: be less picky when
selecting incremental base") changed prepare() to return all local
snapshots.

Special behavior regarding last_sync is also better mentioned
explicitly.

Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>
2022-08-02 11:05:38 +02:00
Dominik Csapak
f1fc7d6c61 ReplicationState: deterministically order replication jobs
if we have multiple jobs for the same vmid with the same schedule,
the last_sync, next_sync and vmid will always be the same, so the order
depends on the order of the $jobs hash (which is random; thanks perl)

to have a fixed order, take the jobid also into consideration

Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>
Reviewed-by: Fabian Ebner <f.ebner@proxmox.com>
2022-06-08 08:48:04 +02:00
Dominik Csapak
1aa4d844a1 ReplicationState: purge state from non local vms
when running replication, we don't want to keep replication states for
non-local vms. Normally this would not be a problem, since on migration,
we transfer the states anyway, but when the ha-manager steals a vm, it
cannot do that. In that case, having an old state lying around is
harmful, since the code does not expect the state to be out-of-sync
with the actual snapshots on disk.

One such problem is the following:

Replicate vm 100 from node A to node B and C, and activate HA. When node
A dies, it will be relocated to e.g. node B and start replicate from
there. If node B now had an old state lying around for it's sync to node
C, it might delete the common base snapshots of B and C and cannot sync
again.

Deleting the state for all non local guests fixes that issue, since it
always starts fresh, and the potentially existing old state cannot be
valid anyway since we just relocated the vm here (from a dead node).

Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>
Reviewed-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
Reviewed-by: Fabian Ebner <f.ebner@proxmox.com>
2022-06-08 08:48:04 +02:00
Thomas Lamprecht
3802f3ddd0 vzdump config: limit notes template to maximal 1024 characters
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2022-06-08 08:47:59 +02:00
Fabian Ebner
8ec108b341 vzdump: update notes-template description
as the actual handling in pve-manager changed a bit.

Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>
2022-04-28 15:06:43 +02:00
Thomas Lamprecht
54fba1b4a9 bump version to 4.1-2
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2022-04-27 18:51:21 +02:00
Fabian Ebner
51cbaa3f93 vzdump: schema: add 'notes-template' and 'protected' properties
In command_line(), notes are printed, quoted, but otherwise as is,
which is a bit ugly for multi-line notes. But it is part of the
commandline, so print it.

Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>
2022-04-27 10:45:59 +02:00
Thomas Lamprecht
73a3e4cb23 replication config: retry first three failed times quicker before going to 30m
So the repeat frequency for a stuck job is now:
t0 -> fails
t1 = t0 +  5m -> repat
t2 = t1 + 10m = t0 + 15m -> repat
t3 = t2 + 15m = t0 + 30m -> repat
t4 = t3 + 30m = t0 + 60-> repat
then
tx = tx-1 + 30m -> repat

So, we converge more naturally/stable to the 30m intervals than
before, when t3 would have been t0 + 45m.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2022-04-27 09:59:26 +02:00
Thomas Lamprecht
3bf8e49a94 replication config: code cleanup
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2022-04-27 09:58:41 +02:00
Thomas Lamprecht
e6e1550049 print snapshot tree: reduce indentation delta per level
previous:

> `-> foo                         2021-05-28 12:59:36 no-description
>   `-> bar                       2021-06-18 12:44:48 no-description
>     `-> current                                     You are here!

now:

> `-> foo                         2021-05-28 12:59:36 no-description
>  `-> bar                        2021-06-18 12:44:48 no-description
>   `-> current                                       You are here!

So requires less space, allowing deeper snapshot trees to still be
displayed nicely, and looks even better while doing that - the latter
may be subjective though.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2022-03-01 13:06:35 +01:00
Dominik Csapak
9fca8f9d5e print snapshot tree: clamp indentation length to positive
If a user has many snapshots, the length heuristic can go negative
and produce wrong indentation, so clamp it at 0.

Reported in the forum: https://forum.proxmox.com/threads/105740/

Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2022-03-01 12:59:38 +01:00
Thomas Lamprecht
6a5f25ee19 bump version to 4.1-1
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2022-02-09 18:28:20 +01:00
Fabian Grünbichler
1fa3dc1994 add storage tunnel module
encapsulating storage-related tunnel methods, currently
- source-side storage-migrate helper
- target-side disk-import handler
- target-side query-disk-import handler
- target-side bwlimit handler

to be extended further with replication-related handlers and helpers.

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2022-02-09 18:22:29 +01:00
Fabian Grünbichler
d88d2066a5 add tunnel helper module
lifted from PVE::QemuMigrate, abstracting away use-case specific data.

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2022-02-09 18:20:55 +01:00
Fabian Grünbichler
74c26370c0 migrate: add get_bwlimit helper
given a source and target storage query either locally or both locally
and remotely and merge the result.

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2022-02-09 18:20:42 +01:00
Fabian Grünbichler
42a84dc9e1 migrate: handle migration_network with remote migration
remote migration always has an explicit endpoint from the start which
gets used for everything.

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2022-02-04 17:36:44 +01:00
Thomas Lamprecht
1a400a9ea8 abstract config: fix implement-me comment
this is internal and doesn't needs to wait until a next major release

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2022-02-04 17:36:06 +01:00
Fabian Ebner
a68bfdb1ee config: activate affected storages for snapshot operations
For snapshot creation, the storage for the vmstate file is activated
via vdisk_alloc when the state file is created.

Do not activate the volumes themselves, as that has unnecessary side
effects (e.g. waiting for zvol device link for ZFS, mapping the volume
for RBD). If a storage can only do snapshot operations on a volume
that has been activated, it needs to activate the volume itself.

The actual implementation will be in the plugins to be able to skip
CD ROM drives and bind-mounts, etc.

Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>
2022-01-28 14:28:35 +01:00
Fabian Ebner
9643bddd3a config: remove unused variable
Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>
2022-01-28 14:28:35 +01:00
Thomas Lamprecht
2fb36c40c8 snapshot prepare: log on parent-cycle deletion
for new clones this should not happen anyway, as the API calls clean
such parent references up now, but for old ones it still can happen,
log to inform.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2021-11-30 08:17:04 +01:00
Oguz Bektas
685a524ea3 snapshots: delete parent property if new snapshot name is already a parent to existing one
Signed-off-by: Oguz Bektas <o.bektas@proxmox.com>
Tested-by: Hannes Laimer <h.laimer@proxmox.com>
2021-11-30 07:59:55 +01:00
Fabian Ebner
244583a40b replication: prepare: simplify code
No functional change is intended.

Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>
2021-11-29 10:50:36 +01:00
Fabian Ebner
ff574bf8d2 replication: update last_sync before removing old replication snapshots
If pvesr was terminated after finishing with the new sync and after
removing old replication snapshots, but before it could write the new
state, the next replication would fail. It would wrongly interpret the
actual last replication snapshot as stale, remove it, and (if no other
snapshots are present) attempt a full sync, which would fail.

Reported in the community forum [0], this was brought to light by the
new pvescheduler before it learned graceful reload.

It's not possible to simply preserve a last remaining snapshot in
prepare(), because prepare() is also used for valid removals. Instead,
update last_sync early enough. Stale snapshots will still be removed
on the next run if there are any.

[0]: https://forum.proxmox.com/threads/100154

Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>
2021-11-29 10:50:36 +01:00
Fabian Grünbichler
7d604b5bbd bump version to 4.0-3
Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2021-11-09 13:17:37 +01:00
Fabian Ebner
2511f525f5 config: snapshot delete check: use volume_snapshot_info
Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>
2021-11-08 10:35:38 +01:00
Fabian Ebner
b20bf9bf7d replication: find common snapshot: use additional information
which is now available from the storage back-end.

The benefits are:

1. Ability to detect different snapshots even if they have the same
name. Rather hard to reach, but for example with:
Snapshots A -> B -> C -> __replicationXYZ
Remove B, rollback to C (causes __replicationXYZ to be removed),
create a new snapshot B. Previously, B was selected as replication
base, but it didn't match on source and target. Now, C is correctly
selected.
2. Smaller delta in some cases by not prefering replication snapshots
over config snapshots, but using the most recent possible one from
both types of snapshots.
3. Less code complexity for snapshot selection.

If the remote side is old, it's not possible to detect mismatch of
distinct snapshots with the same name, but the timestamps from the
local side can still be used.

Still limit to our snapshots (config and replication), because we
don't have guarantees for other snapshots (could be deleted in the
meantime, name might not fit import/export regex, etc.).

Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>
2021-11-08 10:35:34 +01:00
Fabian Ebner
3200c404a9 replication: prepare: return additional information about snapshots
This is backwards compatible, because existing users of prepare() only
rely on the elements to evaluate to true or be defined.

Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>
2021-11-08 10:35:34 +01:00
Fabian Ebner
84fc20aa37 replication: refactor finding most recent common replication snapshot
By using a single loop instead. Should make the code more readable,
but also more efficient.

Suggested-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>
2021-11-08 10:35:34 +01:00
Fabian Ebner
602ca77cdb fix #3111: config: snapshot delete: check if replication still needs it
and abort if it does and --force is not specified.

After rollback, the rollback snapshot might still be needed as the
base for incremental replication, because rollback removes (blocking)
replication snapshots.

It's not enough to limit the check to the most recent snapshot,
because new snapshots might've been created between rollback and
remove.

It's not enough to limit the check to snapshots without a parent (i.e.
in case of ZFS, the oldest), because some volumes might've been added
only after that, meaning the oldest snapshot is not an incremental
replication base for them.

Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>
2021-11-08 10:34:14 +01:00
Fabian Ebner
8d1cd44345 partially fix #3111: replication: be less picky when selecting incremental base
After rollback, it might be necessary to start the replication from an
earlier, possibly non-replication, snapshot, because the replication
snapshot might have been removed from the source node. Previously,
replication could only recover in case the current parent snapshot was
already replicated.

To get into the bad situation (with no replication happening between
the steps):
1. have existing replication
2. take new snapshot
3. rollback to that snapshot
In case the partial fix to only remove blocking replication snapshots
for rollback was already applied, an additional step is necessary to
get into the bad situation:
4. take a second new snapshot

Since non-replication snapshots are now also included, where no
timestamp is readily available, it is necessary to filter them out
when probing for replication snapshots.

If no common replication snapshot is present, iterate backwards
through the config snapshots.

The changes are backwards compatible:

If one side is running the old code, and the other the new code,
the fact that one of the two prepare() calls does not return the
new additional snapshot candidates, means that no new match is
possible. Since the new prepare() returns a superset, no previously
possible match is now impossible.

The branch with @desc_sorted_snap is now taken more often, but
it can still only be taken when the volume exists on the remote side
(and has snapshots). In such cases, it is safe to die if no
incremental base snapshot can be found, because a full sync would not
be possible.

Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>
2021-11-08 10:34:00 +01:00
Fabian Ebner
c05dc937d4 replication: pass guest config to find_common_replication_snapshot
in preparation to iterate over all config snapshots when necessary.

Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>
2021-11-08 10:34:00 +01:00
Fabian Ebner
fbbeb87225 replication: remove unused variable and style fixes
Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>
2021-11-08 10:34:00 +01:00
Fabian Ebner
45c0b7554c partially fix #3111: further improve removing replication snapshots
by using the new $blocker parameter. No longer remove all replication
snapshots from affected volumes unconditionally, but check first if
all blocking snapshots are replication snapshots. If they are, remove
them and proceed with rollback. If they are not, die without removing
any.

For backwards compatibility, it's still necessary to remove all
replication snapshots if $blockers is not available.

Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>
2021-11-08 10:34:00 +01:00
Fabian Ebner
a9bc9b3c89 config: rollback: factor out helper for removing replication snapshots
Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>
2021-11-08 10:34:00 +01:00