shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-21 13:34:40 +03:00

Author	SHA1	Message	Date
Zdenek Kabelac	e30f3c8410	metadata: simplify code	2021-10-06 15:39:58 +02:00
David Teigland	939b4bc587	handle bad metadata text in vg_read path Corrupt metadata text (with good mda header) was being handled in the label_scan phase, but not in the vg_read phase. This was sufficient because metadata areas would always be read and checksummed during label_scan (metadata parsing was skipped previously as an optimization.) This changed with the optimization in commit `61a6f9905e` "metadata: optimize reading metadata copies in scan" Now, some metadata areas will not be read and checksummed at all during the label_scan phase, only during the vg_read phase. This means that bad metadata text may first be detected in the vg_read phase. So, add equivalent bad metadata handling to the vg_read path to match the label_scan path.	2021-09-28 15:17:43 -05:00
Zdenek Kabelac	f74d30d411	cleanup: use already known lv size	2021-09-27 18:56:14 +02:00
Zdenek Kabelac	e585c10fad	make: fix compilation for undefined RAID_INTERNAL Reported-by: adamboardman of gemian	2021-09-27 18:56:14 +02:00
Zdenek Kabelac	24e90f9594	metadata: remember parsing size of VG metadata When creating lvm2 metadata for VG, lvm2 allocate some buffer, and if buffer is not big enough, the buffer is 'reallocated' bigger, and whole metadata creation is repeated until metadata fits. We can try to use 'previous' metadata size as hint to reduce looping here.	2021-09-27 18:49:41 +02:00
Zdenek Kabelac	6c87e98ee3	cov: check for possible NULL segtype Although likely impossible to ever miss ERROR segtype, make analyzer hapier.	2021-09-20 14:26:09 +02:00
Zdenek Kabelac	3e21c8524e	gcc-fanalyzer: add extra check for origin_from_cow Make analyzer work easier with explicit check for internal error.	2021-09-20 13:58:57 +02:00
Zdenek Kabelac	5126ac7c3a	gcc-fanalyzer: explicit test null not pass Make analyzer explicitelly aware we can't get NULL here.	2021-09-20 10:51:30 +02:00
Zdenek Kabelac	dd5f8b3f8c	clang: keep metaname initialized Never access uninitialized metaname buffer.	2021-09-15 15:24:56 +02:00
Zdenek Kabelac	63930f576a	cov: add some initializers	2021-09-13 12:34:41 +02:00
Zdenek Kabelac	a8ee13900d	cov: initialize attr	2021-09-13 12:34:41 +02:00
Zdenek Kabelac	d489445e5a	cache: implement better revert path When cache creation fails on table reload path, implemen more advanced revert solution, that tries to restore state of LVM metadata into is look before actual caching started.	2021-09-13 12:34:41 +02:00
Zdenek Kabelac	e6f735d411	vdo: read new sysfs path New versions of kvdo module exposes statistics at new location: /sys/block/dm-XXX/vdo/statistics/... Enhance lvm2 to access this location first. Also if the statistic info is missing - make it 'debug' level info, so it is not failing 'lvs' command.	2021-09-09 15:24:15 +02:00
Zdenek Kabelac	79427151dc	vdo: add support for auto-unsafe writePolicy This vdoWritePolicy policy missed matching support in lvm2.	2021-09-06 15:19:51 +02:00
Zdenek Kabelac	419c93c873	vdo: support lvcreate with skipped activation Support creation of VDO LV for 'lvcreate -ky...'.	2021-08-31 22:05:47 +02:00
Zdenek Kabelac	88360b0c51	vdo: skip zeroing for VDO LV Since VDO is always returns 'zero' on unprovisioned read and every provisioned block is always 'zeroed' on partial writes, we can avoid 'zeroing' of such LVs.	2021-08-31 22:05:47 +02:00
David Teigland	96b777167c	cov: clean up pvid and vgid usage pvid and vgid are sometimes a null-terminated string, and other times a 'struct id', and the two types were often cast between each other. When a struct id was cast to a char pointer, the resulting string would not necessarily be null terminated. Casting a null-terminated string id to a struct id is fine, but is still avoided when possible. A struct id is: int8_t uuid[ID_LEN] A string id is: char pvid[ID_LEN + 1] A convention is introduced to help distinguish them: - variables and struct fields named "pvid" or "vgid" should be null-terminated strings. - variables and struct fields named "pv_id" or "vg_id" should be struct id's. - examples: char pvid[ID_LEN + 1]; char vgid[ID_LEN + 1]; struct id pv_id; struct id vg_id; Function names also attempt to follow this convention. Avoid casting between the two types as much as possible, with limited exceptions when known to be safe and clearly commented. Avoid using variations of strcpy and strcmp, and instead use memcpy/memcmp with ID_LEN (with similar limited exceptions possible.)	2021-08-16 11:31:15 -05:00
Zdenek Kabelac	f4b49dae1e	cov: raid: no more checks with missing areas Since ->areas is directly dereferenced we need to stop validation right here.	2021-07-28 00:49:28 +02:00
Zdenek Kabelac	69379df8f3	cov: remove unneeded includes	2021-07-28 00:49:28 +02:00
Zdenek Kabelac	2132fdc11f	vgsplit: add support for option --poolmetadataspare When splitting VG with thin/cache pool volume, handle pmspare during such split and allocate new pmspare in new VG or extend existing pmspare there and eventually drop pmspare in original VG if is no longer needed there.	2021-07-21 15:56:33 +02:00
David Teigland	d5a06f9a7d	pvscan: skip indexing devices used by LVs dev_cache_index_devs() is taking a large amount of time when there are many PVs. The index keeps track of devices that are currently in use by active LVs. This info is used to print warnings for users in some limited cases. The checks/warnings that are enabled by the index are not needed by pvscan --cache, so disable it in this case. This may be expanded to other cases in future commits. dev_cache_index_devs should also be improved in another commit to avoid the extreme delays with many devices.	2021-07-06 10:18:07 -05:00
Zdenek Kabelac	2c6a2b6e86	vdo: support vdo_pool_header_size Add profilable configurable setting for vdo pool header size, that is used as 'extra' empty space at the front and end of vdo-pool device to avoid having a disk in the system the may have same data is real vdo LV. For some conversion cases however we may need to allow using '0' header size. TODO: in this case we may eventually avoid adding 'linear' mapping layer in future - but this requires further modification over lvm code base.	2021-06-28 20:41:07 +02:00
Zdenek Kabelac	6e773bb196	lvconvert: fix vdo virtual size when specified Correctly use virtual size specified by: lvconvert --type vdo-pool --virtualsize	2021-06-28 20:41:07 +02:00
Zdenek Kabelac	bb45e33518	backup: automatically store data on vg_unlock Previously there have been necessary explicit call of backup (often either forgotten or over-used). With this patch the necessity to store backup is remember at vg_commit and once the VG is unlocked, the committed metadata are automatically store in backup file. This may possibly alter some printed messages from command when the backup is now taken later.	2021-06-09 14:56:13 +02:00
Zdenek Kabelac	ba3707d953	archiving: take archive automatically Instead of calling explicit archive with command processing logic, move this step towards 1st. vg_write() call, which will automatically store archive of committed metadata. This slightly changes some error path where the error in archiving was detected earlier in the command, while now some on going command 'actions' might have been, but will be simply scratched in case of error (since even new metadata would not have been even written). So general effect should be only some command message ordering.	2021-06-09 14:56:13 +02:00
David Teigland	247f69f9aa	writecache: fix lv_on_pmem dev_is_pmem on pv->dev requires a pv segment or it could segfault.	2021-06-02 10:51:12 -05:00
David Teigland	4a746f7ffc	lvremove: fix removing thin pool with writecache on data	2021-05-24 16:09:35 -05:00
Leo Yan	ef1c57e68f	lib: locking: Add new type "idm" We can consider the drive firmware a server to handle the locking request from nodes, this essentially is a client-server model. DLM uses the kernel as a central place to manage locks, so it also complies with client-server model for locking operations. This is why IDM and DLM are similar with each other for their wrappers. This patch largely works by generalizing the DLM code paths and then providing degeneralized functions as wrappers for both IDM and DLM. Signed-off-by: Leo Yan <leo.yan@linaro.org>	2021-05-20 16:01:05 -05:00
David Teigland	318bb3a06b	blkid: simplify fs block size check Only the LV path name is needed for blkid query, the step of getting a dev struct is not needed.	2021-05-05 16:15:10 -05:00
Zdenek Kabelac	2b3dcd754f	cov: check return value Log problems on fail path.	2021-04-23 23:00:55 +02:00
Zdenek Kabelac	86a3a0c765	cov: fix typo and reduce stack usage Buffer on stack was for single LV name plus some short text around. Use of 50* was a typo so use correly 50+.	2021-04-23 23:00:55 +02:00
Zdenek Kabelac	7e77e250a9	cov: set error_vg only when pointer is non null	2021-04-23 22:58:45 +02:00
David Teigland	0a28e3c44b	Add metadata-based autoactivation property for VG and LV The autoactivation property can be specified in lvcreate or vgcreate for new LVs/VGs, and the property can be changed by lvchange or vgchange for existing LVs/VGs. --setautoactivation y\|n enables\|disables autoactivation of a VG or LV. Autoactivation is enabled by default, which is consistent with past behavior. The disabled state is stored as a new flag in the VG metadata, and the absence of the flag allows autoactivation. If autoactivation is disabled for the VG, then no LVs in the VG will be autoactivated (the LV autoactivation property will have no effect.) When autoactivation is enabled for the VG, then autoactivation can be controlled on individual LVs. The state of this property can be reported for LVs/VGs using the "-o autoactivation" option in lvs/vgs commands, which will report "enabled", or "" for the disabled state. Previous versions of lvm do not recognize this property. Since autoactivation is enabled by default, the disabled setting will have no effect in older lvm versions. If the VG is modified by older lvm versions, the disabled state will also be dropped from the metadata. The autoactivation property is an alternative to using the lvm.conf auto_activation_volume_list, which is still applied to to VGs/LVs in addition to the new property. If VG or LV autoactivation is disabled either in metadata or in auto_activation_volume_list, it will not be autoactivated. An autoactivation command will silently skip activating an LV when the autoactivation property is disabled. To determine the effective autoactivation behavior for a specific LV, multiple settings would need to be checked: the VG autoactivation property, the LV autoactivation property, the auto_activation_volume_list. The "activation skip" property would also be relevant, since it applies to both normal and auto activation.	2021-04-07 15:32:49 -05:00
Zdenek Kabelac	287565fd5d	lvreduce: support --yes Missed support for --yes with 'lvreduce' to answer 'y' to prompt.	2021-04-06 21:26:57 +02:00
Samanta Navarro	01d5e4d1ca	all: fix typos	2021-03-30 13:08:14 +02:00
Zdenek Kabelac	05920e3818	raid: restore mirror handling in _raid_in_sync Function is not having the best name since it does check no just raid LVs to be in sync. Restore the mirror percentage checking - although without retries, since only raid target is currently known to need it - for other types it would be ATM a bug to get inconsistent result.	2021-03-20 10:52:24 +01:00
Zdenek Kabelac	cc140f68a5	raid: resync cannot lose primary leg Prohibity droping primary leg while resyncing.	2021-03-19 23:19:31 +01:00
Zdenek Kabelac	076e155697	raid: interruptible usleep when waiting for sync Whiel waiting for raid to return consistent status, use interruptible sleep - so command can break quickly. Use lv_raid_status() to get percentage easily from status.	2021-03-19 23:17:03 +01:00
Zdenek Kabelac	7a9efc5fae	lvresize: allow mixing striped with errors or zero Enabled extension/mixing of stripes/linears, error and zero segtype LVs with stripes/linear, error and zero segtypes. It is not very useful in practice, as the user cannot store any real data on error or zero segtypes, but it may get some uses in some scenarios where i.e. some portion of the device should not be readable. Mixing of types happens on 'extent_size' level: lvcreate -L1 -n lv vg lvextend --type error -L+1 vg/lv lvextend --type zero -L+1 vg/lv lvextend --type linear -L+1 vg/lv lvextend --type striped -L+1 vg/lv lvs -o+segtype,seg_size vg Note: when the type is not specified, the last segment type is automatically selected. It's also a small 'can of worms' since we can't tell LVs if the LV is linear/error/zero or their mixtures. So the meaning behind them may need some updates. We already have this types of LV created i.e by: vgreduce --removemissing --force where missing LV segments have been replaced by either error or zero segtype (lvm.conf). TODO: it might be worth adding a message while such device is activated.	2021-03-18 18:56:49 +01:00
Zdenek Kabelac	b35ef9d67c	segtypes: macros for error and zero segtypes	2021-03-18 18:34:57 +01:00
Zdenek Kabelac	22554c3ff0	lvremove: extra code for handling thinpool data Add some extra code to handle differently sized thin-pool from thin-pool data volume. ATM this can't really happen, but once we start to use multiple commits while resizing stacked LV, we may actually get into the position, where data LV has been already resized, but thin-pool stayed with old size. But for now - report difference as internal error.	2021-03-18 18:34:57 +01:00
Zdenek Kabelac	5a73399b73	lvresize: support resize of stacked virtual LV Update the LV stack with the size also for virtual LVs.	2021-03-18 18:34:57 +01:00
Zdenek Kabelac	a9b4acd511	dev_manager: add lv_raid_status Just like with other segtype use this function to get whole raid status info available per a single ioctl call. Also it nicely simplifies read of percentage info about in_sync portion of raid volume. TODO: drop use of other calls then lv_raid_status call, since all such calls could already use status - so it just adds unnecessary duplication.	2021-03-18 18:34:57 +01:00
Zdenek Kabelac	8cbe4a171e	thin: add extra protection Check explicitely created LV already has thin segment. As currenlty it's the only user - this patch should have no impact.	2021-03-18 18:34:57 +01:00
Zdenek Kabelac	d682ad619a	cleanup: simplier check first	2021-03-18 18:34:57 +01:00
Zdenek Kabelac	8b2cdd8d3a	debug: start with upper case Use upper case letter to start sentence. Also drop unneded check for vg as it's already non-null.	2021-03-17 00:50:40 +01:00
Zdenek Kabelac	f69ff4b84a	debug: update message	2021-03-15 11:13:24 +01:00
Zdenek Kabelac	bc1bc4cffc	debug: drop stack from regular code flow	2021-03-15 11:13:24 +01:00
Zdenek Kabelac	5edb353062	lvremove: use to_remove for snapshot removal Reuse similar 'acceleration' as used for dependent volumes also for snapshot - so when origin is being removed with all thick snapshots, don't bother with individual 'COW' detachments and write&commits, and when possible handle this all within a single commit.	2021-03-15 11:11:35 +01:00
Zdenek Kabelac	0a2d7c57a1	lvremove: use common routine for prompting Move code for prompting about removed LV to a single function and use it also to prompt for removal of origin and all its thick snapshots and also when removing merging origin. Function does handle postponed write_and_commit so there is no 'in-flight' operation while waiting on [y\|n] answer.	2021-03-15 11:08:47 +01:00
Zdenek Kabelac	a18409b6d1	vg_validate: fix validation of merging thin origin Compat code and handle unusual case, where thin snapshot is also a 'thick snapshot origin' and such snapshot gets merged into a thin origin. However since now lv_is_visible() (which is complex function) replaced &VISIBLE_LV check, the whole this check seems to be no longer useful as sum of all 3 will always match??	2021-03-15 10:59:09 +01:00
Zdenek Kabelac	fab9987ad7	cleanup: move common condition	2021-03-14 16:34:38 +01:00
Zdenek Kabelac	664d3b0f22	lvremove: drop flushing dm cache before remove Since cached LV is going to be removed together with its cache, there is not much to gain if we try to flush cache first. User may use 'vgcfgrestore' to get back origin + cache. Assuming user is not using issue_discards. When data are discarded after remove there is nothing to restore! This change allows to futher reduce number of commits during lvremove/vgremove.	2021-03-14 16:34:38 +01:00
Zdenek Kabelac	3608e8aee7	cache: use interruptible_usleep Reuse code for interruptible sleeping.	2021-03-14 16:34:38 +01:00
Zdenek Kabelac	bbac843268	thinpool: correct condition Actually we do want to flush thin-pool message for particular LV first. Existing condition evaluated to noop.	2021-03-12 12:59:55 +01:00
Zdenek Kabelac	a654148b76	gcc: adding const	2021-03-11 00:18:01 +01:00
Zdenek Kabelac	c4f5d93122	cleanup: eliminate unused assign	2021-03-11 00:18:01 +01:00
Zdenek Kabelac	f4543aca15	lvremove: support faster removal of thin-pools When lvremove/vgremove removes thin volumes with its thin-pool as well, try to skip any updates of such thin-pool, so when everything properly deactivates, there is no message send to this thin-pool and whole thin-pool is removed with a single commit.	2021-03-11 00:18:01 +01:00
Zdenek Kabelac	131ca0eb95	activation: use existing LV as best effort Returning NULL for lv_committed is basically instant crash, so instead try with passed LV instead. It shouldn't matter as this is internall error path anyway, but coverity should be happier.	2021-03-10 01:29:06 +01:00
Zdenek Kabelac	d01c17ff22	debug: more use of display_lvname	2021-03-10 01:11:52 +01:00
Zdenek Kabelac	5f7a7af7f2	cleanup: no backtraces needed after log_error Reduce double backtracing.	2021-03-10 01:11:52 +01:00
Zdenek Kabelac	177b63becc	backup: set in vg_commit Another step towards better automatic handling of backup, and automatically setup needs_backup after commit. In some next step we should reduce number of backups and takem then only at the command finish with vg_committed content.	2021-03-10 01:09:46 +01:00
Zdenek Kabelac	843ee943ab	lvremove: correct return code Need to return ECMD_FAILED from toollib code. Add missing stack traces.	2021-03-08 20:24:04 +01:00
Zdenek Kabelac	6d6e1ae887	cleanup: compare only LV uuid part Match VG uuid just once per list of all LVs in VG. TODO: maybe some more efficeint tree or hash could be better here, but since it's used not so often, the total benefit is not so great, so ATM just reducing amount of checked bytes.	2021-03-08 15:43:27 +01:00
Zdenek Kabelac	e5456c259f	cleanup: simpler checks first Minor optimizatoins...	2021-03-08 15:43:27 +01:00
Zdenek Kabelac	2d64ffaee5	hash: use individual hint sizes Use different 'hint' size for dm_hash_create() call - so when debug info about hash is printed we can recognize which hash was in use. This patch doesn't change actual used size since that is always rounded to be power of 2 and >=16 - so as such is only a help to developer. We could eventually use 'name' arg, but since this would have changed API and this patchset will be routed to libdm & stable - we will just use this small trick.	2021-03-08 15:33:15 +01:00
Zdenek Kabelac	78c7ae7cd2	lvremove: reduce ioctl count Just like with deactivation, call of 'lv_is_not_in_use()' now has embeded report for inactivate LV. Note: this patch cannot be backported to stable-2.02 - as there lv_is_active() has 'cluster' meaning and differs from lvinfo().	2021-03-08 15:32:10 +01:00
Zdenek Kabelac	936c7b5104	vg_read: reuse already parsed config tree When parsing VG metadata we can create from a single config tree also 'vg_committed' that is always created for writable VG. This avoids extra uncessary step of serializing and deserilizing just parsed VG.	2021-03-08 15:30:18 +01:00
Zdenek Kabelac	bc0cb66304	vg_write: optimize caching of precommitted VG Every vg_write stores new 'metadata' into precommitted slot. For this step we use 'serialized buffer' to ascii metadata. Instead of recreating this buffer after whole 'vg_write()' we use this buffer instantly for creating of precommitted VG. This has also the advantage of catching any problems with reparsing of ascii metadata back to VG early before any write.	2021-03-08 15:30:18 +01:00
Zdenek Kabelac	a125a3bb50	lv_remove: reduce commits for removed LVs This patch postpones update of lvm metadata for each removed LV for later moment depending on LV type. It also queues messages to be printed after such write & commit. As such there is some change in the behavior - although before prompt we do make write&commit happens automatically in some other error case we rather keep 'existing' state - so there could be difference in amount of removed & commited LVs. IMHO introduce logic is slightly better and more save. But some cases still need the early commit - i.e. thin-removal and fixing this needs some more thinking. TODO: improve removal at least with the case of the whole thin-pool. i.e. we can simply recognize removal of 'all LVs/whole VG'.	2021-03-08 15:25:05 +01:00
Zdenek Kabelac	eb1160ee42	lvremove: backup at the end of loop Taking backup with each removed LV is slowing down the process considerable and is largerly uneeded. We are supposed to take backup only on significant points and making sure the backup is correct when the command is finished. TODO: check how many other commands can be improved.	2021-03-02 22:54:40 +01:00
David Teigland	83fe6e720f	device usage based on devices file The LVM devices file lists devices that lvm can use. The default file is /etc/lvm/devices/system.devices, and the lvmdevices(8) command is used to add or remove device entries. If the file does not exist, or if lvm.conf includes use_devicesfile=0, then lvm will not use a devices file. When the devices file is in use, the regex filter is not used, and the filter settings in lvm.conf or on the command line are ignored. LVM records devices in the devices file using hardware-specific IDs, such as the WWID, and attempts to use subsystem-specific IDs for virtual device types. These device IDs are also written in the VG metadata. When no hardware or virtual ID is available, lvm falls back using the unstable device name as the device ID. When devnames are used, lvm performs extra scanning to find devices if their devname changes, e.g. after reboot. When proper device IDs are used, an lvm command will not look at devices outside the devices file, but when devnames are used as a fallback, lvm will scan devices outside the devices file to locate PVs on renamed devices. A config setting search_for_devnames can be used to control the scanning for renamed devname entries. Related to the devices file, the new command option --devices <devnames> allows a list of devices to be specified for the command to use, overriding the devices file. The listed devices act as a sort of devices file in terms of limiting which devices lvm will see and use. Devices that are not listed will appear to be missing to the lvm command. Multiple devices files can be kept in /etc/lvm/devices, which allows lvm to be used with different sets of devices, e.g. system devices do not need to be exposed to a specific application, and the application can use lvm on its own set of devices that are not exposed to the system. The option --devicesfile <filename> is used to select the devices file to use with the command. Without the option set, the default system devices file is used. Setting --devicesfile "" causes lvm to not use a devices file. An existing, empty devices file means lvm will see no devices. The new command vgimportdevices adds PVs from a VG to the devices file and updates the VG metadata to include the device IDs. vgimportdevices -a will import all VGs into the system devices file. LVM commands run by dmeventd not use a devices file by default, and will look at all devices on the system. A devices file can be created for dmeventd (/etc/lvm/devices/dmeventd.devices) If this file exists, lvm commands run by dmeventd will use it. Internal implementaion: - device_ids_read - read the devices file . add struct dev_use (du) to cmd->use_devices for each devices file entry - dev_cache_scan - get /dev entries . add struct device (dev) to dev_cache for each device on the system - device_ids_match - match devices file entries to /dev entries . match each du on cmd->use_devices to a dev in dev_cache, using device ID . on match, set du->dev, dev->id, dev->flags MATCHED_USE_ID - label_scan - read lvm headers and metadata from devices . filters are applied, those that do not need data from the device . filter-deviceid skips devs without MATCHED_USE_ID, i.e. skips /dev entries that are not listed in the devices file . read lvm label from dev . filters are applied, those that use data from the device . read lvm metadata from dev . add info/vginfo structs for PVs/VGs (info is "lvmcache") - device_ids_find_renamed_devs - handle devices with unstable devname ID where devname changed . this step only needed when devs do not have proper device IDs, and their dev names change, e.g. after reboot sdb becomes sdc. . detect incorrect match because PVID in the devices file entry does not match the PVID found when the device was read above . undo incorrect match between du and dev above . search system devices for new location of PVID . update devices file with new devnames for PVIDs on renamed devices . label_scan the renamed devs - continue with command processing	2021-02-23 16:43:32 -06:00
Zdenek Kabelac	5d820b0201	cleanup: comment typo	2021-02-23 14:56:48 +01:00
Zdenek Kabelac	ac09fa08aa	lvextend: enable resize of writecached LV	2021-02-23 14:56:47 +01:00
Zdenek Kabelac	a915cd5a46	lvconvert: vdo may convert already formated vdo User use 'lvconvert -Zn --type vdo-pool' to convert an existing vdo formated volume and skip lvm2 internal formating. This however requires user is passing proper matching parameters. For them user can use --profile\|--metadataprofile option whos support has been also enhanced. TODO: add support to read values directly from formated volume.	2021-02-17 11:21:35 +01:00
Zdenek Kabelac	096edeee71	lv_manip: avoid removing LV when converting In some cases we use 'creation' also during conversion. Here it can be actually unwanted side effect we may remove not just newly created layers - but also original converted LV. So until we make clear how to properly revert from some errors in middle of conversion, disable removal for any 'lvconvert' commands.	2021-02-17 11:21:35 +01:00
Zdenek Kabelac	3cc9efc0ed	snapshot: create origin of virtual snap read only When creating old fashioned way thick virtual snapshot, use read-only 'zero' _vorigin device.	2021-02-10 15:39:03 +01:00
Zdenek Kabelac	e429e69b65	dev-type: dev_is_pmem reuses topology read code	2021-02-08 23:43:38 +01:00
Zdenek Kabelac	5ec24dfb0b	lv_resize: support resizing of cached volumes Automatically figure out resizable layer in the LV stack and resize it online. Split check for reshaped raids and postpone removal of unused space after finished reshaping after metadata archiving. Drop warning about unsupported automatic resize of monitored thin-pool. Currently there is not yet support for resize of writecache.	2021-02-08 23:43:10 +01:00
Zdenek Kabelac	39dec26508	lv_manip: reuse function also during reduction Move function _setup_lv_size() in front of _lv_reduce() so it can be reused also in this function. Avoid propagating 0 length to upper layer.	2021-02-08 23:18:44 +01:00
Zdenek Kabelac	bdc2f4c704	lv_resize: use 'bad' code path for error case	2021-02-08 23:18:44 +01:00
Zdenek Kabelac	eed060f040	thin: check for overprovisioning only once	2021-02-08 23:18:44 +01:00
Zdenek Kabelac	99e168162a	thinpool: use lv_config_profil for crop_metadata Better support for thin-pools with individual profiles introduced in the recent patch `b4212be2e7`.	2021-02-08 23:18:44 +01:00
David Teigland	87ee401eea	md component detection changes Move extra md component detection into the label scan phase. It had been in set_pv_devices which was deep within the vg_read phase, which wasn't a good place (better to detect that earlier.) Now that pv metadata info is available in the scan phase, the pv details (size and device_hint) can be used for extra md checking. Use the device_hint from the pv metadata to trigger a full md component check if the device_hint begins with /dev/md. Stop triggering full md component checks based on missing udev info for a dev. Changes to tests to reflect that the code is now detecting md components in some test case that it wasn't before.	2021-02-05 16:23:51 -06:00
Zdenek Kabelac	51c83f1483	lvcreate: use lv_passes_readonly_filter Check if created LV is going to be activated read-only because such LV cannot be zeroed (equals to use option '-pr').	2021-02-02 21:23:39 +01:00
Zdenek Kabelac	3acf6040b5	wipe: reformat message for failure case Use the same error message layout to match BLKZEROUT look. Makes testing easier.	2021-02-01 12:13:49 +01:00
Zdenek Kabelac	be0bf43d74	allocation: report allocation error instead of crash Current allocation limitation requires to fit metadata/log LV on a single PV. This is usually not a big problem, but since thin-pool and cache-pool is using this for allocating extents for their metadata LVs it might be eventually causing errors where the remaining free spaces for large metadata size is spread over several PV.	2021-02-01 12:13:49 +01:00
Zdenek Kabelac	45f0c48365	pvmove: automatically resolve whole stacked LV When passing 'pvmove --name arg' try to automatically move all associated dependencies with given LV. i.e. 'pvmove --name thinpool vg vgnew' moves all thins and data and metadata LV into a new VG vgnew.	2021-02-01 12:06:13 +01:00
Zdenek Kabelac	abc9265a06	cache: reuse code for metadata min_max Use update_pool_metadata_min_max() which is shared with thin-pool metadata min-max updating. Gives improved messages when converting volumes to metadata.	2021-02-01 12:06:13 +01:00
Zdenek Kabelac	f96b455506	pool: limit pmspare to 16GiB There is not much point to let allocate more then this size even when i.e. converted LV is bigger then 16GiB (%extent_size) ATM neither thin-pool nor cache-pool supports bigger metadata.	2021-02-01 12:06:13 +01:00
Zdenek Kabelac	b4212be2e7	thin: improve 16g support for thin pool metadata Initial support for thin-pool used slightly smaller max size 15.81GiB for thin-pool metadata. However the real limit later settled at 15.88GiB (difference is ~64MiB - 16448 4K blocks). lvm2 could not simply increase the size as it has been using hard cropping of the loaded metadata device to avoid warnings printing warning of kernel when the size was bigger (i.e. due to bigger extent_size). This patch adds the new lvm.conf configurable setting: allocation/thin_pool_crop_metadata which defaults to 0 -> no crop of metadata beyond 15.81GiB. Only user with these sizes of metadata will be affected. Without cropping lvm2 now limits metadata allocation size to 15.88GiB. Any space beyond is currently not used by thin-pool target. Even if i.e. bigger LV is used for metadata via lvconvert, or allocated bigger because of to large extent size. With cropping enabled (=1) lvm2 preserves the old limitation 15.81GiB and should allow to work in the evironement with older lvm2 tools (i.e. older distribution). Thin-pool metadata with size bigger then 15.81G is now using CROP_METADATA flag within lvm2 metadata, so older lvm2 recognizes an incompatible thin-pool and cannot activate such pool! Users should use uncropped version as it is not suffering from various issues between thin_repair results and allocated metadata LV as thin_repair limit is 15.88GiB Users should use cropping only when really needed! Patch also better handles resize of thin-pool metadata and prevents resize beoyond usable size 15.88GiB. Resize beyond 15.81GiB automatically switches pool to no-crop version. Even with existing bigger thin-pool metadata command 'lvextend -l+1 vg/pool_tmeta' does the change. Patch gives better controls 'coverted' metadata LV and reports less confusing message during conversion. Patch set also moves the code for updating min/max into pool_manip.c for better sharing with cache_pool code.	2021-02-01 12:06:13 +01:00
David Teigland	a690d16d29	writecache: use cleaner message instead of table reload When detaching writecache, make the first stage send a message to dm-writecache to set the cleaner option. This is instead of reloading the dm table with the cleaner option set. Reloading the table causes udev to process/probe the dm dev, which gets stalled because of the writeback activity, and the stalled udev in turn stalls the lvconvert command when it tries to sync with udev events. When getting writecache status we do not need to get open_count or read_head info, which can cause extra steps.	2021-01-28 15:14:25 -06:00
Heinz Mauelshagen	f08ef23856	lvdisplay: enhance LV status output for raid(0) In case legs of a raid0 LV are removed, the lvdisplay command still reports 'available' though raid0 is not providing any resilience compared to the other raid levels. Also lvdisplay does not display '(partial)' in case of missing raid0 legs as oposed to the lvs command. Enhance lvdisplay to report "NOT available" for any RaidLV type in case too many legs are inaccessible hence causing data loss. I.e. any leg for raid0, all for raid1, more than 1 for raid4/5, more than 2 for raid6 and in case of completely lost mirror groups for raid10. Add test/shell/lvdisplay-raid.sh. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1872678	2021-01-27 16:56:22 +01:00
Zdenek Kabelac	8532b1ca97	vdo: support online rename New VDO targets v6.2.3 corrects support for online rename of VDO device. If needed if can be disable via new lvm.conf setting: vdo_disabled_features = [ "online_rename" ]	2021-01-22 15:30:37 +01:00
Zdenek Kabelac	4b8e5ad595	pools: fix removal of spare volume When removing pool LV from a stacked LV setup, it's been possible to leak _pmspare and such hidden LV then required manual user removal. Fix it by moving automatic removal into _lv_reduce().	2021-01-22 15:30:37 +01:00
David Teigland	0534723a2d	integrity: fix segfault on error path when replacing images When adding replacement raid+integrity images (lvconvert --repair after a raid image is lost), various errors can cause the function to exit with an error. On this exit path, the function attempts to revert new images that had been created but not yet used. The cleanup failed to account for the fact that not all images needed to be reverted.	2021-01-13 13:39:33 -06:00
Zdenek Kabelac	0b6ee6a912	alloc: enhance estimation of sufficient_pes_free Since commit `77fdc17d70` always include log_len size into needed extents - however now we may need sometimes more extents then necessary - mainly when multiple PVs are involved into allocation. Add logs_still_needed into calculation of sufficient_pes_free()	2021-01-13 12:54:45 +01:00
David Teigland	b84a9927b7	partial flag for writecache and integrity When a writecache sublv or an integrity metadata sublv are partial (missing a dev), set the partial flag on the upper level LV also, as is done for other sublvs.	2020-12-11 16:25:25 -06:00
David Teigland	9fe7aba251	cache: activation cache_check on cachevol When using cache with a cachevol, the cache_check tool was not being run on the cache metadata during activation. cache_check clears the needs_check flag in the cache metadata, so if the flag was set due to an unclean shutdown, the activation would fail.	2020-12-09 17:36:09 -06:00
David Teigland	5fef89361d	integrity: display total mismatches at raid LV level Each integrity image in a raid LV reports its own number of integrity mismatches, e.g. lvs -o integritymismatches vg/lv_rimage_0 lvs -o integritymismatches vg/lv_rimage_1 In addition to this, allow the total number of integrity mismatches from all images to be displayed for the raid LV. lvs -o integritymismatches vg/lv shows the number of mismatches from both lv_rimage_0 and lv_rimage_1.	2020-11-11 15:10:15 -06:00
Zdenek Kabelac	7bafae48bb	gcc: cleanup warns from older gcc	2020-10-26 13:06:53 +01:00
Zdenek Kabelac	9740e98cbd	lv_manip: add space into message Just add space between %s(.	2020-10-24 01:42:16 +02:00
David Teigland	6226512ad2	get dev size when setting pv device In some cases the dev size may not have been read yet in set_pv_devices(). In this case get the dev size before comparing the dev size with the pv size.	2020-10-22 13:19:17 -05:00
Zdenek Kabelac	b75c2dfe1b	debug: shorten error message Just check for sigint during log_error().	2020-10-19 16:53:18 +02:00
Zdenek Kabelac	e7fff97b8d	wipe_lv: use BLKZEROOUT when possible Since BLKZEROOUT ioctl should be supposedly fastest way how to clear block device start using this ioctl for zeroing a device. Commonly we do zero typically small portion of a device (8KiB) - however since we now also started to zero metadata devices, in the case of i.e. thin-pool metadata this can go upto ~16GiB and here the performance starts to be noticable.	2020-10-02 21:04:16 +02:00
Zdenek Kabelac	c65d3a6b8a	wipe_lv: interruptible wiping Since we now block signals and wiping may take unexpectedly long time - support breaking command while wipe is in progress.	2020-10-02 21:03:19 +02:00
Zdenek Kabelac	7396f1cfee	wipe_lv: drop label_scan_invalidate on error path Since dev_set_bytes() now closes dev on error path itself, remove this unneeded call now (introduced few commits back in history thus removing comment from WHATS_NEW)	2020-10-02 21:02:04 +02:00
David Teigland	c32d7fed4f	writecache: use two step detach When detaching a writecache, use the cleaner setting by default to writeback data prior to suspending the lv to detach the writecache. This avoids potentially blocking for a long period with the device suspended. Detaching a writecache first sets the cleaner option, waits for a short period of time (less than a second), and checks if the writecache has quickly become clean. If so, the writecache is detached immediately. This optimizes the case where little writeback is needed. If the writecache does not quickly become clean, then the detach command leaves the writecache attached with the cleaner option set. This leaves the LV in the same state as if the user had set the cleaner option directly with lvchange --cachesettings cleaner=1 LV. After leaving the LV with the cleaner option set, the detach command will wait and watch the writeback progress, and will finally detach the writecache when the writeback is finished. The detach command does not need to wait during the writeback phase, and can be canceled, in which case the LV will remain with the writecache attached and the cleaner option set. When the user runs the detach command again it will complete the detach. To detach a writecache directly, without using the cleaner step (which has been the approach previously), add the option --cachesettings cleaner=0 to the detach command.	2020-10-01 11:33:02 -05:00
David Teigland	2272a32e6f	lvmlockd vdo: add support lvmlockd handling for vdo lv and vdo pool is like thin lv and thin pool.	2020-09-29 14:43:27 -05:00
Zdenek Kabelac	bd0d4de4e2	active: fix compilation without devmapper Better support for compilation without device-mapper.	2020-09-29 10:43:56 +02:00
Zdenek Kabelac	4de6f58085	thin: use lv_status_thin and lv_status_thin_pool Introduce structures lv_status_thin_pool and lv_status_thin (pair to lv_status_cache, lv_status_vdo) Convert lv_thin_percent() -> lv_thin_status() and lv_thin_pool_percent() + lv_thin_pool_transaction_id() -> lv_thin_pool_status(). This way a function user can see not only percentages, but also other important status info about thin-pool. TODO: This patch tries to not change too many other things, but pool_below_threshold() now uses new thin-pool info to return failure if thin-pool cannot be actually modified. This should be handle separately in a better way.	2020-09-29 10:43:56 +02:00
Zdenek Kabelac	92c0e8c17f	writecache: archive before modification of metadata Archive before we start to modify metadata.	2020-09-29 10:43:56 +02:00
Zdenek Kabelac	08e838f488	cleanup: avoid unneeded check Since creation of thin snapshot already makes sure, the message list is empty, there is no need to check this again.	2020-09-29 10:43:56 +02:00
Heinz Mauelshagen	8952dcbff0	Revert "lvconvert: display warning if raid1 LV image count does not change" This reverts superfluous commit `3c9177fdc0` as _lv_raid_change_image_count() already checks for non-changed image count. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1872130	2020-09-28 17:14:03 +02:00
Zdenek Kabelac	e414ebef6e	thin: pass through whole code Instead of early 'return 0' let the whole code finish in case of an error with syncing.	2020-09-25 22:59:35 +02:00
Zdenek Kabelac	ef59c83f2d	thin: enhance lvcreate error paths Improve error response and reporting, when creating thin snapshots. If the thin pool kernel metadata already have device with ID lvm2 tries to create, give more meanigful error message and also properly restore transaction id to the value known to thin-pool in this case. Before it's been possible to divert by one from kernel TID value, and lvm2 stacked delete message for such thin device.	2020-09-25 22:56:40 +02:00
Zdenek Kabelac	e2eb1dc501	thin: no delete message for device_id 0 Since we always use device_id > 0, we could use device_id == 0 to actually mark thinLV as an LV we want to remove without delete message.	2020-09-25 22:54:07 +02:00
Zdenek Kabelac	7c19186271	vdo: disable support for online rename of vdopool LV Since ATM kernel does not support this operation, disable 'lvrename' of an active vdopool. As a workaround, user may simply deactivate, rename and activate.	2020-09-23 13:18:23 +02:00
Zdenek Kabelac	3a3307c0d8	vdo: enhance vdo pool extension When user tries to extend vdo pool - he needs to go always at least by 1 full VDO slab (defined as vdo_slab_size_mb). To avoid all trouble around find 'workable' size - lvm2 automatically increases the passed (or by --use-policies calculated) extension size (and informs a user about sometimes possibly large increase as slab size can go upto 32GiB) With VDO users need to always 'think-big' anyway and expect such operation to be in GiB domain range.	2020-09-22 23:28:43 +02:00
Zdenek Kabelac	f38b7afd62	vdo: extend vdo segment validation Try to catch all suspicious VDO segments in metadata early.	2020-09-22 23:25:16 +02:00
Zdenek Kabelac	642ef54399	vdo: correct message about policy extend support Policy extend is already supported for vdo pools as well, so correct the error message.	2020-09-22 23:25:16 +02:00
Zdenek Kabelac	5bc66532c7	activation: use revert_lv on tree suspend failure When thetable reload fails during suspend() - we were only calling plain resume() - and this will reload only those devices, which were left suspend, but will not try to restore metadata state according to lvm2 reverted metadata. So if we were reloading device tree - we have restored only top-level LV and rest of reverted device manipulation were left alone and possibly mismatched what is in committed metadata. FIXME: There are several cases were such revert will likely not work properly anyway as some operation are currenly handled in single commit, while they need multiple commits, but it's step towards better correctness. At least we catch there errors now earlier.	2020-09-22 21:02:14 +02:00
David Teigland	1404e5ee61	metadata: open rw fd before closing ro fd lvm opens devices readonly to scan them, but needs to open then readwrite to update the metadata. Previously, the ro fd was closed before the rw fd was opened, leaving a small gap where the dev was not held open, and during which the dev could possibly change which storage it referred to. With the bcache_change_fd() interface, lvm opens a rw fd on a device to be written, tells bcache to change to the new rw fd, and closes the ro fd. . open dev ro . read dev with the ro fd (label_scan) . lock vg (ex for writing) . open dev rw . close ro fd . rescan dev to check if the metadata changed between the scan and the lock . if the metadata did change, reread in full . write the metadata	2020-09-18 15:10:11 -05:00
David Teigland	1570e76233	bcache: use indirection table for fd Add a "device index" (di) for each device, and use this in the bcache api to the rest of lvm. This replaces the file descriptor (fd) in the api. The rest of lvm uses new functions bcache_set_fd(), bcache_clear_fd(), and bcache_change_fd() to control which fd bcache uses for io to a particular device. . lvm opens a dev and gets and fd. fd = open(dev); . lvm passes fd to the bcache layer and gets a di to use in the bcache api for the dev. di = bcache_set_fd(fd); . lvm uses bcache functions, passing di for the dev. bcache_write_bytes(di, ...), etc. . bcache translates di to fd to do io. . lvm closes the device and clears the di/fd bcache state. close(fd); bcache_clear_fd(di); In the bcache layer, a di-to-fd translation table (int *_fd_table) is added. When bcache needs to perform io on a di, it uses _fd_table[di]. In the following commit, lvm will make use of the new bcache_change_fd() function to change the fd that bcache uses for the dev, without dropping cached blocks.	2020-09-18 15:10:11 -05:00
Zdenek Kabelac	2b36542f41	wipe: dev_set_bytes resolves zeroing Since dev_write_zeros() is just subset of dev_set_bytes() use it directly and simplify code.	2020-09-15 23:07:06 +02:00
Zdenek Kabelac	d588de77aa	wipe: convert zero_value to uint8_t We always write this value as byte.	2020-09-15 22:52:25 +02:00
Zdenek Kabelac	ec4e8b5c0e	wipe: zeroing of 8 sectors is granted With do_zero min is always 8 sectors, so use 0 as default.	2020-09-15 22:52:25 +02:00
Zdenek Kabelac	187cc8d344	lvcreate: change error message Provide more useful error message.	2020-09-15 22:52:25 +02:00
Zdenek Kabelac	39198eb2ce	lvcreate: add extra synchronization at error path Put explict udev synchronization before we try to deactive devices.	2020-09-15 22:52:25 +02:00
Zdenek Kabelac	b2978efbff	cache: simplier signal handling Use just single sigint_allow()/restore() within flushing loop and void one extra signal manipulation.	2020-09-14 00:15:14 +02:00
Zdenek Kabelac	77fdc17d70	alloc: improve estimation of sufficient_pes_free Metadata size was calculated correctly only for raids. Fixes problem for crash during lvcreate when thin-pool was created on a VG where remaining free space had the size to only fit a single metadata LV and not also its _pmspare. Lvcreate crashed with this assert message: lvcreate: metadata/pv_map.c:198: consume_pv_area: Assertion `to_go <= pva->count' failed. Aborted (core dumped) TODO: there is probably to large overload of several alloc_handle variables. Reported-by: Wu Guanghao<wuguanghao3@huawei.com> Reported-by: Zhiqiang Liu <liuzhiqiang26@huawei.com>	2020-09-11 21:51:24 +02:00
Zdenek Kabelac	9f78acfee9	thin: compensate metadata size by extra percent When using --use-policy for automatic extension of thin-pool, the extension of thin-pool's metadata itself can actually take some extra space. Since I'm not aware of exact compensation formula, add just 1% extra to calculated amount and hope it fits. Wanted target is to always have usable thin-pool that fits bellow pool_metadata_min_threshold().	2020-09-11 21:42:37 +02:00
Zdenek Kabelac	b798554a20	lv_manip: even better rounding	2020-09-11 13:37:04 +02:00
Zdenek Kabelac	678951f635	cleanup: comment typo	2020-09-10 23:55:03 +02:00
Zdenek Kabelac	e7bd3ba22d	debug: drop debug trace from regular path Since we query on regular code these: lv_raid_has_integrity() lv_has_integrity_recalculate_metadata() without prior checking for lv_is_raid() - these 'return 0' should not use <stacktrace> as they are expected.	2020-09-10 23:55:03 +02:00
Zdenek Kabelac	bc09803628	lv_manip: relocate check to proper function	2020-09-10 23:54:33 +02:00
Zdenek Kabelac	e7f5acdfa6	lvextend: improve percentage estimation Correcting rounding rules for percentage evaluation. Validate supported range of percentage. (although ranges are already validated earlier on code path)	2020-09-10 23:54:31 +02:00
Zdenek Kabelac	3e6bb77228	lv_manip: add synchronization points	2020-09-08 21:23:03 +02:00
David Teigland	d1019a6434	integrity: improve lv type checks	2020-09-02 12:40:45 -05:00
David Teigland	9a7b81fb72	integrity: fix segfault for lv with no seg in lv_raid_has_integrity	2020-09-02 09:15:58 -05:00
David Teigland	ed249a2c53	integrity: report mismatches with lvs -o integritymismatches reported for integrity images, which may report different values	2020-09-01 17:13:21 -05:00
David Teigland	f2c1de783c	integrity: always default to journal mode lvconvert was defaulting to bitmap mode, and lvcreate was defaulting to journal mode.	2020-09-01 17:12:28 -05:00
Zdenek Kabelac	672d5ad98b	gcc: hide warn about possible uninitialized use of dev_ret Older gcc reports this fp problem.	2020-09-01 23:40:24 +02:00
Zdenek Kabelac	56c41b7522	cov: avoid duplicated assign	2020-09-01 17:57:50 +02:00
Zdenek Kabelac	fd96f1014b	gcc: zero-sized array to fexlible array C99 Switch remaining zero sized struct to flexible arrays to be C99 complient. These simple rules should apply: - The incomplete array type must be the last element within the structure. - There cannot be an array of structures that contain a flexible array member. - Structures that contain a flexible array member cannot be used as a member of another structure. - The structure must contain at least one named member in addition to the flexible array member. Although some of the code pieces should be still improved.	2020-09-01 17:57:50 +02:00
Zdenek Kabelac	b722ce2f10	gcc: drop bogus ;	2020-08-28 21:43:03 +02:00
Zdenek Kabelac	ee0cb17608	gcc: use apropriate type for reading and printing values	2020-08-28 21:43:03 +02:00
Zdenek Kabelac	ff4827ffb1	lv_manip: get_default_region_size return uint32_t	2020-08-28 21:43:02 +02:00
Zdenek Kabelac	03f9cd95b4	writecache: correct usage of const struct	2020-08-28 21:43:02 +02:00
David Teigland	9a88a9c4ce	Revert "lvdisplay: dispaly correct status when underlying devs missing" This reverts commit `1d0dc74f91`. We should avoid adding anything new to lvdisplay and report new information via lvs reporting fields.	2020-08-28 13:28:15 -05:00

1 2 3 4 5 ...

3145 Commits