shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-22 17:35:59 +03:00

Author	SHA1	Message	Date
Zdenek Kabelac	51c83f1483	lvcreate: use lv_passes_readonly_filter Check if created LV is going to be activated read-only because such LV cannot be zeroed (equals to use option '-pr').	2021-02-02 21:23:39 +01:00
David Teigland	bee9f4efdd	filter-mpath: work with nvme devices Recognize when a device is nvme, and apply filter-mpath to nvme devices in addition to scsi devices.	2021-02-02 13:01:20 -06:00
Zdenek Kabelac	026d94d882	gcc: hide uninitialized warning with older gcc	2021-02-01 14:54:49 +01:00
Zdenek Kabelac	3acf6040b5	wipe: reformat message for failure case Use the same error message layout to match BLKZEROUT look. Makes testing easier.	2021-02-01 12:13:49 +01:00
Zdenek Kabelac	be0bf43d74	allocation: report allocation error instead of crash Current allocation limitation requires to fit metadata/log LV on a single PV. This is usually not a big problem, but since thin-pool and cache-pool is using this for allocating extents for their metadata LVs it might be eventually causing errors where the remaining free spaces for large metadata size is spread over several PV.	2021-02-01 12:13:49 +01:00
Zdenek Kabelac	45f0c48365	pvmove: automatically resolve whole stacked LV When passing 'pvmove --name arg' try to automatically move all associated dependencies with given LV. i.e. 'pvmove --name thinpool vg vgnew' moves all thins and data and metadata LV into a new VG vgnew.	2021-02-01 12:06:13 +01:00
Zdenek Kabelac	abc9265a06	cache: reuse code for metadata min_max Use update_pool_metadata_min_max() which is shared with thin-pool metadata min-max updating. Gives improved messages when converting volumes to metadata.	2021-02-01 12:06:13 +01:00
Zdenek Kabelac	f96b455506	pool: limit pmspare to 16GiB There is not much point to let allocate more then this size even when i.e. converted LV is bigger then 16GiB (%extent_size) ATM neither thin-pool nor cache-pool supports bigger metadata.	2021-02-01 12:06:13 +01:00
Zdenek Kabelac	b4212be2e7	thin: improve 16g support for thin pool metadata Initial support for thin-pool used slightly smaller max size 15.81GiB for thin-pool metadata. However the real limit later settled at 15.88GiB (difference is ~64MiB - 16448 4K blocks). lvm2 could not simply increase the size as it has been using hard cropping of the loaded metadata device to avoid warnings printing warning of kernel when the size was bigger (i.e. due to bigger extent_size). This patch adds the new lvm.conf configurable setting: allocation/thin_pool_crop_metadata which defaults to 0 -> no crop of metadata beyond 15.81GiB. Only user with these sizes of metadata will be affected. Without cropping lvm2 now limits metadata allocation size to 15.88GiB. Any space beyond is currently not used by thin-pool target. Even if i.e. bigger LV is used for metadata via lvconvert, or allocated bigger because of to large extent size. With cropping enabled (=1) lvm2 preserves the old limitation 15.81GiB and should allow to work in the evironement with older lvm2 tools (i.e. older distribution). Thin-pool metadata with size bigger then 15.81G is now using CROP_METADATA flag within lvm2 metadata, so older lvm2 recognizes an incompatible thin-pool and cannot activate such pool! Users should use uncropped version as it is not suffering from various issues between thin_repair results and allocated metadata LV as thin_repair limit is 15.88GiB Users should use cropping only when really needed! Patch also better handles resize of thin-pool metadata and prevents resize beoyond usable size 15.88GiB. Resize beyond 15.81GiB automatically switches pool to no-crop version. Even with existing bigger thin-pool metadata command 'lvextend -l+1 vg/pool_tmeta' does the change. Patch gives better controls 'coverted' metadata LV and reports less confusing message during conversion. Patch set also moves the code for updating min/max into pool_manip.c for better sharing with cache_pool code.	2021-02-01 12:06:13 +01:00
David Teigland	a690d16d29	writecache: use cleaner message instead of table reload When detaching writecache, make the first stage send a message to dm-writecache to set the cleaner option. This is instead of reloading the dm table with the cleaner option set. Reloading the table causes udev to process/probe the dm dev, which gets stalled because of the writeback activity, and the stalled udev in turn stalls the lvconvert command when it tries to sync with udev events. When getting writecache status we do not need to get open_count or read_head info, which can cause extra steps.	2021-01-28 15:14:25 -06:00
Heinz Mauelshagen	f08ef23856	lvdisplay: enhance LV status output for raid(0) In case legs of a raid0 LV are removed, the lvdisplay command still reports 'available' though raid0 is not providing any resilience compared to the other raid levels. Also lvdisplay does not display '(partial)' in case of missing raid0 legs as oposed to the lvs command. Enhance lvdisplay to report "NOT available" for any RaidLV type in case too many legs are inaccessible hence causing data loss. I.e. any leg for raid0, all for raid1, more than 1 for raid4/5, more than 2 for raid6 and in case of completely lost mirror groups for raid10. Add test/shell/lvdisplay-raid.sh. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1872678	2021-01-27 16:56:22 +01:00
Zdenek Kabelac	ce6e74f485	filters: avoid duplicated // sysfs_dir already goes with '/sys/'.	2021-01-22 15:30:37 +01:00
Zdenek Kabelac	8532b1ca97	vdo: support online rename New VDO targets v6.2.3 corrects support for online rename of VDO device. If needed if can be disable via new lvm.conf setting: vdo_disabled_features = [ "online_rename" ]	2021-01-22 15:30:37 +01:00
Zdenek Kabelac	4b8e5ad595	pools: fix removal of spare volume When removing pool LV from a stacked LV setup, it's been possible to leak _pmspare and such hidden LV then required manual user removal. Fix it by moving automatic removal into _lv_reduce().	2021-01-22 15:30:37 +01:00
Zdenek Kabelac	9ae1935b74	config: avoid printing spaces before end of line Empty comments were generating unnecessary space char before new line.	2021-01-22 15:30:37 +01:00
David Teigland	2ec29d0677	label_scan: fix missing free of filtered_devs missing free of devl entries on filtered_devs list in commit `2c9bb67604`	2021-01-18 16:26:02 -06:00
David Teigland	37227b8ad6	devs: remove invalid path name aliases Make dev_cache_get() verify aliases and drop any that are invalid before returning a dev for a given name.	2021-01-15 16:31:50 -06:00
David Teigland	0534723a2d	integrity: fix segfault on error path when replacing images When adding replacement raid+integrity images (lvconvert --repair after a raid image is lost), various errors can cause the function to exit with an error. On this exit path, the function attempts to revert new images that had been created but not yet used. The cleanup failed to account for the fact that not all images needed to be reverted.	2021-01-13 13:39:33 -06:00
Zdenek Kabelac	0b6ee6a912	alloc: enhance estimation of sufficient_pes_free Since commit `77fdc17d70` always include log_len size into needed extents - however now we may need sometimes more extents then necessary - mainly when multiple PVs are involved into allocation. Add logs_still_needed into calculation of sufficient_pes_free()	2021-01-13 12:54:45 +01:00
David Teigland	b84a9927b7	partial flag for writecache and integrity When a writecache sublv or an integrity metadata sublv are partial (missing a dev), set the partial flag on the upper level LV also, as is done for other sublvs.	2020-12-11 16:25:25 -06:00
David Teigland	9fe7aba251	cache: activation cache_check on cachevol When using cache with a cachevol, the cache_check tool was not being run on the cache metadata during activation. cache_check clears the needs_check flag in the cache metadata, so if the flag was set due to an unclean shutdown, the activation would fail.	2020-12-09 17:36:09 -06:00
David Teigland	b68141a49d	lvm.conf: remove reference to locking_type	2020-11-17 11:19:55 -06:00
David Teigland	5fef89361d	integrity: display total mismatches at raid LV level Each integrity image in a raid LV reports its own number of integrity mismatches, e.g. lvs -o integritymismatches vg/lv_rimage_0 lvs -o integritymismatches vg/lv_rimage_1 In addition to this, allow the total number of integrity mismatches from all images to be displayed for the raid LV. lvs -o integritymismatches vg/lv shows the number of mismatches from both lv_rimage_0 and lv_rimage_1.	2020-11-11 15:10:15 -06:00
David Teigland	d7058cfa98	writecache: supported in dm-writecache version 3 not version 2	2020-11-09 09:47:01 -06:00
David Teigland	020d1edaa0	writecache: disallow partial or degraded activation when either main or fast lvs are incomplete	2020-10-26 15:48:58 -05:00
David Teigland	2c31939827	pvcreate: clean up opening and filtering of args The args for pvcreate/pvremove (and vgcreate/vgextend when applicable) were not efficiently opened, scanned, and filtered. This change reorganizes the opening and filtering in the following steps: - label scan and filter all devs . open ro . standard label scan at the start of command - label scan and filter dev args . open ro . uses full md component check . typically the first scan and filter of pvcreate devs - close and reopen dev args . open rw and excl - repeat label scan and filter dev args . using reopened rw excl fd - wipe and write new headers . using reopened rw excl fd	2020-10-26 11:13:27 -05:00
Zdenek Kabelac	7bafae48bb	gcc: cleanup warns from older gcc	2020-10-26 13:06:53 +01:00
Zdenek Kabelac	9740e98cbd	lv_manip: add space into message Just add space between %s(.	2020-10-24 01:42:16 +02:00
David Teigland	6226512ad2	get dev size when setting pv device In some cases the dev size may not have been read yet in set_pv_devices(). In this case get the dev size before comparing the dev size with the pv size.	2020-10-22 13:19:17 -05:00
David Teigland	a7f195b7e8	add label_scan_devs_cached label_scan_devs without invalidating data first for cases where the caller wants to use any bcache data they have already read.	2020-10-21 16:24:16 -05:00
David Teigland	677f829e54	add label_read_pvid To read the lvm headers and set dev->pvid if the device is a PV. Difference from label_scan_ functions is this does not read any vg metadata or add any info to lvmcache.	2020-10-21 16:24:16 -05:00
David Teigland	c7311d4722	lvmcache: rename label_read label_scan_dev for consistent naming with other similar functions	2020-10-21 16:24:16 -05:00
David Teigland	b3cdf0d881	lvmcache: add lvmcache_get_dev_mda for future patch	2020-10-21 16:24:16 -05:00
David Teigland	2c9bb67604	scanning: improve filtering control Filtering in label_scan was controlled indirectly by the fact that bcache was not yet set up when label_scan first ran. The result is that filters that needed data would not run and would return -EAGAIN, which would result in the dev flag FILTER_AFTER_SCAN being set. After the dev header was read for checking the label, filters would be rechecked because of FILTER_AFTER_SCAN. All filters would be checked this time because bcache was now set up, and the filters needing data would largely use data already scanned for reading the label. This design worked but is hard to adjust for future cases where bcache is already set up. Replace this method (based on setting up bcache, or not) with a new cmd flag filter_nodata_only. When this flag is set filters that need data will not run. This allows the same label_scan behavior when bcache has been set up. There are no expected changes in behavior.	2020-10-21 16:24:16 -05:00
David Teigland	c74ccd5201	filters: nodata option When filter_nodata_only is set, a filter that uses data is skipped.	2020-10-21 16:24:16 -05:00
David Teigland	c601ec0d6e	filters: allow filter wipe for one device as passes_filter already does	2020-10-21 16:24:16 -05:00
Zdenek Kabelac	fdec4cd3e6	memlock: allocate at most halve of rlimit stack Touch of stack allocation validated given size with rlimit and if the reserved_stack was above rlimit, its been completely ignored - now we will always touch stack upto rlimit/2 size.	2020-10-20 22:26:44 +02:00
Zdenek Kabelac	b75c2dfe1b	debug: shorten error message Just check for sigint during log_error().	2020-10-19 16:53:18 +02:00
Zdenek Kabelac	58976ccc34	properties: fix data_usage typo Patch `4de6f58085` introduce typo, we need to use data_usage. Note: this code was used by lvmapp library and currently is unused.	2020-10-19 16:53:18 +02:00
Zdenek Kabelac	73a3a0d347	debug: drop vgid from debug From the code can be seen the VGID will be always NULL here as vgid != NULL is already handled before. Thus drop from being displayed.	2020-10-02 22:27:00 +02:00
Zdenek Kabelac	117fc64e6e	debug: no backtrace As the path already printed verbose message drop backtrace.	2020-10-02 21:04:16 +02:00
Zdenek Kabelac	1b8c6f09bc	debug: show actually reason for taking this code path Instead of not so useful backtrace, report what was the reason.	2020-10-02 21:04:16 +02:00
Zdenek Kabelac	e1af80c81c	debug: drop FD from error message Since now the error path already has device close and set -1, there is not much in printing this info - actually shouldn't be there at all..	2020-10-02 21:04:16 +02:00
Zdenek Kabelac	dd8212365d	debug: update messages	2020-10-02 21:04:16 +02:00
Zdenek Kabelac	e7fff97b8d	wipe_lv: use BLKZEROOUT when possible Since BLKZEROOUT ioctl should be supposedly fastest way how to clear block device start using this ioctl for zeroing a device. Commonly we do zero typically small portion of a device (8KiB) - however since we now also started to zero metadata devices, in the case of i.e. thin-pool metadata this can go upto ~16GiB and here the performance starts to be noticable.	2020-10-02 21:04:16 +02:00
Zdenek Kabelac	c65d3a6b8a	wipe_lv: interruptible wiping Since we now block signals and wiping may take unexpectedly long time - support breaking command while wipe is in progress.	2020-10-02 21:03:19 +02:00
Zdenek Kabelac	7396f1cfee	wipe_lv: drop label_scan_invalidate on error path Since dev_set_bytes() now closes dev on error path itself, remove this unneeded call now (introduced few commits back in history thus removing comment from WHATS_NEW)	2020-10-02 21:02:04 +02:00
Zdenek Kabelac	b44db5d1a7	bcache: use flexible arrays Cleanup, allocate whole struct with a single malloc call.	2020-10-02 21:00:26 +02:00
Zdenek Kabelac	b3c7a2b3f0	bcache: support interrupts when waiting on IO Since lvm2 normally block signals during protected phase where it does not want to be interrupted. Support interruptible processing when allowed in section between sigint_allow() ... sigint_restore()) and let the 'io_getenvents()' finish with EINTR.	2020-10-02 20:57:50 +02:00
Zdenek Kabelac	0fe58fc54f	bcache: fix busy loop with too many errors When bcache tries to write data to a faulty device, it may get out of caching blocks and then just busy-loops on a CPU - so this check protects this by checking if there is already max_io (~64) errored blocks.	2020-10-02 20:56:55 +02:00
Zdenek Kabelac	41f9e372c0	bcache: fix waiting problem for completed IO Call _wait_all() which does check whether there is still some pending IO before sleep. Otherwise it may happen our submitted IO operations have been already dispatched and this call then endlessly waits for IO which are all done. This can be reproduced when device returns quickly errors on write requests.	2020-10-02 20:53:41 +02:00
David Teigland	450f272b31	devices: support printing the filter that rejects a device Use of this new message function needs to be added to various commands to improve the output.	2020-10-01 12:00:09 -05:00
David Teigland	c32d7fed4f	writecache: use two step detach When detaching a writecache, use the cleaner setting by default to writeback data prior to suspending the lv to detach the writecache. This avoids potentially blocking for a long period with the device suspended. Detaching a writecache first sets the cleaner option, waits for a short period of time (less than a second), and checks if the writecache has quickly become clean. If so, the writecache is detached immediately. This optimizes the case where little writeback is needed. If the writecache does not quickly become clean, then the detach command leaves the writecache attached with the cleaner option set. This leaves the LV in the same state as if the user had set the cleaner option directly with lvchange --cachesettings cleaner=1 LV. After leaving the LV with the cleaner option set, the detach command will wait and watch the writeback progress, and will finally detach the writecache when the writeback is finished. The detach command does not need to wait during the writeback phase, and can be canceled, in which case the LV will remain with the writecache attached and the cleaner option set. When the user runs the detach command again it will complete the detach. To detach a writecache directly, without using the cleaner step (which has been the approach previously), add the option --cachesettings cleaner=0 to the detach command.	2020-10-01 11:33:02 -05:00
David Teigland	2272a32e6f	lvmlockd vdo: add support lvmlockd handling for vdo lv and vdo pool is like thin lv and thin pool.	2020-09-29 14:43:27 -05:00
David Teigland	82e270c18a	lvmlockd vdo: disallow use of shared lock on LV vdo cannot be active on multiple hosts concurrently	2020-09-29 14:43:26 -05:00
Zdenek Kabelac	6728788bf5	debug: remove stacktrace on regular path Here _insert is expected to also fail, so just regular 'return 0'.	2020-09-29 10:43:56 +02:00
Zdenek Kabelac	0c89c5a40f	debug: update debug message	2020-09-29 10:43:56 +02:00
Zdenek Kabelac	bd0d4de4e2	active: fix compilation without devmapper Better support for compilation without device-mapper.	2020-09-29 10:43:56 +02:00
Zdenek Kabelac	4cd356b26b	thin: remove unneeded code test Since we detect already transaction if before starting to build dm tree - this extra check is a duplicate that would only capture very tiny 'race' and we later validate transaction_id with suspended snapshot origin.	2020-09-29 10:43:56 +02:00
Zdenek Kabelac	4de6f58085	thin: use lv_status_thin and lv_status_thin_pool Introduce structures lv_status_thin_pool and lv_status_thin (pair to lv_status_cache, lv_status_vdo) Convert lv_thin_percent() -> lv_thin_status() and lv_thin_pool_percent() + lv_thin_pool_transaction_id() -> lv_thin_pool_status(). This way a function user can see not only percentages, but also other important status info about thin-pool. TODO: This patch tries to not change too many other things, but pool_below_threshold() now uses new thin-pool info to return failure if thin-pool cannot be actually modified. This should be handle separately in a better way.	2020-09-29 10:43:56 +02:00
Zdenek Kabelac	92c0e8c17f	writecache: archive before modification of metadata Archive before we start to modify metadata.	2020-09-29 10:43:56 +02:00
Zdenek Kabelac	08e838f488	cleanup: avoid unneeded check Since creation of thin snapshot already makes sure, the message list is empty, there is no need to check this again.	2020-09-29 10:43:56 +02:00
Zdenek Kabelac	af5f29c7e2	activation: move locking of critical section Move begining of 'suspending' critical section closer to _lv_suspend_lv() for better correctness of error paths.	2020-09-29 10:43:56 +02:00
Bastian Germann	168e2ffbcd	lvm: add readline alternative editline LVM2 is distributed under GPLv2 only. The readline library changed its license long ago to GPLv3. Given that those licenses are incompatible and you follow the FSF in their interpretation that dynamically linking creates a derivative work, distributing LVM2 linked against a current readline version might be legally problematic. Add support for the BSD licensed editline library as an alternative for readline. Link: https://thrysoee.dk/editline	2020-09-29 10:13:24 +02:00
David Teigland	fb96e9ab21	tests: add case for metadata checksum differences Cover the case where two copies of metadata have the same seqno but different checksums. Also elaborate on an existing fixme in the code for this case, since we should be doing something better for this case. This had been uncovering an issue with reopening fds in readwrite mode.	2020-09-28 13:25:57 -05:00
David Teigland	da14cf68cb	scanning: keep open an lvm device with scanning problem The command may want to update it.	2020-09-28 13:25:57 -05:00
David Teigland	890c7ef451	devices: fix reopen for unopened device If there's a request to reopen rw a device that's not open, then just call the normal open function.	2020-09-28 13:25:57 -05:00
Heinz Mauelshagen	8952dcbff0	Revert "lvconvert: display warning if raid1 LV image count does not change" This reverts superfluous commit `3c9177fdc0` as _lv_raid_change_image_count() already checks for non-changed image count. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1872130	2020-09-28 17:14:03 +02:00
Zdenek Kabelac	e414ebef6e	thin: pass through whole code Instead of early 'return 0' let the whole code finish in case of an error with syncing.	2020-09-25 22:59:35 +02:00
Zdenek Kabelac	8b22e38087	thin: improve error message Add more info, explaing why the suspend of thin snapshot origin was omitted.	2020-09-25 22:59:35 +02:00
Zdenek Kabelac	ef59c83f2d	thin: enhance lvcreate error paths Improve error response and reporting, when creating thin snapshots. If the thin pool kernel metadata already have device with ID lvm2 tries to create, give more meanigful error message and also properly restore transaction id to the value known to thin-pool in this case. Before it's been possible to divert by one from kernel TID value, and lvm2 stacked delete message for such thin device.	2020-09-25 22:56:40 +02:00
Zdenek Kabelac	e2eb1dc501	thin: no delete message for device_id 0 Since we always use device_id > 0, we could use device_id == 0 to actually mark thinLV as an LV we want to remove without delete message.	2020-09-25 22:54:07 +02:00
Zdenek Kabelac	50a37948b5	vdo: allow passing renamed vdopool name to kernel Although kernel does not allow to load a new dm table with renamed vdopool, at least make lvm2 code ready it it every will get supported.	2020-09-23 13:20:28 +02:00
Zdenek Kabelac	7c19186271	vdo: disable support for online rename of vdopool LV Since ATM kernel does not support this operation, disable 'lvrename' of an active vdopool. As a workaround, user may simply deactivate, rename and activate.	2020-09-23 13:18:23 +02:00
Zdenek Kabelac	3a3307c0d8	vdo: enhance vdo pool extension When user tries to extend vdo pool - he needs to go always at least by 1 full VDO slab (defined as vdo_slab_size_mb). To avoid all trouble around find 'workable' size - lvm2 automatically increases the passed (or by --use-policies calculated) extension size (and informs a user about sometimes possibly large increase as slab size can go upto 32GiB) With VDO users need to always 'think-big' anyway and expect such operation to be in GiB domain range.	2020-09-22 23:28:43 +02:00
Zdenek Kabelac	f38b7afd62	vdo: extend vdo segment validation Try to catch all suspicious VDO segments in metadata early.	2020-09-22 23:25:16 +02:00
Zdenek Kabelac	642ef54399	vdo: correct message about policy extend support Policy extend is already supported for vdo pools as well, so correct the error message.	2020-09-22 23:25:16 +02:00
Zdenek Kabelac	e08a0421a3	vdo: drop unnecessary tabulator from metadata output	2020-09-22 23:25:16 +02:00
Zdenek Kabelac	5bc66532c7	activation: use revert_lv on tree suspend failure When thetable reload fails during suspend() - we were only calling plain resume() - and this will reload only those devices, which were left suspend, but will not try to restore metadata state according to lvm2 reverted metadata. So if we were reloading device tree - we have restored only top-level LV and rest of reverted device manipulation were left alone and possibly mismatched what is in committed metadata. FIXME: There are several cases were such revert will likely not work properly anyway as some operation are currenly handled in single commit, while they need multiple commits, but it's step towards better correctness. At least we catch there errors now earlier.	2020-09-22 21:02:14 +02:00
Zdenek Kabelac	6c769eb460	bache: fix error return value Return 0 as failure (as checked for). Also add INTERNAL_ERROR if 'DI' would be -1.	2020-09-19 23:00:50 +02:00
David Teigland	1404e5ee61	metadata: open rw fd before closing ro fd lvm opens devices readonly to scan them, but needs to open then readwrite to update the metadata. Previously, the ro fd was closed before the rw fd was opened, leaving a small gap where the dev was not held open, and during which the dev could possibly change which storage it referred to. With the bcache_change_fd() interface, lvm opens a rw fd on a device to be written, tells bcache to change to the new rw fd, and closes the ro fd. . open dev ro . read dev with the ro fd (label_scan) . lock vg (ex for writing) . open dev rw . close ro fd . rescan dev to check if the metadata changed between the scan and the lock . if the metadata did change, reread in full . write the metadata	2020-09-18 15:10:11 -05:00
David Teigland	1570e76233	bcache: use indirection table for fd Add a "device index" (di) for each device, and use this in the bcache api to the rest of lvm. This replaces the file descriptor (fd) in the api. The rest of lvm uses new functions bcache_set_fd(), bcache_clear_fd(), and bcache_change_fd() to control which fd bcache uses for io to a particular device. . lvm opens a dev and gets and fd. fd = open(dev); . lvm passes fd to the bcache layer and gets a di to use in the bcache api for the dev. di = bcache_set_fd(fd); . lvm uses bcache functions, passing di for the dev. bcache_write_bytes(di, ...), etc. . bcache translates di to fd to do io. . lvm closes the device and clears the di/fd bcache state. close(fd); bcache_clear_fd(di); In the bcache layer, a di-to-fd translation table (int *_fd_table) is added. When bcache needs to perform io on a di, it uses _fd_table[di]. In the following commit, lvm will make use of the new bcache_change_fd() function to change the fd that bcache uses for the dev, without dropping cached blocks.	2020-09-18 15:10:11 -05:00
David Teigland	46f43589d0	hints: enhance debug messages	2020-09-16 15:01:10 -05:00
David Teigland	491eb25832	label: cleanup set_byte error exit	2020-09-16 13:54:16 -05:00
David Teigland	37bcd7ce84	Revert "label: use formaters FMTu64 and FMTsize_t" This reverts commit `d0ccb2521b`.	2020-09-16 13:47:06 -05:00
Zdenek Kabelac	d0ccb2521b	label: use formaters FMTu64 and FMTsize_t Produces code without casts to differntly signed types and also shortens and enhances readbility.	2020-09-15 23:07:06 +02:00
Zdenek Kabelac	2b36542f41	wipe: dev_set_bytes resolves zeroing Since dev_write_zeros() is just subset of dev_set_bytes() use it directly and simplify code.	2020-09-15 23:07:06 +02:00
Zdenek Kabelac	d588de77aa	wipe: convert zero_value to uint8_t We always write this value as byte.	2020-09-15 22:52:25 +02:00
Zdenek Kabelac	ec4e8b5c0e	wipe: zeroing of 8 sectors is granted With do_zero min is always 8 sectors, so use 0 as default.	2020-09-15 22:52:25 +02:00
Zdenek Kabelac	7bcc994776	label: deduplicate dev_set_bytes As dev_write_zeros() is same as dev_set_bytes() reused the code directly.	2020-09-15 22:52:25 +02:00
Zdenek Kabelac	7b08133844	label: code deduplication	2020-09-15 22:52:25 +02:00
Zdenek Kabelac	6d344b4ac0	hints: enhance debug with log_sys_debug	2020-09-15 22:52:25 +02:00
Zdenek Kabelac	187cc8d344	lvcreate: change error message Provide more useful error message.	2020-09-15 22:52:25 +02:00
Zdenek Kabelac	39198eb2ce	lvcreate: add extra synchronization at error path Put explict udev synchronization before we try to deactive devices.	2020-09-15 22:52:25 +02:00
Zdenek Kabelac	b2978efbff	cache: simplier signal handling Use just single sigint_allow()/restore() within flushing loop and void one extra signal manipulation.	2020-09-14 00:15:14 +02:00
Zdenek Kabelac	2101e324f9	locking: restore blocking signal for VG_GLOBAL lck During removal of a lot of locking code the signal blocking got lost and signal processing got broken leading to unpredictable behavior of i.e. activation code the can get interrupted in the middle of DM table processing. lvm2 code always expects signals are blocked while lock is held unless it is explictelly placed into section of: sigint_allow();....;sigint_restore(); For checking catched interrupt there is sigint_catched();	2020-09-14 00:15:14 +02:00
Zdenek Kabelac	77fdc17d70	alloc: improve estimation of sufficient_pes_free Metadata size was calculated correctly only for raids. Fixes problem for crash during lvcreate when thin-pool was created on a VG where remaining free space had the size to only fit a single metadata LV and not also its _pmspare. Lvcreate crashed with this assert message: lvcreate: metadata/pv_map.c:198: consume_pv_area: Assertion `to_go <= pva->count' failed. Aborted (core dumped) TODO: there is probably to large overload of several alloc_handle variables. Reported-by: Wu Guanghao<wuguanghao3@huawei.com> Reported-by: Zhiqiang Liu <liuzhiqiang26@huawei.com>	2020-09-11 21:51:24 +02:00
Zdenek Kabelac	9f78acfee9	thin: compensate metadata size by extra percent When using --use-policy for automatic extension of thin-pool, the extension of thin-pool's metadata itself can actually take some extra space. Since I'm not aware of exact compensation formula, add just 1% extra to calculated amount and hope it fits. Wanted target is to always have usable thin-pool that fits bellow pool_metadata_min_threshold().	2020-09-11 21:42:37 +02:00
Zdenek Kabelac	b798554a20	lv_manip: even better rounding	2020-09-11 13:37:04 +02:00
Zdenek Kabelac	678951f635	cleanup: comment typo	2020-09-10 23:55:03 +02:00
Zdenek Kabelac	e7bd3ba22d	debug: drop debug trace from regular path Since we query on regular code these: lv_raid_has_integrity() lv_has_integrity_recalculate_metadata() without prior checking for lv_is_raid() - these 'return 0' should not use <stacktrace> as they are expected.	2020-09-10 23:55:03 +02:00
Zdenek Kabelac	bc09803628	lv_manip: relocate check to proper function	2020-09-10 23:54:33 +02:00
Zdenek Kabelac	e7f5acdfa6	lvextend: improve percentage estimation Correcting rounding rules for percentage evaluation. Validate supported range of percentage. (although ranges are already validated earlier on code path)	2020-09-10 23:54:31 +02:00
David Teigland	1f54129c4e	integrity: fix segfault reporting integrity for other lvs	2020-09-09 10:22:07 -05:00
Zdenek Kabelac	af33a00847	Revert "raid: add _rimage and _rmeta as origin_only" This reverts commit `3388e19489`. More thinking needed.	2020-09-09 00:58:52 +02:00
Zdenek Kabelac	a8ea1817ab	Revert "raid: do not enforce flushing of raids when it is not required" This reverts commit `ce5ea07411`. More thinking needed.	2020-09-09 00:58:32 +02:00
Zdenek Kabelac	ce5ea07411	raid: do not enforce flushing of raids when it is not required This is probably somewhat experimantal patch - but when i.e. raid device is just extend, there should not be a technical need for flush, unless the target would stricly need it. It should allow faster processing of lvm command not being blocked by possibly longer flush.	2020-09-08 21:23:03 +02:00
Zdenek Kabelac	3388e19489	raid: add _rimage and _rmeta as origin_only Since we do not support rimage & rmeta for snapshots - we can avoid quering for -cow devices and add them as origin_only - since their snapshots (-cow) could have never existed. This redumes several ioctl operation during table preloading.	2020-09-08 21:23:03 +02:00
Zdenek Kabelac	3e6bb77228	lv_manip: add synchronization points	2020-09-08 21:23:03 +02:00
David Teigland	8b9028bbe7	hints: remove warning when clearing hint file When the hint file cannot be accessed, silently ignore hints, like other instances do.	2020-09-02 14:06:46 -05:00
David Teigland	d1019a6434	integrity: improve lv type checks	2020-09-02 12:40:45 -05:00
David Teigland	9a7b81fb72	integrity: fix segfault for lv with no seg in lv_raid_has_integrity	2020-09-02 09:15:58 -05:00
David Teigland	ed249a2c53	integrity: report mismatches with lvs -o integritymismatches reported for integrity images, which may report different values	2020-09-01 17:13:21 -05:00
David Teigland	47b5fb138c	integrity: report raidintegritymode randintegrityblocksize reported for the raid lv and the integrity images	2020-09-01 17:12:36 -05:00
David Teigland	f2c1de783c	integrity: always default to journal mode lvconvert was defaulting to bitmap mode, and lvcreate was defaulting to journal mode.	2020-09-01 17:12:28 -05:00
Zdenek Kabelac	672d5ad98b	gcc: hide warn about possible uninitialized use of dev_ret Older gcc reports this fp problem.	2020-09-01 23:40:24 +02:00
Zdenek Kabelac	dbb19f6ace	cleanup: matching declaration order Cosmetic	2020-09-01 17:57:50 +02:00
Zdenek Kabelac	56c41b7522	cov: avoid duplicated assign	2020-09-01 17:57:50 +02:00
Zdenek Kabelac	a481f42630	cov: always initialized values Make sure values are initialized for all possible paths.	2020-09-01 17:57:50 +02:00
Zdenek Kabelac	85e2c7e14d	cov: explicitely ignore function result	2020-09-01 17:57:50 +02:00
Zdenek Kabelac	1705b439b1	cov: always sure we end with '0' Use easier dm_strncpy().	2020-09-01 17:57:50 +02:00
Zdenek Kabelac	fd96f1014b	gcc: zero-sized array to fexlible array C99 Switch remaining zero sized struct to flexible arrays to be C99 complient. These simple rules should apply: - The incomplete array type must be the last element within the structure. - There cannot be an array of structures that contain a flexible array member. - Structures that contain a flexible array member cannot be used as a member of another structure. - The structure must contain at least one named member in addition to the flexible array member. Although some of the code pieces should be still improved.	2020-09-01 17:57:50 +02:00
Zhao Heming	cc2218b401	gcc: change zero-sized array to fexlible array this patch makes gcc happy with compiling option: [-Wstringop-overflow=] Signed-off-by: Zhao Heming <heming.zhao@suse.com>	2020-09-01 17:57:50 +02:00
Zdenek Kabelac	b722ce2f10	gcc: drop bogus ;	2020-08-28 21:43:03 +02:00
Zdenek Kabelac	19e9c88faf	gcc: do not use return with void function Follow C norm and do not use 'return' in void function to call other functions.	2020-08-28 21:43:03 +02:00
Zdenek Kabelac	ee0cb17608	gcc: use apropriate type for reading and printing values	2020-08-28 21:43:03 +02:00
Zdenek Kabelac	7880896f0d	gcc: calc size in compile time	2020-08-28 21:43:02 +02:00
Zdenek Kabelac	ce202c3b1c	gcc: keep unsigned arithmetic Avoid conversion to int.	2020-08-28 21:43:02 +02:00
Zdenek Kabelac	ff4827ffb1	lv_manip: get_default_region_size return uint32_t	2020-08-28 21:43:02 +02:00
Zdenek Kabelac	93c9055591	lvmcache: use uint32_t for seqno caching	2020-08-28 21:43:02 +02:00
Zdenek Kabelac	03f9cd95b4	writecache: correct usage of const struct	2020-08-28 21:43:02 +02:00
Zdenek Kabelac	e3e04b99f2	config: drop reading file with mmap While normally the 'mmap' file reading is better utilizing resources, it has also its odd side with handling errors - so while we normally use the mmap only for reading regular files from root filesystem (i.e. lvm.conf) we can't prevent error to happen during the read of these file - and such error unfortunately ends with SIGBUS error. Maintaing signal handler would be compilated - so switch to slightly less effiecient but more error resistant read() functinality.	2020-08-28 21:43:02 +02:00
David Teigland	9a88a9c4ce	Revert "lvdisplay: dispaly correct status when underlying devs missing" This reverts commit `1d0dc74f91`. We should avoid adding anything new to lvdisplay and report new information via lvs reporting fields.	2020-08-28 13:28:15 -05:00
Zhao Heming	1d0dc74f91	lvdisplay: dispaly correct status when underlying devs missing reproducible steps: 1. vgcreate vg1 /dev/sda /dev/sdb 2. lvcreate --type raid0 -l 100%FREE -n raid0lv vg1 3. do remove the /dev/sdb action 4. lvdisplay show wrong 'LV Status' After removing raid0 type LV underlying dev, lvdisplay still display 'available'. This is wrong status for raid0. This patch add a new function raid_is_available(), which will handle all raid case. With this patch, lvdisplay will show from: LV Status available to: LV Status NOT available (partial) Reviewed-by: Enzo Matsumiya <ematsumiya@suse.com> Signed-off-by: Zhao Heming <heming.zhao@suse.com>	2020-08-24 09:47:04 -05:00
Zdenek Kabelac	46d15b5e4d	wipe_lv: close devices on error path Device was kept open preventing its deactivated and removed on error path.	2020-08-19 15:09:09 +02:00
Zdenek Kabelac	7b41ea61b2	config: move some config setting into commented part It's better to set most of option as 'commented' with some documented defaults instead of providing strict values. This has the advantage we can eventually 'change' defualts and get them working in future. Otherwise once the setting is stored in lvm.conf in /etc, such setting has strictly defined value and that can be only change with file update.	2020-08-19 15:07:09 +02:00
Heinz Mauelshagen	3c9177fdc0	lvconvert: display warning if raid1 LV image count does not change Fix "lvconvert -mN $RaidLV" to display a warning in case the same number of images is being requested. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1774696	2020-07-20 15:42:15 +02:00
David Teigland	00c9a788cc	devices: simplify md superblock checking code	2020-07-09 10:48:34 -05:00
David Teigland	23774f997e	devices: detect md ddf and imsm superblocks	2020-07-09 10:48:21 -05:00
Heinz Mauelshagen	286a793c12	lvconvert: fix conversion to 'mirrored' mirror log with larger regionsize merge.c:_check_lv_segment() was checking regionsize vs. mirrored LV size on any 'mirror/raid1/raid10' segment type including type 'mirrored' mirror logs. Avoid the check only for 'mirrored' mirror logs to allow conversion from log type 'disk' with regionsize > mirror log SubLV size. As we disabled support for 'mirrored' mirror logs with commit `e82303fd6a` which still conditionally allows to enable it via global/support_mirrored_mirror_logs=1, patch is mandatory for all distributions. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1712983	2020-07-09 14:39:50 +02:00
Zdenek Kabelac	9b9bf8786f	raid: no wiping when zeroing raid metadata device Currently lvm2 is not wiping signatures when creating 'metadata' volumes and raid _rmeta was the only exception - so make the behavior consistent with other metadata devices and drop wiping ATM. Drop also some extra debug since they are now more explanatory in wipe_lv() function. Also note - although lvm2 now does not wipe signatures - the error from such wipping used to be actually 'ignored' before wipe_lv() started to return error (with recent commit) and raid creation continued with 'unzeroed' metadata device. TODO: Several issues to resolve: 1. We may want to flip to wipping with all LVs (in that case we need to support passing --yet & --force). 2. Also we may want to clear whole metadata device - however current function is also used for wipping i.e. snapshot COW device which is likely not a good candidate for full device zeroing. We may also need to think about better logic when extent size is enforcing very large LVs, when only a small portion of LV is ever being used. 3. Using TRIM instead of zeroing metadata device might be worth to implement. mm	2020-07-08 11:40:55 +02:00
Zdenek Kabelac	fe78cd4082	wipe_lv: always zero at least 4K When zero_sectors passed value like 1 - we could zero only 1 sector. Reinstantiate we always zero at least 4K block.	2020-07-08 11:12:54 +02:00
David Teigland	ad773511c5	integrity: add initial size to metadata size The metadata device size needs to include space for the dm-integrity "initial_sectors" which hold journals.	2020-06-30 16:43:05 -05:00
Zdenek Kabelac	cca2a652d1	cov: avoid double call of free_hints() on error path Since we 'free_hints()' on return error path from call of _read_hint_file(), avoid calling it twice in the middle of error path process.	2020-06-24 15:01:03 +02:00
Zdenek Kabelac	eb06832b37	cov: remove unused header	2020-06-24 15:01:03 +02:00
Zdenek Kabelac	dccaab3d79	cov: use 64bit arithmetic Although values of VDO block_map_cache_size, index_memory_size, slab_size should not overflow here - use proper 64bit math.	2020-06-24 15:01:03 +02:00
Zdenek Kabelac	bc39d5bec6	pool: zero metadata To avoid polution of metadata with some 'garbage' content or eventualy some leak of stale data in case user want to upload metadata somewhere, ensure upon allocation the metadata device is fully zeroed. Behaviour may slow down allocation of thin-pool or cache-pool a bit so the old behaviour can be restored with lvm.conf setting: allocation/zero_metadata=0 TODO: add zeroing for extension of metadata volume.	2020-06-24 15:01:03 +02:00
Zdenek Kabelac	edbc5a62b2	wipe_lv: make error a fatal event Failure in wiping/zeroing stop the command. If user wants to avoid command abortion he should use -Zn or -Wn to avoid wiping. Note: there is no easy way to distinguish which kind of failure has happend - so it's safe to not proceed any futher.	2020-06-24 15:01:03 +02:00
Zdenek Kabelac	6eb9eba59b	bcache: support longer writes When initiated larger write request, it may have happened, bcache got out of free chunks - fix the loop, that is supposed to wait until next free chunk becomes avain available.	2020-06-24 15:01:03 +02:00
Heinz Mauelshagen	04bba5ea42	lv{resize,extend,reduce}: also check for 2-legged raid4 Users can also convert 2-legged raid1 to raid4 thus causing 'Bus error' on resize requests. Related: https://bugzilla.redhat.com/show_bug.cgi?id=1784351	2020-06-24 14:02:31 +02:00
Heinz Mauelshagen	2cf0f90780	lv{resize,extend,reduce}: reject size change on 2-legged raid5* Reject size changing request in to avoid 'Bus error' and display hint to convert to more stripes. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1784351	2020-06-24 13:52:56 +02:00
David Teigland	2aed2a41f7	lvcreate: new cache or writecache lv with single command To create a new cache or writecache LV with a single command: lvcreate --type cache\|writecache -n Name -L Size --cachedevice PVfast VG [PVslow ...] - A new main linear\|striped LV is created as usual, using the specified -n Name and -L Size, and using the optionally specified PVslow devices. - Then, a new cachevol LV is created internally, using PVfast specified by the cachedevice option. - Then, the cachevol is attached to the main LV, converting the main LV to type cache\|writecache. Include --cachesize Size to specify the size of cache\|writecache to create from the specified --cachedevice PVs, otherwise the entire cachedevice PV is used. The --cachedevice option can be repeated to create the cache from multiple devices, or the cachedevice option can contain a tag name specifying a set of PVs to allocate the cache from. To create a new cache or writecache LV with a single command using an existing cachevol LV: lvcreate --type cache\|writecache -n Name -L Size --cachevol LVfast VG [PVslow ...] - A new main linear\|striped LV is created as usual, using the specified -n Name and -L Size, and using the optionally specified PVslow devices. - Then, the cachevol LVfast is attached to the main LV, converting the main LV to type cache\|writecache. In cases where more advanced types (for the main LV or cachevol LV) are needed, they should be created independently and then combined with lvconvert. Example ------- user creates a new VG with one slow device and one fast device: $ vgcreate vg /dev/slow1 /dev/fast1 user creates a new 8G main LV on /dev/slow1 that uses all of /dev/fast1 as a writecache: $ lvcreate --type writecache --cachedevice /dev/fast1 -n main -L 8G vg /dev/slow1 Example ------- user creates a new VG with two slow devs and two fast devs: $ vgcreate vg /dev/slow1 /dev/slow2 /dev/fast1 /dev/fast2 user creates a new 8G main LV on /dev/slow1 and /dev/slow2 that uses all of /dev/fast1 and /dev/fast2 as a writecache: $ lvcreate --type writecache --cachedevice /dev/fast1 --cachedevice /dev/fast2 -n main -L 8G vg /dev/slow1 /dev/slow2 Example ------- A user has several slow devices and several fast devices in their VG, the slow devs have tag @slow, the fast devs have tag @fast. user creates a new 8G main LV on the slow devs with a 2G writecache on the fast devs: $ lvcreate --type writecache -n main -L 8G --cachedevice @fast --cachesize 2G vg @slow	2020-06-16 13:46:51 -05:00
David Teigland	48872b0369	integrity: avoid increasing logical block size of active LV When adding integrity to an active LV, avoid choosing an integrity block size that would result in increasing the logical block size of the LV.	2020-06-16 12:27:22 -05:00
David Teigland	8e2938c963	improve get_fs_block_size string to number	2020-06-11 15:05:47 -05:00
David Teigland	b528a9ce90	integrity: fix block size check when inactive Checking fs block size requires the LV to be active.	2020-06-11 12:43:52 -05:00
David Teigland	9fbad5bb0f	fix libblkid BLOCK_SIZE check	2020-06-11 12:43:07 -05:00
David Teigland	38eaa1035b	writecache: allow snapshot of LV with writecache	2020-06-10 12:18:00 -05:00
David Teigland	712c9efbf6	fix bad result from _cache_min_metadata_size fixes regression from switching to use _cache_min_metadata_size (commit `c08704cee7`) which returns a bogus value when the cachevol size is 8MB.	2020-06-10 12:17:34 -05:00
David Teigland	a7b2fc8f57	writecache: add settings cleaner and max_age available in dm-writecache 1.2	2020-06-10 12:15:50 -05:00
David Teigland	1ee42f1391	writecache: cachesettings in lvchange and lvs lvchange --cachesettings lvs -o+cache_settings	2020-06-10 12:14:00 -05:00
David Teigland	ce772bfab9	writecache: show error in lv_health_status and lv_attr lv_attr is 'E' and lv_health_status is 'error' when dm-writecache status reports error.	2020-06-10 12:13:48 -05:00
David Teigland	240062a183	writecache: remove from an active lv	2020-06-10 12:13:31 -05:00
David Teigland	fa9eb76a5d	improve info about vgck updatemetadata Add man page info about this option, and add log messages pointing to this option.	2020-06-03 12:38:27 -05:00
Zhao Heming	b59127a838	Change dev->bcache_fd default value from 0 to -1 This fix can avoid bcache_fd will mistakenly open/close in later. Signed-off-by: Zhao Heming <heming.zhao@suse.com>	2020-06-01 12:22:15 -05:00
David Teigland	2a304d7a75	lvmcache: free vginfo lock_type	2020-05-14 10:20:08 -05:00
David Teigland	5c095400de	hints: free hint structs on exit and free on a couple error paths.	2020-05-13 17:20:16 -05:00
David Teigland	2f29765e7f	devs: add some checks for a dev with no path name It's possible for a dev-cache entry to remain after all paths for it have been removed, and other parts of the code expect that a dev always has a name. A better fix may be to remove a device from dev-cache after all paths to it have been removed.	2020-05-13 16:26:26 -05:00
David Teigland	2d1fe38d84	lvmlockd: use 4K sector size when any dev is 4K When either logical block size or physical block size is 4K, then lvmlockd creates sanlock leases based on 4K sectors, but the lvm client side would create the internal lvmlock LV based on the first logical block size it saw in the VG, which could be 512. This could cause the lvmlock LV to be too small to hold all the sanlock leases. Make the lvm client side use the same sizing logic as lvmlockd.	2020-05-11 13:14:55 -05:00
David Teigland	d945b53ff7	remove vg_read_error Once converted results to error numbers but is now just a null check.	2020-04-24 11:14:29 -05:00
David Teigland	4047a32128	use refresh_filters only where needed Filters are changed and need refresh in only one place (vgimportclone), so avoid doing the refresh for every other command that doesn't need it.	2020-04-22 14:08:54 -05:00
David Teigland	d79afd4084	lvmcache: rework handling of VGs with duplicate vgnames The previous method of managing duplicate vgnames prevented vgreduce from working if a foreign vg with the same name existed.	2020-04-21 14:40:34 -05:00
David Teigland	cc4051eec0	pass cmd struct through more functions no functional change	2020-04-21 10:58:05 -05:00
David Teigland	3854931aea	lvmcache_get_mda: remove unused function	2020-04-21 10:58:05 -05:00
David Teigland	d9e8895a96	Allow dm-integrity to be used for raid images dm-integrity stores checksums of the data written to an LV, and returns an error if data read from the LV does not match the previously saved checksum. When used on raid images, dm-raid will correct the error by reading the block from another image, and the device user sees no error. The integrity metadata (checksums) are stored on an internal LV allocated by lvm for each linear image. The internal LV is allocated on the same PV as the image. Create a raid LV with an integrity layer over each raid image (for raid levels 1,4,5,6,10): lvcreate --type raidN --raidintegrity y [options] Add an integrity layer to images of an existing raid LV: lvconvert --raidintegrity y LV Remove the integrity layer from images of a raid LV: lvconvert --raidintegrity n LV Settings Use --raidintegritymode journal\|bitmap (journal is default) to configure the method used by dm-integrity to ensure crash consistency. Initialization When integrity is added to an LV, the kernel needs to initialize the integrity metadata/checksums for all blocks in the LV. The data corruption checking performed by dm-integrity will only operate on areas of the LV that are already initialized. The progress of integrity initialization is reported by the "syncpercent" LV reporting field (and under the Cpy%Sync lvs column.) Example: create a raid1 LV with integrity: $ lvcreate --type raid1 -m1 --raidintegrity y -n rr -L1G foo Creating integrity metadata LV rr_rimage_0_imeta with size 12.00 MiB. Logical volume "rr_rimage_0_imeta" created. Creating integrity metadata LV rr_rimage_1_imeta with size 12.00 MiB. Logical volume "rr_rimage_1_imeta" created. Logical volume "rr" created. $ lvs -a foo LV VG Attr LSize Origin Cpy%Sync rr foo rwi-a-r--- 1.00g 4.93 [rr_rimage_0] foo gwi-aor--- 1.00g [rr_rimage_0_iorig] 41.02 [rr_rimage_0_imeta] foo ewi-ao---- 12.00m [rr_rimage_0_iorig] foo -wi-ao---- 1.00g [rr_rimage_1] foo gwi-aor--- 1.00g [rr_rimage_1_iorig] 39.45 [rr_rimage_1_imeta] foo ewi-ao---- 12.00m [rr_rimage_1_iorig] foo -wi-ao---- 1.00g [rr_rmeta_0] foo ewi-aor--- 4.00m [rr_rmeta_1] foo ewi-aor--- 4.00m	2020-04-15 12:10:32 -05:00
David Teigland	b6b4ad8e28	move pv_list code into lib	2020-04-13 10:04:14 -05:00
Zdenek Kabelac	caff31df19	vdo: make vdopool wrapping device is read-only When vdopool is activated standalone - we use a wrapping linear device to hold actual vdo device active - for this we can set-up read-only device to ensure there cannot be made write through this device to actual pool device.	2020-03-23 17:13:26 +01:00
David Teigland	957904933b	reduce device path error messsages When /dev entries or sysfs entries are changing due to concurrent lvm commands, it can cause warning/error messages about missing paths.	2020-03-12 10:18:51 -05:00
David Teigland	dd0fdd846d	lvmlockd: use transient LV lock when creating snapshot Creating a snapshot was using a persistent LV lock on the origin, so if the origin LV was inactive at the time of the snapshot the LV lock would remain. (Running lvchange -an on the inactive LV would clear the LV lock.) Use a transient LV lock so it will be dropped if it was not locked previously.	2020-03-09 12:25:26 -05:00
Zdenek Kabelac	f439716b75	container_of: use offsetof from stddef Use standardized offsetof() macro from stddef. Helps to build valid code with latest gcc10 with -O2.	2020-03-05 17:38:55 +01:00
David Teigland	f50e7ce76c	hints: free hint list in error exit path	2020-03-03 12:25:34 -06:00
Zdenek Kabelac	d02d7bc560	vdo: fix slab size bits calculation When formating VDO volume, the calculated amound of bits for 'vdoformat --slab-bits' parameter was shifted by 2 bits (calculated size was making 2MiB vdo_slab_size_mb value appear like if user would be specifying only 512KiB) Fixed by properly converting internal size_mb value to KiB.	2020-02-25 17:43:16 +01:00
David Teigland	81d0333067	writecache: allow removing wcorig lv like removing corig	2020-02-21 12:41:52 -06:00
David Teigland	8153c5f1e6	writecache: working real dm uuid suffix for wcorig lv	2020-02-20 17:13:43 -06:00
David Teigland	4829f27b76	writecache: drop real dm suffix fixes the problem of adding writecache to an active LV	2020-02-17 13:07:06 -06:00
Zdenek Kabelac	892a182975	cachevol: stop dm errors with uncaching cache with cachevol Fix the anoying kernel message reported: device-mapper: cache: 253:2: metadata operation 'dm_cache_commit' failed: error = -5 which has been reported while cachevol has been removed. Happened via confusing variable - so switch the variable to commonly user '_size' which presents a value in sector units and avoid 'scaling' this as extent length by vg extent size when placing 'error' target on removal path. Patch shouldn't have impact on actual users data, since at this moment of removal all date should have been already flushed to origin device. m	2020-02-11 17:19:57 +01:00
Zdenek Kabelac	3716aa848e	vdo: fix vdoformat when -V is specified The previous patch improved read of pipe when lvm2 was looking for default logical size, but we clearly must read pipe also for -V case, when the logical size is already defined.	2020-02-10 15:41:30 +01:00
Zdenek Kabelac	96985b1373	raid: better place for blocking reshapes Still the place can be better to block only particular reshape operations which ATM cause kernel problems. We check if the new number of images is higher - and prevent to take conversion if the volume is in use (i.e. thin-pool's data LV).	2020-02-07 16:48:48 +01:00
David Teigland	ffea7daec3	writecache: prevent snapshots there appear to be problems with taking a snapshot of an LV with a writecache, so block it until that is understood or fixed.	2020-02-06 11:27:33 -06:00
David Teigland	2a6078f961	writecache: fix splitcache when origin is raid	2020-02-04 16:12:09 -06:00
Zdenek Kabelac	336361b2f2	lv_manip: add extra check for existin origin_lv clang: it's supposedly impossible path to hit, as we should always have origin_lv defined when running this path, but adding protection isn't a big issue to make this obvious to analyzer.	2020-02-04 17:22:06 +01:00
Zdenek Kabelac	67f627c8fb	raid: add internal error for no segment clang: capture internal error when data_seg would not be defined. (invalid LV with no areas)	2020-02-04 17:22:06 +01:00
Zdenek Kabelac	409362c127	lv_manip: add error handling for _reserve_area Since _reserve_area() may fail due to error allocation failure, add support to report this already reported failure upward. FIXME: it's log_error() without causing direct command failure.	2020-02-04 17:22:06 +01:00
Zdenek Kabelac	bcfe4993e5	lvmlockd: move eval of ENOENT To avoid logging 'errors' for no real error state (ENOENT), move this evaluation upward in the code.	2020-02-04 17:22:06 +01:00
Zdenek Kabelac	d6ac039b65	cov: widen before calculating min_chunk_size Although we expect min_chunk_size to be 32bit value, for large size of caches it might be useful to do calcs 64bit. So to avoid doing shift as signed 32bit - use unsigned 64bit from the start.	2020-02-04 17:22:06 +01:00
Zdenek Kabelac	de43527f94	cov: unused header file removal cov: unused header removed Also ensure library header file with config settings goes first. Move inclusion of format-text.h into layout.h	2020-02-04 17:22:06 +01:00
Zdenek Kabelac	c5e5ae4c95	bcache: fix memleak on error path clang: free io on error path.	2020-02-04 17:22:06 +01:00
David Teigland	bddbbcb98c	writecache: report status fields reporting fields (-o) directly from kernel: writecache_total_blocks writecache_free_blocks writecache_writeback_blocks writecache_error The data_percent field shows used cache blocks / total cache blocks.	2020-01-31 11:52:49 -06:00
David Teigland	8810c11bc9	lvmlockd: use ret value in query function	2020-01-29 10:37:28 -06:00
Zdenek Kabelac	cf844941d4	vdo: adapt for multi line vdo_format output Do not close pipeline after 1st. line parsed from vdo_format. Also reprint the output for a user so new messages from vdo_format can be seen by users.	2020-01-23 10:32:15 +01:00
Zdenek Kabelac	d7bf7091c3	raid: more limitted prohibition of stacked raid usage We actually need to prohibit only reshaping cases which are running over multiple commands.	2020-01-23 10:32:15 +01:00
Zdenek Kabelac	7737ffb11c	raid: disallow reshape of stacked LVs Until we resolve reshape for 'stacked' devices, we need to disable it. So users can no longer reshape i.e. thin-pool data volumes, causing ATM bad thin-pool problems.	2020-01-13 17:42:31 +01:00
David Teigland	2173bdb821	drop warnings about missing pvs in foreign vgs When a foreign VG is ignored, don't print warnings that it is missing PVs.	2019-12-11 12:56:15 -06:00
Zdenek Kabelac	89d839e541	clenaup: simpler form	2019-12-10 15:44:16 +01:00
Zdenek Kabelac	abc0a8faba	vg_read: use else for 3 case Make it visible we check for ==, >, < of same var.	2019-12-10 15:44:16 +01:00
Zdenek Kabelac	5555765cfc	debug: enhance messages Drop 'extra' stack trace where errors are already logged from function. Add some missing dots in messages.	2019-12-10 15:44:16 +01:00
Zdenek Kabelac	cff16b062b	debug: avoid to slashes in debug message	2019-12-10 15:44:16 +01:00
Nikhil Kshirsagar	e70d5d470c	debug: print VG name in log messages for segment errors Signed-off-by: Nikhil Kshirsagar <nkshirsa@redhat.com>	2019-12-10 15:44:06 +01:00
David Teigland	94076245df	scan: add simple scan to find a pvid	2019-11-27 11:13:47 -06:00
David Teigland	74ad2cd76f	metadata: add vg_from_config_tree Add cmd/fmt args to import functions so that they can be used without the fid arg which.	2019-11-27 11:13:47 -06:00
David Teigland	98a8099da9	scanning: use bool type for _scan_text_mismatch	2019-11-27 09:26:49 -06:00
David Teigland	0c1316cda8	scanning: optimize by checking text offset and checksum After the VG lock is taken for vg_read, reread the mda_header and compare the metadata text offset and checksum to what was seen during label scan. If it is unchanged, then the metadata has not changed since the label scan, and the metadata does not need to be reread under the lock for command processing. For commands that do not make changes (e.g. reporting), the mda_header is reread and checked on one mda to decide if the full metadata rereading can be skipped. For other commands (e.g. modifying the vg) the mda_header is reread and checked from all PVs. (These could probably just check one mda also.)	2019-11-26 16:52:28 -06:00
David Teigland	56a295f78c	bcache: add invalidate_bytes function	2019-11-26 16:52:28 -06:00
David Teigland	2037476008	pvcreate,pvremove: fix reacquiring global lock after prompt When pvcreate/pvremove prompt the user, they first release the global lock, then acquire it again after the prompt, to avoid blocking other commands while waiting for a user response. This release/reacquire changes the locking order with respect to the hints flock (and potentially other locks). So, to avoid deadlock, use a nonblocking request when reacquiring the global lock.	2019-11-26 14:34:43 -06:00
David Teigland	1c9b36618e	writecache: modprobe dm-writecache	2019-11-26 11:21:09 -06:00
David Teigland	7ea71a9eb9	Revert "hints: rewrite function" This reverts commit `70fb31b5d6`.	2019-11-14 12:15:05 -06:00
David Teigland	31a862a6be	Revert "debug: enhance debug messages" This reverts commit `e92d3bd1f7`.	2019-11-14 12:11:53 -06:00
Zdenek Kabelac	e92d3bd1f7	debug: enhance debug messages	2019-11-14 18:06:42 +01:00
Zdenek Kabelac	14e01d6316	hints: drop unneeded memset strncpy will zero buffer itself.	2019-11-14 18:06:42 +01:00
Zdenek Kabelac	1760b96368	hints: no need to check for NULL before free free() itself checks for NULL.	2019-11-14 18:06:42 +01:00
Zdenek Kabelac	33c1d2e921	cov: add explicit ret value ignoring We don't need to check for any error result codes here.	2019-11-14 18:06:42 +01:00
Zdenek Kabelac	ad0343d8cb	cov: remove unused headers	2019-11-14 18:06:42 +01:00
Zdenek Kabelac	1da5fd8226	cov: inline _build_desc_write Embed function into the code, since the function is actually simpler written this as there are no memleak troubles with failing allocation error path.	2019-11-14 18:06:42 +01:00
Zdenek Kabelac	61a483a654	hints: check for _touch_hints Exit when !_touch_hints().	2019-11-14 18:06:42 +01:00
Zdenek Kabelac	c38be06531	hints: fix mem leaking buffers	2019-11-14 18:06:42 +01:00
Zdenek Kabelac	1349a52626	hints: validate allocation result	2019-11-14 18:06:42 +01:00
Zdenek Kabelac	219fe72359	hints: validate sscanf results	2019-11-14 18:06:42 +01:00
Zdenek Kabelac	d4d82dbb70	hints: allocate hint only when needed Avoid mem leaking hint on every loop continue and allocate hint only when it's going to be added into list. Switch to use 'dm_strncpy()' and validate sizes.	2019-11-14 18:06:42 +01:00
Zdenek Kabelac	70fb31b5d6	hints: rewrite function	2019-11-14 18:06:42 +01:00
Zdenek Kabelac	44bf9c9a6a	cov: fix memleak for duplicate device For dev_in_device_list() != 0 allocated 'devl' was actually leaking - so instead allocate 'devl' only when !dev_in_device_list() and indent code around.	2019-11-14 18:06:42 +01:00
Zdenek Kabelac	82e6b820b8	cov: check for NULL Since we check for NULL pointers earlier we need to be consistent across function - since the NULL would applies across whole function. When dropping 'mda' check - we are actually already dereferencing it before - so it can't be NULL at that places (and it's validated before entering _read_mda_header_and_metadata).	2019-11-14 18:06:42 +01:00
Zdenek Kabelac	43f149526d	devtype: simplify code Update code with simpler form and check for fclose().	2019-11-14 18:06:14 +01:00
Heming Zhao	13c254fc05	fix dev_unset_last_byte after write error dev_unset_last_byte() must be called while the fd is still valid. After a write error, dev_unset_last_byte() must be called before closing the dev and resetting the fd. In the write error path, dev_unset_last_byte() was being called after label_scan_invalidate() which meant that it would not unset the last_byte values. After a write error, dev_unset_last_byte() is now called in dev_write_bytes() before label_scan_invalidate(), instead of by the caller of dev_write_bytes(). In the common case of a successful write, the sequence is still: dev_set_last_byte(); dev_write_bytes(); dev_unset_last_byte(); Signed-off-by: Zhao Heming <heming.zhao@suse.com>	2019-11-13 09:36:58 -06:00
Zdenek Kabelac	08f36dd093	lvextend: fix resizing volumes of different segtype When resizing 2 volumes like thin-pool and it's metadata and they would be of a different type - command would be actually expecting both LVs being of a same segtype - and would throw an error in case they are different. This patch fixes is by setting a new segtype from last segment of 2nd. extented device. Also it fixes the possible 'percentage' extension setup that might have been used for 'primary' volume - while the 'secondary' LV always goes with direct size - as we do not support 'percentage' setup for them This affects maily usage of thin-pool where the extension of thin-pool data size may also lead to extension of metadata size.	2019-11-11 22:44:25 +01:00
Zdenek Kabelac	43db8f8d5d	cov: ensure read_ahead is available Make sure read_ahead pointer is not NULL when quering for RA.	2019-11-11 22:44:25 +01:00
Zdenek Kabelac	8679d45917	gcc: avoid declaration shadowing dev_name is global in device.h	2019-11-11 22:44:18 +01:00
Zdenek Kabelac	cca5aec0ef	activation: drop removed declaration Seems this function has been removed long time ago with: `3e781ea446`	2019-10-31 15:33:09 +01:00
Zdenek Kabelac	8689b4ed82	raid: drop internal error Fix some internal error reports and debug trace returns	2019-10-31 15:31:30 +01:00
Zdenek Kabelac	3d9fc7d6f3	manip: optimize lvs_using_lv Instead of checking all LVs in a VG - do just a direct copy of LVs from the existing list ->segs_using_thin_lv. TODO: maybe it could be better to expose seg_list to /tools...	2019-10-31 15:31:30 +01:00
Zdenek Kabelac	c21440536d	mirror: remove unused code	2019-10-31 15:31:30 +01:00
Zdenek Kabelac	ab315e7a81	mirror: directly activate updated mirror	2019-10-31 15:31:30 +01:00
Zdenek Kabelac	80b2de9e6a	mirror: fix leg splitting Enhance lv_info with lv_info_with_name_check. This 'variant' not only check existance if UUID in DM table but also compares its DM name whether it's matching expected LV name. Otherwise activation may 'skip' activation with rename in case the DM UUID already exists, just device is different name. This change make fairly easier manipulation with i.e. detached mirror leg which ATM is using same UUID - just the LV name have been changed. Used code was not able to run 'activation' (and do a rename) and just skipped the call. So the code used to do a workaround and 'tried' to deactivate such LV firts - this however work only in non-clvmd case, as cluster was not having the lock for deactivated LV. With this extended lv_info code will run 'activation' and will synchronize the name to match expected LV name. Patch extends _lv_info() with new paramter 'with_name_check', which is later translated into 'name_check' argument for _info_run() which in case of name mismatch evaluates the check as if device does not exists. Such call is only used in one place _lv_activate() which then let activation run. All other invocation of _info() calls are left intact. TODO: fix mirror table manipulation (and raid)....	2019-10-31 15:31:30 +01:00
Joe Thornber	25e7bf021a	[bcache] bcache_invalidate_fd, only remove prefixes on success.	2019-10-29 15:21:11 +00:00
Joe Thornber	7e8296f478	[bcache] reverse earlier patch. It broke some unit tests, for v. little benefit	2019-10-29 15:14:07 +00:00
Joe Thornber	2b3c39e402	[bcache] pass up the error from io_submit rather than using generic -EIO Author: Heming Zhao	2019-10-29 10:39:20 +00:00
Joe Thornber	6b0d969b2a	[label] Use bcache_abort_fd() to ensure blocks are no longer in the cache. The return value from bcache_invalidate_fd() was not being checked. So I've introduced a little function, _invalidate_fd() that always calls bcache_abort_fd() if the write fails.	2019-10-28 15:01:47 +00:00
Joe Thornber	2938b4dcca	[bcache] add bcache_abort() This gives us a way to cope with write failures.	2019-10-28 15:00:53 +00:00
Zdenek Kabelac	0e5f39a5ac	snapshot: use single merging sequence The resume of 'released' 'COW' should preceed the resume of origin. The fact we need to do the sequence differently for merge was cause by bugs fixed in 2 previous commits - so we no longer need to recognize 'merging' and we should always go with single sequence. The importance of this order is - to properly remove '-real' device from origin LV. When COW is activated as 2nd. '-real' device is kept in table as it cannot be removed during 1st. resume of origin, and later activation of COW LV no longer builds tree associated with origin LV.	2019-10-26 00:49:16 +02:00
Zdenek Kabelac	855b16ce14	snapshot: fix checking of merged thin volume When merging of thin snapshot is taking place, the origin target will be of thin type.	2019-10-26 00:49:16 +02:00
Zdenek Kabelac	9968be55ed	snapshot: correctly check device id of merged thin When checking device id of a thin device that is just being merged - the snapshot actually could have been already finished which means '-real' suffix for the LV is already gone and just LV is there - so check explicitely for this condition and use correct UUID for this case.	2019-10-26 00:49:16 +02:00
David Teigland	6a8bd0c509	lvmlockd: fix cachevol locking When a cachevol LV is attached, have the LV keep it's lock allocated. The lock on the cachevol won't be used while it's attached. When the cachevol is split a new lock does not need to be allocated. (Applies to cachevol usage by both dm-cache and dm-writecache.)	2019-10-25 14:08:59 -05:00
Zdenek Kabelac	80ae7206a8	cache: _cpool is protected suffix now	2019-10-22 16:07:21 +02:00
Zdenek Kabelac	a7563dc6a1	gcc: older version can't see udev is always set	2019-10-22 13:39:22 +02:00
David Teigland	c08704cee7	cachevol: use cachepool code for metadata size Based on a more detailed calculation, but because of extent size rounding, the final result is about the same.	2019-10-21 12:13:33 -05:00
Zdenek Kabelac	0c01a4c2a6	gcc: avoid warning: declaration of xxx shadows a global declaration Fix some gcc complaints again shadowing global declarations	2019-10-21 15:32:35 +02:00
Zdenek Kabelac	f61d828c86	gcc: older compiler is happier with this initilizer	2019-10-21 15:32:35 +02:00
Zdenek Kabelac	dd7629ea09	cache: use _cpool for used cache-pools When LV gets cached and uses cache-pool - such cache-pool will now get _cpool suffix automatically. Thus 'Pool' column for cached LV will now show either _cvol or _cpool LV.	2019-10-21 15:31:33 +02:00
Zdenek Kabelac	766dedb628	lvm-string: add drop_lvname_suffix Internal function to drop suffix out of lvname.	2019-10-21 12:14:15 +02:00
Zdenek Kabelac	2266a1863f	lv_manip: add lv_uniq_rename_update Add function to rename LV to either passed name or if the name is already in use, generate new lvol% name.	2019-10-21 12:14:15 +02:00
Zdenek Kabelac	ec85dfe0f8	cachevol: support removal of cachevol Removal of cachevol is equivalent of lvconvert --uncache and works the same way as with cachepool.	2019-10-17 13:03:50 +02:00
Zdenek Kabelac	5938cde11b	cache: single code for removal of cached volume Use same routine for dropping cached LV for cachevol and cachepool.	2019-10-17 13:03:50 +02:00
Zdenek Kabelac	9969361b51	debug: missing trace	2019-10-17 13:03:50 +02:00
Zdenek Kabelac	dab4a2c893	cachevol: move flag setting after taking archive Before 'archive()' is called, lvm2 must not touch/modify metadata. So move setting CACHE_VOL related flags past this point. Also make sure reading of cache segtype always restores this flag properly (even if compatible flag would be lost).	2019-10-17 13:03:50 +02:00
Zdenek Kabelac	f63e20ebcc	cache: drop validation check Since now we can cache either with cache-pool LV or any other LV (being used as cachevol LV) drop the validation condition.	2019-10-17 13:03:49 +02:00
Zdenek Kabelac	af8cfa90d9	cache: add more comments for min meta size Enhance source code with better explanation how the minimal metadata size is evaluated from data size and chunk size.	2019-10-17 13:03:49 +02:00
Zdenek Kabelac	2a08d6d1d4	cachevol: use CVOL UUID for cdata and cmeta layered devices Since code is using -cdata and -cmeta UUID suffixes, it does not need any new 'extra' ID to be generated and stored in metadata. Since introduce of new 'segtype' cache+CACHE_USES_CACHEVOL we can safely assume 'new' cache with cachevol will now be created without extra metadata_id and data_id in metadata. For backward compatibility, code still reads them in case older version of metadata have them - so it still should be able to activate such volumes. Bonus is lowered size of lv structure used to store info about LV (noticable with big volume groups).	2019-10-17 13:03:49 +02:00
David Teigland	81fe045714	cache: change default cachevol metadata sizes The first part of a cachevol LV is used for metadata, and the rest of the space is used for data. The division of space between metadata and data depends on the total size of the cachevol. The previous division gave more space than needed to metadata, it was: cachevol size 8M to 128M -> metadata size 16M * cachevol size 128M to 1G -> metadata size 32M cachevol size 1G and up -> metadata size 64M (* if this resulted in over half the LV used as metadata, then half the cachevol would be used for metadata, and the other half for data.) The division of space now gives less space to metadata, it is: cachevol size 8M to 16M -> metadata size 4M cachevol size 16M to 4G -> metadata size 8M cachevol size 4G to 16G -> metadata size 16M cachevol size 16G to 32G -> metadata size 32M cachevol size 32G and up -> metadata size 64M	2019-10-15 14:36:03 -05:00
David Teigland	0443d00ff1	allow activating known LVs when other LVs have unknown segtypes When a VG contains some LVs with unknown segtypes, the user should still be allowed to activate other LVs in the VG that are understood. $ lvs foo WARNING: Unrecognised flag CACHE_USES_CACHEVOL in segment type cache+CACHE_USES_CACHEVOL. WARNING: Unrecognised segment type cache+CACHE_USES_CACHEVOL LV VG Attr LSize lvol0 foo -wi------- 4.00m other foo vwi---u--- 48.00m $ lvcreate -l1 foo WARNING: Unrecognised flag CACHE_USES_CACHEVOL in segment type cache+CACHE_USES_CACHEVOL. WARNING: Unrecognised segment type cache+CACHE_USES_CACHEVOL Cannot change VG foo with unknown segments in it! Cannot process volume group foo $ lvchange -ay foo/lvol0 WARNING: Unrecognised flag CACHE_USES_CACHEVOL in segment type cache+CACHE_USES_CACHEVOL. WARNING: Unrecognised segment type cache+CACHE_USES_CACHEVOL $ lvchange -ay foo/other WARNING: Unrecognised flag CACHE_USES_CACHEVOL in segment type cache+CACHE_USES_CACHEVOL. WARNING: Unrecognised segment type cache+CACHE_USES_CACHEVOL Refusing activation of LV foo/other containing an unrecognised segment. $ lvs foo WARNING: Unrecognised flag CACHE_USES_CACHEVOL in segment type cache+CACHE_USES_CACHEVOL. WARNING: Unrecognised segment type cache+CACHE_USES_CACHEVOL LV VG Attr LSize lvol0 foo -wi-a----- 4.00m other foo vwi---u--- 48.00m	2019-10-15 14:34:53 -05:00
David Teigland	91ee025d5b	cache: change cachevol flags for backward compat A cachevol LV had the CACHE_VOL status flag in metadata, and the cache LV using it had no new flag. This caused problems if the new metadata was used by an old version of lvm. An old version of lvm would have two problems processing the new metadata: . The old lvm would return an error when reading the VG metadata when it saw the unknown CACHE_VOL status flag. . The old lvm would return an error when reading the VG metadata because it would not find an expected cache pool attached to the cache LV (since the cache LV had a cachevol attached instead.) Change the use of flags: . Change the CACHE_VOL flag to be a COMPATIBLE flag (instead of a STATUS flag) so that old versions will not fail when they see it. . When a cache LV is using a cachevol, the cache LV gets a new SEGTYPE flag CACHE_USES_CACHEVOL. This flag is appended to the segtype name, so that old lvm versions will fail to use the LV because of an unknown segtype, as opposed to failing to read the VG.	2019-10-15 09:05:52 -05:00
Zdenek Kabelac	1cd308d640	cachevol: drop no longer needed functions Code is no longer used/needed.	2019-10-14 15:20:25 +02:00
Zdenek Kabelac	201ffbd04a	cachevol: use lv_cache_remove Use same routine for dropping cache.	2019-10-14 15:20:25 +02:00
Zdenek Kabelac	2825ad9dd2	cachevol: improve manipulation with dm tree Enhance activation of cached devices using cachevol. Correctly instatiace cachevol -cdata & -cmeta devices with '-' in name (as they are only layered devices). Code is also a bit more compacted (although still not ideal, as the usage of extra UUIDs stored in metadata is troublesome and will be repaired later). NOTE: this patch my brink potentially minor incompatiblity for 'runtime' upgrade	2019-10-14 15:17:50 +02:00
Zdenek Kabelac	a454a1b4ea	cachevol: put _cvol as protected suffix. This revert "drop cvol dm uuid suffix for cachevol LVs" commit `5191057d9d`. Start using -cvol for DM UUID.	2019-10-14 15:16:05 +02:00
Zdenek Kabelac	77deadd3af	cachevol: drop LV_CACHE_VOL on detach automatically Move dropping of cachevol flag into detach function. TODO: this flag should be internal to lvm2.	2019-10-14 15:15:14 +02:00
Zdenek Kabelac	615e18f5b2	cache: enhance removal function to work with cvol To keep things simple, use same code for all cache removal functions, not just for cachepools but also cachevols.	2019-10-14 15:14:25 +02:00
Zdenek Kabelac	6ee83f699b	cache: correct condition	2019-10-14 15:14:25 +02:00
Zdenek Kabelac	bc35ccd174	cache: recognize cachevol with lv_cache_remove	2019-10-14 15:14:25 +02:00
Zdenek Kabelac	36944e1009	cache: reload only when switched to cleaner policy Reload cache target only when lvm2 reload table with cache with clearer policy.	2019-10-14 15:14:22 +02:00
David Teigland	bd21736e8b	vgck: let updatemetadata repair mismatched metadata Let vgck --updatemetadata repair cases where different mdas hold indepedently valid but unmatching copies of the metadata, i.e. different text metadata checksums or text metadata sizes.	2019-10-11 12:57:39 -05:00
David Teigland	d6ffc99052	vgck: fix updatemetadata writing different descriptions vgck --updatemetadata would write the same correct metadata to good mdas, and then to bad mdas, but the sequence of vg_write/vg_commit calls betwen good and bad mdas could cause a different description field to be generated for good/bad mdas. (The description field describing the command was recently included in the ondisk copy of the metadata text.)	2019-10-11 12:57:32 -05:00
David Teigland	fe16d296b0	pvmove: remove some cmirror related code which is no longer used	2019-10-11 11:31:42 -05:00
David Teigland	b6240c9188	vgremove: remove internal lvmlock LV If a VG is forcibly changed from lock_type sanlock to lock_type none, the internal lvmlock LV is left behind. If that LV is not removed before vgremove is run on the VG, then an internal check will be triggered by the hidden lvmlock LV. So, check for and remove a left over lvmlock LV during vgremove.	2019-10-04 12:01:30 -05:00
Zdenek Kabelac	ca70dc4540	vdo: add lvs fields to query vdo volume properties Add lots of vdo fields: vdo_operating_mode - For vdo pools, its current operating mode. vdo_compression_state - For vdo pools, whether compression is running. vdo_index_state - For vdo pools, state of index for deduplication. vdo_used_size - For vdo pools, currently used space. vdo_saving_percent - For vdo pools, percentage of saved space. vdo_compression - Set for compressed LV (vdopool). vdo_deduplication - Set for deduplicated LV (vdopool). vdo_use_metadata_hints - Use REQ_SYNC for writes (vdopool). vdo_minimum_io_size - Minimum acceptable IO size (vdopool). vdo_block_map_cache_size - Allocated caching size (vdopool). vdo_block_map_era_length - Speed of cache writes (vdopool). vdo_use_sparse_index - Sparse indexing (vdopool). vdo_index_memory_size - Allocated indexing memory (vdopool). vdo_slab_size - Increment size for growing (vdopool). vdo_ack_threads - Acknowledging threads (vdopool). vdo_bio_threads - IO submitting threads (vdopool). vdo_bio_rotation - IO enqueue (vdopool). vdo_cpu_threads - CPU threads for compression and hashing (vdopool). vdo_hash_zone_threads - Threads for subdivide parts (vdopool). vdo_logical_threads - Logical threads for subdivide parts (vdopool). vdo_physical_threads - Physical threads for subdivide parts (vdopool). vdo_max_discard - Maximum discard size volume can recieve (vdopool). vdo_write_policy - Specified write policy (vdopool). vdo_header_size - Header size at front of vdopool. Previously only 'lvdisplay -m' was exposing them.	2019-10-04 17:31:55 +02:00
Zdenek Kabelac	862aa06e5e	vdo: remember configure VDO write policy in metadata Store write_policy in vdopool metadata. In case it's not present 'auto' is selected.	2019-10-04 17:31:55 +02:00
Zdenek Kabelac	7ca9be034f	vdo: field update	2019-10-04 17:31:55 +02:00
Zdenek Kabelac	cf8aee096f	vdo: introduce get_vdo_write_policy_name	2019-10-04 17:31:55 +02:00
Zdenek Kabelac	c756f76802	vdo: correct internal API for set_vdo_write_policy This is 'setting' function.	2019-10-04 17:31:55 +02:00
Zdenek Kabelac	9d8a028e8c	vdo: keep minimum_io_size in sectors	2019-10-04 17:31:55 +02:00
Zdenek Kabelac	aad91330fe	vdo: raise VDO default bio threads to 4 Since 'vdo create' tends to use this setting, update lvm2 to provide same default.	2019-10-04 17:31:55 +02:00
Zdenek Kabelac	98419e0667	display: try to show status for inactive vdopool Since we now support activation of 'vdo' volume without explicit activation of 'vdopool' it's now possible to have active layer vdopool (-vpool) volume and having vdopool itself inactive - yet still in this case we can show available stats for this volume. But we need to show correct activation status and other standard info.	2019-10-04 17:31:55 +02:00
Zdenek Kabelac	6a9a4b4534	resize: continue change for getting vdo status before resize Continue commit `a98b77c164`. There needs to be error reported when status can't be obtained.	2019-10-04 17:31:55 +02:00
Zdenek Kabelac	cb5f0bdba9	cache: report for succesful status	2019-10-04 17:31:55 +02:00
David Teigland	a68258339d	lvmlockd: set failure flag for test mode Set a failure flag when vg_read returns an error for test mode. The caller can segfault if there's an error with no flag set.	2019-10-04 10:09:49 -05:00
David Teigland	f836fe3836	scan: use PV device name hint for choosing duplicate PV Prefer a device if its name matches the PV device name hint.	2019-09-30 11:38:10 -05:00
David Teigland	4910a31f6d	scan: use PV size for choosing duplicate PV Prefer a device if it matches the size of the PV.	2019-09-30 11:38:10 -05:00
David Teigland	f3084ee2e5	scan: add PV summary info to lvmcache Expand the lvmcache info that is saved by the scan to include PV info from the metadata.	2019-09-30 11:38:10 -05:00
David Teigland	3a8e41a67b	metadata: import device name hint from metadata Start by using it in a comment for a missing PV.	2019-09-30 11:38:10 -05:00
David Teigland	fcfabb26a5	metadata: add args to metadata import functions instead of getting them through fid arg no functional change	2019-09-30 11:38:10 -05:00
Zdenek Kabelac	5c0264d689	vdo: restore monitoring of vdo pool Switch to -vpool layered name needs to monitor proper device.	2019-09-30 13:34:34 +02:00
Zdenek Kabelac	a98b77c164	vdo: properly check percentage for resize Avoid checking 'lv_is_active()' since special LV types does this validation anyway what calling _percent() function and call it ONLY when none of special types is queried. This restores support for VDO resize (as with support for separate VDO pool activation, plain query for lv_is_active() is not working in this case).	2019-09-30 13:34:34 +02:00
Zdenek Kabelac	c813db8fc2	vdo: deactivate forgotten vdo pool If the linear mapping is lost (for whatever reason, i.e. test suite forcible 'dmsetup remove' linear LV, lvm2 had hard times figuring out how to deactivate such DM table. So add function which is in case inactive VDO pool LV checks if the pool is actually still active (-vpool device present) and it has open count == 0. In this case deactivation is allowed to continue and cleanup DM table.	2019-09-30 13:34:34 +02:00
David Teigland	26596ce7fa	writecache: allow removing LV with attached writecache	2019-09-24 15:51:05 -05:00
David Teigland	76dd9b2b51	writecache: move code into new file put writecache specific code in writecache_manip.c should be no functional change	2019-09-24 15:51:05 -05:00
David Teigland	56aadd7fe2	lvremove: remove attached cachevol with removed LV When an LV is removed that has an attached cachevol, also remove the cachevol LV.	2019-09-24 15:51:05 -05:00
David Teigland	5191057d9d	drop cvol dm uuid suffix for cachevol LVs The "-cvol" suffix on the uuid is interfering with activation code, so drop the suffix for now.	2019-09-23 14:13:31 -05:00
David Teigland	27c3c1d7c8	writecache: display layout and role fields	2019-09-20 14:55:11 -05:00
David Teigland	6f7d7089b4	writecache: use dm suffixes and lv attributes - use internal CACHE_VOL flag on cachevol LV - add suffixes to dm uuids for internal LVs - display appropriate letters in the LV attr field - display writecache's cachevol in lvs output	2019-09-20 14:08:51 -05:00
David Teigland	5d3bced5ea	lvconvert: detaching cachevol with missing PVs . For dm-cache in writethrough, always allow splitcache, whether the cache is missing PVs or not. . For dm-cache in writeback, if the cache is missing PVs, allow splitcache with force and yes. . For dm-writecache, if the cache is missing PVs, allow splitcache with force and yes.	2019-09-20 09:59:37 -05:00
David Teigland	515e37b6dd	cachevol: add dm uuid suffixes to hidden lvs to indicate they are private lvm devs	2019-09-20 09:59:37 -05:00
David Teigland	d2c065789c	lvconvert: cachevol LV can have multiple segments	2019-09-20 09:59:37 -05:00
Zdenek Kabelac	6612d8dd5e	vdo: enhance activation with layer -vpool Enhance 'activation' experience for VDO pool to more closely match what happens for thin-pools where we do use a 'fake' LV to keep pool running even when no thinLVs are active. This gives user a choice whether he want to keep thin-pool running (wihout possibly lenghty activation/deactivation process) As we do plan to support multple VDO LVs to be mapped into a single VDO, we want to give user same experience and 'use-patter' as with thin-pools. This patch gives option to activate VDO pool only without activating VDO LV. Also due to 'fake' layering LV we can protect usage of VDO pool from command like 'mkfs' which do require exlusive access to the volume, which is no longer possible. Note: VDO pool contains 1024 initial sectors as 'empty' header - such header is also exposed in layered LV (as read-only LV). For blkid we are indentified as LV with UUID suffix - thus private DM device of lvm2 - so we do not need to store any extra info in this header space (aka zero is good enough).	2019-09-17 13:17:19 +02:00
Zdenek Kabelac	66f69e766e	thin: activate layer pool aas read-only LV When lvm2 is activating layered pool LV (to basically keep pool opened, the other function used to be 'locking' be in sync with DM table) use this LV in read-only mode - this prevents 'write' access into data volume content of thin-pool. Note: since EMPTY/unused thin-pool is created as 'public LV' for generic use by any user who i.e. wish to maintain thin-pool and thins himself. At this moment, thin-pool appears as writable LV. As soon as the 1st. thinLV is created, layer volume will appear is 'read-only' LV from this moment.	2019-09-17 13:16:50 +02:00
Zdenek Kabelac	693215716b	devices: crypto skip Devices with UUID signature CRYPT-SUBDEV are internal crypto devices.	2019-09-17 13:15:22 +02:00
David Teigland	fcbffbdbc0	bcache: change log level for prefetch message The "new new blocks" message was printed as an error but it's not an error condition.	2019-09-03 12:02:09 -05:00
David Teigland	25b58310e3	pvscan: avoid full scan for activation When an online PV completed a VG, the standard activation functions were used to activate the VG. These functions use a full scan of all devs. When many pvscans are run during startup and need to activate many VGs, scanning all devs from all the pvscans can take a long time. Optimize VG activation in pvscan to scan only the devs in the VG being activated. This makes use of the online file info that was used to determine the VG was complete. The downside of this approach is that pvscan activation will not detect duplicate PVs and block activation, where a normal activation command (which scans all devices) would.	2019-09-03 10:11:16 -05:00
David Teigland	98d420200e	vgextend: check missing device during block size check Checking the block size when a device is missing could trigger a segfault.	2019-09-03 10:07:56 -05:00
David Teigland	7cfbf3a394	fix segfault for invalid characters in vg name Fixes a regression from commit `ba7ff96faf` "improve reading and repairing vg metadata" where the error path for a vg name with invalid charaters was missing an error flag, which led to the caller not recognizing an error occured. Previously, an error flag was hidden in the old _vg_make_handle function.	2019-08-29 11:35:46 -05:00
David Teigland	5b3fbccab9	hints: check for malloc failure	2019-08-28 12:41:57 -05:00
David Teigland	12707adac8	hints: fix copy of filter Only the first entry of the filter array was being included in the copy of the filter, rather than the entire thing. The result is that hints would not be refreshed if the filter was changed but the first entry was unchanged.	2019-08-28 12:33:04 -05:00
David Teigland	dcbed38b33	fix duplicate pv size check Fixes a segfault in the recent commit `e01fddc57`: "improve duplicate pv handling for md components" While choosing between duplicates, the info struct is not always valid; it may have been dropped already. Remove the code that was still using the info struct for size comparisons. The size comparisons were a bogus check anyway because it was just preferring the dev that had already been chosen, it wasn't actually comparing the dev size to the PV size. It would be good to use a dev/PV size comparison in the duplicate handling code, but the PV size is not available until after vg_read, not from the scan.	2019-08-27 15:40:24 -05:00
Zdenek Kabelac	b2885b7103	activation: use cmd pending mem for pending_delete Since we need to preserve allocated strings across 2 separate activation calls of '_tree_action()' we need to use other mem pool them dm->mem - but since cmd->mem is released between individual lvm2 locking calls, we rather introduce a new separate mem pool just for pending deletes with easy to see life-span. (not using 'libmem' as it would basicaly keep allocations over the whole lifetime of clvmd) This patch is fixing previous commmit where the memory was improperly used after pool release.	2019-08-27 15:54:42 +02:00
Zdenek Kabelac	55f1d8a269	configure: check for prlimit Update configure and make code compilable if prlimit() is not present. Since the code is suspicious do not cope yet with it's replacement with set/getrlimit().	2019-08-26 17:24:37 +02:00
Zdenek Kabelac	4b1dcc2eeb	lv_manip: add synchronizations New udev in rawhide seems to be 'dropping' udev rule operations for devices that are no longer existing - while this is 'probably' a bug - it's revealing moments in lvm2 that likely should not run in a single transaction and we should wait for a cookie before submitting more work. TODO: it seem more 'error' paths should always include synchronization before starting deactivating 'just activated' devices. We should probably figure out some 'automatic' solution for this instead of placing sync_local_dev_name() all over the place...	2019-08-26 15:32:19 +02:00
Zdenek Kabelac	c98e34e4d0	cache: improve vgremove loop Support internal removal of 'cache origin' volume - which we do not normally expose to a user - however internal processing loops may hit this condition (depending on order of list LVs). So when this operation is internally requested - we automatically try to remove it's 'holding' LV (cache LV) - which will also remove the origin.	2019-08-26 15:32:12 +02:00
Zdenek Kabelac	af0b84ccc8	snapshot: always activate Drop the 'cluster-only' optimization so we do resume ALL device before we try to wait on cookie before 'removal' operation. It's more correct order of operation - alhtough possibly slightly less efficient - but until we have correct list of operations 'in-progress' we can't do anything better.	2019-08-26 15:23:44 +02:00
Zdenek Kabelac	7833c45fbe	activation: extend handling of pending_delete With previous patch `30a98e4d67` we started to put devices one pending_delete list instead of directly scheduling their removal. However we have operations like 'snapshot merge' where we are resuming device tree in 2 subsequent activation calls - so 1st such call will still have suspened devices and no chance to push 'remove' ioctl. Since we curently cannot easily solve this by doing just single activation call (which would be preferred solution) - we introduce a preservation of pending_delete via command structure and then restore it on next activation call. This way we keep to remove devices later - although it might be not the best moment - this may need futher tunning. Also we don't keep the list of operation in 1 trasaction (unless we do verify udev symlinks) - this could probably also make it more correct in terms of which 'remove' can be combined we already running 'resume'.	2019-08-26 15:16:38 +02:00
Zdenek Kabelac	30a98e4d67	activation: add synchronization point Resuming of 'error' table entry followed with it's dirrect removal is now troublesame with latest udev as it may skip processing of udev rules for already 'dropped' device nodes. As we cannot 'synchronize' with udev while we know we have devices in suspended state - rework 'cleanup' so it collects nodes for removal into pending_delete list and process the list with synchronization once we are without any suspended nodes.	2019-08-20 12:46:11 +02:00
Zdenek Kabelac	0451225c19	pvmove: correcting read_ahead setting When pvmove is finished, we do a tricky operation since we try to resume multiple different device that were all joined into 1 big tree. Currently we use the infromation from existing live DM table, where we can get list of all holders of pvmove device. We look for these nodes (by uuid) in new metadata, and we do now a full regular device add into dm tree structure. All devices should be already PRELOAD with correct table before entering suspend state, however for correctly working readahead we need to put correct info also into RESUME tree. Since table are preloaded, the same table is skip and resume, but correct read ahead is now set.	2019-08-20 12:37:32 +02:00
David Teigland	0534cd9cd4	pvscan: disable sleeping and retrying for udev When systemd is running pvscans, udev may not be entirely initialized, so the pvscan should not sleep and retry waiting for udev info.	2019-08-16 14:41:26 -05:00
David Teigland	61fce72a11	bcache: increase max allowed bcache size from 128MB to 512MB (the default remains 8MB)	2019-08-16 13:35:09 -05:00
David Teigland	e01fddc578	improve duplicate pv handling for md components Eliminate md components at the start so they don't interfere with actual duplicates, and don't need to be removed later. This also allows for choosing no copy of a PVID if they all happen to be md components.	2019-08-16 13:26:12 -05:00
David Teigland	ee4a32e992	lvmcache: use devl list helper	2019-08-16 13:26:12 -05:00
David Teigland	96dfad5022	lvmcache: replace found_duplicates variable With just checking if the duplicates lists are empty.	2019-08-16 13:26:11 -05:00
David Teigland	677833ce6f	lvmcache: renaming functions and variables related to duplicates, no functional changes.	2019-08-16 13:26:11 -05:00
David Teigland	65bcd16be2	md component detection addition in vg_read Usually md components are eliminated in label scan and/or duplicate resolution, but they could sometimes get into the vg_read stage, where set_pv_devices compares the device to the PV. If set_pv_devices runs an md component check and finds one, vg_read should eliminate the components. In set_pv_devices, run an md component check always if the PV is smaller than the device (this is not very common.) If the PV is larger than the device, (more common), do the component check when the config setting is "auto" (the default).	2019-08-16 13:24:34 -05:00
David Teigland	ecefcc9ca8	increase soft open file limit When there are more devices than the current soft open file limit (default 1024), raise the soft limit to the hard/max limit (default 4096). Do this prior to scanning in case enough of the devices are PVs that need to be kept open.	2019-08-08 15:45:03 -05:00
David Teigland	eb6aa5fefe	devices: put ifdef around BLKPBSZGET BLKPBSZGET is not defined before kernel version 2.6.32 (e.g. rhel5)	2019-08-08 15:45:03 -05:00
David Teigland	09bc2d0fd1	devices: clean up block size functions Replace calls to the old dev_get_block_size function with calls to the new dev_get_direct_block_size function, and remove the old function.	2019-08-07 11:48:10 -05:00
David Teigland	bec3088f85	Revert "config: cache_policy should be cfg_runtime" This reverts commit `29eee32ac2`. Some other changes are needed to make this runtime.	2019-08-07 11:35:45 -05:00
David Teigland	29eee32ac2	config: cache_policy should be cfg_runtime	2019-08-07 11:08:15 -05:00
David Teigland	682b6216df	config: set deprecated version for segment_libraries Stopped being used some time ago.	2019-08-07 11:08:11 -05:00
David Teigland	0404539edb	vgcreate/vgextend: restrict PVs with mixed block sizes Avoid having PVs with different logical block sizes in the same VG. This prevents LVs from having mixed block sizes, which can produce file system errors. The new config setting devices/allow_mixed_block_sizes (default 0) can be changed to 1 to return to the unrestricted mode.	2019-08-01 10:06:47 -05:00
David Teigland	7f347698e3	Fix rounding writes up to sector size Do this at two levels, although one would be enough to fix the problem seen recently: - Ignore any reported sector size other than 512 of 4096. If either sector size (physical or logical) is reported as 512, then use 512. If neither are reported as 512, and one or the other is reported as 4096, then use 4096. If neither is reported as either 512 or 4096, then use 512. - When rounding up a limited write in bcache to be a multiple of the sector size, check that the resulting write size is not larger than the bcache block itself. (This shouldn't happen if the sector size is 512 or 4096.)	2019-07-26 14:21:08 -05:00
David Teigland	c22ad12bab	metadata: extend writes to zero space Previously, consecutive copies of metadata would have garbage data in the space between them. After metadata wrapping, the garbage would be portions of old metadata. This made analysis of the metadata area more difficult. This would happen because the start of new copy of metadata is advanced from the end of the last copy to start at the next 512 byte boundary. Zero the space between consecutive copies of metadata by extending each metadata write to end at the next 512 byte boundary. The size of the metadata itself is not extended, only the write. The buffer being written contains the metadata text followed by the necessary number of zeros.	2019-07-12 15:00:12 -05:00
David Teigland	4567c6a2b2	enable full md component detection at the right time An active md device with an end superblock causes lvm to enable full md component detection. This was being done within the filter loop instead of before, so the full filtering of some devs could be missed. Also incorporate the recently added config setting that controls the md component detection.	2019-07-10 13:30:50 -05:00
David Teigland	f17353e3e6	md component detection for differing PV and device sizes This check was mistakenly removed when shifting code in commit "separate code for setting devices from metadata parsing". Put it back with some new conditions.	2019-07-09 13:40:41 -05:00
David Teigland	d2b88f2715	scan: remove unused arg to setup_bcache	2019-07-09 13:16:26 -05:00
David Teigland	b4402bd821	exported vg handling The exported VG checking/enforcement was scattered and inconsistent. This centralizes it and makes it consistent, following the existing approach for foreign and shared VGs/PVs, which are very similar to exported VGs/PVs. The access policy that now applies to foreign/shared/exported VGs/PVs, is that if a foreign/shared/exported VG/PV is named on the command line (i.e. explicitly requested by the user), and the command is not permitted to operate on it because it is foreign/shared/exported, then an access error is reported and the command exits with an error. But, if the command is processing all VGs/PVs, and happens to come across a foreign/shared/exported VG/PV (that is not explicitly named on the command line), then the command silently skips it and does not produce an error. A command using tags or --select handles inaccessible VGs/PVs the same way as a command processing all VGs/PVs, and will not report/return errors if these inaccessible VGs/PVs exist. The new policy fixes the exit codes on a somewhat random set of commands that previously exited with an error if they were looking at all VGs/PVs and an exported VG existed on the system. There should be no change to which commands are allowed/disallowed on exported VGs/PVs. Certain LV commands (lvs/lvdisplay/lvscan) would previously not display LVs from an exported VG (for unknown reasons). This has not changed. The lvm fullreport command would previously report info about an exported VG but not about the LVs in it. This has changed to include all info from the exported VG.	2019-06-25 15:39:08 -05:00
David Teigland	d16142f90f	scanning: open devs rw when rescanning for write When vg_read rescans devices with the intention of writing the VG, the label rescan can open the devs RW so they do not need to be closed and reopened RW in dev_write_bytes.	2019-06-21 10:57:49 -05:00
David Teigland	8fecd9c14e	metadata: include description with command in metadata areas Previously the VG metadata description field (which contains the command line) was only included in backup/archive copies of the metadata. Now also include it in the metadata written to the metadata areas.	2019-06-20 16:09:05 -05:00
Marian Csontos	556dcd2c6b	config: Fix default option which makes no sense Default value is either undefined or commented, never both.	2019-06-17 19:08:28 +02:00

... 5 6 7 8 9 ...

7034 Commits