shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2025-01-03 05:18:29 +03:00

Author	SHA1	Message	Date
David Teigland	47effdc025	vgck --updatemetadata is a new command uses vg_write to correct more common or less severe issues, and also adds the ability to repair some metadata corruption that couldn't be handled previously.	2019-06-07 15:54:04 -05:00
David Teigland	de3d3b11f4	move pv header repairs to vg_write Correct PV header in-use or version fields from vg_write instead of vg_read.	2019-06-07 15:54:04 -05:00
David Teigland	ab61a6d85d	move wipe_outdated_pvs to vg_write and implement it based on a device, not based on a pv struct (which is not available when the device is not a part of the vg.) currently only the vgremove command wipes outdated pvs until more advanced recovery is added in a subsequent commit	2019-06-07 15:54:04 -05:00
David Teigland	45b164f62c	create separate lvmcache update functions for read and write The vg read and vg write cases need to update lvmcache differently, so create separate functions for them. The read case now handles checking for outdated mdas and moves them aside into a new list to be repaired in a subsequent commit.	2019-06-07 15:54:04 -05:00
David Teigland	027e0e92e6	fix vg_commit return value The existing comment was desribing the correct behavior, but the code didn't match. The commit is successful if one mda was committed. Making it depend on the result of the internal lvmcache update was wrong.	2019-06-07 15:54:04 -05:00
David Teigland	650524b955	ability to keep track of bad mdas in lvmcache mda's that cannot be processed by lvm because of some corruption can be kept on a separate list. These will be used for more advanced repair in a subsequent commit.	2019-06-07 15:54:04 -05:00
David Teigland	aeafdc1f45	add flags to keep track of bad metadata When reading metadata headers and text, use a new set of flags to identify specific errors that are seen. These will be used for more advanced repair in a subsequent commit.	2019-06-07 15:54:04 -05:00
David Teigland	2b241eb1f6	pvck: use new dump routines for old output Use the recently added dump routines to produce the old/traditional pvck output, and remove the code that had been used for that. The validation/checking done by the new routines means that new lines prefixed with CHECK are printed for incorrect values.	2019-06-05 16:28:52 -05:00
Zdenek Kabelac	e3c4ab0cc7	cache: support no_discard_passdown Recent kernel version from kernel commit: de7180ff908b2bc0342e832dbdaa9a5f1ecaa33a started to report in cache status line new flag: no_discard_passdown Whenever lvm spots unknown status it reports: Unknown feature in status: So add reconginzing this feature flag and also report this with 'lvs -o+kernel_discards' When no_discard_passdown is found in status 'nopassdown' gets reported for this field (roughly matching what we report for thin-pools).	2019-06-05 15:48:41 +02:00
David Teigland	645dd27604	separate code for setting devices from metadata parsing Pull the code that sets devs for PVs out of the metadata parsing code and call it separately.	2019-05-23 11:57:38 -05:00
Zdenek Kabelac	d60d59a5f3	cleanup: use unsigned type	2019-05-03 13:17:22 +02:00
David Teigland	8c87dda195	locking: unify global lock for flock and lockd There have been two file locks used to protect lvm "global state": "ORPHANS" and "GLOBAL". Commands that used the ORPHAN flock in exclusive mode: pvcreate, pvremove, vgcreate, vgextend, vgremove, vgcfgrestore Commands that used the ORPHAN flock in shared mode: vgimportclone, pvs, pvscan, pvresize, pvmove, pvdisplay, pvchange, fullreport Commands that used the GLOBAL flock in exclusive mode: pvchange, pvscan, vgimportclone, vgscan Commands that used the GLOBAL flock in shared mode: pvscan --cache, pvs The ORPHAN lock covers the important cases of serializing the use of orphan PVs. It also partially covers the reporting of orphan PVs (although not correctly as explained below.) The GLOBAL lock doesn't seem to have a clear purpose (it may have eroded over time.) Neither lock correctly protects the VG namespace, or orphan PV properties. To simplify and correct these issues, the two separate flocks are combined into the one GLOBAL flock, and this flock is used from the locking sites that are in place for the lvmlockd global lock. The logic behind the lvmlockd (distributed) global lock is that any command that changes "global state" needs to take the global lock in ex mode. Global state in lvm is: the list of VG names, the set of orphan PVs, and any properties of orphan PVs. Reading this global state can use the global lock in sh mode to ensure it doesn't change while being reported. The locking of global state now looks like: lockd_global() previously named lockd_gl(), acquires the distributed global lock through lvmlockd. This is unchanged. It serializes distributed lvm commands that are changing global state. This is a no-op when lvmlockd is not in use. lockf_global() acquires an flock on a local file. It serializes local lvm commands that are changing global state. lock_global() first calls lockf_global() to acquire the local flock for global state, and if this succeeds, it calls lockd_global() to acquire the distributed lock for global state. Replace instances of lockd_gl() with lock_global(), so that the existing sites for lvmlockd global state locking are now also used for local file locking of global state. Remove the previous file locking calls lock_vol(GLOBAL) and lock_vol(ORPHAN). The following commands which change global state are now serialized with the exclusive global flock: pvchange (of orphan), pvresize (of orphan), pvcreate, pvremove, vgcreate, vgextend, vgremove, vgreduce, vgrename, vgcfgrestore, vgimportclone, vgmerge, vgsplit Commands that use a shared flock to read global state (and will be serialized against the prior list) are those that use process_each functions that are based on processing a list of all VG names, or all PVs. The list of all VGs or all PVs is global state and the shared lock prevents those lists from changing while the command is processing them. The ORPHAN lock previously attempted to produce an accurate listing of orphan PVs, but it was only acquired at the end of the command during the fake vg_read of the fake orphan vg. This is not when orphan PVs were determined; they were determined by elimination beforehand by processing all real VGs, and subtracting the PVs in the real VGs from the list of all PVs that had been identified during the initial scan. This is fixed by holding the single global lock in shared mode while processing all VGs to determine the list of orphan PVs.	2019-04-29 13:01:05 -05:00
David Teigland	ccd1386070	wipe_lv: initially open LV in writable mode wipe_lv knows it's going to write the device, so it can open rw from the start. It was opening readonly, and then dev_write needed to reopen it readwrite.	2019-04-26 14:49:27 -05:00
David Teigland	c33770c02d	lvmlockd: do not allow mirror LV to be activated shared This reverts `518a8e8cfb` "lvmlockd: activate mirror LVs in shared mode with cmirrord" because while activating a mirror LV with cmirrord worked, changes to the active cmirror did not work.	2019-04-04 13:21:38 -05:00
Zdenek Kabelac	fcec6691f0	thin: fix maintenance of _pmspare When metadata grows lvm2 may need to extend also _pmspare volume.	2019-04-03 13:28:54 +02:00
Zdenek Kabelac	e27d027155	thin: resize metadata with data When data are growing, adapt also size of metadata. As we get way too many reports from users doing huge growths of data portion while keep metadata small and avoiding using monitoring. So to enhance the user-experience in case user requests grown of thin-pool (without passing PV list for growth) - lvm2 will automaticaly grown also the metadata part of thin-pool (if possible).	2019-04-03 13:28:22 +02:00
Zdenek Kabelac	7c3de2fd93	thin: introduce estimate_thin_pool_metadata_size Add function for estimation of thin-pool metadata size for given size of data. Function is using already existing internal API so it can be reused for resize of thin-pool data.	2019-04-03 13:27:17 +02:00
David Teigland	85e68a8333	lvextend: refresh shared LV remotely using dlm/corosync When lvextend extends an LV that is active with a shared lock, use this as a signal that other hosts may also have the LV active, with gfs2 mounted, and should have the LV refreshed to reflect the new size. Use the libdlmcontrol run api, which uses dlm_controld/corosync to run an lvchange --refresh command on other cluster nodes.	2019-03-21 12:38:20 -05:00
David Teigland	d369de8399	lvextend: allow on LV active with a shared lock Detect when a shared lock exists, don't require the normal exclusive lock, and allow the lvextend.	2019-03-21 12:38:20 -05:00
Zdenek Kabelac	677aa84be3	vdo: enable caching for vdopool LV and vdo LV Allow using caching with VDO. User can either cache a single vdopool or a vdo LV - difference when the caching is put-in depends on a use-case and it's upto user to decide which kind of speed is expected.	2019-03-20 14:38:31 +01:00
Zdenek Kabelac	0db22c5f81	lv_manip: insert remove layer skips pools Fixing renaming of subLVs when removing and inserting layers - this got visible when using stacked VDO pools.	2019-03-20 14:38:05 +01:00
Zdenek Kabelac	1cc690e911	thin: max thin	2019-03-20 14:37:44 +01:00
David Teigland	4e20ebd6a1	pvscan: ignore online for shared and foreign PVs Activation would not be allowed anyway, but we can check for these cases early and avoid wasted time in pvscan managing online files an attempting activation.	2019-03-05 15:19:05 -06:00
David Teigland	a9eaab6beb	Use "cachevol" to refer to cache on a single LV and "cachepool" to refer to a cache on a cache pool object. The problem was that the --cachepool option was being used to refer to both a cache pool object, and to a standard LV used for caching. This could be somewhat confusing, and it made it less clear when each kind would be used. By separating them, it's clear when a cachepool or a cachevol should be used. Previously: - lvm would use the cache pool approach when the user passed a cache-pool LV to the --cachepool option. - lvm would use the cache vol approach when the user passed a standard LV in the --cachepool option. Now: - lvm will always use the cache pool approach when the user uses the --cachepool option. - lvm will always use the cache vol approach when the user uses the --cachevol option.	2019-02-27 08:52:34 -06:00
Zdenek Kabelac	d19e372795	cleanup: indent	2019-01-28 22:39:10 +01:00
Zdenek Kabelac	78dd9d820d	thin: select chunk size as power of 2 Whenever thin-pool chunk size is unspecified and left for lvm calculation try to select the size as nearest highest power-of-2 instead of just being a multiple of 64KiB.	2019-01-28 22:17:25 +01:00
Zdenek Kabelac	58ad831c72	cache: select chunk size as power of 2 When cache chunk size is not configured, and left for lvm deduction, select the value which is power-of-2.	2019-01-28 22:17:14 +01:00
Zdenek Kabelac	105a8edea1	lv_manip: better work with PERCENT_VG modifier with lvresize Fixing recent commit `022ebb0cfe` Resize already has size that needs to be counted with, otherwise upsizing operation could turn into size reduction one.	2019-01-21 15:39:24 +01:00
Zdenek Kabelac	f3c52a515b	vdo: enable dmeventd resize	2019-01-21 12:53:16 +01:00
Zdenek Kabelac	a16d914d34	cleanup: better naming	2019-01-21 12:53:16 +01:00
Zdenek Kabelac	08cabe9b83	vdo: allow resize of VDO and VDO pool volumes Now with newer VDO kvdo target we can start to use standard mechanism to enable resize of VDO volumes. VDO pool can be grown. Virtual volume grows on top of VDO pool when is not big enough. Reduced VDOLV is calling discard for reduced areas - this can take long time! TODO: implement some pollable mechanism for out-of-lock TRIM.	2019-01-21 12:53:16 +01:00
Zdenek Kabelac	bd6709cec6	vdo: size reduction requires VDO to be active To be able to send discard to reduced areas - the VDO LV needs to be active.	2019-01-21 12:53:16 +01:00
Zdenek Kabelac	f1ad4b0679	vdo: discard reduced area Implement sending discard to reduced LV area.	2019-01-21 12:53:16 +01:00
Zdenek Kabelac	ca72d19691	vdo: estimate virtual size after resize	2019-01-21 12:53:16 +01:00
Zdenek Kabelac	ab031d673d	vdo: introduce function for estimation of virtual size	2019-01-21 12:53:16 +01:00
Zdenek Kabelac	022ebb0cfe	lv_manip: better work with PERCENT_VG modifier When using 'lvcreate -l100%VG' and there is big disproportion between real available space and requested setting - automatically fallback to 100%FREE. Difference can be seen when VG is big and already most space was allocated, so the requestion 100%VG can end (and by spec for % modifier it's correct) as LV with size of 1%VG. Usually this is not a big problem - buit in some cases - like cache-pool allocation, this can result a big difference for chunksize selection. With this patch it's more closely match common-sense logic without the need of reitteration of too big changes in lvm2 core ATM. TODO: in the future there should be allocator solving all allocations in a single call.	2019-01-21 12:53:15 +01:00
Zdenek Kabelac	26ead4bf45	cov: extent_size cannot be 0 Make this obvious to coverity.	2018-12-21 21:45:08 +01:00
Zdenek Kabelac	9dfb1a11b7	cov: drop unneeded header file MAX macro no longer needed in pe_align.	2018-12-21 21:45:08 +01:00
Zdenek Kabelac	3320ab8334	lib: move towards v2 version of VDO format Drop very old original format of VDO target and focus on V2 version. So some variables were renamed or replaced. There is no compatibility preserved (with assumption so far this is experimental feature and there is no real user). Note - version currently VDO calls this version 6.2.	2018-12-20 13:26:55 +01:00
Heinz Mauelshagen	e82303fd6a	lvcreate/lvconvert: optionally reenable mirrored mirror log for testing purposes only This is a followup patch to commit `edb72cb70c` to support related lvm2 test suite tests. A 'global/support_mirrored_mirror_log' bool configuration variable gets introduced allowing the creation of, or conversion to mirrored 'mirror' logs if set. The capability to create these in turn allows the rest of the tests to perform activation of such existing LVs and their conversions to disk/core 'mirror' logs. Display a disclaimer warning if enabled that this is not for regular use. Add definition of the enabled config option to respective test scripts. Related: rhbz1643562	2018-12-17 19:28:54 +01:00
Ming-Hung Tsai	859feb81e5	lvmanip: uninitialized members in struct pv_list (#10 ) Scenario: Given an existed LV `lvol0`, I want to create another LV on the PVs used by `lvol0`. I use `build_parallel_areas_from_lv()` to obtain the `pv_list` of each segments. However, the returned `pv_list` is not properly initialized, which causes segfault in subsequent operations.	2018-12-14 15:23:18 +01:00
Heinz Mauelshagen	dd5716ddf2	raid: fix (de)activation of RaidLVs with visible SubLVs There's a small window during creation of a new RaidLV when rmeta SubLVs are made visible to wipe them in order to prevent erroneous discovery of stale RAID metadata. In case a crash prevents the SubLVs from being committed hidden after such wiping, the RaidLV can still be activated with the SubLVs visible. During deactivation though, a deadlock occurs because the visible SubLVs are deactivated before the RaidLV. The patch adds _check_raid_sublvs to the raid validation in merge.c, an activation check to activate.c (paranoid, because the merge.c check will prevent activation in case of visible SubLVs) and shares the existing wiping function _clear_lvs in raid_manip.c moved to lv_manip.c and renamed to activate_and_wipe_lvlist to remove code duplication. Whilst on it, introduce activate_and_wipe_lv to share with (lvconvert\|lvchange).c. Resolves: rhbz1633167	2018-12-11 16:35:34 +01:00
Heinz Mauelshagen	edb72cb70c	lvcreate/lvconvert: prohibit creation of/conversion to mirrored mirror logs In RHEL7 we marked mirrored mirror logs as deprecated and added a related message. This patch prohibits creating new 'mirror' LVs with that log type or converting existing LVs to have one. Existing LVs with mirrored mirror log can be activated and converted to disk/core logs. Avoid double deprecation message when running lvconvert. Resolves: rhbz1643562	2018-12-08 02:52:50 +01:00
David Teigland	904e1e3d26	Place the first PE at 1 MiB for all defaults . When using default settings, this commit should change nothing. The first PE continues to be placed at 1 MiB resulting in a metadata area size of 1020 KiB (for 4K page sizes; slightly smaller for larger page sizes.) . When default_data_alignment is disabled in lvm.conf, align pe_start at 1 MiB, based on a default metadata area size that adapts to the page size. Previously, disabling this option would result in mda_size that was too small for common use, and produced a 64 KiB aligned pe_start. . Customized pe_start and mda_size values continue to be set as before in lvm.conf and command line. . Remove the configure option for setting default_data_alignment at build time. . Improve alignment related option descriptions. . Add section about alignment to pvcreate man page. Previously, DEFAULT_PVMETADATASIZE was 255 sectors. However, the fact that the config setting named "default_data_alignment" has a default value of 1 (MiB) meant that DEFAULT_PVMETADATASIZE was having no effect. The metadata area size is the space between the start of the metadata area (page size offset from the start of the device) and the first PE (1 MiB by default due to default_data_alignment 1.) The result is a 1020 KiB metadata area on machines with 4KiB page size (1024 KiB - 4 KiB), and smaller on machines with larger page size. If default_data_alignment was set to 0 (disabled), then DEFAULT_PVMETADATASIZE 255 would take effect, and produce a metadata area that was 188 KiB and pe_start of 192 KiB. This was too small for common use. This is fixed by making the default metadata area size a computed value that matches the value produced by default_data_alignment.	2018-11-26 16:36:50 -06:00
David Teigland	3ae5569570	Add dm-writecache support dm-writecache is used like dm-cache with a standard LV as the cache. $ lvcreate -n main -L 128M -an foo /dev/loop0 $ lvcreate -n fast -L 32M -an foo /dev/pmem0 $ lvconvert --type writecache --cachepool fast foo/main $ lvs -a foo -o+devices LV VG Attr LSize Origin Devices [fast] foo -wi------- 32.00m /dev/pmem0(0) main foo Cwi------- 128.00m [main_wcorig] main_wcorig(0) [main_wcorig] foo -wi------- 128.00m /dev/loop0(0) $ lvchange -ay foo/main $ dmsetup table foo-main_wcorig: 0 262144 linear 7:0 2048 foo-main: 0 262144 writecache p 253:4 253:3 4096 0 foo-fast: 0 65536 linear 259:0 2048 $ lvchange -an foo/main $ lvconvert --splitcache foo/main $ lvs -a foo -o+devices LV VG Attr LSize Devices fast foo -wi------- 32.00m /dev/pmem0(0) main foo -wi------- 128.00m /dev/loop0(0)	2018-11-06 14:18:41 -06:00
David Teigland	cac4a9743a	Allow dm-cache cache device to be standard LV If a single, standard LV is specified as the cache, use it directly instead of converting it into a cache-pool object with two separate LVs (for data and metadata). With a single LV as the cache, lvm will use blocks at the beginning for metadata, and the rest for data. Separate dm linear devices are set up to point at the metadata and data areas of the LV. These dm devs are given to the dm-cache target to use. The single LV cache cannot be resized without recreating it. If the --poolmetadata option is used to specify an LV for metadata, then a cache pool will be created (with separate LVs for data and metadata.) Usage: $ lvcreate -n main -L 128M vg /dev/loop0 $ lvcreate -n fast -L 64M vg /dev/loop1 $ lvs -a vg LV VG Attr LSize Type Devices main vg -wi-a----- 128.00m linear /dev/loop0(0) fast vg -wi-a----- 64.00m linear /dev/loop1(0) $ lvconvert --type cache --cachepool fast vg/main $ lvs -a vg LV VG Attr LSize Origin Pool Type Devices [fast] vg Cwi---C--- 64.00m linear /dev/loop1(0) main vg Cwi---C--- 128.00m [main_corig] [fast] cache main_corig(0) [main_corig] vg owi---C--- 128.00m linear /dev/loop0(0) $ lvchange -ay vg/main $ dmsetup ls vg-fast_cdata (253:4) vg-fast_cmeta (253:5) vg-main_corig (253:6) vg-main (253:24) vg-fast (253:3) $ dmsetup table vg-fast_cdata: 0 98304 linear 253:3 32768 vg-fast_cmeta: 0 32768 linear 253:3 0 vg-main_corig: 0 262144 linear 7:0 2048 vg-main: 0 262144 cache 253:5 253:4 253:6 128 2 metadata2 writethrough mq 0 vg-fast: 0 131072 linear 7:1 2048 $ lvchange -an vg/min $ lvconvert --splitcache vg/main $ lvs -a vg LV VG Attr LSize Type Devices fast vg -wi------- 64.00m linear /dev/loop1(0) main vg -wi------- 128.00m linear /dev/loop0(0)	2018-11-06 13:44:54 -06:00
David Teigland	a686391eca	cache: reorganize cache_set_policy to prepare for future addition	2018-11-06 11:36:29 -06:00
David Teigland	23948e99b3	cache: improve error message about flush	2018-11-06 11:36:29 -06:00
David Teigland	3e547fa952	cache: improve warning message about cached thin data	2018-11-06 11:36:28 -06:00
David Teigland	e26dacf30a	cache: factor getting cache mode so part can be called separately	2018-11-06 11:36:28 -06:00
David Teigland	8d7075528f	cache: add cache_mode_num_to_str Requires only string and number, no specific lv/seg type.	2018-11-06 11:36:28 -06:00
Zdenek Kabelac	70e3d0a613	cov: remove unused assigns	2018-11-05 17:25:11 +01:00
David Teigland	aecf542126	metadata: prevent writing beyond metadata area lvm uses a bcache block size of 128K. A bcache block at the end of the metadata area will overlap the PEs from which LVs are allocated. How much depends on alignments. When lvm reads and writes one of these bcache blocks to update VG metadata, it can also be reading and writing PEs that belong to an LV. If these overlapping PEs are being written to by the LV user (e.g. filesystem) at the same time that lvm is modifying VG metadata in the overlapping bcache block, then the user's updates to the PEs can be lost. This patch is a quick hack to prevent lvm from writing past the end of the metadata area.	2018-10-29 16:53:17 -05:00
Heinz Mauelshagen	8df2dd66ce	Revert "raid: fix left behind SubLVs" This reverts commit `16ae968d24`. We need to come up with a better fix, because we fall short wiping all known signatures when not using the wipe_lv API.	2018-10-25 14:35:56 +02:00
Heinz Mauelshagen	16ae968d24	raid: fix left behind SubLVs lvm metadata writes, commits and activations are performed for (newly) allocated RAID metadata SubLVs to wipe any preexisiting data thus avoid false raid superblock positives on RaidLV activation. This process can be interrupted by command or system crashs thus leaving stale SubLVs in the lvm metadata as a problem. Because we hold an exclusive lock in this metadata SubLV wiping process, we can address this problem by avoiding aforementioned commits/writes/activations altogether wiping the respective first sector of the first physical extent allocated to any metadata SubLV directly via the existing dev_set() API. Succeeds all LVM RAID tests. Related: rhbz1633167	2018-10-24 16:35:30 +02:00
Zdenek Kabelac	fdd76da33d	cov: drop uneeded header files	2018-10-15 17:49:44 +02:00
Zdenek Kabelac	253989ecd9	cov: fix error path Avoid calling 'bad:' section since we have not set 'fd' yet and instead directly return failing 0 value.	2018-10-15 17:49:44 +02:00
Zdenek Kabelac	fbfbbf6d6a	cov: drop check for pointer Pointer must be always set and it's been already dereferenced.	2018-10-15 14:24:28 +02:00
Heinz Mauelshagen	989626926c	lvconvert: allow raid4 -> linear conversion request Allow "lvconvert --type linear RaidLV" on a raid4 LV providing convenient interim steps to convert to linear. Add respective new test lvconvert-raid-takeover-raid4_to_linear.sh and lvconvert-raid-takeover-linear_to_raid4.sh for linear to raid4 once on it.	2018-09-10 18:43:21 +02:00
Heinz Mauelshagen	e2e30a64ab	lvconvert: fix interim segtype regression on raid6 conversions When converting from striped/raid0/raid0_meta to raid6 with > 2 stripes, allow possible direct conversion (to raid6_n_6). In case of 2 stripes, first convert to raid5_n to restripe to at least 3 data stripes (the raid6 minimum in lvm2) in a second conversion before finally converting to raid6_n_6. As before, raid6_n_6 then can be converted to any other raid6 layout. Enhance lvconvert-raid-takeover.sh to test the 2 stripes conversions to raid6. Resolves: rhbz1624038	2018-09-07 13:48:19 +02:00
Heinz Mauelshagen	22a1304368	lvconvert: avoid superfluous interim raid type When converting striped/raid0*/raid6_n_6 <-> raid4, avoid superfluous interim raid5_n layout. Related: rhbz1447809	2018-08-31 19:04:19 +02:00
Heinz Mauelshagen	e83c4f07ca	lvconvert: fix conversion attempts to linear "lvconvert --type linear RaidLV" on striped and raid4/5/6/10 have to provide the convenient interim layouts. Fix involves a cleanup to the convenience type function. As a result of testing, add missing sync waits to lvconvert-raid-reshape-linear_to_raid6-single-type.sh. Resolves: rhbz1447809	2018-08-22 17:12:43 +02:00
Heinz Mauelshagen	4578411633	lvconvert: fix regression preventing direct striped conversion Conversion to striped from raid0/raid0_meta is directly possible. Fix a regression setting superfluous interim raid5_n conversion type introduced by commit `bd7cdd0b09`. Add new test script lvconvert-raid0-striped.sh. Resolves: rhbz1608067	2018-08-21 17:28:56 +02:00
Zdenek Kabelac	acab591378	mirror: fix splitmirrors for mirror type With improved mirror activation code --splitmirror issue poppedup since there was missing proper preload code and deactivation for splitted mirror leg.	2018-08-07 17:58:30 +02:00
Zdenek Kabelac	c34291e3bf	cache: drop metadata_format validation Allow to use any combination of cache metadata format for policy.	2018-08-07 17:57:00 +02:00
David Teigland	778ce8d808	lvconvert: improve text about splitmirrors in messages and man page.	2018-07-23 12:28:48 -05:00
David Teigland	117160b27e	Remove lvmetad Native disk scanning is now both reduced and async/parallel, which makes it comparable in performance (and often faster) when compared to lvm using lvmetad. Autoactivation now uses local temp files to record online PVs, and no longer requires lvmetad. There should be no apparent command-level change in behavior.	2018-07-11 11:26:42 -05:00
Zdenek Kabelac	12213445b5	vgchange: vdo support Support vgchange usage with VDO segtype. Also changing extent size need small update for vdo virtual extent. TODO: API needs enhancements so it's not about adding ifs() everywhere.	2018-07-09 15:29:16 +02:00
Zdenek Kabelac	c58733ca15	lvcreate: vdo support Supports basic: 'lvcreate --vdo -LXXXG -VYYYG vg/vdoname -n lvname' Allows to create basic VDO pool volume and virtual VDO volume.	2018-07-09 15:29:12 +02:00
Zdenek Kabelac	6945bbdbc6	lvresize: vdo support Unsupported ATM. Wait till VDO kernel target starts to use updated resize sequence, LOAD, SUSPEND, RESUME.	2018-07-09 15:28:35 +02:00
Zdenek Kabelac	44c99a8822	vdo: data percentage Display percentage of used virtual size of vdo-pool volume.	2018-07-09 15:28:35 +02:00
Zdenek Kabelac	5807993bbf	display: basic vdo segment lvdisplay and lvs support Print some basic info about vdo segment. 'lvdisplay -m' ATM shows the most. lvs shows usage percentage.	2018-07-09 15:28:35 +02:00
Zdenek Kabelac	493ffe7a0f	lv_manip: layout and role support for vdo segment	2018-07-09 15:28:35 +02:00
Zdenek Kabelac	00990ed53e	check_lv_segment: internal vdo segment validation Check if settings for vdo segment are correct.	2018-07-09 15:28:35 +02:00
Zdenek Kabelac	0dafd159a8	vdo_manip: parsing status of VDO device	2018-07-09 15:28:35 +02:00
Zdenek Kabelac	aa63dfbe39	vdo: support functions to map enums to string names Translate VDO enums to printable strings.	2018-07-09 15:28:35 +02:00
Zdenek Kabelac	aff69ecf39	vdo: component activation of VDO data LV Allow component activation of VDO data LV.	2018-07-09 15:28:35 +02:00
Zdenek Kabelac	4b7a57c9ed	vdo: with created names use vpool When user create vdo-pool - use different automatic name. So unlike with traditional LVs using lvol0, lvol1 use vpool0, vpool1... TODO: apply similar for thin-pool & cache-pool...	2018-07-09 15:28:35 +02:00
Zdenek Kabelac	a8f84f7801	vdo: introduce segment types and manip functions Core functionality introducing lvm VDO support.	2018-07-09 15:28:35 +02:00
Zdenek Kabelac	e9d1f676b3	allocation: add check for passing log allocation Updates previous commit.	2018-07-09 00:59:34 +02:00
Zdenek Kabelac	6d1c983122	cleanup: use last_seg More readable code.	2018-07-09 00:23:35 +02:00
Zdenek Kabelac	b697aa9646	allocator: fix thin-pool allocation When allocating thin-pool with more then 1 device - try to allocate 'metadataLV' with reuse of log-type allocation for mirror LV. It should be naturally place on other device then 'dataLV'. However due to somewhat hard to follow allocation logic code, it's been rejected allocation in cases where there was not enough space for data or metadata on single PV, thus to successed, usage of segments was mandatory. While user may use: allocation/thin_pool_metadata_require_separate_pvs=1 to enforce separe meta and data LV - on default settings, this is not enable thus segment allocation is meant to work. NOTE: As already said - the original intention of this whole 'if()' is unclear, so try to split this test into multiple more simple tests that are more readable. TODO: more validation.	2018-07-09 00:19:30 +02:00
Zdenek Kabelac	f2b856c994	lv_manip: do not check extents for any virtual target Allow creation of any virtual segment type with just --virtualsize specified without any real extent size give. TODO: likely --type error,zero might be later enhanced to use -V (along with -L) - but since those targets do not allocate real space, supporting -V makes sense with them.	2018-07-02 10:24:23 +02:00
Zdenek Kabelac	2bb9627d01	lv_manip: add name of failing LV into error message	2018-07-02 10:24:23 +02:00
Zdenek Kabelac	cea88a9e4e	lv_manip: use vgmem pool Switch to vgmem pool for allocation associated with modification of particular VG.	2018-06-25 15:07:55 +02:00
Zdenek Kabelac	357e9f9572	cache: use new api function	2018-06-25 15:07:55 +02:00
Zdenek Kabelac	9c0d92d957	lv_manip: add new internal api function	2018-06-25 15:07:55 +02:00
Zdenek Kabelac	8949903fbb	cache: set areas count prior using it Set correct counter, so it's not failing on internal error check.	2018-06-25 15:07:32 +02:00
Zdenek Kabelac	106ee05ba0	lv_manip: add extra internal error Catch error early, when trying to store data into non-allocated area.	2018-06-22 23:37:02 +02:00
David Teigland	e166d2b14c	lvmlockd: fix another missing lock_type null check Same as `347c807f8`.	2018-06-21 09:24:51 -05:00
David Teigland	428514a07f	Drop --ignoreskippedcluster option It's no longer needed. Clustered VGs are now handled in the same way as foreign VGs, and as shared VGs that can't be accessed: - A command processing all VGs sees a clustered VG, prints a message ("Skipping clustered VG foo."), skips it, and does not fail. - A command where the clustered VG is explicitly named on the command line, prints a message and fails. "Cannot access clustered VG foo, see lvmlockd(8)." The option is listed in the set of ignored options for the commands that previously accepted it. (Removing it entirely would cause commands/scripts to fail if they set it.)	2018-06-15 15:59:34 -05:00
David Teigland	8eab37593e	Add cmd arg to more functions so that it can be used in the filter code	2018-06-15 11:03:55 -05:00
David Teigland	e53cfc6a88	lvmlockd: update method for changing clustered VG The previous method for forcibly changing a clustered VG to a local VG involved using -cn and locking_type 0. Since those options are deprecated, replace it with the same command used for other forced lock type changes: vgchange --locktype none --lockopt force.	2018-06-13 15:30:28 -05:00
David Teigland	17f5572bc9	Remove independent metadata areas in which metadata is stored in files on the local fs instead of on PVs.	2018-06-13 12:25:19 -05:00
David Teigland	981a3ba98e	Clean up repair and result values in vg_read Fix the confusing mix of input and output values in the single variable.	2018-06-12 11:08:26 -05:00
David Teigland	9a8c36b891	Fix use of orphan lock in commands vgreduce, vgremove and vgcfgrestore were acquiring the orphan lock in the midst of command processing instead of at the start of the command. (The orphan lock moved to being acquired at the start of the command back when pvcreate/vgcreate/vgextend were reworked based on pvcreate_each_device.) vgsplit also needed a small update to avoid reacquiring a VG lock that it already held (for the new VG name).	2018-06-12 09:46:11 -05:00
David Teigland	c4153a8dfc	Remove checking for locked VGs A few places were calling a function to check if a VG lock was held. The only place it was actually needed is for pvcreate which wants to do its own locking (and scanning) around process_each_pv. The locking/scanning exceptions for pvcreate in process_each_pv/vg_read can be enabled by just passing a couple of flags instead of checking if the VG is already locked. This also means that these special cases won't be enabled unknowingly in other places where they shouldn't be used.	2018-06-12 09:46:04 -05:00
David Teigland	3b6b7f8f9b	lvmlockd: skip repair lock upgrade for non shared vgs Only attempt lvmlockd lock upgrade for shared VGs.	2018-06-12 09:44:05 -05:00
Zdenek Kabelac	77d5caae90	snapshot: improve checking of merging snapshot Add runtime detection for 'lvs -o+seg_monitor' and 'vgchange --monitor'. This fix should avoid unnecessary timeout on systemd shutdown.	2018-06-11 22:25:42 +02:00
David Teigland	a8759dc7a6	Remove unused cache management from locking This code was for managing lvmcache for clvm and it no longer does anything.	2018-06-08 12:30:43 -05:00
David Teigland	669b1295ae	Remove header declarations for removed functions	2018-06-08 10:01:05 -05:00
David Teigland	73b7e6fde7	Remove more code that was only used by liblvm2app	2018-06-08 09:29:11 -05:00
Joe Thornber	7c4b19c335	Merge branch '2018-06-04-data-structs'	2018-06-08 14:21:07 +01:00
Joe Thornber	d5da55ed85	device_mapper: remove dbg_malloc. I wrote dbg_malloc before we had valgrind. These days there's just no need.	2018-06-08 13:40:53 +01:00
Zdenek Kabelac	5cb4b2a424	cache: cleaner policy also uses fmt2 Format 2 is also with cleaner policy.	2018-06-08 14:37:29 +02:00
Zdenek Kabelac	fb171edd45	pvresize: add missing return Log error path missed return 0. Also fix some unneded bactraces (since log_error already shows position).	2018-06-08 14:36:56 +02:00
Joe Thornber	286c1ba336	device_mapper: rename libdevmapper.h -> all.h I'm paranoid a file will include the global one in /usr/include by accident.	2018-06-08 12:31:45 +01:00
David Teigland	18259d5559	Remove unused clvm variations for active LVs Different flavors of activate_lv() and lv_is_active() which are meaningful in a clustered VG can be eliminated and replaced with whatever that flavor already falls back to in a local VG. e.g. lv_is_active_exclusive_locally() is distinct from lv_is_active() in a clustered VG, but in a local VG they are equivalent. So, all instances of the variant are replaced with the basic local equivalent. For local VGs, the same behavior remains as before. For shared VGs, lvmlockd was written with the explicit requirement of local behavior from these functions (lvmlockd requires locking_type 1), so the behavior in shared VGs also remains the same.	2018-06-07 16:17:04 +01:00
David Teigland	e4d9099e19	Remove more clvm code	2018-06-07 16:17:04 +01:00
David Teigland	d154dd6638	lvmlockd: fix missing lock_type null check Missed checking if vg->lock_type is NULL in commit `db8d3bdfa`: lvmlockd: enable mirror split and merge with dlm lock_type	2018-06-07 16:17:04 +01:00
David Teigland	3e781ea446	Remove clvmd and associated code More code reduction and simplification can follow.	2018-06-05 11:09:13 -05:00
Heinz Mauelshagen	bd7cdd0b09	lvconvert: support linear <-> striped convenience conversions "lvconvert --type {linear\|striped\|raid} ..." on a striped/linear LV provides convenience interim type to convert to the requested final layout similar to the given raid <-> raid* conveninece types. Whilst on it, add missing raid5_n convenince type from raid5* to raid10. Resolves: rhbz1439925 Resolves: rhbz1447809 Resolves: rhbz1573255	2018-06-05 16:23:18 +02:00
Heinz Mauelshagen	de66704253	segtype: add linear Add linear segtype addressing FIXME in preparation for linear <-> striped convenience conversion support	2018-06-05 16:23:18 +02:00
Zdenek Kabelac	1140d70893	build: fixes	2018-06-04 12:28:13 +02:00
Zdenek Kabelac	6a1f458bb7	build: compile fixes	2018-06-01 21:12:31 +02:00
David Teigland	09177b53dd	lvmlockd: clarify lock_type use for coverity Make it clearer when vg->lock_type will be used so coverity doesn't worry about it.	2018-06-01 13:15:22 -05:00
David Teigland	b6f0f20da2	lvmlockd: primarily use vg_is_shared to check if a vg uses an lvmlockd lock_type, instead of the equivalent but longer is_lockd_type.	2018-06-01 13:15:22 -05:00
Joe Thornber	dbba1e9b93	Merge branch 'master' into 2018-05-11-fork-libdm	2018-06-01 13:04:12 +01:00
David Teigland	b9c1cef817	lvmlockd: fix reverting new lv in error path The wrong name was being used to free the LV lock in lvmlockd in the error exit path.	2018-05-31 15:35:48 -05:00
David Teigland	fdaa7e2e87	vgs: add report field for shared equivalent to a non-empty -o locktype.	2018-05-31 10:23:03 -05:00
David Teigland	c516321325	lvmlockd: enable lvcreate of new LV plus existing cache pool In this command, lvcreate creates a new LV and then combines it with an existing cache pool, producing a cache LV. This command was previously not allowed in in a shared VG.	2018-05-30 15:24:24 -05:00
David Teigland	6cd0523337	lvmlockd: enable repairing shared VG while reading it When the lvmlockd lock is shared, upgrade it to ex when repair (writing) is needed during vg_read. Pass the lockd state through additional read-related functions so the instances of repair scattered through vg_read can be handled. (Temporary solution until the ad hoc repairs can be pulled out of vg_read into a top level, centralized repair function.)	2018-05-30 12:56:46 -05:00
David Teigland	948f2d9979	lvmlockd: enable lvcreate of thin pool and thin lv in one command Previously, thin pools and thin lvs need needed to be created with separate commands, now the combined command is permitted.	2018-05-30 09:25:45 -05:00
David Teigland	db8d3bdfa9	lvmlockd: enable mirror split and merge with dlm lock_type	2018-05-30 09:25:45 -05:00
David Teigland	0253f5a21d	fix id_write_format on non-uuid string orphan vgs using the vgname "#orphans" as the vgid, and valgrind complains about calling id_write_format on that invalid uuid.	2018-05-18 13:41:20 -05:00
David Teigland	286c9c78b4	liblvm2app: fix valgrind memory warning	2018-05-17 15:18:11 -05:00
Rick Elrod	8c453e2e5e	cleanup: fix grammar in output - less then -> less than This minor patch fixes grammar in a few messages which get printed to users. It also fixes the same grammar mistake in several comments. Signed-off-by: Rick Elrod <relrod@redhat.com> --	2018-05-17 10:37:45 +02:00
David Teigland	28d35e5c59	scan: fix missing close in lib lib was using dev_test_excl which wasn't closing the device. Switch code to new io layer with excl open. Also use exclusive open in some other places.	2018-05-16 14:48:30 -05:00
Joe Thornber	89fdc0b588	Merge branch 'master' into 2018-05-11-fork-libdm	2018-05-16 13:43:02 +01:00
Joe Thornber	ccc35e2647	device-mapper: Fork libdm internally. The device-mapper directory now holds a copy of libdm source. At the moment this code is identical to libdm. Over time code will migrate out to appropriate places (see doc/refactoring.txt). The libdm directory still exists, and contains the source for the libdevmapper shared library, which we will continue to ship (though not neccessarily update). All code using libdm should now use the version in device-mapper.	2018-05-16 13:00:50 +01:00
Joe Thornber	7f97c7ea9a	build: Don't generate symlinks in include/ dir As we start refactoring the code to break dependencies (see doc/refactoring.txt), I want us to use full paths in the includes (eg, #include "base/data-struct/list.h"). This makes it more obvious when we're breaking abstraction boundaries, eg, including a file in metadata/ from base/	2018-05-14 10:30:20 +01:00
David Teigland	5c9dcd99fd	scan: remove unused args from label_read	2018-05-11 14:16:49 -05:00
David Teigland	bbb8040456	dev_cache: drop open_list devices are now held open only in bcache, so drop the dev_cache list of open devices which is unused.	2018-05-11 12:47:56 -05:00
David Teigland	9ad42e5f06	io: write log header with bcache	2018-05-10 16:25:33 -05:00
David Teigland	57bb46c5e7	filter: use bcache for filter reads Filters are still applied before any device reading or the label scan, but any filter checks that want to read the device are skipped and the device is flagged. After bcache is populated, but before lvm looks for devices (i.e. before label scan), the filters are reapplied to the devices that were flagged above. The filters will then find the data they need in bcache.	2018-05-10 16:03:19 -05:00
Joe Thornber	39ce38eb88	label/lv_manip: squash some warnings	2018-05-10 15:14:39 +01:00
David Teigland	9a5bd01b0c	io: replace dev_set with bcache equivalents	2018-05-09 11:29:52 -05:00
David Teigland	c016b573ee	clvmd: separate saved_vg from vginfo The clvmd saved_vg data is independent from the normal lvm lvmcache vginfo data, so separate saved_vg from vginfo. Normal lvm doesn't need to use save_vg at all, and in clvmd, lvmcache changes on vginfo can be made without worrying about unwanted effects on saved_vg.	2018-05-03 14:54:48 -05:00
Heinz Mauelshagen	88fe07ad0a	raid: use new internal APIs Use APIs introduced with commit `4ebfd8e8eb` where appropriate to minimize redundant code.	2018-05-03 21:36:50 +02:00
Heinz Mauelshagen	4ebfd8e8eb	lvconvert: don't return success on degraded -m raid1 conversion In case "lvconvert -mN RaidLV" was used on a degraded raid1 LV, success was returned instead of an error. Provide message to inform about the need to repair first before changing number of mirrors and exit with error. Add new lvconvert-m-raid1-degraded.sh test. Resolves: rhbz1573960	2018-05-03 18:48:00 +02:00
David Teigland	c1cd18f21e	Remove lvm1 and pool disk formats There are likely more bits of code that can be removed, e.g. lvm1/pool-specific bits of code that were identified using FMT flags. The vgconvert command can likely be reduced further. The lvm1-specific config settings should probably have some other fields set for proper deprecation.	2018-04-30 16:55:02 -05:00
David Teigland	029a76b4f8	clvmd: don't repair vg from vg_read in clvmd The mixed up vg repair code in vg_read was trying to repair a vg when vg_read was called by clvmd. The clvmd daemon isn't supposed to be repairing or writing a vg. (This is a temporary workaround; vg repair will soon be pulled out of vg_read so it can be called in a controlled way and consolidated instead of spread around.)	2018-04-30 15:56:51 -05:00
Joe Thornber	65d6118e47	[metadata-liblvm.c] comment out some dead code and add a FIXME	2018-04-30 09:45:39 +01:00
David Teigland	5b6e62dc1f	clvmd: drop old saved_vg when returning new saved_vg In some pvmove tests, clvmd uses the new (precommitted) saved_vg, but then requests the old saved_vg, and expects that the new saved_vg be returned instead of the old. So, when returning the new saved_vg, forget the old one so we don't return it again.	2018-04-26 14:57:45 -05:00
David Teigland	47bfac21ca	clvmd: skip dev rescan after full scan When clvmd does a full label scan just prior to calling _vg_read(), pass a new flag into _vg_read to indicate that the normal rescan of VG devs is not needed.	2018-04-25 16:39:43 -05:00
David Teigland	1fec86571f	clvmd: reuse a vg struct for sequential LV operations After reading a VG, stash it in lvmcache as "saved_vg". Before reading the VG again, try to use the saved_vg. The saved_vg is dropped on VG lock operations.	2018-04-25 16:39:43 -05:00
Zdenek Kabelac	c492fbb51c	debug: more explanatory error message	2018-04-23 22:42:18 +02:00
David Teigland	1409c4a1c2	clvm: rescan when VG or PV not found Rescan devices to update lvmcache content when clvmd vg_read doesn't find a VG or PV.	2018-04-20 16:09:49 -05:00
David Teigland	aee27dc7ba	scan: skip device rescan in vg_read For reporting commands (pvs,vgs,lvs,pvdisplay,vgdisplay,lvdisplay) we do not need to repeat the label scan of devices in vg_read if they all had matching metadata in the initial label scan. The data read by label scan can just be reused for the vg_read. This cuts the amount of device i/o in half, from two reads of each device to one. We have to be careful to avoid repairing the VG if we've skipped rescanning. (The VG repair code is very poor, and will be redone soon.)	2018-04-20 11:23:14 -05:00
David Teigland	9b6a62f944	lvmcache: simplify Recent changes allow some major simplification of the way lvmcache works and is used. lvmcache_label_scan is now called in a controlled fashion at the start of commands, and not via various unpredictable side effects. Remove various calls to it from other places. lvmcache_label_scan should not be called from anywhere during a command, because it produces an incorrect representation of PVs with no MDAs, and misclassifies them as orphans. This has been a long standing problem. The invalid flag and rescanning based on that is no longer used and removed. The 'force' variation is no longer needed and removed.	2018-04-20 11:22:48 -05:00
David Teigland	a9b0aa5c17	lvmetad: more fixes related to bcache Need to open devs prior to bcache io.	2018-04-20 11:22:48 -05:00
David Teigland	ddb5de7a98	clvm: fix bcache scan handling We can't let clvmd keep all scanned devs open, which prevents them from being removed. So drop the bcache data (and close fds) affter doing a label scan. Also set up bcache before the clvm-specific vg_read (which needs to rescan the vg's devs using bcache) and destroy the bcache after.	2018-04-20 11:22:48 -05:00
David Teigland	e49b114f7e	bcache: use wrappers for bcache read write in lvm Using a wrapper makes it easier to disable bcache if needed.	2018-04-20 11:22:47 -05:00
David Teigland	8065492046	bcache: do all writes through bcache	2018-04-20 11:22:47 -05:00
David Teigland	37471bb477	scan: skip extra scan in vg_read Drop an extra label scan in the recovery part of vg_read. This is a temporary improvement until the pending replacement for the broken recovery code burried in vg_read.	2018-04-20 11:22:46 -05:00
David Teigland	6c67c7557c	scan: use separate fd for bcache Create a new dev->bcache_fd that the scanning code owns and is in charge of opening/closing. This prevents other parts of lvm code (which do various open/close) from interfering with the bcache fd. A number of dev_open and dev_close are removed from the reading path since the read path now uses the bcache. With that in place, open(O_EXCL) for pvcreate/pvremove can then be fixed. That wouldn't work previously because of other open fds.	2018-04-20 11:22:46 -05:00
David Teigland	d9a77e8bb4	lvmcache: simplify metadata cache The copy of VG metadata stored in lvmcache was not being used in general. It pretended to be a generic VG metadata cache, but was not being used except for clvmd activation. There it was used to avoid reading from disk while devices were suspended, i.e. in resume. This removes the code that attempted to make this look like a generic metadata cache, and replaces with with something narrowly targetted to what it's actually used for. This is a way of passing the VG from suspend to resume in clvmd. Since in the case of clvmd one caller can't simply pass the same VG to both suspend and resume, suspend needs to stash the VG somewhere that resume can grab it from. (resume doesn't want to read it from disk since devices are suspended.) The lvmcache vginfo struct is used as a convenient place to stash the VG to pass it from suspend to resume, even though it isn't related to the lvmcache or vginfo. These suspended_vg* vginfo fields should not be used or touched anywhere else, they are only to be used for passing the VG data from suspend to resume in clvmd. The VG data being passed between suspend and resume is never modified, and will only exist in the brief period between suspend and resume in clvmd. suspend has both old (current) and new (precommitted) copies of the VG metadata. It stashes both of these in the vginfo prior to suspending devices. When vg_commit is successful, it sets a flag in vginfo as before, signaling the transition from old to new metadata. resume grabs the VG stashed by suspend. If the vg_commit happened, it grabs the new VG, and if the vg_commit didn't happen it grabs the old VG. The VG is then used to resume LVs. This isolates clvmd-specific code and usage from the normal lvm vg_read code, making the code simpler and the behavior easier to verify. Sequence of operations: - lv_suspend() has both vg_old and vg_new and stashes a copy of each onto the vginfo: lvmcache_save_suspended_vg(vg_old); lvmcache_save_suspended_vg(vg_new); - vg_commit() happens, which causes all clvmd instances to call lvmcache_commit_metadata(vg). A flag is set in the vginfo indicating the transition from the old to new VG: vginfo->suspended_vg_committed = 1; - lv_resume() needs either vg_old or vg_new to use in resuming LVs. It doesn't want to read the VG from disk since devices are suspended, so it gets the VG stashed by lv_suspend: vg = lvmcache_get_suspended_vg(vgid); If the vg_commit did not happen, suspended_vg_committed will not be set, and in this case, lvmcache_get_suspended_vg() will return the old VG instead of the new VG, and it will resume LVs based on the old metadata.	2018-04-20 11:22:45 -05:00
David Teigland	79c4971210	label_scan: remove extra label scan and read for orphan PVs When process_each_pv() calls vg_read() on the orphan VG, the internal implementation was doing an unnecessary lvmcache_label_scan() and two unnecessary label_read() calls on each orphan. Some of those unnecessary label scans/reads would sometimes be skipped due to caching, but the code was always doing at least one unnecessary read on each orphan. The common format_text case was also unecessarily calling into the format-specific pv_read() function which actually did nothing. By analyzing each case in which vg_read() was being called on the orphan VG, we can say that all of the label scans/reads in vg_read_orphans are unnecessary: 1. reporting commands: the information saved in lvmcache by the original label scan can be reported. There is no advantage to repeating the label scan on the orphans a second time before reporting it. 2. pvcreate/vgcreate/vgextend: these all share a common implementation in pvcreate_each_device(). That function already rescans labels after acquiring the orphan VG lock, which ensures that the command is using valid lvmcache information.	2018-04-20 11:22:45 -05:00
David Teigland	748f29b42a	scan: do scanning at the start of a command Move the location of scans to make it clearer and avoid unnecessary repeated scanning. There should be one scan at the start of a command which is then used through the rest of command processing. Previously, the initial label scan was called as a side effect from various utility functions. This would lead to it being called unnecessarily. It is an expensive operation, and should only be called when necessary. Also, this is a primary step in the function of the command, and as such it should be called prominently at the top level of command processing, not as a hidden side effect of a utility function. lvm knows exactly where and when the label scan needs to be done. Because of this, move the label scan calls from the internal functions to the top level of processing. Other specific instances of lvmcache_label_scan() are still called unnecessarily or unclearly by specific commands that do not use the common process_each functions. These will be improved in future commits. During the processing phase, rescanning labels for devices in a VG needs to be done after the VG lock is acquired in case things have changed since the initial label scan. This was being done by way of rescanning devices that had the INVALID flag set in lvmcache. This usually approximated the right set of devices, but it was not exact, and obfuscated the real requirement. Correct this by using a new function that rescans the devices in the VG: lvmcache_label_rescan_vg(). Apart from being inexact, the rescanning was extremely well hidden. _vg_read() would call ->create_instance(), _text_create_text_instance(), _create_vg_text_instance() which would call lvmcache_label_scan() which would call _scan_invalid() which repeats the label scan on devices flagged INVALID. lvmcache_label_rescan_vg() is now called prominently by _vg_read() directly.	2018-04-20 11:21:38 -05:00
David Teigland	4507ba3596	scan: use new label_scan for lvmcache_label_scan To do label scanning, lvm code calls lvmcache_label_scan(). Change lvmcache_label_scan() to use the new label_scan() based on bcache. Also add lvmcache_label_rescan_vg() which calls the new label_scan_devs() which does label scanning on only the specified devices. This is for a subsequent commit and is not yet used.	2018-04-20 11:19:32 -05:00
David Teigland	a7cb76ae94	scan: use bcache for label scan and vg read New label_scan function populates bcache for each device on the system. The two read paths are updated to get data from bcache. The bcache is not yet used for writing. bcache blocks for a device are invalidated when the device is written.	2018-04-20 11:19:24 -05:00
Joe Thornber	00f1b208a1	[io paths] Unpick agk's aio stuff	2018-04-20 11:03:58 -05:00
Zdenek Kabelac	73cda0437f	cleanup: correcting macro wrapping Use proper do {} while(0) so ';' after macros are correctly interpretted..	2018-04-20 12:17:01 +02:00
Zdenek Kabelac	9731d48691	cleanup: enhance debug message	2018-04-20 12:17:01 +02:00
Zdenek Kabelac	d437bd86ff	cleanup: display_lvname update message Add more display_lvname usage. Update some error messages. Indent.	2018-04-20 12:17:01 +02:00
Zdenek Kabelac	7323557379	cleanup: add _mb_ to regiosize option Just like with others mentions default unit in function name.	2018-04-20 12:17:01 +02:00
Zdenek Kabelac	27a1a0e5c0	cleanup: reorder condition There is no point to wait for sync for non-locally active LV.	2018-04-20 12:17:01 +02:00
Zdenek Kabelac	d81e3f9b06	mirror: use vg mempool Use vg mempool with mirror log metadata update.	2018-04-20 12:16:14 +02:00
Zdenek Kabelac	05f954ee9b	mirror: checking for mirror segtype Checking more correctly for mirror segtype here instead of mirrored one which can be also 'raid'.	2018-04-20 12:16:14 +02:00
Zdenek Kabelac	79d214032b	mirror: validate region_size for mirrors Check for region size properties of mirror segments.	2018-04-20 12:16:13 +02:00
Zdenek Kabelac	1693fef529	mirror: properly reload table for log init Since mirror can be stacked, we need to properly reload whole table stack, otherwice we may mishandle devices in dm table.	2018-04-20 12:15:36 +02:00
Zdenek Kabelac	66400d003d	mirror: fix region_size for clustered VG When adjusting region size for clustered VG it always needs to fit 2 full bitset into 1MB due to old limits of CPG. This is relatively big amount of bits, but we have still limitation for region size to fit into 32bits (0x8000000). So for too big mirrors this operation needs to fail - so whenever function returns now 0, it means we can't find matching region_size. Since return 0 is now 'error' we need to also pass proper region_size when creating pvmove mirror.	2018-04-20 12:13:48 +02:00
Zdenek Kabelac	a19456b868	mirror: fix calcs for maximal region_size Since extent_size is no longer power_of_2 this max region size evalution was rather producing random bitsize as a combination of lowest bit from number of extents and extent size itself. Correct calculation to use whole LV size and pick biggest possible power of 2 value smaller then UINT32_MAX.	2018-04-20 12:13:08 +02:00
Zdenek Kabelac	91965af9b1	mirror: improve mirror log size estimation Drop mirrored mirror log limitation that applies only in very limited use-case and actually mirrored mirror log is deprecated anyway. So 'disk' mirror log is selecting the correct minimal size, and bigger size is only enforced with real mirrored mirror log. Also for mirrored mirror log we let use 'smalled' region size if needed so if user uses 1G region size, we still keep small mirror log with much smaller region size in this case when needed. Also mirror log extent calculation is now properly detecting error with too big mirrors where previosly trimmed uint32_t was applies unintentionally.	2018-04-20 12:11:42 +02:00
Zdenek Kabelac	73189170f5	mirror: fix 32bit size calculation On 32bit arch size_t remains 4-byte wide - so size can't get correct result for multiplication of 32bit numbers.	2018-04-20 12:08:57 +02:00
Zdenek Kabelac	8d7ece126b	cache: disallow to combine format 2 with mq Only policy 'smq' is meant to be used with format version 2. Code used to let pass 'mq' policy also with format 2. But 'mq' is obsoloted wth smq and kernel currently matches it. But this is incompatible with older original mq logic - so disallow creation of this rather useless combination.	2018-03-19 12:02:08 +01:00
Heinz Mauelshagen	d68d71013f	lvcreate: remove RaidLV on creation failure In case a newly created RaidLV is blacklisted using config \"activation { volume list = [ ... ] }\" (i.e. its SubLVs stay inactive), the metadata SubLVs can't get wiped thus failing the creation. As a result, the RaidLV together with its SubLVs is left behind in an inconsistent state. Fix by removing the RaidLV and provide a hint about volume_list reasoning. Resolves: rhbz1161347	2018-03-16 15:57:53 +01:00
Zdenek Kabelac	285413b502	cleanup: missing dots and indent	2018-03-15 11:01:04 +01:00
Zdenek Kabelac	d794444715	activation: check for prioritized_section Detect we are in prioritezed section instead of critical one, since these operation were supposed to NOT be happining during whole set of operation. This patch fixes verification of udev operations.	2018-03-15 11:01:04 +01:00
Zdenek Kabelac	29b2cfba06	mirror: correct locking for mirror log initialization The code was not acking proper lock holding LVs when trying to initialize mirror log to predefined values.	2018-03-13 12:58:27 +01:00
Zdenek Kabelac	e095586d9e	cleanup: use path on stack	2018-03-13 12:57:08 +01:00
Heinz Mauelshagen	dd88a0f05c	raid: support raid5_n convenience type on conversion to raid10 Fix requesting a conversion on raid5_{ls,rs,la,ra} -> raid10 not offering offering interim convenience type raid5_n. Resolves: rhbz1468600	2018-03-09 21:23:16 +01:00
Zdenek Kabelac	ee37838b11	cache: fix lock usage for cache conversion Just like with lvcreate, this lvconvert case also need to properly check which LV actually holds lock for cached origin - as it might be i.e. thin-pool tdata subLV.	2018-03-08 10:39:47 +01:00
Zdenek Kabelac	6134a71a90	lvconvert: support for convertsion with active component devices If componet devices could be activated alone, ensure they are not breaking common commands. TODO: mostly likely this is not a definite list of all needed checks and more will come later.	2018-03-06 15:42:07 +01:00
Zdenek Kabelac	f92b6f9930	lvremove: ensure no subLV is active Since component activation is going to be enabled, enusure, no subLV is active when we deactivate LV.	2018-03-06 15:42:07 +01:00
Zdenek Kabelac	73e93ef5e5	lvremove: validate removed component LV is not active This is the 'last' place where a LV is present in metadata. Any removed device should not be left active in dm table. So this check is an extra validation protection to capture any forgotten deactivation (adding 1 extra ioctl into lvremove path)	2018-03-06 15:42:07 +01:00
Zdenek Kabelac	ca9cbd92c4	activation: add base lv component function Introduce: lv_is_component() check is LV is actually a component device. lv_component_is_active() checking if any component device is active. lv_holder_is_active() is any component holding device is active.	2018-03-06 15:42:05 +01:00
Zdenek Kabelac	6481471c9d	debug: update comment	2018-03-06 15:40:34 +01:00
Zdenek Kabelac	f04abd1f8a	lvremove: drop duplicate check for active LV Since this code branch already tested LV is active, avoid repeating same query.	2018-03-06 15:40:31 +01:00
Zdenek Kabelac	b2f1254c14	raid: move VG update after archiving happened Update of LV le_count needs to happen after archive().	2018-03-06 15:38:15 +01:00
Zdenek Kabelac	406d6de651	cleanup: indent	2018-02-28 21:15:55 +01:00
Zdenek Kabelac	16c209c613	cleanup: use lv_is_used_cache_pool Use lv_is_used_cache_pool() to simplify the code. Function was introduced later and this code missed to use it.	2018-02-28 21:15:55 +01:00
Zdenek Kabelac	6ba94fdd81	debug: change message severity Although it's internal issue - in this case command continue without any reported error - thus hide this internal error into debug.	2018-02-28 21:15:55 +01:00
Zdenek Kabelac	052f28746d	lvresize: check external origin with new size Instead of checking with existing size of external origin LV, use correctly the new 'wanted' size of this LV whether it fits the limitiation requirements for older thin-pool target. Otherwise code started to the the resize, updates metadata and just fails during 'resize' in case the LV was active. For inactive LV operation could have actually passed.	2018-02-28 21:15:55 +01:00
Zdenek Kabelac	b09ea3b6f7	lvremove: drop unneded check Checking here for cache_pool is not necessary and in effect the check is not even right - since there are internal states that do allow to active such LV.	2018-02-28 21:08:40 +01:00
Zdenek Kabelac	bc1adc32cb	lv_manip: enhance for_each_sub_lv Fix missing 'externalLV' traversing for thins with external origins. Replace extra for_each_sub_lv_except_pools() with better internal logic allowing selectively to cut of processed subLV tree. Extend error code for function 'fn()' when it returns -1 it will stop futher tree scan for given LV. Also a bit simplify code to have only one place that is calling 'fn()' and use level counter to know depth of traversing. Update renaming travering to skip trees for pools and external origins.	2018-02-28 21:08:38 +01:00
Zdenek Kabelac	a1195aaa66	cleanup: add missing WARNING ATM log_warn() is supposed to be used with WARNING: prefix.	2018-02-15 13:52:02 +01:00
Marian Csontos	d67f160200	mirror: Add deprecation warning for mirrored log	2018-02-14 13:32:04 +01:00
Zdenek Kabelac	e113df129e	cleanup: decode dso path just once Build dso plugin name during segtype initialisation and just use the string during command life-time. Also slightlt update message verbosity and make it very_verbose when operation is going to be made and 'verbose' when it's done.	2018-02-12 22:15:03 +01:00
Zdenek Kabelac	d90a647802	activation: separate reporting of error and monitoring status Avoid using same return code for reporting 2 different things and stricly report error code by return value and add new parameter for reporting monitoring status. This makes easier to recognize which error we got from dm_event and continue only with ENOENT.	2018-02-12 22:14:59 +01:00
Alasdair G Kergon	9194610f42	device: Add ioflags parameter to transfer additional state. Flags are set on the initial I/O and passed to any callbacks that may in turn issue further I/O using the inherited flags.	2018-01-21 21:10:23 +00:00
Zdenek Kabelac	e86910b052	lvconvert: use excl activation for conversion Use properly exclusive activation when reactivating origin after snapshot merge (since origin must have been previously also exlusively activated). Same applies when converting volumes to thin-pool or cache. Previously used 'only' local activation incorrectly allowed local activation of some targets (i.e. raid) - thus 'leaking' chance to activate same device on another node - which can be a problem for device types like raid.	2018-01-17 14:43:34 +01:00
Alasdair G Kergon	35cdd9cf48	label: Clean up storing of device and label sector. No longer use the external 'result' pointer internally to set up the cached label. The callback _set_label_read_result() is now given the internal label pointer directly Callers that don't need the result are no longer required to pass a label pointer into label_read().	2018-01-11 02:54:00 +00:00
Alasdair G Kergon	bacc942333	allocation: Avoid exceeding array bounds in allocation tag code If _limit_to_one_area_per_tag() changes nothing it writes beyond the array.	2018-01-10 15:48:03 +00:00
Alasdair G Kergon	946f07af3e	metadata: Use a consistent format for callback fn parameters	2018-01-05 14:24:56 +00:00
Alasdair G Kergon	b96862ee11	metadata: Consistently skip metadata areas that failed. Even after writing some metadata encountered problems, some commands continue (rightly or wrongly) and attempt to make further changes. Once an mda is marked MDA_FAILED, don't try to use it again. This also applies when reverting, where one loop already skips failed mdas but the other doesn't. This fixes some device open_count warnings on relevant failure paths.	2017-12-12 17:52:45 +00:00
Alasdair G Kergon	d591d04103	device: Tag I/O for each mda on a device separately in log messages. Mark the first metadata area on each text format PV as MDA_PRIMARY. Pass this information down to the device layer so that when there are two metadata areas on a block device, we can easily distinguish two independent streams of I/O.	2017-12-07 03:48:11 +00:00
Alasdair G Kergon	e4805e4883	device: categorise block i/o Introduce enum dev_io_reason to categorise block device I/O in debug messages so it's obvious what it is for. DEV_IO_SIGNATURES /* Scanning device signatures / DEV_IO_LABEL / LVM PV disk label / DEV_IO_MDA_HEADER / Text format metadata area header / DEV_IO_MDA_CONTENT / Text format metadata area content / DEV_IO_FMT1 / Original LVM1 metadata format / DEV_IO_POOL / Pool metadata format / DEV_IO_LV / Content written to an LV / DEV_IO_LOG / Logging messages */	2017-12-04 23:45:26 +00:00
Heinz Mauelshagen	4daad1cf11	lv_manip: allow extension on --nosync raid lv If the recovery of the repleced leg(s) of a RaidLV created without initial resynchronization (i.e. "lvcreate --nosync ...") got interrupted, it can't be extended because of the < 100% sync rate.	2017-12-01 18:38:18 +01:00
Heinz Mauelshagen	d3d18e637c	raid: ignore --stripesize on raid4/5 conversion to 1 stripe In case caller passes in changed stripe size when reshaping raid4/5 to 1 stripe aiming to convert to raid1 and optionally to linear, ignore it to prevent data corruption.	2017-12-01 15:00:09 +01:00
Zdenek Kabelac	c489dd2e17	pvmove: add missing segment merging When pvmove is finished and metadata are updated, the code missed to merge possible mergable segments - so add explicit merging call after pvmoved volumes are unlocked. This avoids weird results where i.e. lvs could have been reporting non-matching segments as lvs upon metadata read is doing silent segment merging while dm table left after pvmove was still preserving non-merged segments.	2017-12-01 12:19:09 +01:00
Zdenek Kabelac	fbd8b456db	pvmove: move code from tools to lib Move code manipulating with locking flags into /lib part of lvm.	2017-12-01 12:18:32 +01:00
Zdenek Kabelac	02e934c444	cleanup: reuse existing macro Use existing macro to detect striped raid segment.	2017-11-27 10:34:30 +01:00
Zdenek Kabelac	5e88d3a89b	cache: use conditional in warning message In some cases the message could be slightly misleading so use here rather conditional. TODO: In future we may possibly further tune the message in case we are certain the level of redundancy protection has not been reduced.	2017-11-24 16:09:59 +01:00
Zdenek Kabelac	ddbe763eb8	mirror: use lv_update_and_reload_origin Replace complex code with standard lv_update_and_reload_origin(). Extra suspend should not be necessary. (If they would be - dependency tree would have bug for fixing).	2017-11-24 16:05:21 +01:00
Zdenek Kabelac	b5be7420d9	locking: pvmove is locking holding LV As we do get lock for pvmove LV - it's lockholder ATM.	2017-11-24 16:05:21 +01:00
Heinz Mauelshagen	93c02e2532	raid: add validation checks for reshape flags Enhance vg_validate() raid checking functions to check for flags LV_RESHAPE and LV_RESHAPE_DELTA_DISKS_(MINUS\|PLUS).	2017-11-15 21:24:44 +01:00
Zdenek Kabelac	eab9097b46	layers: collect only lock holding LVs	2017-11-15 12:11:33 +01:00
Zdenek Kabelac	cc854c0617	pvmove: return pvmove itself When find_pvmove_lv_in_lv() get already a 'pvmoving' LV - return it.	2017-11-15 11:51:53 +01:00
Alasdair G Kergon	b5f62a143d	metadata: Eliminate redundant nested VG metadata Only lv_committed() now uses vg->vg_committed and it appears redundant if its contents match the enclosing VG so don't waste cycles creating it when that's known to be true when no write lock is held so the struct won't get modified.	2017-11-14 15:38:55 +00:00
Heinz Mauelshagen	ebd0fed0ce	raid: correct raid6_n_6 -> raid5 convenience type Fix "lvconvert --type raid5 RaidLV" on a "raid6_n_6" LV offering false "raid6_ls_6" instead of "raid5_n".	2017-11-14 14:41:06 +00:00
Alasdair G Kergon	00acae12a4	metadata: Remove unused vg.cft_precommitted The precommitted metadata config_tree is now only referenced from a single function so just use a local variable instead.	2017-11-14 01:22:09 +00:00
Alasdair G Kergon	6bf0f04ae2	log: Improve various device-related messages - Use 'lvmcache' consistently instead of 'metadata cache' - Always use 5 characters for source line number - Remember to convert uuids into printable form - Use <no name> rather than (null) when VG has no name.	2017-11-13 19:45:33 +00:00
Zdenek Kabelac	dd06a0a4a6	lv_lock_holder: unused cache-pool is not lock holder Unused cache-pool is only a constainer for data and metadata, and does not present localble entity.	2017-11-11 00:59:46 +01:00
Zdenek Kabelac	52cee9dd83	lvremove: for unused cache deactive sublv	2017-11-11 00:59:19 +01:00
Zdenek Kabelac	55b8204ca3	reload: do not take backup with suspended devices If the suspend/resume sequence would leave some device in suspend for possible later resume, backup cannot be takes (fs holding backups could be still frozen in critical section())	2017-11-11 00:58:11 +01:00
Zdenek Kabelac	05f9acdc7f	raid: protect raid4 activation Move check for presence of raid4 into the right place so there is no way how to hit activation of any LV with raid4 on kernel which does not support it.	2017-11-11 00:56:10 +01:00
Heinz Mauelshagen	9958c41927	raid: reject message for 2-legged raid4/5 -> striped Commit `763db8aab0` rejects 2-legged conversions to striped/raid0 but different messages are displayed for raid0 or striped. This commit provides the same rejection messages.	2017-11-08 18:17:26 +01:00
Heinz Mauelshagen	763db8aab0	raid: reject conversion request to striped/raid0 on 2-legged raid4/5 raid4/5 LVs may only be converted to striped or raid0/raid0_meta in case they have at least 3 legs. 2-legged raid4/5 are a result of either converting a raid1 to raid4/5 (takeover) or converting a raid4/5 with more than 2 legs to raid1 with 2 legs (reshape). The raid4/5 personalities map those as raid1, thus reject conversion to striped/raid0. Resolves: rhbz1511047	2017-11-08 17:49:04 +01:00
Zdenek Kabelac	3076a839a5	cleanup: drop unneeded headerfiles Coverity reported these are no longer in use.	2017-11-07 21:26:11 +01:00
Zdenek Kabelac	2354fb3fe4	coverity: avoid overflow_before_widen TODO: it likely should be checked value is >0...	2017-11-07 21:26:11 +01:00
Zdenek Kabelac	7a394575fb	cleanup: use segtype_is_raid_with_meta Replace with common macro.	2017-11-01 00:59:22 +01:00
Zdenek Kabelac	373372c8ab	lv_manip: hide layered LV temporarily Since vg_validate() now rejects LVs without segments and insert_layer_for_segments_on_pv() gets just created 'layer_lv' without segment, it needs to be hidden from vg->lvs during processing of _align_segment_boundary_to_pe_range() as this function calls lv_validate() and now requires vg to be consistent. LV is then put back into vg->lvs.	2017-11-01 00:55:24 +01:00
Alasdair G Kergon	248144d066	liblvm: Fix segfault in lvm_pv_remove. Since `4fa5add6b1` ("pvcreate: Wipe cached bootloaderarea when wiping label.") label_remove is responsible for the lvmcache_del. (toollib and liblvm need fixing to share the code.)	2017-10-30 22:03:35 +00:00
Zdenek Kabelac	2b6391538c	raid: setup LV size earlier New validation code which does require to not store LV with no size (no segments) revealed this size setup code needs to happen earlier.	2017-10-30 17:23:56 +01:00
Zdenek Kabelac	83d5db056b	lvreduce: check LV has segment Before accessing content make sure LV has segment. This can be used in case code removes LV without segments (i.e. on some error path)	2017-10-30 14:39:16 +01:00
Zdenek Kabelac	0424410773	validation: capture store of LV without segment	2017-10-30 14:39:16 +01:00
Alasdair G Kergon	84aca4201e	vgsplit: Fix detection of moved PVs. vgsplit shares the vg_rename code so that must only set the PV_MOVED_VG flag introduced in commit `486ed10848` ("vgmerge: Fix intermediate metadata corruption") on PVs that moved.	2017-10-27 22:53:43 +01:00
Zdenek Kabelac	63c50ced89	snapshot: relocate common code validation for snapshot origin Since both lvcreate and lvconvert needs to check for same type of allowed origin for snapshot - move the code into a single function. This way we also fix several inconsitencies where snapshot has been allowed by mistake either through lvcreate or lvconvert path.	2017-10-27 17:07:42 +02:00
Heinz Mauelshagen	4a3884245d	raid: ignore --stripes/--stripesize on takeover Converting from one raid level to another, no changes of stripes or stripesize can be requested because those are subject to reshaping. I.e. the process requires to takeover first and secondly request raid algorithm, stripe or stripesize changes. Ignore any related changes display warninngs and proceed with the takeover. Without this patch, a takeover requesting stripesize change causes data corruption!	2017-10-26 17:16:23 +02:00
Zdenek Kabelac	0e7edd1d24	snapshot: improve validation Do not allow to take snapshot of mirror/raid leg or log or metadata LV. This was actually never supported, but user was able to create it, and this put device stack in hardly fixable state (needs manual work). This prevents such creation to pass. Also improve validation when recreating snapshot volume type from origin and COW volume.	2017-10-25 21:58:01 +02:00
Zdenek Kabelac	d6fcab900b	lvextend: detect stacked cache lv used for thinpool Ensure, that cacheLV is not tried to be resize until full support is added.	2017-10-23 12:00:43 +02:00
Alasdair G Kergon	f3ae99dcc0	liblvm: Move lib code used exclusively into metadata-liblvm.c Also remove some redundant function definitions from metadata.h.	2017-10-18 19:29:32 +01:00
Alasdair G Kergon	f1cc5b12fd	tidy: Add missing underscores to statics.	2017-10-18 15:58:13 +01:00
Alasdair G Kergon	146745ad88	device: Separate errors for dev not found and filtered. Replaced the confusing device error message "not found (or ignored by filtering)" by either "not found" or "excluded by a filter". (Later we should be able to say which filter.) Left the the liblvm code paths alone.	2017-10-17 02:12:41 +01:00
David Teigland	6ac1e04b3a	replicator: remove the code It has not been used in a long time and is not expected to be used further.	2017-10-13 16:20:42 -05:00
Heinz Mauelshagen	cf13a30eaa	lvcreate: allow 100%FREE creation of "--type mirror" to work Fixes the following case with 3PVs and 3 legs "mirror" LV: # lvcreate -l100%FREE --type mirror -m2 vg3 Insufficient free space for log allocation for logical volume . Unable to allocate extents for mirror log. Related: rhbz1269533	2017-10-12 17:43:24 +02:00
Alasdair G Kergon	22789563de	thin: Improve overprovisioning and repair warnings.	2017-10-09 19:48:00 +01:00
Heinz Mauelshagen	3a639d8144	raid: cleanup raid4/5/6/10 validation check	2017-10-09 16:13:45 +02:00
Heinz Mauelshagen	44275c763c	raid: fix validation check for raid0 segment data_offset member Commit `2f754b73ff` missed one.	2017-10-09 16:03:35 +02:00
Heinz Mauelshagen	5f13e33d54	lvcreate: fix region size on striped RaidLVs Creating striped RaidLVs with lv size not divisible by region size caused the region size to be adjusted: # lvcreate --type raid5 -n region_check.32.00m_3 -i 3 -L 1g --nosync -R 32.00m raid_sanity Using default stripesize 64.00 KiB. Rounding size 1.00 GiB (256 extents) up to stripe boundary size <1.01 GiB(258 extents). WARNING: New raid5 won't be synchronised. Don't read what you didn't write! Using reduced mirror region size of 8.00 MiB Logical volume region_check.32.00m_3 created. Fix by not imposing "mirror" constraints on "raid". Resolves: rhbz1404007	2017-10-09 14:35:06 +02:00
Heinz Mauelshagen	2f754b73ff	raid: fix validation checks for segment data_offset member Commit `222e1e3ace` was not valuing special case of data_ofset member equal to 1.	2017-10-09 14:01:23 +02:00
Heinz Mauelshagen	554a761db2	raid: return previous reshape space allocation properly Fix returning previous allocation of reshape space.	2017-10-09 13:55:01 +02:00
Alasdair G Kergon	486ed10848	vgmerge: Fix intermediate metadata corruption vgmerge suffers from a similar problem to the one fixed in commit `8146548d25` ("vgsplit: Fix intermediate metadata corruption.") When merging, splitting or renaming VGs, use a new PV status flag PV_MOVED_VG to mark the PVs that hold metadata with the old VG name and use this to provide PV-level granularity instead of incorrectly assuming all PVs in the VG are the same.	2017-10-06 02:20:45 +01:00
Heinz Mauelshagen	a95f656d0d	raid: enhance conversion rejection message Related: rhbz1439399	2017-10-04 17:05:59 +02:00
David Teigland	f2ee0e7aca	pvmove: require LV name in a shared VG In a shared VG, only allow pvmove with a named LV, so that only PE's used by the LV will be moved. The LV is then activated exclusively, ensuring that the PE's being moved are not used from another host. Previously, pvmove was mistakenly allowed on a full PV. This won't work when LVs using that PV are active on other hosts.	2017-09-20 09:56:51 -05:00
David Teigland	518a8e8cfb	lvmlockd: activate mirror LVs in shared mode with cmirrord Previously lvmlockd disallowed mirror LVs to be activated in shared mode.	2017-09-20 09:55:34 -05:00
David Teigland	3071837e21	lvmlockd: always disallow mirror splitting lv_raid_split() was correctly prevented in a shared VG, but lv_raid_split_and_track() was missing that check.	2017-09-05 10:28:33 -05:00
Heinz Mauelshagen	222e1e3ace	raid: more validation checks for segment data_offset member Upgrade commit `fb641c3423` with additional checks.	2017-08-14 15:00:15 +02:00
Zdenek Kabelac	8256170e6a	thin: warn about too big chunks size lvm2 warned about zeroing and too big chunksize (>=512KiB), but only during lvconvert, so lvcreate was creating thin-pools without any warning about possible slowness of thin provisioning because of zeroing.	2017-08-01 11:52:27 +02:00
Zdenek Kabelac	876c4a1b3b	tidy: declaration names match implementation Put in sync some naming used for function declaration and actual in-code implementation.	2017-07-20 19:16:41 +02:00
Zdenek Kabelac	39ebacdb5a	raid: reshape synchronization point Give udev time to get in sync and give md-core time to wake up after table reload.	2017-07-20 19:16:39 +02:00
Alasdair G Kergon	7ba0017468	raid: avoid lv_size compiler warning warning: declaration of ‘lv_size’ shadows a global declaration	2017-07-20 16:16:51 +01:00
Zdenek Kabelac	c78316b7a5	raid: move syncing with udev into function Since _deactivate_and_remove_lvs() is used in more then one place, move the needed udev synchronization into this function so other users automatically get correct fs state before next dm manipulation. Assumption here is that this udev synchronization 'delay' may also prevent to 'early' table reloads which might cause kernel problems for md-core - but we may need more generic time-limited reload frequency for raid devices. Note: on udev-less system there will be almost no delay.	2017-07-20 13:52:18 +02:00
Zdenek Kabelac	48ce8c7a49	tidy: drop unneeded cast Avoid casting to the same type.	2017-07-20 11:20:44 +02:00
Zdenek Kabelac	4a2994b7b1	tidy: name all parameters	2017-07-20 11:20:26 +02:00
Zdenek Kabelac	4ef6cfc882	tidy: else after continue Similar as with 'else' after 'return' unindent whole block for better readability of code.	2017-07-20 11:18:29 +02:00
Zdenek Kabelac	0bf836aa14	tidy: prefer not using else after return clang-tidy: avoid using 'else' after return - give more readable code, and also saves indention level.	2017-07-20 11:18:29 +02:00
Zdenek Kabelac	0d0a3397c2	cleanup: add braces in macro	2017-07-20 11:18:29 +02:00
Heinz Mauelshagen	fb641c3423	raid: add validation checks for segment data_offset member Commit `34504855a7` introduced flag LV_RESHAPE_DATA_OFFSET and used it to avoid incompatible activation on older runtime. Enhance vg_validate() raid checking functions with checks for it.	2017-07-15 00:51:43 +02:00
Heinz Mauelshagen	34504855a7	raid: add data_offset incompatibility segment type flag In order to reject out of place reshaping with segment data_offset field on old runtime, add a respective segment type incompatibility flag causing "+RESHAPE_DATA_OFFSET" to be suffixed to the segment type name.	2017-07-14 15:53:23 +02:00
Heinz Mauelshagen	1d69fc7c5e	raid: use return_0 for better backtracking	2017-07-14 15:53:23 +02:00
Heinz Mauelshagen	6685460f5a	lvconvert: allow reshaping in the cluster and on open devices The previous commit fixed allocation/activation of reshape space. Remove conditionals prohibiting reshaping in these cases. Related: rhbz1447812 Related: rhbz1448116 Related: rhbz1461562	2017-07-14 15:53:23 +02:00
Heinz Mauelshagen	f1b78665ef	raid: fix allocation/activation of reshape space When reshape space is allocated anew, an update and reload is needed to promote the new size to the cluster node with the exclusively active RaidLV or reloading the RaidLV will fail with a size related error. Additionally, store "data_offset <sectors>" with the RaidLV in the lvm2 metadata so that it can be retrieved on cluster nodes. Process allocation of reshape space on a 2-legged raid4/5 (interim layout to convert from/to linear via raid1) properly in the cluster. Resolves: rhbz1461562 Resolves: rhbz1448116	2017-07-14 15:53:23 +02:00
Eric Ren	4c94371005	comment: update Use 'is' for both forms.	2017-07-10 14:58:01 +02:00
David Teigland	3797f47ecf	lvmlockd: fix revert in lvcreate If the activation step in lvcreate fails (e.g. the specified minor number is already used), then the lvcreate is reverted, but the LV lock in lvmlockd was not being unlocked or properly freed.	2017-07-07 14:42:25 -05:00
Zdenek Kabelac	2ceb5a0abb	coverity: just make impossible division by zero Visible for analyzer code will not try to use 0 for division.	2017-06-30 20:39:23 +02:00
Zdenek Kabelac	ad286a3227	raid: ensure enum is defined Just making sure enum is always defined. TODO: code path using this enum needs closer inspection.	2017-06-30 20:39:02 +02:00
Zdenek Kabelac	e9c60f874e	coverity: extra check for find_pool_seg find_pool_seg may return NULL in some internal error stats. Handle it explicitely.	2017-06-27 12:15:15 +02:00
Zdenek Kabelac	b939ddf80c	debug: more display_lvname usage	2017-06-27 08:28:36 +02:00
Zdenek Kabelac	d444accdbf	debug: fail in backup is not traced nor error	2017-06-27 00:27:36 +02:00
Zdenek Kabelac	c440bb0742	debug: check for fail in id validation	2017-06-27 00:27:36 +02:00
Zdenek Kabelac	3e331c8e68	cleanup: remove unused code	2017-06-27 00:27:25 +02:00
Zdenek Kabelac	b1e21cf9ed	raid: fix write_commit_backup With commit `41c10034aa` we actually do require LV to be used with _vg_write_lv_suspend_commit_backup(). So write a proper separte single wrapper for write && commit && backup.	2017-06-27 00:27:25 +02:00
Zdenek Kabelac	c465ca6a3a	raid: allow more sync action for extraction Since we discovered status reporting from 'md' goes from large set of weird states we can't just decided based on this word. So let it pass for rebuild and idle as well and check for health devices afterwards.	2017-06-24 22:28:25 +02:00
Zdenek Kabelac	1bd4b0059b	cleanup: use display_percent Replace occurence of %.2f with call of display_percent function.	2017-06-24 17:44:42 +02:00
Zdenek Kabelac	2b18be87aa	raid: recognize transient failed raid leg When raid leg rimage device is marked as 'D'ead by mdcore, lvm2 was not able to replace such device with allocate policy, as device has not appared as missing. Add detection of transiently failing devices.	2017-06-23 23:27:07 +02:00
Zdenek Kabelac	cc03a872c0	cleanup: update messages	2017-06-23 18:44:01 +02:00
Zdenek Kabelac	a7c7d53543	debug: add missing internal error message Do not just 'return_0' log error would need to be shown.	2017-06-23 18:44:01 +02:00
Zdenek Kabelac	1bdcd156fd	cache: restore origin only reload Basically reverting commit `58a9f88b8c`. We can use origin_only in case we are snapshot's origin, as we do support this stack. So when we are 'uncaching' origin+snaps - we do need to reload only origin and we do not need to play with snaps.	2017-06-23 18:44:01 +02:00
Zdenek Kabelac	63ecbcd1b7	raid: switch message to verbose As this is not 'error' resulting query, decrease reported level.	2017-06-23 18:44:01 +02:00
Zdenek Kabelac	6d30350dd1	raid: improving messages for regionsize change Handle change of 'region size' better and follow also standard rule if the command can't success (i.e. size is already same) we return error for all such cases. Also log_pring more info about adjusted value (just like we do for rounding) Also avoid keep pointers on 'display_*' values - they are in ringbuffer for immediate use - not to be kept across multiple calls (as they could be already overwritten by later calls) - so dropped seg_region_size_str	2017-06-23 18:44:00 +02:00
Zdenek Kabelac	41c10034aa	debug: show message only when origin_only was set	2017-06-22 20:17:20 +02:00
Zdenek Kabelac	58a9f88b8c	cache: drop usage of origin_only Since cache LV can be a stacked device, there is no real reason trying to use slight optimised tree for origin_only cache reload (it could be even wrongly implemented in this case). We can easily go with stardard tree load here.	2017-06-22 20:14:31 +02:00
Zdenek Kabelac	ca9e6cec61	cache: make syncing abortable by user When user runs command like 'lvconvert --splitcache' the operation might be actually either slow or not making any progress in kernel, so lets give user a chance to abort such operation. When user press 'Ctrl+C' device table is restored to pre-flushing state.	2017-06-22 20:11:43 +02:00
Heinz Mauelshagen	2df9a78684	mirror: reformat conditional	2017-06-22 00:57:16 +02:00
Heinz Mauelshagen	64fac77e8a	raid: fix segfault Add missing else clause (already missing in initial commit `fe18e5e77a`). Resolves: rhbz1463794	2017-06-22 00:49:00 +02:00
Zdenek Kabelac	e3f63693a4	lvresize: support passing --yes to fsadm Since fsadm now needs --yes to pass prompting operations, we need to pass --yes from lvresize to fsadm.	2017-06-21 14:03:29 +02:00
Zdenek Kabelac	48f06005ab	raid: update path for repair Updating path from commit `61980bcf06`. When repair is running, no removing PVS are given so it shall return success in such case.	2017-06-21 14:00:50 +02:00
Zdenek Kabelac	5f4cfa7c4a	debug: missing traces	2017-06-21 12:36:01 +02:00
Zdenek Kabelac	07fe64b473	raid: use log_error on error path Converting log_warn to log_error since error must be logged when tool returns error.	2017-06-21 12:35:17 +02:00
Zdenek Kabelac	61980bcf06	raid: report error when specified devices are not contained lvm2 always return non-zero error code when action cannot happen.	2017-06-21 12:35:17 +02:00
Zdenek Kabelac	31d153ced0	raid: drop debug code	2017-06-21 12:35:16 +02:00
Zdenek Kabelac	49fa2bea1c	raid: more origin_only updates Seems the code is multiplied - so keep it consistent for now. TODO: drop all uneeded code	2017-06-21 12:35:16 +02:00
Heinz Mauelshagen	1766eaec4b	lvconvert: provide better reshape reject message for open RaidLV On commits `5e611c700b` and `601ad1c73f`. Related: rhbz1447812	2017-06-20 19:06:18 +02:00
Heinz Mauelshagen	76314183e2	raid: avoid explicit activation of SubLVs on reshape/takeover Remove explicit activation of SubLVs and let lv_update_and_reload() perform the proper (pre-)loading sequencing of tables. This avoids related callback functions which are removed. Related: rhbz1448116 Related: rhbz1461526 Related: rhbz1448123	2017-06-20 18:56:45 +02:00
Heinz Mauelshagen	0dfe1bc29d	raid: provide clickable URL BZ references	2017-06-20 18:43:26 +02:00
Zdenek Kabelac	1ea41b6d48	activation: fix usage of origin_only When lock-holding LV differs from actually request locked LV, we drop origin_only flag as it has no use - it'd be applied on completely different LV. Example of problem: Raid is thin-pool _tdata LV. Raid run origin_only locking on stacked device. As lock holder is discovered thinLV. Whole origin_only operation is then applied only on thinLV changing the meaning of whole operation. NOTE: this patch does not change anything for LV that are already top-level lock holding LVs (i.e. thinLVs, snahoshots/origins).	2017-06-20 18:23:24 +02:00
Heinz Mauelshagen	5e611c700b	lvconvert: check open count to disable reshaping of open RAID LV Also check LV open count in addition to opening the RaidLV exclusively as of commit `601ad1c73f`. Related: rhbz1447812	2017-06-20 17:59:10 +02:00
Heinz Mauelshagen	601ad1c73f	lvconvert: enhance disable reshaping of open RAID LV Enhance commit `9e9163618a` to use dev_open_flags/dev_close API. Related: rhbz1447812	2017-06-20 17:27:58 +02:00
Zdenek Kabelac	19cc03fa52	thin: restore conversion to raid Since commit `1bc546269a` we've disabled coversion of raid. This however already got fixed, so reenable commands like: 'lvconvert --type raid1 vg/pool_tdata'.	2017-06-19 23:30:08 +02:00
Heinz Mauelshagen	9e9163618a	lvconvert: disable reshaping of open RAID LV Disable until we have a proper fix for reshape space allocation, switching it to begin/end of rimages and activation. Related: rhbz1447812	2017-06-19 22:25:54 +02:00
Heinz Mauelshagen	e1a1c20e95	lvconvert: enhance message Enhance message introduced by last commit `f342e803ba`. Related: rhbz1439399	2017-06-19 21:40:38 +02:00
Heinz Mauelshagen	f342e803ba	lvconvert: disable conversion of RAID LV under snapshot Disable until we have a proper fix for reshape space allocation, switching it to begin/end of rimages and activation. Related: rhbz1439399	2017-06-19 21:08:52 +02:00
Heinz Mauelshagen	fb46175ce7	lvconvert: disable reshaping of RAID LVs in the cluster Disable until we have a proper fix for reshape space allocation, switching it to begin/end of rimages and activation in the cluster. Related: rhbz1448116 Related: rhbz1461526 Related: rhbz1448123	2017-06-19 21:06:53 +02:00
Zdenek Kabelac	fbb3bffb22	debug: passing non-raid seg would be internal error	2017-06-16 17:04:02 +02:00
Zdenek Kabelac	9e96f96a41	cleanup: drop unused parameter	2017-06-16 17:04:02 +02:00
Zdenek Kabelac	cdb55c19cd	cleanup: show what happens when passed prompt When we show prompt and user passes --yes - we still do tell user which action is going to happen.	2017-06-16 17:04:02 +02:00
Zdenek Kabelac	14816222a1	cleanup: improve debug tracing	2017-06-16 17:04:02 +02:00
Zdenek Kabelac	59d646167f	raid: report percent with segtype info Enhance reporting code, so it does not need to do 'extra' ioctl to get 'status' of normal raid and provide percentage directly. When we have 'merging' snapshot into raid origin, we still need to get this secondary number with extra status call - however, since 'raid' is always a single segment LV - we may skip 'copy_percent' call as we directly know the percent and also with better precision. NOTE: for mirror we still base reported number on the percetage of transferred extents which might get quite imprecisse if big size of extent is used while volume itself is smaller as reporting jump steps are much bigger the actual reported number provides. 2nd.NOTE: raid lvs line report already requires quite a few extra status calls for the same device - but fix will be need slight code improval.	2017-06-16 17:04:01 +02:00
Heinz Mauelshagen	ddf2a1d656	Revert "lvconvert: reject changing number of stripes on single core This reverts commit `3719f4bc54` to allow for single core testing on kernels with deadlock fixes relative to rhbz1443999."	2017-06-16 15:43:23 +02:00
Jonathan Brassow	6c4b2a6aa1	clean-up: Very picky update to comment - hopefully making it clearer	2017-06-14 15:22:04 -05:00
Jonathan Brassow	1f57a5263e	clean-ups: remove unused var, add 'static' for local fn, adjust test For the test clean-up, I was providing too many devices to the first command - possibly allowing it to allocate in the wrong place. I was also not providing a device for the second command - virtually ensuring the test was not performing correctly at times.	2017-06-14 14:49:42 -05:00
Jonathan Brassow	ddb14b6b05	lvconvert: Disallow removal of primary when up-converting (recovering) This patch ensures that under normal conditions (i.e. not during repair operations) that users are prevented from removing devices that would cause data loss. When a RAID1 is undergoing its initial sync, it is ok to remove all but one of the images because they have all existed since creation and contain all the data written since the array was created. OTOH, if the RAID1 was created as a result of an up-convert from linear, it is very important not to let the user remove the primary image (the source of all the data). They should be allowed to remove any devices they want and as many as they want as long as one original (primary) device is left during a "recover" (aka up-convert). This fixes bug 1461187 and includes the necessary regression tests.	2017-06-14 08:41:05 -05:00
Jonathan Brassow	4c0e908b0a	RAID (lvconvert/dmeventd): Cleanly handle primary failure during 'recover' op Add the checks necessary to distiguish the state of a RAID when the primary source for syncing fails during the "recover" process. It has been possible to hit this condition before (like when converting from 2-way RAID1 to 3-way and having the first two devices die during the "recover" process). However, this condition is now more likely since we treat linear -> RAID1 conversions as "recover" now - so it is especially important we cleanly handle this condition.	2017-06-14 08:39:50 -05:00
Jonathan Brassow	d34d2068dd	lvconvert: Don't require a 'force' option during RAID repair. Previously, we were treating non-RAID to RAID up-converts as a "resync" operation. (The most common example being 'linear -> RAID1'.) RAID to RAID up-converts or rebuilds of specific RAID images are properly treated as a "recover" operation. Since we were treating some up-convert operations as "resync", it was possible to have scenarios where data corruption or data loss were possibilities if the RAID hadn't been able to sync completely before a loss of the primary source devices. In order to ensure that the user took the proper precautions in such scenarios, we required a '--force' option to be present. Unfortuneately, the force option was rendered useless because there was no way to distiguish the failure state of a potentially destructive repair from a nominal one - making the '--force' option a requirement for any RAID1 repair! We now treat non-RAID to RAID up-converts properly as "recover" operations. This eliminates the scenarios that can potentially cause data loss or data corruption; and this eliminates the need for the '--force' requirement. This patch removes the requirement to specify '--force' for RAID repairs.	2017-06-14 08:39:07 -05:00
Jonathan Brassow	c87907dcd5	lvconvert: linear -> raid1 upconvert should cause "recover" not "resync" Two of the sync actions performed by the kernel (aka MD runtime) are "resync" and "recover". The "resync" refers to when an entirely new array is going through the process of initializing (or resynchronizing after an unexpected shutdown). The "recover" is the process of initializing a new member device to the array. So, a brand new array with all new devices will undergo "resync". An array with replaced or added sub-LVs will undergo "recover". These two states are treated very differently when failures happen. If any device is lost or replaced while "resync", there are no worries. This is because any writes created from the inception of the array have occurred to all the devices and can be safely recovered. Even though non-initialized portions will still be resync'ed with uninitialized data, it is ok. However, if a pre-existing device is lost (aka, the original linear device in a linear -> raid1 convert) during a "recover", data loss can be the result. Thus, writes are errored by the kernel and recovery is halted. The failed device must be restored or removed. This is the correct behavior. Unfortunately, we were treating an up-convert from linear as a "resync" when we should have been treating it as a "recover". This patch removes the special case for linear upconvert. It allows each new image sub-LV to be marked with a rebuild flag and treats the array as 'in-sync'. This has the correct effect of causing the upconvert to be treated as a "recover" rather than a "resync". There is no need to flag these two states differently in LVM metadata, because they are already considered differently by the kernel RAID metadata. (Any activation/deactivation will properly resume the "recover" process and not a "resync" process.) We make this behavior change based on the presense of dm-raid target version 1.9.0+.	2017-06-14 08:35:22 -05:00
Heinz Mauelshagen	08079ec420	lvconvert: fix detached SubLV deactivation in cluster On conversion from raid10 to raid0 (takeover), all rmeta devices and the rimage devices of mirrored stripes are detached from the raid10 LV. The remaining rimage areas are being shifted down into the slots of the detached ones hence requiring renames to show proper _N suffix sequences (e.g. 0,1,2,3 instead of 0,2,4,6). Only the top-level raid10 LV has a cluster lock, not the detached SubLVs thus their deactivation is impossible and e.g the rename from _rimage_6 to _rimage_3 will fail. Fix by activating exclusively before deactivating and removing. Resolves: rhbz1448123	2017-06-13 23:15:51 +02:00
Heinz Mauelshagen	1c916ec5ff	raid: add reshape segtype flag support Prohibit activation of reshaping RaidLVs on incompatible lvm2 runtime by storing e.g. 'raid5+RESHAPE' segment type strings in the lvm2 metadata. Incompatible runtime not supporting reshaping won't be able to activate those thus avoiding potential data corruption. Any new non-reshaping lvconvert command will reset the segment type string from 'raid5+RESHAPE' to 'raid5'. See commits `0299a7af1e` and `4141409eb0` for segtype flag support.	2017-06-09 22:23:04 +02:00
Zdenek Kabelac	57379157f4	cleanup: update message	2017-06-09 21:49:19 +02:00
Zdenek Kabelac	db5938a4f8	cleanup: define really uses KB Cleanup also units for DEFAULT_THIN_POOL_OPTIMAL_METADATA_SIZE define (128MB) and update calcs for it.	2017-06-09 21:49:19 +02:00
Zdenek Kabelac	48ffb996c5	thin: disallow creation of too big thin pools When a combination of thin-pool chunk size and thin-pool data size goes beyond addressable limit, such volume creation is directly prohibited. Maximum usable thin-pool size is calculated with use of maximal support metadata size (even when it's created smaller) and given chunk-size. If the value data size is found to be too big, the command reports error and operation fails. Previously thin-pool was created however lots of thin-pool data LV was not usable and this space in VG has been wasted.	2017-06-08 11:58:36 +02:00
Zdenek Kabelac	719d099693	cleanup: rename internal define More descriptive name of #define.	2017-06-08 11:07:18 +02:00
Heinz Mauelshagen	39703cb485	lvconvert: reject RAID conversions on inactive LVs Only support RAID conversions on active LVs. If we'd accept e.g. upconverting linear -> raid1 on inactive linear LVs, any LV flags passed to the kernel aren't properly cleared thus errouneously passing them on every activation. Add respective check to lv_raid_change_image_count() and move existing one in lv_raid_convert() for better messages.	2017-06-07 18:37:04 +02:00
Heinz Mauelshagen	3217e0cfea	lvconvert: choose direct path to desired raid level Remove superfluous raid5_n interim LV type from raid4 -> raid10 conversion. Resolves: rhbz1458006	2017-06-02 14:30:57 +02:00
David Teigland	c98a25aab1	print warning about in-use orphans Warn about a PV that has the in-use flag set, but appears in the orphan VG (no VG was found referencing it.) There are a number of conditions that could lead to this: . The PV was created with no mdas and is used in a VG with other PVs (with metadata) that have not yet appeared on the system. So, no VG metadata is found by lvm which references the in-use PV with no mdas. . vgremove could have failed after clearing mdas but before clearing the in-use flag. In this case, the in-use flag needs to be manually cleared on the PV. . The PV may have damanged/unrecognized VG metadata that lvm could not read. . The PV may have no mdas, and the PVs with the metadata may have damaged/unrecognized metadata.	2017-06-01 11:18:42 -05:00
David Teigland	f3c90e90f8	disable repairing in-use flag on orphan PVs A PV holding VG metadata that lvm can't understand (e.g. damaged, checksum error, unrecognized flag) will appear as an in-use orphan, and will be cleared by this repair code. Disable this repair until the code can keep track of these problematic PVs, and distinguish them from actual in-use orphans.	2017-06-01 09:53:14 -05:00
Heinz Mauelshagen	3719f4bc54	lvconvert: reject changing number of stripes on single core Reject any stripe adding/removing reshape on raid4/5/6/10 because of related MD kernel deadlock on single core systems until we get a proper fix in MD. Related: rhbz1443999	2017-05-30 19:14:32 +02:00
Heinz Mauelshagen	65b10281f8	Proper dm_snprintf return checks	2017-05-24 14:00:44 +02:00
Heinz Mauelshagen	3da5cdc5dc	Fix typo	2017-05-24 13:47:45 +02:00
David Teigland	7a0f46e2f8	add comment about PV in-use repair copied from commit message for `d97f1c89de`	2017-05-23 16:59:46 -05:00
Alasdair G Kergon	57492a6094	raid: Drop unnecessary/incorrect use of dm_pool_free	2017-05-23 01:51:04 +01:00
Alasdair G Kergon	fbe7464df5	metadata: Unlock VG on more _vg_make_handle error paths Internal error: VG lock vg0 must be requested before vg3, not after. Internal error: 3 device(s) were left open and have been closed.	2017-05-23 01:38:02 +01:00
Heinz Mauelshagen	2bf01c2f37	lvconvert: fix logic in automatic settings of possible (raid) LV types Commit `5fe07d3574` failed to set raid5 types properly on conversions from raid6. It always enforced raid6_ls_6 for types raid6/raid6_zr/raid6_nr/raid6_nc, thus requiring 3 conversions instead of 2 when asking for raid5_{la,rs,ra,n}. Related: rhbz1439403	2017-05-18 16:20:39 +02:00
Heinz Mauelshagen	9c651b146e	lvconvert: fix indent and typo in last commit	2017-05-18 00:43:20 +02:00
Heinz Mauelshagen	5fe07d3574	lvconvert: enhance automatic settings of possible (raid) LV types Offer possible interim LV types and display their aliases (e.g. raid5 and raid5_ls) for all conversions between striped and any raid LVs in case user requests a type not suitable to direct conversion. E.g. running "lvconvert --type raid5 LV" on a striped LV will replace raid5 aka raid5_ls (rotating parity) with raid5_n (dedicated parity on last image). User is asked to repeat the lvconvert command to get to the requested LV type (raid5 aka raid5_ls in this example) when such replacement occurs. Resolves: rhbz1439403	2017-05-18 00:18:15 +02:00
Alasdair G Kergon	80900dcf76	metadata: Fix metadata repair when devs still missing. _check_reappeared_pv() incorrectly clears the MISSING_PV flags of PVs with unknown devices. While one caller avoids passing such PVs into the function, the other doesn't. Move the check inside the function so it's not forgotten. Without this patch, if the normal VG reading code tries to repair inconsistent metadata while there is an unknown PV, it incorrectly considers the missing PVs no longer to be missing and produces incorrect 'pvs' output omitting the missing PV, for example. Easy reproducer: Create a VG with 3 PVs pv1, pv2, pv3. Hide pv2. Run vgreduce --removemissing. Reinstate the hidden PV pv2 and at the same time hide a different PV pv3. Run 'pvs' - incorrect output. Run 'pvs' again - correct output. See https://bugzilla.redhat.com/1434054	2017-05-11 02:17:34 +01:00
David Teigland	d45531712d	vg_read: check for NULL dev to avoid segfault There are certain situations (not fully understood) where is_missing_pv() is false, but pv->dev is NULL, so this adds a check for NULL pv->dev after is_missing_pv() to avoid a segfault.	2017-05-10 10:45:41 -05:00
Alasdair G Kergon	0e3c16af56	pvresize: Missing a message on error path.	2017-04-27 15:00:41 +01:00
Alasdair G Kergon	cbc69f8c69	pvresize: Prompt when non-default size supplied. Seek confirmation before changing the PV size to one that differs from the underlying block device.	2017-04-27 02:36:34 +01:00
Heinz Mauelshagen	8f305f025e	raid: handle insufficent PVs on takeover to/from raid4 Commit `7bc85177b0` felt short relative to striped/raid0* -> raid4 and raid4 -> raid6. Related: rhbz1438013	2017-04-22 01:19:44 +02:00
Heinz Mauelshagen	97a5fa4b87	raid: avoid superfluous variable	2017-04-22 00:50:36 +02:00
Heinz Mauelshagen	0c2fd133d7	raid: remove double minimum area check on takeover	2017-04-20 21:35:06 +02:00
Heinz Mauelshagen	d8a63f446e	raid: define return value on error paths	2017-04-20 21:32:40 +02:00
Heinz Mauelshagen	5fb5717402	raid: avoid superfluous reload on takeover Allow any reset rebuild flags to trigger the second update on takeover. Use descriptive callback names. Fix typo and add comments.	2017-04-20 21:18:27 +02:00
Heinz Mauelshagen	83cdba75bd	mirror/raid: display adjusted region size with units Display adjusted region size in units (e.g. "4.00 MiB") rather than sectors.	2017-04-20 20:42:21 +02:00
Heinz Mauelshagen	15c3ad9641	lvconvert: typo in message	2017-04-13 22:19:29 +02:00
Zdenek Kabelac	1e64386dc6	raid: use log_error Turn log_print into log_error for error path.	2017-04-12 23:05:50 +02:00
Heinz Mauelshagen	1f715ab3b2	lvconvert: return error without conversion lvconvert parameters not causing a conversion (i.e. no type, number of stripes, stripesize or regionsize changes) will remove any allocated reshape space in which case the command returns success. If reshape space does not exist though, return error.	2017-04-12 22:11:30 +02:00
Zdenek Kabelac	3018cdcaa7	fsadm: support configurable full path Just like with other tools lvm2 is using allow to define fully configurable path. Default is selected by $PREFIX/sbin/fsadm	2017-04-12 21:34:08 +02:00
Heinz Mauelshagen	51a31dbd79	lvconvert: better message on --regionsize Enhance message on "lvconvert --regionsize size RaidLV". in case the regionsize does not change and return error.	2017-04-12 19:34:18 +02:00
Jonathan Brassow	ba12a2e81a	Typo: change loose to lose loose (v): set free; release lose (v) : be deprived of or cease to have or retain We 'lose' redundancy or 'lose' meaning.	2017-04-12 10:28:19 -05:00
Heinz Mauelshagen	532388fad5	lvconvert: fix failing valid regionsize change Reshape check failed when regionsize changed and current raid type was provided with no other change requested (stripes or stripesize). E.g. "lvconvert --type raid6 --regionsize 256K" on a raid6 LV with != 256K regionsize. Enable --type in test script.	2017-04-12 14:38:49 +02:00
Heinz Mauelshagen	01b5820d03	lvconvert: add segment type raid10_near Introducing this alias for "raid10", avoid allocating reshape space when converting between them. Resolves: rhbz1441347	2017-04-12 01:28:22 +02:00
Heinz Mauelshagen	7bc85177b0	raid: handle insufficent PVs on takeover from striped/raid0 Remove any newly allocated sub LV (pair) remnants in case allocation fails due to lag of (parallel) free PV space and keep initial raid type. Resolves: rhbz1438013	2017-04-12 00:27:59 +02:00
David Teigland	69c3543855	raid_manip: fix typo warning message	2017-04-11 14:18:57 -05:00
Heinz Mauelshagen	ef3e1013aa	lvconvert: cleanup prompting	2017-04-06 19:59:57 +02:00
Heinz Mauelshagen	eb6302c8cb	lvconvert: fixe conversion message When selecting a convenience RAID type only display the selected type when it changed. Display proper current raid type when prompting.	2017-04-06 19:28:32 +02:00
Heinz Mauelshagen	653bca6811	lvconvert: raid1 -> linear prompt Avoid 2 prompts when downconverting raid1 to linear (related commit `0f65d7ec3a`).	2017-04-06 19:24:11 +02:00
Heinz Mauelshagen	3b1a96b9b3	lvconvert: avoid error message on raid1 -> raid4 conversion Avoid error message "Logical Volume *_rimage_0 already exists in volume group,,," on takeover conversion from a 2-legged raid1 to raid4 (aiming to reshape it adding images). Resolves: rhbz1439398	2017-04-06 19:09:05 +02:00
Heinz Mauelshagen	0f65d7ec3a	lvconvert: prompt on raid1 image changes Don't change resilience of raid1 LVs without --yes. Adjust respective tests.	2017-04-06 18:47:41 +02:00
Heinz Mauelshagen	e350b83d50	raid: reload on removing images Requesting _raid_remove_images() to commit the metadata missed to reload the origin causing a kernel takeover error converting a 2-legged raid1 (with previously removed images) to raid5.	2017-04-06 00:47:34 +02:00
Heinz Mauelshagen	d23cad16c9	raid: tidying	2017-04-06 00:06:52 +02:00
Heinz Mauelshagen	1ef1bdab27	lvconvert: allow --type with --regionsize Allow the combination of both arguments keeping the raid level but changing the regionssize (e.g. "lvconvert --type raid1 --regionsize 1M RaidLV" on an existing raid1 LV). Resolves: rhbz1438396	2017-04-06 00:03:16 +02:00
Heinz Mauelshagen	980e4f673e	raid: more coverity issues	2017-03-30 18:39:04 +02:00
Heinz Mauelshagen	c34ab29ec6	raid: favour dm_list_first()	2017-03-30 18:13:27 +02:00
Heinz Mauelshagen	2d75ef3b05	raid: address coverity issues	2017-03-30 18:09:06 +02:00
Alasdair G Kergon	396377bc03	pre-release Removing some unused new lines and changing some incorrect "can't release until this is fixed" comments. Rename license.txt to make it clear its merely an included file, not itself a licence.	2017-03-28 16:11:35 +01:00
Heinz Mauelshagen	1bf90dac77	Revert "raid: adjust to misordered raid table line output" This reverts commit `1e4462dbfb` in favour of an enhanced solution avoiding changes in liblvm completetly by checking the target versions in libdm and emitting the respective parameter lines.	2017-03-23 01:19:41 +01:00
Heinz Mauelshagen	7126fb13e7	metadata: cleanup flags definition to be consistent Use shift bitops throughout segtype.h.	2017-03-22 00:29:49 +01:00
Heinz Mauelshagen	1e4462dbfb	raid: adjust to misordered raid table line output The libdevmapper interface compares existing table line retrieved from the kernel to new table line created to decide if it can suppress a reload. Any difference between input and output of the table line is taken to be a change thus causing a table reload. The dm-raid target started to misorder the raid parameters (e.g. 'raid10_copies') starting with dm-raid target version 1.9.0 up to (excluding) 1.11.0. This causes runtime failures (limited to raid10 as of tests) and needs to be reversed to allow e.g. old lvm2 uspace to run properly. Check for the aforementioned version range and adjust creation of the table line to the respective (mis)ordered sequence inside and correct order outside the range (as described for the raid target in the kernels Documentation/device-mapper/dm-raid.txt).	2017-03-21 18:17:42 +01:00
Heinz Mauelshagen	fec2ea76cf	raid: check target version for shrink support Starting with dm-raid target version 1.9.0 shrinking of mapped devices is supported. Check for support being present in lvresize and lvreduce. Related: rhbz1394048	2017-03-17 16:46:33 +01:00
Heinz Mauelshagen	17a8f3d6f0	raid: conditionally reject convert to striped/raid; fix Fix a logic flaw introduced in commit `17bee733d1` preventing e.g. striped -> raid5 conversions. Related: rhbz1191935 Related: rhbz1366296	2017-03-17 16:03:35 +01:00
Heinz Mauelshagen	76709aaf39	raid: cleanup; remove unused function Remove unused function (lv_has_constant_stripes() is used instead).	2017-03-17 14:24:44 +01:00
Zdenek Kabelac	4a271e7ee7	properties: only thin-pool provides discards Quering non-thin-pool segment for discard property may lead to intenal error if the segment had set 'out-of-range' value, so only thin-pool is allowed, for other it returns NULL.	2017-03-17 14:22:33 +01:00
Heinz Mauelshagen	e0ea569045	raid: cleanup Move function _raid45_to_raid54_wrapper() to avoid superfluous declaration.	2017-03-17 14:14:42 +01:00
Heinz Mauelshagen	1520fec3e8	raid: name variables consistently Related: rhbz1191935 Related: rhbz1366296	2017-03-17 14:04:03 +01:00
Heinz Mauelshagen	17bee733d1	raid: conditionally reject convert to striped/raid0* If SubLVs to be removed still exist after an image removing conversion (i.e. "lvconvert --yes --force --stripes N " with N < total stripes) any request to convert to a different striped/raid* level has to be rejected until after those freed SubLVs got removed by running the aforementioned lvconvert again. Add tests to check conversion to striped/raid* gets rejected. Enhance a test comment. Related: rhbz1191935 Related: rhbz1366296	2017-03-17 13:58:54 +01:00
Heinz Mauelshagen	b0336e8b3c	lvconvert: ensure upconversion restrictions Ensure minimum number of 3 data stripes on conversions to raid6. Add test for it. Resolves: rhbz1432675	2017-03-16 22:10:32 +01:00
Zdenek Kabelac	4a727a3ccd	raid: use 64bit arithmetic Coverity - keep multiplication for size cals in 64bit (otherwise it's just 32b x 32b)	2017-03-16 01:02:10 +01:00
Zdenek Kabelac	e3a51537c5	coverity: make sure segtype pointer is valid	2017-03-16 01:02:10 +01:00
Zdenek Kabelac	2a139993b4	thin: remove unneeed test for NULL In this API NULL is not valid parameter so do not check for it.	2017-03-16 01:02:10 +01:00
Heinz Mauelshagen	5f2c942000	raid: check more cautious on region size changes Add additional checks to avoid calling _region_size_change_requested() with bogus actual arguments.	2017-03-13 17:46:56 +01:00
Heinz Mauelshagen	5d3e870946	raid: fix compile time warning	2017-03-10 20:38:16 +01:00
Zdenek Kabelac	d11b8eef89	cleanup: easier code	2017-03-10 19:33:01 +01:00
Zdenek Kabelac	4d2b1a0660	cache: enable usage of --cachemetadataformat lvcreate and lvconvert may select cache metadata format when caching LV. By default lvm2 picks best available format.	2017-03-10 19:33:01 +01:00
Zdenek Kabelac	64d3f05aa1	cache: validation for cache_metadata_format Only cache-pool segtype may store cache_metadata_format. Only supported values are 0,1,2 Format 2 requires LV status uses LV_METADATA_FORMAT. Format 0 (unselected) or 1 shall not set this 'incompatible' status.	2017-03-10 19:33:01 +01:00
Zdenek Kabelac	518b814cdb	cache: LV supports cache segs with metadata format Cache pool read/writes metadata_format within its segment type.. For CachePoolLV unselected metadata format is NOT stored in metadata. For CacheLV when metadata format is not present/selected in lvm2 metadata, it's automatically assumed to be the version 1 (backward compatible). To ensure older lvm2 will not 'miss-read' metadata with new version 2, such LV is marked with METADATA_FORMAT status flag (segment is specifying metadata format). So when cache uses metadata format 2, it will become inaccesible on older system without such support. (kernel dm cache < 1.10, lvm2 < 2.02.169).	2017-03-10 19:33:01 +01:00
Zdenek Kabelac	4a394f410d	cache: introduce allocation/cache_metadata_format Add new profilable configation setting to let user select which metadata format of a created cache pool he wish to use. By default the 'best' available format is autodetected at runtime, but user may enforce format 1 or 2 ATM. Code also detects availability for metadata2 supporting cache target. In case of troubles user may easily Disable usage of this feature by placing 'metadata2' into global/cache_disabled_features list.	2017-03-10 19:33:01 +01:00
Zdenek Kabelac	a9b78d26b1	cleanup: minor cosmetics Update some return value to match return type. Drop unused function and declaration.	2017-03-10 19:33:01 +01:00
Zdenek Kabelac	21c265adcf	cache: improve profile support for cache_set_policy	2017-03-10 19:33:01 +01:00
Zdenek Kabelac	4d0793f0ec	pool: rework handling of passed args As now we can properly recognize all paramerters for pool creation, we may drop PASS_ARG_ defines and rely on '_UNSELECTED' or 0 entries as being those without user given args. When setting are not given on command line - 'update' function fill them from profiles or configuration. For this 'profile' arg was needed to be passed around and since 'VG' itself is not needed, it's been all replaced with 'cmd, profile, extents_size' args.	2017-03-10 19:33:01 +01:00
Zdenek Kabelac	7c52d550e9	thin: single formula for estimation Share the same formula for estimation chunk size or metadata size. Use uint32_t matching type.	2017-03-10 19:33:00 +01:00
Zdenek Kabelac	298d12c459	lvcreate: do not round cache volumes on cache chunks Since cache chunk might be huge and there is no technical need to enforce rounding and there is actually more 'real' VG space used then necessary - keep rounding on 'chunk' bounrary only for thin volumes - where it's the space used anyway. NB: we support conversion of any-size 'existing' LV into cached LV.	2017-03-10 19:33:00 +01:00
Zdenek Kabelac	f24a1f06b2	lvcreate: respecting profile settings	2017-03-10 19:33:00 +01:00
Zdenek Kabelac	36003df7e3	cache: extend usability of cache_set_params Fix missing reset of '*settings' pointer when no args were given. Handle cache_chunk settings like all other settings, so it is properly updated only with non-zero settings and the existing cache-pool chunk_size is not being reconfigured.	2017-03-10 19:33:00 +01:00
Zdenek Kabelac	dcf038c7a6	cache: improve support for profile for cache settings User can specify metadata profile which stores important cache geometry data for easy configuration. Fix missing support for getting chunk_size, cache_mode, cache_policy for a cache/cache pools volumes from configuration or metadata profile.	2017-03-10 19:33:00 +01:00
Zdenek Kabelac	2d11fc695e	cache: set chunk_size as first param	2017-03-10 19:33:00 +01:00
Zdenek Kabelac	4184331965	cache: use UNSELECTED enum Switch from _UNDEFINED to _UNSELECTED which is more describing its value 0, while value -1 is better match for UNDEFINED.	2017-03-10 19:33:00 +01:00
Zdenek Kabelac	b8cd0f4808	thin: add new ZERO/DISCARDS_UNSELECTED To more easily recognize unselected state from select '0' state add new 'THIN_ZERO_UNSELECTED' enum. Same applies to THIN_DISCARDS_UNSELECTED. For those we no longer need to use PASS_ARG_ZERO or PASS_ARG_DISCARDS.	2017-03-10 19:33:00 +01:00
Zdenek Kabelac	acfc82ae29	pool: split chunk size validation Move cache and thin bits into their respective manipulation files. When possible directly call respective chunk_size validator.	2017-03-10 19:33:00 +01:00
Zdenek Kabelac	375e4bb3da	thin: getting default chunk_size from single place Basically code moving operation to have a single place resolving thin_pool_chunk_size_policy. Supported are generic & performance profiles. Function is now shared between thin manipulation code and configuration _CFG logic to obtain defaults and handle correct reporting upward coding stack.	2017-03-10 19:33:00 +01:00
Zdenek Kabelac	50441f2433	cache: properly translate DM_THIN_DISCARDS DM status uses DM defines which need to be translated to LVM enum.	2017-03-10 19:33:00 +01:00
Zdenek Kabelac	7ad57d55af	lvconvert: indent and code simplification Simple modifications to existing _lvconvert_to_pool().	2017-03-10 19:33:00 +01:00
Heinz Mauelshagen	dd2881f277	raid: enhance lv_raid_convert() header relative to reshaping	2017-03-10 19:26:02 +01:00
Heinz Mauelshagen	bc3bec6c54	raid: fix compile time warning	2017-03-10 14:43:37 +01:00
Heinz Mauelshagen	f2d7a48418	lvconvert: add raid1 <-> raid4 conversion In addition to the already supported conversion between 2-legged raid1 and raid5, raid1 and raid4 can be also converted into each other with 2 legs (raid4/5 are limited to map a 2-legged raid1). This patch supports the missing raid4 conversion in the sequence linear -> 2-legged raid1 -> raid4/5, then restripe to more than one data stripes for performance and resilience reasons and optionally convert to striped/raid0. The other conversion sequence is also possible by converting N-way striped/raid0 to raid4/5, then restripe to 2 legs followed by a conversion to raid1 and optionally to linear (loosing all resilience).	2017-03-09 23:18:13 +01:00
Heinz Mauelshagen	66fff1d774	raid: add missing lv_merge_segments() call On conversion from striped to raid0, data LVs are created and all segments and their respective areas of the striped LV are moved across to new segments allocated for the raid0 image LVs. This can cause non-canonical segments to be added to the image LVs. Add a call to lv_merge_segments() once all segments have been added to an image LV to compensate for that. This avoids unsafe table loads on activation. Fix comments.	2017-03-09 22:18:34 +01:00
Heinz Mauelshagen	6dfe1ce251	lvconvert: prompt when splitting off LV of a 2-legged raid1 LV Splitting off an image LV of a 2-legged raid1 LV causes loss of resilience. Ask user to avoid uninformed loss of all resilience. Don't ask for N > 2 legged raid1 LVs. Adjust tests.	2017-03-09 13:59:47 +01:00
Heinz Mauelshagen	d250aa7208	lvconvert: prompt when splitting off a tracked LV of a 2-legged raid1 LV Splitting off an image LV of a 2-legged raid1 LV tracking changes causes loosing partial resilience for any newly written data set. Full resilience will be provided again after the split off image LV got merged back in and the new data set got fully synchronized. Reason being that the data is only stored on the remaining single writable image during the split. Ask user to avoid uninformed loss of such partial resilience. Don't ask for N > 2 legged raid1 LVs.	2017-03-09 03:22:55 +01:00
Heinz Mauelshagen	7fbe6ef16b	lvconvert: prompt when converting raid1 to linear Ask user when converting raid1 to linear to avoid uninformed loss of all resilience.	2017-03-09 02:39:49 +01:00
Heinz Mauelshagen	90ed3d5e8c	raid: fix function description	2017-03-09 02:16:03 +01:00
Heinz Mauelshagen	921b496fff	lvconvert: fix --repair after vgreduce In case N images fail (N <= parity chunks) _and_ a "vgreduce --removemissing --force VG" was applied a following repair of the RaidLV fails: Unable to remove N images: Only 0 devices given. Failed to remove the specified images from tb/r. Failed to replace faulty devices in tb/r. Fix as of this commit results in correct repair: Faulty devices in tb/r successfully replaced.	2017-03-09 02:11:52 +01:00
Heinz Mauelshagen	ed58672029	metadata: comments log_count,nosync,stripes,stripe_size,,... are also used for raid.	2017-03-08 15:13:59 +01:00
Heinz Mauelshagen	3a5561e5ab	raid: define seg->extents_copied seg->extents_copied has to be defined properly on reducing the size of a raid LV or conversion from raid5 with 1 stripe to raid1 will fail. Related: rhbz834579 Related: rhbz1191935 Related: rhbz1191978	2017-03-07 23:28:09 +01:00
Heinz Mauelshagen	18bbeec825	raid: fix raid LV resizing The lv_extend/_lv_reduce API doesn't cope with resizing RaidLVs with allocated reshape space and ongoing conversions. Prohibit resizing during conversions and remove the reshape space before processing resize. Add missing seg->data_copies initialisation. Fix typo/comment.	2017-03-07 22:05:23 +01:00
Heinz Mauelshagen	9ed11e9191	raid: cleanup _lv_set_image_lvs_start_les() Avoid second loop.	2017-03-07 21:55:19 +01:00
Heinz Mauelshagen	05aceaffbd	lvconvert: adjust --stripes on raid10 convert For the time being raid10 is limited to even number of total stripes as is and 2 data copies. The number of stripes provided on creation of a raid10(_near) LV with -i/--stripes gets doubled to define that even total number of stripes (i.e. images). Apply the same on disk adding conversions (reshapes) with "lvconvert --stripes RaidLV" (e.g. 2 stripes = 4 images total converted to 3 stripes = 6 images total). Related: rhbz834579 Related: rhbz1191935 Related: rhbz1191978	2017-03-07 21:36:03 +01:00
Heinz Mauelshagen	c5b6c9ad44	report: raid enhancements for --select Enhance the raid report functions for the recently added LV fields reshape_len, reshape_len_le, data_offset, new_data_offset, data_copies, data_stripes and parity_chunks to cope with "lvs --select". Related: rhbz834579 Related: rhbz1191935 Related: rhbz1191978	2017-03-03 22:29:50 +01:00
Heinz Mauelshagen	7a064303fe	lvconvert: add missing reshape_len initialization An initialization was missing when converting striped to raid0(_meta) causing unitialized reshape_len in the new component LVs first segment. Related: rhbz834579 Related: rhbz1191935 Related: rhbz1191978	2017-02-28 23:29:03 +01:00
Heinz Mauelshagen	964114950c	lvconvert: adjust mininum region size check The imposed minimum region size can cause rejection on disk removing reshapes. Lower it to avoid that. Related: rhbz834579 Related: rhbz1191935 Related: rhbz1191978	2017-02-28 23:10:37 +01:00
Heinz Mauelshagen	ce1e5b9991	lvconvert: adjust reshaping check to target version https://git.kernel.org/cgit/linux/kernel/git/device-mapper/linux-dm.git/commit/?h=dm-4.11&id=b08c6076782 sets the dm-raid target version to 1.10.1. Adjust the condition to set RAID_RESHAPE_FEATURE to it. Related: rhbz834579 Related: rhbz1191935 Related: rhbz1191978	2017-02-28 22:46:25 +01:00
Heinz Mauelshagen	189fa64793	lvconvert: impose region size constraints When requesting a regionsize change during conversions, check for constraints or the command may fail in the kernel n case the region size is too smalle or too large thus leaving any new SubLVs behind. Related: rhbz834579 Related: rhbz1191935 Related: rhbz1191978	2017-02-24 07:27:43 +01:00
Heinz Mauelshagen	3bdc4045c2	lvconvert: fix 2 issues identified in intesting Allow regionsize on upconvert from linear: fix related commit `2574d3257a` to actually work Related: rhbz1394427 Remove setting raid5_n on conversions from raid1 as of commit `932db3db53` because any raid5 mapping may be requested. Related: rhbz834579 Related: rhbz1191935 Related: rhbz1191978	2017-02-24 05:58:45 +01:00
Heinz Mauelshagen	2574d3257a	lvconvert: allow regionsize on upconvert from linear Allow to provide regionsize with "lvconvert -m1 -R N " on upconverts from linear and on N -> M raid1 leg conversions. Resolves: rhbz1394427	2017-02-24 05:20:58 +01:00
Heinz Mauelshagen	34caf83172	lvconvert: add infrastructure for RaidLV reshaping support In order to support striped raid5/6/10 LV reshaping (change of LV type, stripesize or number of legs), this patch introduces the changes to call the reshaping infratructure from lv_raid_convert(). Changes: - add reshaping calls from lv_raid_convert() - add command definitons for reshaping to tools/command-lines.in - fix raid_rimage_extents() - add 2 new test scripts lvconvert-raid-reshape-linear_to_striped.sh and lvconvert-raid-reshape-striped_to_linear.sh to test the linear <-> striped multi-step conversions - add lvconvert-raid-reshape.sh reshaping tests - enhance lvconvert-raid-takeover.sh with new raid10 tests Related: rhbz834579 Related: rhbz1191935 Related: rhbz1191978	2017-02-24 05:20:58 +01:00
Heinz Mauelshagen	f79bd30a8b	lvconvert: add infrastructure for RaidLV reshaping support In order to support striped raid5/6/10 LV reshaping (change of LV type, stripesize or number of legs), this patch introduces more local infrastructure to raid_manip.c used by followup patches. Change: - allow raid_rimage_extents() to calculate raid10 - remove an __unused__ attribute Related: rhbz834579 Related: rhbz1191935 Related: rhbz1191978	2017-02-24 05:20:58 +01:00
Heinz Mauelshagen	1784cc990e	lvconvert: add infrastructure for RaidLV reshaping support In order to support striped raid5/6/10 LV reshaping (change of LV type, stripesize or number of legs), this patch introduces more local infrastructure to raid_manip.c used by followup patches. Change: - add missing raid1 <-> raid5 conversions to support linear <-> raid5 <-> raid0(_meta)/striped conversions - rename related new takeover functions to _takeover_from_raid1_to_raid5 and _takeover_from_raid5_to_raid1, because a reshape to > 2 legs is only possible with raid5 layout Related: rhbz834579 Related: rhbz1191935 Related: rhbz1191978	2017-02-24 05:20:58 +01:00
Heinz Mauelshagen	2d74de3f05	lvconvert: add infrastructure for RaidLV reshaping support In order to support striped raid5/6/10 LV reshaping (change of LV type, stripesize or number of legs), this patch introduces more local infrastructure to raid_manip.c used by followup patches. Change: - enhance _clear_meta_lvs() to support raid0 allowing raid0_meta -> raid10 conversions to succeed by clearing the raid0 rmeta images or the kernel will fail because of discovering reordered raid devices Related: rhbz834579 Related: rhbz1191935 Related: rhbz1191978	2017-02-24 05:20:58 +01:00
Heinz Mauelshagen	34a8d3c2fd	lvconvert: add infrastructure for RaidLV reshaping support In order to support striped raid5/6/10 LV reshaping (change of LV type, stripesize or number of legs), this patch introduces more local infrastructure to raid_manip.c used by followup patches. Changes: - enhance _raid45610_to_raid0_or_striped_wrapper() to support raid5_n with 2 areas to raid1 conversion to allow for striped/raid0(_meta)/raid4/5/6 -> raid1/linear conversions; rename it to _takeover_downconvert_wrapper to discontinue the illegible function name - enhance _striped_or_raid0_to_raid45610_wrapper() to support raid1 with 2 areas to raid5* conversions to allow for linear/raid1 -> striped/raid0(_meta)/raid4/5/6 conversions; rename it to _takeover_upconvert_wrapper for the same reason Related: rhbz834579 Related: rhbz1191935 Related: rhbz1191978	2017-02-24 05:20:58 +01:00
Heinz Mauelshagen	932db3db53	lvconvert: add infrastructure for RaidLV reshaping support In order to support striped raid5/6/10 LV reshaping (change of LV type, stripesize or number of legs), this patch introduces more local infrastructure to raid_manip.c used by followup patches. Changes: - add missing possible reshape conversions and conversion options to allow/prohibit changing stripesize or number fo stripes - enhance setting convenient riad types in reshape conversions (e.g. raid1 with 2 legs -> radi5_n) Related: rhbz834579 Related: rhbz1191935 Related: rhbz1191978	2017-02-24 05:20:58 +01:00
Heinz Mauelshagen	fe18e5e77a	lvconvert: add infrastructure for RaidLV reshaping support In order to support striped raid5/6/10 LV reshaping (change of LV type, stripesize or number of legs), this patch introduces more local infrastructure to raid_manip.c used by followup patches. Changes: - add _raid_reshape() using the pre/post callbacks and the stripes add/remove reshape functions introduced before - and _reshape_requested function checking if a reshape was requested Related: rhbz834579 Related: rhbz1191935 Related: rhbz1191978	2017-02-24 05:20:58 +01:00
Heinz Mauelshagen	929cf4b73c	lvconvert: add infrastructure for RaidLV reshaping support In order to support striped raid5/6/10 LV reshaping (change of LV type, stripesize or number of legs), this patch introduces more local infrastructure to raid_manip.c used by followup patches. Changes: - add vg metadata update functions - add pre and post activation callback functions for proper sequencing of sub lv activations during reshaping - move and enhance _lv_update_reload_fns_reset_eliminate_lvs() to support pre and post activation callbacks - add _reset_flags_passed_to_kernel() which resets anyxi rebuild/reshape flags after they have been passed into the kernel and sets the SubLV remove after reshape flags on legs to be removed Related: rhbz834579 Related: rhbz1191935 Related: rhbz1191978	2017-02-24 05:20:58 +01:00
Heinz Mauelshagen	4de0e692db	lvconvert: add infrastructure for RaidLV reshaping support In order to support striped raid5/6/10 LV reshaping (change of LV type, stripesize or number of legs), this patch introduces more local infrastructure to raid_manip.c used by followup patches. Changes: - add function to support disk adding reshapes - add function to support disk removing reshapes - add function to support layout (e.g. raid5ls -> raid5_rs) or stripesize reshaping Related: rhbz834579 Related: rhbz1191935 Related: rhbz1191978	2017-02-24 05:20:58 +01:00
Heinz Mauelshagen	7d39b4d5e7	lvconvert: add infrastructure for RaidLV reshaping support In order to support striped raid5/6/10 LV reshaping (change of LV type, stripesize or number of legs), this patch introduces more local infrastructure to raid_manip.c used by followup patches. Changes: - add function providing state of a reshaped RaidLV - add function to adjust the size of a RaidLV was reshaped to add/remove stripes Related: rhbz834579 Related: rhbz1191935 Related: rhbz1191978	2017-02-24 05:20:58 +01:00
Heinz Mauelshagen	92691e345d	lvconvert: add infrastructure for RaidLV reshaping support In order to support striped raid5/6/10 LV reshaping (change of LV type, stripesize or number of legs), this patch introduces more local infrastructure to raid_manip.c used by followup patches. Changes: - add lv_raid_data_copies returning raid type specific number; needed for raid10 with more than 2 data copies - remove _shift_and_rename_image_components() constraint to support more than 10 raid legs - add function to calculate total rimage length used by out-of-place reshape space allocation - add out-of-place reshape space alloc/relocate/free functions - move _data_rimages_count() used by reshape space alloc/realocate functions Related: rhbz834579 Related: rhbz1191935 Related: rhbz1191978	2017-02-24 05:20:58 +01:00
Heinz Mauelshagen	c1865b0a86	raid: typo	2017-02-24 05:20:58 +01:00
Heinz Mauelshagen	b499d96215	lvconvert: add infrastructure for RaidLV reshaping support In order to support striped raid5/6/10 LV reshaping (change of LV type, stripesize or number of legs), this patch introduces local infrastructure to raid_manip.c used by followup patches. Add functions: - to check reshaping is supported in target attibute - to return device health string needed to check the raid device is ready to reshape Related: rhbz834579 Related: rhbz1191935 Related: rhbz1191978	2017-02-24 05:20:58 +01:00
Heinz Mauelshagen	e2354ea344	lvconvert: add infrastructure for RaidLV reshaping support In order to support striped raid5/6/10 LV reshaping (change of LV type, stripesize or number of legs), this patch introduces infrastructure prerequisites to be used by raid_manip.c extensions in followup patches. This base is needed for allocation of out-of-place reshape space required by the MD raid personalities to avoid writing over data in-place when reading off the current RAID layout or number of legs and writing out the new layout or to a different number of legs (i.e. restripe) Changes: - add members reshape_len to 'struct lv_segment' to store out-of-place reshape length per component rimage - add member data_copies to struct lv_segment to support more than 2 raid10 data copies - make alloc_lv_segment() aware of both reshape_len and data_copies - adjust all alloc_lv_segment() callers to the new API - add functions to retrieve the current data offset (needed for out-of-place reshaping space allocation) and the devices count from the kernel - make libdm deptree code aware of reshape_len - add LV flags for disk add/remove reshaping - support import/export of the new 'struct lv_segment' members - enhance lv_extend/_lv_reduce to cope with reshape_len - add seg_is_/segtype_is_ macros related to reshaping - add target version check for reshaping - grow rebuilds/writemostly bitmaps to 246 bit to support kernel maximal - enhance libdm deptree code to support data_offset (out-of-place reshaping) and delta_disk (legs add/remove reshaping) target arguments Related: rhbz834579 Related: rhbz1191935 Related: rhbz1191978	2017-02-24 05:20:58 +01:00
Heinz Mauelshagen	8ab0725077	lvchange: reject writemostly/writebehind on raid1 during resync The MD kernel raid1 personality does no use any writemostly leg as the primary. In case a previous linear LV holding data gets upconverted to raid1 it becomes the primary leg of the new raid1 LV and a full resynchronization is started to update the new legs. No writemostly and/or writebehind setting may be allowed during this initial, full synchronization period of this new raid1 LV (using the lvchange(8) command), because that would change the primary (i.e the previous linear LV) thus causing data loss. lvchange has a bug not preventing this scenario. Fix rejects setting writemostly and/or writebehind on resychronizing raid1 LVs. Once we have status in the lvm2 metadata about the linear -> raid upconversion, we may relax this constraint for other types of resynchronization (e.g. for user requested "lvchange --resync "). New lvchange-raid1-writemostly.sh test is added to the test suite. Resolves: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=855895	2017-02-23 15:09:29 +01:00

... 7 8 9 10 11 ...

3177 Commits