shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-22 17:35:59 +03:00

Author	SHA1	Message	Date
Jonathan Brassow	4d45302e25	RAID: Fail RAID4/5/6 creation if PE size is less than STRIPE_SIZE_MIN The maximum stripe size is equal to the volume group PE size. If that size falls below the STRIPE_SIZE_MIN, the creation of RAID 4/5/6 volumes becomes impossible. (The kernel will fail to load a RAID 4/5/6 mapping table with a stripe size less than STRIPE_SIZE_MIN.) So, we report an error if it is attempted. This is very rare because reducing the PE size down that far limits the size of the PV below that of modern devices.	2014-08-15 21:15:34 -05:00
Peter Rajnoha	8af2309231	cleanup: gcc warning One more: metadata/thin_manip.c:503: warning: declaration of "snapshot_count" shadows a global declaration	2014-08-15 15:43:42 +02:00
Peter Rajnoha	8e449ebd63	cleanup: gcc warning metadata/lv_manip.c:269: warning: declaration of "snapshot_count" shadows a global declaration There's existing function called "snapshot_count" so rename the variable to "snap_count".	2014-08-15 15:32:04 +02:00
Peter Rajnoha	e8bbcda2a3	Add lv_layout_and_type fn, lv_layout and lv_type reporting fields. The lv_layout and lv_type fields together help with LV identification. We can do basic identification using the lv_attr field which provides very condensed view. In contrast to that, the new lv_layout and lv_type fields provide more detialed information on exact layout and type used for LVs. For top-level LVs which are pure types not combined with any other LV types, the lv_layout value is equal to lv_type value. For non-top-level LVs which may be combined with other types, the lv_layout describes the underlying layout used, while the lv_type describes the use/type/usage of the LV. These two new fields are both string lists so selection (-S/--select) criteria can be defined using the list operators easily: [] for strict matching {} for subset matching. For example, let's consider this: $ lvs -a -o name,vg_name,lv_attr,layout,type LV VG Attr Layout Type [lvol1_pmspare] vg ewi------- linear metadata,pool,spare pool vg twi-a-tz-- pool,thin pool,thin [pool_tdata] vg rwi-aor--- level10,raid data,pool,thin [pool_tdata_rimage_0] vg iwi-aor--- linear image,raid [pool_tdata_rimage_1] vg iwi-aor--- linear image,raid [pool_tdata_rimage_2] vg iwi-aor--- linear image,raid [pool_tdata_rimage_3] vg iwi-aor--- linear image,raid [pool_tdata_rmeta_0] vg ewi-aor--- linear metadata,raid [pool_tdata_rmeta_1] vg ewi-aor--- linear metadata,raid [pool_tdata_rmeta_2] vg ewi-aor--- linear metadata,raid [pool_tdata_rmeta_3] vg ewi-aor--- linear metadata,raid [pool_tmeta] vg ewi-aor--- level1,raid metadata,pool,thin [pool_tmeta_rimage_0] vg iwi-aor--- linear image,raid [pool_tmeta_rimage_1] vg iwi-aor--- linear image,raid [pool_tmeta_rmeta_0] vg ewi-aor--- linear metadata,raid [pool_tmeta_rmeta_1] vg ewi-aor--- linear metadata,raid thin_snap1 vg Vwi---tz-k thin snapshot,thin thin_snap2 vg Vwi---tz-k thin snapshot,thin thin_vol1 vg Vwi-a-tz-- thin thin thin_vol2 vg Vwi-a-tz-- thin multiple,origin,thin Which is a situation with thin pool, thin volumes and thin snapshots. We can see internal 'pool_tdata' volume that makes up thin pool has actually a level10 raid layout and the internal 'pool_tmeta' has level1 raid layout. Also, we can see that 'thin_snap1' and 'thin_snap2' are both thin snapshots while 'thin_vol1' is thin origin (having multiple snapshots). Such reporting scheme provides much better base for selection criteria in addition to providing more detailed information, for example: $ lvs -a -o name,vg_name,lv_attr,layout,type -S 'type=metadata' LV VG Attr Layout Type [lvol1_pmspare] vg ewi------- linear metadata,pool,spare [pool_tdata_rmeta_0] vg ewi-aor--- linear metadata,raid [pool_tdata_rmeta_1] vg ewi-aor--- linear metadata,raid [pool_tdata_rmeta_2] vg ewi-aor--- linear metadata,raid [pool_tdata_rmeta_3] vg ewi-aor--- linear metadata,raid [pool_tmeta] vg ewi-aor--- level1,raid metadata,pool,thin [pool_tmeta_rmeta_0] vg ewi-aor--- linear metadata,raid [pool_tmeta_rmeta_1] vg ewi-aor--- linear metadata,raid (selected all LVs which are related to metadata of any type) lvs -a -o name,vg_name,lv_attr,layout,type -S 'type={metadata,thin}' LV VG Attr Layout Type [pool_tmeta] vg ewi-aor--- level1,raid metadata,pool,thin (selected all LVs which hold metadata related to thin) lvs -a -o name,vg_name,lv_attr,layout,type -S 'type={thin,snapshot}' LV VG Attr Layout Type thin_snap1 vg Vwi---tz-k thin snapshot,thin thin_snap2 vg Vwi---tz-k thin snapshot,thin (selected all LVs which are thin snapshots) lvs -a -o name,vg_name,lv_attr,layout,type -S 'layout=raid' LV VG Attr Layout Type [pool_tdata] vg rwi-aor--- level10,raid data,pool,thin [pool_tmeta] vg ewi-aor--- level1,raid metadata,pool,thin (selected all LVs with raid layout, any raid layout) lvs -a -o name,vg_name,lv_attr,layout,type -S 'layout={raid,level1}' LV VG Attr Layout Type [pool_tmeta] vg ewi-aor--- level1,raid metadata,pool,thin (selected all LVs with raid level1 layout exactly) And so on...	2014-08-15 14:50:38 +02:00
Peter Rajnoha	1cd622d98b	report: lvs: properly display 'o' for volume type bit and 'C' for target type bit in lv_attr field for cache origin LVs Before this patch: LV VG Attr [cache_orig_corig] vg -wi-ao---- With this patch applied: LV VG Attr [cache_orig_corig] vg owi-aoC---	2014-08-15 13:28:43 +02:00
Peter Rajnoha	8eba33510f	cache+thin: add lv_is_{cache,thin}_origin fn to identify origin LVs	2014-08-15 13:28:43 +02:00
Peter Rajnoha	ec0d2f7aa4	refactor: add defines for raid segtypes This will be reused later on in upcoming code...	2014-08-15 13:28:43 +02:00
Alasdair G Kergon	bf78e55ef3	pvcreate: Fix cache state with filters/sig wiping. _pvcreate_check() has two missing requirements: After refreshing filters there must be a rescan. (Otherwise the persistent filter may remain empty.) After wiping a signature, the filters must be refreshed. (A device that was previously excluded by the filter due to its signature might now need to be included.) If several devices are added at once, the repeated scanning isn't strictly needed, but we can address that later as part of the command processing restructuring (by grouping the devices). Replace the new pvcreate code added by commit `54685c20fc` "filters: fix regression caused by commit e80884cd080cad7e10be4588e3493b9000649426" with this change to _pvcreate_check(). The filter refresh problem dates back to commit `acb4b5e4de` "Fix pvcreate device check."	2014-08-14 01:30:01 +01:00
Peter Rajnoha	c52c9a1e31	activation: if LV inactive and non-clustered, do not issue "Cannot deactivate" on -aln The message "Cannot deactivate remotely exclusive device locally." makes sense only for clustered LV. If the LV is non-clustered, then it's always exclusive by definition and if it's already deactivated, this message pops up inappropriately as those two conditions are met. So issue the message only if the conditions are met AND we have clustered VG.	2014-08-07 16:44:09 +02:00
Peter Rajnoha	54685c20fc	filters: fix regression caused by commit `e80884cd08` Commit `e80884cd08` tried to dump filters for them to be reevaluated when creating a PV to avoid overwriting any existing signature that may have been created after last scan/filtering. However, we need to call refresh_filters instead of persistent_filter->dump since dump requires proper rescannig to fill up the persistent filter again. However, this is true only for pvcreate but not for vgcreate with PV creation where the scanning happens before this PV creation and hence the next rescan (if not full scan), does not fill the persistent filter. Also, move refresh_filters so that it's called sooner and only for pvcreate, vgcreate already calls lvmcache_label_scan(cmd, 2) which then calls refresh_filters itself, so no need to reevaluate this again. This caused the persistent filter (/etc/lvm/cache/.cache file) to be wrong and contain only the PV just being processed with vgcreate <vg_name> <pv_name_to_create>. This regression caused other block devices to be filtered out in case the vgcreate with PV creation was used and then the persistent filter is used by any other LVM command afterwards.	2014-08-01 11:39:53 +02:00
Alasdair G Kergon	c7b9f0ab42	lvresize: Allow approximation with +%FREE. Make lvresize -l+%FREE support approximate allocation. Move existing "Reducing/Extending' message to verbose level and change it to say 'up to' if approximate allocation is being used. Replace it with a new message that gives the actual old and new size or says 'unchanged'.	2014-08-01 00:35:43 +01:00
Peter Rajnoha	ef85997980	metadata: remove spurious "Physical volume <dev_name> not found" This is addendum to commit `2e82a070f3` which fixed these spurious messages that appeared after commit `651d5093ed` ("avoid pv_read in find_pv_by_name"). There was one more "not found" message issued in case the device could not be found in device cache (commit `2e82a07` fixed this only for PV lookup itself). But if we "allow_unformatted" for find_pv_by_name, we should not issue this message even in case the device can't be found in dev cache as we just need to know whether there's a PV or not for the code to decide on next steps and we don't want to issue any messages if either device itself is not found or PV is not found. For example, when we were creating a new PV (and so allow_unformatted = 1) and the device had a signature on it which caused it to be filtered by device filter (e.g. MD signature if md filtering is enabled), or it was part of some other subsystem (e.g. multipath), this message was issued on find_pv_by_name call which was misleading. Also, remove misleading "stack" call in case find_pv_by_name returns NULL in pvcreate_check - any error state is reported later by pvcreate_check code so no need to "stack" here. There's one more and proper check to issue "not found" message if the device can't be found in device cache within pvcreate_check fn so this situation is still covered properly later in the code. Before this patch (/dev/sda contains MD signature and is therefore filtered): $ pvcreate /dev/sda Physical volume /dev/sda not found WARNING: linux_raid_member signature detected on /dev/sda at offset 4096. Wipe it? [y/n]: With this patch applied: $ pvcreate /dev/sda WARNING: linux_raid_member signature detected on /dev/sda at offset 4096. Wipe it? [y/n]: Non-existent devices are still caught properly: $ pvcreate /dev/sdx Device /dev/sdx not found (or ignored by filtering).	2014-07-31 10:03:30 +02:00
Zdenek Kabelac	d7d81e1157	cleanup: show better messages	2014-07-22 22:41:40 +02:00
Zdenek Kabelac	894eda4707	thin and cache: unify pool common code Fix get_pool_params to only read params. Add poolmetadataspare option to get_pool_params. Move all profile code into update_pool_params. Move recalculate code into pool_manip.c	2014-07-22 22:41:38 +02:00
Alasdair G Kergon	99e3c13012	raid: Moved degraded activation code to raid_manip. Adjust some messages & fn names.	2014-07-22 20:50:29 +01:00
Zdenek Kabelac	f5d6c4b0f3	cache: use get_cache_mode for validation Use a single function to validate cache mode arg and set DM_ feature flags.	2014-07-17 16:16:45 +02:00
Zdenek Kabelac	9955204e0d	cleanup: reorder code Simplify code.	2014-07-11 13:32:21 +02:00
Zdenek Kabelac	f7d6614061	cache: warn about metadata size limits Cache pools are similar as with thin pools. Add (needs %s) - since cache has currently a bit strange need for extra few kb over our default 4M extent size so make it more obvious.	2014-07-11 13:31:19 +02:00
Zdenek Kabelac	120bd2d6b1	pool: move code to pool source file More code is used commonly for all pool types (cache & thin)	2014-07-11 12:57:25 +02:00
Zdenek Kabelac	4db5d78cef	display: show C only for cache and cachepool Keep target type (attr6) as the cache data and metadata volume has. (i.e. when will show 'raid' type if metadata is raid)	2014-07-11 12:50:44 +02:00
Zdenek Kabelac	8932d4a625	lv_is_pool: add new defines Defines for lv_is_pool() and lv_is_pool_metadata() Also update comments for prompts for their current meaning. (Though maybe they should be renamed)	2014-07-11 12:50:06 +02:00
Peter Rajnoha	5c3d894013	metadata: fix ALLOCATABLE_PV for lvm1 format This is addendum for commit `6dc7b783c8`. LVM1 format stores the ALLOCATABLE flag directly in PV header, not in VG metadata. So the code needs to be fixed further to work properly for lvm1 format so that the correct PV header is written (the flag is set only if the PV is in some VG, unset otherwise).	2014-07-11 12:24:15 +02:00
Peter Rajnoha	c9ae21798e	report: display 'unknown' value for active/active_locally/active_remotely/active_exclusively if info bypassed Before the patch: $ lvs -o name,active vg/lvol1 --driverloaded n WARNING: Activation disabled. No device-mapper interaction will beattempted. LV Active lvol1 active With this patch applied: $ lvs -o name,active vg/lvol1 --driverloaded n WARNING: Activation disabled. No device-mapper interaction will be attempted. LV Active lvol1 unknown The same for active_{locally,remotely,exclusively} fields. Also, rename headings for these fields (ActLocal/ActRemote/ActExcl).	2014-07-11 11:15:06 +02:00
Peter Rajnoha	f6001465ef	lv_manip: pool-metadata-spare is just a spare LV, not tightly bound to thin or cache	2014-07-07 17:02:06 +02:00
Peter Rajnoha	6dc7b783c8	metadata: fix regression causing PVs not in VGs to be marked as allocatable If the PV is not yet in a VG, it's not allocatable. A regression introduced by commit `0283c439ec` (_pv_create) and later commit `a7ca101517` (pv_read).	2014-07-07 14:07:21 +02:00
Peter Rajnoha	6b58647848	lv_manip: add get_lv_type_name/lv_is_linear and lv_is_striped helper fns The get_lv_type_name helps with translating volume type to human readable form (can be used in reports or various messages if needed). The lv_is_linear and lv_is_striped complete the set of lv_is_* functions that identify exact volume types.	2014-07-04 15:40:17 +02:00
Peter Rajnoha	a4734354ce	refactor: remove static modifier for lv_raid_image_in_sync and lv_raid_healthy fn ...to make use of it in other parts of the code.	2014-07-04 15:40:17 +02:00
Alasdair G Kergon	ac60c876c4	vgsplit: Improve message when LV still active. Mention parent LV as well as the LV triggering the warning. Still leaves some confusing cases but its not worth fixing them at the moment. (Thin pool inactive but a thin volume active => deactivate thin vol. Inactive mirror/raid with pvmove in progress => complete pvmove and active&deactivate mirror/raid. If new VG already exists it requires some LVs to be inactive unnecessarily.)	2014-07-04 01:13:51 +01:00
Alasdair G Kergon	137ed3081a	report: Add lv_parent field. Only defined for thin/cache/raid/mirror at this stage as it relies on get_only_segment_using_this_lv().	2014-07-03 23:49:34 +01:00
Alasdair G Kergon	1e1c2769a7	vgsplit: Fix VG component of lvid. Fix VG component of lvid in vgsplit and vgmerge Update vg_validate() to detect the error. Call lv_is_active() before moving LV into new VG, not after.	2014-07-03 19:06:04 +01:00
Alasdair G Kergon	64ce3a8066	report: Add lv_dm_path and lv_full_name fields.	2014-07-02 17:24:05 +01:00
Alasdair G Kergon	5bfa2ec21d	report: Exclude hidden devices from lv_path field.	2014-07-02 14:57:00 +01:00
Zdenek Kabelac	93a80018ae	lvremove: remove thin volumes on damaged pools Support remove of thin volumes With --force --force when thin pools is damaged. This way it's possible to remove thin pool with unrepairable metadata without requiring to manually edit lvm2 metadata. lvremove -ff vg/pool removes all thin volumes and pool even when thin pool cannot be activated (to accept removal of thin volumes in kernel metadata)	2014-07-02 10:37:52 +02:00
Zdenek Kabelac	13fb02ff1f	cleanup: ignore vg_name in /lib Since vg_name inside /lib function has already been ignored mostly except for a few debug prints - make it and official internal API feature. vg_name is used only in /tools while the VG is not yet openned, and when lvresize/lvcreate /lib function is called with VG pointer already being used, then vg_name becomes irrelevant (it's not been validated anyway). So any internal user of lvcreate_params and lvresize_params does not need to set vg_name pointer and may leave it NULL.	2014-06-30 12:21:36 +02:00
Zdenek Kabelac	2ada685216	cleanup: more lv_is_ functions	2014-06-30 12:16:08 +02:00
Zdenek Kabelac	6da14a82c6	thin: do not create reserved LVs When creating pool's metadata - create initial LV for clearing with some generic name and after the volume is create & cleared - rename it to reserved name '_tmeta/_cmeta'. We should not expose 'reserved' names for public LVs.	2014-06-30 12:16:05 +02:00
Peter Rajnoha	b6fe906956	activation: fix typo in 'activation skip' message	2014-06-30 11:02:45 +02:00
Jonathan Brassow	ed3c2537b8	raid: Allow repair to reuse PVs from same image that suffered a PV failure When repairing RAID LVs that have multiple PVs per image, allow replacement images to be reallocated from the PVs that have not failed in the image if there is sufficient space. This allows for scenarios where a 2-way RAID1 is spread across 4 PVs, where each image lives on two PVs but doesn't use the entire space on any of them. If one PV fails and there is sufficient space on the remaining PV in the image, the image can be reallocated on just the remaining PV.	2014-06-25 22:26:06 -05:00
Jonathan Brassow	7028fd31a0	misc: after releasing a PV segment, merge it with any adjacent free space Previously, the seg_pvs used to track free and allocated space where left in place after 'release_pv_segment' was called to free space from an LV. Now, an attempt is made to combine any adjacent seg_pvs that also track free space. Usually, this doesn't provide much benefit, but in a case where one command might free some space and then do an allocation, it can make a difference. One such case is during a repair of a RAID LV, where one PV of a multi-PV image fails. This new behavior is used when the replacement image can be allocated from the remaining space of the PV that did not fail. (First the entire image with the failed PV is removed. Then the image is reallocated from the remaining PVs.)	2014-06-25 22:04:58 -05:00
Jonathan Brassow	b35fb0b15a	raid/misc: Allow creation of parallel areas by LV vs segment I've changed build_parallel_areas_from_lv to take a new parameter that allows the caller to build parallel areas by LV vs by segment. Previously, the function created a list of parallel areas for each segment in the given LV. When it came time for allocation, the parallel areas were honored on a segment basis. This was problematic for RAID because any new RAID image must avoid being placed on any PVs used by other images in the RAID. For example, if we have a linear LV that has half its space on one PV and half on another, we do not want an up-convert to use either of those PVs. It should especially not wind up with the following, where the first portion of one LV is paired up with the second portion of the other: ------PV1------- ------PV2------- [ 2of2 image_1 ] [ 1of2 image_1 ] [ 1of2 image_0 ] [ 2of2 image_0 ] ---------------- ---------------- Previously, it was possible for this to happen. The change makes it so that the returned parallel areas list contains one "super" segment (seg_pvs) with a list of all the PVs from every actual segment in the given LV and covering the entire logical extent range. This change allows RAID conversions to function properly when there are existing images that contain multiple segments that span more than one PV.	2014-06-25 21:20:41 -05:00
Peter Rajnoha	e80884cd08	filters: always reevaluate filter before creating a PV ...to avoid using cached value (persistent filter) and therefore not noticing any change made after last scan/filtering - the state of the device may have changed, for example new signatures added. $ lvm dumpconfig --type diff allocation { use_blkid_wiping=0 } devices { obtain_device_list_from_udev=0 } $ cat /etc/lvm/cache/.cache \| grep sda $ vgscan Reading all physical volumes. This may take a while... Found volume group "fedora" using metadata type lvm2 $ cat /etc/lvm/cache/.cache \| grep sda "/dev/sda", $ parted /dev/sda mklabel gpt Information: You may need to update /etc/fstab. $ parted /dev/sda print Model: QEMU QEMU HARDDISK (scsi) Disk /dev/sda: 134MB Sector size (logical/physical): 512B/512B Partition Table: gpt Disk Flags: Number Start End Size File system Name Flags $ cat /etc/lvm/cache/.cache \| grep sda "/dev/sda", ==== Before this patch: $ pvcreate /dev/sda Physical volume "/dev/sda" successfully created With this patch applied: $ pvcreate /dev/sda Physical volume /dev/sda not found Device /dev/sda not found (or ignored by filtering).	2014-06-25 16:24:28 +02:00
Peter Rajnoha	3208396ce5	coverity: fix issues reported by coverity	2014-06-24 14:58:53 +02:00
Alasdair G Kergon	f29ae59a4d	pvvmove: add a few comments	2014-06-20 11:41:20 +01:00
Zdenek Kabelac	f96a499c8d	lv: fix lv_is_raid	2014-06-20 11:37:45 +02:00
Zdenek Kabelac	548269a1dd	cleanup: use simplier test Just like all other tests - use direct LV function test	2014-06-20 11:14:11 +02:00
Jonathan Brassow	c6d82c992b	pvmove: Fix code that looks up the "move pv" for display 'lvs' would segfault if trying to display the "move pv" if the pvmove was run with '--atomic'. The structure of an atomic pvmove is different and requires us to descend another level in the LV tree to retrieve the PV information.	2014-06-19 10:57:08 -05:00
Jonathan Brassow	3964a1a89f	pvmove: Clean-up iterator. In 'find_pvmove_lv', separate the code that searches the atomic pvmove LVs from the code that searches the normal pvmove LVs. This cleans up the segment iterator code a bit.	2014-06-19 10:52:09 -05:00
Alasdair G Kergon	b33091cb11	pvmove: tidy	2014-06-19 13:40:47 +01:00
Zdenek Kabelac	597de5c807	cleanup: use insert_layer_for_lv implicit rename There is implicit rename for certain layered device. Do it now for _tdata, _cdata and _corig. TODO: use better API here...	2014-06-18 15:00:18 +02:00
Jonathan Brassow	5ebff6cc9f	pvmove: Enable all-or-nothing (atomic) pvmoves pvmove can be used to move single LVs by name or multiple LVs that lie within the specified PV range (e.g. /dev/sdb1:0-1000). When moving more than one LV, the portions of those LVs that are in the range to be moved are added to a new temporary pvmove LV. The LVs then point to the range in the pvmove LV, rather than the PV range. Example 1: We have two LVs in this example. After they were created, the first LV was grown, yeilding two segments in LV1. So, there are two LVs with a total of three segments. Before pvmove: --------- --------- --------- \| LV1s0 \| \| LV2s0 \| \| LV1s1 \| --------- --------- --------- \| \| \| ------------------------------------- PV \| 000 - 255 \| 256 - 511 \| 512 - 767 \| ------------------------------------- After pvmove inserts the temporary pvmove LV: --------- --------- --------- \| LV1s0 \| \| LV2s0 \| \| LV1s1 \| --------- --------- --------- \| \| \| ------------------------------------- pvmove0 \| seg 0 \| seg 1 \| seg 2 \| ------------------------------------- \| \| \| ------------------------------------- PV \| 000 - 255 \| 256 - 511 \| 512 - 767 \| ------------------------------------- Each of the affected LV segments now point to a range of blocks in the pvmove LV, which purposefully corresponds to the segments moved from the original LVs into the temporary pvmove LV. The current implementation goes on from here to mirror the temporary pvmove LV by segment. Further, as the pvmove LV is activated, only one of its segments is actually mirrored (i.e. "moving") at a time. The rest are either complete or not addressed yet. If the pvmove is aborted, those segments that are completed will remain on the destination and those that are not yet addressed or in the process of moving will stay on the source PV. Thus, it is possible to have a partially completed move - some LVs (or certain segments of LVs) on the source PV and some on the destination. Example 2: What 'example 1' might look if it was half-way through the move. --------- --------- --------- \| LV1s0 \| \| LV2s0 \| \| LV1s1 \| --------- --------- --------- \| \| \| ------------------------------------- pvmove0 \| seg 0 \| seg 1 \| seg 2 \| ------------------------------------- \| \| \| \| ------------------------- source PV \| \| 256 - 511 \| 512 - 767 \| \| ------------------------- \| \|\| ------------------------- dest PV \| 000 - 255 \| 256 - 511 \| ------------------------- This update allows the user to specify that they would like the pvmove mirror created "by LV" rather than "by segment". That is, the pvmove LV becomes an image in an encapsulating mirror along with the allocated copy image. Example 3: A pvmove that is performed "by LV" rather than "by segment". --------- --------- \| LV1s0 \| \| LV2s0 \| --------- --------- \| \| ------------------------- pvmove0 \| * LV-level mirror * \| ------------------------- / \ pvmove_mimage0 / pvmove_mimage1 ------------------------- ------------------------- \| seg 0 \| seg 1 \| \| seg 0 \| seg 1 \| ------------------------- ------------------------- \| \| \| \| ------------------------- ------------------------- \| 000 - 255 \| 256 - 511 \| \| 000 - 255 \| 256 - 511 \| ------------------------- ------------------------- source PV dest PV The thing that differentiates a pvmove done in this way and a simple "up-convert" from linear to mirror is the preservation of the distinct segments. A normal up-convert would simply allocate the necessary space with no regard for segment boundaries. The pvmove operation must preserve the segments because they are the critical boundary between the segments of the LVs being moved. So, when the pvmove copy image is allocated, all corresponding segments must be allocated. The code that merges ajoining segments that are part of the same LV when the metadata is written must also be avoided in this case. This method of mirroring is unique enough to warrant its own definitional macro, MIRROR_BY_SEGMENTED_LV. This joins the two existing macros: MIRROR_BY_SEG (for original pvmove) and MIRROR_BY_LV (for user created mirrors). The advantages of performing pvmove in this way is that all of the LVs affected can be moved together. It is an all-or-nothing approach that leaves all LV segments on the source PV if the move is aborted. Additionally, a mirror log can be used (in the future) to provide tracking of progress; allowing the copy to continue where it left off in the event there is a deactivation.	2014-06-17 22:59:36 -05:00
Peter Rajnoha	cfed0d09e8	report: select: refactor: move percent handling code to libdm for reuse	2014-06-17 16:27:21 +02:00
Peter Rajnoha	5abdb52fdc	report: select: refactor: move str_list to libdm The list of strings is used quite frequently and we'd like to reuse this simple structure for report selection support too. Make it part of libdevmapper for general reuse throughout the code. This also simplifies the LVM code a bit since we don't need to include and manage lvm-types.h anymore (the string list was the only structure defined there).	2014-06-17 16:27:20 +02:00
Zdenek Kabelac	c46d4a745d	snapshot: check snapshot exists Return 0 if the LV is not even snapshot.	2014-06-17 13:36:07 +02:00
Jonathan Brassow	962a40b981	cache: Properly rename origin LV tree when adding "_corig" When creating a cache LV with a RAID origin, we need to ensure that the sub-LVs of that origin properly change their names to include the "_corig" extention of the top-level LV. We do this by first performing a 'lv_rename_update' before making the call to 'insert_layer_for_lv'.	2014-06-16 18:15:39 -05:00
Petr Rockai	ee200ddfc3	pvremove: Update lvmcache => avoid spurious error messages.	2014-06-08 22:57:04 +02:00
Petr Rockai	dba6dec661	metadata: Make it possible to write partial VGs obtained from lvmetad.	2014-06-08 17:41:11 +02:00
Zdenek Kabelac	9240aca369	raid: cleanup error messages Add log_error messages on error paths.	2014-05-27 17:08:49 +02:00
Zdenek Kabelac	1a84032322	cleanup: indent	2014-05-23 21:37:12 +02:00
Zdenek Kabelac	bf6b69c46b	cleanup: use directly segtype->name Simplify printing of segtype name.	2014-05-23 21:36:55 +02:00
Zdenek Kabelac	952514611d	cleanup: add seg_is_pool macro Simplify code querying for pool segtype.	2014-05-23 21:36:55 +02:00
Zdenek Kabelac	aafd7c878c	cleanup: indent	2014-05-21 23:14:41 +02:00
Zdenek Kabelac	9fd0be2a85	debug: fix backtracing	2014-05-20 21:50:28 +02:00
Peter Rajnoha	9e3e4d6994	config: differentiate command and metadata profiles and consolidate profile handling code - When defining configuration source, the code now uses separate CONFIG_PROFILE_COMMAND and CONFIG_PROFILE_METADATA markers (before, it was just CONFIG_PROFILE that did not make the difference between the two). This helps when checking the configuration if it contains correct set of options which are all in either command-profilable or metadata-profilable group without mixing these groups together - so it's a firm distinction. The "command profile" can't contain "metadata profile" and vice versa! This is strictly checked and if the settings are mixed, such profile is rejected and it's not used. So in the end, the CONFIG_PROFILE_COMMAND set of options and CONFIG_PROFILE_METADATA are mutually exclusive sets. - Marking configuration with one or the other marker will also determine the way these configuration sources are positioned in the configuration cascade which is now: CONFIG_STRING -> CONFIG_PROFILE_COMMAND -> CONFIG_PROFILE_METADATA -> CONFIG_FILE/CONFIG_MERGED_FILES - Marking configuration with one or the other marker will also make it possible to issue a command context refresh (will be probably a part of a future patch) if needed for settings in global profile set. For settings in metadata profile set this is impossible since we can't refresh cmd context in the middle of reading VG/LV metadata and for each VG/LV separately because each VG/LV can have a different metadata profile assinged and it's not possible to change these settings at this level. - When command profile is incorrect, it's rejected and also the command exits immediately - the profile must be correct for the command that was run with a profile to be executed. Before this patch, when the profile was found incorrect, there was just the warning message and the command continued without profile applied. But it's more correct to exit immediately in this case. - When metadata profile is incorrect, we reject it during command runtime (as we know the profile name from metadata and not early from command line as it is in case of command profiles) and we do continue with the command as we're in the middle of operation. Also, the metadata profile is applied directly and on the fly on find_config_tree_* fn call and even if the metadata profile is found incorrect, we still need to return the non-profiled value as found in the other configuration provided or default value. To exit immediately even in this case, we'd need to refactor existing find_config_tree_* fns so they can return error. Currently, these fns return only config values (which end up with default values in the end if the config is not found). - To check the profile validity before use to be sure it's correct, one can use : lvm dumpconfig --commandprofile/--metadataprofile ProfileName --validate (the --commandprofile/--metadataprofile for dumpconfig will come as part of the subsequent patch) - This patch also adds a reference to --commandprofile and --metadataprofile in the cmd help string (which was missing before for the --profile for some commands). We do not mention --profile now as people should use --commandprofile or --metadataprofile directly. However, the --profile is still supported for backward compatibility and it's translated as: --profile == --metadataprofile for lvcreate, vgcreate, lvchange and vgchange (as these commands are able to attach profile to metadata) --profile == --commandprofile for all the other commands (--metadataprofile is not allowed there as it makes no sense) - This patch also contains some cleanups to make the code handling the profiles more readable...	2014-05-20 16:21:48 +02:00
Zdenek Kabelac	91284bd9b9	cleanup: device extent_size first	2014-05-18 20:08:07 +02:00
Zdenek Kabelac	58777756e8	debug: backtrace error path Add backtrace for 'n' answer.	2014-05-18 20:07:24 +02:00
Alasdair G Kergon	3b989e317f	allocation: Fix alloc anywhere with parity. Take account of parity areas with alloc anywhere in _calc_required_extents. Extents beyond area_count were treated incorrectly as mirror logs.	2014-05-14 16:25:43 +01:00
Zdenek Kabelac	85cf5a23d2	cleanup: improve error message Update impossile to happen error message.	2014-05-13 10:33:17 +02:00
Zdenek Kabelac	8b95c82fed	coverity: catch unwanted path We validate this path already earlier.	2014-05-12 16:24:39 +02:00
Zdenek Kabelac	9845f8c767	clenaup: drop unused assigns	2014-05-07 14:17:46 +02:00
Zdenek Kabelac	d3e68c8a71	cleanup: cosmetics. Initialized attrs so analyzers are less confused (since currently our method calls should always initialize attrs on return).	2014-05-07 14:17:46 +02:00
Zdenek Kabelac	d88fab8d3a	cleanup: drop uneeded headers	2014-05-07 14:17:45 +02:00
Zdenek Kabelac	d11617864a	coverity: error for undefined origin If we would have been missing origin here, it would be an internal error - since these values are validated earlier.	2014-05-07 14:16:18 +02:00
Zdenek Kabelac	a8042f33d0	coverity: check for profile Ensure str is not NULL for analyzer.	2014-05-07 14:15:52 +02:00
Zdenek Kabelac	48a8cf28f7	cache: avoid expression overflow Cast data_extents to 64bit so calculation is in 64b arithmetic.	2014-05-07 14:14:54 +02:00
Alasdair G Kergon	2eed136f0f	signals: Move sigint handling out to lvm-signal.	2014-05-01 20:07:17 +01:00
Zdenek Kabelac	62e8dd4f6e	cleanup: indent	2014-04-30 10:26:30 +02:00
Zdenek Kabelac	62ad6dee18	lv: show X attr when lv_info fails Print 'X' also when lv_info() fails. (i.e. compilation with --disable-ioctl)	2014-04-30 10:26:29 +02:00
Zdenek Kabelac	517b002648	display: check for dmeventd support When quering for dmeventd monitoring status, check first if lvm2 is configured to monitor to avoid unwanted start of dmeventd process for answering monitoring status.	2014-04-30 10:26:26 +02:00
Peter Rajnoha	5b28cbd7c2	cleanup: _move_pv is static metadata/metadata.c:363:5: warning: no previous prototype for '_move_pv' [-Wmissing-prototypes]	2014-04-28 12:11:44 +02:00
Jonathan Brassow	9ac858fe6b	vgsplit: Make vgsplit work on mirrors with leg and log on same PV Given a named mirror LV, vgsplit will look for the PVs that compose it and move them to a new VG. It does this by first looking at the log and then the legs. If the log is on the same device as one of the mirror images, a problem occurs. This is because the PV is moved to the new VG as the log is processed and thus cannot be found in the current VG when the image is processed. The solution is to check and see if the PV we are looking for has already been moved to the new VG. If so, it is not an error.	2014-04-25 14:53:34 -05:00
Alasdair G Kergon	17e304e0ac	metadata: Fix unlock on VG recovery error path. If lock conversion failed it tried to unlock VG that was no longer locked.	2014-04-18 02:27:16 +01:00
Alasdair G Kergon	177ece01a9	reports: Use X for unknown LV attr when no dm. It's safer not to tell people an LV is inactive when we aren't sure.	2014-04-18 02:23:39 +01:00
Alasdair G Kergon	8980592514	alloc: Correct existing use of positional fill. Perform two allocation attempts with cling if maximise_cling is set, first with then without positional fill. Avoid segfaults from confusion between positional and sorted sequential allocation when number of stripes varies as reported here: https://www.redhat.com/archives/linux-lvm/2014-March/msg00001.html	2014-04-15 02:34:35 +01:00
Alasdair G Kergon	1bf4c3a1fa	alloc: Introduce A_POSITIONAL_FILL. Set A_POSITIONAL_FILL if the array of areas is being filled positionally (with a slot corresponding to each 'leg') rather than sequentially (with all suitable areas found, to be sorted and selected from).	2014-04-15 01:13:47 +01:00
Alasdair G Kergon	c9a8264b8b	alloc: Access alloc_parms from alloc_state. alloc_parms is constant while allocating.	2014-04-15 01:05:34 +01:00
Zdenek Kabelac	84ff3ae703	pvmove: remove locked flag from error pvmove0 When pvmove0 is finished, it replaces temporarily pvmove0 with error segment, however in this case, pvmove0 remains unremovable in case pvmove --abort is interrupted in this moment - since it's not a pvmove anymore and normal lvremove can't be used to remove LOCKED lv.	2014-04-14 12:52:32 +02:00
Alasdair G Kergon	a8d63994ea	alloc: Refactor area reservation code. No functional changes intended to be included in this patch.	2014-04-10 20:48:59 +01:00
Zdenek Kabelac	5553a099d1	cleanup: use DM_ARRAY_SIZE More use of libdevmapper macro	2014-04-08 11:00:15 +02:00
Alasdair G Kergon	12ddaa5f10	lib: Share lvm_even_rand for random numbers.	2014-04-04 01:26:19 +01:00
Jonathan Brassow	6c6468f91d	RAID: Improve an error message When down-converting a RAID1 LV, if the user specifies too few devices, they will get a confusing message. Ex: [root]# lvcreate -m 2 --type raid1 -n raid -L 500M taft Logical volume "raid" created [root]# lvconvert -m 0 taft/raid /dev/sdd1 Unable to extract enough images to satisfy request Failed to extract images from taft/raid This patch makes the error message a bit clearer by telling the user the count they are trying to remove and the number of devices they supplied. [root@bp-01 lvm2]# lvcreate --type raid1 -m 3 -L 200M -n lv vg Logical volume "lv" created [root@bp-01 lvm2]# lvconvert -m -3 vg/lv /dev/sdb1 Unable to remove 3 images: Only 1 device given. Failed to extract images from vg/lv [root@bp-01 lvm2]# lvconvert -m -3 vg/lv /dev/sd[bc]1 Unable to remove 3 images: Only 2 devices given. Failed to extract images from vg/lv [root@bp-01 lvm2]# lvconvert -m -3 vg/lv /dev/sd[bcd]1 [root@bp-01 lvm2]# lvs -a -o name,attr,devices vg LV Attr Devices lv -wi-a----- /dev/sde1(1) This patch doesn't work in all cases. The user can specify the right number of devices, but not a sufficient amount of devices from the LV. This will produce the old error message: [root@bp-01 lvm2]# lvconvert -m -3 vg/lv /dev/sd[bcf]1 Unable to extract enough images to satisfy request Failed to extract images from vg/lv However, I think this error message is sufficient for this case.	2014-04-03 16:57:41 -05:00
Zdenek Kabelac	c2876ee1c9	cache: enforce local exlusive activation For cache flushing local exlusive activation is needed.	2014-04-01 21:29:28 +02:00
Zdenek Kabelac	dc5a3c9964	debug: add internal error for passed LV TODO: in fact we should parameter LV.	2014-04-01 20:54:09 +02:00
Zdenek Kabelac	9cb053339e	cleanup: cache updates messages Passing non cached device is an internal error. Print messages at non-error level. Shorten sleep delay for cache flush.	2014-04-01 20:54:09 +02:00
Zdenek Kabelac	e2ea3cd7ba	cleanup: cache use const char policy Policy should be const char pointer.	2014-04-01 20:54:09 +02:00
Zdenek Kabelac	d58cc2c0fc	cleanup: cache reuse code for pool test Using same error message for pool associated devices.	2014-04-01 20:18:05 +02:00
Zdenek Kabelac	e72dea55bf	cache: fix order of metadata change Start to change metadata after they are archived. And since cache_pool is virtual skip deactivation call for this LV.	2014-04-01 20:17:10 +02:00
Zdenek Kabelac	d3004479cc	cache: use remove_layer_from_lv Since the usability problem were fixed, we can use this function. Cleanup orphan LVs with TEMPORARY flags (reduces couple blkid error reports, but couple of them is still left...)	2014-04-01 20:17:10 +02:00
Zdenek Kabelac	0a15a210a5	cache: cache segment is non-discardable Since cache segment is purely virtual mapping, it has nothing for discard. Discardable is cache origin here which is now properly removed on 'delete' phase. Plain lv_empty() call needs to only detach cache origin and leave origin unchanged.	2014-04-01 20:17:10 +02:00
Zdenek Kabelac	a018c57f0b	cache: never activate cache pool Since cache-pool is purely lvm abstraction layer LV, it never need any device node, so do not add even 'error' device for it.	2014-04-01 20:17:10 +02:00
Zdenek Kabelac	84beba5d7f	vg_validate: check size of lv_name + vg_name Since the whole dm device name may not exceed 127 characters, validate no LV names is bigger then this limit.	2014-03-31 12:05:32 +02:00
Zdenek Kabelac	6570d36ad5	lv_rename: resume fail is certainly error Failing resume path means command has failed, even when commit was ok.	2014-03-31 12:03:25 +02:00
Zdenek Kabelac	c7a89323f5	fix spurios char Mistyped char left in code.	2014-03-27 13:49:24 +01:00
Zdenek Kabelac	fc8185601d	cleanup: indent change	2014-03-27 13:13:09 +01:00
Zdenek Kabelac	356fdda46d	lv_manip: drop cmd pointer from for_each_sub_lv Drop unused passed cmd pointer from function. TODO: We have two similar functions (though not identical) lv_manip.c: for_each_sub_lv() metadata.c: _lv_each_dependency() They seem to not always match - we should probably convert to use only a single function.	2014-03-27 13:10:13 +01:00
Zdenek Kabelac	22579b4451	lv_rename: validate renamed sublv Use proper vgmem memory pool for allocation of LV name in the vg and check if new renamed LV is a valid name. TODO: validation should really use also VG name, othewise we are not able to tell "vgname-lvname" will be valid.	2014-03-27 13:06:23 +01:00
Zdenek Kabelac	b6eb2ac10a	cleanup: indent	2014-03-19 00:58:02 +01:00
Zdenek Kabelac	caf93eb1cb	cleanup: simplify pv name print pv_vg_name() now already hides orphans.	2014-03-19 00:58:01 +01:00
Zdenek Kabelac	506091be70	pv_vg_name: do not expose internal orphans to lvm2 users Check for orphan VG name and return just empty string, Use internally pv->vg_name if the real orphan name is needed.	2014-03-19 00:57:59 +01:00
Zdenek Kabelac	013f5f4550	metadata: print VG with invalid chars in quotes We we report invalid chars, put quotes around vg name.	2014-03-19 00:48:39 +01:00
Zdenek Kabelac	3a82490ee1	snapshot: wrap min_chunk test into a lib function Create a separate function to validation snapshot min chunk value and relocate code into snapshot_manip file. This function will be shared with lvconvert then.	2014-03-17 14:31:43 +01:00
Zdenek Kabelac	8a60cbcf45	thin: always activate and deactive pool when creating When we create thin-pool we have used trick to keep volume active, but since we now support TEMPORARY flag, we could just localy active & deactive metadata LV, and let the thinpool through normal activation process.	2014-03-12 00:16:27 +01:00
Zdenek Kabelac	1850a6e454	thin: fix pool_has_message return for NULL params When pool_has_message() is queried with NULL lv and 0 device_id it should just return 'true' when there is any message queued. So it needs to return negative value dm_list_empty(). Since there is no user for this code path in code currently, this bug has not been triggered.	2014-03-12 00:13:21 +01:00
Peter Rajnoha	74a3fc4e85	config: add default for allocation/cache_pool_chunk_size The same as for allocation/thin_pool_chunk_size - the default value used is just a starting point. The calculation continues using the properties of the devices actually used.	2014-03-06 11:34:02 +01:00
Peter Rajnoha	d27868e94f	config: runtime default for allocation/thin_pool_chunk_size The allocation/thin_pool_chunk_size is a bit more complex. It's default value is evaluated in runtime based on selected thin_pool_chunk_size_policy. But the value is just a starting point. The calculation then continues with dependency on the properties of the devices used. Which means for such a default value, we know only the starting value.	2014-03-06 11:26:02 +01:00
Zdenek Kabelac	090e81281f	lvmetad: more reuse precommit buffer This patch moves more allocation to vg_write (as started in `8c878438f5`) TODO: relocate also communication. (in-release update)	2014-03-01 14:08:58 +01:00
Zdenek Kabelac	07ba047116	cleanup: relocate segment flags Move flags for segments to segtype header where it seems more closely related as the features are related to segtype and not activation. Use unsigned #define - since it's more common in lvm2 source code for bit flags.	2014-02-27 14:46:11 +01:00
Zdenek Kabelac	d00fc1de78	snapshot: correct previous snapshot commit Condition was swapped - however since it's been based on 'random' memory content it's been missed as attribute has not been set. So now we have quite a few possible results when testing. We have old status without separate metadata and we have kernels with fixed snapshot leak bug. (in-release update)	2014-02-27 13:00:49 +01:00
Zdenek Kabelac	40e6176d25	snapshots: fix incorrect calculation of cow size Code uses target driver version for better estimation of max size of COW device for snapshot. The bug can be tested with this script: VG=vg1 lvremove -f $VG/origin set -e lvcreate -L 2143289344b -n origin $VG lvcreate -n snap -c 8k -L 2304M -s $VG/origin dd if=/dev/zero of=/dev/$VG/snap bs=1M count=2044 oflag=direct The bug happens when these two conditions are met * origin size is divisible by (chunk_size/16) - so that the last metadata area is filled completely * the miscalculated snapshot metadata size is divisible by extent size - so that there is no padding to extent boundary which would otherwise save us Signed-off-by:Mikulas Patocka <mpatocka@redhat.com>	2014-02-26 14:25:09 +01:00
Zhiqing Zhang	014ba37cb1	lvresize: fix stripe size validation While stripe size is twice the physical extent size, the original code will not reduce stripe size to maximum (physical extent size). Signed-off-by: Zhiqing Zhang <zhangzq.fnst@cn.fujitsu.com>	2014-02-26 13:25:50 +01:00
Zdenek Kabelac	e7d189baf7	allocation: add default path Make it obvious for compiler extents is always defined for valid code path.	2014-02-25 09:36:26 +01:00
Zdenek Kabelac	962af71b76	mirror: look for mirror seg only in mirror LV Find mirror seg only in MIRROR_IMAGE. (in-release update)	2014-02-25 09:34:02 +01:00
Alasdair G Kergon	b359b86f88	allocation: improve approx alloc with resize Start to convert percentage size handling in lvresize to the new standard. Note in the man pages that this code is incomplete. Fix a regression in non-percentage allocation in my last check in. This is what I am aiming for: -l<extents> -l<percent> LV/ORIGIN sets or changes the LV size based on the specified quantity of logical logical extents (that might be backed by a higher number of physical extents) -l<percent> PVS/VG/FREE sets or changes the LV size so as to allocate or free the desired quantity of physical extents (that might amount to a lower number of logical extents for the LV concerned) -l+50%FREE - Use up half the remaining free space in the VG when carrying out this operation. -l50%VG - After this operation, this LV should be using up half the space in the VG. -l200%LV - Double the logical size of this LV. -l+100%LV - Double the logical size of this LV. -l-50%LV - Reduce the logical size of this LV by half.	2014-02-24 22:48:23 +00:00
Zdenek Kabelac	8c878438f5	metadata: move vg parsing to vg_write Parsing vg structure during supend/commit/resume may require a lot of memory - so move this into vg_write. FIXME: there are now multiple cache layers which our doing some thing multiple times at different levels. Moreover there is now different caching path with and without lvmetad - this should be unified and both path should use same mechanism.	2014-02-24 21:08:53 +01:00
Jonathan Brassow	f3d1debb18	cache: Disallow resizing of cache related LVs For now, disallow lvextend/lvreduce/lvresize of cache LVs, cache pool LVs, and cache pool sub-LVs.	2014-02-24 10:19:50 -06:00
Alasdair G Kergon	13c3f53f55	allocation: misc fixes for percent/raid rounding Several fixes for the recent changes that treat allocation percentages as upper limits. Improve messages to make it easier to see what is happening. Fix some cases that failed with errors when they didn't need to. Fix crashes when first_seg() returns NULL. Remove a couple of log_errors that were actually debugging messages.	2014-02-22 00:26:01 +00:00
Jonathan Brassow	00ce01e52d	cache-pool: Change segtype name from cache_pool to cache-pool Thin pools use "thin-pool" for the segment type name. To be consistent, we use "cache-pool" instead of "cache_pool".	2014-02-19 09:26:03 -06:00
Zdenek Kabelac	c71a3bcbc0	activation: lv_activation_skip remove always same arg. Remove 'skip' argument passed into the function. We always used '0' - as this is the only supported option (-K) and there is no complementary option. Also add some testing for behaviour of skipping.	2014-02-19 11:33:39 +01:00
Zdenek Kabelac	e6fd16f8ea	cleanup: use is_change_activating Use a single inline function to detect activation/deactivation	2014-02-18 21:21:59 +01:00
Zdenek Kabelac	fb519c35bb	cleanup: move verbose message to lv_activation_skip Simplify code and put verbose message into a single place.	2014-02-18 20:49:32 +01:00
Peter Rajnoha	b1e8284f33	pvcreate: do not print stack when pv not found while doing pvcreate_check Not finding an existing PV on a disk where we're just creating the PV is not an error or any bad condition. Remove misleading "stack" call.	2014-02-18 10:01:11 +01:00
Zdenek Kabelac	25cea92338	thin: fix merge of old snaphost Fix merging of old snapshot into thinvolume origin. Add also internal error for the error case when merging requests activation of merged LV.	2014-02-17 22:25:53 +01:00
Zdenek Kabelac	c651c614ec	cache: using unsigned argc Convert using unsigned for _argc.	2014-02-15 11:36:53 +01:00
Jonathan Brassow	fa4812bf7b	cache: Fix cache LV not being instantiated in kernel When an origin exists and the 'lvcreate' command is used to create a cache pool + cache LV, the table is loaded into the kernel but never instantiated (suspend/resume was never called). A user running LVM commands would never know that the kernel did not have the proper state unless they also ran the dmsetup 'table/status' command. The solution is to suspend/resume the cache LV to make the loaded tables become active.	2014-02-14 16:04:31 -06:00
Jonathan Brassow	4b6e3b5e5e	allocation: Allow approximate allocation when specifying size in percent Introduce a new parameter called "approx_alloc" that is set when the desired size of a new LV is specified in percentage terms. If set, the allocation code tries to get as much space as it can but does not fail if can at least get some. One of the practical implications is that users can now specify 100%FREE when creating RAID LVs, like this: ~> lvcreate --type raid5 -i 2 -l 100%FREE -n lv vg	2014-02-13 21:10:28 -06:00
Jonathan Brassow	0912cf67aa	cache: Ability to convert an existing LV into a cached LV Users now have the ability to convert their existing logical volumes into cached logical volumes. A cache pool LV must be specified using the '--cachepool' argument. The cachepool is the small, fast LV used to cache the large, slow LV that is being converted.	2014-02-12 09:55:35 -06:00
Jonathan Brassow	48aef76ec5	cache: lv_cache_create returns LV ptr, so return NULL not 0 on error	2014-02-11 13:47:26 -06:00
Zdenek Kabelac	9d69585b82	cleanup: remove unneeded header files	2014-02-11 19:00:06 +01:00
Zdenek Kabelac	bcd6b643be	cleanup: update clearing message Since some targets are using this routine to setup other values then 0 and also may clear much more data then just disk header - add better message.	2014-02-11 18:59:22 +01:00
Zdenek Kabelac	38e457c478	raid: drop invalid modication of active parameter lv_active_change will enforce proper activation. Modification of activation was wrong and lead to misuse of autoactivation. Fix allows to use proper local exclusive activation, while the removed code turned this into just exclusive activation (losing required local property).	2014-02-11 18:48:38 +01:00
Peter Rajnoha	ed166a3b1d	wiping: wipe DM_snapshot_cow signature without prompt in newly created LVs The libblkid can detect DM_snapshot_cow signature and when creating new LVs with blkid wiping used (allocation/use_blkid_wiping=1 lvm.conf setting and --wipe y used at the same time - which it is by default). Do not issue any prompts about this signature when new LV is created and just wipe it right away without asking questions. Still keep the log in verbose mode though.	2014-02-10 13:28:13 +01:00
Peter Rajnoha	5635816094	cleanup: missing parentheses in a condition gcc reports: metadata/merge.c:229:58: warning: suggest parentheses around '&&' within '\|\|' [-Wparentheses] metadata/merge.c:232:58: warning: suggest parentheses around '&&' within '\|\|' [-Wparentheses]	2014-02-10 09:05:17 +01:00
Jonathan Brassow	36f9fadcb4	cache[pool]: Populate existing report fields with cache data For the report fields that already exist that are relevent to cache and cache pool LVs (like 'origin', 'metadata_lv', etc), populate them.	2014-02-05 09:44:37 -06:00
Jonathan Brassow	96626f64fa	cache: Code to allow the create/remove of cache LVs This patch allows users to create cache LVs with 'lvcreate'. An origin or a cache pool LV must be created first. Then, while supplying the origin or cache pool to the lvcreate command, the cache can be created. Ex1: Here the cache pool is created first, followed by the origin which will be cached. ~> lvcreate --type cache_pool -L 500M -n cachepool vg /dev/small_n_fast ~> lvcreate --type cache -L 1G -n lv vg/cachepool /dev/large_n_slow Ex2: Here the origin is created first, followed by the cache pool - allowing a cache LV to be created covering the origin. ~> lvcreate -L 1G -n lv vg /dev/large_n_slow ~> lvcreate --type cache -L 500M -n cachepool vg/lv /dev/small_n_fast The code determines which type of LV was supplied (cache pool or origin) by checking its type. It ensures the right argument was given by ensuring that the origin is larger than the cache pool. If the user wants to remove just the cache for an LV. They specify the LV's associated cache pool when removing: ~> lvremove vg/cachepool If the user wishes to remove the origin, but leave the cachepool to be used for another LV, they specify the cache LV. ~> lvremove vg/lv In order to remove it all, specify both LVs. This patch also includes tests to create and remove cache pools and cache LVs.	2014-02-04 16:50:16 -06:00
Jonathan Brassow	97be8b3482	cache: Code changes to allow creation of cache pools This patch allows the creation and removal of cache pools. Users are not yet able to create cache LVs. They are only able to define the space used for the cache and its characteristics (chunk_size and cache mode ATM) by creating the cache pool.	2014-02-04 11:57:08 -06:00
Jonathan Brassow	013cf27bff	cache pool: Add 'update_cache_pool_params' Similar function to 'update_thin_pool_params', but for cache. Performs the adjustements for chunk_size, metdata device size, etc.	2014-02-04 11:50:27 -06:00
Jonathan Brassow	8ddc7f641c	misc: disambiguate 'update_pool_params' s/update_pool_params/update_thin_pool_params/ to disambiguate it from a future 'update_cache_pool_params'.	2014-02-04 09:58:35 -06:00
Jonathan Brassow	afe2ba657b	misc: rename variables [min\|max]_chunk to [min\|max]_chunk_size better variable names.	2014-02-04 08:20:10 -06:00
Jonathan Brassow	b94a3ee9f6	cache: Add functions that create/remove cache LVs A cache LV - from LVM's perpective - is a user accessible device that links the cachepool LV and the origin LV. The following functions were added to facilitate the creation and removal of this top-level LV: 1) 'lv_cache_create' - takes a cachepool and an origin device and links them into a new top-level LV of 'cache' segment type. No allocation is necessary in this function, as the sub-LVs contain all of the necessary allocated space. Only the top-level layer needs to be created. 2) 'lv_cache_remove' - this function removes the top-level LV of a cache LV - promoting the cachepool and origin sub-LVs to top-level devices and leaving them exposed to the user. That is, the cachepool is unlinked and free to be used with another origin to form a new cache LV; and the origin is no longer cached. (Currently, if the cache needs to be flushed, it is done in this function and the function waits for it to complete before proceeding. This will be taken out in a future patch in favor of polling.)	2014-02-04 07:59:58 -06:00
Zdenek Kabelac	ef6c5795a0	raid: add temporary activation for raid metadata clear Use LV_TEMPORARY when activating devices for clearing raid metadata.	2014-02-04 14:51:05 +01:00
Jonathan Brassow	3247819531	pool: Make another thin pool fn generic for cache usage also Make '_recalculate_thin_pool_chunk_size_with_dev_hints' so it can be used for cache and thin pools.	2014-02-04 07:03:52 -06:00
Alasdair G Kergon	4aa8a14fc2	compilation: Rename tags variables to tagsl.	2014-01-30 21:09:28 +00:00
Jonathan Brassow	e833d84e67	cache/pool: Make the fns in pool_manip.c work with cache pools The functions in pool_manip.c are specific to thin pools. It's now time to make them more generic and able to handle cache pools as well.	2014-01-28 12:25:07 -06:00
Jonathan Brassow	70fd2139e1	cache: Allocation code changes necessary to support cache_pool Cache pools require a data and metadata area (like thin pools). Unlike thin pool, if 'cache_pool_metadata_require_separate_pvs' is not set to '1', the metadata and data area will be allocated from the same device. It is also done in a manner similar to RAID, where a single chunk of space is allocated and then split to form the metadata and data device - ensuring that they are together.	2014-01-28 12:25:02 -06:00
Jonathan Brassow	75b8ea195c	cache: New functions for gathering info on cache devices Building on the new DM function that parses DM cache status, we introduce the following LVM level functions to aquire information about cache devices: - lv_cache_block_info: retrieves information on the cache's block/chunk usage - lv_cache_policy_info: retrieves information on the cache's policy	2014-01-28 12:24:51 -06:00
Zdenek Kabelac	155405b0e1	thin: validate external origin size Avoid use of external origin with size unaligned/incompatible with thin pool chunk size, since the last chunk is not correctly provisioned when it is overwritten.	2014-01-29 14:58:13 +01:00
Zdenek Kabelac	8074d8056a	thin:drop stack trace when pool is above threshold Since this path is expected, do not log_debug stacktrace.	2014-01-29 14:26:06 +01:00
Zdenek Kabelac	c0bd436dcb	thin: disable extension of reduced thin with etx.origin Since we are currently incapable of providing zeroes for reextended thin volume area, let's disable extension of such already reduce thin volumes. (in-release change)	2014-01-28 10:40:08 +01:00
Jonathan Brassow	90bbed3255	cache: New 'cachepool' segment type This patch adds the new cachepool segment type - the first of two necessary to eventually create 'cache' logical volumes. In addition to the new segment type, updates to makefiles, configure files, the lv_segment struct, and some necessary libdevmapper flags. The cachepool is the LV and corresponding segment type that will hold all information pertinent to the cache itself - it's size, cachemode, cache policy, core arguments (like migration_threshold), etc.	2014-01-27 05:27:16 -06:00
Zdenek Kabelac	89d7732617	thin: fix missing ~ in previous commit	2014-01-24 13:13:37 +01:00
Zdenek Kabelac	731c298e12	thin: use LV_TEMPORARY for metadata initialization This flag need to be specified when we create thin pool - to avoid scanning device with watch rules.	2014-01-24 12:30:28 +01:00
Zdenek Kabelac	432ff4bd72	cleanup: indent	2014-01-24 12:30:28 +01:00
Jonathan Brassow	5e4647ec99	Typo: s/Unale/Unable/	2014-01-22 23:04:27 -06:00
Zdenek Kabelac	902b343e0e	thin: validate resize of thin LV with ext. origin When thin volume is using external origin, current thin target is not able to supply 'extended' size with empty pages. lvm2 detects version and disables extension of LV past the external origin size in this case. Thin LV could be however still reduced and extended freely bellow this size.	2014-01-23 14:20:34 +01:00
Zdenek Kabelac	2dae78b722	thin: rename function Rename pool_can_resize_metadata() to more reusable thin_pool_feature_supported() which could be queried for mutiple different features.	2014-01-23 14:19:17 +01:00
Zdenek Kabelac	1d7b2715e5	missed pool_manip.c Seems like this file is missing from the thin_manip move. Make the tree compilable again.	2014-01-23 09:57:22 +01:00
Jonathan Brassow	998af1a4fb	Misc: Change name of lvcreate_params field - s/create_thin_pool/create_pool/ In preparation for other segment types that create and use "pools", we s/create_thin_pool/create_pool/. This way it is not awkward when creating a cachepool, for example, to use "create_thin_pool".	2014-01-22 10:30:55 -06:00
Jonathan Brassow	5590448c32	Misc: Move some thin pool functions to a new file Functions that handle set-up, tear-down and creation of thin pool volumes will be more generally applicable when more targets exist that make use of device-mapper's persistent data format. One of these targets is the dm-cache target. I've selected some functions that will be useful for the cache segment type to be moved, since they will no longer be thin pool specific but are more broadly useful to any segment type that makes use of a 'pool' LV.	2014-01-22 10:11:29 -06:00
Alasdair G Kergon	aa21e79991	pre-release	2014-01-20 19:22:56 +00:00
Peter Rajnoha	91b26b63b4	thin: fix thin LV flagging for udev to skip scanning Only flag thin LV for no scanning in udev if this LV is about to be wiped. This happens only in case the thin LV's pool was not created with zeroing of the new blocks enabled.	2014-01-20 12:38:21 +01:00
Alasdair G Kergon	5a450eab6a	pvs: fix segfaults with orphans Several fields used to display 0 if undefined. Recent changes to the way the fields are reported threw away some tests for valid pointers, leading to segfaults with 'pvs -o all'. Reinstate the original behaviour.	2014-01-14 03:17:27 +00:00
Zdenek Kabelac	95b1af7280	thin: accept const struct	2014-01-08 11:57:43 +01:00
Alasdair G Kergon	0a13815e68	pvscan: use format feature flags in lvmetad code Introduce FMT_OBSOLETE to identify pool metadata and use it and FMT_MDAS instead of hard-coded format names. Explain device accesses on pvscan --cache man page.	2014-01-08 02:13:13 +00:00
Zdenek Kabelac	94137b72ed	lv_dependency: scan also snapshots and extorigins When LV is scanned for its dependencies - scan also origin's snapshots, and thin external origins. So if any PV from snapshot or external origin device is missing - lvm2 will avoid trying to activate such device.	2013-12-17 14:08:54 +01:00
Peter Rajnoha	32080c4ff7	device: add physical block size info and make sure VG extent size >= PV's phys. block size	2013-12-12 15:02:36 +01:00
Zdenek Kabelac	30a81e5989	cleanup: self compilable headers	2013-12-12 13:28:19 +01:00
Zdenek Kabelac	99d5b4b807	cleanup: share segtype macros Write query macro just once.	2013-12-10 11:16:53 +01:00
Zdenek Kabelac	6f0300de96	lv_remove_single: update parameter name Since we have already a meaning for 'silent' replace its name with more explanator suppress_remove_message (updates recent commit).	2013-12-05 12:40:47 +01:00
Zdenek Kabelac	ff112eee18	thin: merge display	2013-12-04 14:30:26 +01:00
Zdenek Kabelac	5fc3352a15	thin: merge removal When thin is merged - properly handly device removal.	2013-12-04 14:30:26 +01:00
Zdenek Kabelac	e286904da6	thin: snapshot merge support	2013-12-04 14:30:25 +01:00
Zdenek Kabelac	993831bdb1	cleanup: size is already 64bit value Cast is not needed.	2013-12-04 14:30:25 +01:00
Zdenek Kabelac	5a6794a2ce	lv_remove_single: add silent arg Support silence for removal message.	2013-12-04 14:30:25 +01:00
Alasdair G Kergon	2e82a070f3	pvcreate: Avoid spurious 'not found' messages. Replacement of pv_read by find_pv_by_name in commit `651d5093ed` caused spurious error messages when running pvcreate or vgextend against an unformatted device. Physical volume /dev/loop4 not found Physical volume "/dev/loop4" successfully created Physical volume /dev/loop4 not found Physical volume /dev/loop4 not found Physical volume "/dev/loop4" successfully created Volume group "vg1" successfully extended	2013-11-29 21:45:37 +00:00
Peter Rajnoha	6a1957badc	pvcreate: do not issue warning about any existing PV If we're calling pvcreate on a device that already has a PV label, the blkid detects the existing PV and then we consider it for wiping before we continue creating the new PV label and we issue a warning with a prompt whether such old PV label should be removed. We don't do this with native signature detection code. Let's make it consistent with old behaviour. But still keep this "PV" (identified as "LVM1_member" or "LVM2_member" by blkid) detection when creating new LVs to avoid unexpected PV label appeareance inside LV.	2013-11-28 13:14:46 +01:00
Zdenek Kabelac	ce8ebda3fc	cleanup: tab indent	2013-11-28 12:48:01 +01:00
Zdenek Kabelac	50e1fad86a	cleanup: use matching signed types	2013-11-28 12:47:51 +01:00
Zdenek Kabelac	c6cfd7b2b9	cleanup: drop extra dm_list_empty Since dm_list_first has this check already include, skip extra call in while(). Moreover analyzers are then sure pvl is not NULL.	2013-11-28 12:45:52 +01:00
Zdenek Kabelac	bfcf3edcc6	cleanup: fold test into printf arg When arg is folded, compiler is able to check all args. (better for security)	2013-11-28 12:45:52 +01:00
Zdenek Kabelac	8c96afd361	cleanup: use compound literals for wipe_lv Optimize and cleanup recently introduced new function wipe_lv. Use compound literals to get nicely initialized wipe_params struct. Pass in lv as explicit argument for wipe_lv. Use cmd from lv structure. Initialize only non-null members so it's easy to see what is the special arg.	2013-11-28 12:45:52 +01:00
Zdenek Kabelac	79991aa769	snapshot: drop find_merging_snapshot Drop find_merging_snapshot() function. Use find_snapshot() called after check for lv_is_merging_origin() which is the commonly used code path - so we avoid duplicated tests and potential risk of derefering NULL point in unhandled error path.	2013-11-28 12:42:43 +01:00
Peter Rajnoha	eaa23d3273	wiping: add support for blkid wiping This is actually the wipefs functionailty as a matter of fact (wipefs uses the same libblkid calls). libblkid is more rich when it comes to detecting various signatures, including filesystems and users can better decide what to erase and what should be kept. The code is shared for both pvcreate (where wiping is necessary to complete the pvcreate operation) and lvcreate where it's up to the user to decide. The verbose output contains a bit more information about the signature like LABEL and UUID. For example: raw/~ # lvcreate -L16m vg WARNING: linux_raid_member signature detected on /dev/vg/lvol0 at offset 4096. Wipe it? [y/n] or more verbose one: raw/~ # lvcreate -L16m vg -v ... Found existing signature on /dev/vg/lvol0 at offset 4096: LABEL="raw.virt:0" UUID="da6af139-8403-5d06-b8c4-13f6f24b73b1" TYPE="linux_raid_member" USAGE="raid" WARNING: linux_raid_member signature detected on /dev/vg/lvol0 at offset 4096. Wipe it? [y/n] The verbose output is the same output as found in blkid.	2013-11-27 15:49:15 +01:00
Peter Rajnoha	b6dab4e059	lv_manip: rename set_lv -> wipe_lv and include signature wiping capability Use common wipe_lv (former set_lv) fn to do zeroing as well as signature wiping if needed. Provide new struct wipe_lv_params to define the functionality. Bind "lvcreate -W/--wipesignatures y" with proper wipe_lv call. Also, add "yes" and "force" to lvcreate_params so it's possible to apply them for the prompt: "WARNING: %s detected on %s. Wipe it? [y/n]".	2013-11-27 15:48:15 +01:00
Peter Rajnoha	169b4c1586	lvcreate: recognize --wipesignatures arg Recognize the new --wipesignatures arg in lvcreate that is supposed to wipe known signatures if found on newly created LV.	2013-11-27 15:48:15 +01:00
Peter Rajnoha	03c941a4ca	device: cleanup signature wiping functions The wipe_known_signatures fn now wraps the _wipe_signature fn that is called for each known signature (currently md, swap and luks). This patch makes the code more readable, not repeating the same sequence when used anywhere in the code. We're going to reuse this code later...	2013-11-27 12:56:58 +01:00
Zdenek Kabelac	bea118a87c	cleanup: use safe iterator Simplify code and use dm_list_iterate_items_safe() and avoid scanning the list mutliple times. Use dm_list_move().	2013-11-22 21:00:55 +01:00
Zdenek Kabelac	19cc92230c	cleanup: use trigraph Shorter code...	2013-11-22 21:00:55 +01:00
Zdenek Kabelac	1ff53bb7b6	snapshot: code move Move some code lines in front, they will be shared with thin snapshot merge later.	2013-11-22 21:00:55 +01:00
Zdenek Kabelac	30c127eaf8	fix missing header	2013-11-22 21:00:55 +01:00
Zdenek Kabelac	6d196410fc	snapshot: revert and move check to lvconvert Revert `4777eb6872` which put target_present check into init_snapshot_merge(). However this function is also used when parsing metadata. So we would get this present test performed even when target is not really needed. So move this target_present test directly into lvconvert.	2013-11-22 20:57:30 +01:00
Tony Asleson	0dd247502a	lvm2app: Add VG/LV name validation C library portion for https://bugzilla.redhat.com/show_bug.cgi?id=883689 Signed-off-by: Tony Asleson <tasleson@redhat.com>	2013-11-19 14:40:39 -06:00
Tony Asleson	04304ba735	lvm2app: Add ability to create PV with args Add a PV create which takes a paramters object that has get/set method to configure PV creation. Current get/set operations include: - size - pvmetadatacopies - pvmetadatasize - data_alignment - data_alignment_offset - zero Reference: https://bugzilla.redhat.com/show_bug.cgi?id=880395 Signed-off-by: Tony Asleson <tasleson@redhat.com>	2013-11-19 14:40:34 -06:00
Tony Asleson	5074dcc896	metadata.c: Call refactored vgreduce_single Replace the code with the refactored vgreduce_single instead of calling its own implementation. Corrects bug: https://bugzilla.redhat.com/show_bug.cgi?id=989174 Signed-off-by: Tony Asleson <tasleson@redhat.com>	2013-11-19 14:40:30 -06:00
Tony Asleson	fe474e1452	vgreduce: Move _vgreduce_single functionality Moving the core functionality of vgreduce single into lib/metadata/vg.c so that the command line and lvm2app library can call the same core functionality. New function is vgreduce_single. Signed-off-by: Tony Asleson <tasleson@redhat.com>	2013-11-19 14:40:28 -06:00
Petr Rockai	3a6f91d713	metadata: Make the fid mda routines a little more resilient.	2013-11-18 18:00:49 +01:00
Petr Rockai	ecc296311f	metadata: Do not throw an error in pv_label for missing PVs.	2013-11-17 22:35:16 +01:00
Petr Rockai	67c563ac2b	pv_label: NULL result is not always an internal error.	2013-11-17 21:43:06 +01:00
Petr Rockai	dc3a071145	metadata: Add a pv_label accessor (go from a PV to its label).	2013-11-17 21:41:27 +01:00
Petr Rockai	a2034e9a99	metadata: Add lvmcache_info_mda_free as a companion to pv_mda_free.	2013-11-17 21:41:27 +01:00
Petr Rockai	bead8ef5f0	metadata: Nuke the exported "pv_read" function.	2013-11-17 21:41:27 +01:00
Petr Rockai	ba6d6f0028	metadata: Fix handling of orphan PV linking & re-linking.	2013-11-17 21:41:27 +01:00
Petr Rockai	651d5093ed	metadata: Avoid pv_read in find_pv_by_name.	2013-11-17 21:41:27 +01:00
Petr Rockai	2f5c12e3a8	pvremove: Avoid using pv_read in favour of scanning.	2013-11-17 21:41:27 +01:00
Petr Rockai	603b45e0ed	pvresize: Do not use pv_read (get the PV from orphan VG).	2013-11-17 21:41:27 +01:00
Petr Rockai	e1a63905d1	metadata: Do not try to check vg_name of a NULL PV.	2013-11-17 21:41:26 +01:00
Zdenek Kabelac	89a8c7bea6	cleanup: share the nonremoval message Use common goto label for not remove log error.	2013-11-15 12:38:37 +01:00
Zdenek Kabelac	9f6209b878	activation: improve activation This patch fixes mostly cluster behavior but also updates non-cluster reaction where calls like 'lvchange -aln' lead to incorrect errors for some segment types. Fix the implicit activation rules where some segment types could be activated only in exclusive mode in cluster. lvm2 command was not preserver 'local' property and incorrectly converted local activations in to plain exclusive, so the local activation could have activate volumes exclusively, but remotely.	2013-11-01 13:03:50 +01:00
Zdenek Kabelac	c3e674ad30	activation: _lv_activate is ok when filtered. If the volume_list filters out volume from activation, it is still success result for this function. Change the error message back to verbose level. Detect if the volume is active localy before zeroing, so we report error a bit later for cases, where volume could not be activated because it doesn't pass through volume list (but user still could create volume when he disables zeroing)	2013-11-01 13:02:36 +01:00
Peter Rajnoha	039bdad732	activation: flag temporary LVs internally Add LV_TEMPORARY flag for LVs with limited existence during command execution. Such LVs are temporary in way that they need to be activated, some action done and then removed immediately. Such LVs are just like any normal LV - the only difference is that they are removed during LVM command execution. This is also the case for LVs representing future pool metadata spare LVs which we need to initialize by using the usual LV before they are declared as pool metadata spare. We can optimize some other parts like udev to do a better job if it knows that the LV is temporary and any processing on it is just useless. This flag is orthogonal to LV_NOSCAN flag introduced recently as LV_NOSCAN flag is primarily used to mark an LV for the scanning to be avoided before the zeroing of the device happens. The LV_TEMPORARY flag makes a difference between a full-fledged LV visible in the system and the LV just used as a temporary overlay for some action that needs to be done on underlying PVs. For example: lvcreate --thinpool POOL --zero n -L 1G vg - first, the usual LV is created to do a clean up for pool metadata spare. The LV is activated, zeroed, deactivated. - between "activated" and "zeroed" stage, the LV_NOSCAN flag is used to avoid any scanning in udev - betwen "zeroed" and "deactivated" stage, we need to avoid the WATCH udev rule, but since the LV is just a usual LV, we can't make a difference. The LV_TEMPORARY internal LV flag helps here. If we create the LV with this flag, the DM_UDEV_DISABLE_DISK_RULES and DM_UDEV_DISABLE_OTHER_RULES flag are set (just like as it is with "invisible" and non-top-level LVs) - udev is directed to skip WATCH rule use. - if the LV_TEMPORARY flag was not used, there would normally be a WATCH event generated once the LV is closed after "zeroed" stage. This will make problems with immediated deactivation that follows.	2013-10-23 14:09:37 +02:00
Peter Rajnoha	ee878bc52c	coverity: assigned variable not used and reassigned later	2013-10-16 15:06:43 +02:00
Jonathan Brassow	f58b26b633	RAID: Report RAID images split with tracking as out-of-sync ("I"). Split image should have an out-of-sync attr ('I') - always. Even if the RAID LV has not been written to since the LV was split off, it is still not part of the group that makes up the RAID and is therefore "out-of-sync".	2013-10-14 10:48:44 -05:00
Zdenek Kabelac	1146691afc	snapshot: deactivate virtual snapshot first Since the virtual snapshot has no reason to stay alive once we detach related snapshot - deactivate whole thing in front of snapshot removal - otherwice the code would get tricky for support in cluster. The correct full solution would require to have transactions for libdm operations. Also enable to the check for snapshot being opened prior the origin deactivation, otherwise we could easily end with the origin being deactivate, but snapshot still kept active, desynchronizing locking state in cluster.	2013-10-14 00:25:15 +02:00
Zdenek Kabelac	81504ba70c	snapshot: move virtsnap code from tool to lib Move code for removal dependency from tool's remove.c into lib's manipulation code. Same code then works with lvm2app.	2013-10-12 00:14:52 +02:00
Petr Rockai	0decd7553a	metadata: Fix metadata repair paths when lvmetad is used.	2013-10-09 14:44:01 +02:00
Peter Rajnoha	ce7489ed22	activation: add support for flagging an LV to skip udev scanning during activation A common scenario is during new LV creation when we need to wipe the newly created LV and avoid any udev scanning before this stage otherwise it could cause the device (the LV) to be claimed by some other subsystem for which there were stale metadata within LV data. This patch adds possibility to mark the LV we're just about to wipe with a flag that gets passed to udev via DM_COOKIE as a subsystem specific flag - DM_SUBSYSTEM_UDEV_FLAG0 (in this case the subsystem is "LVM") so LVM udev rules will take care of handling that.	2013-10-08 13:43:14 +02:00
Alasdair G Kergon	04d9a52684	release 2.02.103 52 files changed, 598 insertions(+), 264 deletions(-)	2013-10-04 14:32:23 +01:00
Peter Rajnoha	8cf0810d57	thin: rename thin_pool_chunk_size_calculation -> ..size_policy and rename "default" policy to "generic" Just to be consistent with existing naming we use.	2013-10-04 12:30:33 +02:00
Alasdair G Kergon	baf95bbff7	cmdline: Add --ignoreskippedcluster. Accept --ignoreskippedcluster with pvs, vgs, lvs, pvdisplay, vgdisplay, lvdisplay, vgchange and lvchange to avoid the 'Skipping clustered VG' errors when requesting information about a clustered VG without using clustered locking and still exit with success. The messages can still be seen with -v.	2013-10-01 21:20:10 +01:00
Peter Rajnoha	24ffd5244f	thin: better dbg msgs and avoid uninit. value on chunk size recalc	2013-09-30 08:58:57 +02:00
Jonathan Brassow	acdc731e83	RAID: Fix _sufficient_pes_free calculation for RAID lib/metadata/lv_manip.c:_sufficient_pes_free() was calculating the required space for RAID allocations incorrectly due to double accounting. This resulted in failure to allocate when available space was tight. When RAID data and metadata areas are allocated together, the total amount is stored in ah->new_extents and ah->alloc_and_split_meta is set. '_sufficient_pes_free' was adding the necessary metadata extents to ah->new_extents without ever checking ah->alloc_and_split_meta. This often led to double accounting of the metadata extents. This patch checks 'ah->alloc_and_split_meta' to perform proper calculations for RAID. This error is only present in the function that checks for the needed space, not in the functions that do the actual allocation.	2013-09-26 11:30:07 -05:00
Peter Rajnoha	78cba8eb3f	thin: calculate thin pool chunk size based on device IO hints If "default" thin pool chunk size calculation method is selected, use minimum_io_size, otherwise optimal_io_size for "performance" device hint exposed in sysfs. If there appear to be PVs with different hints presented, use their least common multiple. If the hint is less than the default value defined for the calculation method, use the default value instead.	2013-09-25 16:06:38 +02:00
Peter Rajnoha	cc9e65c391	thin: use appropriate default value based on allocation/thin_pool_chunk_size_calculation setting If thin_pool_chunk_size_calculation is set to "default", use 64KiB, otheriwse 512KiB for "performance".	2013-09-25 16:06:38 +02:00
Jonathan Brassow	c37c59e155	Test/clean-up: Indent clean-up and additional RAID resize test Better indenting and a test for bug 1005434 (parity RAID should extend in a contiguous fashion).	2013-09-24 21:32:53 -05:00
Jonathan Brassow	5ded7314ae	RAID: Fix broken allocation policies for parity RAID types A previous commit (`b6bfddcd0a`) which was designed to prevent segfaults during lvextend when trying to extend striped logical volumes forgot to include calculations for RAID4/5/6 parity devices. This was causing the 'contiguous' and 'cling_by_tags' allocation policies to fail for RAID 4/5/6. The solution is to remember that while we can compare ah->area_count == prev_lvseg->area_count for non-RAID, we should compare (ah->area_count + ah->parity_count) == prev_lvseg->area_count for a general solution.	2013-09-24 21:32:10 -05:00
Zdenek Kabelac	b29adbbc4d	raid: add lv_is_raid() More readable then status bit flag masking...	2013-09-23 11:35:15 +02:00
Zdenek Kabelac	30432bd604	cleanup: skip call of detect... SInce we know the pool was locked and we want to reloc pool again, just use '1' directly.	2013-09-23 11:35:15 +02:00
Zdenek Kabelac	b8ea27ac97	cleanup: hide gcc warning Older gcc is giving misleading warning: metadata/lv_manip.c:4018: warning: ‘seg’ may be used uninitialized in this function But warning free compilation is better.	2013-09-11 23:40:45 +02:00
Jonathan Brassow	2691f1d764	RAID: Make RAID single-machine-exclusive capable in a cluster Creation, deletion, [de]activation, repair, conversion, scrubbing and changing operations are all now available for RAID LVs in a cluster - provided that they are activated exclusively. The code has been changed to ensure that no LV or sub-LV activation is attempted cluster-wide. This includes the often overlooked operations of activating metadata areas for the brief time it takes to clear them. Additionally, some 'resume_lv' operations were replaced with 'activate_lv_excl_local' when sub-LVs were promoted to top-level LVs for removal, clearing or extraction. This was necessary because it forces the appropriate renaming actions the occur via resume in the single-machine case, but won't happen in a cluster due to the necessity of acquiring a lock first. The raid tests have been updated to allow testing in a cluster. For the most part, this meant creating devices with '-aey' if they were to be converted to RAID. (RAID requires the converting LV to be EX because it is a condition of activation for the RAID LV in a cluster.)	2013-09-10 16:33:22 -05:00
Jonathan Brassow	ca51435153	Misc/RAID: Enable resume_lv to handle some renaming conflicts. When images and their associated metadata are removed from a RAID1 LV, the remaining sub-LVs are "shifted" down to fill the gaps. For example, if there is a 3-way mirror: [0][1][2] and we remove device#0, the devices will be shifted down [1][2] and renamed. [0][1] This can create a problem for resume_lv (specifically, dm_tree_activate_children) during the renaming process though. This is because it will attempt to rename the higher indexed sub-LVs first and find that it cannot because there are currently other sub-LVs with that name. The solution is to check for a conflicting name before attempting to rename. If a conflict is found and that conflicting sub-LV is also in the process of renaming, we can defer the current rename until the conflicting sub-LV has renamed and cleared the conflict. Now that resume_lv can handle these types of rename conflicts, we can remove the workaround in RAID that was attempting to resume a RAID1 LV from the bottom-up in order to force a proper rename in assending order before attempting a resume on the top-level LV. This "hack" only worked for single machine use-cases of LVM. Clearing this up paves the way for exclusive activation of RAID LVs in a cluster.	2013-09-09 15:07:28 -05:00
Zdenek Kabelac	0670bfeb59	thin: validation catch multiseg thin pool/volumes Multisegment thin pools and volumes are not supported. Catch such error code path early.	2013-09-07 03:32:07 +02:00
Zdenek Kabelac	4c001a7854	thin: fix resize of stacked thin pool volume When the pool is created from non-linear target the more complex rules have to be used and stacking needs to properly decode args for _tdata LV. Also proper allocation policies are being used according to those set in lvm2 metadata for data and metadata LVs. Also properly check for active pool and extra code to active it temporarily. With this fix it's now possible to use: lvcreate -L20 -m2 -n pool vg --alloc anywhere lvcreate -L10 -m2 -n poolm vg --alloc anywhere lvconvert --thinpool vg/pool --poolmetadata vg/poolm lvresize -L+10 vg/pool	2013-09-07 03:24:48 +02:00
Jonathan Brassow	72d6bdd6b9	misc: make lv_is_on_pv use for_each_sub_lv to walk LV tree Make lv_is_on_pv use for_each_sub_lv to walk the LV tree. This reduces code duplication.	2013-08-23 11:03:28 -05:00
Jonathan Brassow	e5c0213168	Thin: Make 'lv_is_on_pv(s)' work with thin types The pool metadata LV must be accounted for when determining what PVs are in a thin-pool. The pool LV must also be accounted for when checking thin volumes. This is a prerequisite for pvmove working with thin types.	2013-08-23 08:49:16 -05:00
Jonathan Brassow	f1e3640df3	Misc: Make get_pv_list_for_lv() available to more than just RAID The function 'get_pv_list_for_lv' will assemble all the PVs that are used by the specified LV. It uses 'for_each_sub_lv' to traverse all of the sub-lvs which may compose it.	2013-08-23 08:40:13 -05:00
Peter Rajnoha	f74e8fe044	thin: fix commit `e195b5227e` Check chunk_size range unconditionally.	2013-08-06 16:28:12 +02:00
Peter Rajnoha	e195b5227e	thin: apply VG profile if creating a new thin pool When creating a new thin pool and there's no profile requested via "lvcreate --profile ...", inherit any VG profile if it's attached. Currently this applies to these settings: allocation/thin_pool_chunk_size allocation/thin_pool_discards allocation/thin_pool_zero	2013-08-06 11:42:40 +02:00
Zdenek Kabelac	7b58f10442	thin: move setting of THIN_POOL Set flag when attaching data LV which make segment THIN_POOL.	2013-07-31 15:27:38 +02:00
Alasdair G Kergon	b6bfddcd0a	alloc: fix lvextend when stripe number varies The PREFERRED allocation mechanism requires the number of areas in the previous LV segment to match the number in the new segment being allocated. If they do not match, the code may crash. E.g. https://bugzilla.redhat.com/989347 Introduce A_AREA_COUNT_MATCHES and when not set avoid referring to the previous segment with the contiguous and cling policies.	2013-07-29 19:35:45 +01:00
Jonathan Brassow	f5a205668b	Revert a previous change commit `d00d45a8b6` introduced changes that are causing cluster mirror tests to fail. Ultimately, I think the change was right, but a proper clean-up will have to wait. The portion of the commit we are reverting correlates to the following commit comment: 2) lib/metadata/mirror.c:_delete_lv() - should have been calling _activate_lv_like_model() with 'mirror_lv'. This is because 'mirror_lv' is the LV that the overall operation is being performed on. We need to use this LV as the basis for determining whether to activate locally, or across the cluster, etc. It appears that when legs or logs are removed from a mirror, they are being activated before they are deleted in order to make them top-level LVs that can be acted upon. When doing this, it appears they are not activated based on the characteristics of the mirror from which they came. IOW, if the mirror was exclusively active, the sub-LVs are activated globally. This is a no-no. This then made it impossible to activate_lv_like_model if the model was "mirror_lv" instead of "lv" in _delete_lv(). Thus, at some point this change should probably be put back and those location where the sub-LVs are being improperly activated "shared" instead of EX should be corrected.	2013-07-24 14:18:07 -05:00
Zdenek Kabelac	5597dc3652	thin: not zeroing for non-zeroed thin pool snaps Do not zero initial 4KB of thin snapshot volume for thin pool with disabled zeroing.	2013-07-24 01:15:31 +02:00
Jonathan Brassow	d00d45a8b6	Clean-up: Addressing a few FIXME's Three fixme's addressed in this commit: 1) lib/metadata/lv_manip.c:_calc_area_multiple() - this could be safely changed to a comment explaining that currently because RAID10 can only have a 2-way mirror, we don't need to know the number of stripes. However, we will need to know that in the future if RAID10 is to support more than 2-way mirroring. 2) lib/metadata/mirror.c:_delete_lv() - should have been calling _activate_lv_like_model() with 'mirror_lv'. This is because 'mirror_lv' is the LV that the overall operation is being performed on. We need to use this LV as the basis for determining whether to activate locally, or across the cluster, etc. 3) tools/lvcreate.c:_lvcreate_params() - Minor clean-up. If '-m 0' is given, treat it as though the mirroring argument was not given (i.e. as though the requested segment type was 'stripe' and not mirror).	2013-07-23 14:46:22 -05:00
Zdenek Kabelac	373f95a921	snapshot: update merging fix Activation is needed only for clustered VG. For non-clustered VG skip activation, since deactivate_lv() is called without problems (no testing for lock presence). (updates `f6ded62291`)	2013-07-23 15:15:04 +02:00
Zdenek Kabelac	6311be29e4	thin: use 64bit arithmetic for checking meta size Avoid overflow since extents are just 32bit values. (in release fix `87aca628`)	2013-07-23 14:58:07 +02:00
Alasdair G Kergon	84801d7c34	thin: rename extend_pool to create_pool	2013-07-23 13:33:14 +01:00
Zdenek Kabelac	f6ded62291	snapshot: fix merging When the merging of snapshot is finished, we need to clean dm table intries for snapshot and -cow device. So for merging snapshot we have to activate_lv plain 'cow' LV and let the table resolver to its work - shortly deactivation_lv() request will follow - in cluster this needs LV lock to be held by clvmd. Also update a test - add small wait - if lvremove is not 'fast enough' and merging process has not been stopped and $lv1 removed in background. Ortherwise the following lvcreate occasionally finds name $lv1 still in use. (in release fix)	2013-07-22 16:26:00 +02:00
Zdenek Kabelac	ea68f08501	cleanup: remove unused headers	2013-07-22 12:41:21 +02:00
Petr Rockai	6d2604f026	metadata: Fix tracking of read_status flags in _vg_make_handle.	2013-07-22 12:04:47 +02:00
Petr Rockai	3ed7f78ff4	metadata: Do not ignore errors in _vg_update_vg_ondisk.	2013-07-22 12:00:48 +02:00
Petr Rockai	f897fcbd95	metadata: Do not try to maintain an ondisk copy of orphan VGs.	2013-07-22 11:51:35 +02:00
Zdenek Kabelac	3075784955	thin: add spare lvcreate support Add --poolmetadataspare option and creates and handles pool metadata spare lv when thin pool is created. With default setting 'y' it tries to ensure, spare has at least the size of created LV.	2013-07-18 18:22:44 +02:00
Zdenek Kabelac	a916bf7eeb	thin: removal of spare disables recovery Warn user when removing spare LV. Remove spare automatically, when last pool from VG is removed.	2013-07-18 18:22:44 +02:00
Zdenek Kabelac	915cc5a2fa	thin: report 'e' volume type pool metadata spare Reuse m'e'tadata volume type for spar'e' volume as well. Essentially they are related and there is no big reason to introduce new flag.	2013-07-18 18:22:44 +02:00
Zdenek Kabelac	460d0254eb	thin: add pool metadata spare lv support Add support for pool's metadata spare volume.	2013-07-18 18:22:43 +02:00
Zdenek Kabelac	08df7ba844	thin: improve pool creation activation order Pool creation involves clearing of metadata device which triggers udev watch rule we cannot udev synchronize with in current code. This metadata devices needs to be activated localy, so in cluster mode deactivation and reactivation is always needed. However for non-clustered mode we may reload table via suspend/resume path which avoids collision with udev watch rule which was occasionaly triggering retry deactivation loop. Code has been also split into 2 separate code paths for thin pools and thin volumes which improved readability of the code as well. Deactivation has been moved out of extend_pool() and decision is now in _lv_create_an_lv() which knows the change mode.	2013-07-18 18:22:43 +02:00
Zdenek Kabelac	d72f34af41	thin: fix error paths for metadata creation Since we vg_write&commit metadata LV inside lv_extend() call, proper restore is needed in case something fails. So add bad: section which deactivates activated LV and removes it from VG. Also check early for metadata LV name lengh fail.	2013-07-18 18:22:43 +02:00
Zdenek Kabelac	7afa9cebcb	thin: fix error path in creation path Remove some calls to revert_new_lv when no LV has been created/commited so far. When the pool update failed - then only revert is needed.	2013-07-18 18:22:43 +02:00
Zdenek Kabelac	1d3f7953bd	lv_manip: move some validation code before archiving Make as much test we can, before actualy modifying metadata. Avoids also unnecessary archiving.	2013-07-18 18:22:43 +02:00
Zdenek Kabelac	7b4b97b731	snapshot: local activation for clear COW device To clear snapshot cow device in cluster enforce local activation here.	2013-07-18 18:22:43 +02:00
Zdenek Kabelac	b5dfe4bec2	metadata: add is_change_activating Add simple inline function to check, whether the change is activating. (better then macro since we get type checking).	2013-07-18 18:22:42 +02:00
Zdenek Kabelac	e8fc77bd6d	cleanup: move detection of change mode before commit point Following patch will need to know change state before commit point. Also the test mode should properly report all ongoing operation.	2013-07-18 18:22:42 +02:00
Zdenek Kabelac	20187fc190	cleanup: use dm_list_empty Check for empty list directly.	2013-07-18 18:22:42 +02:00
Zdenek Kabelac	8bd67f9e5a	cleanup: exit earlier in lv_rename_update List doesn't need to be created when the metadata are not updated.	2013-07-18 18:16:29 +02:00
Peter Rajnoha	73e7f6c45f	profile: move profile assignment to common lv_create_empty fn	2013-07-17 11:31:54 +02:00
Zdenek Kabelac	925701d9f3	thin: write back when command successfully finished Remove backup() call from update_pool_lv() as it's been there duplicated and preperly order backup() call after lvresize, so there is just one such call.	2013-07-15 15:48:32 +02:00
Zdenek Kabelac	42881c8877	thin: send messages to active pool If the thin pool is known to be active, messages can be passed to the pool even when the created thin volume is not going to be activated. So we do not need to stack large list of message and validate and catch creation errors earlier in this case. Replace the test for valid activation combination with simpler list of deactivation combinations.	2013-07-15 15:47:25 +02:00
Zdenek Kabelac	9ba7783350	cleanup: update comments Add and indent.	2013-07-15 15:40:46 +02:00
Peter Rajnoha	8d1e511363	conf: add activation/auto_set_activation_skip The activation/auto_set_activation_skip enables/disables automatic adding of the ACTIVATION_SKIP LV flag. By default thin snapshots are flagged to be skipped during activation. And by default, the auto_set_activation_skip is enabled.	2013-07-12 20:54:17 +02:00
Peter Rajnoha	283c93a858	lvs: add 's(k)ip activatoin' lv_attr field A new 10-th bit in lvs lv_attr field to indicate whether an LV has the ACTIVATION_SKIP flag attached (stored in metadata).	2013-07-12 20:54:17 +02:00
Peter Rajnoha	7dc8c84b18	activation: add support for skipping activation of selected LVs Also add -k/--setactivationskip y/n and -K/--ignoreactivationskip options to lvcreate. The --setactivationskip y sets the flag in metadata for an LV to skip the LV during activation. Also, the newly created LV is not activated. Thin snapsots have this flag set automatically if not specified directly by the --setactivationskip y/n option. The --ignoreactivationskip overrides the activation skip flag set in metadata for an LV (just for the run of the command - the flag is not changed in metadata!) A few examples for the lvcreate with the new options: (non-thin snap LV => skip flag not set in MDA + LV activated) raw/~ $ lvcreate -l1 vg Logical volume "lvol0" created raw/~ $ lvs -o lv_name,attr vg/lvol0 LV Attr lvol0 -wi-a---- (non-thin snap LV + -ky => skip flag set in MDA + LV not activated) raw/~ $ lvcreate -l1 -ky vg Logical volume "lvol1" created raw/~ $ lvs -o lv_name,attr vg/lvol1 LV Attr lvol1 -wi------ (non-thin snap LV + -ky + -K => skip flag set in MDA + LV activated) raw/~ $ lvcreate -l1 -ky -K vg Logical volume "lvol2" created raw/~ $ lvs -o lv_name,attr vg/lvol2 LV Attr lvol2 -wi-a---- (thin snap LV => skip flag set in MDA (default behaviour) + LV not activated) raw/~ $ lvcreate -L100M -T vg/pool -V 1T -n thin_lv Logical volume "thin_lv" created raw/~ $ lvcreate -s vg/thin_lv -n thin_snap Logical volume "thin_snap" created raw/~ $ lvs -o name,attr vg LV Attr pool twi-a-tz- thin_lv Vwi-a-tz- thin_snap Vwi---tz- (thin snap LV + -K => skip flag set in MDA (default behaviour) + LV activated) raw/~ $ lvcreate -s vg/thin_lv -n thin_snap -K Logical volume "thin_snap" created raw/~ $ lvs -o name,attr vg/thin_lv LV Attr thin_lv Vwi-a-tz- (thins snap LV + -kn => no skip flag in MDA (default behaviour overridden) + LV activated) [0] raw/~ # lvcreate -s vg/thin_lv -n thin_snap -kn Logical volume "thin_snap" created [0] raw/~ # lvs -o name,attr vg/thin_snap LV Attr thin_snap Vwi-a-tz-	2013-07-12 20:39:07 +02:00
Alasdair G Kergon	aaa9a68b2f	metadata: cleanup comments and formatting	2013-07-09 12:34:48 +01:00
Alasdair G Kergon	f56a1819e9	tools: remove metadata-exported.h metadata-exported.h is included by tools.h	2013-07-09 03:07:55 +01:00
Alasdair G Kergon	9d5bdc91ca	tools: remove metadata.h	2013-07-09 02:51:24 +01:00
Alasdair G Kergon	8adddbf101	pvcreate: remove metadata.h header Files in tools/ should only use metadata-exported.h not metadata.h. Rename pvcreate_locked to pvcreate_single.	2013-07-09 02:37:56 +01:00
Alasdair G Kergon	7c6526aae2	lvresize: separate validation from action Start separating the validation from the action in the basic lvresize code moved to the library. Remove incorrect use of command line error codes from lvresize library functions. Move errors.h to tools directory to reinforce this, exporting public versions of the error codes in lvm2cmd.h for dmeventd plugins to use.	2013-07-06 03:28:21 +01:00
Zdenek Kabelac	a64239f225	cleanup: use plain unsigned types	2013-07-05 17:20:57 +02:00
Zdenek Kabelac	7319bc0420	cleanup: move declaration to front	2013-07-05 17:20:57 +02:00
Zdenek Kabelac	f88f5a1ca3	thin: move alloc_pool_metadata Move function from /tool to /lib to thin_manip.c Since lvm2api will need to move many things into /lib anyway.	2013-07-04 13:33:41 +02:00
Zdenek Kabelac	6f335ffa35	sigint: improve logic on for sigint reaction Fix and improve handling on sigint. Always check for signal presence before calling of command, so it will not call the command when break was hit. If the command has been finished succesfully there is no problem to mark the command ok and not report interrupt at all. Fix cuple related stack; reports and assignments.	2013-07-03 14:46:42 +02:00
Mike Snitzer	f9e0adcce5	snapshot: Rename snapshot segment returning methods from find__cow to find__snapshot find_cow -> find_snapshot, find_merging_cow -> find_merging_snapshot. Will thin snapshot code to reuse these methods without confusion.	2013-07-02 16:26:03 -04:00
Tony Asleson	50db109e20	liblvm: Moved additional pv resize code The pv resize code required that a lvm_vg_write be done to commit the change. When the method to add the ability to list all PVs, including ones that are not assocated with a VG we had no way for the user to make the change persistent. Thus additional resize code was move and now liblvm calls into a resize function that does indeed write the changes out, thus not requiring the user to explicitly write out he changes. Signed-off-by: Tony Asleson <tasleson@redhat.com>	2013-07-02 14:24:34 -05:00
Tony Asleson	6d6ccded35	lib2app: Added PV create. V2 V2: Correct call to lock_vol Signed-off-by: Tony Asleson <tasleson@redhat.com>	2013-07-02 14:24:34 -05:00
Tony Asleson	8ddb1b4abf	_get_pvs: Remove unused variable Signed-off-by: Tony Asleson <tasleson@redhat.com>	2013-07-02 14:24:34 -05:00
Tony Asleson	e33ac7b1ed	lvm2app: Implement lvm_pv_remove V2 Code move and changes to support calling code from command line and from library interface. V2 Change lock_vol call Signed-off-by: Tony Asleson <tasleson@redhat.com>	2013-07-02 14:24:34 -05:00
Tony Asleson	ef3ab801e8	lvm2app: Add function to retrieve list of PVs V3 As locks are held, you need to call the included function to release the memory and locks when done transversing the list of physical volumes. V2: Rebase fix V3: Prevent VGs from getting cached and then write protected. Signed-off-by: Tony Asleson <tasleson@redhat.com>	2013-07-02 14:24:33 -05:00
Tony Asleson	49d7596581	lvm2app: Implement lv resize (v3) Simplified version of lv resize. v3: Rebase changes to make work. Needed to set sizeargs = 1 to indicate to resize that we are asking for a size based resize. Signed-off-by: Tony Asleson <tasleson@redhat.com>	2013-07-02 14:24:33 -05:00
Tony Asleson	c43ce46ba7	lvm2app: Move core lv re-size code (v6) Moved to allow use from command line and for library use. v3,v4,v5,v6: rebase changes Signed-off-by: Tony Asleson <tasleson@redhat.com>	2013-07-02 14:24:33 -05:00
Peter Rajnoha	9f6cfc9de4	report: add vg_profile and lv_profile to report the profile attached to VG/LV vgs -o vg_profile ... lvs -o lv_profile ...	2013-07-02 15:22:12 +02:00
Peter Rajnoha	24a84549a8	thin: make selected thinp settings profilable These settins are customizable by profiles: allocation/thin_pool_zero allocation/thin_pool_discards allocation/thin_pool_chunk_size activation/thin_pool_autoextend_threshold activation/thin_pool_autoextend_percent	2013-07-02 15:22:11 +02:00
Peter Rajnoha	6f0427cc56	lv: add lv_config_profile fn Returns LV's profile if it exists, VG's profile otherwise (LV inherits VG's profile).	2013-07-02 15:22:11 +02:00
Peter Rajnoha	e21e38cf74	metadata: add support for storing profile name in metadata (during vgcreate/lvcreate) If "vgcreate/lvcreate --profile <profile_name>" is used, the profile name is automatically stored in metadata for making it possible to load it automatically next time the VG/LV is used.	2013-07-02 15:19:09 +02:00
Peter Rajnoha	d6a91da4be	config: add profile arg to find_config_tree_bool	2013-07-02 15:19:09 +02:00
Peter Rajnoha	50bf2c0db1	config: add profile arg to find_config_tree_int	2013-07-02 15:19:09 +02:00
Peter Rajnoha	eeb7b0f7fa	config: add profile arg to find_config_tree_node	2013-07-02 15:19:09 +02:00
Peter Rajnoha	c5e6bc393e	metadata: read VG/LV profile name from metadata if it exists and load it This is per VG/LV profile loading on demand. The profile itself is saved in struct volume_group/logical_volume as "profile" field so we can reference it whenever needed.	2013-07-02 15:19:09 +02:00
Zdenek Kabelac	e30028004b	archiver: do not archive vg more then once Do not keep multiple archives for the executed command. Reuse the ALLOCATABLE_PV from pv status for ARCHIVED_VG vg status. Mark VG with the bit with the first archivation.	2013-07-01 23:09:26 +02:00
Zdenek Kabelac	afea2bf598	cleanup: move bit flags in order Preseve the sequence of bits.	2013-07-01 23:06:41 +02:00
Zdenek Kabelac	da9263905c	cleanup: use "" around type in error message A bit more cleaner error message for wrong discards paramater.	2013-06-25 13:47:39 +02:00
Zdenek Kabelac	2a059f2358	cleanup: use log_print instead of log_error Since reduce the message has informational character and doesn't lead to exit of the command - reduce the log level to info print as we use for other similar types. Reindent next print message.	2013-06-25 13:47:39 +02:00
Zdenek Kabelac	f990b7298d	cleanup: return lv_is_ as 1 or 0 Do not return 64bit values - return just plain int 0 or 1	2013-06-18 22:13:42 +02:00
Zdenek Kabelac	d4308a558d	snapshot: fix max size limit check for COW device Use proper max size as a multiple of extent size. And use 64bit arithmentic for validation of minsize. (in release fix).	2013-06-17 09:37:50 +02:00
Zdenek Kabelac	2f334b16d2	cleanup: use struct assign Simplier code with struct assign. Drop unneeded zeroing of zallocated memory.	2013-06-17 09:37:06 +02:00
Zdenek Kabelac	2636cae139	clean: remove unneeded assign Since init_unknown_segtype returns zalloced memory, NULL assign is not needed.	2013-06-17 09:34:56 +02:00
Zdenek Kabelac	5d73c0c674	cleanup: access pool segs with check Add some minor checks after first_seg(lv). Use direct check for thin pool segment. Drop some unneeded {}	2013-06-16 00:07:33 +02:00
Zdenek Kabelac	8fb5f63637	mirror: add missing error message When a user has not proceeded with conversion, print the error message why the command has failed.	2013-06-16 00:07:32 +02:00
Alasdair G Kergon	c2dc21d89f	text: miscellaneous comments & message tweaks	2013-06-15 01:28:54 +01:00
Peter Rajnoha	c6f48b7c1a	refactor: make device type recognition code common for general use Changes: - move device type registration out of "type filter" (filter.c) to a separate and new dev-type.[ch] for common use throughout the code - the structure for keeping the major numbers detected for available device types and available partitioning available is stored in "dev_types" structure now - move common partitioning detection code to dev-type.[ch] as well together with other device-related functions bound to dev_types (see dev-type.h for the interface) The dev-type interface contains all common functions used to detect subsystems/device types, signature/superblock recognition code, type-specific device properties and other common device properties (bound to dev_types), including partitioning support. - add dev_types instance to cmd context as cmd->dev_types for common use - use cmd->dev_types throughout as a central point for providing information about device types	2013-06-12 12:08:56 +02:00
Peter Rajnoha	657abb08e0	cleanup: use libdm's dm_sysfs_dir() for sysfs directory throughout And remove superfluous cmd->sysfs_dir and set_sysfs_dir_path/sysfs_dir_path fn from lvm-globals.[ch].	2013-06-12 11:44:58 +02:00
Zdenek Kabelac	72c3ae253e	thin: add helper functions Add find_pool_lv() and pool_can_resize_metadata().	2013-06-11 14:03:30 +02:00
Zdenek Kabelac	01ef97fcbb	thin: report 'e' metadata type with higher priority Giving volume type information about being 'metadata' type of volume has higher priority then i.e. 'mirror' or 'thin' flag - for those type we have 'target attr' (7th. field).	2013-06-11 14:03:08 +02:00
Zdenek Kabelac	c0290489c3	thin: report o as volume type for external origin Reuse 'o' attr for lvs report also for external origin.	2013-06-11 14:02:41 +02:00
Zdenek Kabelac	7151ede767	thin: report t for thin pool and volume Do not mark internal device _tdata and _tmeta as having target type 't'. They have the target type on their own (i.e. mirror, raid).	2013-06-11 13:58:16 +02:00
Zdenek Kabelac	1dcba13dfc	cleanup: remove {}	2013-06-11 13:55:26 +02:00
Petr Rockai	f5a3bef276	format1: Fix snapshot reload in lv_remove. The special suspend/resume code in lv_remove for LVM1 snapshots was interpsersed with a vg_commit call. However, while with LVM1 metadata, vg_commit is technically a no-op, the activation code relied on the ondisk and incore metadata being the same, since on LVM1, a "commit" happens in vg_write already. Since the "ondisk" metadata was previously not available with format1 (and incore was silently used instead, via lvmcache), the problem was masked.	2013-06-10 21:01:57 +02:00
Petr Rockai	2cce2f67ab	metadata: Fix a pool CRC failure due to "late" ondisk copy creation.	2013-06-10 17:26:38 +02:00
Petr Rockai	f65dd341a5	locking: Make it possible to pass down an LV to activation code. Previously, we have relied on UUIDs alone, and on lvmcache to make getting a "new copy" of VG metadata fast. If the code which triggers the activation has the correct VG metadata at hand (the version which is currently on disk), it can now hand it to the activation code directly.	2013-06-10 17:26:38 +02:00
Petr Rockai	5d5f2306bd	Add and track an "ondisk" pointer to struct volume_group. This allows us to get the current on-disk version of the metadata whenever we have the current in-flight version, without a recourse to scanning or lvmcache.	2013-06-10 17:26:29 +02:00
Petr Rockai	c1e851e208	Move export_vg_to_config_tree alongside export_vg_to_buffer.	2013-06-10 15:55:55 +02:00
Zdenek Kabelac	31f3274ed8	mirror: implement check for remotely active LV If the mirror is active exclusively and locally, then we may proceed.	2013-05-31 21:42:31 +02:00
Jonathan Brassow	562c678ee2	DM RAID: Add ability to throttle sync operations for RAID LVs. This patch adds the ability to set the minimum and maximum I/O rate for sync operations in RAID LVs. The options are available for 'lvcreate' and 'lvchange' and are as follows: --minrecoveryrate <Rate> [bBsSkKmMgG] --maxrecoveryrate <Rate> [bBsSkKmMgG] The rate is specified in size/sec/device. If a suffix is not given, kiB/sec/device is assumed. Setting the rate to 0 removes the preference.	2013-05-31 11:25:52 -05:00
Zdenek Kabelac	eb7e206a73	snapshot: add cow_max_extents Add more precise calculation of the maximum usable snapshot size. Using only percentage fails for small size of snapshot and extents.	2013-05-30 17:30:15 +02:00
Zdenek Kabelac	59962d8d3e	snapshot: require 3 chunks for creation There is no point in creation of 2chunks snapshot, since the snapshot is invalidated immeditelly with the first write as there is no free chunk for COW blocks (2 chunks are used by the snap header and the 1st. metadata chunk). Enhance error message about the lowest usable size.	2013-05-30 17:28:03 +02:00
Zdenek Kabelac	2f1a571c97	fid: fix reset of PV fid Avoid hitting memory corruption (double free) in code path, where PV FID has been already destroyed and the released pointer was left in PV structure and could have been tried to be released from there 2nd. time with final context destruction.	2013-05-30 16:52:39 +02:00
Peter Rajnoha	732859d21f	refactor: rename embedding area -> bootloader area	2013-05-28 12:37:22 +02:00
Zdenek Kabelac	9966842810	snapshot: skip monitor for large cows If snapshot cow device is already big enough to cover whole origin, do not monitor it.	2013-05-27 10:35:43 +02:00
Zdenek Kabelac	77952151af	snapshot: add lv_is_cow_covering_origin Add function to check is size of cow is already big enough to cover whole origin.	2013-05-27 10:34:53 +02:00
Zdenek Kabelac	3ba3bc0d66	cleanup: drop backtrace After log_error/log_warn there is no point to show <backtrace> in debug log trace from the next code line.	2013-05-27 10:28:32 +02:00
Zdenek Kabelac	8cbacd2474	lv_manip: use lv_is_active Updated reverted commit. The usage of lv_is_active() is needed here, so the (!lv_is_active_exclusive_locally) gives the correct report.	2013-05-20 16:47:33 +02:00
Jonathan Brassow	06ac797f42	Clean-up: Replace 'lv_is_active' with more correct/specific variants There are places where 'lv_is_active' was being used where it was more correct to use 'lv_is_active_locally'. For example, when checking for the existance of a kernel instance before asking for its status. Most of the time these would work correctly. (RAID is only allowed on non-clustered VGs at the moment, which means that 'lv_is_active' and 'lv_is_active_locally' would give the same result.) However, it is more correct to use the proper variant and it helps with future scenarios where targets might be allowed exclusively (or clustered) in a cluster VG.	2013-05-16 10:36:56 -05:00
Peter Rajnoha	4777eb6872	lvconvert: check for snapshot-merge support before merge init	2013-05-16 08:21:57 +02:00
Alasdair G Kergon	f12d88f840	activation: fix lv_is_active regressions Try to fix commit `bf2741376d`. lv_is_active is not the same as lv_info(cmd, org, 0, &info, 0, 0). Introduce and use lv_is_active_locally.	2013-05-15 02:13:31 +01:00
Mike Snitzer	8ad7865b42	Fix alignment of PV data area if detected alignment less than 1 MB This fixes a long standing regression since LVM2 2.02.74 (commit `4efb1d9c`, "Update heuristic used for default and detected data alignment.") The default PE alignment could be used (via MAX()) even if it was determined that the device's MD stripe width, or minimal_io_size or optimal_io_size were not factors of the default PE alignment (either 64K or the newer default of 1MB, etc). This bug would manifest if the default PE alignment was larger than the overriding hint that the device provided (e.g. default of 1MB vs optimal_io_size of 768K). Signed-off-by: Mike Snitzer <snitzer@redhat.com>	2013-05-13 15:56:47 -04:00
Zdenek Kabelac	3d0cb0611e	lv: fix typedef Since older gcc is not accepting duplication of same typedef, stay with predeclared enum type.	2013-05-03 16:02:43 +02:00
Zdenek Kabelac	2d3700ba42	report: improve reporting of active state For reporting stacked or joined devices properly in cluster, we need to report their activation state according the lock, which activated this device tree. This is getting a bit complex - current code tries simple approach - For snapshot - return status for origin. For thin pool - return status of the first known active thin volume. For the rest of them - try to use dependency list of LVs and skip known execptions. This should be able to recursively deduce top level device for given LV. (in release fix)	2013-05-03 15:43:52 +02:00
Zdenek Kabelac	d2d71330c3	lv: add lv_active_change Make a separate /lib function for the change of activation state of the LV. (in release update)	2013-05-03 15:43:19 +02:00
Zdenek Kabelac	8d004b5127	report: show active state of LV For non clustered VG - show "active"/"" For clustered VG its more complex: "local exclusive" "remote exclusive" "locally" "remotely"	2013-04-25 17:33:24 +02:00
Zdenek Kabelac	8b18ab76d2	report: show dmeventd monitoring status Add new lvs segment field 'Monitor' showing 3 states: "monitored" - LV is monitored by dmeventd. "not monitored" - LV is currently not being monitored by dmeventd "" (empty) - LV does not support monitoring, or dmeventd support is not compiled in.	2013-04-25 17:33:24 +02:00
Zdenek Kabelac	f84f12a6a3	snapshot: rework cluster creation and removal Support for exclusive activation of snapshots revealed some problems. When snapshot is created, COW LV is activated first (for clearing) and then it's transformed into snapshot's COW LV, but it has left the lock for such LV active in cluster and this lock could not have been removed from dlm, unless snapshot has been removed within same dlm session. If the user tried to remove snapshot after rebooting node, the lock was missing, and COW LV could not have been detached. Patch modifes the approach in this way: Always deactivate COW LV for clustered vg after clearing (so it's activated again via imlicit snapshot activation rule when snapshot is activated). When snapshot is removed, activate COW LV as independend LV, so the lock will exist for such LV, but only when the snapshot is active. Also add test case for testing snapshot removal after cluster reboot.	2013-04-25 17:33:24 +02:00
Zdenek Kabelac	dd4fdce16c	cleanup: drop unused assignment Assigned values are unused.	2013-04-21 23:14:04 +02:00
Zdenek Kabelac	5e7eae59da	lv_manip: check remove_seg_from_segs_using_this_lv() Add missing check for result of remove_seg_from_segs_using_this_lv(). Failure is reported as internal error.	2013-04-21 23:10:43 +02:00
Zdenek Kabelac	24f8daa13d	raid: test for target_pvs If target_pvs is NULL do not call lv_is_on_pvs()	2013-04-21 23:07:00 +02:00
Zdenek Kabelac	4ca6a4105d	thin: lvcreate better support for AAY Test rather for changes which are deactivating.	2013-04-21 23:06:23 +02:00
Jonathan Brassow	2e0740f7ef	RAID: Add writemostly/writebehind support for RAID1 'lvchange' is used to alter a RAID 1 logical volume's write-mostly and write-behind characteristics. The '--writemostly' parameter takes a PV as an argument with an optional trailing character to specify whether to set ('y'), unset ('n'), or toggle ('t') the value. If no trailing character is given, it will set the flag. Synopsis: lvchange [--writemostly <PV>:{t\|y\|n}] [--writebehind <count>] vg/lv Example: lvchange --writemostly /dev/sdb1:y --writebehind 512 vg/raid1_lv The last character in the 'lv_attr' field is used to show whether a device has the WriteMostly flag set. It is signified with a 'w'. If the device has failed, the 'p'artial flag has priority. Example ("nosync" raid1 with mismatch_cnt and writemostly): [~]# lvs -a --segment vg LV VG Attr #Str Type SSize raid1 vg Rwi---r-m 2 raid1 500.00m [raid1_rimage_0] vg Iwi---r-- 1 linear 500.00m [raid1_rimage_1] vg Iwi---r-w 1 linear 500.00m [raid1_rmeta_0] vg ewi---r-- 1 linear 4.00m [raid1_rmeta_1] vg ewi---r-- 1 linear 4.00m Example (raid1 with mismatch_cnt, writemostly - but failed drive): [~]# lvs -a --segment vg LV VG Attr #Str Type SSize raid1 vg rwi---r-p 2 raid1 500.00m [raid1_rimage_0] vg Iwi---r-- 1 linear 500.00m [raid1_rimage_1] vg Iwi---r-p 1 linear 500.00m [raid1_rmeta_0] vg ewi---r-- 1 linear 4.00m [raid1_rmeta_1] vg ewi---r-p 1 linear 4.00m A new reportable field has been added for writebehind as well. If write-behind has not been set or the LV is not RAID1, the field will be blank. Example (writebehind is set): [~]# lvs -a -o name,attr,writebehind vg LV Attr WBehind lv rwi-a-r-- 512 [lv_rimage_0] iwi-aor-w [lv_rimage_1] iwi-aor-- [lv_rmeta_0] ewi-aor-- [lv_rmeta_1] ewi-aor-- Example (writebehind is not set): [~]# lvs -a -o name,attr,writebehind vg LV Attr WBehind lv rwi-a-r-- [lv_rimage_0] iwi-aor-w [lv_rimage_1] iwi-aor-- [lv_rmeta_0] ewi-aor-- [lv_rmeta_1] ewi-aor--	2013-04-15 13:59:46 -05:00
Zdenek Kabelac	2e39392daf	cleanup: remove unused lvl_idx	2013-04-12 11:26:31 +02:00
Jonathan Brassow	ff64e3500f	RAID: Add scrubbing support for RAID LVs New options to 'lvchange' allow users to scrub their RAID LVs. Synopsis: lvchange --syncaction {check\|repair} vg/raid_lv RAID scrubbing is the process of reading all the data and parity blocks in an array and checking to see whether they are coherent. 'lvchange' can now initaite the two scrubbing operations: "check" and "repair". "check" will go over the array and recored the number of discrepancies but not repair them. "repair" will correct the discrepancies as it finds them. 'lvchange --syncaction repair vg/raid_lv' is not to be confused with 'lvconvert --repair vg/raid_lv'. The former initiates a background synchronization operation on the array, while the latter is designed to repair/replace failed devices in a mirror or RAID logical volume. Additional reporting has been added for 'lvs' to support the new operations. Two new printable fields (which are not printed by default) have been added: "syncaction" and "mismatches". These can be accessed using the '-o' option to 'lvs', like: lvs -o +syncaction,mismatches vg/lv "syncaction" will print the current synchronization operation that the RAID volume is performing. It can be one of the following: - idle: All sync operations complete (doing nothing) - resync: Initializing an array or recovering after a machine failure - recover: Replacing a device in the array - check: Looking for array inconsistencies - repair: Looking for and repairing inconsistencies The "mismatches" field with print the number of descrepancies found during a check or repair operation. The 'Cpy%Sync' field already available to 'lvs' will print the progress of any of the above syncactions, including check and repair. Finally, the lv_attr field has changed to accomadate the scrubbing operations as well. The role of the 'p'artial character in the lv_attr report field as expanded. "Partial" is really an indicator for the health of a logical volume and it makes sense to extend this include other health indicators as well, specifically: 'm'ismatches: Indicates that there are discrepancies in a RAID LV. This character is shown after a scrubbing operation has detected that portions of the RAID are not coherent. 'r'efresh : Indicates that a device in a RAID array has suffered a failure and the kernel regards it as failed - even though LVM can read the device label and considers the device to be ok. The LV should be 'r'efreshed to notify the kernel that the device is now available, or the device should be 'r'eplaced if it is suspected of failing.	2013-04-11 15:33:59 -05:00
Petr Rockai	382fc878d7	lvmetad: Check for reappeared PVs.	2013-04-03 12:48:28 +02:00
Zdenek Kabelac	d24c01a414	thin: lvcreate external origin snapshot support	2013-04-02 15:17:31 +02:00
Zdenek Kabelac	435e0bb608	cleanup: indent line	2013-04-02 15:17:05 +02:00
Peter Rajnoha	5c93f3997b	metadata: use PV's internal UNLABELLED_PV flag more consistently Set when new PV created, cleared on PV write.	2013-03-25 16:21:59 +01:00
Peter Rajnoha	ea36d0501e	cleanup: remove unused 'pv_by_path' fn The pv_by_path might be also dangerous to use as it does not count with any other metadata areas but the ones found on the PV itself. If metadata was not found on the PV referenced by the path, it returned no PV though it might have been referenced by metadata elsewhere (on other PVs...).	2013-03-19 14:57:36 +01:00
Peter Rajnoha	7e5e2dd4ee	vgextend: do not allow PV with 0 MDAs to be added while already in a VG If extending a VG and including a PV with 0 MDAs that was already a part of a VG, the vgextend allowed that PV to be added and we ended up with one PV in two VGs! The vgextend code used the 'pv_by_path' fn that returned a PV for a given path. However, when the PV did not have any metadata areas, the fn just returned a PV without any reference to existing VG. Consequently, any checks for the existing VG failed. [0] raw/~ # pvcreate --metadatacopies 0 /dev/sda Physical volume "/dev/sda" successfully created [0] raw/~ # pvcreate --metadatacopies 1 /dev/sdb Physical volume "/dev/sdb" successfully created [0] raw/~ # vgcreate vg1 /dev/sda /dev/sdb Volume group "vg1" successfully created [0] raw/~ # pvcreate --metadatacopies 1 /dev/sdc Physical volume "/dev/sdc" successfully created [0] raw/~ # vgcreate vg2 /dev/sdc Volume group "vg2" successfully created Before this patch (incorrect): [0] raw/~ # vgextend vg2 /dev/sda Volume group "vg2" successfully extended With this patch (correct): [0] raw/~ # vgextend vg2 /dev/sda Physical volume '/dev/sda' is already in volume group 'vg1' Unable to add physical volume '/dev/sda' to volume group 'vg2'.	2013-03-19 14:57:36 +01:00
Peter Rajnoha	59878d0129	metadata: add 'allow_orphan' arg to find_pv_by_name fn Before, the find_pv_by_name call always failed if the PV found was orphan. However, we might use this function even for a PV that is not part of any VG. This patch adds 'allow_orphan' arg to find_pv_by_name fn that allows that.	2013-03-19 14:57:31 +01:00
Peter Rajnoha	5b6bab2e30	cleanup: remove superfluous wrappers _find_pv_by_name -> find_pv_by_name _find_pv_in_vg -> find_pv_in_vg _find_pv_in_vg_by_uuid -> find_pv_in_vg_by_uuid The only callers of the underscored variants were their wrappers without the underscore. No other part of the code referenced the underscored variants.	2013-03-19 13:58:02 +01:00
Zdenek Kabelac	b36a776a7f	thin: move update_pool_params Now we may recongnize preset arguments, move the code for updating thin pool related values into /lib portion of the code.	2013-03-13 15:13:54 +01:00
Zdenek Kabelac	f06dd8725a	thin: mark passed args Keep the flag whether given thin pool argument has been given on command line or it's been 'estimated' Call of update_pool_params() must not change cmdline given args and needs to know this info. Since there is a need to move this update function into /lib, we cannot use arg_count(). FIXME: we need some generic mechanism here.	2013-03-13 15:13:54 +01:00
Peter Rajnoha	386886f71c	config: refer to config nodes using assigned IDs For example, the old call and reference: find_config_tree_str(cmd, "devices/dir", DEFAULT_DEV_DIR) ...now becomes: find_config_tree_str(cmd, devices_dir_CFG) So we're referring to the named configuration ID instead of passing the configuration path and the default value is taken from central config definition in config_settings.h automatically.	2013-03-06 10:14:33 +01:00
Peter Rajnoha	b778653f03	pv_header_extension: add support for writing PV header extension (flags & Embedding Area) The PV header extension information (PV header extension version, flags and list of Embedding Area locations) is stored just beyond the PV header base. When calculating the Embedding Area start value (ea_start), the same logic is used as when calculating the pe_start value for Data Area - the value must follow exactly the same alignment restrictions for its start value (the alignment detected automatically or provided via command line using the --dataalignment and --dataalignmentoffset arguments). The Embedding Area is placed at the very start of the PV, starting at ea_start. The Data Area starting at pe_start is placed next. The pe_start is still properly aligned. Due to the pe_start alignment, it's possible that the resulting Embedding Area size (ea_size) ends up bigger in size than requested (but never less than requested).	2013-02-26 11:28:00 +01:00
Peter Rajnoha	9dbe25709e	pv_header_extension: add support for reading PV header extension (flags & Embedding Area) New tools with PV header extension support will read the extension if it exists and it's not an error if it does not exist (so old PVs will still work seamlessly with new tools). Old tools without PV header extension support will just ignore any extension. As for the Embedding Area location information (its start and size), there are actually two places where this is stored: - PV header extension - VG metadata The VG metadata contains a copy of what's written in the PV header extension about the Embedding Area location (NULL value is not copied): physical_volumes { pv0 { id = "AkSSRf-difg-fCCZ-NjAN-qP49-1zzg-S0Fd4T" device = "/dev/sda" # Hint only status = ["ALLOCATABLE"] flags = [] dev_size = 262144 # 128 Megabytes pe_start = 67584 pe_count = 23 # 92 Megabytes ea_start = 2048 ea_size = 65536 # 32 Megabytes } } The new metadata fields are "ea_start" and "ea_size". This is mostly useful when restoring the PV by using existing metadata backups (e.g. pvcreate --restorefile ...). New tools does not require these two fields to exist in VG metadata, they're not compulsory. Therefore, reading old VG metadata which doesn't contain any Embedding Area information will not end up with any kind of error but only a debug message that the ea_start and ea_size values were not found. Old tools just ignore these extra fields in VG metadata.	2013-02-26 11:27:23 +01:00
Peter Rajnoha	60c5d4c42f	pv_header_extension: add supporting infrastructure for PV header extension (flags & Embedding Area) PV header extension comes just beyond the existing PV header base: PV header base (existing): - uuid - device size - null-terminated list of Data Areas - null-terminater list of MetaData Areas PV header extension: - extension version - flags - null-terminated list of Embedding Areas This patch also adds "eas" (Embedding Areas) list to lvmcache (lvmcache_info) and it also adds support for common operations on the list (just like for already existing "das" - Data Areas list): - lvmcache_add_ea - lvmcache_update_eas - lvmcache_foreach_ea - lvmcache_del_eas Also, add ea_start and ea_size to struct physical_volume for processing PV Embedding Area location throughout the code (currently only one Embedding Area is supported, though the definition on disk allows for more if needed in the future...). Also, define FMT_EAS format flag to mark that the format actually supports Embedding Areas (currently format-text only).	2013-02-26 11:25:16 +01:00
Peter Rajnoha	6d8de3638c	cleanup: use struct pvcreate_restorable_params throughout	2013-02-26 11:25:11 +01:00
Peter Rajnoha	6692b17777	cleanup: add struct pvcreate_restorable_params and move relevant items from pvcreate_params Extract restorable PV creation parameters from struct pvcreate_params into a separate struct pvcreate_restorable_params for clarity and also for better maintainability when adding any new items later.	2013-02-26 11:24:38 +01:00
Zdenek Kabelac	2cba0ea9f9	thin: removal of external_origin	2013-02-23 10:37:01 +01:00
Zdenek Kabelac	30c13eff37	thin: report external origin Use the field 'origin' for reporting external origin lv name. For thin volumes with external origin, report the size of external origin size via: lvs -o+origin_size	2013-02-23 10:37:01 +01:00
Zdenek Kabelac	87331dc419	thin: add support for external origin Add internal support for thin volume's external origin.	2013-02-23 10:36:58 +01:00
Zdenek Kabelac	d023b2d12f	lvremove: easier removal of dependent lvs Add function to remove lvs which are depending on removed lv prior the lv is removed. User is asked for confirmation.	2013-02-23 10:31:05 +01:00
Peter Rajnoha	303e86adc8	pvcreate: fix alignment to incorporate alignment offset if PV has 0 MDAs If zero metadata copies are used, there's no further recalculation of PV alignment that happens when adding metadata areas to the PV and which actually calculates the alignment correctly as a matter of fact. So fix this for "PV without MDA" case as well. Before this patch: [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 1 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 12.00m [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 0 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 8.00m After this patch: [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 1 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 12.00m [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 0 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 12.00m Also, remove a superfluous condition "pv->pe_start < pv->pe_align" in: if (pe_start == PV_PE_START_CALC && pv->pe_start < pv->pe_align) pv->pe_start = pv->pe_align ... This part of the condition is not reachable as with the PV_PE_START_CALC, we always have pv->pe_start set to 0 from the PV struct initialisation (...the pv->pe_start value is just being calculated).	2013-02-21 14:51:19 +01:00
Jonathan Brassow	dc2ce71313	clean-up: Remove a FIXME question that has been settled It is ok for us to use the shorthand 'lv_is_virtual' to detect error targets in a RAID LV when searching for candidates for device replacement.	2013-02-20 15:03:58 -06:00
Jonathan Brassow	bd0ee420b5	RAID: Allow remove/replace of sub-LVs composed of error segments. When a device fails, we may wish to replace those segments with an error segment. (Like when a 'vgreduce --removemissing' removes a failed device that happens to be a RAID image/meta.) We are then left with images that we will eventually want to remove or replace. This patch allows us to pull out these virtual "error" sub-LVs. This allows a user to 'lvconvert -m -1 vg/lv' to extract the bad sub-LVs. Sub-LVs with error segments are considered for extraction before other possible devices so that good devices are not accidentally removed. This patch also adds the ability to replace RAID images that contain error segments. The user will still be unable to run 'lvconvert --replace' because there is no way to address the 'error' segment (i.e. no PV that it is associated with). However, 'lvconvert --repair' can be used to replace the image's error segment with a new PV. This is also the most appropriate way to do it, since the LV will continue to be reported as 'partial'.	2013-02-20 14:58:56 -06:00
Jonathan Brassow	845852d6b4	RAID: Make 'vgreduce --removemissing' work with RAID LVs Currently it is impossible to remove a failed PV which has a RAID LV on it. This patch fixes the issue by replacing the failed PV with an 'error' segment within the affected sub-LVs. Once there is no longer a RAID LV using the PV, it can be removed. Most often, it is better to replace a failed RAID device with a spare. (You can use 'lvconvert --repair <vg>/<LV>' to accomplish that.) However, if there are no spares in the volume group and none will be added, it is useful to be able to removed the failed device. Following patches address the ability to perform 'lvconvert' operations on RAID LVs that contain sub-LVs composed of 'error' segments.	2013-02-20 14:52:46 -06:00
Jonathan Brassow	0e4ffd9d3b	clean-up: Rename lvm.conf setting 'mirror_region_size' to 'raid_region_size' We have been using 'mirror_region_size' in lvm.conf as the default region size for RAID logical volumes as well as mirror logical volumes. Since, "raid" is more inclusive and representative than "mirror", I have changed the name of this setting. We must still check for the old setting and warn the user if we are overriding it with the new setting if both happen to be present.	2013-02-20 14:40:17 -06:00
Peter Rajnoha	722ca363f0	report: fix pvs -o pv_free reporting for PVs with 0 PEs [0] raw/~ # lsblk -o NAME,SIZE /dev/sda NAME SIZE sda 128M [0] raw/~ # pvcreate --dataalignment 128m /dev/sda Physical volume "/dev/sda" successfully created [0] raw/~ # vgcreate vg /dev/sda Volume group "vg" successfully created [0] raw/~ # lvcreate -l1 vg Volume group "vg" has insufficient free space (0 extents): 1 required. Before this patch: [0] raw/~ # pvs -o pv_name,pv_free PV PFree /dev/sda 128.00m After this patch: [0] raw/~ # pvs -o pv_name,pv_free PV PFree /dev/sda 0	2013-02-21 13:28:07 +01:00
Zdenek Kabelac	7910b6c0ba	thin: update pool_is_active Change it to take LV and move it to exported header - seems to be a better fit for usability from tools/ directory.	2013-02-05 16:54:11 +01:00
Zdenek Kabelac	c984d8fbab	thin: properly unmark volume after detach When the volume is detached form thin pool, unmask THIN_VOLUME flag and reset related pointers.	2013-02-05 14:40:37 +01:00
Zdenek Kabelac	11eaf1c98c	thin: add function pool_is_active This internal function check for active pool device. For cluster it checks every thin volume, On the non-clustered VG we need to check just for presence of -tpool device.	2013-02-05 14:35:44 +01:00
Zdenek Kabelac	ddeb37f282	cleanup: add internal error check Check if 'is_removable' is defined and report internal error, if it's missing.	2013-02-05 14:27:24 +01:00
Zdenek Kabelac	153ce89af3	cleanup: comment update Just update code comment and use single line if().	2013-02-04 19:05:43 +01:00
Zdenek Kabelac	ca7abbce8a	activate: add lv_layer function Add function to return layer name for LV.	2013-02-04 19:01:10 +01:00
Jonathan Brassow	801d4f96a8	RAID: Improve 'lvs' attribute reporting of RAID LVs and sub-LVs There are currently a few issues with the reporting done on RAID LVs and sub-LVs. The most concerning is that 'lvs' does not always report the correct failure status of individual RAID sub-LVs (devices). This can occur when a device fails and is restored after the failure has been detected by the kernel. In this case, 'lvs' would report all devices are fine because it can read the labels on each device just fine. Example: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) However, 'dmsetup status' on the device tells us a different story: [root@bp-01 lvm2]# dmsetup status vg-lv 0 1024000 raid raid1 2 DA 1024000/1024000 In this case, we must also be sure to check the RAID LVs kernel status in order to get the proper information. Here is an example of the correct output that is displayed after this patch is applied: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-p 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-p /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-p /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) The other case where 'lvs' gives incomplete or improper output is when a device is replaced or added to a RAID LV. It should display that the RAID LV is in the process of sync'ing and that the new device is the only one that is not-in-sync - as indicated by a leading 'I' in the Attr column. (Remember that 'i' indicates an (i)mage that is in-sync and 'I' indicates an (I)mage that is not in sync.) Here's an example of the old incorrect behaviour: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [root@bp-01 lvm2]# lvconvert -m +1 vg/lv; lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 0.00 lv_rimage_0(0),lv_rimage_1(0),lv_rimage_2(0) [lv_rimage_0] vg Iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg Iwi-aor-- /dev/sdb1(1) [lv_rimage_2] vg Iwi-aor-- /dev/sdc1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [lv_rmeta_2] vg ewi-aor-- /dev/sdc1(0) Note that all the images currently are marked as 'I' even though it is only the last device that has been added that should be marked. Here is an example of the correct output after this patch is applied: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [root@bp-01 lvm2]# lvconvert -m +1 vg/lv; lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 0.00 lv_rimage_0(0),lv_rimage_1(0),lv_rimage_2(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rimage_2] vg Iwi-aor-- /dev/sdc1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [lv_rmeta_2] vg ewi-aor-- /dev/sdc1(0) Note only the last image is marked with an 'I'. This is correct and we can tell that it isn't the whole array that is sync'ing, but just the new device. It also works under snapshots... [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg owi-a-r-p 33.47 lv_rimage_0(0),lv_rimage_1(0),lv_rimage_2(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg Iwi-aor-p /dev/sdb1(1) [lv_rimage_2] vg Iwi-aor-- /dev/sdc1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-p /dev/sdb1(0) [lv_rmeta_2] vg ewi-aor-- /dev/sdc1(0) snap vg swi-a-s-- /dev/sda1(51201)	2013-02-01 11:33:54 -06:00
Alasdair G Kergon	06abb2dd4c	logging: classify log_debug messages Place most log_debug() messages into a class.	2013-01-07 22:30:29 +00:00
Alasdair G Kergon	b617109fff	lvmetad: fix format1 updates fmt1 doesn't have a separate commit function: updates take effect immediately vg_write is called, so we must update lvmetad at this point if we're going to go on and ask lvmetad for the VG metadata again before calling the commit function (though that's probably an unsupported and pointless thing to do anyway as the client must already have that data and it cannot have changed because it's locked and with devs suspended we shouldn't be communicating with lvmetad; so when that's fixed properly, this fix here can be reverted). This problem showed up as an internal error when lvremoving an LVM1 snapshot. > Internal error: LV snap1 (00000000000000000000000000000001) missing from preload metadata https://bugzilla.redhat.com/891855	2013-01-05 03:17:35 +00:00
Jonathan Brassow	970dfbcd69	RAID: Limit replacement of devices when array is not in-sync. If a RAID array is not in-sync, replacing devices should not be allowed as a general rule. This is because the contents used to populate the incoming device may be undefined because the devices being read where not in-sync. The kernel enforces this rule unless overridden by not allowing the creation of an array that is not in-sync and includes a devices that needs to be rebuilt. Since we cannot know the sync state of an LV if it is inactive, we must also enforce the rule that an array must be active to replace devices. That leaves us with the following conditions: 1) never allow replacement or repair of devices if the LV is in-active 2) never allow replacement if the LV is not in-sync 3) allow repair if the LV is not in-sync, but warn that contents may not be recoverable. In the case where a user is performing the repair on the command line via 'lvconvert --repair', the warning is printed before the user is prompted if they would like to replace the device(s). If the repair is automated (i.e. via dmeventd and policy is "allocate"), then the device is replaced if possible and the warning is printed.	2012-12-18 14:40:42 -06:00
Zdenek Kabelac	401c9aba4a	pv_read: add missing check for valid info If the lvmcache_info_from_pvid() fails to find valid info, invoke the lookup by dev, and only in this case call lvmcache_info_from_pvid() again. Also check for the result of info and return error directly, so the NULL is not passed to lvmcache_get_label().	2012-12-15 17:23:27 +01:00
Zdenek Kabelac	6987a353de	thin: add detach_pool_metadata_lv Add internal function detach_pool_metadata_lv().	2012-12-02 17:56:29 +01:00
Zdenek Kabelac	0387e70d76	thin: fix property discard for lvm2api Discards property is string and may have these values: ignore, nopassdown, passdown	2012-11-27 14:09:49 +01:00
Jonathan Brassow	fb0cee9a66	RAID: Do not allow --splitmirrors on RAID10 logical volumes. RAID10 does not have the ability to split off images for independent use. So, 'lvconvert --splitmirrors' will not work and must be disallowed.	2012-11-21 18:39:26 -06:00
Zdenek Kabelac	400f644286	lv_manip: fix regresion from `bf2741376d` Commit `bf2741376d` started to use lv_is_active() instead of call for lv_info & info.exists so we cover also cluster activated devices. For snapshost the conversion was not correct and introduced regression by blocking creation of snapshot of inactive LV. Fix it by assigning lv_is_active() directly. Note: we still have minor issue to fix - to make lv_is_???? function able to return error states since lv_info() may fail.	2012-11-21 12:15:09 +01:00
Zdenek Kabelac	2e96ea4a89	liblvm: internal API change Return LV/NULL instead of 1/0 which saves lookup for created LV.	2012-11-19 14:37:30 +01:00
Zdenek Kabelac	cf5242a670	lvconvert: store target attributes Target tells us its version, and we may allow different set of options to be supported with different version of driver. Idea is to provide individual feature flags and later be able to query for them.	2012-11-19 14:17:10 +01:00
Jonathan Brassow	6db461e3b0	mirror/raid: Move 'copy_percent' to common code (mirror.c -> lv_manip.c) The 'copy_percent' function takes the 'extents_copied' field from each segment in an LV to create the numerator for the ratio that is to become the copy_percent. (Otherwise known as the 'sync' percent for non-pvmove uses, like mirror LVs and RAID LVs.) This function safely works on RAID - not just mirrors - so it is better to have it in lv_manip.c rather than mirror.c. There's a lot of different functions that do a lot of different things in lv_manip.c, so I placed the function near a function in lv_manip.c that it was close to in metadata-exported.h. Different placement in the file or a different name for the function may be useful.	2012-10-23 20:33:54 -05:00
Zdenek Kabelac	bf2741376d	Use lv_is_active instead of lv_info() Usage of lv_is_active makes it more obvious what is being checked.	2012-10-17 15:42:31 +02:00
Zdenek Kabelac	e431b19bac	cleanup: move log_error upward in code stack Report log_error earlier.	2012-10-17 15:41:44 +02:00
Zdenek Kabelac	f260f99d57	cleanup: switch log_error to log_warn Use log_warn to print non-fatal warning messages. Use of log_error would confuse checker for testing whether proper error has been reported for some real error.	2012-10-17 15:41:35 +02:00

... 6 7 8 9 10 ...

1976 Commits