shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2025-01-03 05:18:29 +03:00

Author	SHA1	Message	Date
Jonathan Brassow	2e0740f7ef	RAID: Add writemostly/writebehind support for RAID1 'lvchange' is used to alter a RAID 1 logical volume's write-mostly and write-behind characteristics. The '--writemostly' parameter takes a PV as an argument with an optional trailing character to specify whether to set ('y'), unset ('n'), or toggle ('t') the value. If no trailing character is given, it will set the flag. Synopsis: lvchange [--writemostly <PV>:{t\|y\|n}] [--writebehind <count>] vg/lv Example: lvchange --writemostly /dev/sdb1:y --writebehind 512 vg/raid1_lv The last character in the 'lv_attr' field is used to show whether a device has the WriteMostly flag set. It is signified with a 'w'. If the device has failed, the 'p'artial flag has priority. Example ("nosync" raid1 with mismatch_cnt and writemostly): [~]# lvs -a --segment vg LV VG Attr #Str Type SSize raid1 vg Rwi---r-m 2 raid1 500.00m [raid1_rimage_0] vg Iwi---r-- 1 linear 500.00m [raid1_rimage_1] vg Iwi---r-w 1 linear 500.00m [raid1_rmeta_0] vg ewi---r-- 1 linear 4.00m [raid1_rmeta_1] vg ewi---r-- 1 linear 4.00m Example (raid1 with mismatch_cnt, writemostly - but failed drive): [~]# lvs -a --segment vg LV VG Attr #Str Type SSize raid1 vg rwi---r-p 2 raid1 500.00m [raid1_rimage_0] vg Iwi---r-- 1 linear 500.00m [raid1_rimage_1] vg Iwi---r-p 1 linear 500.00m [raid1_rmeta_0] vg ewi---r-- 1 linear 4.00m [raid1_rmeta_1] vg ewi---r-p 1 linear 4.00m A new reportable field has been added for writebehind as well. If write-behind has not been set or the LV is not RAID1, the field will be blank. Example (writebehind is set): [~]# lvs -a -o name,attr,writebehind vg LV Attr WBehind lv rwi-a-r-- 512 [lv_rimage_0] iwi-aor-w [lv_rimage_1] iwi-aor-- [lv_rmeta_0] ewi-aor-- [lv_rmeta_1] ewi-aor-- Example (writebehind is not set): [~]# lvs -a -o name,attr,writebehind vg LV Attr WBehind lv rwi-a-r-- [lv_rimage_0] iwi-aor-w [lv_rimage_1] iwi-aor-- [lv_rmeta_0] ewi-aor-- [lv_rmeta_1] ewi-aor--	2013-04-15 13:59:46 -05:00
Zdenek Kabelac	2e39392daf	cleanup: remove unused lvl_idx	2013-04-12 11:26:31 +02:00
Jonathan Brassow	ff64e3500f	RAID: Add scrubbing support for RAID LVs New options to 'lvchange' allow users to scrub their RAID LVs. Synopsis: lvchange --syncaction {check\|repair} vg/raid_lv RAID scrubbing is the process of reading all the data and parity blocks in an array and checking to see whether they are coherent. 'lvchange' can now initaite the two scrubbing operations: "check" and "repair". "check" will go over the array and recored the number of discrepancies but not repair them. "repair" will correct the discrepancies as it finds them. 'lvchange --syncaction repair vg/raid_lv' is not to be confused with 'lvconvert --repair vg/raid_lv'. The former initiates a background synchronization operation on the array, while the latter is designed to repair/replace failed devices in a mirror or RAID logical volume. Additional reporting has been added for 'lvs' to support the new operations. Two new printable fields (which are not printed by default) have been added: "syncaction" and "mismatches". These can be accessed using the '-o' option to 'lvs', like: lvs -o +syncaction,mismatches vg/lv "syncaction" will print the current synchronization operation that the RAID volume is performing. It can be one of the following: - idle: All sync operations complete (doing nothing) - resync: Initializing an array or recovering after a machine failure - recover: Replacing a device in the array - check: Looking for array inconsistencies - repair: Looking for and repairing inconsistencies The "mismatches" field with print the number of descrepancies found during a check or repair operation. The 'Cpy%Sync' field already available to 'lvs' will print the progress of any of the above syncactions, including check and repair. Finally, the lv_attr field has changed to accomadate the scrubbing operations as well. The role of the 'p'artial character in the lv_attr report field as expanded. "Partial" is really an indicator for the health of a logical volume and it makes sense to extend this include other health indicators as well, specifically: 'm'ismatches: Indicates that there are discrepancies in a RAID LV. This character is shown after a scrubbing operation has detected that portions of the RAID are not coherent. 'r'efresh : Indicates that a device in a RAID array has suffered a failure and the kernel regards it as failed - even though LVM can read the device label and considers the device to be ok. The LV should be 'r'efreshed to notify the kernel that the device is now available, or the device should be 'r'eplaced if it is suspected of failing.	2013-04-11 15:33:59 -05:00
Petr Rockai	382fc878d7	lvmetad: Check for reappeared PVs.	2013-04-03 12:48:28 +02:00
Zdenek Kabelac	d24c01a414	thin: lvcreate external origin snapshot support	2013-04-02 15:17:31 +02:00
Zdenek Kabelac	435e0bb608	cleanup: indent line	2013-04-02 15:17:05 +02:00
Peter Rajnoha	5c93f3997b	metadata: use PV's internal UNLABELLED_PV flag more consistently Set when new PV created, cleared on PV write.	2013-03-25 16:21:59 +01:00
Peter Rajnoha	ea36d0501e	cleanup: remove unused 'pv_by_path' fn The pv_by_path might be also dangerous to use as it does not count with any other metadata areas but the ones found on the PV itself. If metadata was not found on the PV referenced by the path, it returned no PV though it might have been referenced by metadata elsewhere (on other PVs...).	2013-03-19 14:57:36 +01:00
Peter Rajnoha	7e5e2dd4ee	vgextend: do not allow PV with 0 MDAs to be added while already in a VG If extending a VG and including a PV with 0 MDAs that was already a part of a VG, the vgextend allowed that PV to be added and we ended up with one PV in two VGs! The vgextend code used the 'pv_by_path' fn that returned a PV for a given path. However, when the PV did not have any metadata areas, the fn just returned a PV without any reference to existing VG. Consequently, any checks for the existing VG failed. [0] raw/~ # pvcreate --metadatacopies 0 /dev/sda Physical volume "/dev/sda" successfully created [0] raw/~ # pvcreate --metadatacopies 1 /dev/sdb Physical volume "/dev/sdb" successfully created [0] raw/~ # vgcreate vg1 /dev/sda /dev/sdb Volume group "vg1" successfully created [0] raw/~ # pvcreate --metadatacopies 1 /dev/sdc Physical volume "/dev/sdc" successfully created [0] raw/~ # vgcreate vg2 /dev/sdc Volume group "vg2" successfully created Before this patch (incorrect): [0] raw/~ # vgextend vg2 /dev/sda Volume group "vg2" successfully extended With this patch (correct): [0] raw/~ # vgextend vg2 /dev/sda Physical volume '/dev/sda' is already in volume group 'vg1' Unable to add physical volume '/dev/sda' to volume group 'vg2'.	2013-03-19 14:57:36 +01:00
Peter Rajnoha	59878d0129	metadata: add 'allow_orphan' arg to find_pv_by_name fn Before, the find_pv_by_name call always failed if the PV found was orphan. However, we might use this function even for a PV that is not part of any VG. This patch adds 'allow_orphan' arg to find_pv_by_name fn that allows that.	2013-03-19 14:57:31 +01:00
Peter Rajnoha	5b6bab2e30	cleanup: remove superfluous wrappers _find_pv_by_name -> find_pv_by_name _find_pv_in_vg -> find_pv_in_vg _find_pv_in_vg_by_uuid -> find_pv_in_vg_by_uuid The only callers of the underscored variants were their wrappers without the underscore. No other part of the code referenced the underscored variants.	2013-03-19 13:58:02 +01:00
Zdenek Kabelac	b36a776a7f	thin: move update_pool_params Now we may recongnize preset arguments, move the code for updating thin pool related values into /lib portion of the code.	2013-03-13 15:13:54 +01:00
Zdenek Kabelac	f06dd8725a	thin: mark passed args Keep the flag whether given thin pool argument has been given on command line or it's been 'estimated' Call of update_pool_params() must not change cmdline given args and needs to know this info. Since there is a need to move this update function into /lib, we cannot use arg_count(). FIXME: we need some generic mechanism here.	2013-03-13 15:13:54 +01:00
Peter Rajnoha	386886f71c	config: refer to config nodes using assigned IDs For example, the old call and reference: find_config_tree_str(cmd, "devices/dir", DEFAULT_DEV_DIR) ...now becomes: find_config_tree_str(cmd, devices_dir_CFG) So we're referring to the named configuration ID instead of passing the configuration path and the default value is taken from central config definition in config_settings.h automatically.	2013-03-06 10:14:33 +01:00
Peter Rajnoha	b778653f03	pv_header_extension: add support for writing PV header extension (flags & Embedding Area) The PV header extension information (PV header extension version, flags and list of Embedding Area locations) is stored just beyond the PV header base. When calculating the Embedding Area start value (ea_start), the same logic is used as when calculating the pe_start value for Data Area - the value must follow exactly the same alignment restrictions for its start value (the alignment detected automatically or provided via command line using the --dataalignment and --dataalignmentoffset arguments). The Embedding Area is placed at the very start of the PV, starting at ea_start. The Data Area starting at pe_start is placed next. The pe_start is still properly aligned. Due to the pe_start alignment, it's possible that the resulting Embedding Area size (ea_size) ends up bigger in size than requested (but never less than requested).	2013-02-26 11:28:00 +01:00
Peter Rajnoha	9dbe25709e	pv_header_extension: add support for reading PV header extension (flags & Embedding Area) New tools with PV header extension support will read the extension if it exists and it's not an error if it does not exist (so old PVs will still work seamlessly with new tools). Old tools without PV header extension support will just ignore any extension. As for the Embedding Area location information (its start and size), there are actually two places where this is stored: - PV header extension - VG metadata The VG metadata contains a copy of what's written in the PV header extension about the Embedding Area location (NULL value is not copied): physical_volumes { pv0 { id = "AkSSRf-difg-fCCZ-NjAN-qP49-1zzg-S0Fd4T" device = "/dev/sda" # Hint only status = ["ALLOCATABLE"] flags = [] dev_size = 262144 # 128 Megabytes pe_start = 67584 pe_count = 23 # 92 Megabytes ea_start = 2048 ea_size = 65536 # 32 Megabytes } } The new metadata fields are "ea_start" and "ea_size". This is mostly useful when restoring the PV by using existing metadata backups (e.g. pvcreate --restorefile ...). New tools does not require these two fields to exist in VG metadata, they're not compulsory. Therefore, reading old VG metadata which doesn't contain any Embedding Area information will not end up with any kind of error but only a debug message that the ea_start and ea_size values were not found. Old tools just ignore these extra fields in VG metadata.	2013-02-26 11:27:23 +01:00
Peter Rajnoha	60c5d4c42f	pv_header_extension: add supporting infrastructure for PV header extension (flags & Embedding Area) PV header extension comes just beyond the existing PV header base: PV header base (existing): - uuid - device size - null-terminated list of Data Areas - null-terminater list of MetaData Areas PV header extension: - extension version - flags - null-terminated list of Embedding Areas This patch also adds "eas" (Embedding Areas) list to lvmcache (lvmcache_info) and it also adds support for common operations on the list (just like for already existing "das" - Data Areas list): - lvmcache_add_ea - lvmcache_update_eas - lvmcache_foreach_ea - lvmcache_del_eas Also, add ea_start and ea_size to struct physical_volume for processing PV Embedding Area location throughout the code (currently only one Embedding Area is supported, though the definition on disk allows for more if needed in the future...). Also, define FMT_EAS format flag to mark that the format actually supports Embedding Areas (currently format-text only).	2013-02-26 11:25:16 +01:00
Peter Rajnoha	6d8de3638c	cleanup: use struct pvcreate_restorable_params throughout	2013-02-26 11:25:11 +01:00
Peter Rajnoha	6692b17777	cleanup: add struct pvcreate_restorable_params and move relevant items from pvcreate_params Extract restorable PV creation parameters from struct pvcreate_params into a separate struct pvcreate_restorable_params for clarity and also for better maintainability when adding any new items later.	2013-02-26 11:24:38 +01:00
Zdenek Kabelac	2cba0ea9f9	thin: removal of external_origin	2013-02-23 10:37:01 +01:00
Zdenek Kabelac	30c13eff37	thin: report external origin Use the field 'origin' for reporting external origin lv name. For thin volumes with external origin, report the size of external origin size via: lvs -o+origin_size	2013-02-23 10:37:01 +01:00
Zdenek Kabelac	87331dc419	thin: add support for external origin Add internal support for thin volume's external origin.	2013-02-23 10:36:58 +01:00
Zdenek Kabelac	d023b2d12f	lvremove: easier removal of dependent lvs Add function to remove lvs which are depending on removed lv prior the lv is removed. User is asked for confirmation.	2013-02-23 10:31:05 +01:00
Peter Rajnoha	303e86adc8	pvcreate: fix alignment to incorporate alignment offset if PV has 0 MDAs If zero metadata copies are used, there's no further recalculation of PV alignment that happens when adding metadata areas to the PV and which actually calculates the alignment correctly as a matter of fact. So fix this for "PV without MDA" case as well. Before this patch: [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 1 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 12.00m [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 0 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 8.00m After this patch: [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 1 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 12.00m [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 0 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 12.00m Also, remove a superfluous condition "pv->pe_start < pv->pe_align" in: if (pe_start == PV_PE_START_CALC && pv->pe_start < pv->pe_align) pv->pe_start = pv->pe_align ... This part of the condition is not reachable as with the PV_PE_START_CALC, we always have pv->pe_start set to 0 from the PV struct initialisation (...the pv->pe_start value is just being calculated).	2013-02-21 14:51:19 +01:00
Jonathan Brassow	dc2ce71313	clean-up: Remove a FIXME question that has been settled It is ok for us to use the shorthand 'lv_is_virtual' to detect error targets in a RAID LV when searching for candidates for device replacement.	2013-02-20 15:03:58 -06:00
Jonathan Brassow	bd0ee420b5	RAID: Allow remove/replace of sub-LVs composed of error segments. When a device fails, we may wish to replace those segments with an error segment. (Like when a 'vgreduce --removemissing' removes a failed device that happens to be a RAID image/meta.) We are then left with images that we will eventually want to remove or replace. This patch allows us to pull out these virtual "error" sub-LVs. This allows a user to 'lvconvert -m -1 vg/lv' to extract the bad sub-LVs. Sub-LVs with error segments are considered for extraction before other possible devices so that good devices are not accidentally removed. This patch also adds the ability to replace RAID images that contain error segments. The user will still be unable to run 'lvconvert --replace' because there is no way to address the 'error' segment (i.e. no PV that it is associated with). However, 'lvconvert --repair' can be used to replace the image's error segment with a new PV. This is also the most appropriate way to do it, since the LV will continue to be reported as 'partial'.	2013-02-20 14:58:56 -06:00
Jonathan Brassow	845852d6b4	RAID: Make 'vgreduce --removemissing' work with RAID LVs Currently it is impossible to remove a failed PV which has a RAID LV on it. This patch fixes the issue by replacing the failed PV with an 'error' segment within the affected sub-LVs. Once there is no longer a RAID LV using the PV, it can be removed. Most often, it is better to replace a failed RAID device with a spare. (You can use 'lvconvert --repair <vg>/<LV>' to accomplish that.) However, if there are no spares in the volume group and none will be added, it is useful to be able to removed the failed device. Following patches address the ability to perform 'lvconvert' operations on RAID LVs that contain sub-LVs composed of 'error' segments.	2013-02-20 14:52:46 -06:00
Jonathan Brassow	0e4ffd9d3b	clean-up: Rename lvm.conf setting 'mirror_region_size' to 'raid_region_size' We have been using 'mirror_region_size' in lvm.conf as the default region size for RAID logical volumes as well as mirror logical volumes. Since, "raid" is more inclusive and representative than "mirror", I have changed the name of this setting. We must still check for the old setting and warn the user if we are overriding it with the new setting if both happen to be present.	2013-02-20 14:40:17 -06:00
Peter Rajnoha	722ca363f0	report: fix pvs -o pv_free reporting for PVs with 0 PEs [0] raw/~ # lsblk -o NAME,SIZE /dev/sda NAME SIZE sda 128M [0] raw/~ # pvcreate --dataalignment 128m /dev/sda Physical volume "/dev/sda" successfully created [0] raw/~ # vgcreate vg /dev/sda Volume group "vg" successfully created [0] raw/~ # lvcreate -l1 vg Volume group "vg" has insufficient free space (0 extents): 1 required. Before this patch: [0] raw/~ # pvs -o pv_name,pv_free PV PFree /dev/sda 128.00m After this patch: [0] raw/~ # pvs -o pv_name,pv_free PV PFree /dev/sda 0	2013-02-21 13:28:07 +01:00
Zdenek Kabelac	7910b6c0ba	thin: update pool_is_active Change it to take LV and move it to exported header - seems to be a better fit for usability from tools/ directory.	2013-02-05 16:54:11 +01:00
Zdenek Kabelac	c984d8fbab	thin: properly unmark volume after detach When the volume is detached form thin pool, unmask THIN_VOLUME flag and reset related pointers.	2013-02-05 14:40:37 +01:00
Zdenek Kabelac	11eaf1c98c	thin: add function pool_is_active This internal function check for active pool device. For cluster it checks every thin volume, On the non-clustered VG we need to check just for presence of -tpool device.	2013-02-05 14:35:44 +01:00
Zdenek Kabelac	ddeb37f282	cleanup: add internal error check Check if 'is_removable' is defined and report internal error, if it's missing.	2013-02-05 14:27:24 +01:00
Zdenek Kabelac	153ce89af3	cleanup: comment update Just update code comment and use single line if().	2013-02-04 19:05:43 +01:00
Zdenek Kabelac	ca7abbce8a	activate: add lv_layer function Add function to return layer name for LV.	2013-02-04 19:01:10 +01:00
Jonathan Brassow	801d4f96a8	RAID: Improve 'lvs' attribute reporting of RAID LVs and sub-LVs There are currently a few issues with the reporting done on RAID LVs and sub-LVs. The most concerning is that 'lvs' does not always report the correct failure status of individual RAID sub-LVs (devices). This can occur when a device fails and is restored after the failure has been detected by the kernel. In this case, 'lvs' would report all devices are fine because it can read the labels on each device just fine. Example: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) However, 'dmsetup status' on the device tells us a different story: [root@bp-01 lvm2]# dmsetup status vg-lv 0 1024000 raid raid1 2 DA 1024000/1024000 In this case, we must also be sure to check the RAID LVs kernel status in order to get the proper information. Here is an example of the correct output that is displayed after this patch is applied: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-p 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-p /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-p /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) The other case where 'lvs' gives incomplete or improper output is when a device is replaced or added to a RAID LV. It should display that the RAID LV is in the process of sync'ing and that the new device is the only one that is not-in-sync - as indicated by a leading 'I' in the Attr column. (Remember that 'i' indicates an (i)mage that is in-sync and 'I' indicates an (I)mage that is not in sync.) Here's an example of the old incorrect behaviour: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [root@bp-01 lvm2]# lvconvert -m +1 vg/lv; lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 0.00 lv_rimage_0(0),lv_rimage_1(0),lv_rimage_2(0) [lv_rimage_0] vg Iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg Iwi-aor-- /dev/sdb1(1) [lv_rimage_2] vg Iwi-aor-- /dev/sdc1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [lv_rmeta_2] vg ewi-aor-- /dev/sdc1(0) Note that all the images currently are marked as 'I' even though it is only the last device that has been added that should be marked. Here is an example of the correct output after this patch is applied: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [root@bp-01 lvm2]# lvconvert -m +1 vg/lv; lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 0.00 lv_rimage_0(0),lv_rimage_1(0),lv_rimage_2(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rimage_2] vg Iwi-aor-- /dev/sdc1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [lv_rmeta_2] vg ewi-aor-- /dev/sdc1(0) Note only the last image is marked with an 'I'. This is correct and we can tell that it isn't the whole array that is sync'ing, but just the new device. It also works under snapshots... [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg owi-a-r-p 33.47 lv_rimage_0(0),lv_rimage_1(0),lv_rimage_2(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg Iwi-aor-p /dev/sdb1(1) [lv_rimage_2] vg Iwi-aor-- /dev/sdc1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-p /dev/sdb1(0) [lv_rmeta_2] vg ewi-aor-- /dev/sdc1(0) snap vg swi-a-s-- /dev/sda1(51201)	2013-02-01 11:33:54 -06:00
Alasdair G Kergon	06abb2dd4c	logging: classify log_debug messages Place most log_debug() messages into a class.	2013-01-07 22:30:29 +00:00
Alasdair G Kergon	b617109fff	lvmetad: fix format1 updates fmt1 doesn't have a separate commit function: updates take effect immediately vg_write is called, so we must update lvmetad at this point if we're going to go on and ask lvmetad for the VG metadata again before calling the commit function (though that's probably an unsupported and pointless thing to do anyway as the client must already have that data and it cannot have changed because it's locked and with devs suspended we shouldn't be communicating with lvmetad; so when that's fixed properly, this fix here can be reverted). This problem showed up as an internal error when lvremoving an LVM1 snapshot. > Internal error: LV snap1 (00000000000000000000000000000001) missing from preload metadata https://bugzilla.redhat.com/891855	2013-01-05 03:17:35 +00:00
Jonathan Brassow	970dfbcd69	RAID: Limit replacement of devices when array is not in-sync. If a RAID array is not in-sync, replacing devices should not be allowed as a general rule. This is because the contents used to populate the incoming device may be undefined because the devices being read where not in-sync. The kernel enforces this rule unless overridden by not allowing the creation of an array that is not in-sync and includes a devices that needs to be rebuilt. Since we cannot know the sync state of an LV if it is inactive, we must also enforce the rule that an array must be active to replace devices. That leaves us with the following conditions: 1) never allow replacement or repair of devices if the LV is in-active 2) never allow replacement if the LV is not in-sync 3) allow repair if the LV is not in-sync, but warn that contents may not be recoverable. In the case where a user is performing the repair on the command line via 'lvconvert --repair', the warning is printed before the user is prompted if they would like to replace the device(s). If the repair is automated (i.e. via dmeventd and policy is "allocate"), then the device is replaced if possible and the warning is printed.	2012-12-18 14:40:42 -06:00
Zdenek Kabelac	401c9aba4a	pv_read: add missing check for valid info If the lvmcache_info_from_pvid() fails to find valid info, invoke the lookup by dev, and only in this case call lvmcache_info_from_pvid() again. Also check for the result of info and return error directly, so the NULL is not passed to lvmcache_get_label().	2012-12-15 17:23:27 +01:00
Zdenek Kabelac	6987a353de	thin: add detach_pool_metadata_lv Add internal function detach_pool_metadata_lv().	2012-12-02 17:56:29 +01:00
Zdenek Kabelac	0387e70d76	thin: fix property discard for lvm2api Discards property is string and may have these values: ignore, nopassdown, passdown	2012-11-27 14:09:49 +01:00
Jonathan Brassow	fb0cee9a66	RAID: Do not allow --splitmirrors on RAID10 logical volumes. RAID10 does not have the ability to split off images for independent use. So, 'lvconvert --splitmirrors' will not work and must be disallowed.	2012-11-21 18:39:26 -06:00
Zdenek Kabelac	400f644286	lv_manip: fix regresion from `bf2741376d` Commit `bf2741376d` started to use lv_is_active() instead of call for lv_info & info.exists so we cover also cluster activated devices. For snapshost the conversion was not correct and introduced regression by blocking creation of snapshot of inactive LV. Fix it by assigning lv_is_active() directly. Note: we still have minor issue to fix - to make lv_is_???? function able to return error states since lv_info() may fail.	2012-11-21 12:15:09 +01:00
Zdenek Kabelac	2e96ea4a89	liblvm: internal API change Return LV/NULL instead of 1/0 which saves lookup for created LV.	2012-11-19 14:37:30 +01:00
Zdenek Kabelac	cf5242a670	lvconvert: store target attributes Target tells us its version, and we may allow different set of options to be supported with different version of driver. Idea is to provide individual feature flags and later be able to query for them.	2012-11-19 14:17:10 +01:00
Jonathan Brassow	6db461e3b0	mirror/raid: Move 'copy_percent' to common code (mirror.c -> lv_manip.c) The 'copy_percent' function takes the 'extents_copied' field from each segment in an LV to create the numerator for the ratio that is to become the copy_percent. (Otherwise known as the 'sync' percent for non-pvmove uses, like mirror LVs and RAID LVs.) This function safely works on RAID - not just mirrors - so it is better to have it in lv_manip.c rather than mirror.c. There's a lot of different functions that do a lot of different things in lv_manip.c, so I placed the function near a function in lv_manip.c that it was close to in metadata-exported.h. Different placement in the file or a different name for the function may be useful.	2012-10-23 20:33:54 -05:00
Zdenek Kabelac	bf2741376d	Use lv_is_active instead of lv_info() Usage of lv_is_active makes it more obvious what is being checked.	2012-10-17 15:42:31 +02:00
Zdenek Kabelac	e431b19bac	cleanup: move log_error upward in code stack Report log_error earlier.	2012-10-17 15:41:44 +02:00
Zdenek Kabelac	f260f99d57	cleanup: switch log_error to log_warn Use log_warn to print non-fatal warning messages. Use of log_error would confuse checker for testing whether proper error has been reported for some real error.	2012-10-17 15:41:35 +02:00
Zdenek Kabelac	b89963a7c3	cleanup: swap return values Use lvm standard return code for success/fail 1/0.	2012-10-17 15:37:26 +02:00
Jonathan Brassow	7519d881ef	Clean-up: Adjust message to be clearer on action taken and why A message is printed when the region_size of a RAID LV is adjusted to allow for large (> ~1TB) LVs. The message wasn't very clear. Hopefully, this is better.	2012-10-15 15:09:05 -05:00
Zdenek Kabelac	6595cae6e9	cleanup: resolve dereferencing type-punned pointer fix gcc warning: dereferencing type-punned pointer will break strict-aliasing rules Replace call by value and pass just const pointer to pvid.	2012-10-14 23:14:00 +02:00
Zdenek Kabelac	be291e1064	thin: lvm2api return origin property for thin LV	2012-10-12 12:20:55 +02:00
Zdenek Kabelac	9ee071705b	cleanup: fix compiler warnings remove unused vars move var declarations into the front of functions. fix some sign warnings	2012-10-12 10:25:07 +02:00
Zdenek Kabelac	b6512b10ae	cleanup: fix typos	2012-10-10 21:22:11 +02:00
Jonathan Brassow	9efd3fb604	RAID: Do not allow RAID LVs in a cluster volume group. It would be possible to activate a RAID LV exclusively in a cluster volume group, but for now we do not allow RAID LVs to exist in a clustered volume group at all. This has two components: 1) Do not allow RAID LVs to be created in a clustered VG 2) Do not allow changing a VG from single-machine to clustered if there are RAID LVs present.	2012-10-03 15:52:54 -05:00
Zdenek Kabelac	d442c3ef0c	liblvm: insert layer with subvolume renames Rename also subvolumes if we are inserting _tdata layer. (Currently it breaks mirrors if it would be generic, needs fixing).	2012-10-03 15:13:32 +02:00
Zdenek Kabelac	21c401006c	liblvm: add lv_rename_update Support lv_rename without directly updating metatata. It can save some metadata commits in some cases, i.e. when LVs are offline.	2012-10-03 15:03:49 +02:00
Jonathan Brassow	886656e4ac	RAID: Fix problems with creating, extending and converting large RAID LVs MD's bitmaps can handle 2^21 regions at most. The RAID code has always used a region_size of 1024 sectors. That means the size of a RAID LV was limited to 1TiB. (The user can adjust the region_size when creating a RAID LV, which can affect the maximum size.) Thus, creating, extending or converting to a RAID LV greater than 1TiB would result in a failure to load the new device-mapper table. Again, the size of the RAID LV is not limited by how much space is allocated for the metadata area, but by the limitations of the MD bitmap. Therefore, we must adjust the 'region_size' to ensure that the number of regions does not exceed the limit. I've added code to do this when extending a RAID LV (which covers 'create' and 'extend' operations) and when up-converting - specifically from linear to RAID1.	2012-09-27 16:51:22 -05:00
Petr Rockai	5f5832e318	lvremove: Ask before discarding data areas.	2012-09-26 17:26:23 +02:00
Petr Rockai	2276379a71	lib/cache/lvmetad: Refactor to use dm_config_tree in requests. We were using daemon_send_simple until now, but it is no longer adequate, since we need to manipulate requests in a generic way (adding a validity token to each request), and the tree-based request interface is much more suitable for this.	2012-09-26 14:49:15 +02:00
Alasdair G Kergon	290ae4791e	lvs: add partial attribute	2012-09-19 12:49:40 +01:00
Alasdair G Kergon	b737ff01e4	discards: skip when removing LVs on missing PVs Don't try to issue discards to a missing PV to avoid segfault. Prevent lvremove from removing LVs that have any part missing. https://bugzilla.redhat.com/857554	2012-09-19 12:48:56 +01:00
Jonathan Brassow	2a6712ddef	RAID1: Clear the LV_NOTSYNCED flag when a RAID1 LV is converted to linear Failing to clear the LV_NOTSYNCED flag when converting a RAID1 LV to linear can result in the flag being present after an upconvert - even if the sync is performed when upconverting.	2012-09-14 16:26:53 -05:00
Jonathan Brassow	116bcb3ea4	RAID1: Like mirrors, do not allow adding images to LV created w/ --nosync Mirrors do not allow upconverting if the LV has been created with --nosync. We will enforce the same rule for RAID1. It isn't hugely critical, since the portions that have been written will be copied over to the new device identically from either of the existing images. However, the unwritten sections may be different, causing the added image to be a hybrid of the existing images. Also, we are disallowing the addition of new images to a RAID1 LV that has not completed the initial sync. This may be different from mirroring, but that is due to the fact that the 'mirror' segment type "stacks" when adding a new image and RAID1 does not. RAID1 will rebuild a newly added image "inline" from the existant images, so they should be in-sync.	2012-09-14 16:12:52 -05:00
Jonathan Brassow	cdb0339319	RAID: Disallow addition of RAID images while array is not in-sync We cannot add images to a RAID array while it is not in-sync. The kernel will simply reject the table, saying: 'rebuild' specified while array is not in-sync Now we check to ensure the LV is in-sync before attempting image additions.	2012-09-10 17:15:20 -05:00
Jonathan Brassow	b49b98d50c	RAID: '--test' should not cause a valid create command to fail It is necessary when creating a RAID LV to clear the new metadata areas. Failure to do so could result in a prepopulated bitmap that would cause the new array to skip syncing portions of the array. It is a requirement that the metadata LVs be activated and cleared in the process of creating. However in test mode, this requirement should be lifted - no new LVs should be created or written to.	2012-09-05 14:32:06 -05:00
Jonathan Brassow	c3eb3a7687	cleanup: Use segtype->ops->name() instead of segtype->name where applicable When printing a message for the user and the lv_segment pointer is available, use segtype->ops->name() instead of segtype->name. This gives a better user-readable name for the segment. This is especially true for the 'striped' segment type, which prints "linear" if there is an area_count of one.	2012-09-05 11:35:54 -05:00
Alasdair G Kergon	3acc85caa8	buffering: use unbuffered silent mode for liblvm Disable private buffering when using liblvm. When private stdin/stdout buffering is not used always use silent mode.	2012-08-26 00:15:45 +01:00
Alasdair G Kergon	438e0050df	config: add silent mode Accept -q as the short form of --quiet. Suppress non-essential standard output if -q is given twice. Treat log/silent in lvm.conf as equivalent to -qq. Review all log_print messages and change some to log_print_unless_silent. When silent, the following commands still produce output: dumpconfig, lvdisplay, lvmdiskscan, lvs, pvck, pvdisplay, pvs, version, vgcfgrestore -l, vgdisplay, vgs. [Needs checking.] Non-essential messages are shifted from log level 4 to log level 5 for syslog and lvm2_log_fn purposes.	2012-08-25 20:35:48 +01:00
Jonathan Brassow	4047e4dfb1	RAID: Add support for RAID10 This patch adds support for RAID10. It is not the default at this stage. The user needs to specify '--type raid10' if they would like RAID10 instead of stacked mirror over stripe.	2012-08-24 15:34:19 -05:00
Zdenek Kabelac	7b300a803c	cleanup: add some missing stack backtraces	2012-08-23 14:38:48 +02:00
Zdenek Kabelac	fd417db274	check: add internal errors for unexpected paths Adding couple INTERNAL_ERROR reports for unwanted parameters: Ensure the 'top' metadata node cannot be NULL for lvmetad. Make obvious vginfo2 cannot be NULL. Report internal error if handler and vg is undefined. Check for handle in poll_vg(). Ensure seg is not NULL in dev_manager_transient(). Report missing read_ahead for _lv_read_ahead_single(). Check for report handler in dm_report_object(). Check missing VG in _vgreduce_single().	2012-08-23 14:37:52 +02:00
Zdenek Kabelac	195fe03075	cleanup: use proper activation_change_t	2012-08-23 14:37:38 +02:00
Zdenek Kabelac	bd67a3151a	cleanup: uint64_t casts	2012-08-23 14:37:21 +02:00
Zdenek Kabelac	286cd2006b	cleanup: drop unneeded included header files This headers were not resolving anything used for compiled .c files. Remove unused util.c file.	2012-08-23 14:37:20 +02:00
Peter Rajnoha	00877fe47b	mirror: reconfigure_mirror_images not used	2012-08-15 10:44:19 +02:00
Zdenek Kabelac	54c24193f5	thin: lvcreate --discards	2012-08-09 16:25:52 +02:00
Alasdair G Kergon	701b4a8363	thin: use discards as plural rather than singular Global change from --discard to --discards, as that feels more natural.	2012-08-07 21:24:41 +01:00
Alasdair G Kergon	7b5ea9a5a8	thin: tighten discard string conversions Respond with "unknown" rather than a NULL pointer if there's an internal error and the discard value is invalid. Don't accept 'no_passdown' or 'no-passdown' variants in the LVM metadata: this is written by the program so should only ever contain "nopassdown" and should be validated strictly against that.	2012-08-07 18:37:35 +01:00
Alasdair G Kergon	adfa778a58	thin: order discard enum alphabetically	2012-08-07 18:36:40 +01:00
Alasdair G Kergon	4dbf872a9f	reports: invalid snaps do not capitalise lv_attr No longer capitalise first LV attribute char for invalid snapshots. This state is available from the 5th char now (I or S).	2012-07-27 20:19:28 +01:00
Jonathan Brassow	5555d2a000	RAID: Fix segfault when attempting to replace RAID 4/5/6 device Commit `8767435ef8` allowed RAID 4/5/6 LV to be extended properly, but introduced a regression in device replacement - a critical component of fault tolerance. When only 1 or 2 drives are being replaced, the 'area_count' needed can be equal to the parity_count. The 'area_multiple' for RAID 4/5/6 was computed as 'area_count - parity_devs', which could result in 'area_multiple' being 0. This would ultimately lead to a division by zero error. Therefore, in calc_area_multiple, it is important to take into account the number of areas that are being requested - just as we already do in _alloc_init.	2012-07-24 19:02:06 -05:00
Zdenek Kabelac	ebbf7d8e68	thin: add discard support for thin pool Add arg support for discard. Add discard ignore, nopassdown, passdown (=default) support. Flags could be set per pool. lvcreate [--discard {ignore\|no_passdown\|passdown}] vg/thinlv	2012-07-18 14:36:57 +02:00
Jonathan Brassow	8767435ef8	RAID: Fix extending size of RAID 4/5/6 logical volumes. Reducing a RAID 4/5/6 LV or extending it with a different number of stripes is still not implemented. This patch covers the "simple" case where the LV is extended with the same number of stripes as the orginal.	2012-06-26 09:44:54 -05:00
Alasdair G Kergon	2cec4b4a77	alloc: fix raid --alloc anywhere double allocs If _alloc_parallel_area for raid devices chooses an area already used up, it doesn't notice that it has no space left in it and leaves later code trying to place a zero-length area into the LV. https://bugzilla.redhat.com/832596	2012-06-28 23:26:42 +01:00
Peter Rajnoha	a2f4ccd839	lvcreate: add --activate ay (autoactivate) One can use "lvcreate --aay" to have the newly created volume activated or not activated based on the activation/auto_activation_volume_list this way. Note: -Z/--zero is not compatible with -aay, zeroing is not used in this case! When using lvcreate -aay, a default warning message is also issued that zeroing is not done.	2012-06-28 09:44:07 -04:00
Peter Rajnoha	95ced7a7be	activate: add autoactivation hooks Define an 'activation_handler' that gets called automatically on PV appearance/disappearance while processing the lvmetad_pv_found and lvmetad_pv_gone functions that are supposed to update the lvmetad state based on PV availability state. For now, the actual support is for PV appearance only, leaving room for PV disappearance support as well (which is a more complex problem to solve as this needs to count with possible device stack). Add a new activation change mode - CHANGE_AAY exposed as '--activate ay/-aay' argument ('activate automatically'). Factor out the vgchange activation functionality for use in other tools (like pvscan...).	2012-06-28 09:42:47 -04:00
Peter Rajnoha	2729720fd3	args: add --activate synonym for --available arg We're refererring to 'activation' all over the code and we're talking about 'LVs being activated' all the time so let's use 'activation/activate' everywhere for clarity and consistency (still providing the old 'available' keyword as a synonym for backward compatibility with existing environments).	2012-06-28 09:42:44 -04:00
Alasdair G Kergon	07a25c249b	discards: don't discard reconfigured extents Update release_lv_segment_area not to discard any PV extents, as it also gets used when moving extents between LVs. Instead, call a new function release_and_discard_lv_segment_area() in the two places where data should be discarded - lv_reduce() and remove_mirrors_from_segments().	2012-06-27 22:12:01 +01:00
Alasdair G Kergon	e59f6981e6	discards: split discard from release_pv_segment Separate discard_pv_segment out of release_pv_segment	2012-06-27 22:11:54 +01:00
Alasdair G Kergon	a5ddb347e5	allocation: allow release_lv_segment_area to fail Allow release_lv_segment_area to fail as functions it calls can fail.	2012-06-27 22:11:49 +01:00
Zdenek Kabelac	6f3cd63551	cleanup: replace memset with struct initilization Simplifies the code, properly detects too long socket paths, drops unused parameter.	2012-06-22 13:23:03 +02:00
Alasdair Kergon	e0ed1b458d	Warn of deadlock risk when using snapshots of mirror segment type.	2012-05-14 16:18:57 +00:00
Alasdair Kergon	8b59522d67	Fix cling policy not to behave like normal policy if no previous LV seg. Fix alloc cling to cling to PVs already found with contiguous policy.	2012-05-11 22:53:13 +00:00
Alasdair Kergon	8a689fd04d	Fix allocation policy loop so it doesn't continue beyond cling using later policies it shouldn't be using when --alloc cling is specified but no tags are defined.	2012-05-11 22:19:12 +00:00
Alasdair Kergon	01cfbe14f1	Append _TO_LVSEG to names of internal A_CONTIGUOUS and A_CLING flags. Remove some unnecesary prev_lvseg checks.	2012-05-11 18:59:01 +00:00
Alasdair Kergon	51514ae62f	Always include debug mesg when cling to allocated is set.	2012-05-11 15:32:19 +00:00
Alasdair Kergon	086829459b	Refactor _has_matching_pv_tag to provide a fn that takes PV structs.	2012-05-11 15:26:30 +00:00

1 2 3 4 5 ...

1326 Commits