shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-21 13:34:40 +03:00

Author	SHA1	Message	Date
Zdenek Kabelac	a5b9b4bf02	thin: fix forbidden discards checks Instead of check for lv_is_active() for thin pool LV, query the whole pool via new pool_is_active(). Fixes a problem when we cannot change discards settings for active pool device where the actual layer for pool device was inactive, but thin volumes using thin pool have been active.	2013-02-05 14:38:16 +01:00
Zdenek Kabelac	11eaf1c98c	thin: add function pool_is_active This internal function check for active pool device. For cluster it checks every thin volume, On the non-clustered VG we need to check just for presence of -tpool device.	2013-02-05 14:35:44 +01:00
Zdenek Kabelac	9d445f371c	report: leave empty report field for 0 Since we do not support LVs with 0 size, use this value as 'error' value for devices without origin, and leave this field blank as in other cases.	2013-02-05 14:32:37 +01:00
Zdenek Kabelac	be5ad90703	lvconvert: fix accepting second lv name Do not allow to accept second LV name on lvconvert --thinpool command line.	2013-02-05 14:31:17 +01:00
Zdenek Kabelac	a4870c79ca	thin: use noflush for obtaining transaction_id Do not flush thin pool data, when reading transation_id status.	2013-02-04 19:05:56 +01:00
Zdenek Kabelac	ca7abbce8a	activate: add lv_layer function Add function to return layer name for LV.	2013-02-04 19:01:10 +01:00
Jonathan Brassow	38e7b37c89	WHATS_NEW: Better description of previous change	2013-02-01 11:52:25 -06:00
Jonathan Brassow	801d4f96a8	RAID: Improve 'lvs' attribute reporting of RAID LVs and sub-LVs There are currently a few issues with the reporting done on RAID LVs and sub-LVs. The most concerning is that 'lvs' does not always report the correct failure status of individual RAID sub-LVs (devices). This can occur when a device fails and is restored after the failure has been detected by the kernel. In this case, 'lvs' would report all devices are fine because it can read the labels on each device just fine. Example: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) However, 'dmsetup status' on the device tells us a different story: [root@bp-01 lvm2]# dmsetup status vg-lv 0 1024000 raid raid1 2 DA 1024000/1024000 In this case, we must also be sure to check the RAID LVs kernel status in order to get the proper information. Here is an example of the correct output that is displayed after this patch is applied: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-p 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-p /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-p /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) The other case where 'lvs' gives incomplete or improper output is when a device is replaced or added to a RAID LV. It should display that the RAID LV is in the process of sync'ing and that the new device is the only one that is not-in-sync - as indicated by a leading 'I' in the Attr column. (Remember that 'i' indicates an (i)mage that is in-sync and 'I' indicates an (I)mage that is not in sync.) Here's an example of the old incorrect behaviour: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [root@bp-01 lvm2]# lvconvert -m +1 vg/lv; lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 0.00 lv_rimage_0(0),lv_rimage_1(0),lv_rimage_2(0) [lv_rimage_0] vg Iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg Iwi-aor-- /dev/sdb1(1) [lv_rimage_2] vg Iwi-aor-- /dev/sdc1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [lv_rmeta_2] vg ewi-aor-- /dev/sdc1(0) Note that all the images currently are marked as 'I' even though it is only the last device that has been added that should be marked. Here is an example of the correct output after this patch is applied: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [root@bp-01 lvm2]# lvconvert -m +1 vg/lv; lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 0.00 lv_rimage_0(0),lv_rimage_1(0),lv_rimage_2(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rimage_2] vg Iwi-aor-- /dev/sdc1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [lv_rmeta_2] vg ewi-aor-- /dev/sdc1(0) Note only the last image is marked with an 'I'. This is correct and we can tell that it isn't the whole array that is sync'ing, but just the new device. It also works under snapshots... [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg owi-a-r-p 33.47 lv_rimage_0(0),lv_rimage_1(0),lv_rimage_2(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg Iwi-aor-p /dev/sdb1(1) [lv_rimage_2] vg Iwi-aor-- /dev/sdc1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-p /dev/sdb1(0) [lv_rmeta_2] vg ewi-aor-- /dev/sdc1(0) snap vg swi-a-s-- /dev/sda1(51201)	2013-02-01 11:33:54 -06:00
Peter Rajnoha	f7da1caf8d	blkdeactivate: fix handling of nested mountpoints and mangled mount paths. If there was a nested mountpoint inside an existing mount path, blkdeactivate could fail to unmount such a mountpoint as it needs to deactivate the deepest path first and continue upwards. For example the simplest reproducer: [root@rhel6-a ~]# lsblk NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT sda 8:0 0 4G 0 disk \|-vg-lvol0 (dm-2) 253:2 0 32M 0 lvm /mnt/a `-vg-lvol1 (dm-3) 253:3 0 32M 0 lvm /mnt/a/b Before this patch: [root@rhel6-a ~]# blkdeactivate -u Deactivating block devices: UMOUNT: unmounting vg-lvol0 (dm-2) mounted on /mnt/a umount: /mnt/a: device is busy. (In some cases useful info about processes that use the device is found by lsof(8) or fuser(1)) UMOUNT: unmounting vg-lvol1 (dm-3) mounted on /mnt/a/b LVM: deactivating Logical Volume vg/lvol1 (deactivation of vg/lvol0 is skipped as /mnt/a that is on lvol0 can't be unmounted - it still has /mnt/a/b as nested mountpoint!) With this patch applied: [root@rhel6-a ~]# blkdeactivate -u Deactivating block devices: UMOUNT: unmounting vg-lvol1 (dm-3) mounted on /mnt/a/b UMOUNT: unmounting vg-lvol0 (dm-2) mounted on /mnt/a LVM: deactivating Logical Volume vg/lvol0 LVM: deactivating Logical Volume vg/lvol1 === Also, this patch contains a fix for processing mangled mount paths: [root@rhel6-a ~]# lsblk NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT sda 8:0 0 4G 0 disk `-vg-lvol0 (dm-2) 253:2 0 32M 0 lvm /mnt/x y z [root@rhel6-a ~]# lsblk -r vg-lvol0 253:2 0 32M 0 lvm /mnt/x\x20y\x20z (the mount path is mangled with \xNN that is visible in raw lsblk output only and which is used in blkdeactive as well) Before this patch: [root@rhel6-a ~]# blkdeactivate -u Deactivating block devices: umount: /mnt/x\x20y\x20z: not found After this patch applied: [root@rhel6-a ~]# blkdeactivate -u Deactivating block devices: UMOUNT: unmounting vg-lvol0 (dm-2) mounted on /mnt/x\x20y\x20z LVM: deactivating Logical Volume vg/lvol0	2013-01-23 14:45:41 +01:00
Zdenek Kabelac	8bcc1da2f3	locales: use higher prio LC_ALL variable For reseting locale environment into significantly less memory consuming version 'C' - use LC_ALL instead of LANG since it has higher priority in locale settings. Otherwise we may observe whole locale-archive which might be over 100MB on i.e. Fedora systems locked in memory with some daemons.	2013-01-22 11:25:02 +01:00
Petr Rockai	142c4bf9f0	Update WHATS_NEW.	2013-01-16 11:22:08 +01:00
Zdenek Kabelac	2b760a7fa7	whatsnew	2013-01-11 09:26:51 +01:00
Alasdair G Kergon	7f747a0d73	logging: add debug classes Add log/debug_classes to lvm.conf to allow debug messages to be classified and filtered at runtime. The dm_errno field is only used by log_error(), so I've redefined it for log_debug() messages to hold the message class. By default, all existing messages appear, but we can add categories that generate high volumes of data, such as logging all traffic to/from lvmetad.	2013-01-07 22:25:19 +00:00
Peter Rajnoha	ad85b0c526	pvscan: synchronize with udev if pvscan --cache is used. We need to call sync_local_dev_names directly as pvscan uses VG_GLOBAL lock and this one does not cause the synchronization (sync_dev_names) to be called on unlock (VG_GLOBAL is not a real VG): define unlock_vg(cmd, vol) do { \ if (is_real_vg(vol)) \ sync_dev_names(cmd); \ (void) lock_vol(cmd, vol, LCK_VG_UNLOCK); \ } while (0) Without this fix, we end up without udev synchronization for the pvscan --cache (mainly for -aay that causes the VGs/LVs to be autoactivated) and also udev synchronization cookies are then left in the system since they're not managed properly (code before sets up udev sync cookies, but we have to call dm_udev_wait at least once after that to do the wait and cleanup).	2012-12-21 11:15:46 +01:00
Peter Rajnoha	756bcabbfe	activation: fix autoactivation to not trigger on each PV change Before, the pvscan --cache -aay was called on each ADD and CHANGE uevent (for a device that is not a device-mapper device) and each CHANGE event (for a PV that is a device-mapper device). This causes troubles with autoactivation in some cases as CHANGE event may originate from using the OPTION+="watch" udev rule that is defined in 60-persistent-storage.rules (part of the rules provided by udev directly) and it's used for all block devices (except fd\|mtd\|nbd\|gnbd\|btibm\|dm-\|md* devices). For example, the following sequence incorrectly activates the rest of LVs in a VG if one of the LVs in the VG is being removed: [root@rhel6-a ~]# pvcreate /dev/sda Physical volume "/dev/sda" successfully created [root@rhel6-a ~]# vgcreate vg /dev/sda Volume group "vg" successfully created [root@rhel6-a ~]# lvcreate -l1 vg Logical volume "lvol0" created [root@rhel6-a ~]# lvcreate -l1 vg Logical volume "lvol1" created [root@rhel6-a ~]# vgchange -an vg 0 logical volume(s) in volume group "vg" now active [root@rhel6-a ~]# lvs LV VG Attr LSize Pool Origin Data% Move Log Cpy%Sync Convert lvol0 vg -wi------ 4.00m lvol1 vg -wi------ 4.00m [root@rhel6-a ~]# lvremove -ff vg/lvol1 Logical volume "lvol1" successfully removed [root@rhel6-a ~]# lvs LV VG Attr LSize Pool Origin Data% Move Log Cpy%Sync Convert lvol0 vg -wi-a---- 4.00m ...so the vg was deactivated, then lvol1 removed, and we end up with lvol1 removed (which is ok) BUT with lvol0 activated (which is wrong)!!! This is because after lvol1 removal, we need to write metadata to the underlying device /dev/sda and that causes the CHANGE event to be generated (because of the WATCH udev rule set on this device) and this causes the pvscan --cache -aay to be reevaluated. We have to limit this and call pvscan --cache -aay to autoactivate VGs/LVs only in these cases: --> if the PV is not a dm device, scan only after proper device addition (ADD event) and not with any other changes (CHANGE event) --> if the PV is a dm device, scan only after proper mapping activation (CHANGE event + the underlying PV in a state "just activated")	2012-12-21 10:34:48 +01:00
Jonathan Brassow	970dfbcd69	RAID: Limit replacement of devices when array is not in-sync. If a RAID array is not in-sync, replacing devices should not be allowed as a general rule. This is because the contents used to populate the incoming device may be undefined because the devices being read where not in-sync. The kernel enforces this rule unless overridden by not allowing the creation of an array that is not in-sync and includes a devices that needs to be rebuilt. Since we cannot know the sync state of an LV if it is inactive, we must also enforce the rule that an array must be active to replace devices. That leaves us with the following conditions: 1) never allow replacement or repair of devices if the LV is in-active 2) never allow replacement if the LV is not in-sync 3) allow repair if the LV is not in-sync, but warn that contents may not be recoverable. In the case where a user is performing the repair on the command line via 'lvconvert --repair', the warning is printed before the user is prompted if they would like to replace the device(s). If the repair is automated (i.e. via dmeventd and policy is "allocate"), then the device is replaced if possible and the warning is printed.	2012-12-18 14:40:42 -06:00
Peter Rajnoha	0379c480e0	WHATS_NEW: changelog for `fae1a611d2` and `5294a6f77a`	2012-12-18 12:12:58 +01:00
Zdenek Kabelac	401c9aba4a	pv_read: add missing check for valid info If the lvmcache_info_from_pvid() fails to find valid info, invoke the lookup by dev, and only in this case call lvmcache_info_from_pvid() again. Also check for the result of info and return error directly, so the NULL is not passed to lvmcache_get_label().	2012-12-15 17:23:27 +01:00
Zdenek Kabelac	3e8dbfaecf	lvmetad: add check for failure dm_config_write_node Detect if dm_config_write_node failed and fail correctly.	2012-12-15 17:23:27 +01:00
Zdenek Kabelac	4008f4f891	lvmetad: fix socket leak in handle_connect Close socket_fd and report error on malloc failure.	2012-12-15 17:23:27 +01:00
Zdenek Kabelac	e012d0635d	lvmetad: check id_read_format error status Detect error from id_read_format() function.	2012-12-15 17:23:27 +01:00
Zdenek Kabelac	ba3f37c9e4	lvmetad: fix memleak on pv_found error path Free resources allocated in pv_found when going out through error path.	2012-12-15 17:23:27 +01:00
Zdenek Kabelac	a4269aadf3	lvmetad: unlock vg on out-of-memory path If we fail to get memory for mutex, hash the mutex or fail somewhere along pthread function calls return allocated resources back and unlock vg_lock_map mutex.	2012-12-15 17:23:26 +01:00
Zdenek Kabelac	788ac7fa54	libdaemon: check for strdup result Detect failure of dm_pool_strdup() and print error in fail path. Save one extra strchr call - since we already know the distance for the '=' character. Drop stack trace from return after log_error().	2012-12-15 17:23:26 +01:00
Zdenek Kabelac	ff5612c0c3	format-text: check for _text_create_text_instance Test if 'fid' creation failed and report stack trace, break the loop and do not pass NULL fid further.	2012-12-15 17:23:23 +01:00
Zdenek Kabelac	740ab81d03	log: move abort past syslog When the abort_on_internal_errors is enabled, we aborted prior the syslog logging output. Since such fatal error gets level _LOG_FATAL it should not be blocked by debug_level() check so lets move it further, to get abort error logged also via syslog.	2012-12-15 17:22:48 +01:00
Andy Grover	58b61c252a	python-lvm: Small fixups to new create_lv_snapshot Tabify Remove use of asize, unneeded. Don't initialize lvobj->parent_vgobj to NULL, the object ctor already zeroed everything on alloc. Redo call to lvm_lv_snapshot to use the liblvm snapshot implementation we went with. Add {}s to silence warning in lv_dealloc. Rename snapshot function for consistency. Update WHATS_NEW. Signed-off-by: Andy Grover <agrover@redhat.com>	2012-12-14 10:30:26 -08:00
Petr Rockai	c089029b70	Update WHATS_NEW.	2012-12-12 15:17:08 +01:00
Peter Rajnoha	e5709a32be	lvmetad: fix compiler warning and add WHATS_NEW line for previous commit	2012-12-12 13:27:25 +01:00
Peter Rajnoha	cad22be394	lvconvert: allow lvconvert --stripes/stripesize only with -mirrors/--repair/--thinpool Also, update lvconvert man page to reflect this and make clear that the --stripes/stripesize is applied to newly allocated space only.	2012-12-11 15:50:25 +01:00
Zdenek Kabelac	ec49f07b0d	mirrors: fix leak in device_is_usable mirror check Function _ignore_blocked_mirror_devices was not release allocated strings images_health and log_health. In error paths it was also not releasing dm_task structure. Swaped return code of _ignore_blocked_mirror_devices and use 1 as success. In _parse_mirror_status use log_error if memory allocation fails and few more errors so they are no going unnoticed as debug messages. On error path always clear return values and free strings. For dev_create_file use cache mem pool to avoid memleak.	2012-12-11 11:15:22 +01:00
Peter Rajnoha	f942ae4a7a	lvconvert: do not ignore -f in lvconvert --repair -y -f	2012-12-11 09:52:54 +01:00
Jonathan Brassow	3835755259	pvmove/RAID: Disallow pvmove on RAID LVs until properly handled Attempting pvmove on RAID LVs replaces the kernel RAID target with a temporary pvmove target, ultimately destroying the RAID LV. pvmove must be prevented on RAID LVs for now. Use 'lvconvert --replace old_pv vg/lv new_pv' if you want to move an image of the RAID LV.	2012-12-04 17:47:47 -06:00
Peter Rajnoha	e2be2652ad	Allow empty activation/{auto_activation\|read_only\|}_volume_list config option. In case we don't want to activate, autoactivate or have the VG/LV read-only. Primarily targeted for the auto_activation_volume_list, but it makes no harm for other settings (the part of the code that reads these three settings is shared, but there's no reason to separate it only for this change).	2012-12-04 10:33:54 +01:00
Zdenek Kabelac	5ec20e267f	thin: reworked thin feature detection Rework thin feature detection to support runtime section to allow to disable them selectively. New lvm.conf option is born: global/thin_disabled_features	2012-12-03 11:57:40 +01:00
Zdenek Kabelac	99018b37ee	thin: lvconvert supports swapping metadata device Support swapping of metadata device if the thin pool already exists. This way it's easy to i.e. resize metadata or their repair operation. User may create some empty LV, replace existing metadata or dump and restore them into bigger LV.	2012-12-02 18:01:27 +01:00
Zdenek Kabelac	6987a353de	thin: add detach_pool_metadata_lv Add internal function detach_pool_metadata_lv().	2012-12-02 17:56:29 +01:00
Zdenek Kabelac	9ec474f38a	lvm2api: fix size reporting API is reporting all sizes as 64bit integers in bytes. Fix at those places, where sectors were returned to remain consistent.	2012-12-02 17:55:08 +01:00
Peter Rajnoha	4891a735d3	udev: recognize DM_DISABLE_UDEV environment variable Setting this environment variable will cause a full fallback to old direct node and symlink management in libdevmapper and lvm2. It means: - disabling udev synchronization (--noudevsync in dmsetup and --noudevsync + activation/udev_sync=0 lvm2 config) - disabling dm and any subsystem related udev rules (--noudevrules in dmsetup and activation/udev_rules=0 lvm2 config) - management of nodes/symlinks under /dev directly by libdevmapper/lvm2 (--verifyudev in dmsetup and activation/verify_udev_operations=1 lvm2 config) - not obtaining any device list from udev database (devices/obtain_device_list_from_udev=0 lvm2 config) Note: we could set all of these before - there's no functional change! However the DM_DISABLE_UDEV environment variable is a nice shortcut to make it easier for libdevmapper users so that one can switch off all of the udev management off at one go directly on the command line, without a need to modify any source or add any extra switches.	2012-11-29 14:03:48 +01:00
Peter Rajnoha	fb8cc7c63f	udev: do not verify udev operations for --noudevsync If udev synchronization is disabled by means of --noudevsync option, we should disable just the synchronization and nothing else. The udev fallback (verifying udev operations and fixing the nodes/symlinks if found incorrect) is orthogonal and controlled by a separate activation/verify_udev_operations configuration option.	2012-11-29 13:59:12 +01:00
Zdenek Kabelac	0387e70d76	thin: fix property discard for lvm2api Discards property is string and may have these values: ignore, nopassdown, passdown	2012-11-27 14:09:49 +01:00
Zdenek Kabelac	09b7ceea95	thin: allow restore with --force Allow restoring metadata with thin pool volumes. No validation is done for this case within vgcfgrestore tool - thus incorrect metadata may lead to destruction of pool content.	2012-11-27 14:08:24 +01:00
Alasdair G Kergon	8c49aa79e7	filters: Add STEC skd and Violin vtms devices	2012-11-26 14:55:17 +00:00
Zdenek Kabelac	1ef9831018	thin: support configurable thin pool defaults Configurable settings for thin pool create if they are not specified on command line. New supported lvm.conf options are: allocation/thin_pool_chunk_size allocation/thin_pool_discards allocation/thin_pool_zero	2012-11-26 12:16:47 +01:00
Zdenek Kabelac	683b1f0625	thin: detect discards for non-power-2 Check if target supports discards for chunk sizes, that are not power of 2 (just multiple of 64K), and enable it in case it's supported by thin kernel target.	2012-11-26 12:14:47 +01:00
Petr Rockai	60668f823e	Automatically restore MISSING PVs with no MDAs.	2012-11-25 20:41:56 +01:00
Jonathan Brassow	b3e9a09abe	RAID: If no stripes argument is given for RAID10 create, default to 2 Similar to the way the 'mirror', 'raid1' and 'raid10' segment types set the number of mirrors to 2 ('-m 1') if the argument is not specified, here we set the number of stripes to 2 if not given on the command line when creating a RAID10 LV.	2012-11-21 18:46:52 -06:00
Jonathan Brassow	fb0cee9a66	RAID: Do not allow --splitmirrors on RAID10 logical volumes. RAID10 does not have the ability to split off images for independent use. So, 'lvconvert --splitmirrors' will not work and must be disallowed.	2012-11-21 18:39:26 -06:00
Zdenek Kabelac	d5697b29ee	mm: skip mlocking [vectors] Somehow forgotten: https://www.redhat.com/archives/linux-lvm/2012-June/msg00019.html Need for arm architecture support.	2012-11-20 10:02:51 +01:00
Zdenek Kabelac	b21d3e3592	thin: lvconvert update Use common function from toollib and support allocation of metadata LV with give thin pool data LV.	2012-11-19 14:38:17 +01:00
Zdenek Kabelac	f4137640f6	thin: add common pool functions Move common functions for lvcreate and lvconvert. get_pool_params() - read thin pool args. update_pool_params() - updates/validates some thin args. It is getting complicated and even few more things will be implemented, so to avoid reimplementing things differently in lvcreate and lvconvert code has been splitted into 2 common functions that allow some future extension.	2012-11-19 14:38:17 +01:00
Jonathan Brassow	54c73b7723	mirror: Mirrored log should be fixed before mirror when double fault occurs This patch is intended to fix bug 825323 - FS turns read-only during a double fault of a mirror leg and mirrored log's leg at the same time. It only affects a 2-way mirror with a mirrored log. 3+-way mirrors and mirrors without a mirrored log are not affected. The problem resulted from the fact that the top level mirror was not using 'noflush' when suspending before its "down-convert". When a mirror image fails, the bios are queue until a suspend is recieved. If it is a 'noflush' suspend, the bios can be safely requeued in the DM core. If 'noflush' is not used, the bios must be pushed through the target and if a device is failed for a mirror, that means issuing an error. When an error is received by a file system, it results in it turning read-only (depending on the FS). Part of the problem was is due to the nature of the stacking involved in using a mirror as a mirror's log. When an image in each fail, the top level mirror stalls because it is waiting for a log flush. The other stalls waiting for corrective action. When the repair command is issued, the entire stacked arrangement is collapsed to a linear LV. The log flush then fails (somewhat uncleanly) and the top-level mirror is suspended without 'noflush' because it is a linear device. This patch allows the log to be repaired first, which in turn allows the top-level mirror's log flush to complete cleanly. The top-level mirror is then secondarily reduced to a linear device - at which time this mirror is suspended properly with 'noflush'.	2012-11-14 14:58:47 -06:00
Tony Asleson	7a34db0cfd	python-lvm: Initial check-in of python-lvm unit test case. Signed-off-by: Tony Asleson <tasleson@redhat.com>	2012-11-14 13:18:37 -06:00
Peter Rajnoha	fc2644ae71	pvscan: exit --cache immediately if locking_type=3 \|\| use_lvmetad=0	2012-11-09 15:56:57 +01:00
Peter Rajnoha	360c569ce8	systemd: various updates and fixes Don't use lvmetad in lvm2-monitor.service ExecStop to avoid a systemd issue. - a systemd design issue while processing dependencies with socket-based activation that ends up with a hang - https://bugzilla.redhat.com/show_bug.cgi?id=843587 (also tracker bug https://bugzilla.redhat.com/show_bug.cgi?id=871527) - not using lvmetad in this case is just a workaround, once the bug above is resolved, we should enable the lvmetad in that specific case Remove dependency on fedora-storage-init.service in lvm2 systemd units. - fedora-storage-init.service and fedora-storage-init-late.service is going to be separated into respective units that belong to each block device subsystem: - mpath + mdraid activated via udev solely - dmraid with its own dmraid-activation.service unit - lvm2 with the lvm2-activation-generator to generate the activation units runtime if lvmetad disabled (global/use_lvmetad=0 set in lvm.conf) and activation done via udev+lvmetad if lvmetad enabled (global/use_lvmetad=1 set in lvm.conf) Depend on lvm2-lvmetad.socket in lvm2-monitor.service systemd unit. - as lvm2-monitor uses lvmetad if lvmetad is enabled	2012-10-30 20:55:50 +01:00
Peter Rajnoha	10492b238d	lvmetad: whats_new + more explanation for previous commit	2012-10-25 14:47:45 +02:00
Jonathan Brassow	b248ba0a39	mirror: Avoid reading mirrors with failed devices in mirrored log Commit `9fd7ac7d03` did not handle mirrors that contained mirrored logs. This is because the status line of the mirror does not give an indication of the health of the mirrored log, as you can see here: [root@bp-01 lvm2]# dmsetup status vg-lv vg-lv_mlog vg-lv: 0 409600 mirror 2 253:6 253:7 400/400 1 AA 3 disk 253:5 A vg-lv_mlog: 0 8192 mirror 2 253:3 253:4 7/8 1 AD 1 core Thus, the possibility for LVM commands to hang still persists when mirror have mirrored logs. I discovered this while performing some testing that does polling with 'pvs' while doing I/O and killing devices. The 'pvs' managed to get between the mirrored log device failure and the attempt by dmeventd to repair it. The result was a very nasty block in LVM commands that is very difficult to remove - even for someone who knows what is going on. Thus, it is absolutely essential that the log of a mirror be recursively checked for mirror devices which may be failed as well. Despite what the code comment says in the aforementioned commit... + * _mirrored_transient_status(). FIXME: It is unable to handle mirrors + * with mirrored logs because it does not have a way to get the status of + * the mirror that forms the log, which could be blocked. ... it is possible to get the status of the log because the log device major/minor is given to us by the status output of the top-level mirror. We can use that to query the log device for any DM status and see if it is a mirror that needs to be bypassed. This patch does just that and is now able to avoid reading from mirrors that have failed devices in a mirrored log.	2012-10-25 00:42:45 -05:00
Jonathan Brassow	9fd7ac7d03	mirror: Avoid reading from mirrors that have failed devices Addresses: rhbz855398 (Allow VGs to be built on cluster mirrors), and other issues. The LVM code attempts to avoid reading labels from devices that are suspended to try to avoid situations that may cause the commands to block indefinitely. When scanning devices, 'ignore_suspended_devices' can be set so the code (lib/activate/dev_manager.c:device_is_usable()) checks any DM devices it finds and avoids them if they are suspended. The mirror target has an additional mechanism that can cause I/O to be blocked. If a device in a mirror fails, all I/O will be blocked by the kernel until a new table (a linear target or a mirror with replacement devices) is loaded. The mirror indicates that this condition has happened by marking a 'D' for the faulty device in its status output. This condition must also be checked by 'device_is_usable()' to avoid the possibility of blocking LVM commands indefinitely due to an attempt to read the blocked mirror for labels. Until now, mirrors were avoided if the 'ignore_suspended_devices' condition was set. This check seemed to suggest, "if we are concerned about suspended devices, then let's ignore mirrors altogether just in case". This is insufficient and doesn't solve any problems. All devices that are suspended are already avoided if 'ignore_suspended_devices' is set; and if a mirror is blocking because of an error condition, it will block the LVM command regardless of the setting of that variable. Rather than avoiding mirrors whenever 'ignore_suspended_devices' is set, this patch causes mirrors to be avoided whenever they are blocking due to an error. (As mentioned above, the case where a DM device is suspended is already covered.) This solves a number of issues that weren't handled before. For example, pvcreate (or any command that does a pv_read or vg_read, which eventually call device_is_usable()) will be protected from blocked mirrors regardless of how 'ignore_suspended_devices' is set. Additionally, a mirror that is neither suspended nor blocking is /allowed/ to be read regardless of how 'ignore_suspended_devices' is set. (The latter point being the source of the fix for rhbz855398.)	2012-10-23 23:10:33 -05:00
Jonathan Brassow	b873fc54ba	WHATS_NEW: Entry for commit `e191780947` WHATS_NEW commit for 'lvs' output change to add RAID 4/5/6 sync %age to s/Copy%/Cpy%Sync/ output.	2012-10-23 21:38:37 -05:00
Zdenek Kabelac	13fe333b54	clvmd: fix parsing of -d argument clvmd -d option parsing was not working properly. clvmd -d 2 (with space) has been ignored because of '::' used in getopt string, and as failsafe it's been used '1'. Later this debug_arg has been ignored and debug_opt was used instead which happend to have value '1'. Submitted-by: Robert Milasan <rmilasan at suse.com> Reported-by: Robert Milasan <rmilasan at suse.com>	2012-10-19 15:35:56 +02:00
Zdenek Kabelac	5f5a5d1f53	lvchange: support --yes option for --persistent Support using command: lvchange --yes --persistent to skip y\|n prompt.	2012-10-19 15:33:46 +02:00
Zdenek Kabelac	c7c53ad41d	pvcreate: fix leak on error path Missing vg release on error path. Add tests for few more error cases.	2012-10-19 15:32:21 +02:00
Zdenek Kabelac	bf2741376d	Use lv_is_active instead of lv_info() Usage of lv_is_active makes it more obvious what is being checked.	2012-10-17 15:42:31 +02:00
Zdenek Kabelac	f260f99d57	cleanup: switch log_error to log_warn Use log_warn to print non-fatal warning messages. Use of log_error would confuse checker for testing whether proper error has been reported for some real error.	2012-10-17 15:41:35 +02:00
Alasdair G Kergon	ea6a8078b4	release: prepare for release	2012-10-15 15:19:32 +01:00
Zdenek Kabelac	b3899056d9	thin: disable conversion of thin-pool to read-only This change is not yet supported.	2012-10-15 14:09:11 +02:00
Zdenek Kabelac	2fc1fc3a93	thin: allow to create read-only thin-volumes Useful for i.e. read-only thin snapshots.	2012-10-15 14:07:03 +02:00
Peter Rajnoha	4dace48f51	Remove pvscan --cache from lvm2-lvmetad init script. This is not needed anymore as the scan is called transparently within the first LVM command that queries lvmetad.	2012-10-15 12:58:23 +02:00
Alasdair G Kergon	78dafcba99	lvmetad: use -l for logging level not -d	2012-10-15 10:44:43 +01:00
Alasdair G Kergon	a0e60d27ff	lvmetad: document and tidy cmdline args Try to bring the lvmetad usage text and man page closer to the code. There seem to be 3 useful ways to use -d with lvmetad at the moment: -d all -d wire -d debug (They can also be comma-separated like -d wire,debug.) Prior to the last release, -d, -dd and -ddd were supported. Fail if an unrecognised debug arg is supplied on the command line. Change -V to report the same version as the lvm binary: previously it just reported version 0.	2012-10-15 02:06:27 +01:00
Zdenek Kabelac	16060b101b	thin: lvextend will fail is autoextend is 0% Since extending by 0% will not increase the size of pool, return failure.	2012-10-14 23:17:30 +02:00
Peter Rajnoha	2679c68689	WHATS_NEW: update	2012-10-12 14:47:40 +02:00
Petr Rockai	141f26035d	Update WHATS_NEW.	2012-10-12 13:24:06 +02:00
Zdenek Kabelac	3058f662cf	thin: prohibit lvcreate --thinpool with mirrors Disable --thinpool to be used with mirror on lvcreate.	2012-10-12 12:21:45 +02:00
Zdenek Kabelac	be291e1064	thin: lvm2api return origin property for thin LV	2012-10-12 12:20:55 +02:00
Alasdair G Kergon	ee3cfa4184	python: Add bindings for liblvm2app. Use configure --enable-python_bindings to generate them. Note that the Makefiles do not yet control the owner or permissions of the two new files on installation.	2012-10-12 02:08:47 +01:00
Zdenek Kabelac	0a46160d94	lvm2api: add defined lvm_percent_to_float Implement function which was somehow missing from it's original placement in the header file lvm2api.h.	2012-10-11 17:29:56 +02:00
Zdenek Kabelac	ca09c9ab4c	thin: support non power of 2 chunk size Support thin chunk size with multiple of 64KiB if user has thin-pool target version at least 1.2.	2012-10-10 21:21:00 +02:00
Jonathan Brassow	3501f17fd0	[lv\|vg]change: Allow limited metadata changes when PVs are missing A while back, the behavior of LVM changed from allowing metadata changes when PVs were missing to not allowing changes. Until recently, this change was tolerated by HA-LVM by forcing a 'vgreduce --removemissing' before trying (again) to add tags to an LV and then activate it. LVM mirroring requires that failed devices are removed anyway, so this was largely harmless. However, RAID LVs do not require devices to be removed from the array in order to be activated. In fact, in an HA-LVM environment this would be very undesirable. Device failures in such an environment can often be transient and it would be much better to restore the device to the array than synchronize an entirely new device. There are two methods that can be used to setup an HA-LVM environment: "clvm" or "tagging". For RAID LVs, "clvm" is out of the question because RAID LVs are not supported in clustered VGs - not even in an exclusively activated manner. That leaves "tagging". HA-LVM uses tagging - coupled with 'volume_list' - to ensure that only one machine can have an LV active at a time. If updates are not allowed when a PV is missing, it is impossible to add or remove tags to allow for activation. This removes one of the most basic functionalities of HA-LVM - site redundancy. If mirroring or RAID is used to replicate the storage in two data centers and one of them goes down, a server and a storage device are lost. When the service fails-over to the alternate site, the VG will be "partial". Unable to add a tag to the VG/LV, the RAID device will be unable to activate. The solution is to allow vgchange and lvchange to alter the LVM metadata for a limited set of options - --[add\|del]tag included. The set of allowable options are ones that do not cause changes to the DM kernel target (like --resync would) or could alter the structure of the LV (like allocation or conversion).	2012-10-10 11:33:10 -05:00
Zdenek Kabelac	cdb7502e54	lvchange: do not start dmevent for resyn If monitoring is disabled in lvm.conf, avoid its starting and preserve DMEVENTD_MONITOR_IGNORE settings internally.	2012-10-09 12:22:26 +02:00
Peter Rajnoha	7a64fff948	systemd: remove ExecStartPost from lvm2-lvmetad.service. The ExecStartPost with pvscan --cache in lvm2-lvmetad.service is not needed now as this is called transparently within the first LVM command that queries lvmetad.	2012-10-08 16:49:54 +02:00
Zdenek Kabelac	ff13206c7e	report: call snapshot percent with cow only Ensure lv_snapshot_percent is used only with snapshot LVs.	2012-10-08 12:16:53 +02:00
Zdenek Kabelac	5b07bd3f91	lvconvert: disable convertion of thin to mirrors For now this convertions is not supported, thus disabled. The only supported conversion for now is to create mirrored thin pools from mirrored devices.	2012-10-08 12:16:53 +02:00
Zdenek Kabelac	1da6c1495a	lvm2api: fix data percent reporting for thin, snap Use same logic for lvm2api as we use lvs reporting. data_percent is meant to be superset for snap_percent.	2012-10-05 10:37:09 +02:00
Jonathan Brassow	9efd3fb604	RAID: Do not allow RAID LVs in a cluster volume group. It would be possible to activate a RAID LV exclusively in a cluster volume group, but for now we do not allow RAID LVs to exist in a clustered volume group at all. This has two components: 1) Do not allow RAID LVs to be created in a clustered VG 2) Do not allow changing a VG from single-machine to clustered if there are RAID LVs present.	2012-10-03 15:52:54 -05:00
Zdenek Kabelac	a27650cc98	thin: lvconvert Update code for lvconvert. Change the lvconvert user interface a bit - now we require 2 specifiers --thinpool takes LV name for data device (and makes the name) --poolmetadata takes LV name for metadata device. Fix type in thin help text -z -> -Z. Supported is also new flag --discards for thinpools.	2012-10-03 15:13:33 +02:00
Zdenek Kabelac	e9f83147d5	thin: lvchange allows to change perms of thin snap Thin snapshots are individual thin volumes so they can have its own control for rw permissions.	2012-10-03 15:13:32 +02:00
Zdenek Kabelac	d442c3ef0c	liblvm: insert layer with subvolume renames Rename also subvolumes if we are inserting _tdata layer. (Currently it breaks mirrors if it would be generic, needs fixing).	2012-10-03 15:13:32 +02:00
Zdenek Kabelac	cf8e1a0093	thin: origin only suspend Skip tree creating when used with origin_only flag.	2012-10-03 15:05:55 +02:00
Zdenek Kabelac	21c401006c	liblvm: add lv_rename_update Support lv_rename without directly updating metatata. It can save some metadata commits in some cases, i.e. when LVs are offline.	2012-10-03 15:03:49 +02:00
Zdenek Kabelac	739092e64a	liblvm2cmd: ensure standard descriptors are ready Check if FDs 0,1,2 are available, and in case they are missing, use /dev/null for them.	2012-10-03 15:02:26 +02:00
Zdenek Kabelac	1f30e048bd	liblvm2cmd: add return code for _close_stray_fds Close fds via /proc/self/fd parsing Return error code if _close_stray_fds fails and quit application if system is in some nonstandard state.	2012-10-03 15:01:23 +02:00
Zdenek Kabelac	98bcfdca83	configure: fix --enable-testing Add missing pkg init for configure --enable-testing.	2012-10-03 14:59:59 +02:00
Jonathan Brassow	886656e4ac	RAID: Fix problems with creating, extending and converting large RAID LVs MD's bitmaps can handle 2^21 regions at most. The RAID code has always used a region_size of 1024 sectors. That means the size of a RAID LV was limited to 1TiB. (The user can adjust the region_size when creating a RAID LV, which can affect the maximum size.) Thus, creating, extending or converting to a RAID LV greater than 1TiB would result in a failure to load the new device-mapper table. Again, the size of the RAID LV is not limited by how much space is allocated for the metadata area, but by the limitations of the MD bitmap. Therefore, we must adjust the 'region_size' to ensure that the number of regions does not exceed the limit. I've added code to do this when extending a RAID LV (which covers 'create' and 'extend' operations) and when up-converting - specifically from linear to RAID1.	2012-09-27 16:51:22 -05:00
Alasdair G Kergon	290ae4791e	lvs: add partial attribute	2012-09-19 12:49:40 +01:00
Alasdair G Kergon	b737ff01e4	discards: skip when removing LVs on missing PVs Don't try to issue discards to a missing PV to avoid segfault. Prevent lvremove from removing LVs that have any part missing. https://bugzilla.redhat.com/857554	2012-09-19 12:48:56 +01:00
Jonathan Brassow	2a6712ddef	RAID1: Clear the LV_NOTSYNCED flag when a RAID1 LV is converted to linear Failing to clear the LV_NOTSYNCED flag when converting a RAID1 LV to linear can result in the flag being present after an upconvert - even if the sync is performed when upconverting.	2012-09-14 16:26:53 -05:00
Jonathan Brassow	116bcb3ea4	RAID1: Like mirrors, do not allow adding images to LV created w/ --nosync Mirrors do not allow upconverting if the LV has been created with --nosync. We will enforce the same rule for RAID1. It isn't hugely critical, since the portions that have been written will be copied over to the new device identically from either of the existing images. However, the unwritten sections may be different, causing the added image to be a hybrid of the existing images. Also, we are disallowing the addition of new images to a RAID1 LV that has not completed the initial sync. This may be different from mirroring, but that is due to the fact that the 'mirror' segment type "stacks" when adding a new image and RAID1 does not. RAID1 will rebuild a newly added image "inline" from the existant images, so they should be in-sync.	2012-09-14 16:12:52 -05:00
Peter Rajnoha	6d75ff138c	systemd: depend on systemd-udev-settle unit in activation unit The "fedora-wait-storage.service" that the "lvm2-activation.service" had as a dependency (which was fedora-specific solution anyway) is obsolete now as this unit called "modprobe scsi_wait_scan" which is not used anymore. The "fedora-wait-storage.service" had "systemd-udev-settle" as its dependency, so let's depend on this one directly now, bypassing the out-dated "fedora-wait-storage.service".	2012-09-12 11:30:13 +02:00
Peter Rajnoha	3127160626	vgchange: fix -aay to activate proper volumes Using 'activation/auto_activation_volume_list = [ "vg/lvol1" ]'. Before this patch: 3 logical volume(s) in volume group "vg" now active LV VG Attr LSize Pool Origin Data% Move Log Copy% Convert lvol0 vg -wi----- 4.00m lvol1 vg -wi-a--- 4.00m lvol2 vg -wi-a--- 4.00m lvol3 vg -wi-a--- 4.00m (vg/lvol1 activated as it passes the list and all subsequent volumes too - wrong!) With this patch: 1 logical volume(s) in volume group "vg" now active LV VG Attr LSize Pool Origin Data% Move Log Copy% Convert lvol0 vg -wi----- 4.00m lvol1 vg -wi-a--- 4.00m lvol2 vg -wi----- 4.00m lvol3 vg -wi----- 4.00m (only vg/lvol1 activated as it passes the list and no other - correct!)	2012-09-12 09:47:40 +02:00

1 2 3 4 5 ...

2620 Commits