shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-21 13:34:40 +03:00

Author	SHA1	Message	Date
Jonathan Brassow	801d4f96a8	RAID: Improve 'lvs' attribute reporting of RAID LVs and sub-LVs There are currently a few issues with the reporting done on RAID LVs and sub-LVs. The most concerning is that 'lvs' does not always report the correct failure status of individual RAID sub-LVs (devices). This can occur when a device fails and is restored after the failure has been detected by the kernel. In this case, 'lvs' would report all devices are fine because it can read the labels on each device just fine. Example: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) However, 'dmsetup status' on the device tells us a different story: [root@bp-01 lvm2]# dmsetup status vg-lv 0 1024000 raid raid1 2 DA 1024000/1024000 In this case, we must also be sure to check the RAID LVs kernel status in order to get the proper information. Here is an example of the correct output that is displayed after this patch is applied: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-p 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-p /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-p /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) The other case where 'lvs' gives incomplete or improper output is when a device is replaced or added to a RAID LV. It should display that the RAID LV is in the process of sync'ing and that the new device is the only one that is not-in-sync - as indicated by a leading 'I' in the Attr column. (Remember that 'i' indicates an (i)mage that is in-sync and 'I' indicates an (I)mage that is not in sync.) Here's an example of the old incorrect behaviour: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [root@bp-01 lvm2]# lvconvert -m +1 vg/lv; lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 0.00 lv_rimage_0(0),lv_rimage_1(0),lv_rimage_2(0) [lv_rimage_0] vg Iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg Iwi-aor-- /dev/sdb1(1) [lv_rimage_2] vg Iwi-aor-- /dev/sdc1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [lv_rmeta_2] vg ewi-aor-- /dev/sdc1(0) Note that all the images currently are marked as 'I' even though it is only the last device that has been added that should be marked. Here is an example of the correct output after this patch is applied: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [root@bp-01 lvm2]# lvconvert -m +1 vg/lv; lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 0.00 lv_rimage_0(0),lv_rimage_1(0),lv_rimage_2(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rimage_2] vg Iwi-aor-- /dev/sdc1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [lv_rmeta_2] vg ewi-aor-- /dev/sdc1(0) Note only the last image is marked with an 'I'. This is correct and we can tell that it isn't the whole array that is sync'ing, but just the new device. It also works under snapshots... [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg owi-a-r-p 33.47 lv_rimage_0(0),lv_rimage_1(0),lv_rimage_2(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg Iwi-aor-p /dev/sdb1(1) [lv_rimage_2] vg Iwi-aor-- /dev/sdc1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-p /dev/sdb1(0) [lv_rmeta_2] vg ewi-aor-- /dev/sdc1(0) snap vg swi-a-s-- /dev/sda1(51201)	2013-02-01 11:33:54 -06:00
Jonathan Brassow	37ffe6a13a	RAID: Cache previous results of lv_raid_dev_health for future use We can avoid many dev_manager (ioctl) calls by caching the results of previous calls to lv_raid_dev_health. Just considering the case where 'lvs -a' is called to get the attributes of a RAID LV and its sub-lvs, this function would be called many times. (It would be called at least 7 times for a 3-way RAID1 - once for the health of each sub-LV and once for the health of the top-level LV.) This is a good idea because the sub-LVs are processed in groups along with their parent RAID LV and in each case, it is the parent LV whose status will be queried. Therefore, there only needs to be one trip through dev_manager for each time the group is processed.	2013-02-01 11:32:18 -06:00
Jonathan Brassow	c8242e5cf4	RAID: Add RAID status accessibility functions Similar to the way thin* accesses its kernel status, we add a method for RAID to grab the various values in its status output without the higher levels (LVM) having to understand how to parse the output. Added functions include: - lib/activate/dev_manager.c:dev_manager_raid_status() Pulls the status line from the kernel - libdm/libdm-deptree.c:dm_get_status_raid() Parses status line and puts components into dm_status_raid struct - lib/activate/activate.c:lv_raid_dev_health() Accesses dm_status_raid to deliver raid dev_health string The new structure and functions can provide a more unified way to access status information. ('lv_raid_percent' could switch to using these functions, for example.)	2013-02-01 11:31:47 -06:00
Jonathan Brassow	a3cfe9d9b7	Test (RAID): Test for RAID10 activations when devices are missing Test the fix for bug 889358. RAID10 had been failing to activate when there were devices that had failed in more than one mirror set.	2013-01-28 12:32:33 -06:00
Peter Rajnoha	2be83f4543	blkdeactivate: prevent trying to unmount the same mountpoint more times An addendum to previous commit 1052863a1b35f7488758c78b3a9ebef5c63392bc.	2013-01-23 16:57:44 +01:00
Peter Rajnoha	f7da1caf8d	blkdeactivate: fix handling of nested mountpoints and mangled mount paths. If there was a nested mountpoint inside an existing mount path, blkdeactivate could fail to unmount such a mountpoint as it needs to deactivate the deepest path first and continue upwards. For example the simplest reproducer: [root@rhel6-a ~]# lsblk NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT sda 8:0 0 4G 0 disk \|-vg-lvol0 (dm-2) 253:2 0 32M 0 lvm /mnt/a `-vg-lvol1 (dm-3) 253:3 0 32M 0 lvm /mnt/a/b Before this patch: [root@rhel6-a ~]# blkdeactivate -u Deactivating block devices: UMOUNT: unmounting vg-lvol0 (dm-2) mounted on /mnt/a umount: /mnt/a: device is busy. (In some cases useful info about processes that use the device is found by lsof(8) or fuser(1)) UMOUNT: unmounting vg-lvol1 (dm-3) mounted on /mnt/a/b LVM: deactivating Logical Volume vg/lvol1 (deactivation of vg/lvol0 is skipped as /mnt/a that is on lvol0 can't be unmounted - it still has /mnt/a/b as nested mountpoint!) With this patch applied: [root@rhel6-a ~]# blkdeactivate -u Deactivating block devices: UMOUNT: unmounting vg-lvol1 (dm-3) mounted on /mnt/a/b UMOUNT: unmounting vg-lvol0 (dm-2) mounted on /mnt/a LVM: deactivating Logical Volume vg/lvol0 LVM: deactivating Logical Volume vg/lvol1 === Also, this patch contains a fix for processing mangled mount paths: [root@rhel6-a ~]# lsblk NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT sda 8:0 0 4G 0 disk `-vg-lvol0 (dm-2) 253:2 0 32M 0 lvm /mnt/x y z [root@rhel6-a ~]# lsblk -r vg-lvol0 253:2 0 32M 0 lvm /mnt/x\x20y\x20z (the mount path is mangled with \xNN that is visible in raw lsblk output only and which is used in blkdeactive as well) Before this patch: [root@rhel6-a ~]# blkdeactivate -u Deactivating block devices: umount: /mnt/x\x20y\x20z: not found After this patch applied: [root@rhel6-a ~]# blkdeactivate -u Deactivating block devices: UMOUNT: unmounting vg-lvol0 (dm-2) mounted on /mnt/x\x20y\x20z LVM: deactivating Logical Volume vg/lvol0	2013-01-23 14:45:41 +01:00
Zdenek Kabelac	8bcc1da2f3	locales: use higher prio LC_ALL variable For reseting locale environment into significantly less memory consuming version 'C' - use LC_ALL instead of LANG since it has higher priority in locale settings. Otherwise we may observe whole locale-archive which might be over 100MB on i.e. Fedora systems locked in memory with some daemons.	2013-01-22 11:25:02 +01:00
Petr Rockai	142c4bf9f0	Update WHATS_NEW.	2013-01-16 11:22:08 +01:00
Petr Rockai	1e4a9534f4	lvmetad: Call _lvmetad_handle_reply in lvmetad_vg_lookup.	2013-01-16 11:19:33 +01:00
Petr Rockai	15fdd5c90d	lvmetad: Fix a race in metadata update. The idea is to avoid a period when an existing VG is not mapped to either the old or the new name. (Note that the brief "blackout" was present even if the name did not actually change.) We instead allow a brief overlap of a VG existing under both names, i.e. a query for a VG might succeed but before a lock is acquired the VG disappears.	2013-01-16 11:19:33 +01:00
Peter Rajnoha	6fc596ca90	dmeventd: close dmeventd FIFO FDs on exec (add FD_CLOEXEC).	2013-01-15 14:59:54 +01:00
Zdenek Kabelac	2b760a7fa7	whatsnew	2013-01-11 09:26:51 +01:00
Sebastian Ott	9602e68577	filters: add scm devices Fix this: pvcreate /dev/scma Device /dev/scma not found (or ignored by filtering). Reported-by: Peter Oberparleiter <peter.oberparleiter@de.ibm.com> Signed-off-by: Sebastian Ott <sebott@linux.vnet.ibm.com>	2013-01-11 09:24:07 +01:00
Alasdair G Kergon	06abb2dd4c	logging: classify log_debug messages Place most log_debug() messages into a class.	2013-01-07 22:30:29 +00:00
Alasdair G Kergon	7f747a0d73	logging: add debug classes Add log/debug_classes to lvm.conf to allow debug messages to be classified and filtered at runtime. The dm_errno field is only used by log_error(), so I've redefined it for log_debug() messages to hold the message class. By default, all existing messages appear, but we can add categories that generate high volumes of data, such as logging all traffic to/from lvmetad.	2013-01-07 22:25:19 +00:00
Alasdair G Kergon	b617109fff	lvmetad: fix format1 updates fmt1 doesn't have a separate commit function: updates take effect immediately vg_write is called, so we must update lvmetad at this point if we're going to go on and ask lvmetad for the VG metadata again before calling the commit function (though that's probably an unsupported and pointless thing to do anyway as the client must already have that data and it cannot have changed because it's locked and with devs suspended we shouldn't be communicating with lvmetad; so when that's fixed properly, this fix here can be reverted). This problem showed up as an internal error when lvremoving an LVM1 snapshot. > Internal error: LV snap1 (00000000000000000000000000000001) missing from preload metadata https://bugzilla.redhat.com/891855	2013-01-05 03:17:35 +00:00
Alasdair G Kergon	48e1ae7f6a	lvmetad: add basic client-side debug logging First attempt at showing precisely what use any command is making of lvmetad in the -vvvv trace information.	2013-01-05 00:35:50 +00:00
Alasdair G Kergon	41e7f45258	lvmetad: rename device vars and move _token_update Move _token_update() to avoid the need for _lvmetad_send prototype. Use 'dev' consistently for a struct device * variable. Use 'devno' for a dev_t.	2013-01-04 23:45:22 +00:00
Alasdair G Kergon	981962b339	libdaemon: add logging to daemon_open Log all conditions encountered in daemon_open(). Only store errno when known to be set.	2013-01-04 23:29:59 +00:00
Alasdair G Kergon	6d760b2c63	lvmetad: improve client logging when connecting Rename lvmetad_warning() to lvmetad_connect_or_warn(). Log all connection attempts on the client side, whether successful or not. Reduce some nesting and remove a redundant assertion.	2013-01-04 23:22:30 +00:00
Alasdair G Kergon	a527a3b8c2	lvmetad: lvm depends on libdaemonclient.a Rebuild lvm binary if libdaemonclient.a changes.	2013-01-04 23:10:38 +00:00
Peter Rajnoha	ad85b0c526	pvscan: synchronize with udev if pvscan --cache is used. We need to call sync_local_dev_names directly as pvscan uses VG_GLOBAL lock and this one does not cause the synchronization (sync_dev_names) to be called on unlock (VG_GLOBAL is not a real VG): define unlock_vg(cmd, vol) do { \ if (is_real_vg(vol)) \ sync_dev_names(cmd); \ (void) lock_vol(cmd, vol, LCK_VG_UNLOCK); \ } while (0) Without this fix, we end up without udev synchronization for the pvscan --cache (mainly for -aay that causes the VGs/LVs to be autoactivated) and also udev synchronization cookies are then left in the system since they're not managed properly (code before sets up udev sync cookies, but we have to call dm_udev_wait at least once after that to do the wait and cleanup).	2012-12-21 11:15:46 +01:00
Peter Rajnoha	756bcabbfe	activation: fix autoactivation to not trigger on each PV change Before, the pvscan --cache -aay was called on each ADD and CHANGE uevent (for a device that is not a device-mapper device) and each CHANGE event (for a PV that is a device-mapper device). This causes troubles with autoactivation in some cases as CHANGE event may originate from using the OPTION+="watch" udev rule that is defined in 60-persistent-storage.rules (part of the rules provided by udev directly) and it's used for all block devices (except fd\|mtd\|nbd\|gnbd\|btibm\|dm-\|md* devices). For example, the following sequence incorrectly activates the rest of LVs in a VG if one of the LVs in the VG is being removed: [root@rhel6-a ~]# pvcreate /dev/sda Physical volume "/dev/sda" successfully created [root@rhel6-a ~]# vgcreate vg /dev/sda Volume group "vg" successfully created [root@rhel6-a ~]# lvcreate -l1 vg Logical volume "lvol0" created [root@rhel6-a ~]# lvcreate -l1 vg Logical volume "lvol1" created [root@rhel6-a ~]# vgchange -an vg 0 logical volume(s) in volume group "vg" now active [root@rhel6-a ~]# lvs LV VG Attr LSize Pool Origin Data% Move Log Cpy%Sync Convert lvol0 vg -wi------ 4.00m lvol1 vg -wi------ 4.00m [root@rhel6-a ~]# lvremove -ff vg/lvol1 Logical volume "lvol1" successfully removed [root@rhel6-a ~]# lvs LV VG Attr LSize Pool Origin Data% Move Log Cpy%Sync Convert lvol0 vg -wi-a---- 4.00m ...so the vg was deactivated, then lvol1 removed, and we end up with lvol1 removed (which is ok) BUT with lvol0 activated (which is wrong)!!! This is because after lvol1 removal, we need to write metadata to the underlying device /dev/sda and that causes the CHANGE event to be generated (because of the WATCH udev rule set on this device) and this causes the pvscan --cache -aay to be reevaluated. We have to limit this and call pvscan --cache -aay to autoactivate VGs/LVs only in these cases: --> if the PV is not a dm device, scan only after proper device addition (ADD event) and not with any other changes (CHANGE event) --> if the PV is a dm device, scan only after proper mapping activation (CHANGE event + the underlying PV in a state "just activated")	2012-12-21 10:34:48 +01:00
Jonathan Brassow	970dfbcd69	RAID: Limit replacement of devices when array is not in-sync. If a RAID array is not in-sync, replacing devices should not be allowed as a general rule. This is because the contents used to populate the incoming device may be undefined because the devices being read where not in-sync. The kernel enforces this rule unless overridden by not allowing the creation of an array that is not in-sync and includes a devices that needs to be rebuilt. Since we cannot know the sync state of an LV if it is inactive, we must also enforce the rule that an array must be active to replace devices. That leaves us with the following conditions: 1) never allow replacement or repair of devices if the LV is in-active 2) never allow replacement if the LV is not in-sync 3) allow repair if the LV is not in-sync, but warn that contents may not be recoverable. In the case where a user is performing the repair on the command line via 'lvconvert --repair', the warning is printed before the user is prompted if they would like to replace the device(s). If the repair is automated (i.e. via dmeventd and policy is "allocate"), then the device is replaced if possible and the warning is printed.	2012-12-18 14:40:42 -06:00
Peter Rajnoha	0379c480e0	WHATS_NEW: changelog for `fae1a611d2` and `5294a6f77a`	2012-12-18 12:12:58 +01:00
Andy Grover	86e528c667	lvm2app: No special behavior for 0 for max_snap_size in lvm_lv_snapshot() It isn't possible to choose a sane default for snapshot size, so just play it straight and use the passed size instead of adding special behavior for 0. Also revert change to Python lib, size parameter must be supplied. Signed-off-by: Andy Grover <agrover@redhat.com>	2012-12-17 14:14:38 -08:00
Zdenek Kabelac	69099e7ef5	Revert "lvmetad: simplify pvid memory allocation." This reverts commit `ed23da95b6`. Hash table device_to_pvid seems to contain references to already deleted pvids and so revert to the older behaviour using allocated memory.	2012-12-17 13:49:19 +01:00
Petr Rockai	5294a6f77a	lvmetad: Fix a possible race in remove_metadata. All operations on shared hash tables need to be protected by mutexes. Moreover, lookup and subsequent key removal need to happen atomically, to avoid races (and possible double free-ing) between multiple threads trying to manipulate the same VG.	2012-12-17 00:47:55 +01:00
Petr Rockai	fae1a611d2	lvmetad: Fix a possible deadlock. If an update and a query were running in parallel, there was a slim but non-zero chance of a deadlock due to (unnecessary) mutex nesting.	2012-12-17 00:47:55 +01:00
Zdenek Kabelac	ed23da95b6	lvmetad: simplify pvid memory allocation. Since pvid_dup and cft config appears to be tightly binded together - reuse it's memory pool for string. Simplifies release of hashes.	2012-12-15 17:23:28 +01:00
Zdenek Kabelac	6f9e26f5c0	thin: dmeventd fix memleak on error path Some error paths on _umount have leaked bitset.	2012-12-15 17:23:27 +01:00
Zdenek Kabelac	401c9aba4a	pv_read: add missing check for valid info If the lvmcache_info_from_pvid() fails to find valid info, invoke the lookup by dev, and only in this case call lvmcache_info_from_pvid() again. Also check for the result of info and return error directly, so the NULL is not passed to lvmcache_get_label().	2012-12-15 17:23:27 +01:00
Zdenek Kabelac	3e8dbfaecf	lvmetad: add check for failure dm_config_write_node Detect if dm_config_write_node failed and fail correctly.	2012-12-15 17:23:27 +01:00
Zdenek Kabelac	4008f4f891	lvmetad: fix socket leak in handle_connect Close socket_fd and report error on malloc failure.	2012-12-15 17:23:27 +01:00
Zdenek Kabelac	e012d0635d	lvmetad: check id_read_format error status Detect error from id_read_format() function.	2012-12-15 17:23:27 +01:00
Zdenek Kabelac	ba3f37c9e4	lvmetad: fix memleak on pv_found error path Free resources allocated in pv_found when going out through error path.	2012-12-15 17:23:27 +01:00
Zdenek Kabelac	399fc1bb33	lvmetad: keep returned struct fully initialized Always clear the response structure. Simplify daemon_reply initialization.	2012-12-15 17:23:27 +01:00
Zdenek Kabelac	a4269aadf3	lvmetad: unlock vg on out-of-memory path If we fail to get memory for mutex, hash the mutex or fail somewhere along pthread function calls return allocated resources back and unlock vg_lock_map mutex.	2012-12-15 17:23:26 +01:00
Zdenek Kabelac	788ac7fa54	libdaemon: check for strdup result Detect failure of dm_pool_strdup() and print error in fail path. Save one extra strchr call - since we already know the distance for the '=' character. Drop stack trace from return after log_error().	2012-12-15 17:23:26 +01:00
Zdenek Kabelac	ff5612c0c3	format-text: check for _text_create_text_instance Test if 'fid' creation failed and report stack trace, break the loop and do not pass NULL fid further.	2012-12-15 17:23:23 +01:00
Zdenek Kabelac	740ab81d03	log: move abort past syslog When the abort_on_internal_errors is enabled, we aborted prior the syslog logging output. Since such fatal error gets level _LOG_FATAL it should not be blocked by debug_level() check so lets move it further, to get abort error logged also via syslog.	2012-12-15 17:22:48 +01:00
Zdenek Kabelac	575c4ed964	cleanup: use proper const in apply_lvname_restrictions Better constness used for reserved prefixes and strings. Also simplify a bit validate_name and use direct char checks isntead of 2 strcmp() calls.	2012-12-15 14:57:40 +01:00
Zdenek Kabelac	21f6511bc2	cleanup: reorder code Swap if() test condition and check for failure and use traditional 'stack' trace.	2012-12-15 14:57:40 +01:00
Zdenek Kabelac	13835d04ac	cleanup: skip assignment env is reassigned without the use, so drop this assign.	2012-12-15 14:57:40 +01:00
Zdenek Kabelac	0396ade38b	cleanup: simplify option matching function Avoid using sprintf and strncmp call, when we really want to compare just one character.	2012-12-15 14:57:40 +01:00
Zdenek Kabelac	a266154e1f	cleanup: singlenode minor change Use strcpy instead of sprintf for plain string. And use dm_strncpy for safer strncpy. TODO: Fix API return values for cluster functions.	2012-12-15 14:57:40 +01:00
Zdenek Kabelac	8ab4334505	cleanup: ignore return values These dm_snprintfs should not fail, since enough space is reserved. So return intentionaly ignored.	2012-12-15 14:57:40 +01:00
Zdenek Kabelac	1b05438fcb	cleanup: ignore errors Since we are doing just dump and function doesn't report any error, explicitely ignore return values from dm_config_write_node and dm_asprintf. Same applies for the logging function.	2012-12-15 14:57:40 +01:00
Zdenek Kabelac	8b8065a870	cleanup: drop unused header This header does not resolve any symbols here.	2012-12-15 14:57:40 +01:00
Zdenek Kabelac	1d774e5667	cleanup: drop test for optarg NULL Since -d takes an argument, we do not need to check for optarg being NULL here.	2012-12-15 14:57:40 +01:00

1 2 3 4 5 ...

7323 Commits