shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-22 17:35:59 +03:00

Author	SHA1	Message	Date
Zdenek Kabelac	8e1f2e733e	gcc: fix comparing floating point warning Since we enabled some more gcc warnings - let's adapt for it and check for double equals with DBL_EPSILON. Current close_enough() is far from perfect for more details see i.e. here: http://randomascii.wordpress.com/2012/01/11/tricks-with-the-floating-point-format/ but fairly enough for lvm2 use-case.	2013-10-25 10:43:32 +02:00
Jonathan Brassow	d5896f0afd	Mirror: Fix hangs and lock-ups caused by attempting label reads of mirrors There is a problem with the way mirrors have been designed to handle failures that is resulting in stuck LVM processes and hung I/O. When mirrors encounter a write failure, they block I/O and notify userspace to reconfigure the mirror to remove failed devices. This process is open to a couple races: 1) Any LVM process other than the one that is meant to deal with the mirror failure can attempt to read the mirror, fail, and block other LVM commands (including the repair command) from proceeding due to holding a lock on the volume group. 2) If there are multiple mirrors that suffer a failure in the same volume group, a repair can block while attempting to read the LVM label from one mirror while trying to repair the other. Mitigation of these races has been attempted by disallowing label reading of mirrors that are either suspended or are indicated as blocking by the kernel. While this has closed the window of opportunity for hitting the above problems considerably, it hasn't closed it completely. This is because it is still possible to start an LVM command, read the status of the mirror as healthy, and then perform the read for the label at the moment after a the failure is discovered by the kernel. I can see two solutions to this problem: 1) Allow users to configure whether mirrors can be candidates for LVM labels (i.e. whether PVs can be created on mirror LVs). If the user chooses to allow label scanning of mirror LVs, it will be at the expense of a possible hang in I/O or LVM processes. 2) Instrument a way to allow asynchronous label reading - allowing blocked label reads to be ignored while continuing to process the LVM command. This would action would allow LVM commands to continue even though they would have otherwise blocked trying to read a mirror. They can then release their lock and allow a repair command to commence. In the event of #2 above, the repair command already in progress can continue and repair the failed mirror. This patch brings solution #1. If solution #2 is developed later on, the configuration option created in #1 can be negated - allowing mirrors to be scanned for labels by default once again.	2013-10-22 19:14:33 -05:00
Peter Rajnoha	039bdad732	activation: flag temporary LVs internally Add LV_TEMPORARY flag for LVs with limited existence during command execution. Such LVs are temporary in way that they need to be activated, some action done and then removed immediately. Such LVs are just like any normal LV - the only difference is that they are removed during LVM command execution. This is also the case for LVs representing future pool metadata spare LVs which we need to initialize by using the usual LV before they are declared as pool metadata spare. We can optimize some other parts like udev to do a better job if it knows that the LV is temporary and any processing on it is just useless. This flag is orthogonal to LV_NOSCAN flag introduced recently as LV_NOSCAN flag is primarily used to mark an LV for the scanning to be avoided before the zeroing of the device happens. The LV_TEMPORARY flag makes a difference between a full-fledged LV visible in the system and the LV just used as a temporary overlay for some action that needs to be done on underlying PVs. For example: lvcreate --thinpool POOL --zero n -L 1G vg - first, the usual LV is created to do a clean up for pool metadata spare. The LV is activated, zeroed, deactivated. - between "activated" and "zeroed" stage, the LV_NOSCAN flag is used to avoid any scanning in udev - betwen "zeroed" and "deactivated" stage, we need to avoid the WATCH udev rule, but since the LV is just a usual LV, we can't make a difference. The LV_TEMPORARY internal LV flag helps here. If we create the LV with this flag, the DM_UDEV_DISABLE_DISK_RULES and DM_UDEV_DISABLE_OTHER_RULES flag are set (just like as it is with "invisible" and non-top-level LVs) - udev is directed to skip WATCH rule use. - if the LV_TEMPORARY flag was not used, there would normally be a WATCH event generated once the LV is closed after "zeroed" stage. This will make problems with immediated deactivation that follows.	2013-10-23 14:09:37 +02:00
Peter Rajnoha	ee878bc52c	coverity: assigned variable not used and reassigned later	2013-10-16 15:06:43 +02:00
Peter Rajnoha	48df36b8c5	activation: check for open count with a timeout before removal/deactivation of an LV This patch reinstates the lv_info call to check for open count of the LV we're removing/deactivating - this was changed with commit `125712b` some time ago and we relied on the ioctl retry logic deeper in the libdm while calling the exact 'remove' ioctl. However, there are still some situations in which it's still required to check for open count before we do any 'remove' actions - this mainly applies to LVs which consist of several sub LVs, like it is for virtual snapshot devices. The commit `1146691` fixed the issue with ordering of actions during virtual snapshot removal while the snapshot is still open. But the check for the open status of the snapshot is still prone to marking the snapshot as in use with an immediate exit even though this could be a temporary asynchronous open only, most notably because of udev and its WATCH udev rule with accompanying scans for the event which is asynchronous. The situation where this crops up most often is when we're closing the LV that was open for read-write and then calling lvremove immediately. This patch reinstates the original lv_info call for the open status of the LV in the lv_check_not_in_use fn that gets called before we do any LV removal/deactivation. In addition to original logic, this patch adds its own retry loop with a delay (25x0.2 seconds) besides the existing ioctl retry loop.	2013-10-15 12:44:42 +02:00
Jonathan Brassow	d97583cfd3	RAID: Better error message when attempting scrubbing op on thinpool LV Component LVs of a thinpool can be RAID LVs. Users who attempt a scrubbing operation directly on a thinpool will be prompted to specify the sub-LV they wish the operation to be performed on. If neither of the sub-LVs are RAID, then a message telling them that the operation can only be performed on a RAID LV will be given.	2013-10-14 15:14:16 -05:00
Jonathan Brassow	f58b26b633	RAID: Report RAID images split with tracking as out-of-sync ("I"). Split image should have an out-of-sync attr ('I') - always. Even if the RAID LV has not been written to since the LV was split off, it is still not part of the group that makes up the RAID and is therefore "out-of-sync".	2013-10-14 10:48:44 -05:00
Zdenek Kabelac	851bba258c	snapshot: rework parsing of snapshot metadata Add better parsing code for snapshot metadata, which describe properly errors found for snapshot segment.	2013-10-14 00:26:58 +02:00
Zdenek Kabelac	1146691afc	snapshot: deactivate virtual snapshot first Since the virtual snapshot has no reason to stay alive once we detach related snapshot - deactivate whole thing in front of snapshot removal - otherwice the code would get tricky for support in cluster. The correct full solution would require to have transactions for libdm operations. Also enable to the check for snapshot being opened prior the origin deactivation, otherwise we could easily end with the origin being deactivate, but snapshot still kept active, desynchronizing locking state in cluster.	2013-10-14 00:25:15 +02:00
Zdenek Kabelac	81504ba70c	snapshot: move virtsnap code from tool to lib Move code for removal dependency from tool's remove.c into lib's manipulation code. Same code then works with lvm2app.	2013-10-12 00:14:52 +02:00
Peter Rajnoha	6b35c70e8b	metadata: add INTERNAL_ERROR to "Metadata inconsistency" msg So we can spot it better if it occurs.	2013-10-10 13:34:43 +02:00
Peter Rajnoha	029b8fbe76	metadata: properly register LV_NOSCAN flag Addendum to commit `ce7489e` which introduced a new internal LV_NOSCAN flag and so it needs to be marked that way properly otherwise it ends up unrecognized and improperly handled during metadata export.	2013-10-10 13:24:32 +02:00
Alasdair G Kergon	7bed6d1263	filters: Add NVM Express (nvme).	2013-10-09 20:08:07 +01:00
Petr Rockai	0decd7553a	metadata: Fix metadata repair paths when lvmetad is used.	2013-10-09 14:44:01 +02:00
Peter Rajnoha	ce7489ed22	activation: add support for flagging an LV to skip udev scanning during activation A common scenario is during new LV creation when we need to wipe the newly created LV and avoid any udev scanning before this stage otherwise it could cause the device (the LV) to be claimed by some other subsystem for which there were stale metadata within LV data. This patch adds possibility to mark the LV we're just about to wipe with a flag that gets passed to udev via DM_COOKIE as a subsystem specific flag - DM_SUBSYSTEM_UDEV_FLAG0 (in this case the subsystem is "LVM") so LVM udev rules will take care of handling that.	2013-10-08 13:43:14 +02:00
Alasdair G Kergon	04d9a52684	release 2.02.103 52 files changed, 598 insertions(+), 264 deletions(-)	2013-10-04 14:32:23 +01:00
Peter Rajnoha	8cf0810d57	thin: rename thin_pool_chunk_size_calculation -> ..size_policy and rename "default" policy to "generic" Just to be consistent with existing naming we use.	2013-10-04 12:30:33 +02:00
Alasdair G Kergon	baf95bbff7	cmdline: Add --ignoreskippedcluster. Accept --ignoreskippedcluster with pvs, vgs, lvs, pvdisplay, vgdisplay, lvdisplay, vgchange and lvchange to avoid the 'Skipping clustered VG' errors when requesting information about a clustered VG without using clustered locking and still exit with success. The messages can still be seen with -v.	2013-10-01 21:20:10 +01:00
Peter Rajnoha	24ffd5244f	thin: better dbg msgs and avoid uninit. value on chunk size recalc	2013-09-30 08:58:57 +02:00
Peter Rajnoha	e02ff32260	fix: also make commit b4637 work without dmeventd	2013-09-30 08:17:56 +02:00
Peter Rajnoha	b4637bd298	fix: make it possible to compile with --disable-devmapper again Some code has been added recently which makes it impossible to compile when "configure --disable-devmapper" is used. This patch just shuffles the code around so it's under proper #ifdef DEVMAPPER_SUPPORT.	2013-09-27 13:58:55 +02:00
Jonathan Brassow	acdc731e83	RAID: Fix _sufficient_pes_free calculation for RAID lib/metadata/lv_manip.c:_sufficient_pes_free() was calculating the required space for RAID allocations incorrectly due to double accounting. This resulted in failure to allocate when available space was tight. When RAID data and metadata areas are allocated together, the total amount is stored in ah->new_extents and ah->alloc_and_split_meta is set. '_sufficient_pes_free' was adding the necessary metadata extents to ah->new_extents without ever checking ah->alloc_and_split_meta. This often led to double accounting of the metadata extents. This patch checks 'ah->alloc_and_split_meta' to perform proper calculations for RAID. This error is only present in the function that checks for the needed space, not in the functions that do the actual allocation.	2013-09-26 11:30:07 -05:00
Peter Rajnoha	78cba8eb3f	thin: calculate thin pool chunk size based on device IO hints If "default" thin pool chunk size calculation method is selected, use minimum_io_size, otherwise optimal_io_size for "performance" device hint exposed in sysfs. If there appear to be PVs with different hints presented, use their least common multiple. If the hint is less than the default value defined for the calculation method, use the default value instead.	2013-09-25 16:06:38 +02:00
Peter Rajnoha	cc9e65c391	thin: use appropriate default value based on allocation/thin_pool_chunk_size_calculation setting If thin_pool_chunk_size_calculation is set to "default", use 64KiB, otheriwse 512KiB for "performance".	2013-09-25 16:06:38 +02:00
Peter Rajnoha	8bf425005c	conf: add allocation/thin_pool_chunk_size_calculation Add allocation/thin_pool_chunk_size_calculation lvm.conf option to select a method for calculating thin pool chunk sizes and define two possible values - "default" and "performance".	2013-09-25 16:06:38 +02:00
Jonathan Brassow	c37c59e155	Test/clean-up: Indent clean-up and additional RAID resize test Better indenting and a test for bug 1005434 (parity RAID should extend in a contiguous fashion).	2013-09-24 21:32:53 -05:00
Jonathan Brassow	5ded7314ae	RAID: Fix broken allocation policies for parity RAID types A previous commit (`b6bfddcd0a`) which was designed to prevent segfaults during lvextend when trying to extend striped logical volumes forgot to include calculations for RAID4/5/6 parity devices. This was causing the 'contiguous' and 'cling_by_tags' allocation policies to fail for RAID 4/5/6. The solution is to remember that while we can compare ah->area_count == prev_lvseg->area_count for non-RAID, we should compare (ah->area_count + ah->parity_count) == prev_lvseg->area_count for a general solution.	2013-09-24 21:32:10 -05:00
Alasdair G Kergon	11dc6a03c4	lvs: Add seg_size_pe field. Requested https://www.redhat.com/archives/linux-lvm/2013-July/msg00112.html	2013-09-23 21:50:14 +01:00
Alasdair G Kergon	c8057aec36	release 2.02.102 18 files changed, 137 insertions(+), 203 deletions(-)	2013-09-23 15:43:37 +01:00
Zdenek Kabelac	1fdead8d97	activation: use improved lv_info Call lv_info() with info == NULL to query for local active presence.	2013-09-23 12:13:08 +02:00
Zdenek Kabelac	3b604e5c8e	lvinfo: allow to use lv_info with NULL info When NULL info struct is passed in - function is usable as a quick query for lv_is_active_locally() - with a bonus we may query for layered device. So it could be seen as a more efficient lv_is_active_locally().	2013-09-23 12:13:06 +02:00
Zdenek Kabelac	b29adbbc4d	raid: add lv_is_raid() More readable then status bit flag masking...	2013-09-23 11:35:15 +02:00
Zdenek Kabelac	b33b618fc8	cleanup: drop unused report parameter Do not pass unused dm_report pointer.	2013-09-23 11:35:15 +02:00
Zdenek Kabelac	9b4bfca219	cleanup: add log_error error path messages	2013-09-23 11:35:15 +02:00
Zdenek Kabelac	d2d61955a3	cleanup: shorter code Put assignment on declaration line.	2013-09-23 11:35:15 +02:00
Zdenek Kabelac	85b9c12e92	cleanup: release all memory in error path Just ensure no memory will stay in pool even in error path.	2013-09-23 11:35:15 +02:00
Zdenek Kabelac	30432bd604	cleanup: skip call of detect... SInce we know the pool was locked and we want to reloc pool again, just use '1' directly.	2013-09-23 11:35:15 +02:00
Zdenek Kabelac	861a3b2f19	cleanup: monitoring more readable Put continue path into one code segment.	2013-09-23 11:35:15 +02:00
Jonathan Brassow	2c41c8b886	RAID: Don't allow syncaction changes on non-RAID LVs Don't allow syncaction or other RAID-type messages on non-RAID logical volumes.	2013-09-19 22:33:01 -05:00
Petr Rockai	3df50d822b	vgconvert: Do not call lvmetad_vg_remove (path shared with vgcfgbackup).	2013-09-18 12:53:11 +02:00
Petr Rockai	054cf25b5f	vgcfgrestore: Remove VG rom lvmetad later, to better deal with errors.	2013-09-18 11:24:58 +02:00
Alasdair G Kergon	a3a5f58c21	reporting: Add devtypes command. Add internal devtypes reporting command to display built-in recognised block device types. (The output does not include any additional types added by a configuration file.) > lvm devtypes -o help Device Types Fields ------------------- devtype_all - All fields in this section. devtype_name - Name of Device Type exactly as it appears in /proc/devices. devtype_max_partitions - Maximum number of partitions. (How many device minor numbers get reserved for each device.) devtype_description - Description of Device Type. > lvm devtypes DevType MaxParts Description aoe 16 ATA over Ethernet ataraid 16 ATA Raid bcache 1 bcache block device cache blkext 1 Extended device partitions ...	2013-09-18 01:09:15 +01:00
Alasdair G Kergon	97ba18f4cb	filters: Add bcache. N.B. Using bcache devices as PVs is still experimental. Problems should be reported to the appropriate mailing lists.	2013-09-16 16:56:55 +01:00
Zdenek Kabelac	b8ea27ac97	cleanup: hide gcc warning Older gcc is giving misleading warning: metadata/lv_manip.c:4018: warning: ‘seg’ may be used uninitialized in this function But warning free compilation is better.	2013-09-11 23:40:45 +02:00
Jonathan Brassow	2691f1d764	RAID: Make RAID single-machine-exclusive capable in a cluster Creation, deletion, [de]activation, repair, conversion, scrubbing and changing operations are all now available for RAID LVs in a cluster - provided that they are activated exclusively. The code has been changed to ensure that no LV or sub-LV activation is attempted cluster-wide. This includes the often overlooked operations of activating metadata areas for the brief time it takes to clear them. Additionally, some 'resume_lv' operations were replaced with 'activate_lv_excl_local' when sub-LVs were promoted to top-level LVs for removal, clearing or extraction. This was necessary because it forces the appropriate renaming actions the occur via resume in the single-machine case, but won't happen in a cluster due to the necessity of acquiring a lock first. The raid tests have been updated to allow testing in a cluster. For the most part, this meant creating devices with '-aey' if they were to be converted to RAID. (RAID requires the converting LV to be EX because it is a condition of activation for the RAID LV in a cluster.)	2013-09-10 16:33:22 -05:00
Jonathan Brassow	ca51435153	Misc/RAID: Enable resume_lv to handle some renaming conflicts. When images and their associated metadata are removed from a RAID1 LV, the remaining sub-LVs are "shifted" down to fill the gaps. For example, if there is a 3-way mirror: [0][1][2] and we remove device#0, the devices will be shifted down [1][2] and renamed. [0][1] This can create a problem for resume_lv (specifically, dm_tree_activate_children) during the renaming process though. This is because it will attempt to rename the higher indexed sub-LVs first and find that it cannot because there are currently other sub-LVs with that name. The solution is to check for a conflicting name before attempting to rename. If a conflict is found and that conflicting sub-LV is also in the process of renaming, we can defer the current rename until the conflicting sub-LV has renamed and cleared the conflict. Now that resume_lv can handle these types of rename conflicts, we can remove the workaround in RAID that was attempting to resume a RAID1 LV from the bottom-up in order to force a proper rename in assending order before attempting a resume on the top-level LV. This "hack" only worked for single machine use-cases of LVM. Clearing this up paves the way for exclusive activation of RAID LVs in a cluster.	2013-09-09 15:07:28 -05:00
Zdenek Kabelac	f5832d8c49	deactivate: drop readahead calc in deactivation Skip readahead when device will be deactivated.	2013-09-07 09:13:20 +02:00
Zdenek Kabelac	0670bfeb59	thin: validation catch multiseg thin pool/volumes Multisegment thin pools and volumes are not supported. Catch such error code path early.	2013-09-07 03:32:07 +02:00
Zdenek Kabelac	655296609e	thin: fix monitoring of thin pool volume Properly skip unmonitoring of thin pool volume in deactivation code path. Code makes sure if there is just any thin pool user it stays monitored with all its resources.	2013-09-07 03:31:04 +02:00
Zdenek Kabelac	4c001a7854	thin: fix resize of stacked thin pool volume When the pool is created from non-linear target the more complex rules have to be used and stacking needs to properly decode args for _tdata LV. Also proper allocation policies are being used according to those set in lvm2 metadata for data and metadata LVs. Also properly check for active pool and extra code to active it temporarily. With this fix it's now possible to use: lvcreate -L20 -m2 -n pool vg --alloc anywhere lvcreate -L10 -m2 -n poolm vg --alloc anywhere lvconvert --thinpool vg/pool --poolmetadata vg/poolm lvresize -L+10 vg/pool	2013-09-07 03:24:48 +02:00

1 2 3 4 5 ...

3421 Commits