shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-22 17:35:59 +03:00

Author	SHA1	Message	Date
Alasdair G Kergon	527db4645f	gcc: replace #ifdef linux with __linux__	2013-11-13 13:56:29 +00:00
Zdenek Kabelac	c3e674ad30	activation: _lv_activate is ok when filtered. If the volume_list filters out volume from activation, it is still success result for this function. Change the error message back to verbose level. Detect if the volume is active localy before zeroing, so we report error a bit later for cases, where volume could not be activated because it doesn't pass through volume list (but user still could create volume when he disables zeroing)	2013-11-01 13:02:36 +01:00
Jonathan Brassow	d5896f0afd	Mirror: Fix hangs and lock-ups caused by attempting label reads of mirrors There is a problem with the way mirrors have been designed to handle failures that is resulting in stuck LVM processes and hung I/O. When mirrors encounter a write failure, they block I/O and notify userspace to reconfigure the mirror to remove failed devices. This process is open to a couple races: 1) Any LVM process other than the one that is meant to deal with the mirror failure can attempt to read the mirror, fail, and block other LVM commands (including the repair command) from proceeding due to holding a lock on the volume group. 2) If there are multiple mirrors that suffer a failure in the same volume group, a repair can block while attempting to read the LVM label from one mirror while trying to repair the other. Mitigation of these races has been attempted by disallowing label reading of mirrors that are either suspended or are indicated as blocking by the kernel. While this has closed the window of opportunity for hitting the above problems considerably, it hasn't closed it completely. This is because it is still possible to start an LVM command, read the status of the mirror as healthy, and then perform the read for the label at the moment after a the failure is discovered by the kernel. I can see two solutions to this problem: 1) Allow users to configure whether mirrors can be candidates for LVM labels (i.e. whether PVs can be created on mirror LVs). If the user chooses to allow label scanning of mirror LVs, it will be at the expense of a possible hang in I/O or LVM processes. 2) Instrument a way to allow asynchronous label reading - allowing blocked label reads to be ignored while continuing to process the LVM command. This would action would allow LVM commands to continue even though they would have otherwise blocked trying to read a mirror. They can then release their lock and allow a repair command to commence. In the event of #2 above, the repair command already in progress can continue and repair the failed mirror. This patch brings solution #1. If solution #2 is developed later on, the configuration option created in #1 can be negated - allowing mirrors to be scanned for labels by default once again.	2013-10-22 19:14:33 -05:00
Peter Rajnoha	039bdad732	activation: flag temporary LVs internally Add LV_TEMPORARY flag for LVs with limited existence during command execution. Such LVs are temporary in way that they need to be activated, some action done and then removed immediately. Such LVs are just like any normal LV - the only difference is that they are removed during LVM command execution. This is also the case for LVs representing future pool metadata spare LVs which we need to initialize by using the usual LV before they are declared as pool metadata spare. We can optimize some other parts like udev to do a better job if it knows that the LV is temporary and any processing on it is just useless. This flag is orthogonal to LV_NOSCAN flag introduced recently as LV_NOSCAN flag is primarily used to mark an LV for the scanning to be avoided before the zeroing of the device happens. The LV_TEMPORARY flag makes a difference between a full-fledged LV visible in the system and the LV just used as a temporary overlay for some action that needs to be done on underlying PVs. For example: lvcreate --thinpool POOL --zero n -L 1G vg - first, the usual LV is created to do a clean up for pool metadata spare. The LV is activated, zeroed, deactivated. - between "activated" and "zeroed" stage, the LV_NOSCAN flag is used to avoid any scanning in udev - betwen "zeroed" and "deactivated" stage, we need to avoid the WATCH udev rule, but since the LV is just a usual LV, we can't make a difference. The LV_TEMPORARY internal LV flag helps here. If we create the LV with this flag, the DM_UDEV_DISABLE_DISK_RULES and DM_UDEV_DISABLE_OTHER_RULES flag are set (just like as it is with "invisible" and non-top-level LVs) - udev is directed to skip WATCH rule use. - if the LV_TEMPORARY flag was not used, there would normally be a WATCH event generated once the LV is closed after "zeroed" stage. This will make problems with immediated deactivation that follows.	2013-10-23 14:09:37 +02:00
Peter Rajnoha	48df36b8c5	activation: check for open count with a timeout before removal/deactivation of an LV This patch reinstates the lv_info call to check for open count of the LV we're removing/deactivating - this was changed with commit `125712b` some time ago and we relied on the ioctl retry logic deeper in the libdm while calling the exact 'remove' ioctl. However, there are still some situations in which it's still required to check for open count before we do any 'remove' actions - this mainly applies to LVs which consist of several sub LVs, like it is for virtual snapshot devices. The commit `1146691` fixed the issue with ordering of actions during virtual snapshot removal while the snapshot is still open. But the check for the open status of the snapshot is still prone to marking the snapshot as in use with an immediate exit even though this could be a temporary asynchronous open only, most notably because of udev and its WATCH udev rule with accompanying scans for the event which is asynchronous. The situation where this crops up most often is when we're closing the LV that was open for read-write and then calling lvremove immediately. This patch reinstates the original lv_info call for the open status of the LV in the lv_check_not_in_use fn that gets called before we do any LV removal/deactivation. In addition to original logic, this patch adds its own retry loop with a delay (25x0.2 seconds) besides the existing ioctl retry loop.	2013-10-15 12:44:42 +02:00
Jonathan Brassow	d97583cfd3	RAID: Better error message when attempting scrubbing op on thinpool LV Component LVs of a thinpool can be RAID LVs. Users who attempt a scrubbing operation directly on a thinpool will be prompted to specify the sub-LV they wish the operation to be performed on. If neither of the sub-LVs are RAID, then a message telling them that the operation can only be performed on a RAID LV will be given.	2013-10-14 15:14:16 -05:00
Zdenek Kabelac	1146691afc	snapshot: deactivate virtual snapshot first Since the virtual snapshot has no reason to stay alive once we detach related snapshot - deactivate whole thing in front of snapshot removal - otherwice the code would get tricky for support in cluster. The correct full solution would require to have transactions for libdm operations. Also enable to the check for snapshot being opened prior the origin deactivation, otherwise we could easily end with the origin being deactivate, but snapshot still kept active, desynchronizing locking state in cluster.	2013-10-14 00:25:15 +02:00
Peter Rajnoha	ce7489ed22	activation: add support for flagging an LV to skip udev scanning during activation A common scenario is during new LV creation when we need to wipe the newly created LV and avoid any udev scanning before this stage otherwise it could cause the device (the LV) to be claimed by some other subsystem for which there were stale metadata within LV data. This patch adds possibility to mark the LV we're just about to wipe with a flag that gets passed to udev via DM_COOKIE as a subsystem specific flag - DM_SUBSYSTEM_UDEV_FLAG0 (in this case the subsystem is "LVM") so LVM udev rules will take care of handling that.	2013-10-08 13:43:14 +02:00
Peter Rajnoha	b4637bd298	fix: make it possible to compile with --disable-devmapper again Some code has been added recently which makes it impossible to compile when "configure --disable-devmapper" is used. This patch just shuffles the code around so it's under proper #ifdef DEVMAPPER_SUPPORT.	2013-09-27 13:58:55 +02:00
Zdenek Kabelac	1fdead8d97	activation: use improved lv_info Call lv_info() with info == NULL to query for local active presence.	2013-09-23 12:13:08 +02:00
Zdenek Kabelac	3b604e5c8e	lvinfo: allow to use lv_info with NULL info When NULL info struct is passed in - function is usable as a quick query for lv_is_active_locally() - with a bonus we may query for layered device. So it could be seen as a more efficient lv_is_active_locally().	2013-09-23 12:13:06 +02:00
Zdenek Kabelac	85b9c12e92	cleanup: release all memory in error path Just ensure no memory will stay in pool even in error path.	2013-09-23 11:35:15 +02:00
Zdenek Kabelac	861a3b2f19	cleanup: monitoring more readable Put continue path into one code segment.	2013-09-23 11:35:15 +02:00
Jonathan Brassow	2c41c8b886	RAID: Don't allow syncaction changes on non-RAID LVs Don't allow syncaction or other RAID-type messages on non-RAID logical volumes.	2013-09-19 22:33:01 -05:00
Zdenek Kabelac	f5832d8c49	deactivate: drop readahead calc in deactivation Skip readahead when device will be deactivated.	2013-09-07 09:13:20 +02:00
Zdenek Kabelac	655296609e	thin: fix monitoring of thin pool volume Properly skip unmonitoring of thin pool volume in deactivation code path. Code makes sure if there is just any thin pool user it stays monitored with all its resources.	2013-09-07 03:31:04 +02:00
Alasdair G Kergon	c0f987949b	activation: Fix segfault with inactive pvmove LV. Set flag to avoid recursion back through an inactive pvmove LV when populating deptree.	2013-08-28 22:56:23 +01:00
Jonathan Brassow	c95f17ea64	Mirror: Fix issue preventing PV creation on mirror LVs Commit `b248ba0a39` attempted to prevent mirror devices which had a failed device in their mirrored log from being usable/readable by LVM. This was to protect against circular dependancies where one LVM command could be blocked trying to read one of these affected mirrors while the LVM command to fix/unblock that mirror was stuck behind the currently running command. The above commit went wrong when it used 'device_is_usable()' to recurse on the mirrored log device to check if it was suspended or blocked. The 'device_is_usable' function also contains a check for reserved names - like *_mlog, etc. This last check always triggered when checking a mirror's log simply because of the name, not because it was suspended or blocked - a false positive. The solution is to create a new function like 'device_is_usable', but without the check for reserved names. Using this new function (device_is_suspended_or_blocked), we can check the status of a mirror's log device properly.	2013-08-07 17:42:26 -05:00
Zdenek Kabelac	aed4e9c703	coverity: pointer validation Check for metadata_lv and make sure we have got proper thin pool segment. Check we are working with merging snapshot when adding merging target.	2013-07-22 12:41:21 +02:00
Jonathan Brassow	4eea660191	RAID: Fix segfault when reporting raid_syncaction field on older kernel The status printed for dm-raid targets on older kernels does not include the syncaction field. This is handled by dev_manager_raid_status() just fine by populating the raid status structure with NULL for that field. However, lv_raid_sync_action() does not properly handle that field being NULL. So, check for it and return 0 if it is NULL.	2013-07-19 10:01:48 -05:00
Zdenek Kabelac	fd31cc9dfc	cleanup: stack and remove braces Add stack trace for error path. Remove unneeded braces.	2013-07-18 18:16:17 +02:00
Zdenek Kabelac	9a06094824	thin: improve external origin tree creation When tree for thin LVs was using external_lv, there has been far less optimal solution, that has tried to add certain existing dependencie only when new node was added. However this has lead to way to complex tree construction since many repeated checks have been made during such tree build. This patch move this detection to the proper _partial_tree generation code and uses for it new 'activation' flag, which is set when tree for ACTIVATION or PRELOAD is generated. It increases performance when thins with external origins are used. (in release update)	2013-07-15 16:00:06 +02:00
Zdenek Kabelac	57be501aa3	dev_manager: lower memory usage Created dlid for test is not needed afterward, so lower a memory usage of this call is repeatedly used for building some large tree. TODO: create function to use given buffer on stack as much cheaper.	2013-07-15 15:59:20 +02:00
Zdenek Kabelac	0443c42e3b	thin: add sub volumes as whole volumes Do not use origin_only when add log_lv and metadata as a subvolume. The stacked volume needs to access whole volume in this case.	2013-07-15 15:58:07 +02:00
Zdenek Kabelac	97d36d5750	thin: check and use layered origin lv Code needs to check if the layer origin device is suspended, It's valid to create thinvolume snapshot of thinvolume which is also used as an old-style snapshot. In this case we need to check -real is suspended. When adding origin_only - add only layer thin volume. (in case it's also old-snapshot add only -real device)	2013-07-15 15:51:39 +02:00
Zdenek Kabelac	55d90b6420	cleanup: update commented-out code part Just make it in-sync with latest proposal.	2013-07-15 15:40:46 +02:00
Mike Snitzer	f9e0adcce5	snapshot: Rename snapshot segment returning methods from find__cow to find__snapshot find_cow -> find_snapshot, find_merging_cow -> find_merging_snapshot. Will thin snapshot code to reuse these methods without confusion.	2013-07-02 16:26:03 -04:00
Peter Rajnoha	d6a91da4be	config: add profile arg to find_config_tree_bool	2013-07-02 15:19:09 +02:00
Peter Rajnoha	8ac4fcf8ff	config: add profile arg to find_config_tree_str_allow_empty	2013-07-02 15:19:09 +02:00
Peter Rajnoha	06dd66af54	config: add profile arg to find_config_tree_str	2013-07-02 15:19:09 +02:00
Peter Rajnoha	eeb7b0f7fa	config: add profile arg to find_config_tree_node	2013-07-02 15:19:09 +02:00
Jonathan Brassow	1acad23d68	RAID: Remove optimization using static vars in lv_raid_dev_health Revert commit `37ffe6a`. If static variables are to be used then we will put them elsewhere and limit the optimization to reporting code, rather that have it be used in the general case.	2013-06-17 13:03:15 -05:00
Zdenek Kabelac	17a3ddf89e	cleanup: drop unused headers Drop heades which do not provide any used symbols.	2013-06-16 00:07:32 +02:00
Alasdair G Kergon	c2dc21d89f	text: miscellaneous comments & message tweaks	2013-06-15 01:28:54 +01:00
Zdenek Kabelac	55a3859632	thin: detect online metadata resize support	2013-06-11 14:03:28 +02:00
Petr Rockai	7d644443e0	activation: Pass both ondisk and incore LV to suspend.	2013-06-10 17:26:38 +02:00
Petr Rockai	f65dd341a5	locking: Make it possible to pass down an LV to activation code. Previously, we have relied on UUIDs alone, and on lvmcache to make getting a "new copy" of VG metadata fast. If the code which triggers the activation has the correct VG metadata at hand (the version which is currently on disk), it can now hand it to the activation code directly.	2013-06-10 17:26:38 +02:00
Zdenek Kabelac	9966842810	snapshot: skip monitor for large cows If snapshot cow device is already big enough to cover whole origin, do not monitor it.	2013-05-27 10:35:43 +02:00
Zdenek Kabelac	3ba3bc0d66	cleanup: drop backtrace After log_error/log_warn there is no point to show <backtrace> in debug log trace from the next code line.	2013-05-27 10:28:32 +02:00
Jonathan Brassow	06ac797f42	Clean-up: Replace 'lv_is_active' with more correct/specific variants There are places where 'lv_is_active' was being used where it was more correct to use 'lv_is_active_locally'. For example, when checking for the existance of a kernel instance before asking for its status. Most of the time these would work correctly. (RAID is only allowed on non-clustered VGs at the moment, which means that 'lv_is_active' and 'lv_is_active_locally' would give the same result.) However, it is more correct to use the proper variant and it helps with future scenarios where targets might be allowed exclusively (or clustered) in a cluster VG.	2013-05-16 10:36:56 -05:00
Alasdair G Kergon	f12d88f840	activation: fix lv_is_active regressions Try to fix commit `bf2741376d`. lv_is_active is not the same as lv_info(cmd, org, 0, &info, 0, 0). Introduce and use lv_is_active_locally.	2013-05-15 02:13:31 +01:00
Alasdair G Kergon	2fbe1e6e00	rephrasing: miscellaneous changes Miscellaneous changes to messages, man pages, comments and WHATS_NEW.	2013-05-15 01:50:42 +01:00
Peter Rajnoha	4407133113	toolcontext: check dm version lazily for udev_fallback setting Setting the cmd->default_settings.udev_fallback also requires DM driver version check. However, this caused useless mapper/control access with ioctl if not needed actually. For example if we're not using activation code, we don't need to know the udev_fallback as there's no node and symlink processing. For example, this premature mapper/control access caused problems when using lvm2app even when no activation happens - there are situations in which we don't need to use mapper/control, but still need some of the lvm2app functionality. This is also the case for lvm2-activation systemd generator which just needs to look at the lvm2 configuration, but it shouldn't touch mapper/control.	2013-05-13 11:53:53 +02:00
Zdenek Kabelac	f39f5b86c3	cleanup: use dm_list_iterate_items	2013-04-25 17:33:24 +02:00
Zdenek Kabelac	4e1ac7faf1	cleanup: add some FIXMEs	2013-04-21 23:14:05 +02:00
Zdenek Kabelac	dd4fdce16c	cleanup: drop unused assignment Assigned values are unused.	2013-04-21 23:14:04 +02:00
Jonathan Brassow	c363c74a25	CLEAN-UP: Better string checking to avoid substring matches Commit `9fd7ac7d03` introduced a way a method of avoiding reading from mirrors with a device failure. If a device was found to be dead, the mapping table was checked for 'handle_errors' or 'block_on_error'. These strings were checked for in the table string via 'strstr', which could also match on strings like, 'no_handle_errors' or 'no_block_on_error'. No such strings exist, but we don't want to have problems in the future if they do. So, we check for ' <string>{'\0'\|' '}'.	2013-04-12 11:30:04 -05:00
Jonathan Brassow	ff64e3500f	RAID: Add scrubbing support for RAID LVs New options to 'lvchange' allow users to scrub their RAID LVs. Synopsis: lvchange --syncaction {check\|repair} vg/raid_lv RAID scrubbing is the process of reading all the data and parity blocks in an array and checking to see whether they are coherent. 'lvchange' can now initaite the two scrubbing operations: "check" and "repair". "check" will go over the array and recored the number of discrepancies but not repair them. "repair" will correct the discrepancies as it finds them. 'lvchange --syncaction repair vg/raid_lv' is not to be confused with 'lvconvert --repair vg/raid_lv'. The former initiates a background synchronization operation on the array, while the latter is designed to repair/replace failed devices in a mirror or RAID logical volume. Additional reporting has been added for 'lvs' to support the new operations. Two new printable fields (which are not printed by default) have been added: "syncaction" and "mismatches". These can be accessed using the '-o' option to 'lvs', like: lvs -o +syncaction,mismatches vg/lv "syncaction" will print the current synchronization operation that the RAID volume is performing. It can be one of the following: - idle: All sync operations complete (doing nothing) - resync: Initializing an array or recovering after a machine failure - recover: Replacing a device in the array - check: Looking for array inconsistencies - repair: Looking for and repairing inconsistencies The "mismatches" field with print the number of descrepancies found during a check or repair operation. The 'Cpy%Sync' field already available to 'lvs' will print the progress of any of the above syncactions, including check and repair. Finally, the lv_attr field has changed to accomadate the scrubbing operations as well. The role of the 'p'artial character in the lv_attr report field as expanded. "Partial" is really an indicator for the health of a logical volume and it makes sense to extend this include other health indicators as well, specifically: 'm'ismatches: Indicates that there are discrepancies in a RAID LV. This character is shown after a scrubbing operation has detected that portions of the RAID are not coherent. 'r'efresh : Indicates that a device in a RAID array has suffered a failure and the kernel regards it as failed - even though LVM can read the device label and considers the device to be ok. The LV should be 'r'efreshed to notify the kernel that the device is now available, or the device should be 'r'eplaced if it is suspected of failing.	2013-04-11 15:33:59 -05:00
Jonathan Brassow	38f8f4a958	RAID: Capture new RAID kernel sync_action status fields I've updated the dm_status_raid structure and dm_get_status_raid() function to make it handle the new kernel status fields that will be coming in dm-raid v1.5.0. It is backwards compatible with the old status line - initializing the new fields to '0'. The new structure is also more amenable to future changes. It includes a 'reserved' field that is currently initialized to zero but could be used to hold flags describing new features. It also now uses pointers for the character strings instead of attempting to allocate their space along with the structure (causing the size of the structure to be variable). This allows future fields to be appended. The new fields that are available are: - sync_action : shows what the sync thread in the kernel is doing (idle, frozen, resync, recover, check, repair, or reshape) - mismatch_count: shows the number of discrepancies which were found or repaired by a "check" or "repair" process, respectively.	2013-04-08 15:04:08 -05:00
Zdenek Kabelac	b9fe52e811	cleanup: move comment	2013-03-13 15:13:50 +01:00

1 2 3 4 5 ...

584 Commits