shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-22 17:35:59 +03:00

Author	SHA1	Message	Date
Zdenek Kabelac	e30028004b	archiver: do not archive vg more then once Do not keep multiple archives for the executed command. Reuse the ALLOCATABLE_PV from pv status for ARCHIVED_VG vg status. Mark VG with the bit with the first archivation.	2013-07-01 23:09:26 +02:00
Jonathan Brassow	8215846aa5	Clean-up: WHATS_NEW Choosing between two entries and forgot to remove one.	2013-06-19 19:55:34 -05:00
Jonathan Brassow	a6d13308ec	RAID/MIRROR: Honor mirror_segtype_default when upconverting linear LVs If the user would upconvert a linear LV to a mirror without specifying the segment type ("--type mirror" vs "--type raid1"), the "mirror" segment type would be chosen without consulting the 'default_mirror_segtype' setting in lvm.conf. This is now used as the basis for determining which should be used if left unspecified.	2013-06-19 17:50:10 -05:00
Zdenek Kabelac	155841c349	lvmetad: fix compare function Check for enough space in preallocated buffer. Fixes problem, when lvm code started to suddenly allocate too big memory chunks. TODO: lvmetad protocol should announce needed size ahead, so if metadata have 1MB we are not reallocating memory...	2013-06-18 22:12:51 +02:00
Zdenek Kabelac	2562968864	vgcfgrestore: fix crash on restore of wrong vgname When vgname has not existed in metadata, it has crashed on double free in format_instance destroy() - since VG was created, used FID and was released - which also released FID, so further use was accessing bad memory. Fix it for this code path before release_vg() so FID will exists when _vg_read_file_name() returns NULL.	2013-06-18 22:11:21 +02:00
Alasdair G Kergon	c2dc21d89f	text: miscellaneous comments & message tweaks	2013-06-15 01:28:54 +01:00
Peter Rajnoha	dba53681a5	man: refine lvm.conf and man page documentation for autoactivation feature	2013-06-14 10:02:56 +02:00
Zdenek Kabelac	fe22089edf	thin: vgsplit support for thins Support vgsplit for VGs with thin pools and thin volumes. In case the thin data and thin metadata volumes are moved to a new VG, move there also all related thin volumes and check that external origins are also present in this new VG.	2013-06-13 14:51:00 +02:00
Peter Rajnoha	966d4f36d7	filter-mpath: detect partitions of mpath components We use mpath filtering (enabled by devices/multipath_component_detection=1 lvm.conf setting) to avoid a situation in which we could end up with duplicate PVs found. We need to filter out the mpath components and use only the top-level multipath mapping instead for PV scans. However, if the there are partitions on multipath components, we need to filter out these partitions. This patch fixes it so those partitions found on multipath components are filtered as well. For example, let's consider following configuration: The sda and sdb are mpath components, sda1 and sdb1 the partitions on these components, mpath-test the mpath mapping and mpath-test1 the partition mapping - created automatically by kpartx right after mpath-test creation. The PV resides on top. (LVM PV) \| mpath-test1 \| mpath-test \| sda1 ---------- sdb1 \ \| \|/ sda sdb E.g. for sda1 and sdb1, the code will detect this and it skips the partition that belongs to the multipath component: <snippet from the log> #filters/filter-mpath.c:156 /dev/sda1: Device is a partition, using primary device /dev/sda for mpath component detection 130 #ioctl/libdm-iface.c:1724 dm status (253:2) OF[16384](*1) 131 #filters/filter-mpath.c:196 /dev/sda1: Skipping mpath component device </snippet from the log> Othewise, we'd see the same PV label on sda1/sdb1 and mpath-test1 at the same time ending up with "Duplicate PV found...".	2013-06-12 13:13:38 +02:00
Zdenek Kabelac	87aca628d6	thin: lvresize supports pool metadata resize Add support for lvresize of thin pool metadata device. lvresize --poolmetadatasize +20 vgname/thinpool_lv or lvresize -L +20 vgname/thinpool_lv_tmeta Where the second one allows all the args for resize (striping...) and the first option resizes accoding to the last metadata lv segment.	2013-06-11 14:05:20 +02:00
Zdenek Kabelac	72c3ae253e	thin: add helper functions Add find_pool_lv() and pool_can_resize_metadata().	2013-06-11 14:03:30 +02:00
Zdenek Kabelac	55a3859632	thin: detect online metadata resize support	2013-06-11 14:03:28 +02:00
Zdenek Kabelac	01ef97fcbb	thin: report 'e' metadata type with higher priority Giving volume type information about being 'metadata' type of volume has higher priority then i.e. 'mirror' or 'thin' flag - for those type we have 'target attr' (7th. field).	2013-06-11 14:03:08 +02:00
Zdenek Kabelac	c0290489c3	thin: report o as volume type for external origin Reuse 'o' attr for lvs report also for external origin.	2013-06-11 14:02:41 +02:00
Zdenek Kabelac	7151ede767	thin: report t for thin pool and volume Do not mark internal device _tdata and _tmeta as having target type 't'. They have the target type on their own (i.e. mirror, raid).	2013-06-11 13:58:16 +02:00
Zdenek Kabelac	272f5ae208	snapshots: check for active state Fix testing if the snapshot could be resized and use lv_is_active() to get correct answer in cluster.	2013-06-11 13:57:18 +02:00
Zdenek Kabelac	f05c5a97c3	filters: dump filter returns error code Add int return value from dump() function. Report stack for error case. Update composable filter.	2013-06-03 08:42:25 +02:00
Zdenek Kabelac	5467a3b2b7	filters: update composable filter Last commit made dump filter only partially composable. Add remaining functionality and also support composable wipe, which is needed, when i.e. vgscan needs to remove cache. (in release fix)	2013-06-02 22:46:06 +02:00
Petr Rockai	1f73e992ef	lvmetad: no use of persistent filter with lvmetad	2013-06-02 00:49:55 +02:00
Petr Rockai	e7878da921	filters: toplevel filter not persistent Add a generic dump operation to filters and make the composite filter call through to its components. Previously, when global filter was set, the code would treat the toplevel composite filter's private area as if it belonged a persistent filter, trying to write nonsense into a non-sensical file. Also deal with NULL cmd->filter gracefully.	2013-06-02 00:48:58 +02:00
Petr Rockai	05bf4b8cc3	vgimportclone: override global_filter in lvm.conf The global filter in system's lvm.conf may conflict with the custom filter we set up in vgimportclone (they can easily fail to intersect). Since we explicitly avoid talking to lvmetad in vgimportclone, it is safe and reasonable to do so.	2013-06-02 00:47:17 +02:00
Zdenek Kabelac	3ced1bf694	lvresize: check for max snapshot size As for lvcreate, lvresize also doesn't need to grow bigger then needed.	2013-05-30 17:35:23 +02:00
Zdenek Kabelac	bd3ece0128	lvcreate: reduce too large cow Detect maximum usable size of snapshot COW device, and do not waste more space for such LV then needed.	2013-05-30 17:35:14 +02:00
Zdenek Kabelac	eb7e206a73	snapshot: add cow_max_extents Add more precise calculation of the maximum usable snapshot size. Using only percentage fails for small size of snapshot and extents.	2013-05-30 17:30:15 +02:00
Zdenek Kabelac	59962d8d3e	snapshot: require 3 chunks for creation There is no point in creation of 2chunks snapshot, since the snapshot is invalidated immeditelly with the first write as there is no free chunk for COW blocks (2 chunks are used by the snap header and the 1st. metadata chunk). Enhance error message about the lowest usable size.	2013-05-30 17:28:03 +02:00
Zdenek Kabelac	56779c32c5	snapshot: fix resize of 100% full cow When the COW area is using all the available space (100%) it can be still a valid snapshot which may need a resize. So support it.	2013-05-30 17:26:20 +02:00
Zdenek Kabelac	99f0483580	args: do not accept >=16EiB sizes Instead of seeing wierd overflows inside the lvm code, giving false error messages, kill the user experiment in the begining. Who needs to use more then 16EiB with lvm2 and 64bit anyway...	2013-05-30 17:23:51 +02:00
Zdenek Kabelac	2f1a571c97	fid: fix reset of PV fid Avoid hitting memory corruption (double free) in code path, where PV FID has been already destroyed and the released pointer was left in PV structure and could have been tried to be released from there 2nd. time with final context destruction.	2013-05-30 16:52:39 +02:00
Peter Rajnoha	be25f7ac83	WHATS_NEW: ea_start,ea_size -> ba_start,ba_size	2013-05-28 12:43:26 +02:00
Peter Rajnoha	732859d21f	refactor: rename embedding area -> bootloader area	2013-05-28 12:37:22 +02:00
Zdenek Kabelac	9966842810	snapshot: skip monitor for large cows If snapshot cow device is already big enough to cover whole origin, do not monitor it.	2013-05-27 10:35:43 +02:00
Zdenek Kabelac	77952151af	snapshot: add lv_is_cow_covering_origin Add function to check is size of cow is already big enough to cover whole origin.	2013-05-27 10:34:53 +02:00
Zdenek Kabelac	06e8ff29ff	snapshot: use dm_get_status_snapshot() Replace code with libdm call to dm_get_status_snapshot().	2013-05-27 10:32:02 +02:00
Zdenek Kabelac	2ada982e73	vgchange: check for mounted fs Check for mounted fs also for vgchange command, not just lvchange. NOTE: Code is using lv_info() just like lvs_in_vg_opened(). It should be probably converted into lv_is_active_locally().	2013-05-20 16:47:33 +02:00
Jonathan Brassow	06ac797f42	Clean-up: Replace 'lv_is_active' with more correct/specific variants There are places where 'lv_is_active' was being used where it was more correct to use 'lv_is_active_locally'. For example, when checking for the existance of a kernel instance before asking for its status. Most of the time these would work correctly. (RAID is only allowed on non-clustered VGs at the moment, which means that 'lv_is_active' and 'lv_is_active_locally' would give the same result.) However, it is more correct to use the proper variant and it helps with future scenarios where targets might be allowed exclusively (or clustered) in a cluster VG.	2013-05-16 10:36:56 -05:00
Peter Rajnoha	b3b551a93e	WHATS_NEW: bad day	2013-05-16 11:02:38 +02:00
Peter Rajnoha	cb0d817fb5	WHATS_NEW: for commit `4f6c2951d6`	2013-05-16 08:38:27 +02:00
Alasdair G Kergon	f12d88f840	activation: fix lv_is_active regressions Try to fix commit `bf2741376d`. lv_is_active is not the same as lv_info(cmd, org, 0, &info, 0, 0). Introduce and use lv_is_active_locally.	2013-05-15 02:13:31 +01:00
Alasdair G Kergon	2fbe1e6e00	rephrasing: miscellaneous changes Miscellaneous changes to messages, man pages, comments and WHATS_NEW.	2013-05-15 01:50:42 +01:00
Alasdair G Kergon	2e4a66a761	make: fix exported symbols regex for non-GNU sed Remove a couple of incorrect backslashes from expressions used to generate lists of exported symbols so it works with busybox sed. [John Spencer]	2013-05-14 19:29:26 +01:00
Alasdair G Kergon	c6cf2ed7fd	commands: accept --yes globally Accept --yes on all commands, even ones that don't today have prompts, so that test scripts that don't care about interactive prompts no longer need to deal with them. But continue to mention --yes only in the command prototypes that actually use it.	2013-05-14 18:45:37 +01:00
Mike Snitzer	8ad7865b42	Fix alignment of PV data area if detected alignment less than 1 MB This fixes a long standing regression since LVM2 2.02.74 (commit `4efb1d9c`, "Update heuristic used for default and detected data alignment.") The default PE alignment could be used (via MAX()) even if it was determined that the device's MD stripe width, or minimal_io_size or optimal_io_size were not factors of the default PE alignment (either 64K or the newer default of 1MB, etc). This bug would manifest if the default PE alignment was larger than the overriding hint that the device provided (e.g. default of 1MB vs optimal_io_size of 768K). Signed-off-by: Mike Snitzer <snitzer@redhat.com>	2013-05-13 15:56:47 -04:00
Zdenek Kabelac	55fe07ad98	mm: fix leak in fail path If the dm_realloc would fail, the already allocate _maps_buffer memory would have been lost (overwritten with NULL). Fix this by using temporary line buffer. Also add a minor cleanup to set end of buffer to '\0', only when we really know the file size fits the preallocated buffer.	2013-05-13 13:13:20 +02:00
Peter Rajnoha	4407133113	toolcontext: check dm version lazily for udev_fallback setting Setting the cmd->default_settings.udev_fallback also requires DM driver version check. However, this caused useless mapper/control access with ioctl if not needed actually. For example if we're not using activation code, we don't need to know the udev_fallback as there's no node and symlink processing. For example, this premature mapper/control access caused problems when using lvm2app even when no activation happens - there are situations in which we don't need to use mapper/control, but still need some of the lvm2app functionality. This is also the case for lvm2-activation systemd generator which just needs to look at the lvm2 configuration, but it shouldn't touch mapper/control.	2013-05-13 11:53:53 +02:00
Zdenek Kabelac	8d004b5127	report: show active state of LV For non clustered VG - show "active"/"" For clustered VG its more complex: "local exclusive" "remote exclusive" "locally" "remotely"	2013-04-25 17:33:24 +02:00
Zdenek Kabelac	8b18ab76d2	report: show dmeventd monitoring status Add new lvs segment field 'Monitor' showing 3 states: "monitored" - LV is monitored by dmeventd. "not monitored" - LV is currently not being monitored by dmeventd "" (empty) - LV does not support monitoring, or dmeventd support is not compiled in.	2013-04-25 17:33:24 +02:00
Zdenek Kabelac	3f7de58e96	man: lvextend --use-policies Add missing man info.	2013-04-25 17:33:24 +02:00
Zdenek Kabelac	f84f12a6a3	snapshot: rework cluster creation and removal Support for exclusive activation of snapshots revealed some problems. When snapshot is created, COW LV is activated first (for clearing) and then it's transformed into snapshot's COW LV, but it has left the lock for such LV active in cluster and this lock could not have been removed from dlm, unless snapshot has been removed within same dlm session. If the user tried to remove snapshot after rebooting node, the lock was missing, and COW LV could not have been detached. Patch modifes the approach in this way: Always deactivate COW LV for clustered vg after clearing (so it's activated again via imlicit snapshot activation rule when snapshot is activated). When snapshot is removed, activate COW LV as independend LV, so the lock will exist for such LV, but only when the snapshot is active. Also add test case for testing snapshot removal after cluster reboot.	2013-04-25 17:33:24 +02:00
Zdenek Kabelac	d51b7e5404	clvmd: avoid pretesting of dev availability Patch fixes hidden problem with lvm metadata caching. When the pretest was made, only the commited data have been cached back since the call lv_info_by_lvid() triggers mda read operation. However call of lv_suspend_if_active() also reads precommited metadata. The problem is visible in this sequence of calls: vg_write(), suspend_lv(), vg_commit(), resume_lv() which may end with leaving outdated mda in lvm cache, since vg_write() drops cached metadata and vg_commit() only transforms precommited to commited metadata, but in the case of pretesting we have no precommited mda available so the cache will continue to use old metadata. This happens, when suspend LV is inactive.	2013-04-25 17:33:22 +02:00
Zdenek Kabelac	45eeb70b02	config: merge timestamps Merging multiple config files together needs to know newest (highest) timestamp of merged files. Persistent cache file is being used only in case, the config file is older then .cache file.	2013-04-23 12:31:16 +02:00
Zdenek Kabelac	1951798d72	vgread: fix fid transfer for lvm1 and pool format Assign fid as the last step before returning VG. Make the format reader for 'lvm1' and 'pool' equal to 'lvm2' format reader. It has caused memory corruption to lvmetad as it later calls destroy_instance() to allocated fid. This patch should fix problems with crashing test lvmetad-lvm1.sh.	2013-04-21 23:13:57 +02:00
Zdenek Kabelac	a2b76a6f02	thin: fix resource leak in err path If the devices list could not have been obtained, FILE* was leaked.	2013-04-21 23:10:30 +02:00
Zdenek Kabelac	17a6915054	thin: explicitly avoid pvmove operation So far we do not support pvmove for thin volumes and thin pools.	2013-04-21 23:09:11 +02:00
Zdenek Kabelac	f787b575b5	lvmetad: fix error paths Also add missing goto out on error. Error path missed return NULL leading to double free of enc_value.	2013-04-21 23:04:53 +02:00
Zdenek Kabelac	c9d8d22224	clmvd: fix responce status Failing status code is expected to be 0. Also do not return '*response' as pointer which has been already free().	2013-04-21 22:54:42 +02:00
Jonathan Brassow	2e0740f7ef	RAID: Add writemostly/writebehind support for RAID1 'lvchange' is used to alter a RAID 1 logical volume's write-mostly and write-behind characteristics. The '--writemostly' parameter takes a PV as an argument with an optional trailing character to specify whether to set ('y'), unset ('n'), or toggle ('t') the value. If no trailing character is given, it will set the flag. Synopsis: lvchange [--writemostly <PV>:{t\|y\|n}] [--writebehind <count>] vg/lv Example: lvchange --writemostly /dev/sdb1:y --writebehind 512 vg/raid1_lv The last character in the 'lv_attr' field is used to show whether a device has the WriteMostly flag set. It is signified with a 'w'. If the device has failed, the 'p'artial flag has priority. Example ("nosync" raid1 with mismatch_cnt and writemostly): [~]# lvs -a --segment vg LV VG Attr #Str Type SSize raid1 vg Rwi---r-m 2 raid1 500.00m [raid1_rimage_0] vg Iwi---r-- 1 linear 500.00m [raid1_rimage_1] vg Iwi---r-w 1 linear 500.00m [raid1_rmeta_0] vg ewi---r-- 1 linear 4.00m [raid1_rmeta_1] vg ewi---r-- 1 linear 4.00m Example (raid1 with mismatch_cnt, writemostly - but failed drive): [~]# lvs -a --segment vg LV VG Attr #Str Type SSize raid1 vg rwi---r-p 2 raid1 500.00m [raid1_rimage_0] vg Iwi---r-- 1 linear 500.00m [raid1_rimage_1] vg Iwi---r-p 1 linear 500.00m [raid1_rmeta_0] vg ewi---r-- 1 linear 4.00m [raid1_rmeta_1] vg ewi---r-p 1 linear 4.00m A new reportable field has been added for writebehind as well. If write-behind has not been set or the LV is not RAID1, the field will be blank. Example (writebehind is set): [~]# lvs -a -o name,attr,writebehind vg LV Attr WBehind lv rwi-a-r-- 512 [lv_rimage_0] iwi-aor-w [lv_rimage_1] iwi-aor-- [lv_rmeta_0] ewi-aor-- [lv_rmeta_1] ewi-aor-- Example (writebehind is not set): [~]# lvs -a -o name,attr,writebehind vg LV Attr WBehind lv rwi-a-r-- [lv_rimage_0] iwi-aor-w [lv_rimage_1] iwi-aor-- [lv_rmeta_0] ewi-aor-- [lv_rmeta_1] ewi-aor--	2013-04-15 13:59:46 -05:00
Zdenek Kabelac	a81a2406f1	tools: add common lv_change_activate Move common code for changing activation state from vgchange and lvchange to one function. Fix the order of checks - so we always implicitelly activate snapshots and thin volumes in exclusive mode, and we do not allow local deactivation for them.	2013-04-12 11:30:07 +02:00
Jonathan Brassow	719e908bc0	WHATS_NEW: Add WHATS_NEW entry for previous commit.	2013-04-11 16:03:24 -05:00
Jonathan Brassow	ff64e3500f	RAID: Add scrubbing support for RAID LVs New options to 'lvchange' allow users to scrub their RAID LVs. Synopsis: lvchange --syncaction {check\|repair} vg/raid_lv RAID scrubbing is the process of reading all the data and parity blocks in an array and checking to see whether they are coherent. 'lvchange' can now initaite the two scrubbing operations: "check" and "repair". "check" will go over the array and recored the number of discrepancies but not repair them. "repair" will correct the discrepancies as it finds them. 'lvchange --syncaction repair vg/raid_lv' is not to be confused with 'lvconvert --repair vg/raid_lv'. The former initiates a background synchronization operation on the array, while the latter is designed to repair/replace failed devices in a mirror or RAID logical volume. Additional reporting has been added for 'lvs' to support the new operations. Two new printable fields (which are not printed by default) have been added: "syncaction" and "mismatches". These can be accessed using the '-o' option to 'lvs', like: lvs -o +syncaction,mismatches vg/lv "syncaction" will print the current synchronization operation that the RAID volume is performing. It can be one of the following: - idle: All sync operations complete (doing nothing) - resync: Initializing an array or recovering after a machine failure - recover: Replacing a device in the array - check: Looking for array inconsistencies - repair: Looking for and repairing inconsistencies The "mismatches" field with print the number of descrepancies found during a check or repair operation. The 'Cpy%Sync' field already available to 'lvs' will print the progress of any of the above syncactions, including check and repair. Finally, the lv_attr field has changed to accomadate the scrubbing operations as well. The role of the 'p'artial character in the lv_attr report field as expanded. "Partial" is really an indicator for the health of a logical volume and it makes sense to extend this include other health indicators as well, specifically: 'm'ismatches: Indicates that there are discrepancies in a RAID LV. This character is shown after a scrubbing operation has detected that portions of the RAID are not coherent. 'r'efresh : Indicates that a device in a RAID array has suffered a failure and the kernel regards it as failed - even though LVM can read the device label and considers the device to be ok. The LV should be 'r'efreshed to notify the kernel that the device is now available, or the device should be 'r'eplaced if it is suspected of failing.	2013-04-11 15:33:59 -05:00
Jonathan Brassow	95d28735ea	WHATS_NEW: Include entry for RAID status func improvements	2013-04-08 15:17:12 -05:00
Zdenek Kabelac	c22e925ce4	man: lvceate document external origin snapshot Document added support for external origin.	2013-04-05 14:15:03 +02:00
Zdenek Kabelac	ddafa0115e	man: updates for lvconvert and lvcreate Cleanup and improvement on man pages.	2013-04-05 14:14:20 +02:00
Peter Rajnoha	32ae07cef1	pv_write: clean up non-orphan format1 PV write ...to not pollute the common and format-independent code in the abstraction layer above. The format1 pv_write has common code for writing metadata and PV header by calling the "write_disks" fn and when rewriting the header itself only (e.g. just for the purpose of changing the PV UUID) during the pvchange operation, we had to tweak this functionality for the format1 case and we had to assign the PV the orphan state temporarily. This patch removes the need for this format1 tweak and it calls the write_disks with appropriate flag indicating whether this is a PV write call or a VG write call, allowing for metatada update for the latter one. Also, a side effect of the former tweak was that it effectively invalidated the cache (even for the non-format1 PVs) as we assigned it the orphan state temporarily just for the format1 PV write to pass. Also, that tweak made it difficult to directly detect whether a PV was part of a VG or not because the state was incorrect. Also, it's not necessary to backup and restore some PV fields when doing a PV write: orig_pe_size = pv_pe_size(pv); orig_pe_start = pv_pe_start(pv); orig_pe_count = pv_pe_count(pv); ... pv_write(pv) ... pv->pe_size = orig_pe_size; pv->pe_start = orig_pe_start; pv->pe_count = orig_pe_count; ...this is already done by the layer below itself (the _format1_pv_write fn). So let's have this cleaned up so we don't need to be bothered about any 'format1 special case for pv_write' anymore.	2013-03-25 15:08:26 +01:00
Peter Rajnoha	784867d5bd	WHATS_NEW: vgextend and PV with 0 MDAs	2013-03-19 15:41:34 +01:00
Zdenek Kabelac	b36a776a7f	thin: move update_pool_params Now we may recongnize preset arguments, move the code for updating thin pool related values into /lib portion of the code.	2013-03-13 15:13:54 +01:00
Alasdair G Kergon	cbfb5a98b5	filters: power2 devs get precedence if PVIDs match Give precedence to EMC "power2" devices with duplicate PVIDs like we already do with "emcpower" devices.	2013-03-11 20:10:49 +00:00
Peter Rajnoha	03b5c51730	WHATS_NEW: add lines for config validation support	2013-03-06 11:00:30 +01:00
Peter Rajnoha	b3776468fa	WHATS_NEW: add lines for embedding area support	2013-02-26 15:50:43 +01:00
Zdenek Kabelac	b73de73151	thin: lvconvert support for external origin Add basic support for converting LV into an external origin volume. Syntax: lvconvert --thinpool vg/pool --originname renamed_origin -T origin It will convert volume 'origin' into a thin volume, which will use 'renamed_origin' as an external read-only origin. All read/write into origin will go via 'pool'. renamed_origin volume is read-only volume, that could be activated only in read-only mode, and cannot be modified.	2013-02-23 10:38:20 +01:00
Zdenek Kabelac	d023b2d12f	lvremove: easier removal of dependent lvs Add function to remove lvs which are depending on removed lv prior the lv is removed. User is asked for confirmation.	2013-02-23 10:31:05 +01:00
Zdenek Kabelac	3679bb1cd9	activation: simplify activation code Reorder activation code to look similar for preload tree and activation tree. Its also give much better suppport for device stacking, since now we also support activation of snapshot which might be then used for other devices.	2013-02-23 10:30:03 +01:00
Zdenek Kabelac	0631d233d8	activation: add _add_layer_target_to_dtree Add function for creation of simple linear mapping over layer device.	2013-02-23 10:29:08 +01:00
Zdenek Kabelac	78b23f3595	activation: extend _cached_info Add layer string to support check of layered devices.	2013-02-23 10:28:01 +01:00
Jonathan Brassow	bbc6378b73	RAID: Make 'lvchange --refresh' restore transiently failed RAID PVs A new function (dm_tree_node_force_identical_table_reload) was added to avoid the suppression of identical table reloads. This allows RAID LVs to reload the on-disk superblock information that contains which devices have failed and the bitmaps. If the failed device has returned, this has the effect of restoring the device and initiating recovery. Without this patch, the user had to completely deactivate their RAID LV and re-activate it in order to restore the failed device. Now they simply need to suspend and resume (which is done by 'lvchange --refresh'). The identical table suppression is only avoided if the LV is not PARTAIL (i.e. all of it's devices can be seen and read by LVM) and the kernel status of the array contains failed devices. In other words, the function will only be called in the case where we may have success in restoring a failed device in the array.	2013-02-21 11:31:36 -06:00
Jonathan Brassow	3ab46449f4	vgimport: Allow '--force' to import VGs with missing PVs. When there are missing PVs in a volume group, most operations that alter the LVM metadata are disallowed. It turns out that 'vgimport' is one of those disallowed operations. This is bad because it creates a circular dependency. 'vgimport' will complain that the VG is inconsistent and that 'vgreduce --removemissing' must be run. However, 'vgreduce' cannot be run because it has not been imported. Therefore, 'vgimport' must be one of the operations allowed to change the metadata when PVs are missing. The '--force' option is the way to make 'vgimport' happen in spite of the missing PVs.	2013-02-20 16:37:41 -06:00
Peter Rajnoha	303e86adc8	pvcreate: fix alignment to incorporate alignment offset if PV has 0 MDAs If zero metadata copies are used, there's no further recalculation of PV alignment that happens when adding metadata areas to the PV and which actually calculates the alignment correctly as a matter of fact. So fix this for "PV without MDA" case as well. Before this patch: [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 1 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 12.00m [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 0 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 8.00m After this patch: [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 1 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 12.00m [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 0 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 12.00m Also, remove a superfluous condition "pv->pe_start < pv->pe_align" in: if (pe_start == PV_PE_START_CALC && pv->pe_start < pv->pe_align) pv->pe_start = pv->pe_align ... This part of the condition is not reachable as with the PV_PE_START_CALC, we always have pv->pe_start set to 0 from the PV struct initialisation (...the pv->pe_start value is just being calculated).	2013-02-21 14:51:19 +01:00
Jonathan Brassow	bd0ee420b5	RAID: Allow remove/replace of sub-LVs composed of error segments. When a device fails, we may wish to replace those segments with an error segment. (Like when a 'vgreduce --removemissing' removes a failed device that happens to be a RAID image/meta.) We are then left with images that we will eventually want to remove or replace. This patch allows us to pull out these virtual "error" sub-LVs. This allows a user to 'lvconvert -m -1 vg/lv' to extract the bad sub-LVs. Sub-LVs with error segments are considered for extraction before other possible devices so that good devices are not accidentally removed. This patch also adds the ability to replace RAID images that contain error segments. The user will still be unable to run 'lvconvert --replace' because there is no way to address the 'error' segment (i.e. no PV that it is associated with). However, 'lvconvert --repair' can be used to replace the image's error segment with a new PV. This is also the most appropriate way to do it, since the LV will continue to be reported as 'partial'.	2013-02-20 14:58:56 -06:00
Jonathan Brassow	845852d6b4	RAID: Make 'vgreduce --removemissing' work with RAID LVs Currently it is impossible to remove a failed PV which has a RAID LV on it. This patch fixes the issue by replacing the failed PV with an 'error' segment within the affected sub-LVs. Once there is no longer a RAID LV using the PV, it can be removed. Most often, it is better to replace a failed RAID device with a spare. (You can use 'lvconvert --repair <vg>/<LV>' to accomplish that.) However, if there are no spares in the volume group and none will be added, it is useful to be able to removed the failed device. Following patches address the ability to perform 'lvconvert' operations on RAID LVs that contain sub-LVs composed of 'error' segments.	2013-02-20 14:52:46 -06:00
Jonathan Brassow	0e4ffd9d3b	clean-up: Rename lvm.conf setting 'mirror_region_size' to 'raid_region_size' We have been using 'mirror_region_size' in lvm.conf as the default region size for RAID logical volumes as well as mirror logical volumes. Since, "raid" is more inclusive and representative than "mirror", I have changed the name of this setting. We must still check for the old setting and warn the user if we are overriding it with the new setting if both happen to be present.	2013-02-20 14:40:17 -06:00
Peter Rajnoha	722ca363f0	report: fix pvs -o pv_free reporting for PVs with 0 PEs [0] raw/~ # lsblk -o NAME,SIZE /dev/sda NAME SIZE sda 128M [0] raw/~ # pvcreate --dataalignment 128m /dev/sda Physical volume "/dev/sda" successfully created [0] raw/~ # vgcreate vg /dev/sda Volume group "vg" successfully created [0] raw/~ # lvcreate -l1 vg Volume group "vg" has insufficient free space (0 extents): 1 required. Before this patch: [0] raw/~ # pvs -o pv_name,pv_free PV PFree /dev/sda 128.00m After this patch: [0] raw/~ # pvs -o pv_name,pv_free PV PFree /dev/sda 0	2013-02-21 13:28:07 +01:00
Zdenek Kabelac	c984d8fbab	thin: properly unmark volume after detach When the volume is detached form thin pool, unmask THIN_VOLUME flag and reset related pointers.	2013-02-05 14:40:37 +01:00
Zdenek Kabelac	a5b9b4bf02	thin: fix forbidden discards checks Instead of check for lv_is_active() for thin pool LV, query the whole pool via new pool_is_active(). Fixes a problem when we cannot change discards settings for active pool device where the actual layer for pool device was inactive, but thin volumes using thin pool have been active.	2013-02-05 14:38:16 +01:00
Zdenek Kabelac	11eaf1c98c	thin: add function pool_is_active This internal function check for active pool device. For cluster it checks every thin volume, On the non-clustered VG we need to check just for presence of -tpool device.	2013-02-05 14:35:44 +01:00
Zdenek Kabelac	9d445f371c	report: leave empty report field for 0 Since we do not support LVs with 0 size, use this value as 'error' value for devices without origin, and leave this field blank as in other cases.	2013-02-05 14:32:37 +01:00
Zdenek Kabelac	be5ad90703	lvconvert: fix accepting second lv name Do not allow to accept second LV name on lvconvert --thinpool command line.	2013-02-05 14:31:17 +01:00
Zdenek Kabelac	a4870c79ca	thin: use noflush for obtaining transaction_id Do not flush thin pool data, when reading transation_id status.	2013-02-04 19:05:56 +01:00
Zdenek Kabelac	ca7abbce8a	activate: add lv_layer function Add function to return layer name for LV.	2013-02-04 19:01:10 +01:00
Jonathan Brassow	38e7b37c89	WHATS_NEW: Better description of previous change	2013-02-01 11:52:25 -06:00
Jonathan Brassow	801d4f96a8	RAID: Improve 'lvs' attribute reporting of RAID LVs and sub-LVs There are currently a few issues with the reporting done on RAID LVs and sub-LVs. The most concerning is that 'lvs' does not always report the correct failure status of individual RAID sub-LVs (devices). This can occur when a device fails and is restored after the failure has been detected by the kernel. In this case, 'lvs' would report all devices are fine because it can read the labels on each device just fine. Example: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) However, 'dmsetup status' on the device tells us a different story: [root@bp-01 lvm2]# dmsetup status vg-lv 0 1024000 raid raid1 2 DA 1024000/1024000 In this case, we must also be sure to check the RAID LVs kernel status in order to get the proper information. Here is an example of the correct output that is displayed after this patch is applied: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-p 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-p /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-p /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) The other case where 'lvs' gives incomplete or improper output is when a device is replaced or added to a RAID LV. It should display that the RAID LV is in the process of sync'ing and that the new device is the only one that is not-in-sync - as indicated by a leading 'I' in the Attr column. (Remember that 'i' indicates an (i)mage that is in-sync and 'I' indicates an (I)mage that is not in sync.) Here's an example of the old incorrect behaviour: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [root@bp-01 lvm2]# lvconvert -m +1 vg/lv; lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 0.00 lv_rimage_0(0),lv_rimage_1(0),lv_rimage_2(0) [lv_rimage_0] vg Iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg Iwi-aor-- /dev/sdb1(1) [lv_rimage_2] vg Iwi-aor-- /dev/sdc1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [lv_rmeta_2] vg ewi-aor-- /dev/sdc1(0) Note that all the images currently are marked as 'I' even though it is only the last device that has been added that should be marked. Here is an example of the correct output after this patch is applied: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [root@bp-01 lvm2]# lvconvert -m +1 vg/lv; lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 0.00 lv_rimage_0(0),lv_rimage_1(0),lv_rimage_2(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rimage_2] vg Iwi-aor-- /dev/sdc1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [lv_rmeta_2] vg ewi-aor-- /dev/sdc1(0) Note only the last image is marked with an 'I'. This is correct and we can tell that it isn't the whole array that is sync'ing, but just the new device. It also works under snapshots... [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg owi-a-r-p 33.47 lv_rimage_0(0),lv_rimage_1(0),lv_rimage_2(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg Iwi-aor-p /dev/sdb1(1) [lv_rimage_2] vg Iwi-aor-- /dev/sdc1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-p /dev/sdb1(0) [lv_rmeta_2] vg ewi-aor-- /dev/sdc1(0) snap vg swi-a-s-- /dev/sda1(51201)	2013-02-01 11:33:54 -06:00
Peter Rajnoha	f7da1caf8d	blkdeactivate: fix handling of nested mountpoints and mangled mount paths. If there was a nested mountpoint inside an existing mount path, blkdeactivate could fail to unmount such a mountpoint as it needs to deactivate the deepest path first and continue upwards. For example the simplest reproducer: [root@rhel6-a ~]# lsblk NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT sda 8:0 0 4G 0 disk \|-vg-lvol0 (dm-2) 253:2 0 32M 0 lvm /mnt/a `-vg-lvol1 (dm-3) 253:3 0 32M 0 lvm /mnt/a/b Before this patch: [root@rhel6-a ~]# blkdeactivate -u Deactivating block devices: UMOUNT: unmounting vg-lvol0 (dm-2) mounted on /mnt/a umount: /mnt/a: device is busy. (In some cases useful info about processes that use the device is found by lsof(8) or fuser(1)) UMOUNT: unmounting vg-lvol1 (dm-3) mounted on /mnt/a/b LVM: deactivating Logical Volume vg/lvol1 (deactivation of vg/lvol0 is skipped as /mnt/a that is on lvol0 can't be unmounted - it still has /mnt/a/b as nested mountpoint!) With this patch applied: [root@rhel6-a ~]# blkdeactivate -u Deactivating block devices: UMOUNT: unmounting vg-lvol1 (dm-3) mounted on /mnt/a/b UMOUNT: unmounting vg-lvol0 (dm-2) mounted on /mnt/a LVM: deactivating Logical Volume vg/lvol0 LVM: deactivating Logical Volume vg/lvol1 === Also, this patch contains a fix for processing mangled mount paths: [root@rhel6-a ~]# lsblk NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT sda 8:0 0 4G 0 disk `-vg-lvol0 (dm-2) 253:2 0 32M 0 lvm /mnt/x y z [root@rhel6-a ~]# lsblk -r vg-lvol0 253:2 0 32M 0 lvm /mnt/x\x20y\x20z (the mount path is mangled with \xNN that is visible in raw lsblk output only and which is used in blkdeactive as well) Before this patch: [root@rhel6-a ~]# blkdeactivate -u Deactivating block devices: umount: /mnt/x\x20y\x20z: not found After this patch applied: [root@rhel6-a ~]# blkdeactivate -u Deactivating block devices: UMOUNT: unmounting vg-lvol0 (dm-2) mounted on /mnt/x\x20y\x20z LVM: deactivating Logical Volume vg/lvol0	2013-01-23 14:45:41 +01:00
Zdenek Kabelac	8bcc1da2f3	locales: use higher prio LC_ALL variable For reseting locale environment into significantly less memory consuming version 'C' - use LC_ALL instead of LANG since it has higher priority in locale settings. Otherwise we may observe whole locale-archive which might be over 100MB on i.e. Fedora systems locked in memory with some daemons.	2013-01-22 11:25:02 +01:00
Petr Rockai	142c4bf9f0	Update WHATS_NEW.	2013-01-16 11:22:08 +01:00
Zdenek Kabelac	2b760a7fa7	whatsnew	2013-01-11 09:26:51 +01:00
Alasdair G Kergon	7f747a0d73	logging: add debug classes Add log/debug_classes to lvm.conf to allow debug messages to be classified and filtered at runtime. The dm_errno field is only used by log_error(), so I've redefined it for log_debug() messages to hold the message class. By default, all existing messages appear, but we can add categories that generate high volumes of data, such as logging all traffic to/from lvmetad.	2013-01-07 22:25:19 +00:00
Peter Rajnoha	ad85b0c526	pvscan: synchronize with udev if pvscan --cache is used. We need to call sync_local_dev_names directly as pvscan uses VG_GLOBAL lock and this one does not cause the synchronization (sync_dev_names) to be called on unlock (VG_GLOBAL is not a real VG): define unlock_vg(cmd, vol) do { \ if (is_real_vg(vol)) \ sync_dev_names(cmd); \ (void) lock_vol(cmd, vol, LCK_VG_UNLOCK); \ } while (0) Without this fix, we end up without udev synchronization for the pvscan --cache (mainly for -aay that causes the VGs/LVs to be autoactivated) and also udev synchronization cookies are then left in the system since they're not managed properly (code before sets up udev sync cookies, but we have to call dm_udev_wait at least once after that to do the wait and cleanup).	2012-12-21 11:15:46 +01:00
Peter Rajnoha	756bcabbfe	activation: fix autoactivation to not trigger on each PV change Before, the pvscan --cache -aay was called on each ADD and CHANGE uevent (for a device that is not a device-mapper device) and each CHANGE event (for a PV that is a device-mapper device). This causes troubles with autoactivation in some cases as CHANGE event may originate from using the OPTION+="watch" udev rule that is defined in 60-persistent-storage.rules (part of the rules provided by udev directly) and it's used for all block devices (except fd\|mtd\|nbd\|gnbd\|btibm\|dm-\|md* devices). For example, the following sequence incorrectly activates the rest of LVs in a VG if one of the LVs in the VG is being removed: [root@rhel6-a ~]# pvcreate /dev/sda Physical volume "/dev/sda" successfully created [root@rhel6-a ~]# vgcreate vg /dev/sda Volume group "vg" successfully created [root@rhel6-a ~]# lvcreate -l1 vg Logical volume "lvol0" created [root@rhel6-a ~]# lvcreate -l1 vg Logical volume "lvol1" created [root@rhel6-a ~]# vgchange -an vg 0 logical volume(s) in volume group "vg" now active [root@rhel6-a ~]# lvs LV VG Attr LSize Pool Origin Data% Move Log Cpy%Sync Convert lvol0 vg -wi------ 4.00m lvol1 vg -wi------ 4.00m [root@rhel6-a ~]# lvremove -ff vg/lvol1 Logical volume "lvol1" successfully removed [root@rhel6-a ~]# lvs LV VG Attr LSize Pool Origin Data% Move Log Cpy%Sync Convert lvol0 vg -wi-a---- 4.00m ...so the vg was deactivated, then lvol1 removed, and we end up with lvol1 removed (which is ok) BUT with lvol0 activated (which is wrong)!!! This is because after lvol1 removal, we need to write metadata to the underlying device /dev/sda and that causes the CHANGE event to be generated (because of the WATCH udev rule set on this device) and this causes the pvscan --cache -aay to be reevaluated. We have to limit this and call pvscan --cache -aay to autoactivate VGs/LVs only in these cases: --> if the PV is not a dm device, scan only after proper device addition (ADD event) and not with any other changes (CHANGE event) --> if the PV is a dm device, scan only after proper mapping activation (CHANGE event + the underlying PV in a state "just activated")	2012-12-21 10:34:48 +01:00
Jonathan Brassow	970dfbcd69	RAID: Limit replacement of devices when array is not in-sync. If a RAID array is not in-sync, replacing devices should not be allowed as a general rule. This is because the contents used to populate the incoming device may be undefined because the devices being read where not in-sync. The kernel enforces this rule unless overridden by not allowing the creation of an array that is not in-sync and includes a devices that needs to be rebuilt. Since we cannot know the sync state of an LV if it is inactive, we must also enforce the rule that an array must be active to replace devices. That leaves us with the following conditions: 1) never allow replacement or repair of devices if the LV is in-active 2) never allow replacement if the LV is not in-sync 3) allow repair if the LV is not in-sync, but warn that contents may not be recoverable. In the case where a user is performing the repair on the command line via 'lvconvert --repair', the warning is printed before the user is prompted if they would like to replace the device(s). If the repair is automated (i.e. via dmeventd and policy is "allocate"), then the device is replaced if possible and the warning is printed.	2012-12-18 14:40:42 -06:00
Peter Rajnoha	0379c480e0	WHATS_NEW: changelog for `fae1a611d2` and `5294a6f77a`	2012-12-18 12:12:58 +01:00
Zdenek Kabelac	401c9aba4a	pv_read: add missing check for valid info If the lvmcache_info_from_pvid() fails to find valid info, invoke the lookup by dev, and only in this case call lvmcache_info_from_pvid() again. Also check for the result of info and return error directly, so the NULL is not passed to lvmcache_get_label().	2012-12-15 17:23:27 +01:00
Zdenek Kabelac	3e8dbfaecf	lvmetad: add check for failure dm_config_write_node Detect if dm_config_write_node failed and fail correctly.	2012-12-15 17:23:27 +01:00

1 2 3 4 5 ...

2701 Commits