shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-22 17:35:59 +03:00

Author	SHA1	Message	Date
Zdenek Kabelac	3b604e5c8e	lvinfo: allow to use lv_info with NULL info When NULL info struct is passed in - function is usable as a quick query for lv_is_active_locally() - with a bonus we may query for layered device. So it could be seen as a more efficient lv_is_active_locally().	2013-09-23 12:13:06 +02:00
Alasdair G Kergon	bd75844024	release 2.02.101 112 files changed, 4131 insertions(+), 1312 deletions(-)	2013-09-20 13:56:29 +01:00
Alasdair G Kergon	6e912d949b	tools: Avoid overflow in _get_int_arg. Use strtoull instead of strtol so that argument size is not cut to 31 bytes on machines with 32-bit long. (Mikulas)	2013-09-18 01:16:48 +01:00
Alasdair G Kergon	a3a5f58c21	reporting: Add devtypes command. Add internal devtypes reporting command to display built-in recognised block device types. (The output does not include any additional types added by a configuration file.) > lvm devtypes -o help Device Types Fields ------------------- devtype_all - All fields in this section. devtype_name - Name of Device Type exactly as it appears in /proc/devices. devtype_max_partitions - Maximum number of partitions. (How many device minor numbers get reserved for each device.) devtype_description - Description of Device Type. > lvm devtypes DevType MaxParts Description aoe 16 ATA over Ethernet ataraid 16 ATA Raid bcache 1 bcache block device cache blkext 1 Extended device partitions ...	2013-09-18 01:09:15 +01:00
Jonathan Brassow	d1bcb21e02	WHATS_NEW: Better description for commit `82228ac` More correct description of changes made to disallow thin+mirror.	2013-09-16 15:37:48 -05:00
Alasdair G Kergon	36c5bb40a2	Makefiles: Fix CC variable override. The CC override in commit `f42b2d4bbf` caused the built-in value to be used instead of the configured value when it wasn't being overridden. The behaviour is explained here: http://stackoverflow.com/questions/18007326/how-to-change-default-values-of-variables-like-cc-in-makefile	2013-09-16 19:57:14 +01:00
Alasdair G Kergon	97ba18f4cb	filters: Add bcache. N.B. Using bcache devices as PVs is still experimental. Problems should be reported to the appropriate mailing lists.	2013-09-16 16:56:55 +01:00
Peter Rajnoha	9742c5192e	systemd: run lvm2-activation-net.service after lvm2-activation.service The lvm2-activation-net.service was ordered only with respect to iscsi and fcoe service before. In addition to that, we also need ordering with respect to lvm2-activation.service to prevent parallel vgchange -aay runs which may cause some problems during activation. See also https://bugs.gentoo.org/show_bug.cgi?id=480066. With this patch, the ordering is firmly set to: lvm2-activation-early.service -> lvm2-activation.service -> lvm2-activation-net.service Thanks to Alexander Tsoy for the original patch (modified a bit here): https://www.redhat.com/archives/lvm-devel/2013-September/msg00049.html	2013-09-16 11:47:09 +02:00
Peter Rajnoha	e2268aeb92	WHATS_NEW: some missing lines	2013-09-12 14:16:54 +02:00
Zdenek Kabelac	2a6abcb80a	tests: singlenode updates Add more 'realistic' simulation of dlm locking. Previous version was not capable to maintain multiple locks. Current version doesn't handle multiqueues for locks, so the ordering is different.	2013-09-12 10:40:39 +02:00
Jonathan Brassow	82228acfc9	Mirror/Thin: Disallow thinpools on mirror logical volumes The same corner cases that exist for snapshots on mirrors exist for any logical volume layered on top of mirror. (One example is when a mirror image fails and a non-repair LVM command is the first to detect it via label reading. In this case, the LVM command will hang and prevent the necessary LVM repair command from running.) When a better alternative exists, it makes no sense to allow a new target to stack on mirrors as a new feature. Since, RAID is now capable of running EX in a cluster and thin is not active-active aware, it makes sense to pair these two rather than mirror+thinpool. As further background, here are some additional comments that I made when addressing a bug related to mirror+thinpool: (https://bugzilla.redhat.com/show_bug.cgi?id=919604#c9) I am going to disallow thin* on top of mirror logical volumes. Users will have to use the "raid1" segment type if they want this. This bug has come down to a choice between: 1) Disallowing thin-LVs from being used as PVs. 2) Disallowing thinpools on top of mirrors. The problem is that the code in dev_manager.c:device_is_usable() is unable to tell whether there is a mirror device lower in the stack from the device being checked. Pretty much anything layered on top of a mirror will suffer from this problem. (Snapshots are a good example of this; and option #1 above has been chosen to deal with them. This can also be seen in dev_manager.c:device_is_usable().) When a mirror failure occurs, the kernel blocks all I/O to it. If there is an LVM command that comes along to do the repair (or a different operation that requires label reading), it would normally avoid the mirror when it sees that it is blocked. However, if there is a snapshot or a thin-LV that is on a mirror, the above code will not detect the mirror underneath and will issue label reading I/O. This causes the command to hang. Choosing #1 would mean that thin-LVs could never be used as PVs - even if they are stacked on something other than mirrors. Choosing #2 means that thinpools can never be placed on mirrors. This is probably better than we think, since it is preferred that people use the "raid1" segment type in the first place. However, RAID* cannot currently be used in a cluster volume group - even in EX-only mode. Thus, a complete solution for option #2 must include the ability to activate RAID logical volumes (and perform RAID operations) in a cluster volume group. I've already begun working on this.	2013-09-11 15:58:44 -05:00
Peter Rajnoha	1f4bc637b4	WHATS_NEW: one more for commit `8d1d835`	2013-09-11 13:16:36 +02:00
Peter Rajnoha	fe227d14a5	WHATS_NEW: for commit `8d1d8350` and `72a9d4f`	2013-09-11 13:01:01 +02:00
Jonathan Brassow	2691f1d764	RAID: Make RAID single-machine-exclusive capable in a cluster Creation, deletion, [de]activation, repair, conversion, scrubbing and changing operations are all now available for RAID LVs in a cluster - provided that they are activated exclusively. The code has been changed to ensure that no LV or sub-LV activation is attempted cluster-wide. This includes the often overlooked operations of activating metadata areas for the brief time it takes to clear them. Additionally, some 'resume_lv' operations were replaced with 'activate_lv_excl_local' when sub-LVs were promoted to top-level LVs for removal, clearing or extraction. This was necessary because it forces the appropriate renaming actions the occur via resume in the single-machine case, but won't happen in a cluster due to the necessity of acquiring a lock first. The raid tests have been updated to allow testing in a cluster. For the most part, this meant creating devices with '-aey' if they were to be converted to RAID. (RAID requires the converting LV to be EX because it is a condition of activation for the RAID LV in a cluster.)	2013-09-10 16:33:22 -05:00
Zdenek Kabelac	f5832d8c49	deactivate: drop readahead calc in deactivation Skip readahead when device will be deactivated.	2013-09-07 09:13:20 +02:00
Zdenek Kabelac	0670bfeb59	thin: validation catch multiseg thin pool/volumes Multisegment thin pools and volumes are not supported. Catch such error code path early.	2013-09-07 03:32:07 +02:00
Zdenek Kabelac	655296609e	thin: fix monitoring of thin pool volume Properly skip unmonitoring of thin pool volume in deactivation code path. Code makes sure if there is just any thin pool user it stays monitored with all its resources.	2013-09-07 03:31:04 +02:00
Zdenek Kabelac	4c001a7854	thin: fix resize of stacked thin pool volume When the pool is created from non-linear target the more complex rules have to be used and stacking needs to properly decode args for _tdata LV. Also proper allocation policies are being used according to those set in lvm2 metadata for data and metadata LVs. Also properly check for active pool and extra code to active it temporarily. With this fix it's now possible to use: lvcreate -L20 -m2 -n pool vg --alloc anywhere lvcreate -L10 -m2 -n poolm vg --alloc anywhere lvconvert --thinpool vg/pool --poolmetadata vg/poolm lvresize -L+10 vg/pool	2013-09-07 03:24:48 +02:00
Alasdair G Kergon	96880102a3	logging: Write Completed message before resetting.	2013-09-06 01:47:41 +01:00
Jonathan Brassow	cc66dedc0e	pvmove: Skip pvmove of RAID, thin, snapshot, origin, and mirror LVs in cluster pvmove of the above types should only have been enabled in single machine mode.	2013-09-03 13:17:01 -05:00
Peter Rajnoha	3b51f298bb	reinstate: commit `82d83a01ce` It now works as supposed. The source of the problem is fixed by previous commit d2d6a9da52e04f28e1916bcea3f9fda356b6df29.	2013-09-03 16:49:21 +02:00
Peter Rajnoha	008c33a21b	tools: add -b/--background for pvscan --cache -aay Udev daemon has recently introduced a limit on the number of udev processes (there was no limit before). This causes a problem when calling pvscan --cache -aay in lvmetad udev rules which is supposed to activate the volumes. This activation is itself synced with udev and so it waits for the activation to complete before the pvscan finishes. The event processing can't continue until this pvscan call is finished. But if we're at the limit with the udev process count, we can't instatiate any more udev processes, all such events are queued and so we can't process the lvm activation event for which the pvscan is waiting. Then we're in a deadlock since the udev process with the pvscan --cache -aay call waits for the lvm activation udev processing to complete, but that will never happen as there's this limit hit with the number of udev processes. The process with pvscan --cache -aay actually times out eventually (3min or 30sec, depends on the version of udev). This patch makes it possible to run the pvscan --cache -aay in the background so the udev processing can continue and hence we can avoid the deadlock mentioned above.	2013-09-03 16:49:21 +02:00
Peter Rajnoha	44c1a02c18	revert: commit `82d83a01ce` The commit `82d83a01ce` "autoactivation: refresh existing VG before autoactivation" causes problems (dangling udev_sync cookies, slow processing of the pvscan --cache --major --minor call from udev rules) when the autoactivation handler is run in parallel on several PVs that belong to the same VG. Revert this patch until the exact source of the problem is found and then properly fixed and handled.	2013-09-02 13:53:27 +02:00
Alasdair G Kergon	78647da1c6	toolcontext: Only reopen stdin if readable. Don't fail when running lvm commands under versions of nohup that set up stdin as O_WRONLY!	2013-08-28 23:55:14 +01:00
Alasdair G Kergon	c0f987949b	activation: Fix segfault with inactive pvmove LV. Set flag to avoid recursion back through an inactive pvmove LV when populating deptree.	2013-08-28 22:56:23 +01:00
Peter Rajnoha	0acd7173d1	systemd: lvm2-activation-generator: remove default dir if args not specified and require all args to be given Remove default "/tmp" as destination directory if no args specified for lvm2-activation-generator. Require all the args to be specified directly for proper functionality.	2013-08-28 16:06:51 +02:00
Jonathan Brassow	2ef48b91ed	pvmove: Allow moving snapshot/origin. Disallow converting and merging LVs The patch allows the user to also pvmove snapshots and origin logical volumes. This means pvmove should be able to move all segment types. I have, however, disallowed moving converting or merging logical volumes.	2013-08-26 16:36:30 -05:00
Jonathan Brassow	caa77b33f2	pvmove: Fix inability to specify LV name when moving RAID, mirror, or thin LV Top-level LVs (like RAID, mirror or thin) are ignored when determining which portions of an LV to pvmove. If the user specified the name of an LV to move and it was one of the above types, it would be skipped. The code would never move on to check whether its sub-LVs needed moving because their names did not match what the user specified. The solution is to check whether a sub-LVs is part of the LV whose name was specified by the user - not just if there was a name match.	2013-08-26 14:12:31 -05:00
Peter Rajnoha	d34ab5e0d3	WHATS_NEW: for `4d3b5724e0`	2013-08-26 15:52:15 +02:00
Zdenek Kabelac	6b416f837f	thin: support lvchange for data and metadata Support lvchange operation on stacked thin pool data and metadata volumes.	2013-08-26 14:55:22 +02:00
Jonathan Brassow	c59167ec13	pvmove: Add support for RAID, mirror, and thin This patch allows pvmove to operate on RAID, mirror and thin LVs. The key component is the ability to avoid moving a RAID or mirror sub-LV onto a PV that already has another RAID sub-LV on it. (e.g. Avoid placing both images of a RAID1 LV on the same PV.) Top-level LVs are processed to determine which PVs to avoid for the sake of redundancy, while bottom-level LVs are processed to determine which segments/extents to move. This approach does have some drawbacks. By eliminating whole PVs from the allocation list, we might miss the opportunity to perform pvmove in some senarios. For example, if we have 3 devices and a linear uses half of the first, a RAID1 uses half of the first and half of the second, and a linear uses half of the third (FIGURE 1); we should be able to pvmove the first device (FIGURE 2). FIGURE 1: [ linear ] [ -RAID- ] [ linear ] [ -RAID- ] [ ] [ ] FIGURE 2: [ moved ] [ -RAID- ] [ linear ] [ moved ] [ linear ] [ -RAID- ] However, the approach we are using would eliminate the second device from consideration and would leave us with too little space for allocation. In these situations, the user does have the ability to specify LVs and move them one at a time.	2013-08-23 08:57:16 -05:00
Peter Rajnoha	99fe3b88d2	systemd: lvm2-activation-generator: report only error otherwise be silent Do not print success status for lvm2-activation-generator: "LVM: Activation generator successfully completed." "LVM: Logical Volume autoactivation enabled." (if use_lvmetad=1) Though this information is quite useful during boot, it may be confusing for users if it happens anytime later and it actually happens if systemd reloads. This is usually on package update to update the systemd state and load any new units that are newly installed in the system. The systemd reload is global and so any existing generators are rerun at that moment too.	2013-08-22 08:27:51 +02:00
Peter Rajnoha	c8daa15270	filter-mpath: remove superfluous error message about mpath major not equal to dm major This is a regression caused by commit `3bd9048854`. The error message added with that commit "mpath major %d is not dm major %d" is superfluous. When scanning for mpath components, we're looking for a parent device. But this parent device is not necessarily an mpath device (so the dm device) if it exists - it can be any other device layered on top (e.g. an MD RAID device).	2013-08-21 14:07:01 +02:00
Jonathan Brassow	f0be9ac904	cmirrord: Prevent secondary checkpoints from corrupting bitmaps The bug addressed by this patch manifested itself during testing by showing a mirror that never became 'in-sync' after creation. The bug is isolated to distributions that do not have support for openAIS checkpointing (i.e. > RHEL6, > F16). When a node joins a group that is managing a mirror log, the other machines in the group send it a checkpoint representing the current state of the bitmap. More than one machine can send a checkpoint, but only the initial one should be imported. Once the bitmap state has been imported from the initial checkpoint, operations (such as resync, mark, and clear operations) can begin. When subsequent checkpoints are allowed to be imported, it has the effect of erasing all the log operations between the initial checkpoint and the ones that follow. When cmirrord was updated to handle the absence of openAIS checkpointing (commit `62e38da133`), the new import_checkpoint() function failed to honor the 'no_read' parameter. This parameter was designed to avoid reading all but the initial checkpoint. Honoring this parameter has solved the issue of corrupting bitmap data with secondary checkpoints.	2013-08-20 13:21:09 -05:00
Peter Rajnoha	cac49725c9	udev: fix lvmetad rules to not ignore loop device configuration If loop device is first configured on systems where /dev/loop-control is used to dynamically create the loop device itself, there's an ADD+CHANGE even generated. But next time the existing /dev/loop[0-9]* is reused, there's only a CHANGE event since the device representing it is already present in kernel (so no ADD event in this case). We can't ignore this CHANGE event for loop devices! This is a regression caused by `756bcabbfe`. We already had a similar problem with MD devices which was fixed by `2ac217d408` (but that one was only an intra-release fix).	2013-08-16 15:45:00 +02:00
Michael Stapelberg	8cbbe851a8	systemd: use LVM_PATH instead of hardcoded value in activation generator	2013-08-15 09:59:19 +02:00
Peter Rajnoha	82d83a01ce	autoactivation: refresh existing VG before autoactivation When autoactivating a VG, there could be an existing VG with exactly the same PV UUIDs. The PVs could be reappeared after previous loss/disconnect (for example disconnecting and reconnecting iscsi). Since there's no "autodeactivation" yet, the mappings for the LVs from the VG were left in the system even if the device was disconnected. These mappings also hold the major:minor of the underlying device. So if the device reappears, it is assigned a different major:minor pair (...and kernel name). We need to cope with this during autoactivation so any existing mappings are corrected for any changes. The VG refresh does that (the vgchange --refresh functionality) - call this before VG autoactivation. (If the VG does not exist yet, the VG refresh is NOP)	2013-08-14 14:04:58 +02:00
Peter Rajnoha	fcbb34bdcc	WHATS_NEW: for `0da72743ca`	2013-08-14 10:18:02 +02:00
Alasdair G Kergon	80bcdb93ff	filters: check for mpath before opening devs Split out the partitioned device filter that needs to open the device and move the multipath filter in front of it. When a device is multipathed, sending I/O to the underlying paths may cause problems, the most obvious being I/O errors visible to lvm if a path is down. Revert the incorrect <backtrace> messages added when a device doesn't pass a filter. Log each filter initialisation to show sequence. Avoid duplicate 'Using $device' debug messages.	2013-08-13 23:26:58 +01:00
Alasdair G Kergon	1a1d3a10ff	vgchange: require confirmation with -c and no VGs Too many people have been running 'vgchange -cy' by mistake so add a confirmation prompt. Use --yes to bypass this.	2013-08-13 18:20:11 +01:00
Peter Rajnoha	fd7cac15bc	WHATS_NEW: be more precise	2013-08-13 18:25:54 +02:00
Peter Rajnoha	e166c00ac6	WHATS_NEW: one more for a85439	2013-08-13 18:16:05 +02:00
Peter Rajnoha	268b370e24	blkdeactivate: add support for bind mounts Recent version of util-linux/umount (v2.23+) provides umount --all-targets that can unmount all the mount targets of the same device (the bind mounts). Use this if available when calling the umount blkdeactivate. Otherwise, for older versions of util-linux, use findmnt (that is also a part of the util-linux) to iterate over all mount targets of the same device - this is the manual way.	2013-08-13 17:51:40 +02:00
Peter Rajnoha	a854398764	blkdeactivate: change the way blkdeactivate reports status The blkdeactivate now suppresses error messages from external tools that are called. Instead, only a summary message "done" or "skipped" is issued by blkdeactivate as any error in calling the external tool (e.g. unmounting or deactivating a device) causes the device to be skipped and the blkdeactivate continues with the next device in the tree. Add new -e/--errors switch to display any error messages from external tools. Also, suppress any output given by the external tools and add new -v/--verbose switch to display it including the verbose output of the tools called (this will enable error reporting as well). Also add blkdeactivate -vv for even more debug (the script's debug).	2013-08-13 17:51:23 +02:00
Alasdair G Kergon	32148369d1	post-release	2013-08-13 11:54:48 +01:00
Alasdair G Kergon	297907899c	release 2.02.100 84 files changed, 1540 insertions(+), 442 deletions(-) Mostly bug fixes this time. Also note: md raid replaces dm mirroring as the default implementation. Can call out to thin_repair to fix thin metadata. Improved clvmd error detection/debugging information.	2013-08-13 11:29:21 +01:00
Jonathan Brassow	abc89422af	Mirror: Fix inability to remove VG's cluster flag if it contains a mirror According to bug 995193, if a volume group 1) contains a mirror 2) is clustered 3) 'locking_type' = 0 is used then it is not possible to remove the 'c'luster flag from the VG. This is due to the way _lv_is_active behaves. We shouldn't allow the cluster flag to be flipped unless the mirrors in the cluster are not active. This is because different kernel modules are used depending on whether a mirror is cluster or not. When we attempt to see if the mirror is active, we first check locally. If it is not, then we attempt to check for remotely active instances if the VG is clustered. Since the no_lock locking type is LCK_CLUSTERED, but does not implement 'query_resource', remote_lock_held will always return an error in this case. An error from remove_lock_held is treated as though the lock _is_ held (i.e. the LV is active remotely). This blocks the cluster flag from changing. The solution is to implement 'query_resource' for the no_lock type. It will report a message and return 1. This will allow _lv_is_active to function properly. The LV would be considered not active remotely and the VG can change its flag.	2013-08-12 13:56:47 -05:00
Alasdair G Kergon	28760275e6	logging: tidy log_sys_error when string empty	2013-08-12 18:40:41 +01:00
Jonathan Brassow	cba228f856	WHATSNEW: typo	2013-08-09 17:17:53 -05:00
Jonathan Brassow	8615234c0f	RAID: Fix bug making lvchange unable to change recovery rate for RAID 1) Since the min\|maxrecoveryrate args are size_kb_ARGs and they are recorded (and sent to the kernel) in terms of kB/sec/disk, we must back out the factor multiple done by size_kb_arg. This is already performed by 'lvcreate' for these arguments. 2) Allow all RAID types, not just RAID1, to change these values. 3) Add min\|maxrecoveryrate_ARG to the list of 'update_partial_unsafe' commands so that lvchange will not complain about needing at least one of a certain set of arguments and failing. 4) Add tests that check that these values can be set via lvchange and lvcreate and that 'lvs' reports back the proper results.	2013-08-09 17:09:47 -05:00
Zdenek Kabelac	e583ff3d2c	thin: thin pool can't be external origin Avoid trying to convert thin-pool to external origin.	2013-08-09 23:04:30 +02:00
Peter Rajnoha	2f61478436	workaround: gcc v4.8 on 32 bit param. passing bug when -02 opimization used gcc -O2 v4.8 on 32 bit architecture is causing a bug in parameter passing. It does not happen with -01 nor -O0. The problematic part of the code was strlen use in config.c in the config_def_check fn and the call for _config_def_check_tree in it: <snip> rplen = strlen(rp); if (!_config_def_check_tree(handle, vp, vp + strlen(vp), rp, rp + rplen, CFG_PATH_MAX_LEN - rplen, cn, cmd->cft_def_hash)) ... </snip> If compiled with -O0 (correct): Breakpoint 1, config_def_check (cmd=0x819b050, handle=0x81a04f8) at config/config.c:775 (gdb) p vp $1 = 0x8189ee0 <_cfg_path> "config" (gdb) p strlen(vp) $2 = 6 (gdb) _config_def_check_tree (handle=0x81a04f8, vp=0x8189ee0 <_cfg_path> "config", pvp=0x8189ee6 <_cfg_path+6> "", rp=0xbfffe1e8 "config", prp=0xbfffe1ee "", buf_size=58, root=0x81a2568, ht=0x81a65 48) at config/config.c:680 (gdb) p vp $4 = 0x8189ee0 <_cfg_path> "config" (gdb) p pvp $5 = 0x8189ee6 <_cfg_path+6> "" If compiled with -O2 (incorrect): Breakpoint 1, config_def_check (cmd=cmd@entry=0x8183050, handle=0x81884f8) at config/config.c:775 (gdb) p vp $1 = 0x8172fc0 <_cfg_path> "config" (gdb) p strlen(vp) $2 = 6 (gdb) p vp + strlen(vp) $3 = 0x8172fc6 <_cfg_path+6> "" (gdb) _config_def_check_tree (handle=handle@entry=0x81884f8, pvp=0x8172fc7 <_cfg_path+7> "host_list", rp=rp@entry=0xbffff190 "config", prp=prp@entry=0xbffff196 "", buf_size=buf_size@entry=58, ht=0x 818e548, root=0x818a568, vp=0x8172fc0 <_cfg_path> "config") at config/config.c:674 (gdb) p pvp $4 = 0x8172fc7 <_cfg_path+7> "host_list" The difference is in passing the "pvp" arg for _config_def_check_tree. While in the correct case, the value of _cfg_path+6 is passed (the result of vp + strlen(vp) - see the snippet of the code above), in the incorrect case, this value is increased by 1 to _cfg_path+7, hence totally malforming the string that is being processed. This ends up with incorrect validation check and incorrect warning messages are issued like: "Configuration setting "config/checks" has invalid type. Found integer, expected section." To workaround this issue, remove the "static" qualifier from the "static char _cfg_path[CFG_PATH_MAX_LEN]". This causes the optimalizer to be less aggressive (also shuffling the arg list for _config_def_check_tree call helps).	2013-08-09 13:24:50 +02:00
Peter Rajnoha	8d3347f70b	WHATS_NEW: entry for `19baf84290`	2013-08-08 10:04:53 +02:00
Jonathan Brassow	68c2d352ec	WHATS_NEW: update WHATS_NEW for previous commit	2013-08-07 17:51:21 -05:00
Jonathan Brassow	b15278c3dc	Mirror/RAID1: When up\|down-converting default to segtype of current LV If there is no RAID support in the kernel but the default mirror segtype is "raid1", converting legacy mirrors can be problematic. For example, changing the log type or converting a mirror to a linear LV does not require the RAID modules to be present. However, because lp->segtype is set to be RAID1 by the configuration file, the command fails. We should only be setting lp->segtype when converting mirrors if it is going to change (e.g. to linear or between mirror types).	2013-08-07 16:01:45 -05:00
Jonathan Brassow	7e1083c985	RAID: Make "raid1" the default mirror segment type	2013-08-06 14:13:55 -05:00
Zdenek Kabelac	003f08c164	clogd: fix descriptor leak when daemonzing	2013-08-06 16:21:51 +02:00
Zdenek Kabelac	7b1315411f	clmvd: fix decriptor leak on restart Do not leave descriptor used for dup2() openned.	2013-08-06 16:20:36 +02:00
Zdenek Kabelac	f6dd5a294b	exec: pipe open Function replaces popen() system and avoids shell execution and argument parsing (no surprices).	2013-08-06 16:18:43 +02:00
Peter Rajnoha	61e7dc833c	WHATS_NEW: previous commit	2013-08-06 14:03:43 +02:00
Peter Rajnoha	e195b5227e	thin: apply VG profile if creating a new thin pool When creating a new thin pool and there's no profile requested via "lvcreate --profile ...", inherit any VG profile if it's attached. Currently this applies to these settings: allocation/thin_pool_chunk_size allocation/thin_pool_discards allocation/thin_pool_zero	2013-08-06 11:42:40 +02:00
Peter Rajnoha	1cdd563b6c	WHATS_NEW: move line to WHATS_NEW_DM	2013-08-06 11:42:01 +02:00
Jonathan Brassow	5ca54c4f0b	dmeventd: Fix memory leak When creating a timeout thread for snapshots, the thread is not tracked and thus never joined. This means that the exit status of the timeout thread is held indefinitely. Saves a bit of memory to set PTHREAD_CREATE_DETACHED when creating this thread. I've also added pthread_attr_init\|destroy to setup the creation pthread_attr_t. Reported-by: NeilBrown <neilb@suse.de> Signed-off-by: Jonathan Brassow <jbrassow@redhat.com>	2013-07-31 15:23:13 -05:00
Zdenek Kabelac	de0cba0e2d	thin: initial --repair support for pools Initial basic support for repair. It currently takes pool metadata spare volume, which is used for recovery. New spare is created if the volume is successfuly repaired. After the operation the previous _tmeta volume is moved into _tmeta%d volume and if everything is ok, this volume could be removed. New _tmeta needs to be pvmoved to proper place and also converted to i.e. mirror if it should be mirrored. Later version will try to automate some steps here.	2013-07-31 15:32:36 +02:00
Zdenek Kabelac	22fc80982a	thin: add thin_repair and thin_dump options Add new configure lvm.conf options for binaries thin_repair and thin_dump. Those are part of device-mapper-persistent-data package and will be used for recovery of thin_pool.	2013-07-31 15:30:47 +02:00
Zdenek Kabelac	ea605d1ec7	thin: metadata resize needs 1.9 version Version 1.8 is not yet fully usable for metadata resize.	2013-07-31 15:29:27 +02:00
Alasdair G Kergon	b6bfddcd0a	alloc: fix lvextend when stripe number varies The PREFERRED allocation mechanism requires the number of areas in the previous LV segment to match the number in the new segment being allocated. If they do not match, the code may crash. E.g. https://bugzilla.redhat.com/989347 Introduce A_AREA_COUNT_MATCHES and when not set avoid referring to the previous segment with the contiguous and cling policies.	2013-07-29 19:35:45 +01:00
Peter Rajnoha	ecc9f74988	filters: fix segfault on incorrect global_filter When using a global_filter and if this filter is incorrectly specified, we ended up with a segfault: raw/~ $ pvs Invalid filter pattern "r\|/dev/sda". Segmentation fault (core dumped) In the example above a closing '\|' character is missing at the end of the regex. The segfault itself was caused by trying to destroy the same filter twice in _init_filters fn within the error path (the "bad" goto target): bad: if (f3) f3->destroy(f3); if (f4) f4->destroy(f4); Where f3 is the composite filter (sysfs + regex + type + md + mpath filter) and f4 is the persistent filter which encompasses this composite filter within persistent filter's 'real' field in 'struct pfilter'. So in the end, we need to destroy the persistent filter only as this will also destroy any 'real' filter attached to it.	2013-07-26 13:04:53 +02:00
Alasdair G Kergon	06dce7d539	post-release	2013-07-25 00:38:53 +01:00
Alasdair G Kergon	76e617b158	release 2.02.99 363 files changed, 19863 insertions(+), 9055 deletions(-) This is a very large release - so expect bugs! Please treat this release like a release candidate. Changes to the external interfaces since 2.02.98 are not yet frozen. Updated releases will follow quickly (days not weeks) as any problems are handled.	2013-07-24 23:59:03 +01:00
Alasdair G Kergon	d13e87b9ef	cleanup: comments and a message	2013-07-24 22:10:37 +01:00
Zdenek Kabelac	5597dc3652	thin: not zeroing for non-zeroed thin pool snaps Do not zero initial 4KB of thin snapshot volume for thin pool with disabled zeroing.	2013-07-24 01:15:31 +02:00
Peter Rajnoha	31de670318	lvconvert: add more checks for lvconvert --type The --type mirror requires -m/--mirrrors: lvconvert --type mirror vg/lvol0 --type mirror requires -m/--mirrors Run `lvconvert --help' for more information. The --type raid* is allowed (the checks already existed): lvconvert --type raid10 vg/lvol0 Converting the segment type for vg/lvol0 from linear to raid10 is not yet supported. The --type snapshot is a synonym to -s/--snapshot: lvconvert -s vg/lvol0 vg/lvol1 Logical volume lvol1 converted to snapshot. lvconvert --type snapshot vg/lvol0 vg/lvol1 Logical volume lvol1 converted to snapshot. All the other segment types are not supported, e.g.: lvconvert --type zero vg/lvol0 Conversion using --type zero is not supported. Run `lvconvert --help' for more information.	2013-07-23 17:13:54 +02:00
Peter Rajnoha	9b1834f075	man: pvs -o ba_start,ba_size -> pv_ba_start,pv_ba_size	2013-07-23 14:45:30 +02:00
Peter Rajnoha	ea333a894e	systemd: lvm2-activation-generator: use LOG_DEBUG/ERR severity for kmsg	2013-07-22 14:04:12 +02:00
Alasdair G Kergon	ccc29f17b6	cmdline: support ARG_GROUPABLE in merge_synonym	2013-07-19 20:37:43 +01:00
Alasdair G Kergon	90a09559ed	commandline: add prefix aliases for raid options Accept --raidwritemostly as well as --writemostly etc.	2013-07-19 19:24:54 +01:00
Jonathan Brassow	4eea660191	RAID: Fix segfault when reporting raid_syncaction field on older kernel The status printed for dm-raid targets on older kernels does not include the syncaction field. This is handled by dev_manager_raid_status() just fine by populating the raid status structure with NULL for that field. However, lv_raid_sync_action() does not properly handle that field being NULL. So, check for it and return 0 if it is NULL.	2013-07-19 10:01:48 -05:00
Alasdair G Kergon	da79fe4c1d	reporting: tidy recent new fields Add underscores and prefixes to recently-added fields. (Might add more alias functionality in future.)	2013-07-19 01:30:02 +01:00
Alasdair G Kergon	357df34133	display: fix units for sizes <1k	2013-07-18 17:55:58 +01:00
Zdenek Kabelac	460d0254eb	thin: add pool metadata spare lv support Add support for pool's metadata spare volume.	2013-07-18 18:22:43 +02:00
Zdenek Kabelac	08df7ba844	thin: improve pool creation activation order Pool creation involves clearing of metadata device which triggers udev watch rule we cannot udev synchronize with in current code. This metadata devices needs to be activated localy, so in cluster mode deactivation and reactivation is always needed. However for non-clustered mode we may reload table via suspend/resume path which avoids collision with udev watch rule which was occasionaly triggering retry deactivation loop. Code has been also split into 2 separate code paths for thin pools and thin volumes which improved readability of the code as well. Deactivation has been moved out of extend_pool() and decision is now in _lv_create_an_lv() which knows the change mode.	2013-07-18 18:22:43 +02:00
Zdenek Kabelac	4e724f5f52	thin: for thin volumes properly list modules thin volume needs thin-pool and thin kernel modules so print them both for lvs -o+modules	2013-07-18 18:22:43 +02:00
Zdenek Kabelac	7afa9cebcb	thin: fix error path in creation path Remove some calls to revert_new_lv when no LV has been created/commited so far. When the pool update failed - then only revert is needed.	2013-07-18 18:22:43 +02:00
Zdenek Kabelac	7b4b97b731	snapshot: local activation for clear COW device To clear snapshot cow device in cluster enforce local activation here.	2013-07-18 18:22:43 +02:00
Peter Rajnoha	8606bf316a	systemd: generator: add lvm2-activation-net.service The new lvm2-activation-net.service activates LVM volumes after network-attached devices are set up (iSCSI and FCoE) if lvmetad is disabled and hence the autoactivation is not used.	2013-07-17 16:54:52 +02:00
Zdenek Kabelac	57be501aa3	dev_manager: lower memory usage Created dlid for test is not needed afterward, so lower a memory usage of this call is repeatedly used for building some large tree. TODO: create function to use given buffer on stack as much cheaper.	2013-07-15 15:59:20 +02:00
Zdenek Kabelac	0443c42e3b	thin: add sub volumes as whole volumes Do not use origin_only when add log_lv and metadata as a subvolume. The stacked volume needs to access whole volume in this case.	2013-07-15 15:58:07 +02:00
Zdenek Kabelac	97d36d5750	thin: check and use layered origin lv Code needs to check if the layer origin device is suspended, It's valid to create thinvolume snapshot of thinvolume which is also used as an old-style snapshot. In this case we need to check -real is suspended. When adding origin_only - add only layer thin volume. (in case it's also old-snapshot add only -real device)	2013-07-15 15:51:39 +02:00
Zdenek Kabelac	925701d9f3	thin: write back when command successfully finished Remove backup() call from update_pool_lv() as it's been there duplicated and preperly order backup() call after lvresize, so there is just one such call.	2013-07-15 15:48:32 +02:00
Zdenek Kabelac	42881c8877	thin: send messages to active pool If the thin pool is known to be active, messages can be passed to the pool even when the created thin volume is not going to be activated. So we do not need to stack large list of message and validate and catch creation errors earlier in this case. Replace the test for valid activation combination with simpler list of deactivation combinations.	2013-07-15 15:47:25 +02:00
Peter Rajnoha	9a44ba94a5	WHATS_NEW: support for LV activation skip flag	2013-07-12 21:37:58 +02:00
Peter Rajnoha	953a438e93	dumpconfig: add --type profilable The --type profilable shows all config settings that are customizable by profiles: raw/~ $ lvm dumpconfig --type profilable allocation { thin_pool_zero=1 thin_pool_discards="passdown" thin_pool_chunk_size=64 } activation { thin_pool_autoextend_threshold=100 thin_pool_autoextend_percent=20 }	2013-07-09 10:00:47 +02:00
Peter Rajnoha	5ed7d0cf1d	dumpconfig: add --mergedconfig option Normally, the lvm dumpconfig processes only the configuration tree that is at the top of the cascade. Considering the cascade is: CONFIG_STRING -> CONFIG_PROFILE -> CONFIG_MERGED_FILES/CONFIG_FILE ...then: (dumpconfig of lvm.conf only) raw/~ $ lvm dumpconfig allocation allocation { maximise_cling=1 mirror_logs_require_separate_pvs=0 thin_pool_metadata_require_separate_pvs=0 thin_pool_chunk_size=64 } (dumpconfig of selected profile configuration only) raw/~ $ lvm dumpconfig --profile test allocation allocation { thin_pool_chunk_size=8 thin_pool_discards="passdown" thin_pool_zero=1 } (dumpconfig of given --config configuration only) raw/~ $ lvm dumpconfig --config 'allocation{thin_pool_chunk_size=16}' allocation allocation { thin_pool_chunk_size=16 } The --mergedconfig option causes the configuration cascade to be merged before processing it with dumpconfig: (dumpconfig of merged selected profile and lvm.conf) raw/~ $ lvm dumpconfig --profile test allocation --mergedconfig allocation { maximise_cling=1 thin_pool_zero=1 thin_pool_discards="passdown" mirror_logs_require_separate_pvs=0 thin_pool_metadata_require_separate_pvs=0 thin_pool_chunk_size=8 } (dumpconfig merged given --config and selected profile and lvm.conf) raw/~ $ lvm dumpconfig --profile test --config 'allocation{thin_pool_chunk_size=16}' allocation --mergedconfig allocation { maximise_cling=1 thin_pool_zero=1 thin_pool_discards="passdown" mirror_logs_require_separate_pvs=0 thin_pool_metadata_require_separate_pvs=0 thin_pool_chunk_size=16 } Hence with the --mergedconfig, we are able to see the configuration that is actually used when processing any LVM command while using any combination of --config/--profile options together with lvm.conf file.	2013-07-08 16:05:56 +02:00
Zdenek Kabelac	985251c8f3	locking: unlock memory on error path Unlock memory and unblock signals on error path.	2013-07-08 14:02:49 +02:00
Alasdair G Kergon	7c6526aae2	lvresize: separate validation from action Start separating the validation from the action in the basic lvresize code moved to the library. Remove incorrect use of command line error codes from lvresize library functions. Move errors.h to tools directory to reinforce this, exporting public versions of the error codes in lvm2cmd.h for dmeventd plugins to use.	2013-07-06 03:28:21 +01:00
Peter Rajnoha	0a5b68e87b	man: document profile config and related options Document following items: configuration cascade (man lvm.conf) --profile ProfileName (man lvm) --detachprofile (man vgchange/lvchange) -o vg_profile/lv_profile (man vgs/lvs) Also document --config a bit so we can see where it fits in the configuration cascade - will be documented more in following commit...	2013-07-03 16:49:26 +02:00
Zdenek Kabelac	6f335ffa35	sigint: improve logic on for sigint reaction Fix and improve handling on sigint. Always check for signal presence before calling of command, so it will not call the command when break was hit. If the command has been finished succesfully there is no problem to mark the command ok and not report interrupt at all. Fix cuple related stack; reports and assignments.	2013-07-03 14:46:42 +02:00
Mike Snitzer	fe09d84668	lvconvert: Rename _swap_lv to _swap_lv_identifiers and move to allow an additional user	2013-07-02 17:02:25 -04:00
Mike Snitzer	f9e0adcce5	snapshot: Rename snapshot segment returning methods from find__cow to find__snapshot find_cow -> find_snapshot, find_merging_cow -> find_merging_snapshot. Will thin snapshot code to reuse these methods without confusion.	2013-07-02 16:26:03 -04:00
Tony Asleson	79106f7394	liblvm/python-lvm New additions Signed-off-by: Tony Asleson <tasleson@redhat.com>	2013-07-02 14:24:34 -05:00
Peter Rajnoha	fd94dd3b7e	WHATS_NEW: config/profile_dir	2013-07-02 15:57:46 +02:00
Peter Rajnoha	7ce0c6777b	WHATS_NEW: configuration profile support	2013-07-02 15:51:21 +02:00
Zdenek Kabelac	e30028004b	archiver: do not archive vg more then once Do not keep multiple archives for the executed command. Reuse the ALLOCATABLE_PV from pv status for ARCHIVED_VG vg status. Mark VG with the bit with the first archivation.	2013-07-01 23:09:26 +02:00
Jonathan Brassow	8215846aa5	Clean-up: WHATS_NEW Choosing between two entries and forgot to remove one.	2013-06-19 19:55:34 -05:00
Jonathan Brassow	a6d13308ec	RAID/MIRROR: Honor mirror_segtype_default when upconverting linear LVs If the user would upconvert a linear LV to a mirror without specifying the segment type ("--type mirror" vs "--type raid1"), the "mirror" segment type would be chosen without consulting the 'default_mirror_segtype' setting in lvm.conf. This is now used as the basis for determining which should be used if left unspecified.	2013-06-19 17:50:10 -05:00
Zdenek Kabelac	155841c349	lvmetad: fix compare function Check for enough space in preallocated buffer. Fixes problem, when lvm code started to suddenly allocate too big memory chunks. TODO: lvmetad protocol should announce needed size ahead, so if metadata have 1MB we are not reallocating memory...	2013-06-18 22:12:51 +02:00
Zdenek Kabelac	2562968864	vgcfgrestore: fix crash on restore of wrong vgname When vgname has not existed in metadata, it has crashed on double free in format_instance destroy() - since VG was created, used FID and was released - which also released FID, so further use was accessing bad memory. Fix it for this code path before release_vg() so FID will exists when _vg_read_file_name() returns NULL.	2013-06-18 22:11:21 +02:00
Alasdair G Kergon	c2dc21d89f	text: miscellaneous comments & message tweaks	2013-06-15 01:28:54 +01:00
Peter Rajnoha	dba53681a5	man: refine lvm.conf and man page documentation for autoactivation feature	2013-06-14 10:02:56 +02:00
Zdenek Kabelac	fe22089edf	thin: vgsplit support for thins Support vgsplit for VGs with thin pools and thin volumes. In case the thin data and thin metadata volumes are moved to a new VG, move there also all related thin volumes and check that external origins are also present in this new VG.	2013-06-13 14:51:00 +02:00
Peter Rajnoha	966d4f36d7	filter-mpath: detect partitions of mpath components We use mpath filtering (enabled by devices/multipath_component_detection=1 lvm.conf setting) to avoid a situation in which we could end up with duplicate PVs found. We need to filter out the mpath components and use only the top-level multipath mapping instead for PV scans. However, if the there are partitions on multipath components, we need to filter out these partitions. This patch fixes it so those partitions found on multipath components are filtered as well. For example, let's consider following configuration: The sda and sdb are mpath components, sda1 and sdb1 the partitions on these components, mpath-test the mpath mapping and mpath-test1 the partition mapping - created automatically by kpartx right after mpath-test creation. The PV resides on top. (LVM PV) \| mpath-test1 \| mpath-test \| sda1 ---------- sdb1 \ \| \|/ sda sdb E.g. for sda1 and sdb1, the code will detect this and it skips the partition that belongs to the multipath component: <snippet from the log> #filters/filter-mpath.c:156 /dev/sda1: Device is a partition, using primary device /dev/sda for mpath component detection 130 #ioctl/libdm-iface.c:1724 dm status (253:2) OF[16384](*1) 131 #filters/filter-mpath.c:196 /dev/sda1: Skipping mpath component device </snippet from the log> Othewise, we'd see the same PV label on sda1/sdb1 and mpath-test1 at the same time ending up with "Duplicate PV found...".	2013-06-12 13:13:38 +02:00
Zdenek Kabelac	87aca628d6	thin: lvresize supports pool metadata resize Add support for lvresize of thin pool metadata device. lvresize --poolmetadatasize +20 vgname/thinpool_lv or lvresize -L +20 vgname/thinpool_lv_tmeta Where the second one allows all the args for resize (striping...) and the first option resizes accoding to the last metadata lv segment.	2013-06-11 14:05:20 +02:00
Zdenek Kabelac	72c3ae253e	thin: add helper functions Add find_pool_lv() and pool_can_resize_metadata().	2013-06-11 14:03:30 +02:00
Zdenek Kabelac	55a3859632	thin: detect online metadata resize support	2013-06-11 14:03:28 +02:00
Zdenek Kabelac	01ef97fcbb	thin: report 'e' metadata type with higher priority Giving volume type information about being 'metadata' type of volume has higher priority then i.e. 'mirror' or 'thin' flag - for those type we have 'target attr' (7th. field).	2013-06-11 14:03:08 +02:00
Zdenek Kabelac	c0290489c3	thin: report o as volume type for external origin Reuse 'o' attr for lvs report also for external origin.	2013-06-11 14:02:41 +02:00
Zdenek Kabelac	7151ede767	thin: report t for thin pool and volume Do not mark internal device _tdata and _tmeta as having target type 't'. They have the target type on their own (i.e. mirror, raid).	2013-06-11 13:58:16 +02:00
Zdenek Kabelac	272f5ae208	snapshots: check for active state Fix testing if the snapshot could be resized and use lv_is_active() to get correct answer in cluster.	2013-06-11 13:57:18 +02:00
Zdenek Kabelac	f05c5a97c3	filters: dump filter returns error code Add int return value from dump() function. Report stack for error case. Update composable filter.	2013-06-03 08:42:25 +02:00
Zdenek Kabelac	5467a3b2b7	filters: update composable filter Last commit made dump filter only partially composable. Add remaining functionality and also support composable wipe, which is needed, when i.e. vgscan needs to remove cache. (in release fix)	2013-06-02 22:46:06 +02:00
Petr Rockai	1f73e992ef	lvmetad: no use of persistent filter with lvmetad	2013-06-02 00:49:55 +02:00
Petr Rockai	e7878da921	filters: toplevel filter not persistent Add a generic dump operation to filters and make the composite filter call through to its components. Previously, when global filter was set, the code would treat the toplevel composite filter's private area as if it belonged a persistent filter, trying to write nonsense into a non-sensical file. Also deal with NULL cmd->filter gracefully.	2013-06-02 00:48:58 +02:00
Petr Rockai	05bf4b8cc3	vgimportclone: override global_filter in lvm.conf The global filter in system's lvm.conf may conflict with the custom filter we set up in vgimportclone (they can easily fail to intersect). Since we explicitly avoid talking to lvmetad in vgimportclone, it is safe and reasonable to do so.	2013-06-02 00:47:17 +02:00
Zdenek Kabelac	3ced1bf694	lvresize: check for max snapshot size As for lvcreate, lvresize also doesn't need to grow bigger then needed.	2013-05-30 17:35:23 +02:00
Zdenek Kabelac	bd3ece0128	lvcreate: reduce too large cow Detect maximum usable size of snapshot COW device, and do not waste more space for such LV then needed.	2013-05-30 17:35:14 +02:00
Zdenek Kabelac	eb7e206a73	snapshot: add cow_max_extents Add more precise calculation of the maximum usable snapshot size. Using only percentage fails for small size of snapshot and extents.	2013-05-30 17:30:15 +02:00
Zdenek Kabelac	59962d8d3e	snapshot: require 3 chunks for creation There is no point in creation of 2chunks snapshot, since the snapshot is invalidated immeditelly with the first write as there is no free chunk for COW blocks (2 chunks are used by the snap header and the 1st. metadata chunk). Enhance error message about the lowest usable size.	2013-05-30 17:28:03 +02:00
Zdenek Kabelac	56779c32c5	snapshot: fix resize of 100% full cow When the COW area is using all the available space (100%) it can be still a valid snapshot which may need a resize. So support it.	2013-05-30 17:26:20 +02:00
Zdenek Kabelac	99f0483580	args: do not accept >=16EiB sizes Instead of seeing wierd overflows inside the lvm code, giving false error messages, kill the user experiment in the begining. Who needs to use more then 16EiB with lvm2 and 64bit anyway...	2013-05-30 17:23:51 +02:00
Zdenek Kabelac	2f1a571c97	fid: fix reset of PV fid Avoid hitting memory corruption (double free) in code path, where PV FID has been already destroyed and the released pointer was left in PV structure and could have been tried to be released from there 2nd. time with final context destruction.	2013-05-30 16:52:39 +02:00
Peter Rajnoha	be25f7ac83	WHATS_NEW: ea_start,ea_size -> ba_start,ba_size	2013-05-28 12:43:26 +02:00
Peter Rajnoha	732859d21f	refactor: rename embedding area -> bootloader area	2013-05-28 12:37:22 +02:00
Zdenek Kabelac	9966842810	snapshot: skip monitor for large cows If snapshot cow device is already big enough to cover whole origin, do not monitor it.	2013-05-27 10:35:43 +02:00
Zdenek Kabelac	77952151af	snapshot: add lv_is_cow_covering_origin Add function to check is size of cow is already big enough to cover whole origin.	2013-05-27 10:34:53 +02:00
Zdenek Kabelac	06e8ff29ff	snapshot: use dm_get_status_snapshot() Replace code with libdm call to dm_get_status_snapshot().	2013-05-27 10:32:02 +02:00
Zdenek Kabelac	2ada982e73	vgchange: check for mounted fs Check for mounted fs also for vgchange command, not just lvchange. NOTE: Code is using lv_info() just like lvs_in_vg_opened(). It should be probably converted into lv_is_active_locally().	2013-05-20 16:47:33 +02:00
Jonathan Brassow	06ac797f42	Clean-up: Replace 'lv_is_active' with more correct/specific variants There are places where 'lv_is_active' was being used where it was more correct to use 'lv_is_active_locally'. For example, when checking for the existance of a kernel instance before asking for its status. Most of the time these would work correctly. (RAID is only allowed on non-clustered VGs at the moment, which means that 'lv_is_active' and 'lv_is_active_locally' would give the same result.) However, it is more correct to use the proper variant and it helps with future scenarios where targets might be allowed exclusively (or clustered) in a cluster VG.	2013-05-16 10:36:56 -05:00
Peter Rajnoha	b3b551a93e	WHATS_NEW: bad day	2013-05-16 11:02:38 +02:00
Peter Rajnoha	cb0d817fb5	WHATS_NEW: for commit `4f6c2951d6`	2013-05-16 08:38:27 +02:00
Alasdair G Kergon	f12d88f840	activation: fix lv_is_active regressions Try to fix commit `bf2741376d`. lv_is_active is not the same as lv_info(cmd, org, 0, &info, 0, 0). Introduce and use lv_is_active_locally.	2013-05-15 02:13:31 +01:00
Alasdair G Kergon	2fbe1e6e00	rephrasing: miscellaneous changes Miscellaneous changes to messages, man pages, comments and WHATS_NEW.	2013-05-15 01:50:42 +01:00
Alasdair G Kergon	2e4a66a761	make: fix exported symbols regex for non-GNU sed Remove a couple of incorrect backslashes from expressions used to generate lists of exported symbols so it works with busybox sed. [John Spencer]	2013-05-14 19:29:26 +01:00
Alasdair G Kergon	c6cf2ed7fd	commands: accept --yes globally Accept --yes on all commands, even ones that don't today have prompts, so that test scripts that don't care about interactive prompts no longer need to deal with them. But continue to mention --yes only in the command prototypes that actually use it.	2013-05-14 18:45:37 +01:00
Mike Snitzer	8ad7865b42	Fix alignment of PV data area if detected alignment less than 1 MB This fixes a long standing regression since LVM2 2.02.74 (commit `4efb1d9c`, "Update heuristic used for default and detected data alignment.") The default PE alignment could be used (via MAX()) even if it was determined that the device's MD stripe width, or minimal_io_size or optimal_io_size were not factors of the default PE alignment (either 64K or the newer default of 1MB, etc). This bug would manifest if the default PE alignment was larger than the overriding hint that the device provided (e.g. default of 1MB vs optimal_io_size of 768K). Signed-off-by: Mike Snitzer <snitzer@redhat.com>	2013-05-13 15:56:47 -04:00
Zdenek Kabelac	55fe07ad98	mm: fix leak in fail path If the dm_realloc would fail, the already allocate _maps_buffer memory would have been lost (overwritten with NULL). Fix this by using temporary line buffer. Also add a minor cleanup to set end of buffer to '\0', only when we really know the file size fits the preallocated buffer.	2013-05-13 13:13:20 +02:00
Peter Rajnoha	4407133113	toolcontext: check dm version lazily for udev_fallback setting Setting the cmd->default_settings.udev_fallback also requires DM driver version check. However, this caused useless mapper/control access with ioctl if not needed actually. For example if we're not using activation code, we don't need to know the udev_fallback as there's no node and symlink processing. For example, this premature mapper/control access caused problems when using lvm2app even when no activation happens - there are situations in which we don't need to use mapper/control, but still need some of the lvm2app functionality. This is also the case for lvm2-activation systemd generator which just needs to look at the lvm2 configuration, but it shouldn't touch mapper/control.	2013-05-13 11:53:53 +02:00
Zdenek Kabelac	8d004b5127	report: show active state of LV For non clustered VG - show "active"/"" For clustered VG its more complex: "local exclusive" "remote exclusive" "locally" "remotely"	2013-04-25 17:33:24 +02:00
Zdenek Kabelac	8b18ab76d2	report: show dmeventd monitoring status Add new lvs segment field 'Monitor' showing 3 states: "monitored" - LV is monitored by dmeventd. "not monitored" - LV is currently not being monitored by dmeventd "" (empty) - LV does not support monitoring, or dmeventd support is not compiled in.	2013-04-25 17:33:24 +02:00
Zdenek Kabelac	3f7de58e96	man: lvextend --use-policies Add missing man info.	2013-04-25 17:33:24 +02:00
Zdenek Kabelac	f84f12a6a3	snapshot: rework cluster creation and removal Support for exclusive activation of snapshots revealed some problems. When snapshot is created, COW LV is activated first (for clearing) and then it's transformed into snapshot's COW LV, but it has left the lock for such LV active in cluster and this lock could not have been removed from dlm, unless snapshot has been removed within same dlm session. If the user tried to remove snapshot after rebooting node, the lock was missing, and COW LV could not have been detached. Patch modifes the approach in this way: Always deactivate COW LV for clustered vg after clearing (so it's activated again via imlicit snapshot activation rule when snapshot is activated). When snapshot is removed, activate COW LV as independend LV, so the lock will exist for such LV, but only when the snapshot is active. Also add test case for testing snapshot removal after cluster reboot.	2013-04-25 17:33:24 +02:00
Zdenek Kabelac	d51b7e5404	clvmd: avoid pretesting of dev availability Patch fixes hidden problem with lvm metadata caching. When the pretest was made, only the commited data have been cached back since the call lv_info_by_lvid() triggers mda read operation. However call of lv_suspend_if_active() also reads precommited metadata. The problem is visible in this sequence of calls: vg_write(), suspend_lv(), vg_commit(), resume_lv() which may end with leaving outdated mda in lvm cache, since vg_write() drops cached metadata and vg_commit() only transforms precommited to commited metadata, but in the case of pretesting we have no precommited mda available so the cache will continue to use old metadata. This happens, when suspend LV is inactive.	2013-04-25 17:33:22 +02:00
Zdenek Kabelac	45eeb70b02	config: merge timestamps Merging multiple config files together needs to know newest (highest) timestamp of merged files. Persistent cache file is being used only in case, the config file is older then .cache file.	2013-04-23 12:31:16 +02:00
Zdenek Kabelac	1951798d72	vgread: fix fid transfer for lvm1 and pool format Assign fid as the last step before returning VG. Make the format reader for 'lvm1' and 'pool' equal to 'lvm2' format reader. It has caused memory corruption to lvmetad as it later calls destroy_instance() to allocated fid. This patch should fix problems with crashing test lvmetad-lvm1.sh.	2013-04-21 23:13:57 +02:00
Zdenek Kabelac	a2b76a6f02	thin: fix resource leak in err path If the devices list could not have been obtained, FILE* was leaked.	2013-04-21 23:10:30 +02:00
Zdenek Kabelac	17a6915054	thin: explicitly avoid pvmove operation So far we do not support pvmove for thin volumes and thin pools.	2013-04-21 23:09:11 +02:00
Zdenek Kabelac	f787b575b5	lvmetad: fix error paths Also add missing goto out on error. Error path missed return NULL leading to double free of enc_value.	2013-04-21 23:04:53 +02:00
Zdenek Kabelac	c9d8d22224	clmvd: fix responce status Failing status code is expected to be 0. Also do not return '*response' as pointer which has been already free().	2013-04-21 22:54:42 +02:00
Jonathan Brassow	2e0740f7ef	RAID: Add writemostly/writebehind support for RAID1 'lvchange' is used to alter a RAID 1 logical volume's write-mostly and write-behind characteristics. The '--writemostly' parameter takes a PV as an argument with an optional trailing character to specify whether to set ('y'), unset ('n'), or toggle ('t') the value. If no trailing character is given, it will set the flag. Synopsis: lvchange [--writemostly <PV>:{t\|y\|n}] [--writebehind <count>] vg/lv Example: lvchange --writemostly /dev/sdb1:y --writebehind 512 vg/raid1_lv The last character in the 'lv_attr' field is used to show whether a device has the WriteMostly flag set. It is signified with a 'w'. If the device has failed, the 'p'artial flag has priority. Example ("nosync" raid1 with mismatch_cnt and writemostly): [~]# lvs -a --segment vg LV VG Attr #Str Type SSize raid1 vg Rwi---r-m 2 raid1 500.00m [raid1_rimage_0] vg Iwi---r-- 1 linear 500.00m [raid1_rimage_1] vg Iwi---r-w 1 linear 500.00m [raid1_rmeta_0] vg ewi---r-- 1 linear 4.00m [raid1_rmeta_1] vg ewi---r-- 1 linear 4.00m Example (raid1 with mismatch_cnt, writemostly - but failed drive): [~]# lvs -a --segment vg LV VG Attr #Str Type SSize raid1 vg rwi---r-p 2 raid1 500.00m [raid1_rimage_0] vg Iwi---r-- 1 linear 500.00m [raid1_rimage_1] vg Iwi---r-p 1 linear 500.00m [raid1_rmeta_0] vg ewi---r-- 1 linear 4.00m [raid1_rmeta_1] vg ewi---r-p 1 linear 4.00m A new reportable field has been added for writebehind as well. If write-behind has not been set or the LV is not RAID1, the field will be blank. Example (writebehind is set): [~]# lvs -a -o name,attr,writebehind vg LV Attr WBehind lv rwi-a-r-- 512 [lv_rimage_0] iwi-aor-w [lv_rimage_1] iwi-aor-- [lv_rmeta_0] ewi-aor-- [lv_rmeta_1] ewi-aor-- Example (writebehind is not set): [~]# lvs -a -o name,attr,writebehind vg LV Attr WBehind lv rwi-a-r-- [lv_rimage_0] iwi-aor-w [lv_rimage_1] iwi-aor-- [lv_rmeta_0] ewi-aor-- [lv_rmeta_1] ewi-aor--	2013-04-15 13:59:46 -05:00
Zdenek Kabelac	a81a2406f1	tools: add common lv_change_activate Move common code for changing activation state from vgchange and lvchange to one function. Fix the order of checks - so we always implicitelly activate snapshots and thin volumes in exclusive mode, and we do not allow local deactivation for them.	2013-04-12 11:30:07 +02:00
Jonathan Brassow	719e908bc0	WHATS_NEW: Add WHATS_NEW entry for previous commit.	2013-04-11 16:03:24 -05:00
Jonathan Brassow	ff64e3500f	RAID: Add scrubbing support for RAID LVs New options to 'lvchange' allow users to scrub their RAID LVs. Synopsis: lvchange --syncaction {check\|repair} vg/raid_lv RAID scrubbing is the process of reading all the data and parity blocks in an array and checking to see whether they are coherent. 'lvchange' can now initaite the two scrubbing operations: "check" and "repair". "check" will go over the array and recored the number of discrepancies but not repair them. "repair" will correct the discrepancies as it finds them. 'lvchange --syncaction repair vg/raid_lv' is not to be confused with 'lvconvert --repair vg/raid_lv'. The former initiates a background synchronization operation on the array, while the latter is designed to repair/replace failed devices in a mirror or RAID logical volume. Additional reporting has been added for 'lvs' to support the new operations. Two new printable fields (which are not printed by default) have been added: "syncaction" and "mismatches". These can be accessed using the '-o' option to 'lvs', like: lvs -o +syncaction,mismatches vg/lv "syncaction" will print the current synchronization operation that the RAID volume is performing. It can be one of the following: - idle: All sync operations complete (doing nothing) - resync: Initializing an array or recovering after a machine failure - recover: Replacing a device in the array - check: Looking for array inconsistencies - repair: Looking for and repairing inconsistencies The "mismatches" field with print the number of descrepancies found during a check or repair operation. The 'Cpy%Sync' field already available to 'lvs' will print the progress of any of the above syncactions, including check and repair. Finally, the lv_attr field has changed to accomadate the scrubbing operations as well. The role of the 'p'artial character in the lv_attr report field as expanded. "Partial" is really an indicator for the health of a logical volume and it makes sense to extend this include other health indicators as well, specifically: 'm'ismatches: Indicates that there are discrepancies in a RAID LV. This character is shown after a scrubbing operation has detected that portions of the RAID are not coherent. 'r'efresh : Indicates that a device in a RAID array has suffered a failure and the kernel regards it as failed - even though LVM can read the device label and considers the device to be ok. The LV should be 'r'efreshed to notify the kernel that the device is now available, or the device should be 'r'eplaced if it is suspected of failing.	2013-04-11 15:33:59 -05:00
Jonathan Brassow	95d28735ea	WHATS_NEW: Include entry for RAID status func improvements	2013-04-08 15:17:12 -05:00
Zdenek Kabelac	c22e925ce4	man: lvceate document external origin snapshot Document added support for external origin.	2013-04-05 14:15:03 +02:00
Zdenek Kabelac	ddafa0115e	man: updates for lvconvert and lvcreate Cleanup and improvement on man pages.	2013-04-05 14:14:20 +02:00
Peter Rajnoha	32ae07cef1	pv_write: clean up non-orphan format1 PV write ...to not pollute the common and format-independent code in the abstraction layer above. The format1 pv_write has common code for writing metadata and PV header by calling the "write_disks" fn and when rewriting the header itself only (e.g. just for the purpose of changing the PV UUID) during the pvchange operation, we had to tweak this functionality for the format1 case and we had to assign the PV the orphan state temporarily. This patch removes the need for this format1 tweak and it calls the write_disks with appropriate flag indicating whether this is a PV write call or a VG write call, allowing for metatada update for the latter one. Also, a side effect of the former tweak was that it effectively invalidated the cache (even for the non-format1 PVs) as we assigned it the orphan state temporarily just for the format1 PV write to pass. Also, that tweak made it difficult to directly detect whether a PV was part of a VG or not because the state was incorrect. Also, it's not necessary to backup and restore some PV fields when doing a PV write: orig_pe_size = pv_pe_size(pv); orig_pe_start = pv_pe_start(pv); orig_pe_count = pv_pe_count(pv); ... pv_write(pv) ... pv->pe_size = orig_pe_size; pv->pe_start = orig_pe_start; pv->pe_count = orig_pe_count; ...this is already done by the layer below itself (the _format1_pv_write fn). So let's have this cleaned up so we don't need to be bothered about any 'format1 special case for pv_write' anymore.	2013-03-25 15:08:26 +01:00
Peter Rajnoha	784867d5bd	WHATS_NEW: vgextend and PV with 0 MDAs	2013-03-19 15:41:34 +01:00
Zdenek Kabelac	b36a776a7f	thin: move update_pool_params Now we may recongnize preset arguments, move the code for updating thin pool related values into /lib portion of the code.	2013-03-13 15:13:54 +01:00
Alasdair G Kergon	cbfb5a98b5	filters: power2 devs get precedence if PVIDs match Give precedence to EMC "power2" devices with duplicate PVIDs like we already do with "emcpower" devices.	2013-03-11 20:10:49 +00:00
Peter Rajnoha	03b5c51730	WHATS_NEW: add lines for config validation support	2013-03-06 11:00:30 +01:00
Peter Rajnoha	b3776468fa	WHATS_NEW: add lines for embedding area support	2013-02-26 15:50:43 +01:00
Zdenek Kabelac	b73de73151	thin: lvconvert support for external origin Add basic support for converting LV into an external origin volume. Syntax: lvconvert --thinpool vg/pool --originname renamed_origin -T origin It will convert volume 'origin' into a thin volume, which will use 'renamed_origin' as an external read-only origin. All read/write into origin will go via 'pool'. renamed_origin volume is read-only volume, that could be activated only in read-only mode, and cannot be modified.	2013-02-23 10:38:20 +01:00
Zdenek Kabelac	d023b2d12f	lvremove: easier removal of dependent lvs Add function to remove lvs which are depending on removed lv prior the lv is removed. User is asked for confirmation.	2013-02-23 10:31:05 +01:00
Zdenek Kabelac	3679bb1cd9	activation: simplify activation code Reorder activation code to look similar for preload tree and activation tree. Its also give much better suppport for device stacking, since now we also support activation of snapshot which might be then used for other devices.	2013-02-23 10:30:03 +01:00
Zdenek Kabelac	0631d233d8	activation: add _add_layer_target_to_dtree Add function for creation of simple linear mapping over layer device.	2013-02-23 10:29:08 +01:00
Zdenek Kabelac	78b23f3595	activation: extend _cached_info Add layer string to support check of layered devices.	2013-02-23 10:28:01 +01:00
Jonathan Brassow	bbc6378b73	RAID: Make 'lvchange --refresh' restore transiently failed RAID PVs A new function (dm_tree_node_force_identical_table_reload) was added to avoid the suppression of identical table reloads. This allows RAID LVs to reload the on-disk superblock information that contains which devices have failed and the bitmaps. If the failed device has returned, this has the effect of restoring the device and initiating recovery. Without this patch, the user had to completely deactivate their RAID LV and re-activate it in order to restore the failed device. Now they simply need to suspend and resume (which is done by 'lvchange --refresh'). The identical table suppression is only avoided if the LV is not PARTAIL (i.e. all of it's devices can be seen and read by LVM) and the kernel status of the array contains failed devices. In other words, the function will only be called in the case where we may have success in restoring a failed device in the array.	2013-02-21 11:31:36 -06:00
Jonathan Brassow	3ab46449f4	vgimport: Allow '--force' to import VGs with missing PVs. When there are missing PVs in a volume group, most operations that alter the LVM metadata are disallowed. It turns out that 'vgimport' is one of those disallowed operations. This is bad because it creates a circular dependency. 'vgimport' will complain that the VG is inconsistent and that 'vgreduce --removemissing' must be run. However, 'vgreduce' cannot be run because it has not been imported. Therefore, 'vgimport' must be one of the operations allowed to change the metadata when PVs are missing. The '--force' option is the way to make 'vgimport' happen in spite of the missing PVs.	2013-02-20 16:37:41 -06:00
Peter Rajnoha	303e86adc8	pvcreate: fix alignment to incorporate alignment offset if PV has 0 MDAs If zero metadata copies are used, there's no further recalculation of PV alignment that happens when adding metadata areas to the PV and which actually calculates the alignment correctly as a matter of fact. So fix this for "PV without MDA" case as well. Before this patch: [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 1 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 12.00m [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 0 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 8.00m After this patch: [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 1 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 12.00m [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 0 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 12.00m Also, remove a superfluous condition "pv->pe_start < pv->pe_align" in: if (pe_start == PV_PE_START_CALC && pv->pe_start < pv->pe_align) pv->pe_start = pv->pe_align ... This part of the condition is not reachable as with the PV_PE_START_CALC, we always have pv->pe_start set to 0 from the PV struct initialisation (...the pv->pe_start value is just being calculated).	2013-02-21 14:51:19 +01:00
Jonathan Brassow	bd0ee420b5	RAID: Allow remove/replace of sub-LVs composed of error segments. When a device fails, we may wish to replace those segments with an error segment. (Like when a 'vgreduce --removemissing' removes a failed device that happens to be a RAID image/meta.) We are then left with images that we will eventually want to remove or replace. This patch allows us to pull out these virtual "error" sub-LVs. This allows a user to 'lvconvert -m -1 vg/lv' to extract the bad sub-LVs. Sub-LVs with error segments are considered for extraction before other possible devices so that good devices are not accidentally removed. This patch also adds the ability to replace RAID images that contain error segments. The user will still be unable to run 'lvconvert --replace' because there is no way to address the 'error' segment (i.e. no PV that it is associated with). However, 'lvconvert --repair' can be used to replace the image's error segment with a new PV. This is also the most appropriate way to do it, since the LV will continue to be reported as 'partial'.	2013-02-20 14:58:56 -06:00
Jonathan Brassow	845852d6b4	RAID: Make 'vgreduce --removemissing' work with RAID LVs Currently it is impossible to remove a failed PV which has a RAID LV on it. This patch fixes the issue by replacing the failed PV with an 'error' segment within the affected sub-LVs. Once there is no longer a RAID LV using the PV, it can be removed. Most often, it is better to replace a failed RAID device with a spare. (You can use 'lvconvert --repair <vg>/<LV>' to accomplish that.) However, if there are no spares in the volume group and none will be added, it is useful to be able to removed the failed device. Following patches address the ability to perform 'lvconvert' operations on RAID LVs that contain sub-LVs composed of 'error' segments.	2013-02-20 14:52:46 -06:00
Jonathan Brassow	0e4ffd9d3b	clean-up: Rename lvm.conf setting 'mirror_region_size' to 'raid_region_size' We have been using 'mirror_region_size' in lvm.conf as the default region size for RAID logical volumes as well as mirror logical volumes. Since, "raid" is more inclusive and representative than "mirror", I have changed the name of this setting. We must still check for the old setting and warn the user if we are overriding it with the new setting if both happen to be present.	2013-02-20 14:40:17 -06:00
Peter Rajnoha	722ca363f0	report: fix pvs -o pv_free reporting for PVs with 0 PEs [0] raw/~ # lsblk -o NAME,SIZE /dev/sda NAME SIZE sda 128M [0] raw/~ # pvcreate --dataalignment 128m /dev/sda Physical volume "/dev/sda" successfully created [0] raw/~ # vgcreate vg /dev/sda Volume group "vg" successfully created [0] raw/~ # lvcreate -l1 vg Volume group "vg" has insufficient free space (0 extents): 1 required. Before this patch: [0] raw/~ # pvs -o pv_name,pv_free PV PFree /dev/sda 128.00m After this patch: [0] raw/~ # pvs -o pv_name,pv_free PV PFree /dev/sda 0	2013-02-21 13:28:07 +01:00
Zdenek Kabelac	c984d8fbab	thin: properly unmark volume after detach When the volume is detached form thin pool, unmask THIN_VOLUME flag and reset related pointers.	2013-02-05 14:40:37 +01:00
Zdenek Kabelac	a5b9b4bf02	thin: fix forbidden discards checks Instead of check for lv_is_active() for thin pool LV, query the whole pool via new pool_is_active(). Fixes a problem when we cannot change discards settings for active pool device where the actual layer for pool device was inactive, but thin volumes using thin pool have been active.	2013-02-05 14:38:16 +01:00
Zdenek Kabelac	11eaf1c98c	thin: add function pool_is_active This internal function check for active pool device. For cluster it checks every thin volume, On the non-clustered VG we need to check just for presence of -tpool device.	2013-02-05 14:35:44 +01:00
Zdenek Kabelac	9d445f371c	report: leave empty report field for 0 Since we do not support LVs with 0 size, use this value as 'error' value for devices without origin, and leave this field blank as in other cases.	2013-02-05 14:32:37 +01:00
Zdenek Kabelac	be5ad90703	lvconvert: fix accepting second lv name Do not allow to accept second LV name on lvconvert --thinpool command line.	2013-02-05 14:31:17 +01:00
Zdenek Kabelac	a4870c79ca	thin: use noflush for obtaining transaction_id Do not flush thin pool data, when reading transation_id status.	2013-02-04 19:05:56 +01:00
Zdenek Kabelac	ca7abbce8a	activate: add lv_layer function Add function to return layer name for LV.	2013-02-04 19:01:10 +01:00
Jonathan Brassow	38e7b37c89	WHATS_NEW: Better description of previous change	2013-02-01 11:52:25 -06:00
Jonathan Brassow	801d4f96a8	RAID: Improve 'lvs' attribute reporting of RAID LVs and sub-LVs There are currently a few issues with the reporting done on RAID LVs and sub-LVs. The most concerning is that 'lvs' does not always report the correct failure status of individual RAID sub-LVs (devices). This can occur when a device fails and is restored after the failure has been detected by the kernel. In this case, 'lvs' would report all devices are fine because it can read the labels on each device just fine. Example: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) However, 'dmsetup status' on the device tells us a different story: [root@bp-01 lvm2]# dmsetup status vg-lv 0 1024000 raid raid1 2 DA 1024000/1024000 In this case, we must also be sure to check the RAID LVs kernel status in order to get the proper information. Here is an example of the correct output that is displayed after this patch is applied: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-p 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-p /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-p /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) The other case where 'lvs' gives incomplete or improper output is when a device is replaced or added to a RAID LV. It should display that the RAID LV is in the process of sync'ing and that the new device is the only one that is not-in-sync - as indicated by a leading 'I' in the Attr column. (Remember that 'i' indicates an (i)mage that is in-sync and 'I' indicates an (I)mage that is not in sync.) Here's an example of the old incorrect behaviour: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [root@bp-01 lvm2]# lvconvert -m +1 vg/lv; lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 0.00 lv_rimage_0(0),lv_rimage_1(0),lv_rimage_2(0) [lv_rimage_0] vg Iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg Iwi-aor-- /dev/sdb1(1) [lv_rimage_2] vg Iwi-aor-- /dev/sdc1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [lv_rmeta_2] vg ewi-aor-- /dev/sdc1(0) Note that all the images currently are marked as 'I' even though it is only the last device that has been added that should be marked. Here is an example of the correct output after this patch is applied: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [root@bp-01 lvm2]# lvconvert -m +1 vg/lv; lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 0.00 lv_rimage_0(0),lv_rimage_1(0),lv_rimage_2(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rimage_2] vg Iwi-aor-- /dev/sdc1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [lv_rmeta_2] vg ewi-aor-- /dev/sdc1(0) Note only the last image is marked with an 'I'. This is correct and we can tell that it isn't the whole array that is sync'ing, but just the new device. It also works under snapshots... [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg owi-a-r-p 33.47 lv_rimage_0(0),lv_rimage_1(0),lv_rimage_2(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg Iwi-aor-p /dev/sdb1(1) [lv_rimage_2] vg Iwi-aor-- /dev/sdc1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-p /dev/sdb1(0) [lv_rmeta_2] vg ewi-aor-- /dev/sdc1(0) snap vg swi-a-s-- /dev/sda1(51201)	2013-02-01 11:33:54 -06:00
Peter Rajnoha	f7da1caf8d	blkdeactivate: fix handling of nested mountpoints and mangled mount paths. If there was a nested mountpoint inside an existing mount path, blkdeactivate could fail to unmount such a mountpoint as it needs to deactivate the deepest path first and continue upwards. For example the simplest reproducer: [root@rhel6-a ~]# lsblk NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT sda 8:0 0 4G 0 disk \|-vg-lvol0 (dm-2) 253:2 0 32M 0 lvm /mnt/a `-vg-lvol1 (dm-3) 253:3 0 32M 0 lvm /mnt/a/b Before this patch: [root@rhel6-a ~]# blkdeactivate -u Deactivating block devices: UMOUNT: unmounting vg-lvol0 (dm-2) mounted on /mnt/a umount: /mnt/a: device is busy. (In some cases useful info about processes that use the device is found by lsof(8) or fuser(1)) UMOUNT: unmounting vg-lvol1 (dm-3) mounted on /mnt/a/b LVM: deactivating Logical Volume vg/lvol1 (deactivation of vg/lvol0 is skipped as /mnt/a that is on lvol0 can't be unmounted - it still has /mnt/a/b as nested mountpoint!) With this patch applied: [root@rhel6-a ~]# blkdeactivate -u Deactivating block devices: UMOUNT: unmounting vg-lvol1 (dm-3) mounted on /mnt/a/b UMOUNT: unmounting vg-lvol0 (dm-2) mounted on /mnt/a LVM: deactivating Logical Volume vg/lvol0 LVM: deactivating Logical Volume vg/lvol1 === Also, this patch contains a fix for processing mangled mount paths: [root@rhel6-a ~]# lsblk NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT sda 8:0 0 4G 0 disk `-vg-lvol0 (dm-2) 253:2 0 32M 0 lvm /mnt/x y z [root@rhel6-a ~]# lsblk -r vg-lvol0 253:2 0 32M 0 lvm /mnt/x\x20y\x20z (the mount path is mangled with \xNN that is visible in raw lsblk output only and which is used in blkdeactive as well) Before this patch: [root@rhel6-a ~]# blkdeactivate -u Deactivating block devices: umount: /mnt/x\x20y\x20z: not found After this patch applied: [root@rhel6-a ~]# blkdeactivate -u Deactivating block devices: UMOUNT: unmounting vg-lvol0 (dm-2) mounted on /mnt/x\x20y\x20z LVM: deactivating Logical Volume vg/lvol0	2013-01-23 14:45:41 +01:00
Zdenek Kabelac	8bcc1da2f3	locales: use higher prio LC_ALL variable For reseting locale environment into significantly less memory consuming version 'C' - use LC_ALL instead of LANG since it has higher priority in locale settings. Otherwise we may observe whole locale-archive which might be over 100MB on i.e. Fedora systems locked in memory with some daemons.	2013-01-22 11:25:02 +01:00
Petr Rockai	142c4bf9f0	Update WHATS_NEW.	2013-01-16 11:22:08 +01:00
Zdenek Kabelac	2b760a7fa7	whatsnew	2013-01-11 09:26:51 +01:00
Alasdair G Kergon	7f747a0d73	logging: add debug classes Add log/debug_classes to lvm.conf to allow debug messages to be classified and filtered at runtime. The dm_errno field is only used by log_error(), so I've redefined it for log_debug() messages to hold the message class. By default, all existing messages appear, but we can add categories that generate high volumes of data, such as logging all traffic to/from lvmetad.	2013-01-07 22:25:19 +00:00
Peter Rajnoha	ad85b0c526	pvscan: synchronize with udev if pvscan --cache is used. We need to call sync_local_dev_names directly as pvscan uses VG_GLOBAL lock and this one does not cause the synchronization (sync_dev_names) to be called on unlock (VG_GLOBAL is not a real VG): define unlock_vg(cmd, vol) do { \ if (is_real_vg(vol)) \ sync_dev_names(cmd); \ (void) lock_vol(cmd, vol, LCK_VG_UNLOCK); \ } while (0) Without this fix, we end up without udev synchronization for the pvscan --cache (mainly for -aay that causes the VGs/LVs to be autoactivated) and also udev synchronization cookies are then left in the system since they're not managed properly (code before sets up udev sync cookies, but we have to call dm_udev_wait at least once after that to do the wait and cleanup).	2012-12-21 11:15:46 +01:00
Peter Rajnoha	756bcabbfe	activation: fix autoactivation to not trigger on each PV change Before, the pvscan --cache -aay was called on each ADD and CHANGE uevent (for a device that is not a device-mapper device) and each CHANGE event (for a PV that is a device-mapper device). This causes troubles with autoactivation in some cases as CHANGE event may originate from using the OPTION+="watch" udev rule that is defined in 60-persistent-storage.rules (part of the rules provided by udev directly) and it's used for all block devices (except fd\|mtd\|nbd\|gnbd\|btibm\|dm-\|md* devices). For example, the following sequence incorrectly activates the rest of LVs in a VG if one of the LVs in the VG is being removed: [root@rhel6-a ~]# pvcreate /dev/sda Physical volume "/dev/sda" successfully created [root@rhel6-a ~]# vgcreate vg /dev/sda Volume group "vg" successfully created [root@rhel6-a ~]# lvcreate -l1 vg Logical volume "lvol0" created [root@rhel6-a ~]# lvcreate -l1 vg Logical volume "lvol1" created [root@rhel6-a ~]# vgchange -an vg 0 logical volume(s) in volume group "vg" now active [root@rhel6-a ~]# lvs LV VG Attr LSize Pool Origin Data% Move Log Cpy%Sync Convert lvol0 vg -wi------ 4.00m lvol1 vg -wi------ 4.00m [root@rhel6-a ~]# lvremove -ff vg/lvol1 Logical volume "lvol1" successfully removed [root@rhel6-a ~]# lvs LV VG Attr LSize Pool Origin Data% Move Log Cpy%Sync Convert lvol0 vg -wi-a---- 4.00m ...so the vg was deactivated, then lvol1 removed, and we end up with lvol1 removed (which is ok) BUT with lvol0 activated (which is wrong)!!! This is because after lvol1 removal, we need to write metadata to the underlying device /dev/sda and that causes the CHANGE event to be generated (because of the WATCH udev rule set on this device) and this causes the pvscan --cache -aay to be reevaluated. We have to limit this and call pvscan --cache -aay to autoactivate VGs/LVs only in these cases: --> if the PV is not a dm device, scan only after proper device addition (ADD event) and not with any other changes (CHANGE event) --> if the PV is a dm device, scan only after proper mapping activation (CHANGE event + the underlying PV in a state "just activated")	2012-12-21 10:34:48 +01:00
Jonathan Brassow	970dfbcd69	RAID: Limit replacement of devices when array is not in-sync. If a RAID array is not in-sync, replacing devices should not be allowed as a general rule. This is because the contents used to populate the incoming device may be undefined because the devices being read where not in-sync. The kernel enforces this rule unless overridden by not allowing the creation of an array that is not in-sync and includes a devices that needs to be rebuilt. Since we cannot know the sync state of an LV if it is inactive, we must also enforce the rule that an array must be active to replace devices. That leaves us with the following conditions: 1) never allow replacement or repair of devices if the LV is in-active 2) never allow replacement if the LV is not in-sync 3) allow repair if the LV is not in-sync, but warn that contents may not be recoverable. In the case where a user is performing the repair on the command line via 'lvconvert --repair', the warning is printed before the user is prompted if they would like to replace the device(s). If the repair is automated (i.e. via dmeventd and policy is "allocate"), then the device is replaced if possible and the warning is printed.	2012-12-18 14:40:42 -06:00
Peter Rajnoha	0379c480e0	WHATS_NEW: changelog for `fae1a611d2` and `5294a6f77a`	2012-12-18 12:12:58 +01:00
Zdenek Kabelac	401c9aba4a	pv_read: add missing check for valid info If the lvmcache_info_from_pvid() fails to find valid info, invoke the lookup by dev, and only in this case call lvmcache_info_from_pvid() again. Also check for the result of info and return error directly, so the NULL is not passed to lvmcache_get_label().	2012-12-15 17:23:27 +01:00
Zdenek Kabelac	3e8dbfaecf	lvmetad: add check for failure dm_config_write_node Detect if dm_config_write_node failed and fail correctly.	2012-12-15 17:23:27 +01:00
Zdenek Kabelac	4008f4f891	lvmetad: fix socket leak in handle_connect Close socket_fd and report error on malloc failure.	2012-12-15 17:23:27 +01:00
Zdenek Kabelac	e012d0635d	lvmetad: check id_read_format error status Detect error from id_read_format() function.	2012-12-15 17:23:27 +01:00
Zdenek Kabelac	ba3f37c9e4	lvmetad: fix memleak on pv_found error path Free resources allocated in pv_found when going out through error path.	2012-12-15 17:23:27 +01:00
Zdenek Kabelac	a4269aadf3	lvmetad: unlock vg on out-of-memory path If we fail to get memory for mutex, hash the mutex or fail somewhere along pthread function calls return allocated resources back and unlock vg_lock_map mutex.	2012-12-15 17:23:26 +01:00
Zdenek Kabelac	788ac7fa54	libdaemon: check for strdup result Detect failure of dm_pool_strdup() and print error in fail path. Save one extra strchr call - since we already know the distance for the '=' character. Drop stack trace from return after log_error().	2012-12-15 17:23:26 +01:00
Zdenek Kabelac	ff5612c0c3	format-text: check for _text_create_text_instance Test if 'fid' creation failed and report stack trace, break the loop and do not pass NULL fid further.	2012-12-15 17:23:23 +01:00
Zdenek Kabelac	740ab81d03	log: move abort past syslog When the abort_on_internal_errors is enabled, we aborted prior the syslog logging output. Since such fatal error gets level _LOG_FATAL it should not be blocked by debug_level() check so lets move it further, to get abort error logged also via syslog.	2012-12-15 17:22:48 +01:00
Andy Grover	58b61c252a	python-lvm: Small fixups to new create_lv_snapshot Tabify Remove use of asize, unneeded. Don't initialize lvobj->parent_vgobj to NULL, the object ctor already zeroed everything on alloc. Redo call to lvm_lv_snapshot to use the liblvm snapshot implementation we went with. Add {}s to silence warning in lv_dealloc. Rename snapshot function for consistency. Update WHATS_NEW. Signed-off-by: Andy Grover <agrover@redhat.com>	2012-12-14 10:30:26 -08:00
Petr Rockai	c089029b70	Update WHATS_NEW.	2012-12-12 15:17:08 +01:00
Peter Rajnoha	e5709a32be	lvmetad: fix compiler warning and add WHATS_NEW line for previous commit	2012-12-12 13:27:25 +01:00
Peter Rajnoha	cad22be394	lvconvert: allow lvconvert --stripes/stripesize only with -mirrors/--repair/--thinpool Also, update lvconvert man page to reflect this and make clear that the --stripes/stripesize is applied to newly allocated space only.	2012-12-11 15:50:25 +01:00
Zdenek Kabelac	ec49f07b0d	mirrors: fix leak in device_is_usable mirror check Function _ignore_blocked_mirror_devices was not release allocated strings images_health and log_health. In error paths it was also not releasing dm_task structure. Swaped return code of _ignore_blocked_mirror_devices and use 1 as success. In _parse_mirror_status use log_error if memory allocation fails and few more errors so they are no going unnoticed as debug messages. On error path always clear return values and free strings. For dev_create_file use cache mem pool to avoid memleak.	2012-12-11 11:15:22 +01:00
Peter Rajnoha	f942ae4a7a	lvconvert: do not ignore -f in lvconvert --repair -y -f	2012-12-11 09:52:54 +01:00
Jonathan Brassow	3835755259	pvmove/RAID: Disallow pvmove on RAID LVs until properly handled Attempting pvmove on RAID LVs replaces the kernel RAID target with a temporary pvmove target, ultimately destroying the RAID LV. pvmove must be prevented on RAID LVs for now. Use 'lvconvert --replace old_pv vg/lv new_pv' if you want to move an image of the RAID LV.	2012-12-04 17:47:47 -06:00
Peter Rajnoha	e2be2652ad	Allow empty activation/{auto_activation\|read_only\|}_volume_list config option. In case we don't want to activate, autoactivate or have the VG/LV read-only. Primarily targeted for the auto_activation_volume_list, but it makes no harm for other settings (the part of the code that reads these three settings is shared, but there's no reason to separate it only for this change).	2012-12-04 10:33:54 +01:00
Zdenek Kabelac	5ec20e267f	thin: reworked thin feature detection Rework thin feature detection to support runtime section to allow to disable them selectively. New lvm.conf option is born: global/thin_disabled_features	2012-12-03 11:57:40 +01:00
Zdenek Kabelac	99018b37ee	thin: lvconvert supports swapping metadata device Support swapping of metadata device if the thin pool already exists. This way it's easy to i.e. resize metadata or their repair operation. User may create some empty LV, replace existing metadata or dump and restore them into bigger LV.	2012-12-02 18:01:27 +01:00
Zdenek Kabelac	6987a353de	thin: add detach_pool_metadata_lv Add internal function detach_pool_metadata_lv().	2012-12-02 17:56:29 +01:00
Zdenek Kabelac	9ec474f38a	lvm2api: fix size reporting API is reporting all sizes as 64bit integers in bytes. Fix at those places, where sectors were returned to remain consistent.	2012-12-02 17:55:08 +01:00
Peter Rajnoha	4891a735d3	udev: recognize DM_DISABLE_UDEV environment variable Setting this environment variable will cause a full fallback to old direct node and symlink management in libdevmapper and lvm2. It means: - disabling udev synchronization (--noudevsync in dmsetup and --noudevsync + activation/udev_sync=0 lvm2 config) - disabling dm and any subsystem related udev rules (--noudevrules in dmsetup and activation/udev_rules=0 lvm2 config) - management of nodes/symlinks under /dev directly by libdevmapper/lvm2 (--verifyudev in dmsetup and activation/verify_udev_operations=1 lvm2 config) - not obtaining any device list from udev database (devices/obtain_device_list_from_udev=0 lvm2 config) Note: we could set all of these before - there's no functional change! However the DM_DISABLE_UDEV environment variable is a nice shortcut to make it easier for libdevmapper users so that one can switch off all of the udev management off at one go directly on the command line, without a need to modify any source or add any extra switches.	2012-11-29 14:03:48 +01:00
Peter Rajnoha	fb8cc7c63f	udev: do not verify udev operations for --noudevsync If udev synchronization is disabled by means of --noudevsync option, we should disable just the synchronization and nothing else. The udev fallback (verifying udev operations and fixing the nodes/symlinks if found incorrect) is orthogonal and controlled by a separate activation/verify_udev_operations configuration option.	2012-11-29 13:59:12 +01:00
Zdenek Kabelac	0387e70d76	thin: fix property discard for lvm2api Discards property is string and may have these values: ignore, nopassdown, passdown	2012-11-27 14:09:49 +01:00
Zdenek Kabelac	09b7ceea95	thin: allow restore with --force Allow restoring metadata with thin pool volumes. No validation is done for this case within vgcfgrestore tool - thus incorrect metadata may lead to destruction of pool content.	2012-11-27 14:08:24 +01:00
Alasdair G Kergon	8c49aa79e7	filters: Add STEC skd and Violin vtms devices	2012-11-26 14:55:17 +00:00
Zdenek Kabelac	1ef9831018	thin: support configurable thin pool defaults Configurable settings for thin pool create if they are not specified on command line. New supported lvm.conf options are: allocation/thin_pool_chunk_size allocation/thin_pool_discards allocation/thin_pool_zero	2012-11-26 12:16:47 +01:00
Zdenek Kabelac	683b1f0625	thin: detect discards for non-power-2 Check if target supports discards for chunk sizes, that are not power of 2 (just multiple of 64K), and enable it in case it's supported by thin kernel target.	2012-11-26 12:14:47 +01:00
Petr Rockai	60668f823e	Automatically restore MISSING PVs with no MDAs.	2012-11-25 20:41:56 +01:00
Jonathan Brassow	b3e9a09abe	RAID: If no stripes argument is given for RAID10 create, default to 2 Similar to the way the 'mirror', 'raid1' and 'raid10' segment types set the number of mirrors to 2 ('-m 1') if the argument is not specified, here we set the number of stripes to 2 if not given on the command line when creating a RAID10 LV.	2012-11-21 18:46:52 -06:00
Jonathan Brassow	fb0cee9a66	RAID: Do not allow --splitmirrors on RAID10 logical volumes. RAID10 does not have the ability to split off images for independent use. So, 'lvconvert --splitmirrors' will not work and must be disallowed.	2012-11-21 18:39:26 -06:00
Zdenek Kabelac	d5697b29ee	mm: skip mlocking [vectors] Somehow forgotten: https://www.redhat.com/archives/linux-lvm/2012-June/msg00019.html Need for arm architecture support.	2012-11-20 10:02:51 +01:00
Zdenek Kabelac	b21d3e3592	thin: lvconvert update Use common function from toollib and support allocation of metadata LV with give thin pool data LV.	2012-11-19 14:38:17 +01:00
Zdenek Kabelac	f4137640f6	thin: add common pool functions Move common functions for lvcreate and lvconvert. get_pool_params() - read thin pool args. update_pool_params() - updates/validates some thin args. It is getting complicated and even few more things will be implemented, so to avoid reimplementing things differently in lvcreate and lvconvert code has been splitted into 2 common functions that allow some future extension.	2012-11-19 14:38:17 +01:00
Jonathan Brassow	54c73b7723	mirror: Mirrored log should be fixed before mirror when double fault occurs This patch is intended to fix bug 825323 - FS turns read-only during a double fault of a mirror leg and mirrored log's leg at the same time. It only affects a 2-way mirror with a mirrored log. 3+-way mirrors and mirrors without a mirrored log are not affected. The problem resulted from the fact that the top level mirror was not using 'noflush' when suspending before its "down-convert". When a mirror image fails, the bios are queue until a suspend is recieved. If it is a 'noflush' suspend, the bios can be safely requeued in the DM core. If 'noflush' is not used, the bios must be pushed through the target and if a device is failed for a mirror, that means issuing an error. When an error is received by a file system, it results in it turning read-only (depending on the FS). Part of the problem was is due to the nature of the stacking involved in using a mirror as a mirror's log. When an image in each fail, the top level mirror stalls because it is waiting for a log flush. The other stalls waiting for corrective action. When the repair command is issued, the entire stacked arrangement is collapsed to a linear LV. The log flush then fails (somewhat uncleanly) and the top-level mirror is suspended without 'noflush' because it is a linear device. This patch allows the log to be repaired first, which in turn allows the top-level mirror's log flush to complete cleanly. The top-level mirror is then secondarily reduced to a linear device - at which time this mirror is suspended properly with 'noflush'.	2012-11-14 14:58:47 -06:00
Tony Asleson	7a34db0cfd	python-lvm: Initial check-in of python-lvm unit test case. Signed-off-by: Tony Asleson <tasleson@redhat.com>	2012-11-14 13:18:37 -06:00
Peter Rajnoha	fc2644ae71	pvscan: exit --cache immediately if locking_type=3 \|\| use_lvmetad=0	2012-11-09 15:56:57 +01:00
Peter Rajnoha	360c569ce8	systemd: various updates and fixes Don't use lvmetad in lvm2-monitor.service ExecStop to avoid a systemd issue. - a systemd design issue while processing dependencies with socket-based activation that ends up with a hang - https://bugzilla.redhat.com/show_bug.cgi?id=843587 (also tracker bug https://bugzilla.redhat.com/show_bug.cgi?id=871527) - not using lvmetad in this case is just a workaround, once the bug above is resolved, we should enable the lvmetad in that specific case Remove dependency on fedora-storage-init.service in lvm2 systemd units. - fedora-storage-init.service and fedora-storage-init-late.service is going to be separated into respective units that belong to each block device subsystem: - mpath + mdraid activated via udev solely - dmraid with its own dmraid-activation.service unit - lvm2 with the lvm2-activation-generator to generate the activation units runtime if lvmetad disabled (global/use_lvmetad=0 set in lvm.conf) and activation done via udev+lvmetad if lvmetad enabled (global/use_lvmetad=1 set in lvm.conf) Depend on lvm2-lvmetad.socket in lvm2-monitor.service systemd unit. - as lvm2-monitor uses lvmetad if lvmetad is enabled	2012-10-30 20:55:50 +01:00
Peter Rajnoha	10492b238d	lvmetad: whats_new + more explanation for previous commit	2012-10-25 14:47:45 +02:00
Jonathan Brassow	b248ba0a39	mirror: Avoid reading mirrors with failed devices in mirrored log Commit `9fd7ac7d03` did not handle mirrors that contained mirrored logs. This is because the status line of the mirror does not give an indication of the health of the mirrored log, as you can see here: [root@bp-01 lvm2]# dmsetup status vg-lv vg-lv_mlog vg-lv: 0 409600 mirror 2 253:6 253:7 400/400 1 AA 3 disk 253:5 A vg-lv_mlog: 0 8192 mirror 2 253:3 253:4 7/8 1 AD 1 core Thus, the possibility for LVM commands to hang still persists when mirror have mirrored logs. I discovered this while performing some testing that does polling with 'pvs' while doing I/O and killing devices. The 'pvs' managed to get between the mirrored log device failure and the attempt by dmeventd to repair it. The result was a very nasty block in LVM commands that is very difficult to remove - even for someone who knows what is going on. Thus, it is absolutely essential that the log of a mirror be recursively checked for mirror devices which may be failed as well. Despite what the code comment says in the aforementioned commit... + * _mirrored_transient_status(). FIXME: It is unable to handle mirrors + * with mirrored logs because it does not have a way to get the status of + * the mirror that forms the log, which could be blocked. ... it is possible to get the status of the log because the log device major/minor is given to us by the status output of the top-level mirror. We can use that to query the log device for any DM status and see if it is a mirror that needs to be bypassed. This patch does just that and is now able to avoid reading from mirrors that have failed devices in a mirrored log.	2012-10-25 00:42:45 -05:00
Jonathan Brassow	9fd7ac7d03	mirror: Avoid reading from mirrors that have failed devices Addresses: rhbz855398 (Allow VGs to be built on cluster mirrors), and other issues. The LVM code attempts to avoid reading labels from devices that are suspended to try to avoid situations that may cause the commands to block indefinitely. When scanning devices, 'ignore_suspended_devices' can be set so the code (lib/activate/dev_manager.c:device_is_usable()) checks any DM devices it finds and avoids them if they are suspended. The mirror target has an additional mechanism that can cause I/O to be blocked. If a device in a mirror fails, all I/O will be blocked by the kernel until a new table (a linear target or a mirror with replacement devices) is loaded. The mirror indicates that this condition has happened by marking a 'D' for the faulty device in its status output. This condition must also be checked by 'device_is_usable()' to avoid the possibility of blocking LVM commands indefinitely due to an attempt to read the blocked mirror for labels. Until now, mirrors were avoided if the 'ignore_suspended_devices' condition was set. This check seemed to suggest, "if we are concerned about suspended devices, then let's ignore mirrors altogether just in case". This is insufficient and doesn't solve any problems. All devices that are suspended are already avoided if 'ignore_suspended_devices' is set; and if a mirror is blocking because of an error condition, it will block the LVM command regardless of the setting of that variable. Rather than avoiding mirrors whenever 'ignore_suspended_devices' is set, this patch causes mirrors to be avoided whenever they are blocking due to an error. (As mentioned above, the case where a DM device is suspended is already covered.) This solves a number of issues that weren't handled before. For example, pvcreate (or any command that does a pv_read or vg_read, which eventually call device_is_usable()) will be protected from blocked mirrors regardless of how 'ignore_suspended_devices' is set. Additionally, a mirror that is neither suspended nor blocking is /allowed/ to be read regardless of how 'ignore_suspended_devices' is set. (The latter point being the source of the fix for rhbz855398.)	2012-10-23 23:10:33 -05:00
Jonathan Brassow	b873fc54ba	WHATS_NEW: Entry for commit `e191780947` WHATS_NEW commit for 'lvs' output change to add RAID 4/5/6 sync %age to s/Copy%/Cpy%Sync/ output.	2012-10-23 21:38:37 -05:00
Zdenek Kabelac	13fe333b54	clvmd: fix parsing of -d argument clvmd -d option parsing was not working properly. clvmd -d 2 (with space) has been ignored because of '::' used in getopt string, and as failsafe it's been used '1'. Later this debug_arg has been ignored and debug_opt was used instead which happend to have value '1'. Submitted-by: Robert Milasan <rmilasan at suse.com> Reported-by: Robert Milasan <rmilasan at suse.com>	2012-10-19 15:35:56 +02:00
Zdenek Kabelac	5f5a5d1f53	lvchange: support --yes option for --persistent Support using command: lvchange --yes --persistent to skip y\|n prompt.	2012-10-19 15:33:46 +02:00
Zdenek Kabelac	c7c53ad41d	pvcreate: fix leak on error path Missing vg release on error path. Add tests for few more error cases.	2012-10-19 15:32:21 +02:00
Zdenek Kabelac	bf2741376d	Use lv_is_active instead of lv_info() Usage of lv_is_active makes it more obvious what is being checked.	2012-10-17 15:42:31 +02:00
Zdenek Kabelac	f260f99d57	cleanup: switch log_error to log_warn Use log_warn to print non-fatal warning messages. Use of log_error would confuse checker for testing whether proper error has been reported for some real error.	2012-10-17 15:41:35 +02:00
Alasdair G Kergon	ea6a8078b4	release: prepare for release	2012-10-15 15:19:32 +01:00
Zdenek Kabelac	b3899056d9	thin: disable conversion of thin-pool to read-only This change is not yet supported.	2012-10-15 14:09:11 +02:00

... 3 4 5 6 7 ...

2954 Commits