shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-22 17:35:59 +03:00

Author	SHA1	Message	Date
Peter Rajnoha	10492b238d	lvmetad: whats_new + more explanation for previous commit	2012-10-25 14:47:45 +02:00
Jonathan Brassow	b248ba0a39	mirror: Avoid reading mirrors with failed devices in mirrored log Commit `9fd7ac7d03` did not handle mirrors that contained mirrored logs. This is because the status line of the mirror does not give an indication of the health of the mirrored log, as you can see here: [root@bp-01 lvm2]# dmsetup status vg-lv vg-lv_mlog vg-lv: 0 409600 mirror 2 253:6 253:7 400/400 1 AA 3 disk 253:5 A vg-lv_mlog: 0 8192 mirror 2 253:3 253:4 7/8 1 AD 1 core Thus, the possibility for LVM commands to hang still persists when mirror have mirrored logs. I discovered this while performing some testing that does polling with 'pvs' while doing I/O and killing devices. The 'pvs' managed to get between the mirrored log device failure and the attempt by dmeventd to repair it. The result was a very nasty block in LVM commands that is very difficult to remove - even for someone who knows what is going on. Thus, it is absolutely essential that the log of a mirror be recursively checked for mirror devices which may be failed as well. Despite what the code comment says in the aforementioned commit... + * _mirrored_transient_status(). FIXME: It is unable to handle mirrors + * with mirrored logs because it does not have a way to get the status of + * the mirror that forms the log, which could be blocked. ... it is possible to get the status of the log because the log device major/minor is given to us by the status output of the top-level mirror. We can use that to query the log device for any DM status and see if it is a mirror that needs to be bypassed. This patch does just that and is now able to avoid reading from mirrors that have failed devices in a mirrored log.	2012-10-25 00:42:45 -05:00
Jonathan Brassow	9fd7ac7d03	mirror: Avoid reading from mirrors that have failed devices Addresses: rhbz855398 (Allow VGs to be built on cluster mirrors), and other issues. The LVM code attempts to avoid reading labels from devices that are suspended to try to avoid situations that may cause the commands to block indefinitely. When scanning devices, 'ignore_suspended_devices' can be set so the code (lib/activate/dev_manager.c:device_is_usable()) checks any DM devices it finds and avoids them if they are suspended. The mirror target has an additional mechanism that can cause I/O to be blocked. If a device in a mirror fails, all I/O will be blocked by the kernel until a new table (a linear target or a mirror with replacement devices) is loaded. The mirror indicates that this condition has happened by marking a 'D' for the faulty device in its status output. This condition must also be checked by 'device_is_usable()' to avoid the possibility of blocking LVM commands indefinitely due to an attempt to read the blocked mirror for labels. Until now, mirrors were avoided if the 'ignore_suspended_devices' condition was set. This check seemed to suggest, "if we are concerned about suspended devices, then let's ignore mirrors altogether just in case". This is insufficient and doesn't solve any problems. All devices that are suspended are already avoided if 'ignore_suspended_devices' is set; and if a mirror is blocking because of an error condition, it will block the LVM command regardless of the setting of that variable. Rather than avoiding mirrors whenever 'ignore_suspended_devices' is set, this patch causes mirrors to be avoided whenever they are blocking due to an error. (As mentioned above, the case where a DM device is suspended is already covered.) This solves a number of issues that weren't handled before. For example, pvcreate (or any command that does a pv_read or vg_read, which eventually call device_is_usable()) will be protected from blocked mirrors regardless of how 'ignore_suspended_devices' is set. Additionally, a mirror that is neither suspended nor blocking is /allowed/ to be read regardless of how 'ignore_suspended_devices' is set. (The latter point being the source of the fix for rhbz855398.)	2012-10-23 23:10:33 -05:00
Jonathan Brassow	b873fc54ba	WHATS_NEW: Entry for commit `e191780947` WHATS_NEW commit for 'lvs' output change to add RAID 4/5/6 sync %age to s/Copy%/Cpy%Sync/ output.	2012-10-23 21:38:37 -05:00
Zdenek Kabelac	13fe333b54	clvmd: fix parsing of -d argument clvmd -d option parsing was not working properly. clvmd -d 2 (with space) has been ignored because of '::' used in getopt string, and as failsafe it's been used '1'. Later this debug_arg has been ignored and debug_opt was used instead which happend to have value '1'. Submitted-by: Robert Milasan <rmilasan at suse.com> Reported-by: Robert Milasan <rmilasan at suse.com>	2012-10-19 15:35:56 +02:00
Zdenek Kabelac	5f5a5d1f53	lvchange: support --yes option for --persistent Support using command: lvchange --yes --persistent to skip y\|n prompt.	2012-10-19 15:33:46 +02:00
Zdenek Kabelac	c7c53ad41d	pvcreate: fix leak on error path Missing vg release on error path. Add tests for few more error cases.	2012-10-19 15:32:21 +02:00
Zdenek Kabelac	bf2741376d	Use lv_is_active instead of lv_info() Usage of lv_is_active makes it more obvious what is being checked.	2012-10-17 15:42:31 +02:00
Zdenek Kabelac	f260f99d57	cleanup: switch log_error to log_warn Use log_warn to print non-fatal warning messages. Use of log_error would confuse checker for testing whether proper error has been reported for some real error.	2012-10-17 15:41:35 +02:00
Alasdair G Kergon	ea6a8078b4	release: prepare for release	2012-10-15 15:19:32 +01:00
Zdenek Kabelac	b3899056d9	thin: disable conversion of thin-pool to read-only This change is not yet supported.	2012-10-15 14:09:11 +02:00
Zdenek Kabelac	2fc1fc3a93	thin: allow to create read-only thin-volumes Useful for i.e. read-only thin snapshots.	2012-10-15 14:07:03 +02:00
Peter Rajnoha	4dace48f51	Remove pvscan --cache from lvm2-lvmetad init script. This is not needed anymore as the scan is called transparently within the first LVM command that queries lvmetad.	2012-10-15 12:58:23 +02:00
Alasdair G Kergon	78dafcba99	lvmetad: use -l for logging level not -d	2012-10-15 10:44:43 +01:00
Alasdair G Kergon	a0e60d27ff	lvmetad: document and tidy cmdline args Try to bring the lvmetad usage text and man page closer to the code. There seem to be 3 useful ways to use -d with lvmetad at the moment: -d all -d wire -d debug (They can also be comma-separated like -d wire,debug.) Prior to the last release, -d, -dd and -ddd were supported. Fail if an unrecognised debug arg is supplied on the command line. Change -V to report the same version as the lvm binary: previously it just reported version 0.	2012-10-15 02:06:27 +01:00
Zdenek Kabelac	16060b101b	thin: lvextend will fail is autoextend is 0% Since extending by 0% will not increase the size of pool, return failure.	2012-10-14 23:17:30 +02:00
Peter Rajnoha	2679c68689	WHATS_NEW: update	2012-10-12 14:47:40 +02:00
Petr Rockai	141f26035d	Update WHATS_NEW.	2012-10-12 13:24:06 +02:00
Zdenek Kabelac	3058f662cf	thin: prohibit lvcreate --thinpool with mirrors Disable --thinpool to be used with mirror on lvcreate.	2012-10-12 12:21:45 +02:00
Zdenek Kabelac	be291e1064	thin: lvm2api return origin property for thin LV	2012-10-12 12:20:55 +02:00
Alasdair G Kergon	ee3cfa4184	python: Add bindings for liblvm2app. Use configure --enable-python_bindings to generate them. Note that the Makefiles do not yet control the owner or permissions of the two new files on installation.	2012-10-12 02:08:47 +01:00
Zdenek Kabelac	0a46160d94	lvm2api: add defined lvm_percent_to_float Implement function which was somehow missing from it's original placement in the header file lvm2api.h.	2012-10-11 17:29:56 +02:00
Zdenek Kabelac	ca09c9ab4c	thin: support non power of 2 chunk size Support thin chunk size with multiple of 64KiB if user has thin-pool target version at least 1.2.	2012-10-10 21:21:00 +02:00
Jonathan Brassow	3501f17fd0	[lv\|vg]change: Allow limited metadata changes when PVs are missing A while back, the behavior of LVM changed from allowing metadata changes when PVs were missing to not allowing changes. Until recently, this change was tolerated by HA-LVM by forcing a 'vgreduce --removemissing' before trying (again) to add tags to an LV and then activate it. LVM mirroring requires that failed devices are removed anyway, so this was largely harmless. However, RAID LVs do not require devices to be removed from the array in order to be activated. In fact, in an HA-LVM environment this would be very undesirable. Device failures in such an environment can often be transient and it would be much better to restore the device to the array than synchronize an entirely new device. There are two methods that can be used to setup an HA-LVM environment: "clvm" or "tagging". For RAID LVs, "clvm" is out of the question because RAID LVs are not supported in clustered VGs - not even in an exclusively activated manner. That leaves "tagging". HA-LVM uses tagging - coupled with 'volume_list' - to ensure that only one machine can have an LV active at a time. If updates are not allowed when a PV is missing, it is impossible to add or remove tags to allow for activation. This removes one of the most basic functionalities of HA-LVM - site redundancy. If mirroring or RAID is used to replicate the storage in two data centers and one of them goes down, a server and a storage device are lost. When the service fails-over to the alternate site, the VG will be "partial". Unable to add a tag to the VG/LV, the RAID device will be unable to activate. The solution is to allow vgchange and lvchange to alter the LVM metadata for a limited set of options - --[add\|del]tag included. The set of allowable options are ones that do not cause changes to the DM kernel target (like --resync would) or could alter the structure of the LV (like allocation or conversion).	2012-10-10 11:33:10 -05:00
Zdenek Kabelac	cdb7502e54	lvchange: do not start dmevent for resyn If monitoring is disabled in lvm.conf, avoid its starting and preserve DMEVENTD_MONITOR_IGNORE settings internally.	2012-10-09 12:22:26 +02:00
Peter Rajnoha	7a64fff948	systemd: remove ExecStartPost from lvm2-lvmetad.service. The ExecStartPost with pvscan --cache in lvm2-lvmetad.service is not needed now as this is called transparently within the first LVM command that queries lvmetad.	2012-10-08 16:49:54 +02:00
Zdenek Kabelac	ff13206c7e	report: call snapshot percent with cow only Ensure lv_snapshot_percent is used only with snapshot LVs.	2012-10-08 12:16:53 +02:00
Zdenek Kabelac	5b07bd3f91	lvconvert: disable convertion of thin to mirrors For now this convertions is not supported, thus disabled. The only supported conversion for now is to create mirrored thin pools from mirrored devices.	2012-10-08 12:16:53 +02:00
Zdenek Kabelac	1da6c1495a	lvm2api: fix data percent reporting for thin, snap Use same logic for lvm2api as we use lvs reporting. data_percent is meant to be superset for snap_percent.	2012-10-05 10:37:09 +02:00
Jonathan Brassow	9efd3fb604	RAID: Do not allow RAID LVs in a cluster volume group. It would be possible to activate a RAID LV exclusively in a cluster volume group, but for now we do not allow RAID LVs to exist in a clustered volume group at all. This has two components: 1) Do not allow RAID LVs to be created in a clustered VG 2) Do not allow changing a VG from single-machine to clustered if there are RAID LVs present.	2012-10-03 15:52:54 -05:00
Zdenek Kabelac	a27650cc98	thin: lvconvert Update code for lvconvert. Change the lvconvert user interface a bit - now we require 2 specifiers --thinpool takes LV name for data device (and makes the name) --poolmetadata takes LV name for metadata device. Fix type in thin help text -z -> -Z. Supported is also new flag --discards for thinpools.	2012-10-03 15:13:33 +02:00
Zdenek Kabelac	e9f83147d5	thin: lvchange allows to change perms of thin snap Thin snapshots are individual thin volumes so they can have its own control for rw permissions.	2012-10-03 15:13:32 +02:00
Zdenek Kabelac	d442c3ef0c	liblvm: insert layer with subvolume renames Rename also subvolumes if we are inserting _tdata layer. (Currently it breaks mirrors if it would be generic, needs fixing).	2012-10-03 15:13:32 +02:00
Zdenek Kabelac	cf8e1a0093	thin: origin only suspend Skip tree creating when used with origin_only flag.	2012-10-03 15:05:55 +02:00
Zdenek Kabelac	21c401006c	liblvm: add lv_rename_update Support lv_rename without directly updating metatata. It can save some metadata commits in some cases, i.e. when LVs are offline.	2012-10-03 15:03:49 +02:00
Zdenek Kabelac	739092e64a	liblvm2cmd: ensure standard descriptors are ready Check if FDs 0,1,2 are available, and in case they are missing, use /dev/null for them.	2012-10-03 15:02:26 +02:00
Zdenek Kabelac	1f30e048bd	liblvm2cmd: add return code for _close_stray_fds Close fds via /proc/self/fd parsing Return error code if _close_stray_fds fails and quit application if system is in some nonstandard state.	2012-10-03 15:01:23 +02:00
Zdenek Kabelac	98bcfdca83	configure: fix --enable-testing Add missing pkg init for configure --enable-testing.	2012-10-03 14:59:59 +02:00
Jonathan Brassow	886656e4ac	RAID: Fix problems with creating, extending and converting large RAID LVs MD's bitmaps can handle 2^21 regions at most. The RAID code has always used a region_size of 1024 sectors. That means the size of a RAID LV was limited to 1TiB. (The user can adjust the region_size when creating a RAID LV, which can affect the maximum size.) Thus, creating, extending or converting to a RAID LV greater than 1TiB would result in a failure to load the new device-mapper table. Again, the size of the RAID LV is not limited by how much space is allocated for the metadata area, but by the limitations of the MD bitmap. Therefore, we must adjust the 'region_size' to ensure that the number of regions does not exceed the limit. I've added code to do this when extending a RAID LV (which covers 'create' and 'extend' operations) and when up-converting - specifically from linear to RAID1.	2012-09-27 16:51:22 -05:00
Alasdair G Kergon	290ae4791e	lvs: add partial attribute	2012-09-19 12:49:40 +01:00
Alasdair G Kergon	b737ff01e4	discards: skip when removing LVs on missing PVs Don't try to issue discards to a missing PV to avoid segfault. Prevent lvremove from removing LVs that have any part missing. https://bugzilla.redhat.com/857554	2012-09-19 12:48:56 +01:00
Jonathan Brassow	2a6712ddef	RAID1: Clear the LV_NOTSYNCED flag when a RAID1 LV is converted to linear Failing to clear the LV_NOTSYNCED flag when converting a RAID1 LV to linear can result in the flag being present after an upconvert - even if the sync is performed when upconverting.	2012-09-14 16:26:53 -05:00
Jonathan Brassow	116bcb3ea4	RAID1: Like mirrors, do not allow adding images to LV created w/ --nosync Mirrors do not allow upconverting if the LV has been created with --nosync. We will enforce the same rule for RAID1. It isn't hugely critical, since the portions that have been written will be copied over to the new device identically from either of the existing images. However, the unwritten sections may be different, causing the added image to be a hybrid of the existing images. Also, we are disallowing the addition of new images to a RAID1 LV that has not completed the initial sync. This may be different from mirroring, but that is due to the fact that the 'mirror' segment type "stacks" when adding a new image and RAID1 does not. RAID1 will rebuild a newly added image "inline" from the existant images, so they should be in-sync.	2012-09-14 16:12:52 -05:00
Peter Rajnoha	6d75ff138c	systemd: depend on systemd-udev-settle unit in activation unit The "fedora-wait-storage.service" that the "lvm2-activation.service" had as a dependency (which was fedora-specific solution anyway) is obsolete now as this unit called "modprobe scsi_wait_scan" which is not used anymore. The "fedora-wait-storage.service" had "systemd-udev-settle" as its dependency, so let's depend on this one directly now, bypassing the out-dated "fedora-wait-storage.service".	2012-09-12 11:30:13 +02:00
Peter Rajnoha	3127160626	vgchange: fix -aay to activate proper volumes Using 'activation/auto_activation_volume_list = [ "vg/lvol1" ]'. Before this patch: 3 logical volume(s) in volume group "vg" now active LV VG Attr LSize Pool Origin Data% Move Log Copy% Convert lvol0 vg -wi----- 4.00m lvol1 vg -wi-a--- 4.00m lvol2 vg -wi-a--- 4.00m lvol3 vg -wi-a--- 4.00m (vg/lvol1 activated as it passes the list and all subsequent volumes too - wrong!) With this patch: 1 logical volume(s) in volume group "vg" now active LV VG Attr LSize Pool Origin Data% Move Log Copy% Convert lvol0 vg -wi----- 4.00m lvol1 vg -wi-a--- 4.00m lvol2 vg -wi----- 4.00m lvol3 vg -wi----- 4.00m (only vg/lvol1 activated as it passes the list and no other - correct!)	2012-09-12 09:47:40 +02:00
Jonathan Brassow	4ededc698f	RAID: Properly handle resync of RAID LVs Issuing a 'lvchange --resync <VG>/<RAID_LV>' had no effect. This is because the code to handle RAID LVs was not present. This patch adds the code that will clear the metadata areas of RAID LVs - causing them to resync upon activation.	2012-09-11 13:09:35 -05:00
Jonathan Brassow	05131f5853	cleanup: Reduce indentation by short-circuiting function By changing the conditional for resyncing mirrors with core-logs a bit, we can short-circuit the rest of the function for that case and reduce the amount of indenting in the rest of the function. This cleanup will simplify future patches aimed at properly handling the resync of RAID LVs.	2012-09-11 12:55:17 -05:00
Jonathan Brassow	b49b98d50c	RAID: '--test' should not cause a valid create command to fail It is necessary when creating a RAID LV to clear the new metadata areas. Failure to do so could result in a prepopulated bitmap that would cause the new array to skip syncing portions of the array. It is a requirement that the metadata LVs be activated and cleared in the process of creating. However in test mode, this requirement should be lifted - no new LVs should be created or written to.	2012-09-05 14:32:06 -05:00
Zdenek Kabelac	e52d316751	lvm2api: extend lvm2api with lvm_lv_rename Add support for LV rename.	2012-08-27 13:02:42 +02:00
Alasdair G Kergon	92330ba9c8	setvbuf: close and reopen stream before change Fix setvbuf code by closing and reopening stream before changing buffer. But we need to review what this code is doing embedded inside a library function rather than the simpler original form being run independently at the top of main() by tools that need it.	2012-08-26 00:19:52 +01:00
Alasdair G Kergon	3acc85caa8	buffering: use unbuffered silent mode for liblvm Disable private buffering when using liblvm. When private stdin/stdout buffering is not used always use silent mode.	2012-08-26 00:15:45 +01:00
Alasdair G Kergon	438e0050df	config: add silent mode Accept -q as the short form of --quiet. Suppress non-essential standard output if -q is given twice. Treat log/silent in lvm.conf as equivalent to -qq. Review all log_print messages and change some to log_print_unless_silent. When silent, the following commands still produce output: dumpconfig, lvdisplay, lvmdiskscan, lvs, pvck, pvdisplay, pvs, version, vgcfgrestore -l, vgdisplay, vgs. [Needs checking.] Non-essential messages are shifted from log level 4 to log level 5 for syslog and lvm2_log_fn purposes.	2012-08-25 20:35:48 +01:00
Jonathan Brassow	4047e4dfb1	RAID: Add support for RAID10 This patch adds support for RAID10. It is not the default at this stage. The user needs to specify '--type raid10' if they would like RAID10 instead of stacked mirror over stripe.	2012-08-24 15:34:19 -05:00
Zdenek Kabelac	57c0f72b1d	lvconvert: use _reload_lv on more places Use common subroutine.	2012-08-23 14:38:45 +02:00
Zdenek Kabelac	ed53b4b674	lvmetad: do not deref NULL pointer Call log only for req.cft != NULL.	2012-08-23 14:34:54 +02:00
Zdenek Kabelac	8edc0e450d	lvmetad: fix memleaks Release allocated buffers in daemon_logf, daemon_log_parse	2012-08-23 14:33:23 +02:00
Zdenek Kabelac	54c24193f5	thin: lvcreate --discards	2012-08-09 16:25:52 +02:00
Zdenek Kabelac	80bf4eb035	thin: fix man page for lvs Renamed discard -> discards	2012-08-09 16:25:25 +02:00
Zdenek Kabelac	b8a6efbcc0	thin: fix condition for kernels without discards Report warning if the kernel is not support given discards settings. (In this case the behavior is equal to IGNORE.)	2012-08-09 16:24:42 +02:00
Zdenek Kabelac	1f1c664b78	thin: default discards for old mda is IGNORE If the discard was not set in metadata, use IGNORE, as this is the equivalent behavior for this case.	2012-08-09 16:23:32 +02:00
Jonathan Brassow	de3b1c4506	RAID: Improve RAID argument handling. Disallow '-m' for RAID types that have no mirror component and disallow '-i' argument for RAID types that have no stripe component.	2012-08-08 12:32:27 -05:00
Alasdair G Kergon	701b4a8363	thin: use discards as plural rather than singular Global change from --discard to --discards, as that feels more natural.	2012-08-07 21:24:41 +01:00
Alasdair G Kergon	df452b47a1	release: update version/WHATS_NEW	2012-08-07 20:41:45 +01:00
Alasdair G Kergon	016997acaf	man: document allocation process in lvm.8	2012-08-07 02:06:42 +01:00
Peter Rajnoha	6e55201144	args: increase major:minor limit to 4095:1048575 Remove the limit for major and minor number arguments used while specifying persistent numbers via -My --major <major> --minor <minor> option which was set to 255 before. Follow the kernel limit instead which is 12 bits for major and 20 bits for minor number (kernel >= 2.6 and LVM formats that does not have FMT_RESTRICTED_LVIDS - so still keep the old limit of 255 for lvm1 format).	2012-08-06 18:01:01 +02:00
Peter Rajnoha	fa68466e90	systemd: integrate lvm2 activation generator with conf+make	2012-07-31 16:46:24 +02:00
Peter Rajnoha	f64f22e2d6	lvm2app: add lvm_config_find_bool function To effectively retrieve the setting of anything that could be enabled or disabled.	2012-07-31 16:18:01 +02:00
Petr Rockai	6997943f22	lvmetad: Implement --test (fixes #832033 ).	2012-07-30 11:19:02 +02:00
Alasdair G Kergon	4dbf872a9f	reports: invalid snaps do not capitalise lv_attr No longer capitalise first LV attribute char for invalid snapshots. This state is available from the 5th char now (I or S).	2012-07-27 20:19:28 +01:00
Jonathan Brassow	186a2772e8	vgextend: Allow PVs to be added to VGs that have PVs missing Allowing people to add devices to a VG that has PVs missing helps people avoid the inability to repair RAID LVs in certain cases. For example, if a user creates a RAID 4/5/6 LV using all of the available devices in a VG, there will be no spare devices to repair the LV with if a device should fail. Further, because the VG is missing a device, new devices cannot be added to allow the repair. If 'vgreduce --removemissing' were attempted, the "MISSING" PV could not be removed without also destroying the RAID LV. Allowing vgextend to operate solves the circular dependency. When the PV is added by a vgextend operation, the sequence number is incremented and the 'MISSING' flag is put on the PVs which are missing.	2012-07-26 17:06:06 -05:00
Alasdair G Kergon	7803756e97	filters: Add Micron PCIe SSDs (mtip32xx) Recognise Micron PCIe SSDs in filter and move array out to device-types.h.	2012-07-26 02:26:40 +01:00
Jonathan Brassow	1b60789020	Forgot to update WHATS_NEW for commit `5555d2a000`	2012-07-24 22:28:23 -05:00
Peter Rajnoha	5e36b86c46	config: fix one-node dumpconfig, add dm_config_write_one_node A regression introduced in 2.02.89 (`11e520256b`) caused the lvm dumpconfig <node> to print out the node as well as its subsequent siblings. The information about "only_one" mode got lost. Before this patch (just an example node): # lvm dumpconfig global/use_lvmetad use_lvmetad=1 thin_check_executable="/usr/sbin/thin_check" thin_check_options="-q" (...all nodes to the end of the section) With this patch applied: # lvm dumpconfig global/use_lvmetad use_lvmetad=1	2012-07-20 15:53:04 +02:00
Peter Rajnoha	8d5ae472e5	daemon-server: fix error message on daemon shutdown If a daemon (like lvmetad that is using common daemon-server code) received a kill signal that was supposed to shut the daemon down, a spurious message was issued: "Failed to handle a client connection". This happened if the kill signal came just in the middle of waiting for a client request in "select" - the request that was supposed to be handled was blank at that moment of course.	2012-07-19 16:45:08 +02:00
Zdenek Kabelac	48367c5be9	thin: add lvchange for discard and zero change Update lvchange to allow change of 'zero' flag for thinpool. Add support for changing discard handling. N.B. from/to ignore could be only changed for inactive pool.	2012-07-18 14:38:34 +02:00
Zdenek Kabelac	46b9cc1248	thin: add reporting of discard for thin pool New field "discard" is added for lvs reporting of lv segment. Reported as one character: (i)gnore (n)opassdown (p)assdown lvs -o+discard	2012-07-18 14:37:44 +02:00
Zdenek Kabelac	ebbf7d8e68	thin: add discard support for thin pool Add arg support for discard. Add discard ignore, nopassdown, passdown (=default) support. Flags could be set per pool. lvcreate [--discard {ignore\|no_passdown\|passdown}] vg/thinlv	2012-07-18 14:36:57 +02:00
Zdenek Kabelac	260e8f2476	thin: detect supported features from thinp target Add shell variable to override reported min version for testing: LVM_THIN_VERSION_MIN	2012-07-18 14:35:17 +02:00
Peter Rajnoha	07e4ac7b00	lvconvert: count % upwards when merging a snapshot Before: # lvconvert --merge -i 1 vg/lvol1 Merging of volume lvol1 started. lvol0: Merged: 36.7% lvol0: Merged: 21.3% lvol0: Merged: 5.8% lvol0: Merged: 0.0% Merge of snapshot into logical volume lvol0 has finished. Logical volume "lvol1" successfully removed After: # lvconvert --merge -i 1 vg/lvol1 Merging of volume lvol1 started. lvol0: Merged: 61.4% lvol0: Merged: 73.0% lvol0: Merged: 88.4% lvol0: Merged: 100.0% Merge of snapshot into logical volume lvol0 has finished. Logical volume "lvol1" successfully removed	2012-07-10 15:30:18 +02:00
Peter Rajnoha	cd8ea8b437	activate: skip manual activation for --sysinit -aay When --sysinit -a ay is used with vg/lvchange and lvmetad is up and running, we should skip manual activation as that would be a useless step - all volumes are autoactivated once all the PVs for a VG are present. If lvmetad is not active at the time of the vgchange --sysinit -a ay call, the activation proceeds in standard 'manual' way. This way, we can still have vg/lvchange --sysinit -a ay called unconditionally in system initialization scripts no matter if lvmetad is used or not.	2012-07-10 14:01:33 +02:00
Jonathan Brassow	8767435ef8	RAID: Fix extending size of RAID 4/5/6 logical volumes. Reducing a RAID 4/5/6 LV or extending it with a different number of stripes is still not implemented. This patch covers the "simple" case where the LV is extended with the same number of stripes as the orginal.	2012-06-26 09:44:54 -05:00
Alasdair G Kergon	1d0a2b919f	toollib: fix ignored_mdas process_each_pv rescan In process_each_pv() if we haven't yet scanned and the PV appears to be an orphan, we must scan the other PVs looking for mdas that reference it to find out what VG it is in. 1. If the PV has no mdas, we must scan. 2. If the PV has an mda that is not ignored we do not need to scan. 3. If the PV has an mda that is ignored, we do need to scan. This patch fixes case 3. > pvs -o +mda_count,vg_mda_count /dev/loop[0123] PV VG Fmt Attr PSize PFree #PMda #VMda /dev/loop0 vg3 lvm2 a- 96.00m 96.00m 0 1 /dev/loop1 vg3 lvm2 a- 96.00m 96.00m 1 1 /dev/loop2 vg2 lvm2 a- 96.00m 96.00m 1 2 /dev/loop3 vg2 lvm2 a- 28.00m 28.00m 1 2 Before: > pvs /dev/loop2 /dev/loop3 /dev/loop0 /dev/loop1 --unbuffered PV VG Fmt Attr PSize PFree /dev/loop2 lvm2 a-- 100.00m 100.00m /dev/loop3 vg2 lvm2 a-- 28.00m 28.00m /dev/loop0 lvm2 a-- 100.00m 100.00m /dev/loop1 vg3 lvm2 a-- 96.00m 96.00m After: > pvs /dev/loop2 /dev/loop3 /dev/loop0 /dev/loop1 --unbuffered PV VG Fmt Attr PSize PFree /dev/loop2 vg2 lvm2 a-- 96.00m 96.00m /dev/loop3 vg2 lvm2 a-- 28.00m 28.00m /dev/loop0 vg3 lvm2 a-- 96.00m 96.00m /dev/loop1 vg3 lvm2 a-- 96.00m 96.00m	2012-06-29 21:22:09 +01:00
Peter Rajnoha	a54285a715	man: --activate ay and auto_activation_volume_list	2012-06-29 12:40:26 +02:00
Alasdair G Kergon	2cec4b4a77	alloc: fix raid --alloc anywhere double allocs If _alloc_parallel_area for raid devices chooses an area already used up, it doesn't notice that it has no space left in it and leaves later code trying to place a zero-length area into the LV. https://bugzilla.redhat.com/832596	2012-06-28 23:26:42 +01:00
Alasdair G Kergon	2f201d0e5e	WHATS_NEW: update Update WHATS_NEW.	2012-06-28 23:14:27 +01:00
Peter Rajnoha	2d5adc5823	initscript: call vgchange -aay instead of -aly The clmvd init script called "vgchange -aly" before to activate all VGs in cluster environment. This activated all VGs, no matter if it was clustered or not. Auto activation for clustered VGs is not supported yet so the behaviour of -aay is still the same as before for clustered VGs. However, for non-clustered VGs, we need to check with the activation/auto_activation_volume_list whether the VG/LV should be activated on boot or not.	2012-06-28 09:44:07 -04:00
Peter Rajnoha	f6a3ef4490	conf: add activation/auto_activation_volume_list	2012-06-28 09:44:07 -04:00
Peter Rajnoha	a2f4ccd839	lvcreate: add --activate ay (autoactivate) One can use "lvcreate --aay" to have the newly created volume activated or not activated based on the activation/auto_activation_volume_list this way. Note: -Z/--zero is not compatible with -aay, zeroing is not used in this case! When using lvcreate -aay, a default warning message is also issued that zeroing is not done.	2012-06-28 09:44:07 -04:00
Peter Rajnoha	c9b9077b44	lvchange: add --activate ay (autoactivate) The same as for vgchange...	2012-06-28 09:44:07 -04:00
Peter Rajnoha	d2df8dddc8	pvscan: add --activate ay option (autoactivate) Define auto_activation_handler that activates VGs/LVs automatically based on the activation/auto_activation_volume_list (activating all volumes by default if the list is not defined). The autoactivation is done within the pvscan call in 69-dm-lvmetad.rules that watches for udev events (device appearance/removal). For now, this works for non-clustered and complete VGs only.	2012-06-28 09:44:03 -04:00
Peter Rajnoha	215a314f19	vgchange: add --activate ay option (autoactivate) Normally, the 'vgchange -ay' activates all volume groups (that pass the activation/volume_list filter if set). This call can appear in two scenarios: - system boot (so activation within a script in general) - manual call on command line (so activaton on user's direct request) For the former one, we would like to select which VGs should be actually activated. One can define the list of VGs directly to do that. But that would require the same list to be provided in all the scripts. The 'vgchange -aay' will check for the activation/auto_activation_volume_list in adition and it will activate only those VGs/LVs that pass this filter (assuming all to be activated if the list is not defined - the same logic we already have for activation/volume_list). Init/boot scripts should use this form of activation primarily (which, anyway, becomes only a fallback now with autoactivation done on PV appearance in tandem with lvmetad in place).	2012-06-28 09:42:48 -04:00
Peter Rajnoha	95ced7a7be	activate: add autoactivation hooks Define an 'activation_handler' that gets called automatically on PV appearance/disappearance while processing the lvmetad_pv_found and lvmetad_pv_gone functions that are supposed to update the lvmetad state based on PV availability state. For now, the actual support is for PV appearance only, leaving room for PV disappearance support as well (which is a more complex problem to solve as this needs to count with possible device stack). Add a new activation change mode - CHANGE_AAY exposed as '--activate ay/-aay' argument ('activate automatically'). Factor out the vgchange activation functionality for use in other tools (like pvscan...).	2012-06-28 09:42:47 -04:00
Peter Rajnoha	2729720fd3	args: add --activate synonym for --available arg We're refererring to 'activation' all over the code and we're talking about 'LVs being activated' all the time so let's use 'activation/activate' everywhere for clarity and consistency (still providing the old 'available' keyword as a synonym for backward compatibility with existing environments).	2012-06-28 09:42:44 -04:00
Alasdair G Kergon	07a25c249b	discards: don't discard reconfigured extents Update release_lv_segment_area not to discard any PV extents, as it also gets used when moving extents between LVs. Instead, call a new function release_and_discard_lv_segment_area() in the two places where data should be discarded - lv_reduce() and remove_mirrors_from_segments().	2012-06-27 22:12:01 +01:00
Alasdair G Kergon	a5ddb347e5	allocation: allow release_lv_segment_area to fail Allow release_lv_segment_area to fail as functions it calls can fail.	2012-06-27 22:11:49 +01:00
Peter Rajnoha	c8591b2ac7	dev-io: open device read-only to obtain readahead value There's no need to have the device open RW while obtaining the readahead value. The RW open used before caused the CHANGE udev event to be generated if the WATCH udev rule was set for the underlying device (and that is normally the case both for non-dm and dm devices by default). This did not cause any problems before since we were not interested in underlying devices. However, with upcoming changes (autoactivation), we're watching for events on underlying devices marked as PVs and such a spurious event could cause the autoactivation code to be triggered. So when trying to deactivate the volume, we could end up with immediate activation just after that because of the CHANGE event originated in the WATCH udev rule since the underlying device was open RW during the deactivation process. Though maybe a better solution would be to completely filter such spurious events out of the autoactivation process somehow, it's still useful if there are as least spurious events generated as possible in the system itself.	2012-06-25 11:55:37 +02:00
Zdenek Kabelac	6bd3864b41	thin: fix lvconvert error path NULL dereference For printing the name, use given command line parameter.	2012-06-22 13:50:21 +02:00
Zdenek Kabelac	e9f9c6be26	lvmetad: check for fid existance Fail to update lvmetad with proper log error message.	2012-06-22 13:50:21 +02:00
Zdenek Kabelac	192fa11dab	fix: use 64bit math for reserved memory If the user specifies number in the range of [4G/1024, 4G>, the used value would wrap around (32bit math). So keep the math 64bit. Note, using such large lvm.conf values is pointless with lvm2.	2012-06-22 13:32:19 +02:00
Zdenek Kabelac	461eb1ac6a	cmirrord: add missing checks for kernel_send Log errors if kernel_send fails.	2012-06-20 14:48:26 +02:00
Zdenek Kabelac	865b9d3701	cmirrord: fix cut&paste	2012-06-20 14:41:57 +02:00
Zdenek Kabelac	fb4584b83d	cmirrord: add test for closedir() and close()	2012-06-20 14:40:39 +02:00
Zdenek Kabelac	2f99e5e35a	Sync filesystem for thin snapshots Add missing lockfs option when suspend origin, before thin volume snapshot is created	2012-06-15 14:43:07 +02:00
Alasdair G Kergon	d41ad502b8	release: post-release update version	2012-06-09 00:44:01 +01:00
Alasdair G Kergon	8dade001b8	release: WHATS_NEW tidy	2012-06-09 00:29:20 +01:00
Alasdair G Kergon	d459f6b32a	Edit WHATS_NEW.	2012-06-07 15:14:19 +01:00
Alasdair G Kergon	64a3ac8f51	Upstream source repo move to fedorahosted.org git. Change version number suffix from -cvs to -git.	2012-06-06 13:26:46 +01:00
Zdenek Kabelac	8cdb78d0dd	Fix error path Do not increase nr_filt in case of NULL ret value, since the error path doesn't handle NULL pointers.	2012-05-23 13:02:36 +00:00
Zdenek Kabelac	ec50952652	ok - that was nice mid-air collision	2012-05-16 13:09:09 +00:00
Zdenek Kabelac	6fa1d69804	update	2012-05-16 13:06:18 +00:00
Alasdair Kergon	56d49cbf13	Re-enable partial activation of non-thin LVs until it can be fixed. (2.02.90) - The test should be checking the LV as a whole, not just individual segments.	2012-05-16 12:50:14 +00:00
Alasdair Kergon	e0ed1b458d	Warn of deadlock risk when using snapshots of mirror segment type.	2012-05-14 16:18:57 +00:00
Alasdair Kergon	8b59522d67	Fix cling policy not to behave like normal policy if no previous LV seg. Fix alloc cling to cling to PVs already found with contiguous policy.	2012-05-11 22:53:13 +00:00
Alasdair Kergon	8a689fd04d	Fix allocation policy loop so it doesn't continue beyond cling using later policies it shouldn't be using when --alloc cling is specified but no tags are defined.	2012-05-11 22:19:12 +00:00
Alasdair Kergon	01cfbe14f1	Append _TO_LVSEG to names of internal A_CONTIGUOUS and A_CLING flags. Remove some unnecesary prev_lvseg checks.	2012-05-11 18:59:01 +00:00
Zdenek Kabelac	767ce95a11	Add missing pkg init	2012-05-10 08:54:33 +00:00
Peter Rajnoha	9c17acdfe8	Fix division by zero if PV with zero PE count is used during vgcfgrestore.	2012-05-09 12:30:56 +00:00
Zdenek Kabelac	0a9f894ff0	Initial support for lvconvert for thin pool volumes. Support has many limitations and lots of FIXMEs inside, however it makes initial task when user creates a separate LV for thin pool data and thin metadata already usable, so let's enable it for testing. Easiest API: lvconvert --chunksize XX --thinpool data_lv metadata_lv More functionality extensions will follow up. TODO: Code needs some rework since a lot of same code is getting copied.	2012-05-09 12:17:06 +00:00
Zdenek Kabelac	98f2e3d974	Fix regression in for_each_sub_lv pool_lv is not a sub lv in terms for this function. It has caused problem with renaming thin_volume, where it has tried to rename pool LV as well.	2012-05-09 12:12:21 +00:00
Jonathan Earl Brassow	eb2d70293d	Fix up-convert when mirror activation is controled by volume_list and tags. When mirrors are up-converted, a transient mirror layer is put in so that only the new devices are sync'ed. That transient layer must carry the tags of the original mirror LV, otherwise it will fail to activate when activation is regulated by lvm.conf:activation/volume_list. The conversion would then fail. The fix is to do exactly the same thing that is being done for linear -> mirror converting (lib/metadata/mirror.c:_init_mirror_log()). We copy the tags temporarily for the new LV and remove them after the activation.	2012-05-05 02:08:46 +00:00
Jonathan Earl Brassow	1e4e9548b1	Disallow snapshots of mirror segment types. Snapshots of RAID logical volumes are allowed (including "raid1"). However, snapshots of "mirror" logical volumes has been disallowed due to unsolvable issues inherent to the design. The fact that mirroring (dm-raid1.c) must stop all I/O as the result of a failure and wait for userspace intervention can lead to a circular dependency if userspace is simultaneously waiting for snapshots (on mirrors) to make an I/O update before proceeding. Various snapshot on mirror tests have been removed as a result.	2012-05-01 19:21:24 +00:00
Jonathan Earl Brassow	e5b9338ada	Fix bug in cmirror that caused incorrect status info to print on some nodes. Looking at the code in cmirrord/local.c, we can see the various different request types handled in different ways. Some information that is non-changing does not need to go around the cluster and can be short-circuited. For example, once the cluster mirror is in-sync, it is pointless to continue sending that query around the cluster. We can save network bandwidth and reply directly back to the kernel. When it comes to status information, there are two types 'TABLE' and 'INFO'. The 'TABLE' information never changes and belongs to the group of requests that can be safely short-circuited. The 'STATUS' information can change - and will change if a device fails. Thus it cannot be short-circuited, but this is exactly what was found. The 'STATUS' information request was being short-circuited and therefore never reporting the failure condition to anyone other than the "server" that experienced it directly.	2012-04-26 17:30:49 +00:00
Alasdair Kergon	34fbbfe34e	Remove statement that snapshots cannot be tagged from lvm man page.	2012-04-26 15:24:46 +00:00
Jonathan Earl Brassow	ac6e1e3e8d	Disallow changing cluster attribute of VG while RAID LVs are active. Mirror and snapshot LVs are already checked for when switching the cluster attribute of a VG. This patch adds RAID.	2012-04-25 13:38:41 +00:00
Peter Rajnoha	c70037445a	WHATS_NEW	2012-04-25 09:32:36 +00:00
Jonathan Earl Brassow	dfd024d3a8	Allow a subset of failed devices to be replaced in RAID LVs. If two devices in an array failed, it was previously impossible to replace just one of them. This patch allows for the replacement of some, but perhaps not all, failed devices.	2012-04-24 20:05:31 +00:00
Jonathan Earl Brassow	2bfb3e519a	Prevent resume from creating error devices that already exist from suspend. Thanks to agk for providing the patch that prevents resume from attempting (and then failing) to create error devices which already exist; having been created by a corresponding suspend operation.	2012-04-24 20:00:03 +00:00
Zdenek Kabelac	8262a3f6ca	Update singlenode locking Support lock conversion Work also with LCK_READ TODO: do more validation.	2012-04-24 12:16:40 +00:00
Zdenek Kabelac	a8f352fd56	Update some lvs column names Fix thin_pool -> pool_lv Add more fields supported by thin provisioning. Keep fields alphabetically sorted for easier lookup.	2012-04-24 12:13:29 +00:00
Alasdair Kergon	067184f32d	Handle replacement of an active device that goes missing with an error device. (E.g. lvchange --refresh --partial on striped LV if a PV disappeared.)	2012-04-24 00:51:26 +00:00
Jonathan Earl Brassow	c62f9f0b2f	Unlike 'mirror' segtype, 'raid1' should perform flush on suspend. The 'mirror' segtype and 'raid1' segtype both set the 'MIRRORED' flag. However, due to differences in the way these device-mapper targets behave 'mirror' must be suspended with the 'noflush' option and 'raid1' does not have to be. This patch ensures that when the 'MIRRORED' flag is checked to see if 'noflush' is needed that it does not also set it for 'raid1' by mistake.	2012-04-20 14:17:44 +00:00
Peter Rajnoha	973cfb19b7	Add udev info and context to lvmdump. --------------------------------------------------------------------	2012-04-18 15:26:02 +00:00
Jonathan Earl Brassow	a7feae8a6e	Fix code that performs RAID device replacement while under snapshot. The code should have been calling [suspend\|resume]_lv_origin() rather than [suspend\|resume]_lv. This addresses bug 807069.	2012-04-12 03:16:37 +00:00
Jonathan Earl Brassow	187486c7bb	Fix inability to split RAID1 image while specifying a particular PV. The logic for resuming the original and newly split LVs was not properly done to handle situations where anything but the last device in the array was split. It did not take into account the possible name collisions that might occur when the original LV undergoes the shifting and renaming of its sub-LVs.	2012-04-11 14:20:19 +00:00
Zdenek Kabelac	c63b155d16	Update man pages Use one style for man pages.	2012-04-11 12:42:10 +00:00
Zdenek Kabelac	5dc27b75eb	Fix lvresize for thin pool When resizing thin pool - we need to use strip info from _tdata volume. In future more generic solution will be necessary once we start to support lvconvert (resize of stacked devices and stay properly aligned). For now we just allow striped or linear LV so this code will work.	2012-04-11 12:40:03 +00:00
Zdenek Kabelac	6fc1f948c2	Lvresize rounds upward When given lvresize new size - round upward for stripes - unless we use % and we are at the border of free extents. This patch is not a complete fix and few more cases will need special care.	2012-04-11 12:36:37 +00:00
Zdenek Kabelac	c6f3701a71	Support rounding downward for lvcreate and % If specifying size with % and we are reaching number of free extents - round downward with stripes.	2012-04-11 12:33:34 +00:00
Peter Rajnoha	30bd294fc6	Change message severity to log_very_verbose for missing dev info in udev db. Libudev does not provide transactions when querying udev database - once we get the list of block devices (devices/obtain_device_list_from_udev=1) and we iterate over the list to get more detailed information about device node and symlink names used etc., the device could be removed just in between we get the list and put a query for more info. In this case, libudev returns NULL value as the device does not exist anymore. Recently, we've added a warning message to reveal such situations. However, this could be misleading if the device is not related to the LVM action we're just processing - the non-related block device could be removed in parallel and this is not an error but a possible and normal operation. (N.B. This "missing info" should not happen when devices are related to the LVM action we're just processing since all such processing should be synchronized with udev and the udev db must always be in consistent state after the sync point. But we can't filter this situation out from others, non-related devices, so we have to lower the message verbosity here for a general solution.)	2012-04-11 09:12:02 +00:00
Jonathan Earl Brassow	c0b5886f18	RAID LVs could not handle a down-convert if a device other than the last one in the array was specified for removal. This change addresses that (bz806111).	2012-04-11 01:23:29 +00:00
Jonathan Earl Brassow	bad8b5848f	Commit ID `46a75dedb4` consolidated code from the various dmeventd plug-ins into a new function called 'dmeventd_lvm2_command', but the new function did not strip off the "_mlog" extentions that the mirror plug-in had been doing. This created bug 794904 - failure to replace devices in a redundant log. The test suite did catch this scenario because it performs repair tests (mainly) through the CLI and not dmeventd. It's also not easy to test because the test itself will hang if the bug is encountered.	2012-04-10 23:34:41 +00:00
Zdenek Kabelac	6e826bb6a4	Fix unlocking in error path of vgreduce When vg_read fails, it internally unlocks VG if it's been locked, so in error path we should skip unlock_vg for this case. (user would see ugly internal warning)	2012-03-30 14:59:35 +00:00
Peter Rajnoha	ebd9225245	WHATS_NEW	2012-03-30 11:39:52 +00:00
Peter Rajnoha	543eaed88c	Detect VG name being part of the LV name in lvconvert --splitmirrors -n. Before: devel/~ # lvconvert --splitmirrors 1 -n vg/splitted_one vg/mirrored_one Internal error: LV name vg/splitted_one has invalid form. Intermediate VG metadata write failed. After: devel/~ # lvconvert --splitmirrors 1 -n vg/splitted_one vg/mirrored_one Logical volume mirrored_one converted. devel/~ # lvconvert --splitmirrors 1 -n abc/splitted_one vg/mirrored_one Please use a single volume group name ("vg" or "abc") Run `lvconvert --help' for more information.	2012-03-30 08:58:02 +00:00
Milan Broz	46e9aac160	Fix exclusive lvmchange -aey to fail if volume is active on different node. Activation on remote node should be tried only if it is masked by tags locally (like when hosttags enabled, IOW activate_lv_excl_local() doesn't return error.) Introduced change caused that lvchange -aey succeeded even if volume was activated exclusively remotely.	2012-03-27 15:53:45 +00:00
Peter Rajnoha	3be9089cd3	Add 'vgscan --cache' functionality for consistency with 'pvscan --cache'. Calling vgscan alone should reuse information from the lvmetad (if running). The --cache option should initiate direct device scan and update lvmetad appropriately (if running). This is mainly for vgscan to behave consistently compared to pvscan.	2012-03-27 11:04:46 +00:00
Milan Broz	ddb31b62e5	Keep exclusive activation in pvmove if LV is already active. Pvmove should never try to downgrade exclusive lock for LVs. This allows pvmove to work again for exclusive activated LVs.	2012-03-26 20:33:40 +00:00
Milan Broz	dcd90bc501	Do not allow pvmove if some affected LVs are activated locally or on more nodes while others are activated exclusively. Current pvmove code can either use local mirror (for exclusive activation) or cmirror (for clustered LVs). Because the whole intenal pvmove LV is just segmented LV containing segments of several top-level LVs, code cannot properly handle situation if some segment need to be activated exclusively. Previously, it wrongly activated exclusive LV on all nodes (locing code allowed it) but now this is no lnger possible. If there is exclusively activated LV, pvmove is only possible if all affected LVs are aslo activated exclusively. (Note that in non-exclusive mode pvmove still activates LVs on other nodes during move.) # lvchange -aly vg_test/lv1 # lvchange -aey vg_test/lv2 # pvmove -i 1 /dev/sdc Error locking on node bar-01: Device or resource busy Error locking on node bar-03: Volume is busy on another node ... Failed to activate lv2	2012-03-26 20:32:58 +00:00
Milan Broz	62a40438ab	Remove unused and wrongly set cluster VG flag from clvmd lock query command.	2012-03-26 20:29:45 +00:00
Milan Broz	7076d1439b	Fix pvmove if LV is activated exclusively but cmirror is not running. In this case we should allow to use local mirror, check for cmirror should apply only for lvconvert/lvcreate. Introduced in 2.02.86 by removing !(lv->status & ACTIVATE_EXCL). (Partially workaround, it is minimalistic patch for now.)	2012-03-23 16:28:40 +00:00
Zdenek Kabelac	0fc9a3dce3	Always free hash table also in error path	2012-03-23 10:33:26 +00:00
Zdenek Kabelac	2caa558e7c	Update and fix monitoring of thin pool devices Code adds better support for monitoring of thin pool devices. update_pool_lv uses DMEVENTD_MONITOR_IGNORE to not manipulate with monitoring. vgchange & lvchange are checking real thin pool device for existance as we are using _tpool real device and visible LV pool device might not be even active (_tpool is activated implicitely for any thin volume). monitor_dev_for_events is another _lv_postorder like code it might be worth to think about reusing it here - for now update the code to properly monitory thin volume deps. For unmonitoring add extra code to check the usage of thin pool - in case it's in use unmonitoring of thin volume is skipped.	2012-03-23 09:58:04 +00:00
Zdenek Kabelac	5da4d94adc	Return mem fail if hash insert fails	2012-03-23 09:48:17 +00:00
Zdenek Kabelac	fbd89d3a1a	Fix typo in config option check	2012-03-23 09:42:36 +00:00
Zdenek Kabelac	1d6a2c7326	Update lcov target	2012-03-23 09:39:03 +00:00
Zdenek Kabelac	0b17a75f13	Fix regression in thin monitoring Patch https://www.redhat.com/archives/lvm-devel/2012-February/msg00118.html removed initilization of thin volume monitoring, leaving it only for thin pool - but missed the code move part for monitoring of thin pools. Effectively making thin pools not monitorable.	2012-03-20 17:42:19 +00:00
Zdenek Kabelac	37672e676d	Support improperly formated device numbers There are kernel drivers (smblk) which set '-1' as their device major number. This number is listed in /proc/devices then - but the kernel itself is using just 12 bits - thus device is accessible via 4095 - there is posted patch for 3.4 to fix this behavior (0 for auto allocation was mean to be used). However to still allow using such devices with older kernels add some code to use same behavior - so cut 12 bits from the major number from /proc/devices. For now use log_warn() - maybe the severity of the message could be lowered to just verbose level.	2012-03-20 10:47:02 +00:00
Zdenek Kabelac	a9382908ae	Fix string parsing Fix propagation of -e option - pass it via internal shell variable. Fix parsing of /proc/mounts files (don't check for substrings). as reported by O.Mangold with suggested patch: https://www.redhat.com/archives/linux-lvm/2012-February/msg00030.html Properly pass arguments with spaces ("$@") Add validation for YES and EXTOFF variable content.	2012-03-16 12:53:05 +00:00
Petr Rockai	f1d117f9f9	It's new.	2012-03-16 10:46:25 +00:00
Jonathan Earl Brassow	dc7b1640ed	Fix name conflicts that prevent down-converting RAID1 when specifying a device When down-converting a RAID1 device, it is the last device that is extracted and removed when the user does not specify a particular device. However, when a device is specified (and it is not the last), the device is removed and the remaining sub-LVs are "shifted down" to fill the hole. This cause problems when resuming the LV because if the shifted devices were resumed (and thus renamed) before the sub-LV being extracted, there would be a name conflict. The solution is to resume the extracted sub-LVs first so that they can be properly renamed preventing a possible conflict. This addresses bug 801967.	2012-03-15 20:00:54 +00:00
Zdenek Kabelac	e866931169	Improve thin_check option passing Update a way we handle option passing - so we now support path and options with space inside. Fix dm name usage for thin pools with '-' in name. Use new lvm.conf option thin_check_options to pass in options as string array.	2012-03-14 17:12:05 +00:00
Zdenek Kabelac	f61cacad16	Add --with-thin-check configure option If specified - use given path without test (Path could be empty) If autodetection is in use - check for command in available PATH.	2012-03-14 17:09:00 +00:00
Peter Rajnoha	88bba90e6e	WHATS_NEW	2012-03-14 12:12:21 +00:00
Alasdair Kergon	bba1e4d11f	Fix error message when pvmove LV activation fails with name already in use.	2012-03-13 20:21:26 +00:00
Zdenek Kabelac	0d3ce181e1	Better structure layout for device_info Save some relocation entries and use directly char[]. Since we do not need yes more then 127 partitions per device, use just int8_t. Move lvm_type_filter_destroy into local static function.	2012-03-12 14:40:41 +00:00
Zdenek Kabelac	aa9ebf4494	Switch to normal log_verbose message Here it's not an error case - so do not push this message to stderr.	2012-03-12 14:18:28 +00:00
Zdenek Kabelac	f6632c1ef4	Fix error path for create_toolcontext Never return unfinished toolcontext - since error path is hit on various stages of initialization we cannot leave it partially uninitialized, since we would need to spread many more test across the code for config_valid. Instead return NULL and properly release udev library resources as well.	2012-03-12 14:15:04 +00:00
Zdenek Kabelac	34a45b0029	Fix warn message and update man page Fix regression in man page. The chunk size is in kilobyte units on command line input though in the source code we work with sector size unit so make it clear in the man page. Update chunksize for thin pool in man page - it's max value is 1024M == 1G. Fix warning range message to show proper max value.	2012-03-06 09:22:02 +00:00
Alasdair Kergon	a17ac481ab	post-release	2012-03-06 04:47:37 +00:00
Alasdair Kergon	ce05af1d32	pre-release	2012-03-06 02:50:40 +00:00
Alasdair Kergon	b343d75a5a	Switch pvscan --cache major:minor to --major --minor.	2012-03-06 02:30:49 +00:00
Zdenek Kabelac	aeaec150c0	Some more missing supposedly 64bit operations. Avoid use 32bit math for extent_size.	2012-03-05 15:05:24 +00:00
Zdenek Kabelac	90423c1200	Fit thin pool metadata into 128MB If the lvcreate may decide some automagical values for a user, try to keep the pool metadata size into 128MB range for optimal perfomance (as suggested by Joe). So if the pool metadata size and chunk_size were not specified, try to select such values they would fit into 128MB size.	2012-03-05 14:19:13 +00:00
Zdenek Kabelac	975b5b42d2	Improve warning Use thin_dump --repair suggestion in log error message and use just warning on deactivation path without repair info (since node has been deactivated). Also check whether there is not 16 args for thin_check configured.	2012-03-05 14:15:50 +00:00
Zdenek Kabelac	20c40a0807	Use 64bit math Prevent 32bit overflow and resulting weird error reports when working with TB sizes..	2012-03-05 14:12:57 +00:00
Zdenek Kabelac	d18c70b4df	Validate udev structures Avoid using NULL pointers from udev. It seems like some older versions of udev were improperly returning NULL in some case, so do not silently break here, and give at least a warning to the user.	2012-03-04 17:40:59 +00:00
Zdenek Kabelac	462de06d96	Return success for deactivation of thin pool if the thin_check fail on thin pool - still return successful deactivation, since lvremove would currently fail. TODO: find some way to not run check with lvremove.	2012-03-04 17:36:23 +00:00
Alasdair Kergon	35216ca66c	Scan all devices for lvmetad if 'pvscan --cache' used without device list.	2012-03-03 18:32:53 +00:00
Alasdair Kergon	37160ef249	post-release	2012-03-03 02:08:37 +00:00
Alasdair Kergon	05babeeef5	.	2012-03-03 01:28:15 +00:00
Alasdair Kergon	02b351ad95	pre-release	2012-03-03 01:00:49 +00:00
Zdenek Kabelac	0438b15353	List _thread_registry missed mutex Operation on _thread_registry needs to be covered by mutex. Cosmetic move a die code after free for valgind short leak list.	2012-03-02 22:57:25 +00:00
Zdenek Kabelac	6c7a6c07ee	Add support for thin check Use libdm callback to execute thin_check before activation thin pool and after deactivation as well. Supporting thin_check_executable which may pass in extra options for the tool.	2012-03-02 21:49:43 +00:00
Zdenek Kabelac	1babf24949	Fix estimation of pool metadata device size If no size was give the later added minimal size check efectively disable this code. Also the argument for size now must be kept in sector_size, so adding division by SECTOR_SIZE (moved into a const expression)	2012-03-02 17:25:21 +00:00
Zdenek Kabelac	52f76a7682	Test alloc fail	2012-03-01 21:49:32 +00:00
Zdenek Kabelac	1281a5e3d5	Check for alloc error Simplify segtype_str usage and check for NULL segtype.	2012-03-01 21:21:54 +00:00
Zdenek Kabelac	c219934a87	Add _rimage as reserved suffix	2012-03-01 10:39:21 +00:00
Zdenek Kabelac	3bd9048854	Improve error logging Log errors instead of plain return 0. Check for f->private strdup result.	2012-03-01 10:30:48 +00:00
Zdenek Kabelac	f9467799c1	Check for allocation error return ENOMEM when malloc fails.	2012-03-01 09:54:23 +00:00
Zdenek Kabelac	24ab6328f7	Wipe initial 4KiB for non-zeroed thin volumes If the thin pool has disabled zeroing (created with -Zn), we at least clear initial 4KiB of such thin volume (provisions 1st block). If lvcreate is executed with '-an' command will abort (same way like we for normal LV - however for normal LV option -Zn may skip clearing completely, for thin volumes this option is not supported (applies only for pools).	2012-02-29 22:08:57 +00:00
Jonathan Earl Brassow	62e38da133	Allow cluster mirrors to handle the absence of the checkpoint lib (libSaCkpt). The OpenAIS checkpoint library is going away; therefore, cmirrord must operate without it. The algorithms the handle the timing of when to send a checkpoint, the determination of what to send, and which ongoing cluster requests are relevent with respect to the checkpoints are unaffected. We need only replace the functions that actually perform the storing/transmitting and retrieving/receiving of the checkpoint data. Rather than store the checkpoint data in an OpenAIS checkpoint file, we simply transmit it along with the message that notifies the incoming node that the checkpoint is ready.	2012-02-29 21:15:34 +00:00
Zdenek Kabelac	54b2aadf40	Revert free of allocated segtype lvm_register_segtype takes ownership of segtype and call destructor for it in error path.	2012-02-28 14:23:41 +00:00
Zdenek Kabelac	0650d875e8	Test dm_hash_insert() failures mem failures	2012-02-28 11:12:58 +00:00
Zdenek Kabelac	bd046f0201	Ensure clvmd message is always \0 terminated Drop whole buffer clearing (most messages at <100 bytes). Just make sure we have always \0 terminated string for strlen() operations. (before for PIPE_BUF sized messages this was not set).	2012-02-28 11:06:56 +00:00
Zdenek Kabelac	c19d86338d	Better detection of missing dmeventd fifo connection	2012-02-28 11:03:24 +00:00
Zdenek Kabelac	6f8bd07b40	Duplicate standard in/out descriptors for daemon Addressing somewhat tricky bug here. Since stdin,stdout,stderr were closed it's been occasionally possible to see some unexpected messages to be flowing into a clvmd and generating some randomly sized allocation of many megabytes. Since the message was not being generated by standard send_message() construction, after some more testing it apperead to be a debug log message - thus something has flown to local socket opened on strandard out descriptor. To fix the issue - use standard file descriptor duplication code for daemons. For making easier debugging of polling daemon - developer might want to recompile without modifition of standard file descriptors.	2012-02-28 10:06:53 +00:00
Zdenek Kabelac	696052b78e	Limit max size of clvmd message This could be seen as some sort of simple validation - it's not easy to recognize a valid message for now - but we definitely do not want to allocate a lot of megabytes in clvmd memory locked daemon when broken message gets in. Size of 8000 is just selected for now - possibly there could be much lower value put in.	2012-02-28 09:58:19 +00:00
Zdenek Kabelac	782a37e411	Do not send uninitilised bytes Use struct initalizers to fill struct members and at the same time have all unspecified members set to 0.	2012-02-28 09:53:55 +00:00
Zdenek Kabelac	f380cd7d98	Use unsigned type for bitmask Using report_type_t for bitmask is not correct, since we have not defined types for all bit combinations - so switching to unsigned type, since values of report_type_t enum are unsigned.	2012-02-27 11:45:05 +00:00
Zdenek Kabelac	75f8f3ce8b	Nicer cleanup of excl_uuid hash Since it on exit path, it's not a big difference, but makes less noise in analyzer and valgrind.	2012-02-27 11:26:25 +00:00
Zdenek Kabelac	530efdb525	Check for vg_name existance Since vg_read() mda ops could be called with NULL vg_name, check it before derefence also for pool and format1.	2012-02-27 11:23:15 +00:00
Zdenek Kabelac	9737943c4c	Fix missing break Bug introduced with addition of internal error default case. Seem like this code is not used. TODO: add coverage test.	2012-02-27 11:13:48 +00:00
Zdenek Kabelac	e1153fd385	Test seg pointer for non-null As the function accepts NULL for 'seg' parameter, check for it before dereference.	2012-02-27 10:15:08 +00:00
Zdenek Kabelac	3af1ebe31e	Test result of _init_tags.	2012-02-27 10:05:35 +00:00
Zdenek Kabelac	24d39aa142	Always check result of _set_vg_name()	2012-02-27 10:00:23 +00:00
Zdenek Kabelac	7e25b8f932	Drop uname call, it's not used from gulm era.	2012-02-27 09:58:18 +00:00
Zdenek Kabelac	93b087da97	Check allocation result	2012-02-27 09:56:27 +00:00
Zdenek Kabelac	71f3bbd53f	Limit sscanf params with size Make sure parsed string fits given char buffer.	2012-02-23 22:50:50 +00:00
Zdenek Kabelac	499a161640	Use const for lv lv_is_active doesn't needs modifiable LV struct so keep it const. Remove lv_send_message() left bits from code - they were never released in 2.02.89.	2012-02-23 22:41:57 +00:00
Zdenek Kabelac	c817e60796	Use same signed numbers Keep unsigned aritmetic. TODO: we should probably switch dm_split_words() to return unsigned numbers. (minor API libdm change mostly compatible)	2012-02-23 22:30:20 +00:00
Alasdair Kergon	1a4b6136be	post-release	2012-02-23 18:26:28 +00:00
Alasdair Kergon	f9fc7d8da4	pre-release	2012-02-23 18:22:09 +00:00
Jonathan Earl Brassow	870762d8e3	Require number of stripes to be greater than parity devices in higher RAID. Also, add some comments to code that I recently added that may be unclear otherwise.	2012-02-23 17:36:35 +00:00
Peter Rajnoha	da532741c9	Add LVMetaD systemd units.	2012-02-23 11:24:07 +00:00
Jonathan Earl Brassow	9bdfb30720	Fix allocation code to allow replacement of single RAID 4/5/6 device. The code fail to account for the case where we just need a single device in a RAID 4/5/6 array. There is no good way to tell the allocation functions that we don't need parity devices when we are allocating just a single device. So, I've used a bit of a hack. If we are allocating an area_count that is <= the parity count, then we can assume we are simply allocating a replacement device (i.e. no need to include parity devices in the calculations). This should make sense in most cases. If we need to allocate replacement devices due to failure (or moving), we will never allocate more than the parity count; or we would cause the array to become unusable. If we are creating a new device, we should always create more stripes than parity devices.	2012-02-23 03:57:23 +00:00
Alasdair Kergon	d860272b00	Check all tags and LV names are in a valid form in vg_validate.	2012-02-23 00:11:01 +00:00
Peter Rajnoha	4417a8bd40	Add configure --with-tmpfilesdir and lvm2 tmpfiles.d configuration file itself. /etc/tmpfiles.d directory holds configuration files for temporary/volatile files and directories that should be automatically managed. For example, if we have some parts of the fs hierarchy on tmpfs, we'd like to recreate some files or directories on every boot so they're always prepared for use. Systemd can read such configuration files. For now, the lock and run directory are the ones that are most probably placed on tmpfs. If this is the case, we can install the configuration by 'make install_tmpfiles_configuration'.	2012-02-22 17:55:10 +00:00
Jonathan Earl Brassow	e8eb64c878	Allow 'lvconvert --repair' to operate on RAID 4/5/6. The higher level RAIDs should be allowed for repair along with 'mirror' and 'raid1' segment types.	2012-02-22 17:18:49 +00:00
Jonathan Earl Brassow	0e92b70f71	* empty log message *	2012-02-22 17:14:38 +00:00
Alasdair Kergon	971248911b	post-release	2012-02-20 21:11:06 +00:00
Alasdair Kergon	815aa3555f	pre-release	2012-02-20 19:38:19 +00:00
Zdenek Kabelac	d81498a824	Initialize dmeventd monitoring for every command Read lvm.conf setting for monitoring for each command. So we should not activate monitoring if the default compilation is set to monitor during lvconvert commnads. Patch also removes check for clustered VG and allows to disable monitoring for clustered VG with the assumption, the problem with monitoring and dmeventd flag passing for INGNORE is already fixed.	2012-02-15 15:18:43 +00:00
Zdenek Kabelac	1fa8ddaf51	Initialize monitoring support only for thin pools	2012-02-15 13:49:51 +00:00
Jonathan Earl Brassow	ad48a46fc9	Make conversion from a synced 'mirror' to 'raid1' not cause a full resync. It was not possible to pass down the DM_[FORCE\|NO]SYNC flags to 'dm_tree_node_add_raid_target'. This meant that converting to 'raid1' from 'mirror' would cause a full resync. (It also meant that '--nosync' was ineffective when creating a 'raid1' LV.) I've taken the 'reserved' parameter in 'dm_tree_node_add_raid_target' and used it for the "flags" parameter. Now it is possible to pass the sync flags and any other flags that may come up.	2012-02-13 20:13:39 +00:00
Zdenek Kabelac	172c87f7ca	Never try to test character past given buffer In case units[0] would be already '\0', do not check units[1].	2012-02-13 14:23:40 +00:00
Peter Rajnoha	e587cb6ac5	Add configure --with-systemdsystemunitdir.	2012-02-13 13:02:47 +00:00
Zdenek Kabelac	3e74542b5d	Add check for allocation failure	2012-02-13 11:16:42 +00:00
Zdenek Kabelac	cbe6bcd593	Add check for rimage name allocation failure	2012-02-13 11:10:37 +00:00
Zdenek Kabelac	bed744c15d	Add check for mda_copy failure	2012-02-13 11:09:25 +00:00
Zdenek Kabelac	fde44d055b	Add check for failure	2012-02-13 11:07:55 +00:00
Zdenek Kabelac	52f2f3eae4	Add free_orphan_vg Move commod code to destroy orphan VG into free_orphan_vg() function. Use orphan vgmem for creation of PV lists. Remove some free_pv_fid() calls (FIXME: check all of them) FIXME: Check whether we could merge release_vg back again for all VGs.	2012-02-13 11:03:59 +00:00
Zdenek Kabelac	65079de265	If the same fid is already same avoid ref_counting	2012-02-13 11:01:34 +00:00
Zdenek Kabelac	960ee343f3	Add missing test for failure of lvmcache_foreach_pv	2012-02-13 10:58:20 +00:00
Zdenek Kabelac	f9411bb2af	Clean error paths for format instance With updated orphan VG code this code needed some updates. Add missing log_error for allocation failures.	2012-02-13 10:56:31 +00:00
Zdenek Kabelac	874a4fd80d	Release_vg instead of plain free in error path	2012-02-13 10:53:31 +00:00
Zdenek Kabelac	bbf98c19a8	Log error reporting for failing _alloc_pv Drop unneeded zeroing of zalloced memory region.	2012-02-13 10:51:52 +00:00
Fabio M. Di Nitto	94424fabd0	In the new corosync world, dlm is a standalone service. Fix clvmd init script to Require dlm service when building for the new corosync or clvmd will fail to start.	2012-02-13 05:24:57 +00:00
Alasdair Kergon	0a182731e4	post-release	2012-02-13 00:23:21 +00:00
Alasdair Kergon	79b3966a34	pre-release	2012-02-12 23:02:52 +00:00
Alasdair Kergon	ba14fff2af	FMT_INSTANCE_PV is no longer used	2012-02-12 22:37:24 +00:00
Petr Rockai	872b97a752	What's new.	2012-02-10 02:56:54 +00:00
Petr Rockai	0fbbc6ce13	What's new: lvmcache.	2012-02-10 01:29:46 +00:00
Peter Rajnoha	5fa417a9c0	Stop processing lvextend if trying to extend a mirror that is being recovered. Missing correct return value in lv_extend fn.	2012-02-09 15:13:42 +00:00
Zdenek Kabelac	a7e2da0585	Thin add pool_below_threshold Test both data and metadata percent usage.	2012-02-08 13:05:38 +00:00
Zdenek Kabelac	94f88a4f14	Fix test for lv_snapshot_percent Do not check for PERCENT_MERGE_FAILED if the lv_snapshot_percent() failed. (test for snap_percent would be testing uninitialized value).	2012-02-08 13:02:07 +00:00
Zdenek Kabelac	9278655de1	Some fixmes 'len' calculation is unused ? Unreachable code could be removed or moved upward ?	2012-02-08 12:57:15 +00:00
Zdenek Kabelac	462835faa0	Switch to return void List delete cannot fail, so there is no reason to test for error.	2012-02-08 12:52:58 +00:00
Zdenek Kabelac	33dea28e23	Use dm_snprintf and improve error handling Add standard error reporting with error logging. Use plain alloc instead of zalloc for string buffer. Use dm_snprintf with valid test for <0.	2012-02-08 12:50:10 +00:00
Zdenek Kabelac	7ffca95bb6	Add range test for device number Check the output of atoi is in valid range.	2012-02-08 12:48:14 +00:00
Zdenek Kabelac	3a8b6a9948	Keep page_size as signed number Since it's return value from sysconf and is checked for <0.	2012-02-08 11:34:46 +00:00
Zdenek Kabelac	84fd8ea4bd	Move done jump lower Since before 'goto done' is bufused zeroed, it would otherwise write 1 byte in front of buffer.	2012-02-08 11:31:29 +00:00
Zdenek Kabelac	0154bcf0a7	Check that whole locking_dir fits _lock_dir buffer	2012-02-08 11:17:34 +00:00
Zdenek Kabelac	02aeb23f1f	Use dm_list_iterate_items_safe And avoid direct access to list member variables. Inline _free_li().	2012-02-08 11:12:18 +00:00
Zdenek Kabelac	5dfd775384	Ensure strncpy() function always ends with '\0' Since last character needs to be \0 for string, pass buffer size smaller by 1 byte.	2012-02-08 11:05:04 +00:00
Zdenek Kabelac	cd4c26a27f	Set status for error path Do not leave status unitialized, since in some cases, it's tested, when the function returns error.	2012-02-08 10:56:17 +00:00
Zdenek Kabelac	f9bd70878b	Add missing deps for lvm2api Hmm, wasted some time because of this missing deps....	2012-02-08 10:52:45 +00:00
Zdenek Kabelac	ee54e43702	Fix resource leaks for failing allocation In case, something would fail during format initialization, return allocated memory.	2012-02-08 10:49:36 +00:00
Zdenek Kabelac	12ac6f9f11	Release allocated resources in error path If composite_filter_create() fails, release filters.	2012-02-08 10:46:24 +00:00
Zdenek Kabelac	7b408a08ef	Check result of lstat If lstat returns errno different from ENOENT, do not use the content of struct stat 'buf'.	2012-02-08 10:43:42 +00:00
Petr Rockai	3959c60250	What's new.	2012-02-01 20:13:44 +00:00
Alasdair Kergon	2a57a934bb	post-release	2012-02-01 18:46:57 +00:00
Alasdair Kergon	c8250560cd	pre-release	2012-02-01 15:17:04 +00:00
Zdenek Kabelac	42b5c54092	Add synchornization point in mirror log init. Put extra sync point when mirror log is deactivated and before it's activated for the second time.	2012-02-01 13:50:36 +00:00
Zdenek Kabelac	ab852ffe66	Disable partial activation for thin LVs and LVs with all missing segments Count number of error and existing areas and if there is no existing area for the LV avoid its activation. Always disable partial activatio for thin volumes. For mirrors currently put in hack to let it pass with a special name since current mirror code needs to activate such LV during some operations.	2012-02-01 13:47:27 +00:00
Zdenek Kabelac	dfb679e5c7	Avoid warning for small pv_min_size Do not print warning for pv_min_size set in range between 512KB and 2MB.	2012-02-01 13:42:18 +00:00
Peter Rajnoha	b627165a75	Clean up systemd unit ordering and requirements.	2012-02-01 13:08:39 +00:00
Zdenek Kabelac	8d2d4f2026	User correct base dir for lcov reports Fix problem when srcdir != builddir.	2012-02-01 10:46:45 +00:00
Alasdair Kergon	72abf1d880	Track unreserved space for all alloc policies and then permit NORMAL to place log and data on same single PV.	2012-02-01 02:10:45 +00:00
Alasdair Kergon	b6d7a48480	Automatically detect whether corosync clvmd needs to use confdb or cmap. (fabio)	2012-01-31 21:21:53 +00:00
Zdenek Kabelac	15fd61e492	Fix data% reporting For reading % of mapped size of thin volume use as origin for old style snapshot '-real' device needs to be queried. Fix log_error report given for lvs -a in this case.	2012-01-28 20:12:26 +00:00
Alasdair Kergon	91c631c558	post-release	2012-01-27 01:23:40 +00:00
Alasdair Kergon	a1991f101d	pre-release	2012-01-26 14:02:42 +00:00
Zdenek Kabelac	b45035ee14	Test for uname result in fail path initialize to 0.	2012-01-25 22:17:57 +00:00
Zdenek Kabelac	209da6efee	Fix missing dmt destructor Also always initialize maj,min,patchlevel when success is returned.	2012-01-25 22:16:04 +00:00
Zdenek Kabelac	1ef10bd81a	Limit alignment to 32bit values to get the same behavior on 32/64 machines.	2012-01-25 21:52:53 +00:00
Zdenek Kabelac	e6771e50a9	Check for correctness of uint64 value if exists	2012-01-25 21:43:51 +00:00
Zdenek Kabelac	e8905d9816	Rename origin_only to more generic use_layer flag Since now we have more layered devices i.e. thin volumes - support selection of layer via flag.	2012-01-25 13:10:26 +00:00
Zdenek Kabelac	10e80a212f	Update verbose lvs to print metadata_percent info Update lvs -o fields in WHATS_NEW.	2012-01-25 11:32:41 +00:00
Zdenek Kabelac	c0663a97a5	Update lv_info whats_new	2012-01-25 09:00:57 +00:00
Zdenek Kabelac	bdba904d7c	Thin add lv_thin_pool_transaction_id Easy function to get transaction_id status value.	2012-01-25 08:48:42 +00:00
Jonathan Earl Brassow	6cf3274732	Use suspend\|resume_origin_only when up-converting RAID LVs, as mirrors do. Failure to do so results in "Performing unsafe table load while X device(s) are known to be suspended" errors. While fixing the problem in this way works and is consistent with the way the mirror segment type does it, it would be nice to find a solution that uses the generic suspend/resume calls. Also included in this check-in are additions to the test suite that perform conversions on RAID LVs under a snapshot. These tests are disabled for the time being due to a kernel bug that is yet to be tracked down.	2012-01-24 14:33:38 +00:00
Jonathan Earl Brassow	d5617bccab	Fix the way RAID meta LVs are added to the dependency tree. Similar to the "mirror" segment type's log device, _add_dev_to_dtree should be called and not _add_lv_to_dtree when adding metadata sub-LVs to the deptree. Since _add_lv_to_dtree was being called, 'origin_only' could be set if a snapshot sits on top of the RAID device. This would cause the actual device that needed to be added to be skipped in favor of the non-existant device, "<foo>-real".	2012-01-23 20:56:42 +00:00
Alasdair Kergon	f5bfc8b10d	Attempt to improve clustered 'lvchange -aey' behaviour to try local node before remote nodes and address some existing anomalies.	2012-01-21 05:29:51 +00:00
Mike Snitzer	fc0f2d5031	Prompt if request is made to remove a snapshot whose "Merge failed".	2012-01-20 22:04:16 +00:00
Mike Snitzer	27e21a4adc	Allow removal of an invalid snapshot that was to be merged on next activation. Don't allow a user to merge an invalid snapshot.	2012-01-20 22:03:48 +00:00
Mike Snitzer	d658922f36	Use m and M lv_attr to indicate that a snapshot merge failed in lvs. snapshot (m)erge failed, suspended snapshot (M)erge failed	2012-01-20 22:03:03 +00:00
Mike Snitzer	23e34c729b	Differentiate between snapshot status of "Invalid" and "Merge failed".	2012-01-20 22:02:04 +00:00
Mike Snitzer	861c624acb	Lookup snapshot usage percent of origin when a snapshot is merging.	2012-01-20 21:56:01 +00:00
Zdenek Kabelac	c54998209d	Update lvdisplay to show more info about thin LVs Reformat name and path how the LV is represented with lvm1 compatible option, to switch to the old way - which had number of problem - i.e. many links do not exist - since for private devices we are not creating them. Add more info about thin pools and volumes.	2012-01-20 16:59:58 +00:00
Zdenek Kabelac	f881095a69	Drop hack in segtype reporting Since striped name function knows when to report 'linear' instead of 'stripe' type name - drop it from this place. This fixes problem when reporting segtype e.g. for thin-pool which is also using area_count=1 to store thin data device reference. It also returns properly strduped memory instead of badly casted const char*.	2012-01-20 10:55:28 +00:00
Alasdair Kergon	b2b316ab51	.	2012-01-20 03:56:18 +00:00
Jonathan Earl Brassow	25d1410592	Preserve exclusive activation of cluster mirror when converting. This patch to the suspend code - like the similar change for resume - queries the lock mode of a cluster volume and records whether it is active exclusively. This is necessary for suspend due to the possibility of preloading targets. Failure to check to exclusivity causes the cluster target of an exclusively activated mirror to be used when converting - rather than the single machine target.	2012-01-20 00:27:18 +00:00
Zdenek Kabelac	2f65269b77	Drop unimplemented	2012-01-19 16:22:42 +00:00
Zdenek Kabelac	53d7985fa1	Add support to keep info about creation time and host for each LV Basic support to keep info when the LV was created. Host and time is stored into LV mda section. FIXME: Current version doesn't support configurable string via lvm.conf and used fixed version strftime "%Y-%m-%d %T %z".	2012-01-19 15:31:45 +00:00
Alasdair Kergon	a7d2f7795a	Make error message hit when preallocated memlock memory exceeded clearer.	2012-01-12 18:29:07 +00:00
Alasdair Kergon	8f95d94b4f	Show read-only activation in display tools.	2012-01-12 16:58:43 +00:00
Alasdair Kergon	a18dcfb533	Add activation/read_only_volume_list to override LV permission in metadata.	2012-01-12 01:51:56 +00:00
Alasdair Kergon	1e482f7ca6	Give priority to emcpower devices with duplicate PVIDs.	2012-01-11 20:38:42 +00:00
Zdenek Kabelac	7afa7b079c	Check for error code in _adjust_policy_params If error is detected in _adjust_policy_params, break further command processing.	2012-01-09 12:31:52 +00:00
Zdenek Kabelac	4fbde0143a	Support rounding of percentage upward We want to keep this logic - when LV is extend - extend the LV by at least given amount, when LV is reduced - reduce the LV by at most given amount. So for this the rounding needs to be used. Current logic which seems to satisfy give rule is to round up all extent values for LV resize upward except for values with '-' sign that are round downward. This patch also fixes the problem when lvextend --use-polices tried to extend LV the by i.e. 20% - but the resulting 20% were smaller the extent size thus before this patch no extension happened.	2012-01-05 15:38:18 +00:00
Zdenek Kabelac	1aae627433	Use new dmeventd_lvm2_command function in dmeventd plugins. For snapshot, prepare whole command in front into private buffer. Add also some missing '\n' for syslog messages. For raid and mirror only convert creation of command line string. This should avoid any unbound growth of mempool for dm_split_names.	2011-12-22 16:37:01 +00:00
Zdenek Kabelac	8527b92738	Add helper function dmeventd_lvm2_command(). Since this code is in all plugins - create a common helper function.	2011-12-22 15:55:21 +00:00
Zdenek Kabelac	59e1bb62de	Updated documentation for dmeventd. Update man page style. Mention raid and thin plugins. Update help text printed by command to match man page.	2011-12-22 15:50:38 +00:00
Zdenek Kabelac	5339307ca7	Drop extra stat before open of device Since the !(dev->flags & DEV_REGULAR) code path just called dev_name_confirmed() which has just called 'stat()' inside, remove duplicate second stat() call here.	2011-12-21 13:24:24 +00:00
Zdenek Kabelac	538d5e81a7	Do not lstat common path prefix When both path have identical prefix i.e. /dev/disk/by-id skip 2 x lstat() for /dev /dev/disk /dev/disk/by-id and directly lstat() only different part of the path. Reduces amount of lstat calls on system with lots of devices.	2011-12-21 13:21:09 +00:00
Zdenek Kabelac	5146908366	Add common initialization code for struct device Avoid duplicate code and add _dev_init() where all common member values are initialized.	2011-12-21 13:17:54 +00:00
Zdenek Kabelac	b062ee2826	Always zalloc device structure Since there is zalloc behind the macro, put 'z' into the name. Make the 'use_malloc' code path also using zalloc() call, so it also give zeroed area.	2011-12-21 13:14:54 +00:00
Zdenek Kabelac	169470b621	Fix missing thread list manipulation For manipulation with thread list to avoid race with timeout thread, take also _timeout_mutex.	2011-12-21 13:03:06 +00:00
Zdenek Kabelac	d3b4a0f322	Check lv pointer for NULL before derefence.	2011-12-21 12:59:22 +00:00
Zdenek Kabelac	61158adbcf	Allow empty strings for description and creation_host config fields	2011-12-21 12:49:00 +00:00
Alasdair Kergon	66e5b7f53c	Reinstate support for format1 snapshots, but issue deprecated warning. I anticipate removing support for snapshots with lvm1-formatted metadata in a future release.	2011-12-20 00:02:18 +00:00
Alasdair Kergon	594753751a	Only use built-in stack size in clvmd - ignore lvm.conf.	2011-12-08 21:24:08 +00:00
Jonathan Earl Brassow	d098140177	Add policy based automated repair of RAID logical volumes The RAID plug-in for dmeventd now calls 'lvconvert --repair' to address failures of devices in a RAID logical volume. The action taken can be either to "warn" or "allocate" a new device from any spares that may be available in the volume group. The action is designated by setting 'raid_fault_policy' in lvm.conf - the default being "warn".	2011-12-06 19:30:15 +00:00
Jonathan Earl Brassow	9711057499	Don't allow two images to be split and tracked from a RAID LV at one time Also, don't allow a splitmirror operation on a RAID LV that is already tracking a split, unless the operation is to stop the tracking and complete the split. Example: ~> lvconvert --splitmirrors 1 --trackchanges vg/lv /dev/sdc1 # Now tracking changes - image can be merged back or split-off for good ~> lvconvert --splitmirrors 1 -n new_name vg/lv /dev/sdc1 # ^ Completes split ^ If a split is performed on a RAID that is tracking an already split image and PVs are provided, we must ensure that 1) the already split LV is represented in the PVs 2) we are careful to split only the tracked image	2011-12-01 00:21:04 +00:00
Jonathan Earl Brassow	d34991ed97	Don't allow size change of RAID LV that is tracking changes for a split image Don't allow size change of RAID sub-LVs independently	2011-12-01 00:13:16 +00:00
Jonathan Earl Brassow	a927e401f1	Do not allow users to change the name of RAID sub-LVs or the name of the RAID LV if it is tracking changes for a split image.	2011-12-01 00:09:34 +00:00
Jonathan Earl Brassow	9981b8be03	WHATS_NEW for previous commit.	2011-12-01 00:05:40 +00:00
Jonathan Earl Brassow	0c506d9a40	Support the ability to replace specific devices in a RAID array. RAID is not like traditional LVM mirroring. LVM mirroring required failed devices to be removed or the logical volume would simply hang. RAID arrays can keep on running with failed devices. In fact, for RAID types other than RAID1, removing a device would mean substituting an error target or converting to a lower level RAID (e.g. RAID6 -> RAID5, or RAID4/5 to RAID0). Therefore, rather than removing a failed device unconditionally and potentially allocating a replacement, RAID allows the user to "replace" a device with a new one. This approach is a 1-step solution vs the current 2-step solution. example> lvconvert --replace <dev_to_remove> vg/lv [possible_replacement_PVs] '--replace' can be specified more than once. example> lvconvert --replace /dev/sdb1 --replace /dev/sdc1 vg/lv	2011-11-30 02:02:10 +00:00
Alasdair Kergon	8dd6036da4	Add activation/use_linear_target enabled by default. (prajnoha) LVM metadata knows only of striped segments - not linear ones. The activation code detects segments with a single stripe and switches them to use the linear target. If the new lvm.conf setting is set to 0 (e.g. in a test script), this 'optimisation' is turned off.	2011-11-28 20:37:51 +00:00
Zdenek Kabelac	4b42d7ae98	Cleanup test makefiles Simplify /api makefile and use SUBDIRS target for test dir. Properly cleanup Makefiles with distclean in /test. Use symbolic links for shell scripts for non-srcdir compilation.	2011-11-23 12:21:41 +00:00
Alasdair Kergon	c122e5e7a3	Move y/n prompts to stderr and repeat if response has both 'n' and 'y'. (Note that in a future release we might make this stricter and insist on exactly 'y' or 'n'.)	2011-11-23 01:34:38 +00:00
Petr Rockai	c2bd285160	More of WHATS_NEW.	2011-11-21 12:44:38 +00:00
Petr Rockai	c6856ef42b	Update WHATS_NEW.	2011-11-21 12:33:56 +00:00
Alasdair Kergon	bf75c30493	Don't ignore configure --mandir and --infodir.	2011-11-20 20:52:09 +00:00
Zdenek Kabelac	647c8edf82	Drop pool memory allocated in lv_has_target_type Remove FIXMES - there should not be any pool free call since the memory pool is from device manager, and pool is detroyed after the operation, so doing extra free here would not help here. However lv_has_target_type() is using cmd mempool so here the extra call for dm_pool_free makes sence.	2011-11-18 19:42:03 +00:00
Zdenek Kabelac	900f5f8187	Replace dynamic buffer allocations for PATH_MAX Use static buffer instead of stack allocated buffer. This reduces stack size usage of lvm tool and the change is very simple. Since the whole library is not thread safe - it should not add any new problems - and if there will be some conversion it's easy to convert this to use some preallocated buffer.	2011-11-18 19:31:09 +00:00
Zdenek Kabelac	8deeeb07ea	Unlock memory for vg_write For write we do not need to hold memory locked. This relaxes many conditions and avoid problems when allocating a lot of memory for writting metadata buffers. (In case of huge MDA size this would lead to mismatch between locked and unlocked memory region size). Add also internal check we are not writing in critical section.	2011-11-18 19:28:00 +00:00
Zdenek Kabelac	37f274ced9	Query before removing inactive snapshots Removal of an inactive origin removes also all related snapshots. When we now support 'old' external snapshots with thin volumes, removal of pool will not only drop all thin volumes, but as a consequence also all snapshots - which might be seen a bit unexpected for the user - so add a query to confirm such action. lvremove -f will skip the prompt.	2011-11-18 19:25:20 +00:00
Zdenek Kabelac	e8a40f6571	Allow to activate snapshot Add extra code to active and deactivate related snapshots and origin when user specifies snapshot logical volume as lvchange parameter. Before patch: $> lvs -a LV VG Attr LSize Pool Origin Snap% Move Log Copy% Convert lvol0 mvg owi-a-s- 1.00k lvol1 mvg swi-a-s- 16.00k lvol0 0.00 lvol2 mvg swi-a-s- 16.00k lvol0 0.00 $> lvchange -an mvg/lvol2; echo $? Can't change snapshot logical volume "lvol2". 5 After patch: $> lvchange -an mvg/lvol2 Change of snapshot lvol2 will also change its origin lvol0 and 1 other snapshot(s). Proceed? [y/n]: n Logical volume lvol2 not changed. $> lvchange -y -an mvg/lvol2; echo $? 0 $> lvs -a LV VG Attr LSize Pool Origin Snap% Move Log Copy% Convert lvol0 mvg owi---s- 1.00k lvol1 mvg swi---s- 16.00k lvol0 lvol2 mvg swi---s- 16.00k lvol0	2011-11-18 19:22:49 +00:00
Zdenek Kabelac	e858ac1546	Skip non-virtual snapshots for availability Change the behavior of availability change. With this patch the lvgchange returns success when VG is properly changed. It skips non-virtual origins from being changes when only 'vg' is specified as lvchange -a parameter. Before this change we had this: $> lvs -a LV VG Attr LSize Pool Origin lvol0 mvg owi-a-s- 128.00k lvol1 mvg owi-a-s- 128.00k lvol2 mvg swi-a-s- 1.25m lvol0 lvol3 mvg swi-a-s- 1.25m lvol1 $> lvchange -an mvg ; echo $? Can't change snapshot logical volume "lvol2". Can't change snapshot logical volume "lvol3". 5 $> lvs -a LV VG Attr LSize Pool Origin lvol0 mvg owi---s- 128.00k lvol1 mvg owi---s- 128.00k lvol2 mvg swi---s- 1.25m lvol0 lvol3 mvg swi---s- 1.25m lvol1 $> lvchange -ay mvg ; echo $? Can't change snapshot logical volume "lvol2". Can't change snapshot logical volume "lvol3". 5 $> lvs LV VG Attr LSize Pool Origin lvol0 mvg owi-a-s- 128.00k lvol1 mvg owi-a-s- 128.00k lvol2 mvg swi-a-s- 1.25m lvol0 lvol3 mvg swi-a-s- 1.25m lvol1 After commit: $> lvs -a LV VG Attr LSize Pool Origin lvol0 mvg owi-a-s- 128.00k lvol1 mvg owi-a-s- 128.00k lvol2 mvg swi-a-s- 1.25m lvol0 lvol3 mvg swi-a-s- 1.25m lvol1 $> lvchange -an mvg ; echo $? 0 $> lvs -a LV VG Attr LSize Pool Origin lvol0 mvg owi---s- 128.00k lvol1 mvg owi---s- 128.00k lvol2 mvg swi---s- 1.25m lvol0 lvol3 mvg swi---s- 1.25m lvol1 $> lvchange -ay mvg ; echo $? 0 $> lvs -a LV VG Attr LSize Pool Origin lvol0 mvg owi-a-s- 128.00k lvol1 mvg owi-a-s- 128.00k lvol2 mvg swi-a-s- 1.25m lvol0 lvol3 mvg swi-a-s- 1.25m lvol1	2011-11-18 19:19:22 +00:00
Zdenek Kabelac	91e4512619	Adjusted mirror region size only for mirrors and raids Update region_size only for mirror and raid targets. This fixes warning messages when vg is using small extent size like 1KiB and no mirror/raid is created, but the user still got the message: $> vgcreate -s 1K vg <pvs> $> lvcreate -L10K vg Using reduced mirror region size of 4 sectors	2011-11-15 17:32:12 +00:00
Zdenek Kabelac	8542953f74	Reorder AND test condition Take the easiest condition for checking first since they must apply all together, check local conditions first before doing more expensive tests.	2011-11-15 17:27:41 +00:00
Peter Rajnoha	5680d14ecd	Avoid 'mda inconsistency' by properly registering UNLABELLED_PV flag (2.02.86). When a PV label write is deferred to a vg_write call (as introduced by a patch in 2.02.86), the PV is flagged with the internal UNLABELLED_PV flag. However, when calling vg_archive before vg_write, we still have the PV labelled with the UNLABELLED_PV flag which was not recognised as a proper flag while exporting VG metadata: # vgcreate vg /dev/sda No physical volume label read from /dev/sda Metadata inconsistency: Not all flags successfully exported. Metadata inconsistency: Not all flags successfully exported. Writing physical volume data to disk "/dev/sda" Physical volume "/dev/sda" successfully created Volume group "vg" successfully created	2011-11-15 11:54:15 +00:00
Alasdair Kergon	bf09a32006	Make dmsetup.static and lvm.static build when dmeventd is disabled. udev may also need to be disabled if you didn't build it statically too. dmeventd.static could be fixed with some more work but I don't really see the point: without dlopen() it's useless, and if you have dlopen(), why not support normal shared libraries too?	2011-11-14 21:30:35 +00:00
Alasdair Kergon	630b4c2111	Move gentoo MAKEDEV to /sbin in lvm2create_initrd. (James Le Cuirot)	2011-11-12 17:03:53 +00:00
Milan Broz	07113beea3	Do not scan device if it is part of active multipath. Add filter which tries to check if scanned device is part of active multipath. Firstly, only SCSI major number devices are handled in filter. Then it checks if device has exactly one holder (in sysfs) and if it is device-mapper device and DM-UUID is prefixed by "MPATH-". If so, this device is filtered out. The whole filter can be switched off by setting mpath_component_detection in lvm.conf. https://bugzilla.redhat.com/show_bug.cgi?id=597010 Signed-off-by: Milan Broz <mbroz@redhat.com>	2011-11-11 15:11:08 +00:00
Zdenek Kabelac	65b977f249	Update lvs man page style.	2011-11-08 12:16:53 +00:00
Zdenek Kabelac	e903e37d0a	Add missing default LVM_VG_NAME Add support for exported shell variable LVM_VG_NAME also for thins and snapshots.	2011-11-07 11:01:53 +00:00
Zdenek Kabelac	4079a8f298	Avoid lvextend to overflow Add extra check to extent_count overflow. Use internal define MAX_EXTENT_COUNT instead UINT32_MAX.	2011-11-04 22:49:53 +00:00
Alasdair Kergon	13dc67cda7	Add missing lvrename mirrored log recursion in for_each_sub_lv.	2011-11-04 01:31:23 +00:00
Zdenek Kabelac	2b71bcd0cb	Improve lv_extend stack reporting and some code cleanup with setting return value.	2011-10-28 20:23:24 +00:00
Zdenek Kabelac	2fa836e843	Extend virtual segment instead of adding new one Before adding a new virtual segment to LV, check first whether the last segment isn't already of the same type. In this case extend last segment instead of creating the new one. Thin volumes should have always only 1 virtual segment, but it helps also to virtual snapshot or error segtype..	2011-10-28 20:17:55 +00:00
Zdenek Kabelac	bd4b840879	Add last_seg Implement a function to return the last segment in a LV. Signed-off-by: Mikulas Patocka <mpatocka@redhat.com>	2011-10-28 20:12:54 +00:00
Zdenek Kabelac	7ad1c43b48	Add find_config_tree_str_allow_empty Add function to allow read of empty strings as valid arguments. Add a warning message if string argument has ignored value.	2011-10-28 20:06:49 +00:00
Jonathan Earl Brassow	682309e0b8	Disallow 'mirrored' log for cluster mirrors. Git commit ID `0864378250` was meant to disallow 'mirrored' logs for cluster mirrors. However, when add_mirror_log is used to create the log (as is now the case when using 'lvcreate' or converting only the log) the check is bypassed. This patch adds the check to add_mirror_log.	2011-10-25 13:17:04 +00:00
Zdenek Kabelac	eafbdf3029	Don't print char type[8] as a plain string pvck prints 'extra' character from the label since there is no '\0' after the struct label entry and just uint64_t follows directly. So avoid it by limiting 8 chars to be printed. https://www.redhat.com/archives/lvm-devel/2011-January/msg00109.html Signed-off-by: Paul Bolle <pebolle tiscali nl>	2011-10-24 10:24:39 +00:00
Zdenek Kabelac	f2c56bc3b6	Drop mempool parameter from read functions Use implicit vgmem pool.	2011-10-23 16:05:45 +00:00
Zdenek Kabelac	72ff89d279	Always use vg memory pool for allocated lv segment Remove mem pool parameter from alloc_lv_segment() Since we should always allocate LV segment from the vg mempool.	2011-10-23 16:02:01 +00:00
Zdenek Kabelac	9e453cab1c	Reduce stack size usage in print_log As the buf2[] and locn[] can't be used at the same time, safe 1 page from stack memory.	2011-10-22 16:52:00 +00:00
Zdenek Kabelac	06b8248d63	Make move_lv_segment non-static This function could be useful for other _manip source files. Use dm_list manipulation function for provided functionality, which make the code more readable and avoid touching list internal details here.	2011-10-22 16:42:10 +00:00
Alasdair Kergon	dbd60cf576	Pass exclusive LV locks to all nodes in the cluster. This was the intended behaviour, as described in the lvchange man page, so you have complete control through volume_list in lvm.conf, but the code seems to have been treating -ae as local-only for a very long time.	2011-10-21 15:49:45 +00:00
Zdenek Kabelac	4d83891a67	Make units for chunksize more obvious	2011-10-21 09:53:16 +00:00
Zdenek Kabelac	789fa12d62	Improve lvcreate man page Split syntax for thin-pool since it cannot be fully matched with snapshot. So to avoid more confusion - take thin support into separate line. Though still significant updates are needed for thin provisioning.	2011-10-19 16:49:13 +00:00
Petr Rockai	c266d0611a	New.	2011-10-19 09:01:03 +00:00
Jonathan Earl Brassow	3b032963d5	cmirrord now returns log name to kernel in CTR so it can be registered Version 2 of the userspace log protocol accepts return information during the DM_ULOG_CTR exchange. The return information contains the name of the log device that is being used (if there is one). The kernel can then register the device via 'dm_get_device'. Amoung other things, this allows for userspace to assemble a correct dependency tree of devices - critical for LVM handling of suspend/resume calls. Also, update dm-log-userspace.h to match the kernel header associated with this protocol change. (Includes a version inc.)	2011-10-14 14:18:49 +00:00
Zdenek Kabelac	7f815706ca	Fix lv_info open_count test When verify_udev_operations was disable, code for stacking fs operation for lvm links was completely disable - but this code was also used for collecting information, that a new node is being created. Add a new flag which is set when a creation of lv symlinks is requested which should restore old behaviour of lv_info function, that has called fs_sync() before quere for open count on device.	2011-10-14 13:23:47 +00:00
Zdenek Kabelac	8a706f836d	Simplify worker loop Do not reacquire mutex several times without a real reason. Code readability is also better.	2011-10-11 09:54:39 +00:00
Zdenek Kabelac	96de8adcc9	Use barrier instead of mutex Barrier is supposed to be used in situation like this and replace tricky mutex usage, where mutex has been unlocked by a different thread than the locking thread.	2011-10-11 09:26:04 +00:00
Zdenek Kabelac	da0ec96159	Update	2011-10-11 09:20:17 +00:00
Zdenek Kabelac	dde1ca1ef1	Update whats new	2011-10-11 09:14:51 +00:00
Zdenek Kabelac	d4f134b8f6	Check for refresh_filter failure Properly detect if the filters were refreshed properly. (May needs few more fixes ??) Filter refresh may fail because it may be out of free file descriptors when clvmd gets overloaded.	2011-10-11 09:09:00 +00:00
Zdenek Kabelac	efe62a3411	Use condition instead of sleep Replace usleep with pthread condition to increase speed testing (for simplicity just 1 condition for all locks). Use thread mutex also for unlock resource (so it wakes up awaiting threads) Better check some error states and return error in fail case with unlocked mutex.	2011-10-11 09:05:20 +00:00
Zdenek Kabelac	de75bc6688	Improve backtrace reporting Add <backtrace> so the function appears logged for the fail path.	2011-10-11 08:59:42 +00:00
Zdenek Kabelac	4007ac814f	Change message severity Using log_warn to report missing symlinks as warning, since the command itself returns as successful, we should not produce log_error(). log_warn is better fit here.	2011-10-11 08:57:13 +00:00
Jonathan Earl Brassow	f60175c308	Add the ability to convert LVs of "mirror" segtype to "raid1" segtype. Example: ~> lvconvert --type raid1 vg/mirror_lv Steps to convert "mirror" to "raid1" 1) Allocate a RAID metadata LV for each mirror image from the same PVs on which they are located. 2) Clear the metadata LVs. This involves writing LVM metadata, so we don't change any aspects of the mirror LV before this so that the user can easily remove LVs from the failed convert attempt while retaining the original mirror. 3) Remove the mirror log, if it exists. 4) Add metadata LVs to mirror LV 5) Rename mirror sub-lvs (s/mimage/rimage/) 6) Change flags and segtype from mirror to raid1	2011-10-07 14:56:01 +00:00
Jonathan Earl Brassow	d3582e0252	Add the ability to convert linear LVs to RAID1 Example: ~> lvconvert --type raid1 -m 1 vg/lv The following steps are performed to convert linear to RAID1: 1) Allocate a metadata device from the same PV as the linear device to provide the metadata/data LV pair required for all RAID components. 2) Allocate the required number of metadata/data LV pairs for the remaining additional images. 3) Clear the metadata LVs. This performs a LVM metadata update. 4) Create the top-level RAID LV and add the component devices. We want to make any failure easy to unwind. This is why we don't create the top-level LV and add the components until the last step. Should anything happen before that, the user could simply remove the unnecessary images. Also, we want to ensure that the metadata LVs are cleared before forming the array to prevent stale information from polluting the new array. A new macro 'seg_is_linear' was added to allow us to distinguish linear LVs from striped LVs.	2011-10-07 14:52:26 +00:00
Jonathan Earl Brassow	a80192b6a7	Allow 'nosync' extension of mirrors. This patch allows a mirror to be extended without an initial resync of the extended portion. It compliments the existing '--nosync' option to lvcreate. This action can be done implicitly if the mirror was created with the '--nosync' option, or explicitly if the '--nosync' option is used when extending the device. Here are the operational criteria: 1) A mirror created with '--nosync' should extend with 'nosync' implicitly [EXAMPLE]# lvs vg; lvextend -L +5G vg/lv ; lvs vg LV VG Attr LSize Pool Origin Snap% Move Log Copy% Convert lv vg Mwi-a-m- 5.00g lv_mlog 100.00 Extending 2 mirror images. Extending logical volume lv to 10.00 GiB Logical volume lv successfully resized LV VG Attr LSize Pool Origin Snap% Move Log Copy% Convert lv vg Mwi-a-m- 10.00g lv_mlog 100.00 2) The 'M' attribute ('M' signifies a mirror created with '--nosync', while 'm' signifies a mirror created w/o '--nosync') must be preserved when extending a mirror created with '--nosync'. See #1 for example of 'M' attribute. 3) A mirror created without '--nosync' should extend with 'nosync' only when '--nosync' is explicitly used when extending. [EXAMPLE]# lvs vg; lvextend -L +5G vg/lv; lvs vg LV VG Attr LSize Pool Origin Snap% Move Log Copy% Convert lv vg mwi-a-m- 20.00m lv_mlog 100.00 Extending 2 mirror images. Extending logical volume lv to 5.02 GiB Logical volume lv successfully resized LV VG Attr LSize Pool Origin Snap% Move Log Copy% Convert lv vg mwi-a-m- 5.02g lv_mlog 0.39 vs. [EXAMPLE]# lvs vg; lvextend -L +5G vg/lv --nosync; lvs vg LV VG Attr LSize Pool Origin Snap% Move Log Copy% Convert lv vg mwi-a-m- 20.00m lv_mlog 100.00 Extending 2 mirror images. Extending logical volume lv to 5.02 GiB Logical volume lv successfully resized LV VG Attr LSize Pool Origin Snap% Move Log Copy% Convert lv vg Mwi-a-m- 5.02g lv_mlog 100.00 4) The 'm' attribute must change to 'M' when extending a mirror created without '--nosync' is extended with the '--nosync' option. (See #3 examples above.) 5) An inactive mirror's sync percent cannot be determined definitively, so it must not be allowed to skip resync. Instead, the extend should ask the user if they want to extend while performing a resync. [EXAMPLE]# lvchange -an vg/lv [EXAMPLE]# lvextend -L +5G vg/lv Extending 2 mirror images. Extending logical volume lv to 10.00 GiB vg/lv is not active. Unable to get sync percent. Do full resync of extended portion of vg/lv? [y/n]: y Logical volume lv successfully resized 6) A mirror that is performing recovery (as opposed to an initial sync) - like after a failure - is not allowed to extend with either an implicit or explicit nosync option. [You can simulate this with a 'corelog' mirror because when it is reactivated, it must be recovered every time.] [EXAMPLE]# lvcreate -m1 -L 5G -n lv vg --nosync --corelog WARNING: New mirror won't be synchronised. Don't read what you didn't write! Logical volume "lv" created [EXAMPLE]# lvs vg LV VG Attr LSize Pool Origin Snap% Move Log Copy% Convert lv vg Mwi-a-m- 5.00g 100.00 [EXAMPLE]# lvchange -an vg/lv; lvchange -ay vg/lv; lvs vg LV VG Attr LSize Pool Origin Snap% Move Log Copy% Convert lv vg Mwi-a-m- 5.00g 0.08 [EXAMPLE]# lvextend -L +5G vg/lv Extending 2 mirror images. Extending logical volume lv to 10.00 GiB vg/lv cannot be extended while it is recovering. 7) If 'no' is selected in #5 or if the condition in #6 is hit, it should not result in the mirror being resized or the 'm/M' attribute being changed. NOTE: A mirror created with '--nosync' behaves differently than one created without it when performing an extension. The former cannot be extended when the mirror is recovering (unless in-active), while the latter can. This is a reasonable thing to do since recovery of a mirror doesn't take long (at least in the case of an on-disk log) and it would cause far more time in degraded mode if the extension w/o '--nosync' was allowed. It might be reasonable to add the ability to force the operation in the future. This should /not/ force a nosync extension, but rather force a sync'ed extension. IOW, the user would be saying, "Yes, yes... I know recovery won't take long and that I'll be adding significantly to the time spent in degraded mode, but I need the extra space right now!".	2011-10-06 15:32:26 +00:00
Jonathan Earl Brassow	b19f01212e	Fix splitmirror in cluster having different DM/LVM views of storage. This patch also does some clean-up of the splitmirrors code. I've attempted to clean-up the splitmirrors code to make it easier to understand with fewer operations. I've tried to reduce the number of metadata operations without compromising the intermediate stages which are necessary for easy clean-up in the even of failure. These changes now correctly handle cluster situations - including exclusive cluster mirrors. Whereas before, a splitmirror operation would result in remote nodes having LVM commands report the newly split LV with a proper name while DM commands would report the old (pre-split) names of the device. IOW, there was a kernel/userspace mismatch.	2011-10-06 14:55:39 +00:00
Jonathan Earl Brassow	6c0b0e5d9a	Revert initial solution to bug 733114 - I/O error message during splitmirror The original commit comments can be located via this git commit ID: `7d8e615c0b` There were three possible solutions to the original problem proposed in the initial check-in. The one chosen was as follows: 2) Do like _remove_mirror_images does and suspend the original, then suspend the sub-lv (the error target), then resume the sub-lv, and finally resume the original LV. This seems like extra pointless operations to me, but it doesn't produce the error message (although, I'm not sure why) and it allows us to leave the visible flag in place. Turns out, the cluster also views the extra suspend/resume operations as pointless too and ignores them. So, this solution doesn't work in a cluster. Further, I've noticed that in addition to the remote cluster nodes still getting I/O errors from scanning the error target, they also have a different LVM and DM views of the same LV. IOW, while the LVM level (gotten from the LVM metadata) sees the correct name for the newly split LV, device-mapper still maintains the old names. Because the original fix failed to completely fix the problem (or work-around it) and because a better solution must be found to address the additional cluster issue of device renaming, I am reverting the above mentioned commit.	2011-10-06 14:49:16 +00:00
Jonathan Earl Brassow	83c606ae30	This patch fixes issues with improper udev flags on sub-LVs. The current code does not always assign proper udev flags to sub-LVs (e.g. mirror images and log LVs). This shows up especially during a splitmirror operation in which an image is split off from a mirror to form a new LV. A mirror with a disk log is actually composed of 4 different LVs: the 2 mirror images, the log, and the top-level LV that "glues" them all together. When a 2-way mirror is split into two linear LVs, two of those LVs must be removed. The segments of the image which is not split off to form the new LV are transferred to the top-level LV. This is done so that the original LV can maintain its major/minor, UUID, and name. The sub-lv from which the segments were transferred gets an error segment as a transitory process before it is eventually removed. (Note that if the error target was not put in place, a resume_lv would result in two LVs pointing to the same segment! If the machine crashes before the eventual removal of the sub-LV, the result would be a residual LV with the same mapping as the original (now linear) LV.) So, the two LVs that need to be removed are now the log device and the sub-LV with the error segment. If udev_flags are not properly set, a resume will cause the error LV to come up and be scanned by udev. This causes I/O errors. Additionally, when udev scans sub-LVs (or former sub-LVs), it can cause races when we are trying to remove those LVs. This is especially bad during failure conditions. When the mirror is suspended, the top-level along with its sub-LVs are suspended. The changes (now 2 linear devices and the yet-to-be-removed log and error LV) are committed. When the resume takes place on the original LV, there are no longer links to the other sub-lvs through the LVM metadata. The links are implicitly handled by querying the kernel for a list of dependencies. This is done in the '_add_dev' function (which is recursively called for each dependency found) - called through the following chain: _add_dev dm_tree_add_dev_with_udev_flags <* DM / LVM divide *> _add_dev_to_dtree _add_lv_to_dtree _create_partial_dtree _tree_action dev_manager_activate _lv_activate_lv _lv_resume lv_resume_if_active When udev flags are calculated by '_get_udev_flags', it is done by referencing the 'logical_volume' structure. Those flags are then passed down into 'dm_tree_add_dev_with_udev_flags', which in turn passes them to '_add_dev'. Unfortunately, when '_add_dev' is finding the dependencies, it has no way to calculate their proper udev_flags. This is because it is below the DM/LVM divide - it doesn't have access to the logical_volume structure. In fact, '_add_dev' simply reuses the udev_flags given for the initial device! This virtually guarentees the udev_flags are wrong for all the dependencies unless they are reset by some other mechanism. The current code provides no such mechanism. Even if '_add_new_lv_to_dtree' were called on the sub-devices - which it isn't - entries already in the tree are simply passed over, failing to reset any udev_flags. The solution must retain its implicit nature of discovering dependencies and be able to go back over the dependencies found to properly set the udev_flags. My solution simply calls a new function before leaving '_add_new_lv_to_dtree' that iterates over the dtree nodes to properly reset the udev_flags of any children. It is important that this function occur after the '_add_dev' has done its job of querying the kernel for a list of dependencies. It is this list of children that we use to look up their respective LVs and properly calculate the udev_flags. This solution has worked for single machine, cluster, and cluster w/ exclusive activation.	2011-10-06 14:45:40 +00:00
Jonathan Earl Brassow	a391248427	Fix vgsplit when there are mirrors that have mirrored logs. The problem as reported by "ben <benscott@nwlink.com>" on lvm-devel: vgsplit fails with mirrored mirror log #lvs --all -o lv_name,lv_attr,devices LV Attr Devices MyMirror mwi-- [MyMirror_mimage_0] Iwi--- /dev/sdq(0) [MyMirror_mimage_1] Iwi--- /dev/sdo(0) [MyMirror_mimage_2] Iwi--- /dev/sdi(0) [MyMirror_mlog] mwi--- [MyMirror_mlog_mimage_0] Iwi--- /dev/sds(0) [MyMirror_mlog_mimage_1] Iwi--- /dev/sde(0) #vgsplit -v "TestA" "TestB" "/dev/sdq" "/dev/sdo" "/dev/sdi" "/dev/sds" "/dev/sde" Checking for volume group "TestA" Checking for new volume group "TestB" Archiving volume group "TestA" metadata (seqno 213). Can't split mirror MyMirror between two Volume Groups AFTER FIX: [root@bp-01 ~]# lvs -a -o name,vg_name,devices vg new Volume group "new" not found Skipping volume group new LV VG Devices lv vg lv_mimage_0(0),lv_mimage_1(0) [lv_mimage_0] vg /dev/sdb1(0) [lv_mimage_1] vg /dev/sdc1(0) [lv_mlog] vg lv_mlog_mimage_0(0),lv_mlog_mimage_1(0) [lv_mlog_mimage_0] vg /dev/sdh1(0) [lv_mlog_mimage_1] vg /dev/sdi1(0) [root@bp-01 ~]# vgsplit vg new /dev/sd[bchi]1 New volume group "new" successfully split from "vg" [root@bp-01 ~]# lvs -a -o name,vg_name,devices vg new LV VG Devices lv new lv_mimage_0(0),lv_mimage_1(0) [lv_mimage_0] new /dev/sdb1(0) [lv_mimage_1] new /dev/sdc1(0) [lv_mlog] new lv_mlog_mimage_0(0),lv_mlog_mimage_1(0) [lv_mlog_mimage_0] new /dev/sdh1(0) [lv_mlog_mimage_1] new /dev/sdi1(0)	2011-10-06 14:17:45 +00:00
Alasdair Kergon	ad9c59e2e9	Clarify multi-name device filter pattern matching explanation in lvm.conf.5.	2011-10-04 20:49:24 +00:00
Zdenek Kabelac	a00cb3a6b0	Add lvm functions for sending messages. Functions are currently only needed for thin provissioning.	2011-10-03 18:37:47 +00:00
Alasdair Kergon	10d0d9c7c4	Introduce revert_lv for better pvmove cleanup. (One further fix needed to remove the stray pvmove LVs left behind.)	2011-09-27 22:43:40 +00:00
Alasdair Kergon	74e72bd75d	Replace incomplete pvmove activation failure recovery code with a message. As it stands, the recovery code can make things worse sometimes so it's better to insist on a proper 'pvmove --abort' cleanup.	2011-09-27 17:29:33 +00:00
Alasdair Kergon	1c26860d82	Abort if _finish_pvmove suspend_lvs fails instead of cleaning up incompletely. Change suspend_lvs to call vg_revert internally. Change vg_revert to void and remove superfluous calls after failed vg_commit.	2011-09-27 17:09:42 +00:00
Zdenek Kabelac	7ae124743e	Use execvp for clvmd restart Since execve passed only NULL as environ, we had lost all environment vars on restart - thus actually running 'different' clvmd then the one at start. Preserving environ allows to restart clvmd with the same settings (i.e. LD_LIBRARY_PATH) Add test for second restart.	2011-09-26 07:51:23 +00:00
Zdenek Kabelac	90d106ef19	Restart CLVMD with same cluster manager Add named cluster_ops to easily learn the name of the active cluster manager, so we are able to restart singlenode manager in testing. Add simple test for clvmd -S (restart) and -R (refresh) (though it needs some extensions).	2011-09-25 19:37:00 +00:00
Zdenek Kabelac	f1ab501a58	Fix log_error() usage Cosmetic - skip <bactrace> when error has been just printed in raid segtype. Add missing log_error if allocation would fail for unknown segtype.	2011-09-24 21:19:30 +00:00
Zdenek Kabelac	a4b6b51757	Improvements Simplify RUN_BASE Put .tests-stamp deps only for check target and fix its cleanup. Fix abs_top_srcdir. vgimportclone needs srcdir. Clean api subdir.	2011-09-24 21:10:19 +00:00
Zdenek Kabelac	00e72fcfee	Fix install_ocf When builddir is different from srcdir install_ocf: has not been able to find files for installation.	2011-09-24 21:05:03 +00:00
Zdenek Kabelac	d2c116058e	CLVMD support for LVM_CLVMD_BINARY and LVM_BINARY Read 2 environmental vars to learn about overide position for CLVMD and LVM binaries. We support LVM_BINARY in other script - and this way we could easily test restart in our test-suite.	2011-09-24 20:50:35 +00:00
Zdenek Kabelac	a039e204e7	CLVMD bugfix support for args -S -E Bugfix: Add (most probably unfinished) support for -E arg with list of exclusive locks. (During clvmd restart all exclusive locks would have been lost and in fact, if there would have been an exclusive lock, usage text would be printed and clvmd exits.) Instead of parsing list options multiple times every time some lock UUID is checked - put them straight into the hash table - make the code easier to understand as well. Remove was_ex_lock() function (replaced with dm_hash_lookup()). Swap return value for get_initial_state() (1 means success). Update man pages and usage info for -E option.	2011-09-24 20:48:34 +00:00
Jonathan Earl Brassow	efa3621a59	Add 'Volume Type' lv_attr characters for RAID and RAID_IMAGE. RAID_META is already handled.	2011-09-23 15:17:54 +00:00
Peter Rajnoha	9fa1d30a1c	Add activation/retry_deactivation to lvm.conf to retry deactivation of an LV.	2011-09-22 17:39:56 +00:00
Peter Rajnoha	125712bea0	Replace open_count check with holders/mounted_fs check on lvremove path. Before, we used to display "Can't remove open logical volume" which was generic. There 3 possibilities of how a device could be opened: - used by another device - having a filesystem on that device which is mounted - opened directly by an application With the help of sysfs info, we can distinguish the first two situations. The third one will be subject to "remove retry" logic - if it's opened quickly (e.g. a parallel scan from within a udev rule run), this will finish quickly and we can remove it once it has finished. If it's a legitimate application that keeps the device opened, we'll do our best to remove the device, but we will fail finally after a few retries.	2011-09-22 17:33:50 +00:00
Jonathan Earl Brassow	f989a55539	Disallow the creation of mirrors (mirror or raid1 segtype) with only one leg. If you specify the segment type (e.g. --type mirror) and the mirrors argument as zero, it would result in a mirrored LV with only one image. While the device may be valid in theory, it should not be allowed in practice. It also makes it difficult on the conversion tools, since they react badly to single-image mirrors.	2011-09-22 15:36:21 +00:00
Zdenek Kabelac	f79f7250ce	Clvmd restart cleanup Patch fixes Clang warnings about possible access via lv_name NULL pointer. Replaces allocation of memory (strdup) with just pointer assignment (since execve is being called anyway). Checks for !*lv_name only when lv_name is defined. (and as I'm not quite sure what state this really is - putting a FIXME around - as this rather looks suspicios ??). Add debug print of passed clvmd args.	2011-09-22 09:47:34 +00:00
Zdenek Kabelac	f1f42ab732	Add all exclusive locks to clvmd restart option args Fix bug when only every even lock has been passed. Warning: currently -E causes clvmd to exit with usage text being printed.	2011-09-22 09:45:24 +00:00
Milan Broz	f5d39ec97a	Always sent the whole command header in restart/reload clvmd commands. (Newly added check catch this as invalid packet.) (N.B. that code is so fragile that it need full rewrite soon:-)	2011-09-21 13:40:46 +00:00
Zdenek Kabelac	d9bba4f16f	Check for failing 'stat' and skip this loop iteration (since data in statbuf are invalid). Check whether sysconf managed to find _SC_PAGESIZE. Report at least debug warning about failing unlink (logging scheme here seems to be a different then in lvm). Duplicate terminal FDs and use similar code as is made in clvmd and cleanup warns about missing open/close tests. FIXME: Looks like we already have 3 instancies of the same code in lvm repo.	2011-09-21 10:42:53 +00:00
Zdenek Kabelac	da1350d420	Add missing log_error() to lvresize command when fsadm tool fails Also add test case	2011-09-21 10:39:47 +00:00
Zdenek Kabelac	8f8c5580fd	Add support for DM_DEV_DIR Follow other commands support this directory setting. Useful for test suite.	2011-09-19 19:36:52 +00:00
Zdenek Kabelac	ce840163c0	Revert patch Caller of exec must report log_error when rstatus is passed.	2011-09-19 18:38:43 +00:00
Zdenek Kabelac	4eeff46bf2	Use log_error instead of log_verbose when executed command fails	2011-09-19 14:54:23 +00:00
Zdenek Kabelac	13e3c25ade	Add support for non /dev devices Since test suite is not using /dev - add support for such dirs into fsadm.	2011-09-19 14:52:33 +00:00
Zdenek Kabelac	53c09bce42	Support different PATH setting When fsadm is test - it needs to execute lvm and fsadm from non-standard path setting. So adding a support in fsadm script when user set LVM_BINARY, then the lvm command invoced from fsadm will have the same PATH setting as before entering fsadm command. Needed for testing.	2011-09-19 13:51:09 +00:00
Zdenek Kabelac	d2010960c9	Surround all executed commands with quotes In case someone would use filename paths with spaces when changing this script surround commands with '"'. With default settings there is no change in behavior.	2011-09-19 13:47:37 +00:00
Zdenek Kabelac	dd96ceda43	Fix missing '$' in test	2011-09-19 13:43:50 +00:00
Zdenek Kabelac	5f3f06db66	Move debug message so it does not look like we are executing command in the middle of critical_section in log trace.	2011-09-19 12:48:02 +00:00

... 6 7 8 9 10 ...

2865 Commits