shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-22 17:35:59 +03:00

Author	SHA1	Message	Date
Zdenek Kabelac	fecd043cca	raid: split preserves local exlusive activation	2016-12-14 11:40:01 +01:00
Zdenek Kabelac	77e09c3fb4	raid: activation with list Commit `0690392040` revealed a problem in raid metadata manipulation. We do two operations in one table reload: - raid leg/image extraction - rename remaining raid legs This should be made in separate steps. Otherwise we do an uncorrectable table change on error path (leaving tables for admin and dmsetup). As a hotfix - restore the previous logic and use a single new function _lv_update_and_reload_list which activates exclusively extracted LVs on the list before resuming suspended raid LV. This restore 'rename' functionality upon resume. Also still preserve the 'origin_only' logic - although we know it's not working properly for cluster and LV stacking. Further fixes are needed.	2016-12-14 11:37:02 +01:00
Zdenek Kabelac	fce7449d73	cleanup: remove wrapping function backup is not 'tested' for success and also it should actually happen just when command is finished. We do not target to make backups with each inter-step metadata change.	2016-12-13 22:07:52 +01:00
Zdenek Kabelac	c7da16e5f1	cleanup: log message updates	2016-12-13 22:07:52 +01:00
Zdenek Kabelac	a8f5e1f274	cleanup: more lv_is_ usage	2016-12-13 22:07:52 +01:00
Zdenek Kabelac	47b96c3537	cleanup: allocate NAME_LEN size for lv name	2016-12-13 22:07:52 +01:00
Zdenek Kabelac	d0fe3ec0c5	raid: avoid manipulation of segment status RAID is LV property TODO: only 2 flags are seg->status: PVMOVE & MERGING At least the second one should be soon elimanted as again we merge LV not a segment.	2016-12-13 22:07:52 +01:00
Zdenek Kabelac	0690392040	raid: improve table reload sequence This is another place for 'common' use pattern or reload and activation of deleted devices. (Moving the exclusive activation to _deactivate_and_remove_lvs()). TODO: looks like halve of raid function is reloading just 'origin' - and the other full LV.	2016-12-13 22:07:52 +01:00
Zdenek Kabelac	794093722c	debug: missing stack traces	2016-12-11 23:24:19 +01:00
Zdenek Kabelac	67f9e6b175	raid: avoid _ at end of name of extracted metadata LV Do not generate @PREFIX@vg/LV1_rmeta_1_extracted_.	2016-12-11 23:20:51 +01:00
Zdenek Kabelac	55ca8043d4	raid: optimize clearing of lvs Activate whole list of metadata lvs first before clearing them. (Similar to commit `ada5733c56`) TODO: make this clearing in a single common function.	2016-12-11 23:19:41 +01:00
Zdenek Kabelac	8831a541a8	raid: fix delete on clustered vg For clustered VG ensure lock is grabbed first, so later deactivation works. TODO: fix tree to solve device removal automatically.	2016-12-11 23:18:22 +01:00
Zdenek Kabelac	0c8369099b	raid: fix raid1 to mirror conversion Fix order of operation when converting raid1 into old mirror. Before any later metadata modification are initiated prepare mirror_log device with all clearing. Then directly convert raid1 into mirror with mirror_log. This convertion now properly see as precommitted metadata new 'mirror' and committed old 'raid' and is able to preload all LVs.	2016-12-11 23:17:22 +01:00
Zdenek Kabelac	15e4ab3e93	cleanup: messages in raid Use display_lvname and add 'dots'. Add some missing WARNING and log_debug_metadata.	2016-12-09 15:15:02 +01:00
Zdenek Kabelac	b7cf7b1d3a	gcc: quiet warning about unused function Once this function will need to be used, git revert.	2016-11-25 15:02:36 +01:00
Zdenek Kabelac	108d9a63fd	cleanup: indent	2016-11-25 15:01:28 +01:00
Zdenek Kabelac	1a4f13eb6e	cleanup: add some dots and use display_lvname Just some more VG/LV printing.	2016-11-25 15:01:27 +01:00
Zdenek Kabelac	1d58074d9f	debug: more stacktrace corrections Continue previous patch dropping some unneeded stack traces after printed log_error/warn messages.	2016-11-25 14:58:28 +01:00
Zdenek Kabelac	5fd5cfe061	cleanup: use display_lvname and msg cleanup Use display_lvname in tracing messages. Add some missing 'dots' to messages.	2016-11-23 17:48:01 +01:00
Alasdair G Kergon	6de05cf5f5	raid: Remove fixed FIXME	2016-11-15 20:45:55 +00:00
Zdenek Kabelac	6db5b91231	cleanup: avoid using double __ in extracted image name	2016-11-11 16:58:20 +01:00
Heinz Mauelshagen	e611f82a11	lvconvert: fix raid repair regression Limit prevention to raid1 as intended with commit `8270ff5702`. Related to rhbz1311765	2016-10-28 21:45:00 +02:00
Heinz Mauelshagen	8270ff5702	lvconvert: prevent non-synced raid1 primary leg repair (Automatic) repair may not be allowed during the initial sync of an upconverted linear LV, because the data on the failing, primary leg hasn't been completely synchronized to the N-1 other legs of the raid1 LV (replacing failed legs during repair involves discontinuing access to any replaced legs data, thus preventing data recovery on the primary leg e.g. via dd_rescue). Even though repair would not cause data loss when adding legs to a fully synced raid1 LV, we don't have information yet defining this state yet (e.g. a raid1 LV flag telling the fully synchronized status before any legs were added), hence can't automatically decide to allow to repair. If nonetheless a repair on a non-synced raid1 LVs is intended, the "--force" option has to be provided. Resolves: rhbz1311765	2016-10-28 15:55:10 +02:00
Heinz Mauelshagen	0468f5da6d	raid_manip: fix typo Related to rhbz1386184	2016-10-26 17:53:55 +02:00
Heinz Mauelshagen	de78e8eae7	lvconvert: position dedicated parity device in raid4 conversions porperly On conversions between striped/raid0* and raid4, the kernel expects the dedicated raid4 parity SubLVs in the first segment area rather than in the last it's been allocated to, thus the data mapping ain't proper. Enhance lvconvert (lib/metadata/raid_manip.c) to shift the dedicated parity SubLVs on conversions from striped/raid0* to raid4 and vice-versa. In case of raid0_meta -> raid4 where the MD raid0 personality already has stored RAID array device positions in the superblocks, the MetaLVs have to be cleared so that the kernel doesn't fail validating the array positions after lvm has shifted them up by one. Add more tests to lvconvert-raid-takeover.sh including one to check for mapping flaws by converting a created raid4 with filesystem -> striped and fsck it. Whilst on it: - add missing direct striped -> raid4 conversion to the takeover array to avoid an intermim conversion from striped -> raid0* - clean up the takeover array - allow lvconvert to actually call lv_raid_convert() on all takeover requests in order to check parameters and display messages provided by takeover functions rather than just "...not supported" from within lvconvert - fix a typo Resolves: rhbz1386148	2016-10-21 19:00:31 +02:00
Zdenek Kabelac	9fe4f2337b	cleanup: drop assign before use Drop unneeded assigns singe vars are set later in code before their first use (Coverity).	2016-10-03 17:49:55 +02:00
Alasdair G Kergon	1bc546269a	lvconvert: Disable thin pool raid conversion while active. Works if the pool is inactive. Activation code doesn't notice a new raid dependency in on-disk metadata when a thin LV is already active. https://bugzilla.redhat.com/1365286	2016-09-27 18:22:54 +01:00
Heinz Mauelshagen	5d455b28fc	lvconvert: fix (automatic) raid repair regression The dm-raid target now rejects device rebuild requests during ongoing resynchronization thus causing 'lvconvert --repair ...' to fail with a kernel error message. This regresses with respect to failing automatic repair via the dmeventd RAID plugin in case raid_fault_policy="allocate" is configured in lvm.conf as well. Previously allowing such repair request required cancelling the resynchronization of any still accessible DataLVs, hence reasoning potential data loss. Patch allows the resynchronization of still accessible DataLVs to finish up by rejecting any 'lvconvert --repair ...'. It enhances the dmeventd RAID plugin to be able to automatically repair by postponing the repair after synchronization ended. More tests are added to lvconvert-rebuild-raid.sh to cover single and multiple DataLV failure cases for the different RAID levels. - resolves: rhbz1371717	2016-09-21 00:39:29 +02:00
Alasdair G Kergon	2fde4399a0	lvconvert: Fix --splitmirrors segfault with incorrect PV. Commit `9ee071705b` misunderstood integer promotion, but it's simpler to detect -1 more directly.	2016-08-26 01:21:01 +01:00
Heinz Mauelshagen	8e9d5d12ae	pvmove: prohibit non-resilient collocation of RAID SubLVs 'pvmove -n name pv1 pv2' allows to collocate multiple RAID SubLVs on pv2 (e.g. results in collocated raidlv_rimage_0 and raidlv_rimage_1), thus causing loss of resilence and/or performance of the RaidLV. Fix this pvmove flaw leading to potential data loss in case of PV failure by preventing any SubLVs from collocation on any PVs of the RaidLV. Still allow to collocate any DataLVs of a RaidLV with their sibling MetaLVs and vice-versa though (e.g. raidlv_rmeta_0 on pv1 may still be moved to pv2 already holding raidlv_rimage_0). Because access to the top-level RaidLV name is needed, promote local _top_level_lv_name() from raid_manip.c to global top_level_lv_name(). - resolves rhbz1202497	2016-08-15 18:22:32 +02:00
Heinz Mauelshagen	9c9b9b276a	raid_manip: pay attention to PVs listed on command line when allocating MetaLVs Adding MetaLVs to given DataLVs (e.g. raid0 -> raid0_meta takeover) wasn't paying attention to any PVs listed on the lvconvert command line.	2016-08-13 00:20:01 +02:00
Heinz Mauelshagen	6d52b17dfc	raid_manip: add missing code avoiding MetaLV collocation on the same PV Adding MetaLVs to given DataLVs (e.g. raid0 -> raid0_meta takeover), _avoid_pvs_with_other_images_of_lv() was missing code to prohibit allocation when called with a just allocated MetaLV to prohibit collaocation of the next allocated MetaLV on the same PV. - resolves rhbz1366738	2016-08-12 22:46:57 +02:00
Alasdair G Kergon	93b61c07eb	raid: Avoid double suffix on rmeta LV name paired with rimage LV.	2016-08-11 23:31:49 +01:00
Alasdair G Kergon	76ef2d15d8	lvconvert: Support raid0<->raid4 and mirror<->raid1. Only simple takeover cases so far.	2016-08-07 00:56:08 +01:00
Alasdair G Kergon	de7f1deb80	raid: Report supported lvconvert conversions if invalid.	2016-08-07 00:30:26 +01:00
Alasdair G Kergon	6f236c3353	raid: Add workaround to prepare for raid4 conversions.	2016-08-07 00:07:06 +01:00
Alasdair G Kergon	ba0c26a078	raid: Pass list of LVs to remove into more fns.	2016-08-06 23:46:45 +01:00
Alasdair G Kergon	30884208d4	raid: Move two functions.	2016-08-06 23:29:27 +01:00
Heinz Mauelshagen	8f25ad6416	raid_manip: fix log print format from commit `d2c3b23e6d`	2016-08-05 16:29:56 +02:00
Heinz Mauelshagen	d2c3b23e6d	lvchange: Allow device specification when requesting a repair 'lvchange --resync LV' or 'lvchange --syncaction repair LV' request the RAID layout specific parity blocks in raid4/5/6 to be recreated or the mirrored blocks to be copied again from the master leg/copy for raid1/10, thus not allowing a rebuild of a particular PV. Introduce repeatable option '--[raid]rebuild PV' to allow to request rebuilds of specific PVs in a RaidLV which are known to contain corrupt data (e.g. rebuild a raid1 master leg). Add test lvchange-rebuild-raid.sh to test/shell doing rebuild variations on raid1/10 and 5; add aux function check_status_chars to support the new test. - Resolves rhbz1064592	2016-08-05 16:01:46 +02:00
Alasdair G Kergon	91f866f786	raid: Tell lib whether stripesize was specified	2016-08-05 14:28:14 +01:00
Alasdair G Kergon	7482ff93b8	raid: Turn lv_raid_change_image_count into wrapper Eventually the separate entry point will disappear.	2016-08-05 14:17:54 +01:00
Alasdair G Kergon	b1b0b134ec	raid0: Validate presence of raid0 meta_areas more tightly.	2016-08-04 21:15:07 +01:00
Alasdair G Kergon	4a15abe865	striped: Add precise macros for original segtype. The existing striped macros include raid0 segments.	2016-08-04 01:24:39 +01:00
Alasdair G Kergon	fdc3fcbfce	lvconvert: Pass region_size to lv_raid_convert.	2016-08-02 23:51:20 +01:00
Alasdair G Kergon	c490be9134	Revert "thin: when converting a thin pool data or metadata LV from" This reverts commit `237f84e038`. This case failed: lvcreate --type raid1 -m1 -l2 vg99 lvcreate -aey -l2 -s vg99/lvol0 lvconvert -m2 vg99/lvol0	2016-08-01 15:17:44 +01:00
Heinz Mauelshagen	237f84e038	thin: when converting a thin pool data or metadata LV from linear to raid1, the linear wasnt't switched to the raid1 mapping, thus creating the false impression of resilience.	2016-07-29 19:17:12 +02:00
Alasdair G Kergon	3bc1adc404	raid_manip: Some validation functions.	2016-07-24 01:40:24 +01:00
Alasdair G Kergon	ea543b5c6f	raid_manip: Fix stripe_size type to uint32_t.	2016-07-24 01:35:04 +01:00
Alasdair G Kergon	4a544ae272	raid_manip: Reorder some functions.	2016-07-24 01:31:30 +01:00
Heinz Mauelshagen	94207dfd68	lvconvert: raid0 replace attempt segfaults Any failing stripes in raid0/raid0_meta type LVs cause data loss, thus replacement via 'lvconvert --replace...' does not make sense. Patch prohibits replacement on raid0/raid0_meta LVs. - resolves rhbz1356734	2016-07-18 20:16:40 +02:00
Alasdair G Kergon	5af311ddd8	macros: Add lv_is_not_synced.	2016-07-14 14:21:01 +01:00
Heinz Mauelshagen	9c27573493	raid_manip: 'vgreduce --removemissing --force ...' segfaults on raid0 LV An unconditional access to the non-existing MetaLV of a raid0 LV in lv_raid_remove_missing() was causing the segfault. Only call log_debug() on replacements of existing MetaLVs. - resolves rhbz1354646	2016-07-12 17:55:01 +02:00
Zdenek Kabelac	fd53d86eea	cleanup: gcc warns removal Ensure vars have always defined value.	2016-07-12 10:39:33 +02:00
Alasdair G Kergon	c1a66d4fc6	coverity: Fixes for recent changes.	2016-07-06 16:09:32 +01:00
Alasdair G Kergon	d8c2677ab9	raid0: Add raid0_meta segment type.	2016-07-01 22:20:54 +01:00
Alasdair G Kergon	686acce23f	lvconvert: Conversions between striped and raid0.	2016-06-28 23:44:15 +01:00
Alasdair G Kergon	79446ffad7	raid: Infrastructure for raid takeover.	2016-06-28 02:42:30 +01:00
Alasdair G Kergon	dfc516f9bf	lvconvert: Refactor argument handling code. Begin disentangling the different lvconvert modes of operation from each other.	2016-06-22 18:40:22 +01:00
Alasdair G Kergon	b896f7de1e	raid0: Standardise meta_areas checks before access.	2016-05-23 22:55:13 +01:00
Heinz Mauelshagen	18cf5e8e67	raid_manip: allow for raid leg to be replaced when not both data and metadata image are on pvs resolves rhbz#1130329	2016-03-07 15:25:30 +01:00
Peter Rajnoha	fc628e92ba	metadata: also look at historical LVs when checking LV name availability Live LVs and historical LVs are in one namespace and the name needs to be unique in whole VG.	2016-03-03 13:50:59 +01:00
Zdenek Kabelac	e04a0184cb	cleanup: use lv_is_partial Check for PARTIAL_LV flag in standard way.	2016-03-03 10:17:03 +01:00
Zdenek Kabelac	dbc71dc05e	gcc: cleanup some sign warnings When comparing unsigned with int, the comparision is made as 'unsigned' type, so make it rather explicit which type is being compared.	2016-02-23 12:25:25 +01:00
Zdenek Kabelac	fcbef05aae	doc: change fsf address Hmm rpmlint suggest fsf is using a different address these days, so lets keep it up-to-date	2016-01-21 12:11:37 +01:00
Zdenek Kabelac	cad3568def	raid: drop unneeded NULL test Skip testing target_pvs for NULL, we already dereference it in many other places. If check would ever be needed - it needs to be in front of _raid_extract_images().	2015-11-17 19:01:25 +01:00
Zdenek Kabelac	007be91e3d	raid: ensure area_count is at least 2 Enusure we will not divide by 0.	2015-11-13 11:17:06 +01:00
Zdenek Kabelac	57c2a1ae8c	raid: mark intententional copy and paste Coverity: add this extra comment, to let Coverity know this slightly changed copy&paste code is intentional.	2015-11-09 10:22:52 +01:00
Heinz Mauelshagen	b33d7586e7	raid_manip: fix wrong image size allocation on raid10 "lvconvert --replace ..."	2015-10-02 17:09:37 +02:00
Alasdair G Kergon	39a97d86f0	segtypes: Add and use new segtype macros. Includes fixing an inverted raid10 segtype check in _raid_add_target_line.	2015-09-24 14:59:07 +01:00
Alasdair G Kergon	214e2cddf6	segtypes: Use SEG_TYPE_NAME_ string constants.	2015-09-22 19:04:12 +01:00
Zdenek Kabelac	32d6ca9196	cleanup: show error message Add error message on error path.	2015-09-03 23:34:36 +02:00
Zdenek Kabelac	55a9262bdb	cleanup: unused header files (Coverity)	2015-08-18 15:00:08 +02:00
David Teigland	fe70b03de2	Add lvmlockd	2015-07-02 15:42:26 -05:00
Alasdair G Kergon	4c629a5257	locking: Add missing error handling. Add missing error logging and detection to unlock_vg and callers of sync_local_dev_names etc.	2015-06-30 18:54:38 +01:00
Alasdair G Kergon	0300730cc9	pre-release	2015-05-15 23:19:29 +01:00
Zdenek Kabelac	88421c883e	raid: reread status when 0 is reported When kernel target reports sync status as 0% it might as well mean it's 100% in sync, just the target is in some race inconsistent state - so reread once again and take a more optimistic value ;) Patch tries to work around: https://bugzilla.redhat.com/show_bug.cgi?id=1210637	2015-05-04 13:09:05 +02:00
Zdenek Kabelac	434031719e	raid: check lock holding LV Since raid could be used as stacked LV - check lock holding LV for proper locking type for clustered usage.	2015-01-30 14:16:27 +01:00
Zdenek Kabelac	93b9015760	raid: fix raid image splitting When raid leg is extracted, now the preload code handles this state correctly and put proper new table entry into dm tree, so the activation of extracted leg and removed metadata works after commit.	2015-01-28 13:45:18 +01:00
Peter Rajnoha	0fddc5ab5c	coverity: missing return value check Reported by coverity for code added recently - _avoid_pvs_with_other_images_of_lv which calls process_each_sub_lv and not checking return value.	2015-01-22 10:11:19 +01:00
Heinz Mauelshagen	302b6c99a7	raid_manip: v2 fix multi-segment misallocation on 'lvconvert --repair' The previous patch felt short WRT disabling allocation on PVs holding other legs of the RAID LV persistently; this patch introduces an internal, transient PV flag PV_ALLOCATION_PROHIBITED to address this very problem. General problem description for completeness: An 'lvconvert --repair $RAID_LV" to replace a failed leg of a multi-segment RAID10/4/5/6 logical volume can lead to allocation of (parts of) the replacement image component pair on the physical volume of another image component (e.g. image 0 allocated on the same PV as image 1 silently impeding resilience). Patch fixes this severe resilince issue by prohibiting allocation on PVs already holding other legs of the RAID set. It allows to allocate free space on any operational PV already holding parts of the image component pair.	2015-01-16 13:44:16 +01:00
Heinz Mauelshagen	cdd17eee37	raid_manip: fix multi-segment misallocation on 'lvconvert --repair' An 'lvconvert --repair $RAID_LV" to replace a failed leg of a multi-segment RAID10/4/5/6 logical volume can lead to allocation of (parts of) the replacement image component pair on the physical volume of another image component (e.g. image 0 allocated on the same PV as image 1 silently impeding resilience). Patch fixes this severe resilince issue by prohibiting allocation on PVs already holding other legs of the RAID set. It allows to allocate free space on any operational PV already holding parts of the image component pair.	2015-01-14 13:41:55 +01:00
Heinz Mauelshagen	aaecbb1818	raid: fix mirror image naming when converting from mirror to raid1 $ lvcreate -l1 -m1 --type mirror vg Logical volume "lvol0" created. $ lvconvert --type raid1 vg/lvol0 Before: $ lvs -a vg LV VG Active Attr LSize Cpy%Sync Layout Role lvol0 vg active rwi-a-r--- 4.00m 100.00 raid,raid1 public [lvol0_mimage_0_rimage_0] vg active iwi-aor--- 4.00m linear private,raid,image [lvol0_mimage_1_rimage_1] vg active iwi-aor--- 4.00m linear private,raid,image [lvol0_rmeta_0] vg active ewi-aor--- 4.00m linear private,raid,metadata [lvol0_rmeta_1] vg active ewi-aor--- 4.00m linear private,raid,metadata Incorrect name: lvol0_mimage_0_rimage_0 With this patch applied: $ lvs -a vg LV VG Active Attr LSize Cpy%Sync Layout Role lvol0 vg active rwi-a-r--- 4.00m 100.00 raid,raid1 public [lvol0_rimage_0] vg active iwi-aor--- 4.00m linear private,raid,image [lvol0_rimage_1] vg active iwi-aor--- 4.00m linear private,raid,image [lvol0_rmeta_0] vg active ewi-aor--- 4.00m linear private,raid,metadata [lvol0_rmeta_1] vg active ewi-aor--- 4.00m linear private,raid,metadata Proper name: lvol0_rimage_0	2015-01-07 13:25:08 +01:00
Zdenek Kabelac	f3bd9a2797	raid: properly rename split image When we split leg from raid - we take a proper new lock for a new LV. However for now activation checks only 'existince' of device UUID, but it's not validating device has a proper name. As a quick fix call suspend()/resume() to rename after split mirror.	2014-12-05 13:39:42 +01:00
Zdenek Kabelac	62c7027a7c	raid: fix activation order Cannot 'activate' volumes in suspend state and we need to use lock holding LV for suspend/resume.	2014-11-10 22:05:48 +01:00
Zdenek Kabelac	1c7aae40a1	raid: query lock holder Ask for lock the proper LV. Use the top-most LV to query for locally exclusive lock. The rest of operations are then using 'lv_info()' TODO: Check all devices are reloaded from proper level. In general any query on lv_is_active is supposed to be running ona lv_lock_holder() volume.	2014-10-24 16:39:31 +02:00
Zdenek Kabelac	9411c19b31	segments: introduce lvseg_name Instead of segtype->ops->name() introduce lvseg_name(). This also allows us to leave name() function 'empty' for default return of segtype->name. TODO: add functions for rest of ops->	2014-10-24 16:39:30 +02:00
Heinz Mauelshagen	45f57477f4	cleanup: Use segtype.h definitions of segment type names wherever possible We are not using already defined segement type names where we could. There is a lot of other places in device-mapper and LVM2 we have those hardcoded so we should better finally have a common interface in libdevmapper to avoid this.	2014-09-24 15:24:41 +02:00
Zdenek Kabelac	84cdf85bd2	cleanup: constify activation usage of lv pointer Let's enforce cheking of write access to LV by compiler. Activation part does never need to write anything to LV so keep LV pointer const.	2014-09-24 10:54:47 +02:00
Alasdair G Kergon	979be63f25	mirrors: Fix checks for mirror/raid/pvmove LVs. Try to enforce consistent macro usage along these lines: lv_is_mirror - mirror that uses the original dm-raid1 implementation (segment type "mirror") lv_is_mirror_type - also includes internal mirror image and log LVs lv_is_raid - raid volume that uses the new dm-raid implementation (segment type "raid") lv_is_raid_type - also includes internal raid image / log / metadata LVs lv_is_mirrored - LV is mirrored using either kernel implementation (excludes non-mirror modes like raid5 etc.) lv_is_pvmove - internal pvmove volume	2014-09-16 00:13:46 +01:00
Zdenek Kabelac	ae08a3a294	cleanup: skip unused assign Reset of tmp_names is only needed in else{} path.	2014-09-12 13:51:31 +02:00
Zdenek Kabelac	07b3e6cd74	cleanup: avoid strlen() we know max size Just use max NAME_LEN size buffer and copy the name.	2014-09-12 13:51:31 +02:00
Zdenek Kabelac	ab7977de7b	cleanup: simplify _extract_image_components Reorder test - first check for writable flag and then allocate.	2014-09-12 13:51:31 +02:00
Zdenek Kabelac	6898131091	cleanup: missing error message	2014-09-12 13:51:31 +02:00
Zdenek Kabelac	3e57143abd	cleanup: better error messages	2014-09-12 13:51:30 +02:00
Zdenek Kabelac	08914ed7c1	raid: destroy allocation handle on error path Don't leak ah memory pool on error path.	2014-09-12 13:51:30 +02:00
Zdenek Kabelac	76c3c94bd2	cleanup: update _alloc_image_component function Return allocated volume directly instead of 1/0.	2014-09-12 13:51:30 +02:00
Zdenek Kabelac	126463ad1f	cleanup: plain code reindent Just simple reindent and brace changes.	2014-09-12 13:51:30 +02:00
Zdenek Kabelac	ad376e9e00	debug: add missing stack trace on error path	2014-09-12 13:51:29 +02:00
Zdenek Kabelac	c10c16cc35	raid: use _generate_raid_name Use new function to get implicit name validation (so we do not exit with internal error on metadata validation).	2014-09-12 13:51:29 +02:00
Zdenek Kabelac	2db0312455	raid: add function for name creation Add name for construction and validation of raid subvolume name with a given suffix. TODO: check if reusable for mirrors as well.	2014-09-12 13:51:29 +02:00
Zdenek Kabelac	40b7b107b1	raid: check result of get_segtype_from_string Error here is rather highly unpexpected for these types, but stay consistent with rest of the code and don't use unchecked value.	2014-09-12 13:45:50 +02:00
Zdenek Kabelac	08bde75093	raid: add missing archive call Before starting to update raid metadata, archive existing unmodified one.	2014-09-12 13:45:49 +02:00
Zdenek Kabelac	569184a3bb	raid: add missing vg_revert After failing vg_write() and suspend_lv() there was missing vg_revert() call.	2014-09-12 13:45:14 +02:00
Zdenek Kabelac	dd1fa0e808	raid: add missing backups Add backup() calls that were missing after successful update of metadata.	2014-09-12 13:42:57 +02:00
Zdenek Kabelac	c710f02e01	lv_update_and_reload: replace code sequence Use lv_update_and_reload() and lv_update_and_reload_origin() to handle write/suspend/commit/resume sequence. In few places this properly handle vg_revert() after suspend failure, and also ensures there is metadata backup after successful vg_commit().	2014-09-09 19:20:09 +02:00
Alasdair G Kergon	99e3c13012	raid: Moved degraded activation code to raid_manip. Adjust some messages & fn names.	2014-07-22 20:50:29 +01:00
Jonathan Brassow	ed3c2537b8	raid: Allow repair to reuse PVs from same image that suffered a PV failure When repairing RAID LVs that have multiple PVs per image, allow replacement images to be reallocated from the PVs that have not failed in the image if there is sufficient space. This allows for scenarios where a 2-way RAID1 is spread across 4 PVs, where each image lives on two PVs but doesn't use the entire space on any of them. If one PV fails and there is sufficient space on the remaining PV in the image, the image can be reallocated on just the remaining PV.	2014-06-25 22:26:06 -05:00
Jonathan Brassow	b35fb0b15a	raid/misc: Allow creation of parallel areas by LV vs segment I've changed build_parallel_areas_from_lv to take a new parameter that allows the caller to build parallel areas by LV vs by segment. Previously, the function created a list of parallel areas for each segment in the given LV. When it came time for allocation, the parallel areas were honored on a segment basis. This was problematic for RAID because any new RAID image must avoid being placed on any PVs used by other images in the RAID. For example, if we have a linear LV that has half its space on one PV and half on another, we do not want an up-convert to use either of those PVs. It should especially not wind up with the following, where the first portion of one LV is paired up with the second portion of the other: ------PV1------- ------PV2------- [ 2of2 image_1 ] [ 1of2 image_1 ] [ 1of2 image_0 ] [ 2of2 image_0 ] ---------------- ---------------- Previously, it was possible for this to happen. The change makes it so that the returned parallel areas list contains one "super" segment (seg_pvs) with a list of all the PVs from every actual segment in the given LV and covering the entire logical extent range. This change allows RAID conversions to function properly when there are existing images that contain multiple segments that span more than one PV.	2014-06-25 21:20:41 -05:00
Peter Rajnoha	3208396ce5	coverity: fix issues reported by coverity	2014-06-24 14:58:53 +02:00
Peter Rajnoha	cfed0d09e8	report: select: refactor: move percent handling code to libdm for reuse	2014-06-17 16:27:21 +02:00
Zdenek Kabelac	9240aca369	raid: cleanup error messages Add log_error messages on error paths.	2014-05-27 17:08:49 +02:00
Jonathan Brassow	6c6468f91d	RAID: Improve an error message When down-converting a RAID1 LV, if the user specifies too few devices, they will get a confusing message. Ex: [root]# lvcreate -m 2 --type raid1 -n raid -L 500M taft Logical volume "raid" created [root]# lvconvert -m 0 taft/raid /dev/sdd1 Unable to extract enough images to satisfy request Failed to extract images from taft/raid This patch makes the error message a bit clearer by telling the user the count they are trying to remove and the number of devices they supplied. [root@bp-01 lvm2]# lvcreate --type raid1 -m 3 -L 200M -n lv vg Logical volume "lv" created [root@bp-01 lvm2]# lvconvert -m -3 vg/lv /dev/sdb1 Unable to remove 3 images: Only 1 device given. Failed to extract images from vg/lv [root@bp-01 lvm2]# lvconvert -m -3 vg/lv /dev/sd[bc]1 Unable to remove 3 images: Only 2 devices given. Failed to extract images from vg/lv [root@bp-01 lvm2]# lvconvert -m -3 vg/lv /dev/sd[bcd]1 [root@bp-01 lvm2]# lvs -a -o name,attr,devices vg LV Attr Devices lv -wi-a----- /dev/sde1(1) This patch doesn't work in all cases. The user can specify the right number of devices, but not a sufficient amount of devices from the LV. This will produce the old error message: [root@bp-01 lvm2]# lvconvert -m -3 vg/lv /dev/sd[bcf]1 Unable to extract enough images to satisfy request Failed to extract images from vg/lv However, I think this error message is sufficient for this case.	2014-04-03 16:57:41 -05:00
Jonathan Brassow	4b6e3b5e5e	allocation: Allow approximate allocation when specifying size in percent Introduce a new parameter called "approx_alloc" that is set when the desired size of a new LV is specified in percentage terms. If set, the allocation code tries to get as much space as it can but does not fail if can at least get some. One of the practical implications is that users can now specify 100%FREE when creating RAID LVs, like this: ~> lvcreate --type raid5 -i 2 -l 100%FREE -n lv vg	2014-02-13 21:10:28 -06:00
Zdenek Kabelac	ef6c5795a0	raid: add temporary activation for raid metadata clear Use LV_TEMPORARY when activating devices for clearing raid metadata.	2014-02-04 14:51:05 +01:00
Zdenek Kabelac	8c96afd361	cleanup: use compound literals for wipe_lv Optimize and cleanup recently introduced new function wipe_lv. Use compound literals to get nicely initialized wipe_params struct. Pass in lv as explicit argument for wipe_lv. Use cmd from lv structure. Initialize only non-null members so it's easy to see what is the special arg.	2013-11-28 12:45:52 +01:00
Peter Rajnoha	b6dab4e059	lv_manip: rename set_lv -> wipe_lv and include signature wiping capability Use common wipe_lv (former set_lv) fn to do zeroing as well as signature wiping if needed. Provide new struct wipe_lv_params to define the functionality. Bind "lvcreate -W/--wipesignatures y" with proper wipe_lv call. Also, add "yes" and "force" to lvcreate_params so it's possible to apply them for the prompt: "WARNING: %s detected on %s. Wipe it? [y/n]".	2013-11-27 15:48:15 +01:00
Jonathan Brassow	2691f1d764	RAID: Make RAID single-machine-exclusive capable in a cluster Creation, deletion, [de]activation, repair, conversion, scrubbing and changing operations are all now available for RAID LVs in a cluster - provided that they are activated exclusively. The code has been changed to ensure that no LV or sub-LV activation is attempted cluster-wide. This includes the often overlooked operations of activating metadata areas for the brief time it takes to clear them. Additionally, some 'resume_lv' operations were replaced with 'activate_lv_excl_local' when sub-LVs were promoted to top-level LVs for removal, clearing or extraction. This was necessary because it forces the appropriate renaming actions the occur via resume in the single-machine case, but won't happen in a cluster due to the necessity of acquiring a lock first. The raid tests have been updated to allow testing in a cluster. For the most part, this meant creating devices with '-aey' if they were to be converted to RAID. (RAID requires the converting LV to be EX because it is a condition of activation for the RAID LV in a cluster.)	2013-09-10 16:33:22 -05:00
Jonathan Brassow	ca51435153	Misc/RAID: Enable resume_lv to handle some renaming conflicts. When images and their associated metadata are removed from a RAID1 LV, the remaining sub-LVs are "shifted" down to fill the gaps. For example, if there is a 3-way mirror: [0][1][2] and we remove device#0, the devices will be shifted down [1][2] and renamed. [0][1] This can create a problem for resume_lv (specifically, dm_tree_activate_children) during the renaming process though. This is because it will attempt to rename the higher indexed sub-LVs first and find that it cannot because there are currently other sub-LVs with that name. The solution is to check for a conflicting name before attempting to rename. If a conflict is found and that conflicting sub-LV is also in the process of renaming, we can defer the current rename until the conflicting sub-LV has renamed and cleared the conflict. Now that resume_lv can handle these types of rename conflicts, we can remove the workaround in RAID that was attempting to resume a RAID1 LV from the bottom-up in order to force a proper rename in assending order before attempting a resume on the top-level LV. This "hack" only worked for single machine use-cases of LVM. Clearing this up paves the way for exclusive activation of RAID LVs in a cluster.	2013-09-09 15:07:28 -05:00
Jonathan Brassow	f1e3640df3	Misc: Make get_pv_list_for_lv() available to more than just RAID The function 'get_pv_list_for_lv' will assemble all the PVs that are used by the specified LV. It uses 'for_each_sub_lv' to traverse all of the sub-lvs which may compose it.	2013-08-23 08:40:13 -05:00
Jonathan Brassow	06ac797f42	Clean-up: Replace 'lv_is_active' with more correct/specific variants There are places where 'lv_is_active' was being used where it was more correct to use 'lv_is_active_locally'. For example, when checking for the existance of a kernel instance before asking for its status. Most of the time these would work correctly. (RAID is only allowed on non-clustered VGs at the moment, which means that 'lv_is_active' and 'lv_is_active_locally' would give the same result.) However, it is more correct to use the proper variant and it helps with future scenarios where targets might be allowed exclusively (or clustered) in a cluster VG.	2013-05-16 10:36:56 -05:00
Zdenek Kabelac	dd4fdce16c	cleanup: drop unused assignment Assigned values are unused.	2013-04-21 23:14:04 +02:00
Zdenek Kabelac	5e7eae59da	lv_manip: check remove_seg_from_segs_using_this_lv() Add missing check for result of remove_seg_from_segs_using_this_lv(). Failure is reported as internal error.	2013-04-21 23:10:43 +02:00
Zdenek Kabelac	24f8daa13d	raid: test for target_pvs If target_pvs is NULL do not call lv_is_on_pvs()	2013-04-21 23:07:00 +02:00
Jonathan Brassow	2e0740f7ef	RAID: Add writemostly/writebehind support for RAID1 'lvchange' is used to alter a RAID 1 logical volume's write-mostly and write-behind characteristics. The '--writemostly' parameter takes a PV as an argument with an optional trailing character to specify whether to set ('y'), unset ('n'), or toggle ('t') the value. If no trailing character is given, it will set the flag. Synopsis: lvchange [--writemostly <PV>:{t\|y\|n}] [--writebehind <count>] vg/lv Example: lvchange --writemostly /dev/sdb1:y --writebehind 512 vg/raid1_lv The last character in the 'lv_attr' field is used to show whether a device has the WriteMostly flag set. It is signified with a 'w'. If the device has failed, the 'p'artial flag has priority. Example ("nosync" raid1 with mismatch_cnt and writemostly): [~]# lvs -a --segment vg LV VG Attr #Str Type SSize raid1 vg Rwi---r-m 2 raid1 500.00m [raid1_rimage_0] vg Iwi---r-- 1 linear 500.00m [raid1_rimage_1] vg Iwi---r-w 1 linear 500.00m [raid1_rmeta_0] vg ewi---r-- 1 linear 4.00m [raid1_rmeta_1] vg ewi---r-- 1 linear 4.00m Example (raid1 with mismatch_cnt, writemostly - but failed drive): [~]# lvs -a --segment vg LV VG Attr #Str Type SSize raid1 vg rwi---r-p 2 raid1 500.00m [raid1_rimage_0] vg Iwi---r-- 1 linear 500.00m [raid1_rimage_1] vg Iwi---r-p 1 linear 500.00m [raid1_rmeta_0] vg ewi---r-- 1 linear 4.00m [raid1_rmeta_1] vg ewi---r-p 1 linear 4.00m A new reportable field has been added for writebehind as well. If write-behind has not been set or the LV is not RAID1, the field will be blank. Example (writebehind is set): [~]# lvs -a -o name,attr,writebehind vg LV Attr WBehind lv rwi-a-r-- 512 [lv_rimage_0] iwi-aor-w [lv_rimage_1] iwi-aor-- [lv_rmeta_0] ewi-aor-- [lv_rmeta_1] ewi-aor-- Example (writebehind is not set): [~]# lvs -a -o name,attr,writebehind vg LV Attr WBehind lv rwi-a-r-- [lv_rimage_0] iwi-aor-w [lv_rimage_1] iwi-aor-- [lv_rmeta_0] ewi-aor-- [lv_rmeta_1] ewi-aor--	2013-04-15 13:59:46 -05:00
Zdenek Kabelac	2e39392daf	cleanup: remove unused lvl_idx	2013-04-12 11:26:31 +02:00
Jonathan Brassow	dc2ce71313	clean-up: Remove a FIXME question that has been settled It is ok for us to use the shorthand 'lv_is_virtual' to detect error targets in a RAID LV when searching for candidates for device replacement.	2013-02-20 15:03:58 -06:00
Jonathan Brassow	bd0ee420b5	RAID: Allow remove/replace of sub-LVs composed of error segments. When a device fails, we may wish to replace those segments with an error segment. (Like when a 'vgreduce --removemissing' removes a failed device that happens to be a RAID image/meta.) We are then left with images that we will eventually want to remove or replace. This patch allows us to pull out these virtual "error" sub-LVs. This allows a user to 'lvconvert -m -1 vg/lv' to extract the bad sub-LVs. Sub-LVs with error segments are considered for extraction before other possible devices so that good devices are not accidentally removed. This patch also adds the ability to replace RAID images that contain error segments. The user will still be unable to run 'lvconvert --replace' because there is no way to address the 'error' segment (i.e. no PV that it is associated with). However, 'lvconvert --repair' can be used to replace the image's error segment with a new PV. This is also the most appropriate way to do it, since the LV will continue to be reported as 'partial'.	2013-02-20 14:58:56 -06:00
Jonathan Brassow	845852d6b4	RAID: Make 'vgreduce --removemissing' work with RAID LVs Currently it is impossible to remove a failed PV which has a RAID LV on it. This patch fixes the issue by replacing the failed PV with an 'error' segment within the affected sub-LVs. Once there is no longer a RAID LV using the PV, it can be removed. Most often, it is better to replace a failed RAID device with a spare. (You can use 'lvconvert --repair <vg>/<LV>' to accomplish that.) However, if there are no spares in the volume group and none will be added, it is useful to be able to removed the failed device. Following patches address the ability to perform 'lvconvert' operations on RAID LVs that contain sub-LVs composed of 'error' segments.	2013-02-20 14:52:46 -06:00
Jonathan Brassow	0e4ffd9d3b	clean-up: Rename lvm.conf setting 'mirror_region_size' to 'raid_region_size' We have been using 'mirror_region_size' in lvm.conf as the default region size for RAID logical volumes as well as mirror logical volumes. Since, "raid" is more inclusive and representative than "mirror", I have changed the name of this setting. We must still check for the old setting and warn the user if we are overriding it with the new setting if both happen to be present.	2013-02-20 14:40:17 -06:00
Alasdair G Kergon	06abb2dd4c	logging: classify log_debug messages Place most log_debug() messages into a class.	2013-01-07 22:30:29 +00:00
Jonathan Brassow	970dfbcd69	RAID: Limit replacement of devices when array is not in-sync. If a RAID array is not in-sync, replacing devices should not be allowed as a general rule. This is because the contents used to populate the incoming device may be undefined because the devices being read where not in-sync. The kernel enforces this rule unless overridden by not allowing the creation of an array that is not in-sync and includes a devices that needs to be rebuilt. Since we cannot know the sync state of an LV if it is inactive, we must also enforce the rule that an array must be active to replace devices. That leaves us with the following conditions: 1) never allow replacement or repair of devices if the LV is in-active 2) never allow replacement if the LV is not in-sync 3) allow repair if the LV is not in-sync, but warn that contents may not be recoverable. In the case where a user is performing the repair on the command line via 'lvconvert --repair', the warning is printed before the user is prompted if they would like to replace the device(s). If the repair is automated (i.e. via dmeventd and policy is "allocate"), then the device is replaced if possible and the warning is printed.	2012-12-18 14:40:42 -06:00
Jonathan Brassow	fb0cee9a66	RAID: Do not allow --splitmirrors on RAID10 logical volumes. RAID10 does not have the ability to split off images for independent use. So, 'lvconvert --splitmirrors' will not work and must be disallowed.	2012-11-21 18:39:26 -06:00
Zdenek Kabelac	f260f99d57	cleanup: switch log_error to log_warn Use log_warn to print non-fatal warning messages. Use of log_error would confuse checker for testing whether proper error has been reported for some real error.	2012-10-17 15:41:35 +02:00
Zdenek Kabelac	9ee071705b	cleanup: fix compiler warnings remove unused vars move var declarations into the front of functions. fix some sign warnings	2012-10-12 10:25:07 +02:00
Jonathan Brassow	886656e4ac	RAID: Fix problems with creating, extending and converting large RAID LVs MD's bitmaps can handle 2^21 regions at most. The RAID code has always used a region_size of 1024 sectors. That means the size of a RAID LV was limited to 1TiB. (The user can adjust the region_size when creating a RAID LV, which can affect the maximum size.) Thus, creating, extending or converting to a RAID LV greater than 1TiB would result in a failure to load the new device-mapper table. Again, the size of the RAID LV is not limited by how much space is allocated for the metadata area, but by the limitations of the MD bitmap. Therefore, we must adjust the 'region_size' to ensure that the number of regions does not exceed the limit. I've added code to do this when extending a RAID LV (which covers 'create' and 'extend' operations) and when up-converting - specifically from linear to RAID1.	2012-09-27 16:51:22 -05:00
Jonathan Brassow	2a6712ddef	RAID1: Clear the LV_NOTSYNCED flag when a RAID1 LV is converted to linear Failing to clear the LV_NOTSYNCED flag when converting a RAID1 LV to linear can result in the flag being present after an upconvert - even if the sync is performed when upconverting.	2012-09-14 16:26:53 -05:00
Jonathan Brassow	116bcb3ea4	RAID1: Like mirrors, do not allow adding images to LV created w/ --nosync Mirrors do not allow upconverting if the LV has been created with --nosync. We will enforce the same rule for RAID1. It isn't hugely critical, since the portions that have been written will be copied over to the new device identically from either of the existing images. However, the unwritten sections may be different, causing the added image to be a hybrid of the existing images. Also, we are disallowing the addition of new images to a RAID1 LV that has not completed the initial sync. This may be different from mirroring, but that is due to the fact that the 'mirror' segment type "stacks" when adding a new image and RAID1 does not. RAID1 will rebuild a newly added image "inline" from the existant images, so they should be in-sync.	2012-09-14 16:12:52 -05:00
Jonathan Brassow	cdb0339319	RAID: Disallow addition of RAID images while array is not in-sync We cannot add images to a RAID array while it is not in-sync. The kernel will simply reject the table, saying: 'rebuild' specified while array is not in-sync Now we check to ensure the LV is in-sync before attempting image additions.	2012-09-10 17:15:20 -05:00
Jonathan Brassow	b49b98d50c	RAID: '--test' should not cause a valid create command to fail It is necessary when creating a RAID LV to clear the new metadata areas. Failure to do so could result in a prepopulated bitmap that would cause the new array to skip syncing portions of the array. It is a requirement that the metadata LVs be activated and cleared in the process of creating. However in test mode, this requirement should be lifted - no new LVs should be created or written to.	2012-09-05 14:32:06 -05:00
Jonathan Brassow	c3eb3a7687	cleanup: Use segtype->ops->name() instead of segtype->name where applicable When printing a message for the user and the lv_segment pointer is available, use segtype->ops->name() instead of segtype->name. This gives a better user-readable name for the segment. This is especially true for the 'striped' segment type, which prints "linear" if there is an area_count of one.	2012-09-05 11:35:54 -05:00
Alasdair G Kergon	438e0050df	config: add silent mode Accept -q as the short form of --quiet. Suppress non-essential standard output if -q is given twice. Treat log/silent in lvm.conf as equivalent to -qq. Review all log_print messages and change some to log_print_unless_silent. When silent, the following commands still produce output: dumpconfig, lvdisplay, lvmdiskscan, lvs, pvck, pvdisplay, pvs, version, vgcfgrestore -l, vgdisplay, vgs. [Needs checking.] Non-essential messages are shifted from log level 4 to log level 5 for syslog and lvm2_log_fn purposes.	2012-08-25 20:35:48 +01:00
Jonathan Brassow	4047e4dfb1	RAID: Add support for RAID10 This patch adds support for RAID10. It is not the default at this stage. The user needs to specify '--type raid10' if they would like RAID10 instead of stacked mirror over stripe.	2012-08-24 15:34:19 -05:00
Zdenek Kabelac	286cd2006b	cleanup: drop unneeded included header files This headers were not resolving anything used for compiled .c files. Remove unused util.c file.	2012-08-23 14:37:20 +02:00
Jonathan Earl Brassow	dfd024d3a8	Allow a subset of failed devices to be replaced in RAID LVs. If two devices in an array failed, it was previously impossible to replace just one of them. This patch allows for the replacement of some, but perhaps not all, failed devices.	2012-04-24 20:05:31 +00:00
Jonathan Earl Brassow	a7feae8a6e	Fix code that performs RAID device replacement while under snapshot. The code should have been calling [suspend\|resume]_lv_origin() rather than [suspend\|resume]_lv. This addresses bug 807069.	2012-04-12 03:16:37 +00:00
Jonathan Earl Brassow	187486c7bb	Fix inability to split RAID1 image while specifying a particular PV. The logic for resuming the original and newly split LVs was not properly done to handle situations where anything but the last device in the array was split. It did not take into account the possible name collisions that might occur when the original LV undergoes the shifting and renaming of its sub-LVs.	2012-04-11 14:20:19 +00:00
Jonathan Earl Brassow	c0b5886f18	RAID LVs could not handle a down-convert if a device other than the last one in the array was specified for removal. This change addresses that (bz806111).	2012-04-11 01:23:29 +00:00
Jonathan Earl Brassow	dc7b1640ed	Fix name conflicts that prevent down-converting RAID1 when specifying a device When down-converting a RAID1 device, it is the last device that is extracted and removed when the user does not specify a particular device. However, when a device is specified (and it is not the last), the device is removed and the remaining sub-LVs are "shifted down" to fill the hole. This cause problems when resuming the LV because if the shifted devices were resumed (and thus renamed) before the sub-LV being extracted, there would be a name conflict. The solution is to resume the extracted sub-LVs first so that they can be properly renamed preventing a possible conflict. This addresses bug 801967.	2012-03-15 20:00:54 +00:00
Jonathan Earl Brassow	870762d8e3	Require number of stripes to be greater than parity devices in higher RAID. Also, add some comments to code that I recently added that may be unclear otherwise.	2012-02-23 17:36:35 +00:00
Jonathan Earl Brassow	9bdfb30720	Fix allocation code to allow replacement of single RAID 4/5/6 device. The code fail to account for the case where we just need a single device in a RAID 4/5/6 array. There is no good way to tell the allocation functions that we don't need parity devices when we are allocating just a single device. So, I've used a bit of a hack. If we are allocating an area_count that is <= the parity count, then we can assume we are simply allocating a replacement device (i.e. no need to include parity devices in the calculations). This should make sense in most cases. If we need to allocate replacement devices due to failure (or moving), we will never allocate more than the parity count; or we would cause the array to become unusable. If we are creating a new device, we should always create more stripes than parity devices.	2012-02-23 03:57:23 +00:00
Zdenek Kabelac	cbe6bcd593	Add check for rimage name allocation failure	2012-02-13 11:10:37 +00:00
Jonathan Earl Brassow	6cf3274732	Use suspend\|resume_origin_only when up-converting RAID LVs, as mirrors do. Failure to do so results in "Performing unsafe table load while X device(s) are known to be suspended" errors. While fixing the problem in this way works and is consistent with the way the mirror segment type does it, it would be nice to find a solution that uses the generic suspend/resume calls. Also included in this check-in are additions to the test suite that perform conversions on RAID LVs under a snapshot. These tests are disabled for the time being due to a kernel bug that is yet to be tracked down.	2012-01-24 14:33:38 +00:00
Jonathan Earl Brassow	9711057499	Don't allow two images to be split and tracked from a RAID LV at one time Also, don't allow a splitmirror operation on a RAID LV that is already tracking a split, unless the operation is to stop the tracking and complete the split. Example: ~> lvconvert --splitmirrors 1 --trackchanges vg/lv /dev/sdc1 # Now tracking changes - image can be merged back or split-off for good ~> lvconvert --splitmirrors 1 -n new_name vg/lv /dev/sdc1 # ^ Completes split ^ If a split is performed on a RAID that is tracking an already split image and PVs are provided, we must ensure that 1) the already split LV is represented in the PVs 2) we are careful to split only the tracked image	2011-12-01 00:21:04 +00:00
Jonathan Earl Brassow	a927e401f1	Do not allow users to change the name of RAID sub-LVs or the name of the RAID LV if it is tracking changes for a split image.	2011-12-01 00:09:34 +00:00
Jonathan Earl Brassow	0c506d9a40	Support the ability to replace specific devices in a RAID array. RAID is not like traditional LVM mirroring. LVM mirroring required failed devices to be removed or the logical volume would simply hang. RAID arrays can keep on running with failed devices. In fact, for RAID types other than RAID1, removing a device would mean substituting an error target or converting to a lower level RAID (e.g. RAID6 -> RAID5, or RAID4/5 to RAID0). Therefore, rather than removing a failed device unconditionally and potentially allocating a replacement, RAID allows the user to "replace" a device with a new one. This approach is a 1-step solution vs the current 2-step solution. example> lvconvert --replace <dev_to_remove> vg/lv [possible_replacement_PVs] '--replace' can be specified more than once. example> lvconvert --replace /dev/sdb1 --replace /dev/sdc1 vg/lv	2011-11-30 02:02:10 +00:00
Jonathan Earl Brassow	f60175c308	Add the ability to convert LVs of "mirror" segtype to "raid1" segtype. Example: ~> lvconvert --type raid1 vg/mirror_lv Steps to convert "mirror" to "raid1" 1) Allocate a RAID metadata LV for each mirror image from the same PVs on which they are located. 2) Clear the metadata LVs. This involves writing LVM metadata, so we don't change any aspects of the mirror LV before this so that the user can easily remove LVs from the failed convert attempt while retaining the original mirror. 3) Remove the mirror log, if it exists. 4) Add metadata LVs to mirror LV 5) Rename mirror sub-lvs (s/mimage/rimage/) 6) Change flags and segtype from mirror to raid1	2011-10-07 14:56:01 +00:00
Jonathan Earl Brassow	d3582e0252	Add the ability to convert linear LVs to RAID1 Example: ~> lvconvert --type raid1 -m 1 vg/lv The following steps are performed to convert linear to RAID1: 1) Allocate a metadata device from the same PV as the linear device to provide the metadata/data LV pair required for all RAID components. 2) Allocate the required number of metadata/data LV pairs for the remaining additional images. 3) Clear the metadata LVs. This performs a LVM metadata update. 4) Create the top-level RAID LV and add the component devices. We want to make any failure easy to unwind. This is why we don't create the top-level LV and add the components until the last step. Should anything happen before that, the user could simply remove the unnecessary images. Also, we want to ensure that the metadata LVs are cleared before forming the array to prevent stale information from polluting the new array. A new macro 'seg_is_linear' was added to allow us to distinguish linear LVs from striped LVs.	2011-10-07 14:52:26 +00:00
Jonathan Earl Brassow	40c85cf1d7	When up-converting a RAID1 array, we need to allocate new larger arrays for seg->areas and seg->meta_areas. We also need to copy the memory from the old arrays to the newly allocated arrays. The amount of memory to copy was determined by seg->area_count. However, seg->area_count was being set to the higher value after copying the 'seg->areas' information, but before copying the 'seg->meta_areas' information. This means we were copying more memory than necessary for 'seg->meta_areas' - something that could lead to a segfault.	2011-09-22 15:33:21 +00:00
Zdenek Kabelac	886d005616	LVM_WRITE and LVM_READ are 64bit constants Revert John patch, which fixed only 1 place where ~LVM_WRITE was in use and convert ommited LVM_READ/WRITE flags to 64bit constants as well. (Since both 'status' flags for LV and VG are 64bit.)	2011-09-14 09:57:35 +00:00
Jonathan Earl Brassow	0c89ef513a	Changing RAID status flags to 64-bit broke some binary flag operations. LVM_WRITE is a 32-bit flag. Now that RAID[_IMAGE\|_META] are 64-bit, and'ing a RAID LV's status against LVM_WRITE can reset the higher order flags. A similar thing will affect thinp flags if not careful.	2011-09-13 16:33:21 +00:00
Alasdair Kergon	2ef5b7cca6	Start using 64-bit status flags - most of the code already handles them. tdata -> tpool remove commented out definitions from metadata.h formatting clean-ups	2011-09-06 18:49:31 +00:00
Alasdair Kergon	1d64dcfbf7	clarify comment	2011-08-19 19:35:50 +00:00
Alasdair Kergon	3250b38583	_ for static fns	2011-08-19 15:59:15 +00:00
Jonathan Earl Brassow	a2facf4ad4	Add ability to merge back a RAID1 image that has been split w/ --trackchanges Argument layout is very similar to the merge command for snapshots.	2011-08-18 19:43:08 +00:00
Jonathan Earl Brassow	f439e65b64	Add support for m-way to n-way up-convert in RAID1 (no linear to n-way yet) This patch adds the ability to upconvert a raid1 array - say from 2-way to 3-way. It does not yet support upconverting linear to n-way. The 'raid' device-mapper target allows for individual components (images) of an array to be specified for rebuild. This mechanism is used when adding new images to the array so that the new images can be resync'ed while the rest of the images in the array can remain 'in-sync'. (There is no mirror-on-mirror layering required.)	2011-08-18 19:41:21 +00:00
Jonathan Earl Brassow	6d04311efa	Add the ability to split an image from the mirror and track changes. ~> lvconvert --splitmirrors 1 --trackchanges vg/lv The '--trackchanges' option allows a user the ability to use an image of a RAID1 array for the purposes of temporary read-only access. The image can be merged back into the array at a later time and only the blocks that have changed in the array since the split will be resync'ed. This operation can be thought of as a partial split. The image is never completely extracted from the array, in that the array reserves the position the device occupied and tracks the differences between the array and the split image via a bitmap. The image itself is rendered read-only and the name (<LV>_rimage_*) cannot be changed. The user can complete the split (permanently splitting the image from the array) by re-issuing the 'lvconvert' command without the '--trackchanges' argument and specifying the '--name' argument. ~> lvconvert --splitmirrors 1 --name my_split vg/lv Merging the tracked image back into the array is done with the '--merge' option (included in a follow-on patch). ~> lvconvert --merge vg/lv_rimage_<n> The internal mechanics of this are relatively simple. The 'raid' device- mapper target allows for the specification of an empty slot in an array via '- -'. This is what will be used if a partial activation of an array is ever required. (It would also be possible to use 'error' targets in place of the '- -'.) If a RAID image is found to be both read-only and visible, then it is considered separate from the array and '- -' is used to hold it's position in the array. So, all that needs to be done to temporarily split an image from the array /and/ cause the kernel target's bitmap to track (aka "mark") changes made is to make the specified image visible and read-only. To merge the device back into the array, the image needs to be returned to the read/write state of the top-level LV and made invisible.	2011-08-18 19:38:26 +00:00
Jonathan Earl Brassow	a324baf6a1	Add --splitmirrors support for RAID1 (1 image only) Users already have the ability to split an image from an LV of "mirror" segtype. This patch extends that ability to LVs of "raid1" segtype. This patch only allows a single image to be split off, however. (The "mirror" segtype allows an arbitrary number of images to be split off. e.g. 4-way => 3-way/linear, 2-way/2-way, linear,3-way)	2011-08-18 19:34:18 +00:00
Jonathan Earl Brassow	63d32fb6a6	When down-converting RAID1, don't activate sub-lvs between suspend/resume of top-level LV. We can't activate sub-lv's that are being removed from a RAID1 LV while it is suspended. However, this is what was being used to have them show-up so we could remove them. 'sync_local_dev_names' is a sufficient and proper replacement and can be done after the top-level LV is resumed.	2011-08-18 19:31:33 +00:00
Jonathan Earl Brassow	4903b85d23	Compiler warning fixes, better error messaging, and cosmetic changes. 1) add new function 'raid_remove_top_layer' which will be useful to other conversion functions later (also cleans up code) 2) Add error messages if raid_[extract\|add]_images fails 3) Add function prototypes to prevent compiler warnings when compiling with '--with-raid=shared'	2011-08-13 04:28:34 +00:00
Jonathan Earl Brassow	a22515c87f	Various code clean-ups (s/malloc/zalloc/, new msgs, etc) Fix a couple more issues that kabi found. - Add some error messages in failure cases - s/malloc/zalloc/ - use vg->vgmem for lv names instead of vg->cmd->mem	2011-08-11 21:32:18 +00:00
Jonathan Earl Brassow	b2fa9b43dc	Add some log_error msg's and fix potential segfault Thanks to kabi for spotting these - especially the possibility for segfault if a loop runs all the way through without finding a match.	2011-08-11 19:17:10 +00:00
Jonathan Earl Brassow	4aebd52c4c	Add ability to down-convert RAID1 arrays. Also, add some simple RAID tests to testsuite.	2011-08-11 18:24:40 +00:00

... 3 4 5 6 7 ...

373 Commits