shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-22 17:35:59 +03:00

Author	SHA1	Message	Date
Jonathan Brassow	4daea88516	clean-up: typos s/bellow/below/	2015-07-06 10:15:11 -05:00
Alasdair G Kergon	810ab095e6	macros: Wrap PRI with FMT. Create a set of wrappers with embedded % such as #define FMTu64 "%" PRIu64	2015-07-06 15:09:17 +01:00
Zdenek Kabelac	a900d150e4	thin: move pool messaging from resume to suspend Existing messaging intarface for thin-pool has a few 'weak' points: * Message were posted with each 'resume' operation, thus not allowing activation of thin-pool with the existing state. * Acceleration skipped suspend step has not worked in cluster, since clvmd resumes only nodes which are suspended (have proper lock state). * Resume may fail and code is not really designed to 'fail' in this phase (generic rule here is resume DOES NOT fail unless something serious is wrong and lvm2 tool usually doesn't handle recovery path in this case.) * Full thin-pool suspend happened, when taken a thin-volume snapshot. With this patch the new method relocates message passing into suspend state. This has a few drawbacks with current API, but overal it performs better and gives are more posibilities to deal with errors. Patch introduces a new logic for 'origin-only' suspend of thin-pool and this also relates to thin-volume when taking snapshot. When suspend_origin_only operation is invoked on a pool with queued messages then only those messages are posted to thin-pool and actual suspend of thin pool and data and metadata volume is skipped. This makes taking a snapshot of thin-volume lighter operation and avoids blocking of other unrelated active thin volumes. Also fail now happens in 'suspend' state where the 'Fail' is more expected and it is better handled through error paths. Activation of thin-pool is now not sending any message and leaves upto a tool to decided later how to finish unfinished double-commit transaction. Problem which needs some API improvements relates to the lvm2 tree construction. For the suspend tree we do not add target table line into the tree, but only a device is inserted into a tree. Current mechanism to attach messages for thin-pool requires the libdm to know about thin-pool target, so lvm2 currently takes assumption, node is really a thin-pool and fills in the table line for this node (which should be ensured by the PRELOAD phase, but it's a misuse of internal API) we would possibly need to be able to attach message to 'any' node. Other thing to notice - current messaging interface in thin-pool target requires to suspend thin volume origin first and then send a create message, but this could not have any 'nice' solution on lvm2 side and IMHO we should introduce something like 'create_after_resume' message. Patch also changes the moment, where lvm2 transaction id is increased. Now it happens only after successful finish of kernel transaction id change. This change was needed to handle properly activation of pool, which is in the middle of unfinished transaction, and also this corrects usage of thin-pool by external apps like Docker.	2015-07-03 16:13:14 +02:00
Zdenek Kabelac	622064f00f	thin: check for overprovisioning	2015-07-03 16:13:14 +02:00
David Teigland	fe70b03de2	Add lvmlockd	2015-07-02 15:42:26 -05:00
Alasdair G Kergon	4c629a5257	locking: Add missing error handling. Add missing error logging and detection to unlock_vg and callers of sync_local_dev_names etc.	2015-06-30 18:54:38 +01:00
Peter Rajnoha	621398ebb7	lv: time: increase buffer to 4k in lv_time_dup	2015-06-29 15:24:00 +02:00
Peter Rajnoha	125cd06698	conf: make time format configurable Make it possible to define format for time that is displayed. The way the format is defined is equal to the way that is used for strftime function, although not all formatting options as used in strftime are available for LVM2 - the set is restricted (e.g. we do not allow newline to be printed). The lvm.conf comments contain the whole list that LVM2 accepts for time format together with brief description (copied from strftime man page). For example: (defaults used - the format is the same as used before this patch) $ lvs -o+time vg/lvol0 vg/lvol1 LV VG Attr LSize Time lvol0 vg -wi-a----- 4.00m 2015-06-25 16:18:34 +0200 lvol1 vg -wi-a----- 4.00m 2015-06-29 09:17:11 +0200 (using 'time_format = "@%s"' in lvm.conf - number of seconds since the Epoch) $ lvs -o+time vg/lvol0 vg/lvol1 LV VG Attr LSize Time lvol0 vg -wi-a----- 4.00m @1435241914 lvol1 vg -wi-a----- 4.00m @1435562231	2015-06-29 14:30:35 +02:00
Zdenek Kabelac	e217873ed6	snapshot: add synchronization point Synchronize with udev logic before reusing device as snapshot. This patch tries to fix the problem with udev, where we manage to 'active' LV for clearing, then we deactivate such device and active again as member of 'origin&snapshot' tree all in 1 step. There needs to be a sync point where udev has time to remove all links, otherwise we race with scans and we may end-up with mysterious 'free' links in the system pointing to wrong dm names. This patch tries to fix failing topology cluster tests..	2015-06-24 15:18:49 +02:00
Zdenek Kabelac	a3e0d830bd	thin: support unaligned size of external origin and thin pool With thin-pool kernel target module 1.13 it's now support usage of external origin with sizes which are not 'alligned' with chunk size of thin-pool. Enable lvm2 support for this and also fix reporting of data_percent usage for case sizes are not alligned.	2015-06-18 18:50:36 +02:00
Zdenek Kabelac	6f2a617c31	thin: drop limitation for extension of reduced thin volume Drop check which has prevented resize of reduce thin volume with external origin. User is supposed to use 'zeroing' to get 'clean' chunks.	2015-06-18 18:48:59 +02:00
David Teigland	e043e03cd8	lv_refresh: move the bulk of the function into lib So that it can be used from other lib code.	2015-06-16 13:38:40 -05:00
David Teigland	d5adec1056	Add the 's' activation mode Just as 'e' means activation with an exclusive lock, add an 's' to mean activation with a shared lock. This allows the existing but implicit behavior of '-ay' of clvm LVs to be specified explicitly. For local VGs, asy simply means ay, just like aey means ay. For local VGs, ay == aey == asy For clvm VGs, ay == asy, aey == aey, asy == asy	2015-06-16 10:18:16 -05:00
Petr Rockai	632dde0cbc	metadata: When outdated PVs are wiped, notify lvmetad about the fact.	2015-06-10 16:27:12 +02:00
Petr Rockai	c78b6f18d4	metadata: Reject lvmetad metadata extensions when reading from disk.	2015-06-10 16:25:57 +02:00
Petr Rockai	756d027da5	metadata: Explain the pvs_outdated field in struct volume_group.	2015-06-10 16:17:45 +02:00
Petr Rockai	611c8b6d29	metadata: Add pvs_outdated to struct volume_group. This is a list of PVs that should have their MDAs wiped because they carry outdated metadata (that used to belong to the VG they are attached to).	2015-05-20 19:46:14 +02:00
Petr Rockai	5435346052	metadata: Factor _wipe_outdated_pvs() PVs out of _vg_read().	2015-05-20 19:46:13 +02:00
Alasdair G Kergon	0300730cc9	pre-release	2015-05-15 23:19:29 +01:00
Zdenek Kabelac	9e102ecbd9	mirror: use proper 64bit constants `ed2a08bf25` missed to use 64bit constants.	2015-05-15 22:53:12 +02:00
David Teigland	8e509b5dd5	toollib: avoid repeated lvmetad vg_lookup In process_each_{vg,lv,pv} when no vgname args are given, the first step is to get a list of all vgid/vgname on the system. This is exactly what lvmetad returns from a vg_list request. The current code is doing a vg_lookup on each VG after the vg_list and populating lvmcache with the info for each VG. These preliminary vg_lookup's are unnecessary, because they will be done again when the processing functions call vg_read. This patch eliminates the initial round of vg_lookup's, which can roughly cut in half the number of lvmetad requests and save a lot of extra work.	2015-05-08 11:44:55 -05:00
Zdenek Kabelac	29c709f591	debug: tracing error path	2015-05-08 15:15:10 +02:00
Zdenek Kabelac	ed2a08bf25	cleanup: use 64bit ulongs Use 64bit arithmetics for all numbers (Coverity).	2015-05-08 15:15:10 +02:00
Zdenek Kabelac	3c46428fcd	cleanup: drop unneeded int test Testing int region_size > INT32_MAX is always false so drop the test (Coverity).	2015-05-08 15:15:10 +02:00
Zdenek Kabelac	2cea1c1bd9	pvcreate: fix test for wiping status Commit `ed420fb691` changed paramet wiped to be a pointer, but missed to switch to test pointer dereferenced value and instead always checked 'pointer'.	2015-05-08 13:36:39 +02:00
Zdenek Kabelac	88421c883e	raid: reread status when 0 is reported When kernel target reports sync status as 0% it might as well mean it's 100% in sync, just the target is in some race inconsistent state - so reread once again and take a more optimistic value ;) Patch tries to work around: https://bugzilla.redhat.com/show_bug.cgi?id=1210637	2015-05-04 13:09:05 +02:00
Alasdair G Kergon	cc26085b62	alloc: Respect cling_tag_list in contig alloc. When performing initial allocation (so there is nothing yet to cling to), use the list of tags in allocation/cling_tag_list to partition the PVs. We implement this by maintaining a list of tags that have been "used up" as we proceed and ignoring further devices that have a tag on the list. https://bugzilla.redhat.com/983600	2015-04-11 01:55:24 +01:00
Alasdair G Kergon	2872e8c289	alloc: Add A_PARTITION_BY_TAGS to avoid sharing. Add A_PARTITION_BY_TAGS set when allocated areas should not share tags with each other and allow _match_pv_tags to accept an alternative list of tags. (Not used yet.)	2015-04-10 21:57:52 +01:00
Alasdair G Kergon	f1e3e99169	alloc: Log PV tags when reserving areas.	2015-03-26 21:13:26 +00:00
Alasdair G Kergon	e8fa3354f0	alloc: Pass alloc_handle through to _reserve_area.	2015-03-26 20:32:59 +00:00
Alasdair G Kergon	f9d74ba3d1	alloc: Only report cling tag errors once.	2015-03-26 19:43:51 +00:00
Alasdair G Kergon	4b1219ee87	metadata: Move alloc_handle init/destroy fns.	2015-03-26 18:44:24 +00:00
Peter Rajnoha	8759f7d755	metadata: vg: add removed_lvs field to collect LVs which have been removed Do not keep dangling LVs if they're removed from the vg->lvs list and move them to vg->removed_lvs instead (this is actually similar to already existing vg->removed_pvs list, just it's for LVs now). Once we have this vg->removed_lvs list indexed so it's possible to do lookups for LVs quickly, we can remove the LV_REMOVED flag as that one won't be needed anymore - instead of checking the flag, we can directly check the vg->removed_lvs list if the LV is present there or not and to say if the LV is removed or not then. For now, we don't have this index, but it may be implemented in the future.	2015-03-24 08:43:08 +01:00
Peter Rajnoha	c9f021de0b	metadata: process_each_lv_in_vg: get the list of LVs to process first, then do the processing This avoids a problem in which we're using selection on LV list - we need to do the selection on initial state and not on any intermediary state as we process LVs one by one - some of the relations among LVs can be gone during this processing. For example, processing one LV can cause the other LVs to lose the relation to this LV and hence they're not selectable anymore with the original selection criteria as it would be if we did selection on inital state. A perfect example is with thin snapshots: $ lvs -o lv_name,origin,layout,role vg LV Origin Layout Role lvol1 thin,sparse public,origin,thinorigin,multithinorigin lvol2 lvol1 thin,sparse public,snapshot,thinsnapshot lvol3 lvol1 thin,sparse public,snapshot,thinsnapshot pool thin,pool private $ lvremove -ff -S 'lv_name=lvol1 \|\| origin=lvol1' Logical volume "lvol1" successfully removed The lvremove command above was supposed to remove lvol1 as well as all its snapshots which have origin=lvol1. It failed to do so, because once we removed the origin lvol1, the lvol2 and lvol3 which were snapshots before are not snapshots anymore - the relations change as we're processing these LVs one by one. If we do the selection first and then execute any concrete actions on these LVs (which is what this patch does), the behaviour is correct then - the selection is done on the initial state: $ lvremove -ff -S 'lv_name=lvol1 \|\| origin=lvol1' Logical volume "lvol1" successfully removed Logical volume "lvol2" successfully removed Logical volume "lvol3" successfully removed Similarly for all the other situations in which relations among LVs are being changed by processing the LVs one by one. This patch also introduces LV_REMOVED internal LV status flag to mark removed LVs so they're not processed further when we iterate over collected list of LVs to be processed. Previously, when we iterated directly over vg->lvs list to process the LVs, we relied on the fact that once the LV is removed, it is also removed from the vg->lvs list we're iterating over. But that was incorrect as we shouldn't remove LVs from the list during one iteration while we're iterating over that exact list (dm_list_iterate_items safe can handle only one removal at one iteration anyway, so it can't be used here).	2015-03-24 08:43:07 +01:00
Alasdair G Kergon	6407d184d1	cache: Store metadata size and checksum. Refactor the recent metadata-reading optimisation patches. Remove the recently-added cache fields from struct labeller and struct format_instance. Instead, introduce struct lvmcache_vgsummary to wrap the VG information that lvmcache holds and add the metadata size and checksum to it. Allow this VG summary information to be looked up by metadata size + checksum. Adjust the debug log messages to make it clear when this shortcut has been successful. (This changes the optimisation slightly, and might be extendable further.) Add struct cached_vg_fmtdata to format-specific vg_read calls to preserve state alongside the VG across separate calls and indicate if the details supplied match, avoiding the need to read and process the VG metadata again.	2015-03-18 23:43:02 +00:00
Alasdair G Kergon	95fbbf4f40	metadata: Fix recent vg_validate message text.	2015-03-17 17:48:56 +00:00
Alasdair G Kergon	a854546234	metadata: Detect internal use of LVM_WRITE_LOCKED. Generate internal error if LVM_WRITE_LOCKED ever appears in struct volume_group: it's only used in external metadata.	2015-03-09 18:56:24 +00:00
Alasdair G Kergon	faccdeda83	comments: Use full flag names.	2015-03-09 18:53:22 +00:00
Zdenek Kabelac	04101bc430	lib: drop unneeded vg_read call Since we take a lock inside vg_lock_newname() and we do a full detection of presence of vgname inside all scanned labels, there is no point to do this for second time to be sure there is no such vg. The only side-effect of such call would be a full validation of some already exising VG metadata - but that's not the task for vgcreate when create a new VG. This call noticable reduces number of scans during 'vgcreate'.	2015-03-06 14:05:06 +01:00
Zdenek Kabelac	7e7411966a	lib: avoid reparsing same metadata When reading VG mda from multiple PVs - do all the validation only when mda is seen for the first time and when mda checksum and length is same just return already existing VG pointer. (i.e. using 300PVs for a VG would lead to create and destroy 300 config trees....)	2015-03-06 13:53:12 +01:00
David Teigland	1e65fdd9ba	system_id: make new VGs read-only for old lvm versions Previous versions of lvm will not obey the restrictions imposed by the new system_id, and would allow such a VG to be written. So, a VG with a new system_id is further changed to force previous lvm versions to treat it as read-only. This is done by removing the WRITE flag from the metadata status line of these VGs, and putting a new WRITE_LOCKED flag in the flags line of the metadata. Versions of lvm that recognize WRITE_LOCKED, also obey the new system_id. For these lvm versions, WRITE_LOCKED is identical to WRITE, and the rules associated with matching system_id's are imposed. A new VG lock_type field is also added that causes the same WRITE/WRITE_LOCKED transformation when set. A previous version of lvm will also see a VG with lock_type as read-only. Versions of lvm that recognize WRITE_LOCKED, must also obey the lock_type setting. Until the lock_type feature is added, lvm will fail to read any VG with lock_type set and report an error about an unsupported lock_type. Once the lock_type feature is added, lvm will allow VGs with lock_type to be used according to the rules imposed by the lock_type. When both system_id and lock_type settings are removed, a VG is written with the old WRITE status flag, and without the new WRITE_LOCKED flag. This allows old versions of lvm to use the VG as before.	2015-03-05 09:50:43 -06:00
David Teigland	c6a57dc4f3	Revert "systemid: Add ACCESS_NEEDS_SYSTEM_ID VG flag." This reverts commit `bfbb5d269a`. This will be done differently.	2015-03-05 09:50:43 -06:00
Peter Rajnoha	190d591fbe	report: fix seg_monitor field to display monitoring status for thick snapshots and mirrors The seg_monitor did not display monitored status for thick snapshots and mirrors (with mirror log not mirrored). The seg monitor did work correctly even before for other segtypes - thins and raids. Before (mirrors and snapshots, only mirrors with mirrored log properly displayed monitoring status): [0] f21/~ # lvs -a -o lv_name,lv_layout,lv_role,seg_monitor vg LV Layout Role Monitor mirror mirror public [mirror_mimage_0] linear private,mirror,image [mirror_mimage_1] linear private,mirror,image [mirror_mlog] linear private,mirror,log mirror_with_mirror_log mirror public monitored [mirror_with_mirror_log_mimage_0] linear private,mirror,image [mirror_with_mirror_log_mimage_1] linear private,mirror,image [mirror_with_mirror_log_mlog] mirror private,mirror,log monitored [mirror_with_mirror_log_mlog_mimage_0] linear private,mirror,image [mirror_with_mirror_log_mlog_mimage_1] linear private,mirror,image thick_origin linear public,origin,thickorigin thick_snapshot linear public,snapshot,thicksnapshot With this patch applied (monitoring status displayed for all mirrors and snapshots): [0] f21/~ # lvs -a -o lv_name,lv_layout,lv_role,seg_monitor vg LV Layout Role Monitor mirror mirror public monitored [mirror_mimage_0] linear private,mirror,image [mirror_mimage_1] linear private,mirror,image [mirror_mlog] linear private,mirror,log mirror_with_mirror_log mirror public monitored [mirror_with_mirror_log_mimage_0] linear private,mirror,image [mirror_with_mirror_log_mimage_1] linear private,mirror,image [mirror_with_mirror_log_mlog] mirror private,mirror,log monitored [mirror_with_mirror_log_mlog_mimage_0] linear private,mirror,image [mirror_with_mirror_log_mlog_mimage_1] linear private,mirror,image thick_origin linear public,origin,thickorigin thick_snapshot linear public,snapshot,thicksnapshot monitored	2015-03-05 14:05:34 +01:00
Alasdair G Kergon	bfbb5d269a	systemid: Add ACCESS_NEEDS_SYSTEM_ID VG flag. Set ACCESS_NEEDS_SYSTEM_ID VG status flag whenever there is a non-lvm1 system_id set. Prevents concurrent access from older LVM2 versions. Not set on VGs that bear a system_id only due to conversion from lvm1 metadata.	2015-03-04 01:16:32 +00:00
Alasdair G Kergon	3562b5ab39	systemid: Init and merge lvm2 and lvm1 fields. Use system_id field in preference to lvm1_system_id. Initialise both for now.	2015-03-04 01:00:51 +00:00
Alasdair G Kergon	4e6f3e5162	archives: Preserve format type in file. format_text processes both lvm2 on-disk metadata and metadata read from other sources such as backup files. Add original_fmt field to retain the format type of the original metadata. Before this patch, /etc/lvm/archives would contain backups of lvm1 metadata with format = "lvm2" unless the source was lvm1 on-disk metadata.	2015-03-04 00:30:26 +00:00
Peter Rajnoha	1a41e649a6	metadata: vg: alloc lvm1_system_id in alloc_vg sooner	2015-03-02 13:00:45 +01:00
Peter Rajnoha	eeaf3f2e88	metadata: vg: add missing vg->lvm1_system_id initialization The vg->lvm1_systemd_id needs to be initialized as all the code around counts with that. Just like we initialize lvm1_system_id in vg_create (no matter if it's actually LVM1 or LVM2 format), this patch adds this init in alloc_vg as well so the rest of the code does not segfaul when trying to access vg->lvm1_system_id.	2015-03-02 12:17:27 +01:00
David Teigland	c32efc7f7e	system_id: apply consistent naming In log messages refer to it as system ID (not System ID). Do not put quotes around the system_id string when printing. On the command line use systemid. In code, metadata, and config files use system_id. In lvmsystemid refer to the concept/entity as system_id.	2015-02-27 13:32:00 -06:00
Alasdair G Kergon	a432066c7c	mirror: Explicit cast in region_size_max	2015-02-26 19:49:25 +00:00
Alasdair G Kergon	cb727a1ccc	mirror: Avoid region size compiler warning. format ‘%u’ expects type ‘unsigned int’, but argument 7 has type ‘uint64_t’	2015-02-26 19:45:55 +00:00
David Teigland	dd6a202831	lvchange: deactivate is always possible in foreign vgs The only realistic way for a host to have active LVs in a foreign VG is if the host's system_id (or system_id_source) is changed while LVs are active. In this case, the active LVs produce an warning, and access to the VG is implicitly allowed (without requiring --foreign.) This allows the active LVs to be deactivated. In this case, rescanning PVs for the VG offers no benefit. It is not possible that rescanning would reveal an LV that is active but wasn't previously in the VG metadata.	2015-02-25 14:58:49 -06:00
Jonathan Brassow	dd0ee35378	cmirror: Adjust region size to work around CPG msg limit to avoid hang. cmirror uses the CPG library to pass messages around the cluster and maintain its bitmaps. When a cluster mirror starts-up, it must send the current state to any joining members - a checkpoint. When mirrors are large (or the region size is small), the bitmap size can exceed the message limit of the CPG library. When this happens, the CPG library returns CPG_ERR_TRY_AGAIN. (This is also a bug in CPG, since the message will never be successfully sent.) There is an outstanding bug (bug 682771) that is meant to lift this message length restriction in CPG, but for now we work around the issue by increasing the mirror region size. This limits the size of the bitmap and avoids any issues we would otherwise have around checkpointing. Since this issue only affects cluster mirrors, the region size adjustments are only made on cluster mirrors. This patch handles cluster mirror issues involving pvmove, lvconvert (from linear to mirror), and lvcreate. It also ensures that when users convert a VG from single-machine to clustered, any mirrors with too many regions (i.e. a bitmap that would be too large to properly checkpoint) are trapped.	2015-02-25 14:42:15 -06:00
David Teigland	8668a9e81c	systemid: silently ignore foreign vgs unless named A foreign VG should be silently ignored by a reporting/display command like 'vgs'. If the reporting/display command specifies a foreign VG by name on the command line, it should produce an error message. Scanning commands pvscan/vgscan/lvscan are always allowed to read and update caches from all PVs, including those that belong to foreign VGs. Other non-report/display/scan commands always ignore a foreign VG, or report an error if they attempt to use a foreign VG. vgimport should always invalidate the lvmetad cache because lvmetad likely holds a pre-vgexported copy of the VG. (This is unrelated to using foreign VGs; the pre-vgexported VG may have had no system_id at all.)	2015-02-25 10:53:52 -06:00
Petr Rockai	7d615a3fe5	cache: Fix a segfault when passing --cachepolicy without --cachesettings.	2015-02-24 11:39:35 +01:00
Alasdair G Kergon	b18feb98e5	systemid: Fix access restrictions. When checking whether the system ID permits access to a VG, check for each permitted situation first, and only then issue the appropriate error message. Always issue a message for now. (We'll try to suppress some of those later when the VG concerned wasn't explicitly requested.) Add more messages to try to ensure every return code is checked and every error path (and only an error path) contains a log_error(). Add self-correction to vgchange -c to deal with situations where the cluster state and system ID state are out-of-sync (e.g. if old tools were used).	2015-02-23 23:19:36 +00:00
Alasdair G Kergon	df227be37c	lvm1: Reenable sys ID. Move the lvm1 sys ID into vg->lvm1_system_id and reenable the #if 0 LVM1 code. Still display the new-style system ID in the same reporting field, though, as only one can be set. Add a format feature flag FMT_SYSTEM_ON_PVS for LVM1 and disallow access to LVM1 VGs if a new-style system ID has been set. Treat the new vg->system_id as const.	2015-02-23 23:03:52 +00:00
Alasdair G Kergon	2fc2928978	config: Rename allow_system_id to extra_system_ids. Add warnings to the config file templates and briefly document each value. Configure lvmlocal.conf and install in /etc/lvm.	2015-02-23 22:19:08 +00:00
Zdenek Kabelac	a18d789684	cleanup: simplify error path code Mempool needs to free only with first alllocated element, everything allocated afterwards is released as well.	2015-02-19 14:44:04 +01:00
Zdenek Kabelac	4c184e9d6b	cleanup: drop unused value assign Dop unused value assignments. Unknown is detected via other combination (!linear && !striped). Also change the log_error() message into a warning, since the function is not really returning error, but still keep the INTERNAL_ERROR. Ret value is always set later.	2015-02-19 14:43:25 +01:00
Peter Rajnoha	ed420fb691	pvcreate: switch to "none" dev-ext source during pvcreate The dev ext source must be reset for the dev_cache_get call (which evaluates filters), not lvmcache_label_scan - so fix original commit `727c7ff85d`. Also, add comments in _pvcreate_check fn explaining why refresh filter and rescan is needed and exactly in which situations.	2015-02-19 14:34:55 +01:00
Peter Rajnoha	6b4066585f	filters: no need to refresh filters/rescan if no signature is wiped during pvcreate at all Before, we refreshed filters and we did full rescan of devices if we passed through wiping (wipe_known_signatures fn call). However, this fn returns success even if no signatures were found and so nothing was wiped. In this case, it's not necessary to do the filter refresh/rescan of devices as nothing changed clearly. This patch exports number of wiped signatures from all the wiping functions below. The caller (_pvcreate_check) then checks whether any wiping was done at all and if not, no refresh/rescan is done, saving some time and resources.	2015-02-17 09:46:34 +01:00
Peter Rajnoha	727c7ff85d	pvcreate: switch to "none" dev-ext source during pvcreate pvcreate code path executes signature wiping if there are any signatures found on device to prepare the device for PV. When the signature is wiped, the WATCH udev rule triggers the event which then updates udev database with fresh info, clearing the old record about previous signature. However, when we're using udev db as dev-ext source, we'd need to wait for this WATCH-triggered event. But we can't synchronize against such events (at least not at this moment). Without this sync, if the code continues, the device could still be marked as containing the old signature if reading udev db. This may end up even with the device to be still filtered, though the signature is already wiped. This problem is then exposed as (an example with md components): $ mdadm --create /dev/md0 --level=1 --raid-devices=2 /dev/sda /dev/sdb --run $ mdadm -S /dev/md0 $ pvcreate -y /dev/sda Wiping linux_raid_member signature on /dev/sda. /dev/sda: Couldn't find device. Check your filters? $ echo $? 5 So we need to temporarily switch off "udev" dev-ext source here in this part of pvcreate code until we find a way how to sync with WATCH events. (This problem does not occur with signature wiping which we do on newly created LVs since we already handle this properly with our udev flags - the LV_NOSCAN/LV_TEMPORARY flag. But we can't use this technique for non-dm devices to keep WATCH rule under control.)	2015-02-16 15:07:00 +01:00
David Teigland	8cdec4c434	system_id: use for VG ownership See included lvmsystemid(7) for full description.	2015-02-13 10:10:27 -06:00
Zdenek Kabelac	434031719e	raid: check lock holding LV Since raid could be used as stacked LV - check lock holding LV for proper locking type for clustered usage.	2015-01-30 14:16:27 +01:00
Zdenek Kabelac	2055b04c11	cleanup: indent tabs	2015-01-30 12:33:52 +01:00
Zdenek Kabelac	2e35c68122	lv_manip: add for_each_sub_lv_except_pools() for_each_sub_lv() now scans in depth also pools, however for rename we actually do want to skip pools. So add a new for_each_sub_lv_except_pools() to be used by rename, every other user of for_each_sub_lv() scans every sub LV with pools included. This is i.e. necessary for properly working preload of pools that are using raid arrays.	2015-01-30 12:33:52 +01:00
Peter Rajnoha	531cc58d89	lvm2app: fix lvm_lv_get_attr regression causing unknown values This is a regression from v115 where some of the fields/properties were converted to using the common "struct lvinfo" and "struct lv_seg_status" so we don't need to issue info and status ioctl several times per one reported line. Not all fields are converted yet, but one that is converted is the lv_attr field with the lv_attr_dup counterpart used in lvm_lv_get_attr lvm2app fn. These changes were introduced with `e34b004422` and later - this patch introduced the "info_ok" field in the lv_with_info_and_seg_status structure which encapsulates the lvinfo and lv_seg_status struct. For the lv_attr_dup, the lv_attr_dup code missed the assignment for the "info_ok" flag which saves the result of the lv_info_with_seg_status call. Hence such info was marked as unusable - unknown and it was returned as such via lvm_lv_get_attr lvm2app fn.	2015-01-30 09:53:34 +01:00
Zdenek Kabelac	553f37da71	raid: lock holder will skip visible raid LVs RAID marks legs as VISIBLE with notion it's not longer true raid leg - so skip tree scannig and take this LV as top-level LV.	2015-01-28 13:45:27 +01:00
Zdenek Kabelac	93b9015760	raid: fix raid image splitting When raid leg is extracted, now the preload code handles this state correctly and put proper new table entry into dm tree, so the activation of extracted leg and removed metadata works after commit.	2015-01-28 13:45:18 +01:00
Peter Rajnoha	0fddc5ab5c	coverity: missing return value check Reported by coverity for code added recently - _avoid_pvs_with_other_images_of_lv which calls process_each_sub_lv and not checking return value.	2015-01-22 10:11:19 +01:00
Peter Rajnoha	338d98be97	cleanup: for commit `7bcb3fb02d`	2015-01-21 11:29:12 +01:00
Peter Rajnoha	7bcb3fb02d	report: rename lv_error_when_full field to lv_when_full and display either "error", "queue" or "" Rename original lv_error_when_full field to lv_when_full and also convert it from binary field to string field displaying three possible values: "error", "queueu" or "" (blank for undefined). $ lvs vg/pool vg/pool1 vg/linear_lv -o+lv_when_full LV VG Attr LSize Data% Meta% WhenFull linear_lv vg -wi-a----- 4.00m pool vg twi-aotz-- 4.00m 0.00 0.98 queue pool1 vg twi-a-tz-- 4.00m 0.00 0.88 error For -S\|--select these synonyms are recognized: "error" -> "error when full", "error if no space" "queue" -> "queue when full", "queue if no space" "" -> "undefined"	2015-01-21 10:50:32 +01:00
Zdenek Kabelac	87e80b6aac	report: proper lv_attr_dup emulation We need to create a mempool for proper emulation of lv_attr_dup for lvm2api.	2015-01-20 16:24:45 +01:00
Zdenek Kabelac	d80d832ae9	report: seg_monitor undefined Add 'undefined' value for segment which do not support monitoring. Fixes crash for commands like 'pvs -o+seg_monitor'.	2015-01-20 15:02:10 +01:00
Zdenek Kabelac	b3a348c03c	report: use same info also for lv_attr Recently the single 'status' code has been used for number of cache features. Extend the API a little bit to allow usage also for lv_attr_dup. As the function itself is used in lvm2api - add a new function: lv_attr_dup_with_info_and_seg_status() that is able to use grabbed info & status information. report_init() is now using directly passed lvdm struct pointer which holds the infomation whether lv_info() was correctly obtained or there was some error when trying to read it. Move 'healt' attribute to status. TODO convert raid function to use the already known status.	2015-01-20 14:58:41 +01:00
Zdenek Kabelac	07eb1c7dc8	cleanup: add lv_is_error_when_full() macro Like with other status bits use macro for testing. (in-release update)	2015-01-20 14:52:06 +01:00
Heinz Mauelshagen	302b6c99a7	raid_manip: v2 fix multi-segment misallocation on 'lvconvert --repair' The previous patch felt short WRT disabling allocation on PVs holding other legs of the RAID LV persistently; this patch introduces an internal, transient PV flag PV_ALLOCATION_PROHIBITED to address this very problem. General problem description for completeness: An 'lvconvert --repair $RAID_LV" to replace a failed leg of a multi-segment RAID10/4/5/6 logical volume can lead to allocation of (parts of) the replacement image component pair on the physical volume of another image component (e.g. image 0 allocated on the same PV as image 1 silently impeding resilience). Patch fixes this severe resilince issue by prohibiting allocation on PVs already holding other legs of the RAID set. It allows to allocate free space on any operational PV already holding parts of the image component pair.	2015-01-16 13:44:16 +01:00
Zdenek Kabelac	2908ab3eed	thin: errrorwhenfull support Support error_if_no_space feature for thin pools. Report more info about thinpool status: (out_of_data (D), metadata_read_only (M), failed (F) also as health attribute.)	2015-01-14 14:52:05 +01:00
Heinz Mauelshagen	cdd17eee37	raid_manip: fix multi-segment misallocation on 'lvconvert --repair' An 'lvconvert --repair $RAID_LV" to replace a failed leg of a multi-segment RAID10/4/5/6 logical volume can lead to allocation of (parts of) the replacement image component pair on the physical volume of another image component (e.g. image 0 allocated on the same PV as image 1 silently impeding resilience). Patch fixes this severe resilince issue by prohibiting allocation on PVs already holding other legs of the RAID set. It allows to allocate free space on any operational PV already holding parts of the image component pair.	2015-01-14 13:41:55 +01:00
Peter Rajnoha	fb7e2ff493	metadata: add "Failed to write VG <vg_name>." on failed vg_write and revert previous patch Better than previous patch which changed log_warn to log_error - we can have multiple MDAs and if one of them fails to be written, we can still continue with other MDAs if we're in a mode where we can handle missing PVs - so keep the log_warn for single failed MDA write as it was before. However, add log_error with "Failed to write VG <vg_name>." in case we're not handling missing PVs or no MDA was written at all during VG write process. This also prevents an internal error in which the vg_write fails and we're not issuing any other log_error in vg_write caller or above, so we end up with: "Internal error: Failed command did not use log_error".	2015-01-09 14:04:44 +01:00
Peter Rajnoha	db7351d313	metadata: log_error instead of log_warn on failed mda write	2015-01-09 12:00:03 +01:00
Heinz Mauelshagen	aaecbb1818	raid: fix mirror image naming when converting from mirror to raid1 $ lvcreate -l1 -m1 --type mirror vg Logical volume "lvol0" created. $ lvconvert --type raid1 vg/lvol0 Before: $ lvs -a vg LV VG Active Attr LSize Cpy%Sync Layout Role lvol0 vg active rwi-a-r--- 4.00m 100.00 raid,raid1 public [lvol0_mimage_0_rimage_0] vg active iwi-aor--- 4.00m linear private,raid,image [lvol0_mimage_1_rimage_1] vg active iwi-aor--- 4.00m linear private,raid,image [lvol0_rmeta_0] vg active ewi-aor--- 4.00m linear private,raid,metadata [lvol0_rmeta_1] vg active ewi-aor--- 4.00m linear private,raid,metadata Incorrect name: lvol0_mimage_0_rimage_0 With this patch applied: $ lvs -a vg LV VG Active Attr LSize Cpy%Sync Layout Role lvol0 vg active rwi-a-r--- 4.00m 100.00 raid,raid1 public [lvol0_rimage_0] vg active iwi-aor--- 4.00m linear private,raid,image [lvol0_rimage_1] vg active iwi-aor--- 4.00m linear private,raid,image [lvol0_rmeta_0] vg active ewi-aor--- 4.00m linear private,raid,metadata [lvol0_rmeta_1] vg active ewi-aor--- 4.00m linear private,raid,metadata Proper name: lvol0_rimage_0	2015-01-07 13:25:08 +01:00
Peter Rajnoha	ff1eca3b6f	mirror: do not try to reactivate inactive mirror when removing its LVs which have missing PVs When mirror has missing PVs and there are mirror images on those missing PVs, we delete the images and during this delete operation, we also reactivate the LV. But if we're trying to reactivate the LV in cluster which is not active and at the same time cmirrord is not running (which is OK since we may have created the mirror LV as inactive), we end up with: "Error locking on node <node_name>: Shared cluster mirrors are not available." That is because we're trying to activate the mirror LV without cmirrord. However, there's no need to do this reactivation if the mirror LV (and hence it's sub LVs) were not activated before. This issue caused failure in mirror-vgreduce-removemissing.sh test recently with this sequence (excerpt from the test script): prepare_lvs_ lvcreate -an -Zn -l2 --type mirror -m1 --nosync -n $lv1 $vg "$dev1" $dev2" "$dev3":$BLOCKS mimages_are_on_ $lv1 "$dev1" "$dev2" mirrorlog_is_on_ $lv1 "$dev3" aux disable_dev "$dev2" vgreduce --removemissing --force $vg The important thing about that test is that we're not running cmirrord, we're activating the mirror with "-an" so it's inactive and then vgreduce --removemissing tries to reactivate the mirror images as part of the _delete_lv function call inside and since cmirrord is not running, we end up with the "Shared cluster mirrors are not available." error.	2015-01-07 11:16:19 +01:00
Petr Rockai	e97023804a	pvremove: Avoid metadata re-reads & related error messages.	2015-01-06 14:27:30 +01:00
Peter Rajnoha	509650ec4c	cmirror: do not check for cmirror availability when creating deactivated cluster mirrors When creating cluster mirrors while they're not supposed to be activated immediately after creation, we don't need to check for cmirrord availability. We can just create these mirrors and let the check to be done on activation later on. This is addendum for commit `cba6186325`.	2015-01-06 09:59:04 +01:00
Peter Rajnoha	cba6186325	cmirror: check for cmirror availability during cluster mirror creation and activation When creating/activating clustered mirrors, we should have cmirrord available and running. If it's not, we ended up with rather cryptic errors like: $ lvcreate -l1 -m1 --type mirror vg Error locking on node 1: device-mapper: reload ioctl on failed: Invalid argument Failed to activate new LV. $ vgchange -ay vg Error locking on node node 1: device-mapper: reload ioctl on failed: Invalid argument This patch adds check for cmirror availability and it errors out properly, also giving a more precise error messge so users are able to identify the source of the problem easily: $ lvcreate -l1 -m1 --type mirror vg Shared cluster mirrors are not available. $ vgchange -ay vg Error locking on node 1: Shared cluster mirrors are not available. Exclusively activated cluster mirror LVs are OK even without cmirrord: $ vgchange -aey vg 1 logical volume(s) in volume group "vg" now active	2015-01-05 16:54:07 +01:00
Zdenek Kabelac	f3bd9a2797	raid: properly rename split image When we split leg from raid - we take a proper new lock for a new LV. However for now activation checks only 'existince' of device UUID, but it's not validating device has a proper name. As a quick fix call suspend()/resume() to rename after split mirror.	2014-12-05 13:39:42 +01:00
Peter Rajnoha	a5baf13a06	pool: fix typo in error message: then -> than	2014-12-04 09:18:16 +01:00
Alasdair G Kergon	a057f40155	mirror: Validate raid region size config setting. If necessary, round down to a power of 2 the raid/mirror region size taken from the config files.	2014-12-03 22:47:08 +00:00
Alasdair G Kergon	de53e0955d	mirror: Restrict region size to power of 2.	2014-12-02 14:24:21 +00:00
Petr Rockai	2c3db52356	metadata: Add cache_policy to lvcreate_params and honour it.	2014-11-27 20:20:48 +01:00
Zdenek Kabelac	2de11c9e9e	thin: add missing 64KB rounding When chunk size needs to be estimated, the code missed to round to proper 64kb boundaries (or power of 2 for older thin pool driver). So for some data and metadata size (i.e. 10GB and 4MB) it resulted in incorrect chunk size (not being a multiple of 64KB) Fix it by adding proper rounding and also use 1 routine for 2 places where the same calculation is made. Fix also incorrect printed warning that has used 'ffs()' (which returns first 'least significant' bit in word) and it was not really giving any useful size info and replace it with properly estimated chunk size.	2014-11-26 09:29:25 +01:00
Peter Rajnoha	62f3a4d2d8	pvresize: fix size in 'Resizing to ...' verbose message to show proper result size	2014-11-25 15:19:10 +01:00
Petr Rockai	c75ae0846e	cache: Implement 'default' as a policy settings value to clear the record.	2014-11-20 16:51:07 +01:00
Petr Rockai	d22ffd8c28	cache: Add lv_cache_setpolicy to cache_manip.c.	2014-11-20 16:51:06 +01:00
Zdenek Kabelac	9f2961f259	cache: check for internal error Don't try to duplicate NULL on internal error path.	2014-11-20 16:35:46 +01:00
Zdenek Kabelac	d7985ebead	thin: fix error path Print pool name and not the origin name.	2014-11-19 18:58:30 +01:00
Zdenek Kabelac	38200c2000	cleanup: add '.' to log messages	2014-11-14 18:12:35 +01:00
Zdenek Kabelac	f36080a05d	vg_read: correct warning Use log_warn when we are effectively not creating an error - we 'allowed' inconsistent read for a reason - so it's just warning level we process inconsistent VG - it's upto caller later to decide error level of command return value and in case of error it needs to use log_error then.	2014-11-14 18:12:35 +01:00

1 2 3 4 5 ...

1938 Commits