shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-21 13:34:40 +03:00

Author	SHA1	Message	Date
Zdenek Kabelac	c0a505b0bb	cleanup: drop unused header files	2016-07-04 17:40:24 +02:00
David Teigland	a7c45ddc59	lvmetad: two phase vg_update Previously, a command sent lvmetad new VG metadata in vg_commit(). In vg_commit(), devices are suspended, so any memory allocation done by the command while sending to lvmetad, or by lvmetad while updating its cache could deadlock if memory reclaim was triggered. Now lvmetad is updated in unlock_vg(), after devices are resumed. The new method for updating VG metadata in lvmetad is in two phases: 1. In vg_write(), before devices are suspended, the command sends lvmetad a short message ("set_vg_info") telling it what the new VG seqno will be. lvmetad sees that the seqno is newer than the seqno of its cached VG, so it sets the INVALID flag for the cached VG. If sending the message to lvmetad fails, the command fails before the metadata is committed and the change is not made. If sending the message succeeds, vg_commit() is called. 2. In unlock_vg(), after devices are resumed, the command sends lvmetad the standard vg_update message with the new metadata. lvmetad sees that the seqno in the new metadata matches the seqno it saved from set_vg_info, and knows it has the latest copy, so it clears the INVALID flag for the cached VG. If a command fails between 1 and 2 (after committing the VG on disk, but before sending lvmetad the new metadata), the cached VG retains the INVALID flag in lvmetad. A subsequent command will read the cached VG from lvmetad, see the INVALID flag, ignore the cached copy, read the VG from disk instead, update the lvmetad copy with the latest copy from disk, (this clears the INVALID flag in lvmetad), and use the correct VG metadata for the command. (This INVALID mechanism already existed for use by lvmlockd.)	2016-06-28 02:30:31 +01:00
David Teigland	f96de67490	vgcfgrestore: check for missing device The missing device will generally be seen earlier and cause the command to not reach this point, but check anyway for completeness.	2016-06-20 16:02:07 -05:00
David Teigland	6ae22125c6	vgcfgrestore: use lvmetad disabled state Previously, vgcfgrestore would attempt to vg_remove the existing VG from lvmetad and then vg_update to add the restored VG. But, if there was a failure in the command or with vg_update, the lvmetad cache would be left incorrect. Now, disable lvmetad before the restore begins, and then rescan to populate lvmetad from disk after restore has written the new VG to disk.	2016-06-20 11:19:49 -05:00
David Teigland	01156de6f7	lvmcache: add optional dev arg to lvmcache_info_from_pvid A number of places are working on a specific dev when they call lvmcache_info_from_pvid() to look up an info struct based on a pvid. In those cases, pass the dev being used to lvmcache_info_from_pvid(). When a dev is specified, lvmcache_info_from_pvid() will verify that the cached info it's using matches the dev being processed before returning the info. Calling code will not mistakenly get info for the wrong dev when duplicate devs exist. This confusion was happening when scanning labels when duplicate devs existed. label_read for the first dev would add an info struct to lvmcache for that dev/pvid. label_read for the second dev would see the pvid in lvmcache from first dev, and mistakenly conclude that the label_read from the second dev can be skipped because it's already been done. By verifying that the dev for the cached pvid matches the dev being read, this mismatch is avoided and the label is actually read from the second duplicate.	2016-06-07 15:15:47 -05:00
Peter Rajnoha	02d67848eb	coverity: fix possible resource leak of descendants_buffer in _print_historical_lv fn	2016-05-31 09:36:58 +02:00
Alasdair G Kergon	bf8d00985a	raid0: Add raid0 segment type. This remains experimental and quite restrictive so should only be used for testing at this stage. (E.g. lvreduce is not supported.)	2016-05-23 16:46:38 +01:00
Zdenek Kabelac	9c083d34af	debug: use display_lvname Add some tracing message	2016-05-19 18:40:14 +02:00
Zdenek Kabelac	509b2e5247	debug: move misplaced log_debug It should log action before taking it instead of only in error path.	2016-04-21 00:34:01 +02:00
David Teigland	5e9e43074a	lvmetad: rework command connection setup and checking The lvmetad connection is created within the init_connections() path during command startup, rather than via the old lvmetad_active() check. The old lvmetad_active() checks are replaced with lvmetad_used() which is a simple check that tests if the command is using/connected to lvmetad. The old lvmetad_set_active(cmd, 0) calls, which stopped the command from using lvmetad (to revert to disk scanning), are replaced with lvmetad_make_unused(cmd).	2016-04-19 14:00:02 -05:00
Zdenek Kabelac	a28c81cbae	debug: unify some tracing messages Introduce FMTVGID - although it might be possibly better to ensure vgid is always \0 ended string. Unify some lvmcache reported messages.	2016-04-12 13:06:16 +02:00
David Teigland	147c9c01a2	rename function read_vgname to read_vgsummary The name did not clearly represent what it does.	2016-04-11 13:07:48 -05:00
Zdenek Kabelac	8e9deb2e70	gcc: cast time_t to 64bit Value is printed as uint64, so make sure right type is passed on all platforms. Fixes gcc warning on some 32bit platforms.	2016-03-10 18:38:54 +01:00
Peter Rajnoha	d03b1779b4	coverity: fix possible resource leak in _print_historical_lv function The code in _print_historical_lv function works with temporary "descendants_buffer" that is allocated and freed within this function. When printing text out, we used "outf" macro which called "out_text" fn and it checked return value and if failed, the macro called "return_0" automatically. But since we use the temporary buffer, if any of the out_text calls fails, we need to deallocate this buffer properly - that's the "goto_out", otherwise we'll be leaking memory. So add new "outfgo" helper macro which does the same as "outf", but it calls "goto_out" instead of "return_0" so we can jump to a cleanup hook at the end.	2016-03-07 10:43:50 +01:00
Peter Rajnoha	f833a6d074	metadata: add historical_glv_remove	2016-03-03 13:50:57 +01:00
Peter Rajnoha	673bc0636c	metadata: format_text: interconnect historical LVs among each other and also with live LVs Interconnect historical LVs in an ancestry chain and also connect the first/last one with its live ancestor/descendant if it exists.	2016-03-03 13:49:13 +01:00
Peter Rajnoha	a0842d1f25	metadata: format_text: import historical LVs Import historical LV list from metadata and add it to struct volume_group's historical_lvs list.	2016-03-03 13:46:39 +01:00
Peter Rajnoha	54d3d976c7	metadata: format_text: reuse _print_timestamp fn	2016-03-03 13:46:39 +01:00
Peter Rajnoha	3a0ef77305	metadata: format_text: also export historical LVs Also export historical LVs when exporting LVM2 metadata. This is list of all historical LVs listed in "historical_logical_volumes" metadata section with all the properties exported for each historical LV. For example, we have this thin snapshot sequence: lvol1 --> lvol2 --> lvol3 \ --> lvol4 We end up with these metadata: logical_volume { ... (lvol1, lvol3 and lvol4 listed here as usual - no change here) ... } historical_logical_volumes { lvol2 { id = "S0Dw1U-v5sF-LwAb-W9SI-pNOF-Madd-5dxSv5" creation_time = 1456919613 # 2016-03-02 12:53:33 +0100 removal_time = 1456919620 # 2016-03-02 12:53:40 +0100 origin = "lvol1" descendants = ["lvol3", "lvol4"] } } By removing lvol1 further, we end up with: historical_logical_volumes { lvol2 { id = "S0Dw1U-v5sF-LwAb-W9SI-pNOF-Madd-5dxSv5" creation_time = 1456919613 # 2016-03-02 12:53:33 +0100 removal_time = 1456919620 # 2016-03-02 12:53:40 +0100 origin = "-lvol1" descendants = ["lvol3", "lvol4"] } lvol1 { id = "me0mes-aYnK-nRfT-vNlV-UiR1-GP7r-ojbROr" creation_time = 1456919608 # 2016-03-02 12:53:28 +0100 removal_time = 1456919767 # 2016-03-02 12:56:07 +0100 } }	2016-03-03 13:46:18 +01:00
David Teigland	4de6caf5b5	redefine pvcreate structs New pv_create_args struct contains all the specific parameters for creating a PV, independent of the command.	2016-02-25 09:14:10 -06:00
David Teigland	ff2267012a	vgconvert: refactor to avoid pvcreate code This uses the vg->pv_write_list in place of the vg->pvs_to_write list, and eliminates the use of pvcreate_params. The label remove and zeroing steps are shifted out of vg_write() to the higher level like pvcreate will do.	2016-02-25 09:14:09 -06:00
Zdenek Kabelac	7d8a67714f	cleanup: drop double ;	2016-02-23 12:25:25 +01:00
Peter Rajnoha	8ad93874d6	tests: fix tests checking pv_attr - there's a new bit now	2016-02-15 12:44:46 +01:00
Peter Rajnoha	9b9f1ae772	format: format_text: add pv_needs_rewrite to format_handler and implemention for format_text	2016-02-15 12:44:46 +01:00
Peter Rajnoha	d84a80afb5	backup: backup_restore_vg: register PVs that need writing via vg->pvs_to_write list The backup_restore_vg is used directly for restoring the VG from backup. It's also used to do the VG conversions from one metadata format to another which means vgconvert calls backup_restore_vg too. When restoring VG from backup, we need to rewrite/write PV headers as PVs may have been orphans before and now they're becoming part of some VG - we need to write the PV_EXT_USED flag at least. When using the backup_restore_vg for vgconvert, we need to write completely new PV header in different format. Avoid the special "pv_write" call and handling that was used before this patch in vgconvert (vgconvert_single function to be more precise) and reuse existing internal interface to register PV header for writing (or rewriting) via vg->pvs_to_write list instead like we do it elsewhere in the code. This patch also resolves a problem in which PV headers with target format were written in the vgconvert_single fn as orphans and VG metadata were added later on - this was a tiny hack actually. We can't do this now - we need to write the PV as belonging to a VG because otherwise the PV_EXT_USED flag won't be written properly (if the PV header is written as orphan, the PV_EXT_USED is set to 0, of course, even though metadata are attached later). So this patch removes this tiny inconsistency which was passing just fine before because we didn't have any relation to the VG in PV header before. Now we have the PV_EXT_USED flag which says the "PV is used in some VG".	2016-02-15 12:44:46 +01:00
Peter Rajnoha	d320d9c52b	pv: format-text: store PV_EXT_USED flag if PV is used and unset it otherwise When adding a PV to VG, set the PV_EXT_USED flag in PV header and vice versa - if the PV is no longer in a VG, unset the flag.	2016-02-15 12:44:46 +01:00
Peter Rajnoha	10128c9bd6	metadata: schedule PV for header rewrite if adding a PV to VG or restoring VG When adding PV to VG, we need to rewrite PV header as there's a flip in PV_EXT_USED flag. The same applies if we're restoring VG from backup.	2016-02-15 12:44:46 +01:00
Peter Rajnoha	71ea2e1602	lvmcache/lvmetad: cache PV extension version Store PV extension version in lvmcache/lvmetad for use throughout the code.	2016-02-15 12:44:46 +01:00
Peter Rajnoha	7593221f94	lvmcache/lvmetad: cache PV extension flags Store PV extension flags in lvmcache/lvmetad for use throughout the code.	2016-02-15 12:44:46 +01:00
Peter Rajnoha	54b41db9a6	metadata: introduce PV_EXT_USED flag and bump PV_HEADER_EXTENSION_VSN	2016-02-15 12:44:46 +01:00
Peter Rajnoha	a522af93b7	format: add FMT_PV_FLAGS to indicate format supports PV flags	2016-02-15 12:44:46 +01:00
Zdenek Kabelac	fcbef05aae	doc: change fsf address Hmm rpmlint suggest fsf is using a different address these days, so lets keep it up-to-date	2016-01-21 12:11:37 +01:00
Alasdair G Kergon	01228b692b	vgcfgrestore: Retain allocatable PV attribute. pvchange -xn was getting lost. All PVs were set to allocatable again after restore. Moved setting ALLOCATABLE_PV outside pv_setup().	2016-01-14 00:46:45 +00:00
David Teigland	796461a912	vgrename: use process_each_vg Use process_each_vg() to lock and read the old VG, and then call the main vgrename code. When real VG names are used (not a UUID in place of the old name), the command still pre-locks the new name (when strcmp wants it locked first), before calling process_each_vg on the old name. In the case where the old name is replaced with a UUID, process_each_vg now translates that UUID into the real VG name, which it locks and reads. In this case, we cannot do pre-locking to maintain lock ordering because the old name is unknown. So, in this case the strcmp based lock ordering is suppressed and the old name is always locked first. This opens a remote chance for lock ordering conflict between racing vgrenames between two names where one or both commands use the UUID.	2015-12-14 14:26:47 -06:00
Zdenek Kabelac	748b8158b5	archiver: fix reporting for check_current_backup It's getting a bit more complex here. Basic idea behind is - check_current_backup() should not log error when a user is using a read-only filesystem, so e.g. vgscan will not report any error when it tries to take missing backup. We still have cases when error could be reported though, e.g. the backup this would be a symbolic link, but these are rather misconfiguration and unexpected case.	2015-12-04 22:10:30 +01:00
Zdenek Kabelac	e7978c5ab6	cleanup: drop log_suppress(2) usage No longer need to use log_suppress(2) instance so dropped.	2015-12-03 18:02:34 +01:00
Zdenek Kabelac	f40b3ba1e9	archiver: inital change toward proper logging We have to modes of 'archive()' usage - 1. compulsory - fail stops command and user may try '-An' option to do a command. 2. non-compulsory - some fails in archiving are ignorable (i.e. read-only filesystem where archive dir is located). Those 2 cases needs to be properly handle - i.e. the non-compulsory logging should not be tampering error logging message production. So more work here is needed	2015-12-03 18:01:45 +01:00
David Teigland	d3ca18e489	lvmcache: include system_id in vginfo cache Save system_id just like creation_host and lock_type strings in vginfo cache.	2015-11-30 11:32:17 -06:00
Zdenek Kabelac	9243877ea1	cleanup: use display_lvname Switch debug msg to use display_lvname. Link to VG early, so we have access to VG from LV.	2015-11-23 23:42:59 +01:00
Zdenek Kabelac	c3b292a4a9	format-text: ensure no division by zero Coverity likes here to be 100% sure no division by zero is possible. Add check for alignment !=0 which is made on other code paths here.	2015-11-16 01:16:11 +01:00
Zdenek Kabelac	2e04eee192	cleanup: do not test alloca for NULL alloca() never returns NULL. In case stack is out-of-range the behaviour is undefined.	2015-11-09 10:22:51 +01:00
Peter Rajnoha	ccfc09f79b	metadata: format_text: also count with calculated mda size of 0 When checking minimum mda size, make sure the mda_size after alignment and calculation is more than 0 - if there's no place for an MDA at the end of the disk, the _text_pv_add_metadata_area does not try to add it there and it returns (because we already have the MDA at the start of the disk at least).	2015-10-30 12:02:34 +01:00
Peter Rajnoha	c2e88d1107	metadata: format_text: better check for metadata overlap Actually, we don't need extra condition as introduced in commit `00348c0a63`. We should fix the last condition: (mdac->rlocn.size >= mdah->size) ...which should be: (MDA_HEADER_SIZE + (rlocn ? rlocn->size : 0) + mdac->rlocn.size >= mdah->size)) Where the "mdac" is new metadata, the "rlocn" is old metadata. So the main problem with the previous condition was that it didn't count in MDA_HEADER_SIZE properly (and possible existing metadata - the "rlocn"). This could have caused the error state where metadata in ring buffer overlap to not be hit. Replace the new condition introduced in `00348c0a63` with the improved one for the condition that existed there already but it was just incomplete.	2015-10-30 08:57:34 +01:00
Peter Rajnoha	00348c0a63	metadata: format_text: check VG metadata do not overlap themselves We're already checking whether old and new meta do not overlap in ring buffer (as we need to keep both old and new meta during vg_write up until vg_commit). We also need to check whether the new metadata do not overlap themselves in case we don't have old metadata yet (...because we're in vgcreate). This could happen if we're creating a VG so that the very first metadata written are long enough that it wraps themselves in metadata ring buffer. Although we limited the minimum metadata area size better with the previous commit `ccb8da404d` which makes the initial VG metadata overlap in ring buffer to be less probable, the risk of hitting this overlap condition is still there if we still manage to generate big enough metadata somehow. For example, users can provide many and/or long VG tags during vgcreate so that the VG metadata is long enough to start to wrap in the ring buffer again...	2015-10-29 16:46:41 +01:00
Peter Rajnoha	ccb8da404d	metadata: format_text: check metadata area size is at least MDA_SIZE_MIN	2015-10-29 16:00:32 +01:00
Peter Rajnoha	b3c81d02c9	revert: `3d03e504cd`: message about VG metadata size vs. PV mda size The message needs refinement - it's not correct in all situations.	2015-10-29 11:10:48 +01:00
Peter Rajnoha	3d03e504cd	metadata: format_text: provide more detailed error message when metadata too large for PV mda Also, leave out the note about "circular buffer" which is an internal imeplementation detail anyway and not quite informational for users: Before this patch: $ vgcreate vg1 /dev/sda VG vg1 metadata too large for circular buffer Failed to write VG vg1. With this patch applied: $ vgcreate vg1 /dev/sda VG vg1 metadata too large: size of metadata to write is 691 bytes while PV metadata area size on /dev/sda is 512 bytes. Failed to write VG vg1.	2015-10-08 16:27:03 +02:00
Alasdair G Kergon	214e2cddf6	segtypes: Use SEG_TYPE_NAME_ string constants.	2015-09-22 19:04:12 +01:00
Peter Rajnoha	fcfca57e2e	format-text: label: fix missing dev assignment for struct label in _text_pv_write When using lvm shell, some structures which are cached in memory may be reused. This happens for the struct label (a part of lvmcache_info structure) when lvmetad is used in which case the PV scan is not done that would normally overwrite these label structures in memory and making them up-to-date. This is all consequence of the fact that struct lvmcache_info and struct label are not always assigned in the same part of the code. For example, if lvmetad is not used, parts of the struct label are reassigned in label_read fn while struct lvmcache_info is created elsewhere. No part of the code reused struct label (and its "dev" field) before calling label_read fn. That's why the real bug is hidden when using lvm shell without lvmetad. However, with lvmetad and lvm shell, the situation is a bit different. The label_read fn is not called if lvmetad is used, hence the struct label may have ended up not initialized properly. There was missing assignment for the dev field in struct label in _text_pv_write fn which caused this problem to appear in lvm shell with lvmetad, for example: Before this patch: lvm> pvcreate /dev/sda Physical volume "/dev/sda" successfully created lvm> pvs /dev/sda PV VG Fmt Attr PSize PFree unknown device lvm2 --- 128.00m 128.00m With this patch applied: lvm> pvcreate /dev/sda Physical volume "/dev/sda" successfully created lvm> pvs /dev/sda PV VG Fmt Attr PSize PFree /dev/sda lvm2 --- 128.00m 128.00m Also, this problem had not appeared before changes introduced by commits `e1a63905d1` through `3a6f91d713` which, among other things, added proper label field type reporting. Before, label reporting was the same as using struct physical_volume which has its own dev field assigned and so this problem was not exposed.	2015-09-15 18:07:32 +02:00
Zdenek Kabelac	28b4fa3e27	Revert "lvmcache: check for too long pvid" This reverts commit `70db1d523d`. Since we use 'strncpy' even for case where it exactly matches the buffer size and \0 is not expected to be added there.	2015-08-18 15:22:13 +02:00
Zdenek Kabelac	a8fd88463e	cleanup: trace error from lvmcache_update_vgname_and_id Check result value from lvmcache_update_vgname_and_id().	2015-08-18 15:00:08 +02:00
Zdenek Kabelac	70db1d523d	lvmcache: check for too long pvid	2015-08-18 14:53:36 +02:00
David Teigland	e593213b87	lvmcache: add lock_type to VG summary and info structs vgsummary information contains provisional VG information that is obtained without holding the VG lock. This info can be used to lock the VG, and then read it with vg_read(). After the VG is read properly, the vgsummary info should be verified. Add the VG lock_type to the vgsummary. It needs to be known before the VG can be locked and read.	2015-07-29 14:27:32 -05:00
David Teigland	cb14bbdbc9	metadata: add comments describing lock_args for lvmlockd	2015-07-09 15:16:28 -05:00
Peter Rajnoha	3b6840e099	config: replace find_config_tree_node with find_config_tree_array where appropriate	2015-07-08 13:03:08 +02:00
Alasdair G Kergon	810ab095e6	macros: Wrap PRI with FMT. Create a set of wrappers with embedded % such as #define FMTu64 "%" PRIu64	2015-07-06 15:09:17 +01:00
David Teigland	fe70b03de2	Add lvmlockd	2015-07-02 15:42:26 -05:00
Peter Rajnoha	7b45a1fc60	refactor: rename _out_tags fn to _out_list and use it for string lists in general	2015-06-29 09:43:55 +02:00
Peter Rajnoha	f143ad3a93	cleanup: remove unused tags.c file	2015-06-29 09:43:47 +02:00
Peter Rajnoha	e29d4773f4	refactor: rename alloc_printed_tags fn to _alloc_printed_str_list and use it for string lists in general	2015-06-29 09:43:41 +02:00
Peter Rajnoha	77c2d11657	refactor: rename read_tags fn to _read_str_list and use it for string lists in general	2015-06-29 09:43:32 +02:00
Petr Rockai	c78b6f18d4	metadata: Reject lvmetad metadata extensions when reading from disk.	2015-06-10 16:25:57 +02:00
Petr Rockai	43224f22e4	format_text: Parse (optional) outdated_pvs section in VG metadata.	2015-05-20 19:46:14 +02:00
Peter Rajnoha	1806694928	metadata: use log_debug_metadata instead of general log_debug for BA debug messages	2015-05-11 11:07:53 +02:00
Zdenek Kabelac	05934d2538	format_text: properly validate PV size for restore Use 64bit arithmentic for PV size calculation (Coverity). Also remove sector shift for compared PV size, since all values are already held in sectors. This fixes validatio of PV size when restoring PV from vg metadata backup file.	2015-05-08 15:12:35 +02:00
Alasdair G Kergon	cc26085b62	alloc: Respect cling_tag_list in contig alloc. When performing initial allocation (so there is nothing yet to cling to), use the list of tags in allocation/cling_tag_list to partition the PVs. We implement this by maintaining a list of tags that have been "used up" as we proceed and ignoring further devices that have a tag on the list. https://bugzilla.redhat.com/983600	2015-04-11 01:55:24 +01:00
Alasdair G Kergon	a9d48bae2f	cache: Set correct vgid when changing PV header. pv_write is called both to write orphans and to rewrite PV headers of PVs in VGs. It needs to select the correct VG id so that the internal cache state gets updated correctly. It only affected commands that involved further steps after the pv_write and was often masked because the metadata would be re-read off disk and correct itself. "Incorrect metadata area header checksum" warnings appeared. Example: Create vg1 containing dev1, dev2 and dev3. Hide dev1 and dev2 from the system. Fix up vg1 with vgreduce --removemissing. Bring back dev1 and dev2. In a single operation reinstate dev1 and dev2 into vg1 (vgextend). Done as separate operations (automatically fix-up dev1 and dev2 as orphans, then vgextend) it worked, but done all in one go the internal cache got corrupted and warnings about checksum errors appeared.	2015-04-09 21:13:55 +01:00
Peter Rajnoha	c9f021de0b	metadata: process_each_lv_in_vg: get the list of LVs to process first, then do the processing This avoids a problem in which we're using selection on LV list - we need to do the selection on initial state and not on any intermediary state as we process LVs one by one - some of the relations among LVs can be gone during this processing. For example, processing one LV can cause the other LVs to lose the relation to this LV and hence they're not selectable anymore with the original selection criteria as it would be if we did selection on inital state. A perfect example is with thin snapshots: $ lvs -o lv_name,origin,layout,role vg LV Origin Layout Role lvol1 thin,sparse public,origin,thinorigin,multithinorigin lvol2 lvol1 thin,sparse public,snapshot,thinsnapshot lvol3 lvol1 thin,sparse public,snapshot,thinsnapshot pool thin,pool private $ lvremove -ff -S 'lv_name=lvol1 \|\| origin=lvol1' Logical volume "lvol1" successfully removed The lvremove command above was supposed to remove lvol1 as well as all its snapshots which have origin=lvol1. It failed to do so, because once we removed the origin lvol1, the lvol2 and lvol3 which were snapshots before are not snapshots anymore - the relations change as we're processing these LVs one by one. If we do the selection first and then execute any concrete actions on these LVs (which is what this patch does), the behaviour is correct then - the selection is done on the initial state: $ lvremove -ff -S 'lv_name=lvol1 \|\| origin=lvol1' Logical volume "lvol1" successfully removed Logical volume "lvol2" successfully removed Logical volume "lvol3" successfully removed Similarly for all the other situations in which relations among LVs are being changed by processing the LVs one by one. This patch also introduces LV_REMOVED internal LV status flag to mark removed LVs so they're not processed further when we iterate over collected list of LVs to be processed. Previously, when we iterated directly over vg->lvs list to process the LVs, we relied on the fact that once the LV is removed, it is also removed from the vg->lvs list we're iterating over. But that was incorrect as we shouldn't remove LVs from the list during one iteration while we're iterating over that exact list (dm_list_iterate_items safe can handle only one removal at one iteration anyway, so it can't be used here).	2015-03-24 08:43:07 +01:00
Alasdair G Kergon	a515a91fcc	format_text: Fix precommitted segfault. The code never mixes reads of committed and precommitted metadata, so there's no need to attempt to set PRECOMMITTED when *use_previous_vg is being set.	2015-03-19 11:14:47 +00:00
Alasdair G Kergon	6407d184d1	cache: Store metadata size and checksum. Refactor the recent metadata-reading optimisation patches. Remove the recently-added cache fields from struct labeller and struct format_instance. Instead, introduce struct lvmcache_vgsummary to wrap the VG information that lvmcache holds and add the metadata size and checksum to it. Allow this VG summary information to be looked up by metadata size + checksum. Adjust the debug log messages to make it clear when this shortcut has been successful. (This changes the optimisation slightly, and might be extendable further.) Add struct cached_vg_fmtdata to format-specific vg_read calls to preserve state alongside the VG across separate calls and indicate if the details supplied match, avoiding the need to read and process the VG metadata again.	2015-03-18 23:43:02 +00:00
Alasdair G Kergon	1d3711c0b2	format_text: Set system id directly. Rearrange _read_vg code to set the appropriate system id field directly.	2015-03-09 19:33:27 +00:00
Alasdair G Kergon	379d9ec8ec	systemid: Use temp status var for LVM_WRITE_LOCKED	2015-03-09 19:18:14 +00:00
Alasdair G Kergon	faccdeda83	comments: Use full flag names.	2015-03-09 18:53:22 +00:00
David Teigland	e9a233ee8e	system_id: detect an lvm1 system id Detect an lvm1 system id by looking at the WRITE_LOCKED flag. Don't copy this lvm1 system id into vg->system_id so that the restrictions associated with the new system id are not applied to the old VG with the inherited lvm1 system id.	2015-03-09 13:27:34 -05:00
Zdenek Kabelac	a9b28a4f21	lib: reduce parsing in vgname_from_mda Use similar logic as with text_vg_import_fd() and avoid repeated parsing of same mda and its config tree for vgname_from_mda(). Remember last parsed vgname, vgid and creation_host in labeller structure and if the metadata have the same size and checksum, return this stored info. TODO: The reuse of labeller struct is not ideal, some lvmcache API for this functionality would be nicer.	2015-03-06 13:53:13 +01:00
Zdenek Kabelac	7e7411966a	lib: avoid reparsing same metadata When reading VG mda from multiple PVs - do all the validation only when mda is seen for the first time and when mda checksum and length is same just return already existing VG pointer. (i.e. using 300PVs for a VG would lead to create and destroy 300 config trees....)	2015-03-06 13:53:12 +01:00
Zdenek Kabelac	6a2ae250ff	cleanup: add stack trace Missed stack in error path.	2015-03-06 13:51:54 +01:00
Zdenek Kabelac	60427d5d42	lib: return value Drop label out: with goto and return NULL directly. Add log_debug() for zero metadata offset.	2015-03-06 13:51:43 +01:00
Zdenek Kabelac	4d16bfaabb	lib: zero returned labeller struct Return zeroed struct. (Structure will be extended, so ensure all members are initilized.)	2015-03-06 13:17:39 +01:00
David Teigland	5e25bca1a9	system_id: avoid munging vg and lv fields Munge the WRITE/WRITE_LOCKED flags in a temp variable instead of in the vg/lv fields.	2015-03-05 10:23:16 -06:00
David Teigland	1e65fdd9ba	system_id: make new VGs read-only for old lvm versions Previous versions of lvm will not obey the restrictions imposed by the new system_id, and would allow such a VG to be written. So, a VG with a new system_id is further changed to force previous lvm versions to treat it as read-only. This is done by removing the WRITE flag from the metadata status line of these VGs, and putting a new WRITE_LOCKED flag in the flags line of the metadata. Versions of lvm that recognize WRITE_LOCKED, also obey the new system_id. For these lvm versions, WRITE_LOCKED is identical to WRITE, and the rules associated with matching system_id's are imposed. A new VG lock_type field is also added that causes the same WRITE/WRITE_LOCKED transformation when set. A previous version of lvm will also see a VG with lock_type as read-only. Versions of lvm that recognize WRITE_LOCKED, must also obey the lock_type setting. Until the lock_type feature is added, lvm will fail to read any VG with lock_type set and report an error about an unsupported lock_type. Once the lock_type feature is added, lvm will allow VGs with lock_type to be used according to the rules imposed by the lock_type. When both system_id and lock_type settings are removed, a VG is written with the old WRITE status flag, and without the new WRITE_LOCKED flag. This allows old versions of lvm to use the VG as before.	2015-03-05 09:50:43 -06:00
David Teigland	c6a57dc4f3	Revert "systemid: Add ACCESS_NEEDS_SYSTEM_ID VG flag." This reverts commit `bfbb5d269a`. This will be done differently.	2015-03-05 09:50:43 -06:00
Alasdair G Kergon	bfbb5d269a	systemid: Add ACCESS_NEEDS_SYSTEM_ID VG flag. Set ACCESS_NEEDS_SYSTEM_ID VG status flag whenever there is a non-lvm1 system_id set. Prevents concurrent access from older LVM2 versions. Not set on VGs that bear a system_id only due to conversion from lvm1 metadata.	2015-03-04 01:16:32 +00:00
Alasdair G Kergon	3562b5ab39	systemid: Init and merge lvm2 and lvm1 fields. Use system_id field in preference to lvm1_system_id. Initialise both for now.	2015-03-04 01:00:51 +00:00
Alasdair G Kergon	4e6f3e5162	archives: Preserve format type in file. format_text processes both lvm2 on-disk metadata and metadata read from other sources such as backup files. Add original_fmt field to retain the format type of the original metadata. Before this patch, /etc/lvm/archives would contain backups of lvm1 metadata with format = "lvm2" unless the source was lvm1 on-disk metadata.	2015-03-04 00:30:26 +00:00
Alasdair G Kergon	b18feb98e5	systemid: Fix access restrictions. When checking whether the system ID permits access to a VG, check for each permitted situation first, and only then issue the appropriate error message. Always issue a message for now. (We'll try to suppress some of those later when the VG concerned wasn't explicitly requested.) Add more messages to try to ensure every return code is checked and every error path (and only an error path) contains a log_error(). Add self-correction to vgchange -c to deal with situations where the cluster state and system ID state are out-of-sync (e.g. if old tools were used).	2015-02-23 23:19:36 +00:00
Alasdair G Kergon	a5df78e0f0	format_text: Fix creation_host_system_id. Don't escape quotes - forbidden characters.	2015-02-23 19:19:48 +00:00
Alasdair G Kergon	cc5e3dbf24	format_text: Store creation_host_system_id. Record the current system ID at the time of writing out VG metadata in the outer section of it alongside the hostname and time.	2015-02-23 17:54:47 +00:00
Zdenek Kabelac	2908ab3eed	thin: errrorwhenfull support Support error_if_no_space feature for thin pools. Report more info about thinpool status: (out_of_data (D), metadata_read_only (M), failed (F) also as health attribute.)	2015-01-14 14:52:05 +01:00
Alasdair G Kergon	9a5910bdf9	pre-release	2014-11-11 14:13:00 +00:00
Zdenek Kabelac	f5e265a07f	cache: use LV_PENDING_DELETE	2014-11-10 22:05:49 +01:00
Zdenek Kabelac	ff2e8b0de6	thin: simplify thin volume creation Move code for creation of thin volume into a single place out of lv_extend(). This allows to drop extra pool arg for alloc_lv_segment() && lv_extend() and makes code more easier to read and follow.	2014-10-26 18:37:13 +01:00
Alasdair G Kergon	5e6e2d6b1b	vgcreate: Permit non-power-of-2 extent sizes. Relax validation to permit extent sizes > 128KB that are not powers of 2 with lvm2 format. Existing code was already capable of handling this.	2014-10-14 18:12:15 +01:00
David Teigland	8dc5f42254	metadata: Use flags to control warnings. The warnings arg was used to enable logging of warnings when reading a PV. This arg is turned into a set of flags with the WARN_PV_READ flag matching the existing behavior. A new flag WARN_INCONSISTENT is added that will cause vg_read_internal() to log the "VG is not consistent" warning so the various callers do not need to log this warning themselves. A new vg_read flag READ_WARN_INCONSISTENT is used from reporting to enable the WARN_INCONSISTENT flag in vg_read_internal. [Committed by agk with cosmetic changes and tweaks.]	2014-10-07 01:15:43 +01:00
Zdenek Kabelac	392bb6f46e	fix: regression for recent persistent commit Do not let fly metadata with just 'minor' set (since they would not be readable on older version) Be permissive with invalid major/minor number and just report them as problem, but allow to use such metadata with default major:minor.	2014-09-19 17:08:41 +02:00
Zdenek Kabelac	e3cbdde070	backup: drops locked memory Since we want to backup metadata, this is the point we no longer want to hold memory locked.	2014-09-19 15:55:46 +02:00
Zdenek Kabelac	73f4fa6bc1	metadata: validate major, minor numbers Validate major, minor numbers after reading them from metadata.	2014-09-19 15:53:27 +02:00
Zdenek Kabelac	1ce21c19d5	va_list: properly pass va_list through functions Code should not just pass va_list arg through the function as args could be passed in many strange ways. Use va_copy(). For details look in i.e.: http://julipedia.meroh.net/2011/09/using-vacopy-to-safely-pass-ap.html	2014-09-16 11:42:40 +02:00
Alasdair G Kergon	979be63f25	mirrors: Fix checks for mirror/raid/pvmove LVs. Try to enforce consistent macro usage along these lines: lv_is_mirror - mirror that uses the original dm-raid1 implementation (segment type "mirror") lv_is_mirror_type - also includes internal mirror image and log LVs lv_is_raid - raid volume that uses the new dm-raid implementation (segment type "raid") lv_is_raid_type - also includes internal raid image / log / metadata LVs lv_is_mirrored - LV is mirrored using either kernel implementation (excludes non-mirror modes like raid5 etc.) lv_is_pvmove - internal pvmove volume	2014-09-16 00:13:46 +01:00
Alasdair G Kergon	2360ce3551	cleanup: Use lv_is_ macros. Use lv_is_* macros throughout the code base, introducing lv_is_pvmove, lv_is_locked, lv_is_converting and lv_is_merging. lv_is_mirror_type no longer includes pvmove.	2014-09-15 21:33:53 +01:00
Alasdair G Kergon	3366baf076	metadata: Reinstate system info in metadata. Revert part of `cac0722cac` This was deliberate and aids the investigation of problems.	2014-07-21 15:54:20 +01:00
Zdenek Kabelac	cac0722cac	metadata: use outfc for comments Few unecessary comments were written to on-disc metadata. Use outfc() to have comments only in archived files. (may also save couple bytes in ringbuffer). TODO: needed validation against newline char...	2014-07-17 16:17:44 +02:00
Peter Rajnoha	5abdb52fdc	report: select: refactor: move str_list to libdm The list of strings is used quite frequently and we'd like to reuse this simple structure for report selection support too. Make it part of libdevmapper for general reuse throughout the code. This also simplifies the LVM code a bit since we don't need to include and manage lvm-types.h anymore (the string list was the only structure defined there).	2014-06-17 16:27:20 +02:00
Peter Rajnoha	9e3e4d6994	config: differentiate command and metadata profiles and consolidate profile handling code - When defining configuration source, the code now uses separate CONFIG_PROFILE_COMMAND and CONFIG_PROFILE_METADATA markers (before, it was just CONFIG_PROFILE that did not make the difference between the two). This helps when checking the configuration if it contains correct set of options which are all in either command-profilable or metadata-profilable group without mixing these groups together - so it's a firm distinction. The "command profile" can't contain "metadata profile" and vice versa! This is strictly checked and if the settings are mixed, such profile is rejected and it's not used. So in the end, the CONFIG_PROFILE_COMMAND set of options and CONFIG_PROFILE_METADATA are mutually exclusive sets. - Marking configuration with one or the other marker will also determine the way these configuration sources are positioned in the configuration cascade which is now: CONFIG_STRING -> CONFIG_PROFILE_COMMAND -> CONFIG_PROFILE_METADATA -> CONFIG_FILE/CONFIG_MERGED_FILES - Marking configuration with one or the other marker will also make it possible to issue a command context refresh (will be probably a part of a future patch) if needed for settings in global profile set. For settings in metadata profile set this is impossible since we can't refresh cmd context in the middle of reading VG/LV metadata and for each VG/LV separately because each VG/LV can have a different metadata profile assinged and it's not possible to change these settings at this level. - When command profile is incorrect, it's rejected and also the command exits immediately - the profile must be correct for the command that was run with a profile to be executed. Before this patch, when the profile was found incorrect, there was just the warning message and the command continued without profile applied. But it's more correct to exit immediately in this case. - When metadata profile is incorrect, we reject it during command runtime (as we know the profile name from metadata and not early from command line as it is in case of command profiles) and we do continue with the command as we're in the middle of operation. Also, the metadata profile is applied directly and on the fly on find_config_tree_* fn call and even if the metadata profile is found incorrect, we still need to return the non-profiled value as found in the other configuration provided or default value. To exit immediately even in this case, we'd need to refactor existing find_config_tree_* fns so they can return error. Currently, these fns return only config values (which end up with default values in the end if the config is not found). - To check the profile validity before use to be sure it's correct, one can use : lvm dumpconfig --commandprofile/--metadataprofile ProfileName --validate (the --commandprofile/--metadataprofile for dumpconfig will come as part of the subsequent patch) - This patch also adds a reference to --commandprofile and --metadataprofile in the cmd help string (which was missing before for the --profile for some commands). We do not mention --profile now as people should use --commandprofile or --metadataprofile directly. However, the --profile is still supported for backward compatibility and it's translated as: --profile == --metadataprofile for lvcreate, vgcreate, lvchange and vgchange (as these commands are able to attach profile to metadata) --profile == --commandprofile for all the other commands (--metadataprofile is not allowed there as it makes no sense) - This patch also contains some cleanups to make the code handling the profiles more readable...	2014-05-20 16:21:48 +02:00
Peter Rajnoha	ff9d27a1c7	config: add CONFIG_FILE_SPECIAL config source id Add CONFIG_FILE_SPECIAL config source id to make a difference between real configuration tree (like lvm.conf and tag configs) and special purpose configuration tree (like LVM metadata, persistent filter). This makes it easier to attach correct customized data to the config tree that is created out of the source then.	2014-05-19 15:37:41 +02:00
Zdenek Kabelac	559c003ee2	cleanup: reduce inclusion of unnecessary headers Remove those file which are not needed by .c files or already include because the headers already needs them.	2014-04-18 16:38:50 +02:00
Peter Rajnoha	05eb6a167e	tests: add separate test file for bootloader area support and enhance tests Enahnce bootloader area test to check whether restoring values from backup works correctly.	2014-04-10 14:18:59 +02:00
Alasdair G Kergon	5d7614fcf9	format_text: Report failed close.	2014-04-04 02:28:10 +01:00
Zdenek Kabelac	0499e87ace	cleanup: simplify pv name size estimation Reuse buffer with size of 2 * PATH_MAX to handle worst case escape and avoid extra calculation of espaced len.	2014-03-26 14:11:37 +01:00
Zdenek Kabelac	65bbfdf74d	lvmetad: add missing dev_close in error path Fixes missing dev_close() in dev_read error path introduced in commit `a368698672` `3e5bec37e9` (in-release fix)	2014-03-25 14:55:58 +01:00
Zdenek Kabelac	89575d6895	cleanup: drop init of already zalloced mem	2014-03-25 11:22:59 +01:00
Zdenek Kabelac	406ec4162f	cleanup: use dm_free without extra test It's ok to free(NULL).	2014-03-25 11:22:59 +01:00
Zdenek Kabelac	08018a5345	archiver: drop unneeded backup check When the backup is disabled, avoid testing backup presence. This only leads to errors being logged in debug trace and the missing backup can't be fixed, since it's disabled.	2014-03-19 00:45:41 +01:00
Petr Rockai	fb003cdfd5	format-text: Fix a warning.	2014-02-28 16:23:16 +01:00
Petr Rockai	3e5bec37e9	format-text: Fix _raw_read_mda_header (missing close, open r/o).	2014-02-28 16:21:09 +01:00
Petr Rockai	a368698672	lvmetad: Hide corrupt MDAs from the cache. This is probably not optimal, but makes the lvmetad case mimic non-lvmetad code more closely. It also fixes vgremove of a partially corrupt VG with lvmetad, as _vg_write_raw (and consequently, entire vg_write) currently panics when it encounters a corrupt MDA. Ideally, we'd be able to explicitly control when it is safe to ignore them.	2014-02-28 11:23:52 +01:00
Peter Rajnoha	08116a4962	cleanup: missing header file	2014-02-20 09:07:38 +01:00
Petr Rockai	b391ae88e5	format-text: Avoid a label_scan while in a critical_section().	2014-02-19 17:43:30 +01:00
Jonathan Brassow	97be8b3482	cache: Code changes to allow creation of cache pools This patch allows the creation and removal of cache pools. Users are not yet able to create cache LVs. They are only able to define the space used for the cache and its characteristics (chunk_size and cache mode ATM) by creating the cache pool.	2014-02-04 11:57:08 -06:00
Alasdair G Kergon	4aa8a14fc2	compilation: Rename tags variables to tagsl.	2014-01-30 21:09:28 +00:00
Alasdair G Kergon	5eee73bd7c	pvresize: Fix orphan PV size calculation. The size of any metadata must be ignored when calculating the size of an orphan PV. Bug introduced by `603b45e0ed` ("pvresize: Do not use pv_read (get the PV from orphan VG).")	2014-01-17 01:12:04 +00:00
Alasdair G Kergon	ebac2ed5be	pvresize: Avoid archiving orphan VG metadata. Block creations of archive and backup files for internal orphan VGs. Bug introduced by `603b45e0ed` ("pvresize: Do not use pv_read (get the PV from orphan VG).")	2014-01-16 23:02:59 +00:00
Peter Rajnoha	d443bfac21	config: fix metadata/disk_areas config setting registration The metadata/disk_areas setting was incorrectly registered as "string" configuration option but it's a section where each area is defined in its own subsection with "start_sector", "size" and "id" setting. This setting is not officialy supported, it's undocumented and it's used solely for debugging. Note: At this moment, it does not seem to be working with lvmetad!	2013-12-13 16:52:51 +01:00
Zdenek Kabelac	30a81e5989	cleanup: self compilable headers	2013-12-12 13:28:19 +01:00
Zdenek Kabelac	01c438a96c	format-text: ensure aligment is not 0 Make sure this path of code is not used for alignment == 0, to prevent division by 0.	2013-11-28 12:42:39 +01:00
Zdenek Kabelac	782a356e7c	archiver: add check for dm_pool_strdup It will likely not fail to duplicate empty string, but just keep the test of result of this function consistent. Also on error path restore extent_size if in some case someone would still use that variable.	2013-11-22 21:00:54 +01:00
Zdenek Kabelac	3d3b8bfd1c	pv_write: check for lvmcache_add_mda failure Add missing test of failing lvmcache_add_mda() call.	2013-11-22 20:55:09 +01:00
Petr Rockai	9b91977f4e	labeller: Make the use of "private" as "fmt" explicit. All labellers always use the "private" (void *) field as the fmt pointer. Making this fact explicit in the type of the labeller simplifies the label reporting code which needs to extract the format. Moreover, it removes a number of error-prone casts from the code.	2013-11-17 21:41:27 +01:00
Peter Rajnoha	039bdad732	activation: flag temporary LVs internally Add LV_TEMPORARY flag for LVs with limited existence during command execution. Such LVs are temporary in way that they need to be activated, some action done and then removed immediately. Such LVs are just like any normal LV - the only difference is that they are removed during LVM command execution. This is also the case for LVs representing future pool metadata spare LVs which we need to initialize by using the usual LV before they are declared as pool metadata spare. We can optimize some other parts like udev to do a better job if it knows that the LV is temporary and any processing on it is just useless. This flag is orthogonal to LV_NOSCAN flag introduced recently as LV_NOSCAN flag is primarily used to mark an LV for the scanning to be avoided before the zeroing of the device happens. The LV_TEMPORARY flag makes a difference between a full-fledged LV visible in the system and the LV just used as a temporary overlay for some action that needs to be done on underlying PVs. For example: lvcreate --thinpool POOL --zero n -L 1G vg - first, the usual LV is created to do a clean up for pool metadata spare. The LV is activated, zeroed, deactivated. - between "activated" and "zeroed" stage, the LV_NOSCAN flag is used to avoid any scanning in udev - betwen "zeroed" and "deactivated" stage, we need to avoid the WATCH udev rule, but since the LV is just a usual LV, we can't make a difference. The LV_TEMPORARY internal LV flag helps here. If we create the LV with this flag, the DM_UDEV_DISABLE_DISK_RULES and DM_UDEV_DISABLE_OTHER_RULES flag are set (just like as it is with "invisible" and non-top-level LVs) - udev is directed to skip WATCH rule use. - if the LV_TEMPORARY flag was not used, there would normally be a WATCH event generated once the LV is closed after "zeroed" stage. This will make problems with immediated deactivation that follows.	2013-10-23 14:09:37 +02:00
Peter Rajnoha	6b35c70e8b	metadata: add INTERNAL_ERROR to "Metadata inconsistency" msg So we can spot it better if it occurs.	2013-10-10 13:34:43 +02:00
Peter Rajnoha	029b8fbe76	metadata: properly register LV_NOSCAN flag Addendum to commit `ce7489e` which introduced a new internal LV_NOSCAN flag and so it needs to be marked that way properly otherwise it ends up unrecognized and improperly handled during metadata export.	2013-10-10 13:24:32 +02:00
Alasdair G Kergon	c8057aec36	release 2.02.102 18 files changed, 137 insertions(+), 203 deletions(-)	2013-09-23 15:43:37 +01:00
Petr Rockai	3df50d822b	vgconvert: Do not call lvmetad_vg_remove (path shared with vgcfgbackup).	2013-09-18 12:53:11 +02:00
Petr Rockai	054cf25b5f	vgcfgrestore: Remove VG rom lvmetad later, to better deal with errors.	2013-09-18 11:24:58 +02:00
Peter Rajnoha	34d207d9b3	lvmetad: fix mda offset/size overflow if >= 4g (32bit) When reading an info about MDAs from lvmetad, we need to use 64 bit int to read the value of the offset/size, otherwise the value is overflows and then it's used throughout! This is dangerous if we're trying to write such metadata area then, mostly visible if we're using 2 mdas where the 2nd one is at the end of the underlying device and hence the value of the mda offset is high enough to cause problems: (the offset trimmed to value of 0 instead of 4096m, so we write at the very start of the disk (or elsewhere if the offset has some other value!) [1] raw/~ # lvcreate -s -l 100%FREE vg --virtualsize 4097m Logical volume "lvol0" created [1] raw/~ # pvcreate --metadatacopies 2 /dev/vg/lvol0 Physical volume "/dev/vg/lvol0" successfully created [1] raw/~ # hexdump -n 512 /dev/vg/lvol0 0000000 0000 0000 0000 0000 0000 0000 0000 0000 * 0000200 [1] raw/~ # pvchange -u /dev/vg/lvol0 Physical volume "/dev/vg/lvol0" changed 1 physical volume changed / 0 physical volumes not changed [1] raw/~ # hexdump -n 512 /dev/vg/lvol0 0000000 d43e d2a5 4c20 4d56 2032 5b78 4135 7225 0000010 4e30 3e2a 0001 0000 0000 0000 0000 0000 0000020 0000 0010 0000 0000 0000 0000 0000 0000 0000030 0000 0000 0000 0000 0000 0000 0000 0000 * 0000200 ======= (the offset overflows to undefined values which is far behind the end of the disk) [1] raw/~ # lvcreate -s -l 100%FREE vg --virtualsize 100g Logical volume "lvol0" created [1] raw/~ # pvcreate --metadatacopies 2 /dev/vg/lvol0 Physical volume "/dev/vg/lvol0" successfully created [1] raw/~ # pvchange -u /dev/vg/lvol0 /dev/vg/lvol0: lseek 18446744073708503040 failed: Invalid argument /dev/vg/lvol0: lseek 18446744073708503040 failed: Invalid argument Failed to store physical volume "/dev/vg/lvol0" 0 physical volumes changed / 1 physical volume not changed	2013-08-06 13:37:42 +02:00
Zdenek Kabelac	460d0254eb	thin: add pool metadata spare lv support Add support for pool's metadata spare volume.	2013-07-18 18:22:43 +02:00
Zdenek Kabelac	20187fc190	cleanup: use dm_list_empty Check for empty list directly.	2013-07-18 18:22:42 +02:00
Peter Rajnoha	7dc8c84b18	activation: add support for skipping activation of selected LVs Also add -k/--setactivationskip y/n and -K/--ignoreactivationskip options to lvcreate. The --setactivationskip y sets the flag in metadata for an LV to skip the LV during activation. Also, the newly created LV is not activated. Thin snapsots have this flag set automatically if not specified directly by the --setactivationskip y/n option. The --ignoreactivationskip overrides the activation skip flag set in metadata for an LV (just for the run of the command - the flag is not changed in metadata!) A few examples for the lvcreate with the new options: (non-thin snap LV => skip flag not set in MDA + LV activated) raw/~ $ lvcreate -l1 vg Logical volume "lvol0" created raw/~ $ lvs -o lv_name,attr vg/lvol0 LV Attr lvol0 -wi-a---- (non-thin snap LV + -ky => skip flag set in MDA + LV not activated) raw/~ $ lvcreate -l1 -ky vg Logical volume "lvol1" created raw/~ $ lvs -o lv_name,attr vg/lvol1 LV Attr lvol1 -wi------ (non-thin snap LV + -ky + -K => skip flag set in MDA + LV activated) raw/~ $ lvcreate -l1 -ky -K vg Logical volume "lvol2" created raw/~ $ lvs -o lv_name,attr vg/lvol2 LV Attr lvol2 -wi-a---- (thin snap LV => skip flag set in MDA (default behaviour) + LV not activated) raw/~ $ lvcreate -L100M -T vg/pool -V 1T -n thin_lv Logical volume "thin_lv" created raw/~ $ lvcreate -s vg/thin_lv -n thin_snap Logical volume "thin_snap" created raw/~ $ lvs -o name,attr vg LV Attr pool twi-a-tz- thin_lv Vwi-a-tz- thin_snap Vwi---tz- (thin snap LV + -K => skip flag set in MDA (default behaviour) + LV activated) raw/~ $ lvcreate -s vg/thin_lv -n thin_snap -K Logical volume "thin_snap" created raw/~ $ lvs -o name,attr vg/thin_lv LV Attr thin_lv Vwi-a-tz- (thins snap LV + -kn => no skip flag in MDA (default behaviour overridden) + LV activated) [0] raw/~ # lvcreate -s vg/thin_lv -n thin_snap -kn Logical volume "thin_snap" created [0] raw/~ # lvs -o name,attr vg/thin_snap LV Attr thin_snap Vwi-a-tz-	2013-07-12 20:39:07 +02:00
Peter Rajnoha	e21e38cf74	metadata: add support for storing profile name in metadata (during vgcreate/lvcreate) If "vgcreate/lvcreate --profile <profile_name>" is used, the profile name is automatically stored in metadata for making it possible to load it automatically next time the VG/LV is used.	2013-07-02 15:19:09 +02:00
Peter Rajnoha	50bf2c0db1	config: add profile arg to find_config_tree_int	2013-07-02 15:19:09 +02:00
Peter Rajnoha	eeb7b0f7fa	config: add profile arg to find_config_tree_node	2013-07-02 15:19:09 +02:00
Peter Rajnoha	c5e6bc393e	metadata: read VG/LV profile name from metadata if it exists and load it This is per VG/LV profile loading on demand. The profile itself is saved in struct volume_group/logical_volume as "profile" field so we can reference it whenever needed.	2013-07-02 15:19:09 +02:00
Peter Rajnoha	da3ea66a96	config: add config_source_t type to identify configuration source A helper type that helps with identification of the configuration source which makes handling the configuration cascade a bit easier, mainly removing and adding configuration trees to cascade dynamically. Currently, the possible types are: CONFIG_UNDEFINED - configuration is not defined yet (not initialized) CONFIG_FILE - one file configuration CONFIG_MERGED_FILES - configuration that is a result of merging more files into one CONFIG_STRING - configuration string typed on cmd line directly CONFIG_PROFILE - profile configuration (the new type of configuration, patches will follow...) Also, generalize existing "remove_overridden_config_tree" to work with configuration type identification in a cascade. Before, it was just the CONFIG_STRING we used. Now, we need some more to add in a cascade (like the CONFIG_PROFILE). So, we have: struct dm_config_tree remove_config_tree_by_source(struct cmd_context cmd, config_source_t source); config_source_t config_get_source_type(struct dm_config_tree *cft); ... for removing the tree by its source type from the cascade and simply getting the source type.	2013-07-02 15:19:08 +02:00
Zdenek Kabelac	b31725d0ae	archive: add missing bit set In the last update not all code paths have set the archived flag. If we run in test mode or without archiving enabled - set the bit as well - so test whether archiving has been called succesfully will be ok. (in relase fix).	2013-07-02 11:07:15 +02:00
Zdenek Kabelac	e30028004b	archiver: do not archive vg more then once Do not keep multiple archives for the executed command. Reuse the ALLOCATABLE_PV from pv status for ARCHIVED_VG vg status. Mark VG with the bit with the first archivation.	2013-07-01 23:09:26 +02:00
Peter Rajnoha	0ca1688134	metadata: log_debug only when BA found in metadata ...not the other way round as it was before. This way it makes more sense as BA use is exceptional and it's useless to contaminate the log with messages about BA not being found in metadata.	2013-06-27 16:03:35 +02:00
Peter Rajnoha	6de45db5b5	cleanup: clear outdated comment (TODO already done)	2013-06-27 15:26:39 +02:00
Zdenek Kabelac	2562968864	vgcfgrestore: fix crash on restore of wrong vgname When vgname has not existed in metadata, it has crashed on double free in format_instance destroy() - since VG was created, used FID and was released - which also released FID, so further use was accessing bad memory. Fix it for this code path before release_vg() so FID will exists when _vg_read_file_name() returns NULL.	2013-06-18 22:11:21 +02:00
Petr Rockai	c1e851e208	Move export_vg_to_config_tree alongside export_vg_to_buffer.	2013-06-10 15:55:55 +02:00
Peter Rajnoha	732859d21f	refactor: rename embedding area -> bootloader area	2013-05-28 12:37:22 +02:00
Jonathan Brassow	2e0740f7ef	RAID: Add writemostly/writebehind support for RAID1 'lvchange' is used to alter a RAID 1 logical volume's write-mostly and write-behind characteristics. The '--writemostly' parameter takes a PV as an argument with an optional trailing character to specify whether to set ('y'), unset ('n'), or toggle ('t') the value. If no trailing character is given, it will set the flag. Synopsis: lvchange [--writemostly <PV>:{t\|y\|n}] [--writebehind <count>] vg/lv Example: lvchange --writemostly /dev/sdb1:y --writebehind 512 vg/raid1_lv The last character in the 'lv_attr' field is used to show whether a device has the WriteMostly flag set. It is signified with a 'w'. If the device has failed, the 'p'artial flag has priority. Example ("nosync" raid1 with mismatch_cnt and writemostly): [~]# lvs -a --segment vg LV VG Attr #Str Type SSize raid1 vg Rwi---r-m 2 raid1 500.00m [raid1_rimage_0] vg Iwi---r-- 1 linear 500.00m [raid1_rimage_1] vg Iwi---r-w 1 linear 500.00m [raid1_rmeta_0] vg ewi---r-- 1 linear 4.00m [raid1_rmeta_1] vg ewi---r-- 1 linear 4.00m Example (raid1 with mismatch_cnt, writemostly - but failed drive): [~]# lvs -a --segment vg LV VG Attr #Str Type SSize raid1 vg rwi---r-p 2 raid1 500.00m [raid1_rimage_0] vg Iwi---r-- 1 linear 500.00m [raid1_rimage_1] vg Iwi---r-p 1 linear 500.00m [raid1_rmeta_0] vg ewi---r-- 1 linear 4.00m [raid1_rmeta_1] vg ewi---r-p 1 linear 4.00m A new reportable field has been added for writebehind as well. If write-behind has not been set or the LV is not RAID1, the field will be blank. Example (writebehind is set): [~]# lvs -a -o name,attr,writebehind vg LV Attr WBehind lv rwi-a-r-- 512 [lv_rimage_0] iwi-aor-w [lv_rimage_1] iwi-aor-- [lv_rmeta_0] ewi-aor-- [lv_rmeta_1] ewi-aor-- Example (writebehind is not set): [~]# lvs -a -o name,attr,writebehind vg LV Attr WBehind lv rwi-a-r-- [lv_rimage_0] iwi-aor-w [lv_rimage_1] iwi-aor-- [lv_rmeta_0] ewi-aor-- [lv_rmeta_1] ewi-aor--	2013-04-15 13:59:46 -05:00
Peter Rajnoha	386886f71c	config: refer to config nodes using assigned IDs For example, the old call and reference: find_config_tree_str(cmd, "devices/dir", DEFAULT_DEV_DIR) ...now becomes: find_config_tree_str(cmd, devices_dir_CFG) So we're referring to the named configuration ID instead of passing the configuration path and the default value is taken from central config definition in config_settings.h automatically.	2013-03-06 10:14:33 +01:00
Peter Rajnoha	a9d0e25627	cleanup: remove struct pv_header_extension reference from struct pv_header Just to prevent accidental and improper use when reading the layout from disk because of the already existing disk_areas_xl[0] lists that are variable in size. We can read pv_header_extension only after we know exactly where the lists end...	2013-02-27 10:47:24 +01:00
Peter Rajnoha	b778653f03	pv_header_extension: add support for writing PV header extension (flags & Embedding Area) The PV header extension information (PV header extension version, flags and list of Embedding Area locations) is stored just beyond the PV header base. When calculating the Embedding Area start value (ea_start), the same logic is used as when calculating the pe_start value for Data Area - the value must follow exactly the same alignment restrictions for its start value (the alignment detected automatically or provided via command line using the --dataalignment and --dataalignmentoffset arguments). The Embedding Area is placed at the very start of the PV, starting at ea_start. The Data Area starting at pe_start is placed next. The pe_start is still properly aligned. Due to the pe_start alignment, it's possible that the resulting Embedding Area size (ea_size) ends up bigger in size than requested (but never less than requested).	2013-02-26 11:28:00 +01:00
Peter Rajnoha	9dbe25709e	pv_header_extension: add support for reading PV header extension (flags & Embedding Area) New tools with PV header extension support will read the extension if it exists and it's not an error if it does not exist (so old PVs will still work seamlessly with new tools). Old tools without PV header extension support will just ignore any extension. As for the Embedding Area location information (its start and size), there are actually two places where this is stored: - PV header extension - VG metadata The VG metadata contains a copy of what's written in the PV header extension about the Embedding Area location (NULL value is not copied): physical_volumes { pv0 { id = "AkSSRf-difg-fCCZ-NjAN-qP49-1zzg-S0Fd4T" device = "/dev/sda" # Hint only status = ["ALLOCATABLE"] flags = [] dev_size = 262144 # 128 Megabytes pe_start = 67584 pe_count = 23 # 92 Megabytes ea_start = 2048 ea_size = 65536 # 32 Megabytes } } The new metadata fields are "ea_start" and "ea_size". This is mostly useful when restoring the PV by using existing metadata backups (e.g. pvcreate --restorefile ...). New tools does not require these two fields to exist in VG metadata, they're not compulsory. Therefore, reading old VG metadata which doesn't contain any Embedding Area information will not end up with any kind of error but only a debug message that the ea_start and ea_size values were not found. Old tools just ignore these extra fields in VG metadata.	2013-02-26 11:27:23 +01:00
Peter Rajnoha	60c5d4c42f	pv_header_extension: add supporting infrastructure for PV header extension (flags & Embedding Area) PV header extension comes just beyond the existing PV header base: PV header base (existing): - uuid - device size - null-terminated list of Data Areas - null-terminater list of MetaData Areas PV header extension: - extension version - flags - null-terminated list of Embedding Areas This patch also adds "eas" (Embedding Areas) list to lvmcache (lvmcache_info) and it also adds support for common operations on the list (just like for already existing "das" - Data Areas list): - lvmcache_add_ea - lvmcache_update_eas - lvmcache_foreach_ea - lvmcache_del_eas Also, add ea_start and ea_size to struct physical_volume for processing PV Embedding Area location throughout the code (currently only one Embedding Area is supported, though the definition on disk allows for more if needed in the future...). Also, define FMT_EAS format flag to mark that the format actually supports Embedding Areas (currently format-text only).	2013-02-26 11:25:16 +01:00
Peter Rajnoha	6d8de3638c	cleanup: use struct pvcreate_restorable_params throughout	2013-02-26 11:25:11 +01:00
Zdenek Kabelac	87331dc419	thin: add support for external origin Add internal support for thin volume's external origin.	2013-02-23 10:36:58 +01:00
Peter Rajnoha	303e86adc8	pvcreate: fix alignment to incorporate alignment offset if PV has 0 MDAs If zero metadata copies are used, there's no further recalculation of PV alignment that happens when adding metadata areas to the PV and which actually calculates the alignment correctly as a matter of fact. So fix this for "PV without MDA" case as well. Before this patch: [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 1 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 12.00m [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 0 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 8.00m After this patch: [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 1 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 12.00m [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 0 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 12.00m Also, remove a superfluous condition "pv->pe_start < pv->pe_align" in: if (pe_start == PV_PE_START_CALC && pv->pe_start < pv->pe_align) pv->pe_start = pv->pe_align ... This part of the condition is not reachable as with the PV_PE_START_CALC, we always have pv->pe_start set to 0 from the PV struct initialisation (...the pv->pe_start value is just being calculated).	2013-02-21 14:51:19 +01:00
Peter Rajnoha	a7d6a612b8	fix: 'Couldn't read extent size' --> '... extent start'	2013-02-21 13:33:27 +01:00
Alasdair G Kergon	06abb2dd4c	logging: classify log_debug messages Place most log_debug() messages into a class.	2013-01-07 22:30:29 +00:00
Zdenek Kabelac	ff5612c0c3	format-text: check for _text_create_text_instance Test if 'fid' creation failed and report stack trace, break the loop and do not pass NULL fid further.	2012-12-15 17:23:23 +01:00
Zdenek Kabelac	21f6511bc2	cleanup: reorder code Swap if() test condition and check for failure and use traditional 'stack' trace.	2012-12-15 14:57:40 +01:00
Zdenek Kabelac	09b7ceea95	thin: allow restore with --force Allow restoring metadata with thin pool volumes. No validation is done for this case within vgcfgrestore tool - thus incorrect metadata may lead to destruction of pool content.	2012-11-27 14:08:24 +01:00
Petr Rockai	60668f823e	Automatically restore MISSING PVs with no MDAs.	2012-11-25 20:41:56 +01:00
Zdenek Kabelac	f260f99d57	cleanup: switch log_error to log_warn Use log_warn to print non-fatal warning messages. Use of log_error would confuse checker for testing whether proper error has been reported for some real error.	2012-10-17 15:41:35 +02:00
Petr Rockai	c9f56d639b	lvmetad: Use "%" PRId64 in place of "%d" for extra clarity.	2012-09-26 17:26:16 +02:00
Petr Rockai	2276379a71	lib/cache/lvmetad: Refactor to use dm_config_tree in requests. We were using daemon_send_simple until now, but it is no longer adequate, since we need to manipulate requests in a generic way (adding a validity token to each request), and the tree-based request interface is much more suitable for this.	2012-09-26 14:49:15 +02:00
Zdenek Kabelac	286cd2006b	cleanup: drop unneeded included header files This headers were not resolving anything used for compiled .c files. Remove unused util.c file.	2012-08-23 14:37:20 +02:00
Zdenek Kabelac	6f3cd63551	cleanup: replace memset with struct initilization Simplifies the code, properly detects too long socket paths, drops unused parameter.	2012-06-22 13:23:03 +02:00
Peter Rajnoha	9c17acdfe8	Fix division by zero if PV with zero PE count is used during vgcfgrestore.	2012-05-09 12:30:56 +00:00
Peter Rajnoha	cb08b8eb7e	Check if info struct returned is not NULL. Just some missing checks revealed by Coverity in recent code.	2012-04-10 12:26:27 +00:00
Alasdair Kergon	9c159ea320	Pass struct device around internally rather than dev_t. Add 3rd daemon return state "unknown" for lookups that are carried out successfully but don't find the item requested. Avoid issuing error messages when it's expected that a device that's being looked up in lvmetad might not be there.	2012-03-02 20:46:36 +00:00
Alasdair Kergon	d742cdf327	Change pvscan --lvmetad to pvscan --cache.	2012-03-02 18:09:46 +00:00
Alasdair Kergon	5b613cff97	Pass 'single_device' parameter down to suppress 'Can't find uuid' messages when reading VG text metadate and called from pvscan --lvmetad. (Longer-term, that check needs moving outside of that code.)	2012-02-29 02:35:35 +00:00
Zdenek Kabelac	a46cc72fd2	Add some stack traces for dev_close error paths	2012-02-28 10:11:35 +00:00
Zdenek Kabelac	d2a3352755	Just code move of hash initialization in front of function Make sure both hash tables are initialized before _read_sections() call. Presents no functional change (since PV scan phase was not adding LV hashes), but makes the code easier to handle mem failing case, and static analyzer is hapier as well.	2012-02-27 11:40:58 +00:00
Zdenek Kabelac	b9141fcefa	Add stack traces for lock_vol failures Adding at least stack traces with some FIXMEs for cases, where we might want to do something cleaver - maybe fail command or give user hints something is not going well ? For remote_backup is stack probably 'good' enough for now.	2012-02-27 11:35:59 +00:00
Zdenek Kabelac	c608e46675	Remove test for pvid Since pvid is char buffer[] and not pointer, there is no point to check it for NULL.	2012-02-27 09:54:25 +00:00
Zdenek Kabelac	b6c5ea358e	Some reformating for lvmetad uddates cleanup gcc warning, use PRIu64 header cleanups const pointer fixes.	2012-02-23 17:59:32 +00:00
Petr Rockai	dae0822698	The lvmetad client-side integration. Only active when use_lvmetad = 1 is set in lvm.conf and lvmetad is running.	2012-02-23 13:11:07 +00:00
Zdenek Kabelac	bed744c15d	Add check for mda_copy failure	2012-02-13 11:09:25 +00:00
Zdenek Kabelac	52f2f3eae4	Add free_orphan_vg Move commod code to destroy orphan VG into free_orphan_vg() function. Use orphan vgmem for creation of PV lists. Remove some free_pv_fid() calls (FIXME: check all of them) FIXME: Check whether we could merge release_vg back again for all VGs.	2012-02-13 11:03:59 +00:00
Zdenek Kabelac	f9411bb2af	Clean error paths for format instance With updated orphan VG code this code needed some updates. Add missing log_error for allocation failures.	2012-02-13 10:56:31 +00:00
Alasdair Kergon	b719e3d323	FMT_INSTANCE_VG is redundant now	2012-02-12 23:01:19 +00:00
Petr Rockai	6e41729eb8	Keep a global (per-format) orphan_vg and keep any and all orphan PVs linked to it. Avoids the need for FMT_INSTANCE_PV and enables further simplifications. No functional change, internal refactor only.	2012-02-10 02:53:03 +00:00
Petr Rockai	8e5f7cf3dc	Move lvmcache data structures behind an API (making the structures private to lvmcache.c). No functional change.	2012-02-10 01:28:27 +00:00
Zdenek Kabelac	33dea28e23	Use dm_snprintf and improve error handling Add standard error reporting with error logging. Use plain alloc instead of zalloc for string buffer. Use dm_snprintf with valid test for <0.	2012-02-08 12:50:10 +00:00
Zdenek Kabelac	ee54e43702	Fix resource leaks for failing allocation In case, something would fail during format initialization, return allocated memory.	2012-02-08 10:49:36 +00:00
Zdenek Kabelac	96bffe6a4a	Instrument code that pointer are already released Set pointers to NULL since on the function exit they are no longer valid.	2012-01-25 22:35:36 +00:00
Zdenek Kabelac	e6771e50a9	Check for correctness of uint64 value if exists	2012-01-25 21:43:51 +00:00
Zdenek Kabelac	18b3d24692	Thin until proper vgcfgrestore for thin is implementad, disable restore. Since it may probably do more harm to leave it enabled - add extra test for presence of thin volumes in VG, and in this case disable restore.	2012-01-20 11:01:13 +00:00
Zdenek Kabelac	53d7985fa1	Add support to keep info about creation time and host for each LV Basic support to keep info when the LV was created. Host and time is stored into LV mda section. FIXME: Current version doesn't support configurable string via lvm.conf and used fixed version strftime "%Y-%m-%d %T %z".	2012-01-19 15:31:45 +00:00
Zdenek Kabelac	2465451549	Rename internal macro to match signess Since _read_int64 called dm_config_get_uint64, rename it to less confusing _read_uint64.	2012-01-19 15:17:46 +00:00
Zdenek Kabelac	61158adbcf	Allow empty strings for description and creation_host config fields	2011-12-21 12:49:00 +00:00
Petr Rockai	845b1df617	Make a cleaner split between config tree and config file functionality. Move the latter out of libdm.	2011-12-18 21:56:03 +00:00
Jonathan Earl Brassow	0c506d9a40	Support the ability to replace specific devices in a RAID array. RAID is not like traditional LVM mirroring. LVM mirroring required failed devices to be removed or the logical volume would simply hang. RAID arrays can keep on running with failed devices. In fact, for RAID types other than RAID1, removing a device would mean substituting an error target or converting to a lower level RAID (e.g. RAID6 -> RAID5, or RAID4/5 to RAID0). Therefore, rather than removing a failed device unconditionally and potentially allocating a replacement, RAID allows the user to "replace" a device with a new one. This approach is a 1-step solution vs the current 2-step solution. example> lvconvert --replace <dev_to_remove> vg/lv [possible_replacement_PVs] '--replace' can be specified more than once. example> lvconvert --replace /dev/sdb1 --replace /dev/sdc1 vg/lv	2011-11-30 02:02:10 +00:00
Zdenek Kabelac	900f5f8187	Replace dynamic buffer allocations for PATH_MAX Use static buffer instead of stack allocated buffer. This reduces stack size usage of lvm tool and the change is very simple. Since the whole library is not thread safe - it should not add any new problems - and if there will be some conversion it's easy to convert this to use some preallocated buffer.	2011-11-18 19:31:09 +00:00
Peter Rajnoha	5680d14ecd	Avoid 'mda inconsistency' by properly registering UNLABELLED_PV flag (2.02.86). When a PV label write is deferred to a vg_write call (as introduced by a patch in 2.02.86), the PV is flagged with the internal UNLABELLED_PV flag. However, when calling vg_archive before vg_write, we still have the PV labelled with the UNLABELLED_PV flag which was not recognised as a proper flag while exporting VG metadata: # vgcreate vg /dev/sda No physical volume label read from /dev/sda Metadata inconsistency: Not all flags successfully exported. Metadata inconsistency: Not all flags successfully exported. Writing physical volume data to disk "/dev/sda" Physical volume "/dev/sda" successfully created Volume group "vg" successfully created	2011-11-15 11:54:15 +00:00
Zdenek Kabelac	f2c56bc3b6	Drop mempool parameter from read functions Use implicit vgmem pool.	2011-10-23 16:05:45 +00:00
Zdenek Kabelac	72ff89d279	Always use vg memory pool for allocated lv segment Remove mem pool parameter from alloc_lv_segment() Since we should always allocate LV segment from the vg mempool.	2011-10-23 16:02:01 +00:00
Alasdair Kergon	ef78ebf35a	lvcreate/remove thin_pool and thin volumes (--driverloaded n only)	2011-09-08 16:41:18 +00:00
Alasdair Kergon	9ac61d2ba2	lvcreate parsing for thin provisioning. The rest is incomplete so this isn't usable yet.	2011-09-06 00:26:42 +00:00
Zdenek Kabelac	3caa77f831	Use size_t return type Since these function returns buffer size - use size_t type for them.	2011-09-01 10:25:22 +00:00
Petr Rockai	97a4b5165e	Replace const usage of dm_config_find_node with more appropriate value-lookup functionality. A number of bugs (copied and pasted all over the code) should disappear: - most string lookup based on dm_config_find_node would segfault when encountering a non-zero integer (the intention there was to print an error message instead) - check for required sections in metadata would have been satisfied by values as well (i.e. not sections) - encountering a section in place of expected flag value would have segfaulted (due to assumed but unchecked cn->v != NULL)	2011-08-31 15:19:19 +00:00
Petr Rockai	e59e2f7c3c	Move the core of the lib/config/config.c functionality into libdevmapper, leaving behind the LVM-specific parts of the code (convenience wrappers that handle `struct device` and `struct cmd_context`, basically). A number of functions have been renamed (in addition to getting a dm_ prefix) -- namely, all of the config interface now has a dm_config_ prefix.	2011-08-30 14:55:15 +00:00
Peter Rajnoha	d35188058b	Directly allocate buffer memory in a pvck scan instead of using a mempool. There's a very high memory usage when calling _pv_analyse_mda_raw (e.g. while executing pvck) that can end up with "out of memory". _pv_analyse_mda_raw scans for metadata in the MDA, iteratively increasing the size to scan with SECTOR_SIZE until we find a probable config section or we're at the edge of the metadata area. However, when using a memory pool, we're also iteratively chasing for bigger and bigger mempool chunk which can't be found and so we're always allocating a new one, consuming more and more memory... This patch just changes the mempool to direct memory allocation in this problematic part of the code.	2011-08-29 13:37:36 +00:00
Zdenek Kabelac	077a6755ff	Replace free_vg with release_vg Move the free_vg() to vg.c and replace free_vg with release_vg and make the _free_vg internal. Patch is needed for sharing VG in vginfo cache so the release_vg function name is a better fit here.	2011-08-10 20:25:29 +00:00
Jonathan Earl Brassow	cac52ca4ce	Add basic RAID segment type(s) support. Implementation described in doc/lvm2-raid.txt. Basic support includes: - ability to create RAID 1/4/5/6 arrays - ability to delete RAID arrays - ability to display RAID arrays Notable missing features (not included in this patch): - ability to clean-up/repair failures - ability to convert RAID segment types - ability to monitor RAID segment types	2011-08-02 22:07:20 +00:00
Zdenek Kabelac	bebe60b70c	Code move of vg_mark_partial() up in stack It's useful to keep the partial flag cached - so just move the call for vg_mark_partil_lvs() into import_vg_from_config_tree() so it gets evaluated before it goes through the lvmcache. This patch should not present any functional change. Note: It is rather temporal solution - proper place is probably inside the 'read' call back - but needs some more discussion. For now using this minor hack.	2011-06-17 14:39:10 +00:00
Zdenek Kabelac	93a98c2672	Remove unused internal flag ACTIVATE_EXCL from the code	2011-06-17 14:30:58 +00:00
Petr Rockai	6d25c0d26f	Fix RHBZ 651590 (failure to lock LV results in failure to repair mirror after transient error), stemming from the following sequence of events: 1) devices fail IO, triggering repair 2) dmeventd starts fixing up the mirror 3) during the downconversion, a new metadata version is written --> the devices come back online here 4) the mirror device suspend/resume is called to update DM tables 5) during the suspend/resume cycle, pre-commit metadata is read; however, since the failed devices are now back online, we get back inconsistent set of precommit metadata and the whole operation fails The patch relaxes the check that fails in step 5 above, namely by ignoring inconsistencies coming from PVs that are marked MISSING.	2011-06-15 17:45:02 +00:00
Alasdair Kergon	3cac20f850	Defer writing PV labels to vg_write. Store label_sector only in struct physical_volume.	2011-06-01 19:29:31 +00:00
Peter Rajnoha	c08c564e21	Use new dev_open_readonly fn to prevent opening devices for read-write when not necessary. Before, we used vg_write_lock_held call to determnine the way a device is opened. Unfortunately, this opened many devices in RW mode when it was not really necessary. With the OPTIONS+="watch" rule used in the udev rules, this could fire numerous events while closing such devices (and it caused useless scans from within udev rules in return). A common bug we hit with this was with the lvremove command which was unable to remove the LV since it was being opened from within the udev rules. This patch should minimize such situations (at least with respect to LVM handling of devices). Though there's still a possibility someone will open a device 'outside' in parallel and fire the event based on the watch rule when closing a device once opened for RW.	2011-05-28 09:48:14 +00:00
Alasdair Kergon	5510b4e7d7	test update without WHATS_NEW to check it gives warning now	2011-04-29 19:06:17 +00:00
Zdenek Kabelac	b680d5bf7b	Fix use of released vgname and vgid Avoid using of already released memory when duplicated MDA is found. As get_pv_from_vg_by_id() may call lvmcache_label_scan() use the local copy of the vgname and vgid on the stack as vginfo may dissapear and code was then accessing garbage in memory. i.e. pvs /dev/loop0 (when /dev/loop0 and /dev/loop1 has same MDA content) Invalid read of size 1 at 0x523C986: dm_hash_lookup (hash.c:325) by 0x440C8C: vginfo_from_vgname (lvmcache.c:399) by 0x4605C0: _create_vg_text_instance (format-text.c:1882) by 0x46140D: _text_create_text_instance (format-text.c:2243) by 0x47EB49: _vg_read (metadata.c:2887) by 0x47FBD8: vg_read_internal (metadata.c:3231) by 0x477594: get_pv_from_vg_by_id (metadata.c:344) by 0x45F07A: _get_pv_if_in_vg (format-text.c:1400) by 0x45F0B9: _populate_pv_fields (format-text.c:1414) by 0x45F40F: _text_pv_read (format-text.c:1493) by 0x480431: _pv_read (metadata.c:3500) by 0x4802B2: pv_read (metadata.c:3462) Address 0x652ab80 is 0 bytes inside a block of size 4 free'd at 0x4C2756E: free (vg_replace_malloc.c:366) by 0x442277: _free_vginfo (lvmcache.c:963) by 0x44235E: _drop_vginfo (lvmcache.c:992) by 0x442B23: _lvmcache_update_vgname (lvmcache.c:1165) by 0x443449: lvmcache_update_vgname_and_id (lvmcache.c:1358) by 0x443C07: lvmcache_add (lvmcache.c:1492) by 0x46588C: _text_read (text_label.c:271) by 0x466A65: label_read (label.c:289) by 0x4413FC: lvmcache_label_scan (lvmcache.c:635) by 0x4605AD: _create_vg_text_instance (format-text.c:1881) by 0x46140D: _text_create_text_instance (format-text.c:2243) by 0x47EB49: _vg_read (metadata.c:2887) Add testing script	2011-04-21 13:13:40 +00:00
Zdenek Kabelac	2c5827076b	Add missing printf attributes These attributes were missing in previous patch, that was adding instrumentation for printf formating string parameter.	2011-04-08 14:21:34 +00:00
Jonathan Earl Brassow	60c10a45ce	s/MIRROR_NOTSYNCED/LV_NOTSYNCED/ - Flag will may refer to more than just mirrors	2011-03-29 12:51:57 +00:00
Alasdair Kergon	9c58641e74	Rename _check_version	2011-03-27 13:44:08 +00:00
Zdenek Kabelac	844b75f4d6	Fix allocation of system_id As code uses strncpy(system_id, NAME_LEN) and doesn't set '\0' Fix it by always allocating NAME_LEN + 1 buffer size and with zalloc we always get '\0' as the last byte. This bug may trigger some unexpected behavior of the string operation code - depends on the pool allocator. FIXME: refactor this code to alloc_vg.	2011-03-13 23:05:48 +00:00
Peter Rajnoha	ff4479414c	Use format instance mempool where possible and adequate.	2011-03-11 15:10:16 +00:00
Peter Rajnoha	e8d4946ec7	Various cleanups for fid mem and ref_count changes. Missing free_vg on error_path in lvmcache_get_vg fn. Call destroy_instance only if the fid is not part of the vg in backup_read_vg fn (otherwise it's part of the VG we're returning and we definitely don't want to destroy it!).	2011-03-11 15:08:31 +00:00
Peter Rajnoha	1307ddf4cf	Use only vg_set_fid and new pv_set_fid fn to assign the format instance. This is essential for proper format instance ref_count support. We must use these functions to set the fid everywhere from now on, even the NULL value!	2011-03-11 14:50:13 +00:00
Peter Rajnoha	293481107f	Make create_text_context fn static and move it inside create_instance fn. We'd like to use the fid mempool for text_context that is stored in the instance (we used cmd mempool before, so the order of initialisation was not a matter, but now it is since we need to create the fid mempool first which happens in create_instance fn). The text_context initialisation is not needed anywhere outside the create_instance fn so move it there.	2011-03-11 14:45:17 +00:00
Peter Rajnoha	a1bec4e685	Add mem and ref_count fields to struct format_instance for own mempool use. Format instances can be created anytime on demand and it contains metadata area information mostly (at least for now, but in the future, we may store more things here to update/edit in a PV/VG). In case we have lots of metadata areas, memory consumption will rise. Using cmd context mempool is not quite optimal here because it is destroyed too late. So let's use a separate mempool for format instances. Reference counting is used because fids could be shared, e.g. each PV has either a PV-based fid or VG-based fid. If it's VG-based, each PV has a shared fid with the VG - a reference to VG's fid.	2011-03-11 14:38:38 +00:00
Peter Rajnoha	56f5b12eed	Use new alloc_fid fn for common format instance initialisation.	2011-03-11 14:30:27 +00:00
Zdenek Kabelac	3019419e95	Refactor vg allocation code Create new function alloc_vg() to allocate VG structure. It takes pool_name (for easier debugging). and also take vg_name to futher simplify code. Move remainder of _build_vg_from_pds to _pool_vg_read and use vg memory pool for import functions. (it's been using smem -> fid mempool -> cmd mempool) (FIXME: remove mempool parameter for import functions and use vg). Move remainder of the _build_vg to _format1_vg_read	2011-03-10 12:43:29 +00:00
Peter Rajnoha	15b9215534	Use a copy if moving an mda from pv fid to vg fid. We'll destroy the pv fid (with all mdas in it) after merging all pv mdas to a vg in _text_pv_setup fn, hence we need to use a copy here!	2011-03-02 10:23:29 +00:00
Peter Rajnoha	0b100565ae	Make add_metadata_area_to_pv/remove_metadata_area_from_pv static. No need to put these in format-text.h, it's not used anywhere else actually.	2011-03-02 10:19:14 +00:00
Milan Broz	0cb777d642	Rephrase backup message.	2011-02-28 20:50:01 +00:00
Peter Rajnoha	150e43a05c	Use pv->vg_name directly instead of pv->vg->name in _text_pv_write. This also prevents a possible segfault during an automatic repair when the PV does not belong to a VG anymore and we call pv_write_orphan.	2011-02-28 17:05:48 +00:00
Peter Rajnoha	3b97e8d643	Allow non-orphan PVs with two metadata areas to be resized. We allow writing non-orphan PVs only for resize now. The "orphan PV" assert in pv_write fn uses the "allow_non_orphan" parameter to control this assert. However, we should find a more elaborate solution so we can remove this restriction altogether (pv_write together with vg_write is not atomic, we need to find a safe mechanism so there's an easy revert possible in case of an error).	2011-02-28 13:19:02 +00:00
Peter Rajnoha	4b8f066c19	vgconvert is fixed now to work with the changes in metadata area handling - enable the tests. Add a small fix that preserves pe_start for lvm1 PVs when being converted. (this fix needs to be replaced with something more clever, but let's have this working now)	2011-02-25 14:12:14 +00:00
Peter Rajnoha	4a304dc1d8	Allow only orphan PVs to be resized even with two metadata areas.	2011-02-25 14:08:54 +00:00
Peter Rajnoha	38b0564cab	Read PV metadata information from cache if pv_setup called with pv->fid == vg->fid. If the PV is already part of the VG (so the pv->fid == vg->fid), it makes no sense to attach the mdas information from PV to a VG. Instead, we read new PV metadata information from cache and attach it to the VG fid.	2011-02-25 13:59:47 +00:00
Peter Rajnoha	ea4a41e961	Fix a bug in metadata location calculation, cleanup pv_add_metadata_area fn. This bug (a missing line) caused the 2nd MDA area location to be calculated incorrectly and it didn't fit the disk size properly. (https://www.redhat.com/archives/lvm-devel/2011-February/msg00127.html)	2011-02-25 13:50:02 +00:00
Peter Rajnoha	51aed1992f	Add old_uuid field to struct physical_volume so we can still reference a PV with its old UUID when we're changig it (the cache as well as metadata area index has the old uuid that we need to use to access the information!)	2011-02-21 12:31:28 +00:00
Peter Rajnoha	cb2396730a	Change pvresize code to work with new metadata handling interface and allow resizing a PV with two metadata areas.	2011-02-21 12:27:26 +00:00
Peter Rajnoha	17ad2b1115	Change pv_write code to work with the changes in metadata handling interface and changes in format_instance.	2011-02-21 12:26:27 +00:00
Peter Rajnoha	903d7db050	Remove unused _mda_setup fn. This functionality is covered by new pv_add_metadata_area fn.	2011-02-21 12:25:16 +00:00
Peter Rajnoha	94d91fdda1	Change the code throughout to use new pv_initialise and modified pv_setup fn. Change pv_create code to work with these changes together with using new pv_add_metadata_area fn to add metadata areas for a PV being created.	2011-02-21 12:24:15 +00:00
Peter Rajnoha	617b900d85	Separate new pv_initialise function out of the original pv_setup code. pv_initiliase initialises a new PV pv_setup sets up an existing PV with a VG	2011-02-21 12:20:18 +00:00
Peter Rajnoha	981895a860	Add new pv_remove_metadata_area interface function.	2011-02-21 12:17:54 +00:00
Peter Rajnoha	8d5d20a526	Add new pv_add_metadata_area interface function.	2011-02-21 12:17:26 +00:00
Peter Rajnoha	305816232d	Remove useless mdas parameter for pv_read (from now on, we store mdas in a format instance)	2011-02-21 12:15:59 +00:00
Peter Rajnoha	f8b78ec613	Add vg_set_fid function to change VG format instance. This function also sets a reference to a new VG format instance for all PVs that are part of the VG so the PV-VG interconnection is consistent after the change.	2011-02-21 12:10:58 +00:00
Peter Rajnoha	c0c21864c6	Change the code throughout for recent changes in format_instance handling.	2011-02-21 12:07:03 +00:00
Peter Rajnoha	88129db5e1	Change create_instance to create PV-based as well as VG-based format instances. Add supporting functions to work with the format instance and metadata area structures stored within the format instance. Add support for simple indexing of metadata areas using PV id and mda order (for on-disk PV only for now, we can extend the indexing even for other mdas if needed - we only need to define a proper key for the index).	2011-02-21 12:05:49 +00:00
Zdenek Kabelac	4ebc6404ee	Void* arithmetic replaced with char*	2011-02-18 14:34:41 +00:00
Zdenek Kabelac	b1bcff7424	Critical section New strategy for memory locking to decrease the number of call to to un/lock memory when processing critical lvm functions. Introducing functions for critical section. Inside the critical section - memory is always locked. When leaving the critical section, the memory stays locked until memlock_unlock() is called - this happens with sync_local_dev_names() and sync_dev_names() function call. memlock_reset() is needed to reset locking numbers after fork (polldaemon). The patch itself is mostly rename: memlock_inc -> critical_section_inc memlock_dec -> critical_section_dec memlock -> critical_section Daemons (clmvd, dmevent) are using memlock_daemon_inc&dec (mlockall()) thus they will never release or relock memory they've already locked memory. Macros sync_local_dev_names() and sync_dev_names() are functions. It's better for debugging - and also we do not need to add memlock.h to locking.h header (for memlock_unlock() prototyp).	2011-02-18 14:16:11 +00:00

... 3 4 5 6 7 ...

785 Commits