shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-22 17:35:59 +03:00

Author	SHA1	Message	Date
Alasdair G Kergon	35cdd9cf48	label: Clean up storing of device and label sector. No longer use the external 'result' pointer internally to set up the cached label. The callback _set_label_read_result() is now given the internal label pointer directly Callers that don't need the result are no longer required to pass a label pointer into label_read().	2018-01-11 02:54:00 +00:00
Alasdair G Kergon	6210c1ec28	device: Mark read-only device buffers const.	2018-01-10 19:57:10 +00:00
Alasdair G Kergon	c350f96c09	device: Eliminate unnecessary buffer from dev_read.	2018-01-10 18:48:01 +00:00
Alasdair G Kergon	dcb2a5a611	device: Remove some data copying between buffers. Callers that read larger amounts of data now get a pointer to read-only data directly without copying it through an intermediate buffer. This data is owned by the device layer so the callers no longer free it.	2018-01-10 15:48:03 +00:00
Alasdair G Kergon	e2438b5b9f	format_text: Use malloc aligned for export buffer	2018-01-10 15:48:03 +00:00
Alasdair G Kergon	f4675af4cf	format_text: Use vgsummary callbacks	2018-01-09 03:14:30 +00:00
Alasdair G Kergon	6d322e68f3	label: Add callback fns (partially)	2018-01-08 17:04:56 +00:00
Alasdair G Kergon	5e7d3ad749	device: Introduce dev_read_callback If it obtains the data, it passes it into the supplied callback function and returns 1. Otherwise the callback receives failed = 1. Updated config_file_read_fd to use this and similarly return the data via a callback fn of its own.	2018-01-06 02:40:12 +00:00
Alasdair G Kergon	946f07af3e	metadata: Use a consistent format for callback fn parameters	2018-01-05 14:24:56 +00:00
Alasdair G Kergon	a0ddfad94b	metadata: Change the new data processing fns to void. Move the existing fn return codes into the new structs.	2018-01-05 03:12:22 +00:00
Alasdair G Kergon	c70c9f6565	format_text: Split vgname_from_mda into three pieces.	2018-01-04 21:13:44 +00:00
Alasdair G Kergon	d61b1369d0	format_text: Split out raw_read_mda_header processing	2018-01-04 15:52:59 +00:00
Alasdair G Kergon	139209ef42	format_text: Split up _update_mda. Dedicated functions are now used to process each piece of data obtained, so the refactoring in this file gives us one for the vgsummary and one for the metadata header. This new type of function takes two parameters (for now), the obtained data plus a single struct (that must not reference any data on the stack) that wraps up the entire context needed to process it.	2018-01-04 12:25:24 +00:00
Alasdair G Kergon	111a9fcff5	format_text: Allocate update_mda baton from mempool. Also store return code. Note that fatal and non-fatal errors while handling the mda aren't currently distinguished.	2018-01-03 23:53:00 +00:00
Alasdair G Kergon	5a846e0929	format_text: Split the text import fns into two pieces.	2018-01-03 20:48:02 +00:00
Alasdair G Kergon	22b6c482ec	config: Split config buffer processing into new fn. Wrap its parameters into struct process_config_file_params allocated from a mempool now passed into the config_file_read* fns.	2018-01-02 21:10:46 +00:00
Alasdair G Kergon	17649d4ac8	device: Move dev_read memory allocation into device layer. Rename dev_read() to dev_read_buf() - the function that reads data into a supplied buffer. Introduce a new dev_read() that allocates the buffer it returns and switch the important users over to this. No caller may change the returned data. (For now, callers are responsible for freeing it after use, but later the device layer will take full ownership.) dev_read_buf() should only be used for tiny buffers or unimportant code (such as the old disk formats).	2017-12-19 01:31:50 +00:00
Alasdair G Kergon	5f45cb90a7	format_text: Transfer circular buf alloc to device layer. Instead of the caller passing dev_read_circular() a buffer to fill with data, the device layer itself now allocates it.	2017-12-15 22:34:26 +00:00
Alasdair G Kergon	beee9940a5	format_text: Separate out code paths for buffer wraparound The creation of wrapped around metadata - where the start of metadata is written up to the end of the buffer and the remainder follows back at the start of the buffer - is now restricted to cases where writing the metadata in one piece wouldn't fit. This shouldn't happen in 'normal' usage so let's begin treating the code for this as a special case that can be ignored when optimising 'normal' cases.	2017-12-15 21:12:19 +00:00
Alasdair G Kergon	145ded10c2	format_text: Supply mempool directly to raw_read_mda_header.	2017-12-15 14:57:05 +00:00
Alasdair G Kergon	3edc25dbdf	format_text: Round size written up to multiple of 4096. Zero-fill metadata up to the next 4096 boundary then write out a multiple of 4096 bytes to avoid triggering a read-modify-write.	2017-12-12 22:52:22 +00:00
Alasdair G Kergon	78ffa44fc5	format_text: Change metadata alignment from 512 to 4096. If there is sufficient space in the metadata area, align the next metadata to a disk offset that is a multiple of 4096 bytes and don't write it circularly. If it doesn't all fit at the end of the metadata area, go back to the start and write it all there contiguously. If there is insufficient space to use the new stricter rules, revert to the original behaviour, aligning on 512-byte boundaries wrapping around the circular buffer as required.	2017-12-12 20:57:36 +00:00
Alasdair G Kergon	643df602c7	format_text: More refactoring of metadata offset calcs	2017-12-12 18:51:32 +00:00
Alasdair G Kergon	4002f5e206	format_text: Refactor and document metadata offset calculation.	2017-12-12 18:36:54 +00:00
Alasdair G Kergon	b76c6951aa	format_text: Adjust metadata alignment calculation. Use new ALIGN_ABSOLUTE macro when calculating the start location of new metadata and adjust the end of buffer detection so that there is no longer an imposed gap between old and new metadata.	2017-12-11 20:25:03 +00:00
Alasdair G Kergon	053d35de47	format_text: Use absolute alignment to calculate metadata usage Currently both start and offset should always be divisible by alignment, so this should have no effect, but a later patch will increase alignment so these variables can no longer be optimised out.	2017-12-11 17:14:38 +00:00
Alasdair G Kergon	2db67a8ea0	format_text: Move metadata size checking into separate fn. Move checks into _metadata_fits_into_buffer() and add macro for alignment.	2017-12-11 17:08:29 +00:00
Alasdair G Kergon	46393bfca0	format_text: Log additional circular buffer information.	2017-12-11 16:07:34 +00:00
Alasdair G Kergon	49d486319f	format_text: Replace PRI with FMT.	2017-12-11 15:39:25 +00:00
Alasdair G Kergon	14b1e5270d	format_text: Use explicit alignment in wrapping calc. Expand out the metadata wrapping calculations to prepare to support a larger alignment. The current alignment is 512 bytes so (mdac_area_start + rlocn->offset) % alignment is zero.	2017-12-08 01:18:46 +00:00
Alasdair G Kergon	d591d04103	device: Tag I/O for each mda on a device separately in log messages. Mark the first metadata area on each text format PV as MDA_PRIMARY. Pass this information down to the device layer so that when there are two metadata areas on a block device, we can easily distinguish two independent streams of I/O.	2017-12-07 03:48:11 +00:00
Alasdair G Kergon	e4805e4883	device: categorise block i/o Introduce enum dev_io_reason to categorise block device I/O in debug messages so it's obvious what it is for. DEV_IO_SIGNATURES /* Scanning device signatures / DEV_IO_LABEL / LVM PV disk label / DEV_IO_MDA_HEADER / Text format metadata area header / DEV_IO_MDA_CONTENT / Text format metadata area content / DEV_IO_FMT1 / Original LVM1 metadata format / DEV_IO_POOL / Pool metadata format / DEV_IO_LV / Content written to an LV / DEV_IO_LOG / Logging messages */	2017-12-04 23:45:26 +00:00
Alasdair G Kergon	a9812ec9d3	label: Remove unused verify functions. label_verify has never been used so remove it.	2017-11-28 01:36:55 +00:00
Alasdair G Kergon	52fd66210b	metadata: Avoid accessing ignored metadata. When an ignored metadata area gets flagged for use again, make sure the code doesn't try to parse its old metadata. Firstly by trying to detect this situation and skipping the read (while still remembering the position reached in the circular buffer), and secondly by clearing the invalid live metadata location on disk as a precaution when subsequently writing out the precommitted metadata. Problems showed up when a metadata area in one VG got moved to another VG in ignored state (still holding metadata for the original VG) and then later got brought into use in the new VG - only the header should be read in this case, not any of the metadata content.	2017-10-27 22:53:43 +01:00
David Teigland	6ac1e04b3a	replicator: remove the code It has not been used in a long time and is not expected to be used further.	2017-10-13 16:20:42 -05:00
Alasdair G Kergon	486ed10848	vgmerge: Fix intermediate metadata corruption vgmerge suffers from a similar problem to the one fixed in commit `8146548d25` ("vgsplit: Fix intermediate metadata corruption.") When merging, splitting or renaming VGs, use a new PV status flag PV_MOVED_VG to mark the PVs that hold metadata with the old VG name and use this to provide PV-level granularity instead of incorrectly assuming all PVs in the VG are the same.	2017-10-06 02:20:45 +01:00
Alasdair G Kergon	8146548d25	vgsplit: Fix intermediate metadata corruption. Changing the VG of a PV uses the same on-disk mechanism as vgrename. This relies on recognising both the old and new VG names. Prior to this patch the vgsplit code incorrectly provided the new VG name twice instead of the old and new ones. This lead the low-level mechanism not to recognise the device as already belonging to a VG and so paying no attention to the location of its existing metadata, sometimes partly overwriting it and then later trying to read the corrupt metadata and issuing a checksum error.	2017-09-22 18:34:34 +01:00
Zdenek Kabelac	c1e3f96c97	lvmcache: check for lvmcache_foreach_mda return code lvmcache_foreach_mda() can fail for numerous reasons and failing error code cannot be ignored (out-of-memory...) TODO: might need more error handling tunning.	2017-08-22 10:23:31 +02:00
Peter Rajnoha	3c978f7bcc	pvcreate: fix check for 2nd mda at end of disk fits if using pvcreate --restorefile Fix code checking that the 2nd mda which is at the end of disk really fits the available free space and avoid any DA and MDA interleaving when we already have DA preallocated. This mainly applies when we're restoring a PV from VG backup using pvcreate --restorefile where we may already have some DA preallocated - this means the PV was in a VG before with already allocated space from it (the LVs were created). Hence we need to avoid stepping into DA - the MDA can never ever be inside in such case! The code responsible for this calculation was already in _text_pv_add_metadata_area fn, but it had a bug in the calculation where we subtracted one more sector by mistake and then the code could still incorrectly allocate the MDA inside existing DA. The patch also renames the variable in the code so it doesn't confuse us in future. Also, if the 2nd mda doesn't fit, don't silently continue with just 1 MDA (at the start of the disk). If 2nd mda was requested and we can't create that due to unavailable space, error out correctly (the patch also adds a test to shell/pvcreate-operation.sh for this case).	2017-08-15 13:40:25 +02:00
Zdenek Kabelac	876c4a1b3b	tidy: declaration names match implementation Put in sync some naming used for function declaration and actual in-code implementation.	2017-07-20 19:16:41 +02:00
Zdenek Kabelac	48ce8c7a49	tidy: drop unneeded cast Avoid casting to the same type.	2017-07-20 11:20:44 +02:00
Zdenek Kabelac	0bf836aa14	tidy: prefer not using else after return clang-tidy: avoid using 'else' after return - give more readable code, and also saves indention level.	2017-07-20 11:18:29 +02:00
Zdenek Kabelac	0d0a3397c2	cleanup: add braces in macro	2017-07-20 11:18:29 +02:00
Zdenek Kabelac	f7e62bc55c	cleanup: drop extra compare dm_free() already validates for NULL itself.	2017-07-17 12:32:18 +02:00
Heinz Mauelshagen	34504855a7	raid: add data_offset incompatibility segment type flag In order to reject out of place reshaping with segment data_offset field on old runtime, add a respective segment type incompatibility flag causing "+RESHAPE_DATA_OFFSET" to be suffixed to the segment type name.	2017-07-14 15:53:23 +02:00
Alasdair G Kergon	5027c3c7ee	format_text: Extend FIXME to reduce label scans It's unnecessarily scanning all invalid labels even when nothing changed instead of first just scanning the ones under the lock.	2017-07-13 17:05:49 +01:00
Zdenek Kabelac	419e8284c8	coverity: validate length of renaming path Make sure path fits into buffer on stack.	2017-06-27 12:15:42 +02:00
Heinz Mauelshagen	14d563accc	raid: change reshape segtype flags Commit `1c916ec5ff` missed new reshape flags.	2017-06-14 15:01:19 +02:00
Heinz Mauelshagen	1c916ec5ff	raid: add reshape segtype flag support Prohibit activation of reshaping RaidLVs on incompatible lvm2 runtime by storing e.g. 'raid5+RESHAPE' segment type strings in the lvm2 metadata. Incompatible runtime not supporting reshaping won't be able to activate those thus avoiding potential data corruption. Any new non-reshaping lvconvert command will reset the segment type string from 'raid5+RESHAPE' to 'raid5'. See commits `0299a7af1e` and `4141409eb0` for segtype flag support.	2017-06-09 22:23:04 +02:00
Zdenek Kabelac	fb86bddda2	flags: improve unknown flags logic Use same logic as with unknown segment type - so preserve such name fully with all flags just with UNKNOWN segment type bits.	2017-05-30 18:43:45 +02:00
Zdenek Kabelac	d1ac6108c3	flags: restore same logic with MISSING Since lvmetad is using 'MISSING' in status for 'another' purpose, we need to support ATM also flag get from this place. Until fixed better - we accept both flags - alhough lvm2 will only print in flags.	2017-05-30 16:16:29 +02:00
Zdenek Kabelac	4141409eb0	flags: add segtype flag support Switch METADATA_FORMAT flag usage to be stored via segtype instead of 'status' flag which appeared to cause major incompatibility troubles. For backward compatiblity segtype flags are still accepted also via 'status' bits which were used from version 2.02.169 so metadata saved by this newer lvm2 version should still work nicely, although new save version will no longer work on this older lvm2 version.	2017-05-29 14:52:56 +02:00
Zdenek Kabelac	0299a7af1e	flags: add read and print of segtype flag Allow storing LV status bits with segment type name field. Switching to this since this field has better support for compatibility with older version of lvm2 - since such unknown segtype will not cause complete invisiblity of metadata from older lvm2 code - just the particular LV will become unusable with unknown type of segment.	2017-05-29 14:49:41 +02:00
Zdenek Kabelac	1bb0c5197f	cleanup: backtrace Add debug backtrace.	2017-05-29 14:48:33 +02:00
Zdenek Kabelac	966d1130db	cleanup: separate type and mask Split misused 'enum' into 2 fields - one for type of PV, VG, LV and other for mask.	2017-05-29 14:47:26 +02:00
Zdenek Kabelac	8e0bc73eba	cleanup: bad flag is internal error here Convert to internal error.	2017-05-29 14:47:16 +02:00
Alasdair G Kergon	d1ddfc4085	format_text: More internal errors if given invalid internal metadata Three more messages to ensure each failure in out_areas() results in a low-level message instead of sometimes just <backtrace>.	2017-05-22 23:30:34 +01:00
Mikulas Patocka	78d004efa8	build: fix x32 arch This patch fixed lvm2 compilation running on x32 arch. (Using 64bit x86 cpu features but running on 32b address space, so consuming less mem in VM). On x32 arch 'time_t' is 64bit while 'long' is 32bit.	2017-03-27 20:50:19 +02:00
Zdenek Kabelac	518b814cdb	cache: LV supports cache segs with metadata format Cache pool read/writes metadata_format within its segment type.. For CachePoolLV unselected metadata format is NOT stored in metadata. For CacheLV when metadata format is not present/selected in lvm2 metadata, it's automatically assumed to be the version 1 (backward compatible). To ensure older lvm2 will not 'miss-read' metadata with new version 2, such LV is marked with METADATA_FORMAT status flag (segment is specifying metadata format). So when cache uses metadata format 2, it will become inaccesible on older system without such support. (kernel dm cache < 1.10, lvm2 < 2.02.169).	2017-03-10 19:33:01 +01:00
Heinz Mauelshagen	e2354ea344	lvconvert: add infrastructure for RaidLV reshaping support In order to support striped raid5/6/10 LV reshaping (change of LV type, stripesize or number of legs), this patch introduces infrastructure prerequisites to be used by raid_manip.c extensions in followup patches. This base is needed for allocation of out-of-place reshape space required by the MD raid personalities to avoid writing over data in-place when reading off the current RAID layout or number of legs and writing out the new layout or to a different number of legs (i.e. restripe) Changes: - add members reshape_len to 'struct lv_segment' to store out-of-place reshape length per component rimage - add member data_copies to struct lv_segment to support more than 2 raid10 data copies - make alloc_lv_segment() aware of both reshape_len and data_copies - adjust all alloc_lv_segment() callers to the new API - add functions to retrieve the current data offset (needed for out-of-place reshaping space allocation) and the devices count from the kernel - make libdm deptree code aware of reshape_len - add LV flags for disk add/remove reshaping - support import/export of the new 'struct lv_segment' members - enhance lv_extend/_lv_reduce to cope with reshape_len - add seg_is_/segtype_is_ macros related to reshaping - add target version check for reshaping - grow rebuilds/writemostly bitmaps to 246 bit to support kernel maximal - enhance libdm deptree code to support data_offset (out-of-place reshaping) and delta_disk (legs add/remove reshaping) target arguments Related: rhbz834579 Related: rhbz1191935 Related: rhbz1191978	2017-02-24 05:20:58 +01:00
Zdenek Kabelac	75f2388093	backup: show warning once per command When command calls backup() more then once (which is actually not wanted) this warning message is shown repeatedly: "WARNING: This metadata update is NOT backed up." Instead now print message just once and less confuse user.	2016-12-18 19:38:30 +01:00
Zdenek Kabelac	a8f5e1f274	cleanup: more lv_is_ usage	2016-12-13 22:07:52 +01:00
Zdenek Kabelac	d1e398c474	segtype: check for seg type instead of status RAID is LV property - which has single segment of raid type.	2016-12-13 22:07:52 +01:00
Zdenek Kabelac	4a33e4c509	cov: declaration matching	2016-12-11 23:24:19 +01:00
Zdenek Kabelac	1d58074d9f	debug: more stacktrace corrections Continue previous patch dropping some unneeded stack traces after printed log_error/warn messages.	2016-11-25 14:58:28 +01:00
Peter Rajnoha	f1cad4c710	config: use config_tree_from_string_without_dup_node_check throughout code to construct metadata trees	2016-09-21 18:18:15 +02:00
Peter Rajnoha	e40fbd08c8	config: parse config tree without dup node checking if it's metadata tree	2016-09-21 18:16:05 +02:00
Zdenek Kabelac	c0a505b0bb	cleanup: drop unused header files	2016-07-04 17:40:24 +02:00
David Teigland	a7c45ddc59	lvmetad: two phase vg_update Previously, a command sent lvmetad new VG metadata in vg_commit(). In vg_commit(), devices are suspended, so any memory allocation done by the command while sending to lvmetad, or by lvmetad while updating its cache could deadlock if memory reclaim was triggered. Now lvmetad is updated in unlock_vg(), after devices are resumed. The new method for updating VG metadata in lvmetad is in two phases: 1. In vg_write(), before devices are suspended, the command sends lvmetad a short message ("set_vg_info") telling it what the new VG seqno will be. lvmetad sees that the seqno is newer than the seqno of its cached VG, so it sets the INVALID flag for the cached VG. If sending the message to lvmetad fails, the command fails before the metadata is committed and the change is not made. If sending the message succeeds, vg_commit() is called. 2. In unlock_vg(), after devices are resumed, the command sends lvmetad the standard vg_update message with the new metadata. lvmetad sees that the seqno in the new metadata matches the seqno it saved from set_vg_info, and knows it has the latest copy, so it clears the INVALID flag for the cached VG. If a command fails between 1 and 2 (after committing the VG on disk, but before sending lvmetad the new metadata), the cached VG retains the INVALID flag in lvmetad. A subsequent command will read the cached VG from lvmetad, see the INVALID flag, ignore the cached copy, read the VG from disk instead, update the lvmetad copy with the latest copy from disk, (this clears the INVALID flag in lvmetad), and use the correct VG metadata for the command. (This INVALID mechanism already existed for use by lvmlockd.)	2016-06-28 02:30:31 +01:00
David Teigland	f96de67490	vgcfgrestore: check for missing device The missing device will generally be seen earlier and cause the command to not reach this point, but check anyway for completeness.	2016-06-20 16:02:07 -05:00
David Teigland	6ae22125c6	vgcfgrestore: use lvmetad disabled state Previously, vgcfgrestore would attempt to vg_remove the existing VG from lvmetad and then vg_update to add the restored VG. But, if there was a failure in the command or with vg_update, the lvmetad cache would be left incorrect. Now, disable lvmetad before the restore begins, and then rescan to populate lvmetad from disk after restore has written the new VG to disk.	2016-06-20 11:19:49 -05:00
David Teigland	01156de6f7	lvmcache: add optional dev arg to lvmcache_info_from_pvid A number of places are working on a specific dev when they call lvmcache_info_from_pvid() to look up an info struct based on a pvid. In those cases, pass the dev being used to lvmcache_info_from_pvid(). When a dev is specified, lvmcache_info_from_pvid() will verify that the cached info it's using matches the dev being processed before returning the info. Calling code will not mistakenly get info for the wrong dev when duplicate devs exist. This confusion was happening when scanning labels when duplicate devs existed. label_read for the first dev would add an info struct to lvmcache for that dev/pvid. label_read for the second dev would see the pvid in lvmcache from first dev, and mistakenly conclude that the label_read from the second dev can be skipped because it's already been done. By verifying that the dev for the cached pvid matches the dev being read, this mismatch is avoided and the label is actually read from the second duplicate.	2016-06-07 15:15:47 -05:00
Peter Rajnoha	02d67848eb	coverity: fix possible resource leak of descendants_buffer in _print_historical_lv fn	2016-05-31 09:36:58 +02:00
Alasdair G Kergon	bf8d00985a	raid0: Add raid0 segment type. This remains experimental and quite restrictive so should only be used for testing at this stage. (E.g. lvreduce is not supported.)	2016-05-23 16:46:38 +01:00
Zdenek Kabelac	9c083d34af	debug: use display_lvname Add some tracing message	2016-05-19 18:40:14 +02:00
Zdenek Kabelac	509b2e5247	debug: move misplaced log_debug It should log action before taking it instead of only in error path.	2016-04-21 00:34:01 +02:00
David Teigland	5e9e43074a	lvmetad: rework command connection setup and checking The lvmetad connection is created within the init_connections() path during command startup, rather than via the old lvmetad_active() check. The old lvmetad_active() checks are replaced with lvmetad_used() which is a simple check that tests if the command is using/connected to lvmetad. The old lvmetad_set_active(cmd, 0) calls, which stopped the command from using lvmetad (to revert to disk scanning), are replaced with lvmetad_make_unused(cmd).	2016-04-19 14:00:02 -05:00
Zdenek Kabelac	a28c81cbae	debug: unify some tracing messages Introduce FMTVGID - although it might be possibly better to ensure vgid is always \0 ended string. Unify some lvmcache reported messages.	2016-04-12 13:06:16 +02:00
David Teigland	147c9c01a2	rename function read_vgname to read_vgsummary The name did not clearly represent what it does.	2016-04-11 13:07:48 -05:00
Zdenek Kabelac	8e9deb2e70	gcc: cast time_t to 64bit Value is printed as uint64, so make sure right type is passed on all platforms. Fixes gcc warning on some 32bit platforms.	2016-03-10 18:38:54 +01:00
Peter Rajnoha	d03b1779b4	coverity: fix possible resource leak in _print_historical_lv function The code in _print_historical_lv function works with temporary "descendants_buffer" that is allocated and freed within this function. When printing text out, we used "outf" macro which called "out_text" fn and it checked return value and if failed, the macro called "return_0" automatically. But since we use the temporary buffer, if any of the out_text calls fails, we need to deallocate this buffer properly - that's the "goto_out", otherwise we'll be leaking memory. So add new "outfgo" helper macro which does the same as "outf", but it calls "goto_out" instead of "return_0" so we can jump to a cleanup hook at the end.	2016-03-07 10:43:50 +01:00
Peter Rajnoha	f833a6d074	metadata: add historical_glv_remove	2016-03-03 13:50:57 +01:00
Peter Rajnoha	673bc0636c	metadata: format_text: interconnect historical LVs among each other and also with live LVs Interconnect historical LVs in an ancestry chain and also connect the first/last one with its live ancestor/descendant if it exists.	2016-03-03 13:49:13 +01:00
Peter Rajnoha	a0842d1f25	metadata: format_text: import historical LVs Import historical LV list from metadata and add it to struct volume_group's historical_lvs list.	2016-03-03 13:46:39 +01:00
Peter Rajnoha	54d3d976c7	metadata: format_text: reuse _print_timestamp fn	2016-03-03 13:46:39 +01:00
Peter Rajnoha	3a0ef77305	metadata: format_text: also export historical LVs Also export historical LVs when exporting LVM2 metadata. This is list of all historical LVs listed in "historical_logical_volumes" metadata section with all the properties exported for each historical LV. For example, we have this thin snapshot sequence: lvol1 --> lvol2 --> lvol3 \ --> lvol4 We end up with these metadata: logical_volume { ... (lvol1, lvol3 and lvol4 listed here as usual - no change here) ... } historical_logical_volumes { lvol2 { id = "S0Dw1U-v5sF-LwAb-W9SI-pNOF-Madd-5dxSv5" creation_time = 1456919613 # 2016-03-02 12:53:33 +0100 removal_time = 1456919620 # 2016-03-02 12:53:40 +0100 origin = "lvol1" descendants = ["lvol3", "lvol4"] } } By removing lvol1 further, we end up with: historical_logical_volumes { lvol2 { id = "S0Dw1U-v5sF-LwAb-W9SI-pNOF-Madd-5dxSv5" creation_time = 1456919613 # 2016-03-02 12:53:33 +0100 removal_time = 1456919620 # 2016-03-02 12:53:40 +0100 origin = "-lvol1" descendants = ["lvol3", "lvol4"] } lvol1 { id = "me0mes-aYnK-nRfT-vNlV-UiR1-GP7r-ojbROr" creation_time = 1456919608 # 2016-03-02 12:53:28 +0100 removal_time = 1456919767 # 2016-03-02 12:56:07 +0100 } }	2016-03-03 13:46:18 +01:00
David Teigland	4de6caf5b5	redefine pvcreate structs New pv_create_args struct contains all the specific parameters for creating a PV, independent of the command.	2016-02-25 09:14:10 -06:00
David Teigland	ff2267012a	vgconvert: refactor to avoid pvcreate code This uses the vg->pv_write_list in place of the vg->pvs_to_write list, and eliminates the use of pvcreate_params. The label remove and zeroing steps are shifted out of vg_write() to the higher level like pvcreate will do.	2016-02-25 09:14:09 -06:00
Zdenek Kabelac	7d8a67714f	cleanup: drop double ;	2016-02-23 12:25:25 +01:00
Peter Rajnoha	8ad93874d6	tests: fix tests checking pv_attr - there's a new bit now	2016-02-15 12:44:46 +01:00
Peter Rajnoha	9b9f1ae772	format: format_text: add pv_needs_rewrite to format_handler and implemention for format_text	2016-02-15 12:44:46 +01:00
Peter Rajnoha	d84a80afb5	backup: backup_restore_vg: register PVs that need writing via vg->pvs_to_write list The backup_restore_vg is used directly for restoring the VG from backup. It's also used to do the VG conversions from one metadata format to another which means vgconvert calls backup_restore_vg too. When restoring VG from backup, we need to rewrite/write PV headers as PVs may have been orphans before and now they're becoming part of some VG - we need to write the PV_EXT_USED flag at least. When using the backup_restore_vg for vgconvert, we need to write completely new PV header in different format. Avoid the special "pv_write" call and handling that was used before this patch in vgconvert (vgconvert_single function to be more precise) and reuse existing internal interface to register PV header for writing (or rewriting) via vg->pvs_to_write list instead like we do it elsewhere in the code. This patch also resolves a problem in which PV headers with target format were written in the vgconvert_single fn as orphans and VG metadata were added later on - this was a tiny hack actually. We can't do this now - we need to write the PV as belonging to a VG because otherwise the PV_EXT_USED flag won't be written properly (if the PV header is written as orphan, the PV_EXT_USED is set to 0, of course, even though metadata are attached later). So this patch removes this tiny inconsistency which was passing just fine before because we didn't have any relation to the VG in PV header before. Now we have the PV_EXT_USED flag which says the "PV is used in some VG".	2016-02-15 12:44:46 +01:00
Peter Rajnoha	d320d9c52b	pv: format-text: store PV_EXT_USED flag if PV is used and unset it otherwise When adding a PV to VG, set the PV_EXT_USED flag in PV header and vice versa - if the PV is no longer in a VG, unset the flag.	2016-02-15 12:44:46 +01:00
Peter Rajnoha	10128c9bd6	metadata: schedule PV for header rewrite if adding a PV to VG or restoring VG When adding PV to VG, we need to rewrite PV header as there's a flip in PV_EXT_USED flag. The same applies if we're restoring VG from backup.	2016-02-15 12:44:46 +01:00
Peter Rajnoha	71ea2e1602	lvmcache/lvmetad: cache PV extension version Store PV extension version in lvmcache/lvmetad for use throughout the code.	2016-02-15 12:44:46 +01:00
Peter Rajnoha	7593221f94	lvmcache/lvmetad: cache PV extension flags Store PV extension flags in lvmcache/lvmetad for use throughout the code.	2016-02-15 12:44:46 +01:00
Peter Rajnoha	54b41db9a6	metadata: introduce PV_EXT_USED flag and bump PV_HEADER_EXTENSION_VSN	2016-02-15 12:44:46 +01:00
Peter Rajnoha	a522af93b7	format: add FMT_PV_FLAGS to indicate format supports PV flags	2016-02-15 12:44:46 +01:00
Zdenek Kabelac	fcbef05aae	doc: change fsf address Hmm rpmlint suggest fsf is using a different address these days, so lets keep it up-to-date	2016-01-21 12:11:37 +01:00
Alasdair G Kergon	01228b692b	vgcfgrestore: Retain allocatable PV attribute. pvchange -xn was getting lost. All PVs were set to allocatable again after restore. Moved setting ALLOCATABLE_PV outside pv_setup().	2016-01-14 00:46:45 +00:00
David Teigland	796461a912	vgrename: use process_each_vg Use process_each_vg() to lock and read the old VG, and then call the main vgrename code. When real VG names are used (not a UUID in place of the old name), the command still pre-locks the new name (when strcmp wants it locked first), before calling process_each_vg on the old name. In the case where the old name is replaced with a UUID, process_each_vg now translates that UUID into the real VG name, which it locks and reads. In this case, we cannot do pre-locking to maintain lock ordering because the old name is unknown. So, in this case the strcmp based lock ordering is suppressed and the old name is always locked first. This opens a remote chance for lock ordering conflict between racing vgrenames between two names where one or both commands use the UUID.	2015-12-14 14:26:47 -06:00
Zdenek Kabelac	748b8158b5	archiver: fix reporting for check_current_backup It's getting a bit more complex here. Basic idea behind is - check_current_backup() should not log error when a user is using a read-only filesystem, so e.g. vgscan will not report any error when it tries to take missing backup. We still have cases when error could be reported though, e.g. the backup this would be a symbolic link, but these are rather misconfiguration and unexpected case.	2015-12-04 22:10:30 +01:00
Zdenek Kabelac	e7978c5ab6	cleanup: drop log_suppress(2) usage No longer need to use log_suppress(2) instance so dropped.	2015-12-03 18:02:34 +01:00
Zdenek Kabelac	f40b3ba1e9	archiver: inital change toward proper logging We have to modes of 'archive()' usage - 1. compulsory - fail stops command and user may try '-An' option to do a command. 2. non-compulsory - some fails in archiving are ignorable (i.e. read-only filesystem where archive dir is located). Those 2 cases needs to be properly handle - i.e. the non-compulsory logging should not be tampering error logging message production. So more work here is needed	2015-12-03 18:01:45 +01:00
David Teigland	d3ca18e489	lvmcache: include system_id in vginfo cache Save system_id just like creation_host and lock_type strings in vginfo cache.	2015-11-30 11:32:17 -06:00
Zdenek Kabelac	9243877ea1	cleanup: use display_lvname Switch debug msg to use display_lvname. Link to VG early, so we have access to VG from LV.	2015-11-23 23:42:59 +01:00
Zdenek Kabelac	c3b292a4a9	format-text: ensure no division by zero Coverity likes here to be 100% sure no division by zero is possible. Add check for alignment !=0 which is made on other code paths here.	2015-11-16 01:16:11 +01:00
Zdenek Kabelac	2e04eee192	cleanup: do not test alloca for NULL alloca() never returns NULL. In case stack is out-of-range the behaviour is undefined.	2015-11-09 10:22:51 +01:00
Peter Rajnoha	ccfc09f79b	metadata: format_text: also count with calculated mda size of 0 When checking minimum mda size, make sure the mda_size after alignment and calculation is more than 0 - if there's no place for an MDA at the end of the disk, the _text_pv_add_metadata_area does not try to add it there and it returns (because we already have the MDA at the start of the disk at least).	2015-10-30 12:02:34 +01:00
Peter Rajnoha	c2e88d1107	metadata: format_text: better check for metadata overlap Actually, we don't need extra condition as introduced in commit `00348c0a63`. We should fix the last condition: (mdac->rlocn.size >= mdah->size) ...which should be: (MDA_HEADER_SIZE + (rlocn ? rlocn->size : 0) + mdac->rlocn.size >= mdah->size)) Where the "mdac" is new metadata, the "rlocn" is old metadata. So the main problem with the previous condition was that it didn't count in MDA_HEADER_SIZE properly (and possible existing metadata - the "rlocn"). This could have caused the error state where metadata in ring buffer overlap to not be hit. Replace the new condition introduced in `00348c0a63` with the improved one for the condition that existed there already but it was just incomplete.	2015-10-30 08:57:34 +01:00
Peter Rajnoha	00348c0a63	metadata: format_text: check VG metadata do not overlap themselves We're already checking whether old and new meta do not overlap in ring buffer (as we need to keep both old and new meta during vg_write up until vg_commit). We also need to check whether the new metadata do not overlap themselves in case we don't have old metadata yet (...because we're in vgcreate). This could happen if we're creating a VG so that the very first metadata written are long enough that it wraps themselves in metadata ring buffer. Although we limited the minimum metadata area size better with the previous commit `ccb8da404d` which makes the initial VG metadata overlap in ring buffer to be less probable, the risk of hitting this overlap condition is still there if we still manage to generate big enough metadata somehow. For example, users can provide many and/or long VG tags during vgcreate so that the VG metadata is long enough to start to wrap in the ring buffer again...	2015-10-29 16:46:41 +01:00
Peter Rajnoha	ccb8da404d	metadata: format_text: check metadata area size is at least MDA_SIZE_MIN	2015-10-29 16:00:32 +01:00
Peter Rajnoha	b3c81d02c9	revert: `3d03e504cd`: message about VG metadata size vs. PV mda size The message needs refinement - it's not correct in all situations.	2015-10-29 11:10:48 +01:00
Peter Rajnoha	3d03e504cd	metadata: format_text: provide more detailed error message when metadata too large for PV mda Also, leave out the note about "circular buffer" which is an internal imeplementation detail anyway and not quite informational for users: Before this patch: $ vgcreate vg1 /dev/sda VG vg1 metadata too large for circular buffer Failed to write VG vg1. With this patch applied: $ vgcreate vg1 /dev/sda VG vg1 metadata too large: size of metadata to write is 691 bytes while PV metadata area size on /dev/sda is 512 bytes. Failed to write VG vg1.	2015-10-08 16:27:03 +02:00
Alasdair G Kergon	214e2cddf6	segtypes: Use SEG_TYPE_NAME_ string constants.	2015-09-22 19:04:12 +01:00
Peter Rajnoha	fcfca57e2e	format-text: label: fix missing dev assignment for struct label in _text_pv_write When using lvm shell, some structures which are cached in memory may be reused. This happens for the struct label (a part of lvmcache_info structure) when lvmetad is used in which case the PV scan is not done that would normally overwrite these label structures in memory and making them up-to-date. This is all consequence of the fact that struct lvmcache_info and struct label are not always assigned in the same part of the code. For example, if lvmetad is not used, parts of the struct label are reassigned in label_read fn while struct lvmcache_info is created elsewhere. No part of the code reused struct label (and its "dev" field) before calling label_read fn. That's why the real bug is hidden when using lvm shell without lvmetad. However, with lvmetad and lvm shell, the situation is a bit different. The label_read fn is not called if lvmetad is used, hence the struct label may have ended up not initialized properly. There was missing assignment for the dev field in struct label in _text_pv_write fn which caused this problem to appear in lvm shell with lvmetad, for example: Before this patch: lvm> pvcreate /dev/sda Physical volume "/dev/sda" successfully created lvm> pvs /dev/sda PV VG Fmt Attr PSize PFree unknown device lvm2 --- 128.00m 128.00m With this patch applied: lvm> pvcreate /dev/sda Physical volume "/dev/sda" successfully created lvm> pvs /dev/sda PV VG Fmt Attr PSize PFree /dev/sda lvm2 --- 128.00m 128.00m Also, this problem had not appeared before changes introduced by commits `e1a63905d1` through `3a6f91d713` which, among other things, added proper label field type reporting. Before, label reporting was the same as using struct physical_volume which has its own dev field assigned and so this problem was not exposed.	2015-09-15 18:07:32 +02:00
Zdenek Kabelac	28b4fa3e27	Revert "lvmcache: check for too long pvid" This reverts commit `70db1d523d`. Since we use 'strncpy' even for case where it exactly matches the buffer size and \0 is not expected to be added there.	2015-08-18 15:22:13 +02:00
Zdenek Kabelac	a8fd88463e	cleanup: trace error from lvmcache_update_vgname_and_id Check result value from lvmcache_update_vgname_and_id().	2015-08-18 15:00:08 +02:00
Zdenek Kabelac	70db1d523d	lvmcache: check for too long pvid	2015-08-18 14:53:36 +02:00
David Teigland	e593213b87	lvmcache: add lock_type to VG summary and info structs vgsummary information contains provisional VG information that is obtained without holding the VG lock. This info can be used to lock the VG, and then read it with vg_read(). After the VG is read properly, the vgsummary info should be verified. Add the VG lock_type to the vgsummary. It needs to be known before the VG can be locked and read.	2015-07-29 14:27:32 -05:00
David Teigland	cb14bbdbc9	metadata: add comments describing lock_args for lvmlockd	2015-07-09 15:16:28 -05:00
Peter Rajnoha	3b6840e099	config: replace find_config_tree_node with find_config_tree_array where appropriate	2015-07-08 13:03:08 +02:00
Alasdair G Kergon	810ab095e6	macros: Wrap PRI with FMT. Create a set of wrappers with embedded % such as #define FMTu64 "%" PRIu64	2015-07-06 15:09:17 +01:00
David Teigland	fe70b03de2	Add lvmlockd	2015-07-02 15:42:26 -05:00
Peter Rajnoha	7b45a1fc60	refactor: rename _out_tags fn to _out_list and use it for string lists in general	2015-06-29 09:43:55 +02:00
Peter Rajnoha	f143ad3a93	cleanup: remove unused tags.c file	2015-06-29 09:43:47 +02:00
Peter Rajnoha	e29d4773f4	refactor: rename alloc_printed_tags fn to _alloc_printed_str_list and use it for string lists in general	2015-06-29 09:43:41 +02:00
Peter Rajnoha	77c2d11657	refactor: rename read_tags fn to _read_str_list and use it for string lists in general	2015-06-29 09:43:32 +02:00
Petr Rockai	c78b6f18d4	metadata: Reject lvmetad metadata extensions when reading from disk.	2015-06-10 16:25:57 +02:00
Petr Rockai	43224f22e4	format_text: Parse (optional) outdated_pvs section in VG metadata.	2015-05-20 19:46:14 +02:00
Peter Rajnoha	1806694928	metadata: use log_debug_metadata instead of general log_debug for BA debug messages	2015-05-11 11:07:53 +02:00
Zdenek Kabelac	05934d2538	format_text: properly validate PV size for restore Use 64bit arithmentic for PV size calculation (Coverity). Also remove sector shift for compared PV size, since all values are already held in sectors. This fixes validatio of PV size when restoring PV from vg metadata backup file.	2015-05-08 15:12:35 +02:00
Alasdair G Kergon	cc26085b62	alloc: Respect cling_tag_list in contig alloc. When performing initial allocation (so there is nothing yet to cling to), use the list of tags in allocation/cling_tag_list to partition the PVs. We implement this by maintaining a list of tags that have been "used up" as we proceed and ignoring further devices that have a tag on the list. https://bugzilla.redhat.com/983600	2015-04-11 01:55:24 +01:00
Alasdair G Kergon	a9d48bae2f	cache: Set correct vgid when changing PV header. pv_write is called both to write orphans and to rewrite PV headers of PVs in VGs. It needs to select the correct VG id so that the internal cache state gets updated correctly. It only affected commands that involved further steps after the pv_write and was often masked because the metadata would be re-read off disk and correct itself. "Incorrect metadata area header checksum" warnings appeared. Example: Create vg1 containing dev1, dev2 and dev3. Hide dev1 and dev2 from the system. Fix up vg1 with vgreduce --removemissing. Bring back dev1 and dev2. In a single operation reinstate dev1 and dev2 into vg1 (vgextend). Done as separate operations (automatically fix-up dev1 and dev2 as orphans, then vgextend) it worked, but done all in one go the internal cache got corrupted and warnings about checksum errors appeared.	2015-04-09 21:13:55 +01:00
Peter Rajnoha	c9f021de0b	metadata: process_each_lv_in_vg: get the list of LVs to process first, then do the processing This avoids a problem in which we're using selection on LV list - we need to do the selection on initial state and not on any intermediary state as we process LVs one by one - some of the relations among LVs can be gone during this processing. For example, processing one LV can cause the other LVs to lose the relation to this LV and hence they're not selectable anymore with the original selection criteria as it would be if we did selection on inital state. A perfect example is with thin snapshots: $ lvs -o lv_name,origin,layout,role vg LV Origin Layout Role lvol1 thin,sparse public,origin,thinorigin,multithinorigin lvol2 lvol1 thin,sparse public,snapshot,thinsnapshot lvol3 lvol1 thin,sparse public,snapshot,thinsnapshot pool thin,pool private $ lvremove -ff -S 'lv_name=lvol1 \|\| origin=lvol1' Logical volume "lvol1" successfully removed The lvremove command above was supposed to remove lvol1 as well as all its snapshots which have origin=lvol1. It failed to do so, because once we removed the origin lvol1, the lvol2 and lvol3 which were snapshots before are not snapshots anymore - the relations change as we're processing these LVs one by one. If we do the selection first and then execute any concrete actions on these LVs (which is what this patch does), the behaviour is correct then - the selection is done on the initial state: $ lvremove -ff -S 'lv_name=lvol1 \|\| origin=lvol1' Logical volume "lvol1" successfully removed Logical volume "lvol2" successfully removed Logical volume "lvol3" successfully removed Similarly for all the other situations in which relations among LVs are being changed by processing the LVs one by one. This patch also introduces LV_REMOVED internal LV status flag to mark removed LVs so they're not processed further when we iterate over collected list of LVs to be processed. Previously, when we iterated directly over vg->lvs list to process the LVs, we relied on the fact that once the LV is removed, it is also removed from the vg->lvs list we're iterating over. But that was incorrect as we shouldn't remove LVs from the list during one iteration while we're iterating over that exact list (dm_list_iterate_items safe can handle only one removal at one iteration anyway, so it can't be used here).	2015-03-24 08:43:07 +01:00
Alasdair G Kergon	a515a91fcc	format_text: Fix precommitted segfault. The code never mixes reads of committed and precommitted metadata, so there's no need to attempt to set PRECOMMITTED when *use_previous_vg is being set.	2015-03-19 11:14:47 +00:00
Alasdair G Kergon	6407d184d1	cache: Store metadata size and checksum. Refactor the recent metadata-reading optimisation patches. Remove the recently-added cache fields from struct labeller and struct format_instance. Instead, introduce struct lvmcache_vgsummary to wrap the VG information that lvmcache holds and add the metadata size and checksum to it. Allow this VG summary information to be looked up by metadata size + checksum. Adjust the debug log messages to make it clear when this shortcut has been successful. (This changes the optimisation slightly, and might be extendable further.) Add struct cached_vg_fmtdata to format-specific vg_read calls to preserve state alongside the VG across separate calls and indicate if the details supplied match, avoiding the need to read and process the VG metadata again.	2015-03-18 23:43:02 +00:00
Alasdair G Kergon	1d3711c0b2	format_text: Set system id directly. Rearrange _read_vg code to set the appropriate system id field directly.	2015-03-09 19:33:27 +00:00
Alasdair G Kergon	379d9ec8ec	systemid: Use temp status var for LVM_WRITE_LOCKED	2015-03-09 19:18:14 +00:00
Alasdair G Kergon	faccdeda83	comments: Use full flag names.	2015-03-09 18:53:22 +00:00
David Teigland	e9a233ee8e	system_id: detect an lvm1 system id Detect an lvm1 system id by looking at the WRITE_LOCKED flag. Don't copy this lvm1 system id into vg->system_id so that the restrictions associated with the new system id are not applied to the old VG with the inherited lvm1 system id.	2015-03-09 13:27:34 -05:00
Zdenek Kabelac	a9b28a4f21	lib: reduce parsing in vgname_from_mda Use similar logic as with text_vg_import_fd() and avoid repeated parsing of same mda and its config tree for vgname_from_mda(). Remember last parsed vgname, vgid and creation_host in labeller structure and if the metadata have the same size and checksum, return this stored info. TODO: The reuse of labeller struct is not ideal, some lvmcache API for this functionality would be nicer.	2015-03-06 13:53:13 +01:00
Zdenek Kabelac	7e7411966a	lib: avoid reparsing same metadata When reading VG mda from multiple PVs - do all the validation only when mda is seen for the first time and when mda checksum and length is same just return already existing VG pointer. (i.e. using 300PVs for a VG would lead to create and destroy 300 config trees....)	2015-03-06 13:53:12 +01:00
Zdenek Kabelac	6a2ae250ff	cleanup: add stack trace Missed stack in error path.	2015-03-06 13:51:54 +01:00
Zdenek Kabelac	60427d5d42	lib: return value Drop label out: with goto and return NULL directly. Add log_debug() for zero metadata offset.	2015-03-06 13:51:43 +01:00
Zdenek Kabelac	4d16bfaabb	lib: zero returned labeller struct Return zeroed struct. (Structure will be extended, so ensure all members are initilized.)	2015-03-06 13:17:39 +01:00
David Teigland	5e25bca1a9	system_id: avoid munging vg and lv fields Munge the WRITE/WRITE_LOCKED flags in a temp variable instead of in the vg/lv fields.	2015-03-05 10:23:16 -06:00
David Teigland	1e65fdd9ba	system_id: make new VGs read-only for old lvm versions Previous versions of lvm will not obey the restrictions imposed by the new system_id, and would allow such a VG to be written. So, a VG with a new system_id is further changed to force previous lvm versions to treat it as read-only. This is done by removing the WRITE flag from the metadata status line of these VGs, and putting a new WRITE_LOCKED flag in the flags line of the metadata. Versions of lvm that recognize WRITE_LOCKED, also obey the new system_id. For these lvm versions, WRITE_LOCKED is identical to WRITE, and the rules associated with matching system_id's are imposed. A new VG lock_type field is also added that causes the same WRITE/WRITE_LOCKED transformation when set. A previous version of lvm will also see a VG with lock_type as read-only. Versions of lvm that recognize WRITE_LOCKED, must also obey the lock_type setting. Until the lock_type feature is added, lvm will fail to read any VG with lock_type set and report an error about an unsupported lock_type. Once the lock_type feature is added, lvm will allow VGs with lock_type to be used according to the rules imposed by the lock_type. When both system_id and lock_type settings are removed, a VG is written with the old WRITE status flag, and without the new WRITE_LOCKED flag. This allows old versions of lvm to use the VG as before.	2015-03-05 09:50:43 -06:00
David Teigland	c6a57dc4f3	Revert "systemid: Add ACCESS_NEEDS_SYSTEM_ID VG flag." This reverts commit `bfbb5d269a`. This will be done differently.	2015-03-05 09:50:43 -06:00
Alasdair G Kergon	bfbb5d269a	systemid: Add ACCESS_NEEDS_SYSTEM_ID VG flag. Set ACCESS_NEEDS_SYSTEM_ID VG status flag whenever there is a non-lvm1 system_id set. Prevents concurrent access from older LVM2 versions. Not set on VGs that bear a system_id only due to conversion from lvm1 metadata.	2015-03-04 01:16:32 +00:00

1 2 3 4 5 ...

752 Commits