shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-22 17:35:59 +03:00

Author	SHA1	Message	Date
Dave Wysochanski	7042e06a2a	Make vg->mda_copies persistent in on disk vg metadata. This patch adds the ability to read/write the vg->mda_copies values from/to the vg metadata. If we read the VG metadata and this field does not exist, we set mda_copies to the default value of 0. Later in the code, we use this special '0' value to indicate a disable of metadata balancing. This should preserve existing LVM behavior and ensure metadata balancing can be turned off should the need arise. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:37:10 +00:00
Dave Wysochanski	69d1732334	Update _vg_read and _text_create_text_instance to use fid_add_mda[s]. When we are constructing the vg, we may need to adjust the list of metadata_areas if there are ignored mdas. At label read time, we do not read the metadata of ignored mdas, and as a result, they do not get placed on vg->fid->metadata_areas inside _text_create_text_instance since lvmcache does not have these areas attached to vginfo->infos. However, when we're checking the pvids inside _vg_read, after having read another metadata area from another PV, we do have the opportunity to update the metadata_area and metadata_areas_ignored lists based on the read metadata_area. We need accurate mda lists for the reporting functions that count the ignored mdas, as well as general correctness of mda balancing. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:35:17 +00:00
Dave Wysochanski	e6bd367b57	Implement ignore of mda if bit set by skipping r/w of metadata. We implement ignore of an mda at label_read time by checking for the ignore bit, and then skipping the reading of the vgname and other information in the metadata. This will have an effect similar to a PV found with no mdas. Thus, it will look like an orphan in the cache until we scan the rest of the system and find a PV with metadata, and the mda will not be on the vg->fid->metadata_areas list so no read/writes will be done to the metadata area. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:34:24 +00:00
Dave Wysochanski	9ccac021a7	Add metadata_areas_ignored list and functions to manage ignored mdas. Add a second mda list, metadata_areas_ignored to fid, and a couple functions, fid_add_mda() and fid_add_mdas() to help manage the lists. These functions are needed to properly count the ignored mdas and manage the lists attached to the 'fid' and ultimately the 'vg'. Ensure metadata_areas_ignored is initialized in other formats, even if the list is never used. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:33:22 +00:00
Dave Wysochanski	f55a20eb36	Rename fid->metadata_areas to fid->metadata_areas_in_use. Rename the metadata_areas list to an 'in_use' list to prepare for future 'ignored' list.	2010-06-28 20:32:44 +00:00
Dave Wysochanski	ef4fa155a5	Add mda location specific mda_copy constructor. Because of the way mdas are handled internally, where a PV in a VG has mdas on both info->mdas and vg->fid->metadata_areas list, we need a location independent copy constructor for struct metadata_area. Break up the existing format-text specific copy constructor into a format independent piece and a format dependent piece. This function is necessary to properly implement pv_set_mda_ignored(). Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-by: Alasdair G Kergon <agk@redhat.com>	2010-06-28 20:31:59 +00:00
Dave Wysochanski	29f24d4634	Add mda_locns_match() internal library function for mapping pv/device to VG mda. A metadata_area is defined independent of the location. One downside is that there is no obvious mapping from a pv to an mda. For a PV in a VG, we need a way to start with a PV and end up with an MDA, if we are to manage mdas starting with a device/pv. This function provides us a way to go down the list of PVs on a VG, and identify which ones match a particular PV. I'm not entirely happy with this approach, but it does fit into the existing structures in a reasonable way. An alternative solution might be to refactor the VG - PV interface such that mdas are a list tied to a PV. However, this seemed a bit tricky since a PV does not come into existence until after the list of mdas is constructed (see _vg_read() - we create a 'fid' and attach mdas to it, then we go through them and attach pvs). Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-by: Alasdair G Kergon <agk@redhat.com>	2010-06-28 20:31:38 +00:00
Dave Wysochanski	a6b36a5901	Ensure in-memory state matches on-disk state of mda ignore bit. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:31:18 +00:00
Dave Wysochanski	09e0f43ba0	Allow raw_read_mda_header to be called from text_label.c. We'd like to pass in mda_header to vgname_from_mda(). In order to do this, we need to call raw_read_mda_header() from text_label.c, _text_read(), which gets called from the label_read() path, and peers into the metadata and update vginfo cache. We should check the disable bit here, and if set, not peer into the vg metadata, thus reducing the I/O to disk. In the process, move vgname_from_mda() to layout.h, since the fn only gets called from format_text code, and we need the mda_header definition from the private layout.h. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:31:01 +00:00
Dave Wysochanski	da0b4d8770	Move dev_open/dev_close outside vgname_from_mda(). Refactor vgname_from_mda() so caller must open/close the device. Should be no functional change. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:30:46 +00:00
Dave Wysochanski	96597c2eab	Move dev_open / dev_close outside _vg_read_raw_area(). This refactoring moves the device open/close up one level to the caller of _vg_read_raw_area(). Should be no functional change and facilitate future changes related to metadata balancing. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:30:30 +00:00
Dave Wysochanski	322c5868b3	Add location independent flag and functions to ignore mdas. First we add a 'flags' field to the location independent metadata_area structure, and a MDA_IGNORE flag. The mda_is_ignored and mda_set_ignored functions are added to manage the flag. Adding the flag and functions gives a library interface to ignore metadata areas independent of the underlying location (disk, file, etc). The location specific read/write functions must then handle the specifics of what this flag means to the location. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-by: Alasdair G Kergon <agk@redhat.com>	2010-06-28 20:30:14 +00:00
Dave Wysochanski	d144d5eeb7	Add text format specific 'rlocn' ignore flag and access functions. Adding a flag to the 'rlocn' structure in the mda header of the text format allows us to flip a bit to ignore an area on disk that stores the metadata via the text format specific mda_header. This patch defines the flag and access functions to manage the flag. Other patches will manage the ignore on a format-independent basis, by using a flag in the metadata_area structure. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:29:57 +00:00
Dave Wysochanski	7c604e7649	Change 'filler' to 'flags' in on-disk 'raw_locn' structure. Future patches will make use of a specific flag in the on-disk 'raw_locn' structure to enable/disable metadata areas, and facilitate metadata balancing. Note that 'filler' is always set to '0' (see add_mda() - memset), so use of this area as a non-zero flags field is a safe way to provide future code features. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:29:42 +00:00
Dave Wysochanski	58f55600d0	Add device name to output of error messages in raw_read_mda_header(). It would be helpful if we had the device name when something like a mda_header checksum error occurs. Before: ./tools/lvm pvs -opv_name,vg_name,uuid,mda_count,pv_mda_count_ignored,vg_mda_count,vg_mda_count_ignored,vg_mda_copies Incorrect metadata area header checksum PV VG PV UUID #PMda #PMdaIgn #VMda #VMdaIgn #VMdaCps /dev/loop0 vgtest2 sVv26t-gjpb-Rcau-uBDO-Cx04-GbRR-6Ssq7e 2 0 4 0 4 /dev/loop1 vgtest2 zXWStT-qE8F-mbkc-RfgH-aytv-mptF-Y5Ce09 2 0 4 0 4 /dev/loop2 riCpK9-9G8r-LlIp-i2oh-mb3N-CUzk-u5YpuR 1 0 0 0 0 /dev/loop3 vgtest tQCUjm-rmyd-i92d-4eeE-UYBW-v1vQ-kRaA17 2 0 4 2 0 /dev/loop4 vgtest ZRvpeI-p8F1-ccVW-BBac-xhl1-aGXU-CbP0oo 2 2 4 2 0 After: ./tools/lvm pvs -opv_name,vg_name,uuid,mda_count,pv_mda_count_ignored,vg_mda_count,vg_mda_count_ignored,vg_mda_copies Incorrect metadata area header checksum on /dev/loop2 at offset 4096 PV VG PV UUID #PMda #PMdaIgn #VMda #VMdaIgn #VMdaCps /dev/loop0 vgtest2 sVv26t-gjpb-Rcau-uBDO-Cx04-GbRR-6Ssq7e 2 0 4 0 4 /dev/loop1 vgtest2 zXWStT-qE8F-mbkc-RfgH-aytv-mptF-Y5Ce09 2 0 4 0 4 /dev/loop2 riCpK9-9G8r-LlIp-i2oh-mb3N-CUzk-u5YpuR 1 0 0 0 0 /dev/loop3 vgtest tQCUjm-rmyd-i92d-4eeE-UYBW-v1vQ-kRaA17 2 0 4 2 0 /dev/loop4 vgtest ZRvpeI-p8F1-ccVW-BBac-xhl1-aGXU-CbP0oo 2 2 4 2 0	2010-06-22 19:18:27 +00:00
Peter Rajnoha	03023d3965	Fix incorrect memory pool deallocation while using vg_read for files. We create a separate pool "lvm2 vg_read" for vg_read and we don't use cmd->mem anymore.	2010-06-01 12:08:50 +00:00
Zdenek Kabelac	8fea97b7e7	Replicator: base lvm2 support Adding configure.in support for Replicators. Adding basic lib lvm support for Replicators. Adding flags REPLICATOR and REPLICATOR_LOG. Adding segments SEG_REPLICATOR and SEG_REPLICATOR_DEV. Adding basic methods for handling replicator metadata.	2010-05-21 12:36:30 +00:00
Petr Rockai	9409998d71	Suppress duplicate error messages about read failures and missing devices.	2010-05-05 22:37:52 +00:00
Peter Rajnoha	1e696b0c15	Do not reset position in metadata ring buffer on vgrename and vgcfgrestore. We should write metadata into next position in the ring buffer while calling vgrename and vgcfgrestore. At this code level (_vg_write_raw), we were not able to determine if this is a rename or not. If yes, then accompanying VG structure passed here has a new name set, not the old one. When looking for a location where to put metadata next, we were given a NULL value because of failed VG name comparison (in _find_vg_rlocn) between the name in existing metadata and metadata we're just about to write. This resets the position in the ring buffer, overwriting any existing metadata (and also incorrectly updates the cache to "orphan" afterwards). This patch just adds old_name item in struct volume_group that we can check and use if necessary and detect renames at lower layers as well. The same applies for vgcfgrestore, but here we're using a special value of old_name, an empty string, to disable the check with existing metadata totally.	2010-04-14 13:09:16 +00:00
Alasdair Kergon	aab7a3978b	Fix pvmove allocation to take existing parallel stripes into account. When moving parts of striped LVs, pvmove wouldn't care about leaving you with two stripes on the same disk. Now --alloc anywhere is needed for that. (Tried and gave up on two alternative approaches before the one committed here.)	2010-04-08 00:28:57 +00:00
Dave Wysochanski	9e82787da2	Add add_pvl_to_vgs() - helper function to add a pv to a vg list. Small refactor of main places in the code where a pv is added to a vg into a small function which adds the pv to the list and updates the vg counts. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-04-06 14:04:54 +00:00
Dave Wysochanski	36e9d03d1b	Refactor _read_pv() code that updates vg->extent_count and vg->free_count. Simple refactor to mov code that updates the vg extent counts from a single pv's counts close to the code that adds a pv to vg->pvs and updates vg->pv_count. No functional change. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-04-06 14:04:03 +00:00
Alasdair Kergon	258db3ad8e	Change most remaining log_error WARNING messages to log_warn.	2010-04-01 10:34:09 +00:00
Milan Broz	6733116a19	Fix all segments memory is allocated from vg private mempool. Physical segments were still allocated from global command context mempool. This leads to very high memory usage when activating large VG (vgchange). (Memory usage was about 2G when >3000LVs). Fix it by properly using vg->vgmem private pool, so all the memory is released early. New memory pool parameter is needed here for pv_split_segment function. Also fix the same problem in some minor allocations (vg description, lv segment split).	2010-03-31 17:23:18 +00:00
Milan Broz	d59a2b6109	Use hash table for quick lv reference when reading metadata. The _read_vg uses already hash for PVs to optimise reading of large VGs and avoiding repeated PV list traversing. Use the same aproach to speed up parsing VG with many LVs.	2010-03-31 17:20:02 +00:00
Alasdair Kergon	0a5182fc97	Suppress repeated errors about the same missing PV uuids. Bypass full device scans when using internally-cached VG metadata.	2010-03-17 02:11:18 +00:00
Alasdair Kergon	b1f9a2f5d1	Only do one full device scan during each read of text format metadata.	2010-03-16 17:30:00 +00:00
Alasdair Kergon	c97cbf8c08	pre-release	2010-02-15 23:53:15 +00:00
Dave Wysochanski	8caf272a93	Add copy constructor for struct metadata_area. Clean up cut&paste code with proper copy constructor.	2010-02-02 16:26:34 +00:00
Mike Snitzer	c52678ee9b	Rename segment and lv status flag from SNAPSHOT_MERGE to MERGING. Eliminate 'merging_snapshot' from 'struct logical_volume' and just use 'snapshot' for origin lv's reference to the merging snapshot; also set MERGING in the origin lv's status.	2010-01-13 01:56:18 +00:00
Mike Snitzer	68e8f5a4a2	Add 'SNAPSHOT_MERGE' lv_segment 'status' flag. Make 'merging_snapshot' pointer that points from the origin to the segment that represents the merging snapshot. Import/export 'merging_store' metadata. Do not allow creating snapshots while another snapshot is merging. Snapshot created in this state would certainly contain invalid data. NOTE: patches at the end of this series will remove 'merging_snapshot' and will introduce helpful wrappers and cleanups.	2010-01-13 01:35:49 +00:00
Zdenek Kabelac	387f47078c	Add few const modifiers.	2010-01-07 14:47:57 +00:00
Zdenek Kabelac	ea8acabe26	Export function out_text_with_comment() and add outfc() macro that checks for error.	2010-01-07 14:45:28 +00:00
Zdenek Kabelac	1e13fa7a6a	Add macros outsize() for out_size() and outhint() for out_hint() that check for errors in a similar way as outf() for out_text().	2010-01-07 14:40:46 +00:00
Petr Rockai	550cae2340	#define an INTERNAL_ERROR macro and use it throughout LVM.	2009-12-16 19:22:11 +00:00
Zdenek Kabelac	b1ebf028de	Cleanup returns for void functions.	2009-12-11 13:16:37 +00:00
Dave Wysochanski	59baeb838c	Update a few more uint64_t's related to the 64-bit status change. At this point they probably do not matter but going forward they may - depends on future patches for replicator, etc. I think these probably got missed because they were 'flags' so I changed the name to 'status' to be consistent. So the on-disk things 'flags' and the in structure 'status' (bits). NOTE: WHATS_NEW already has entry for this in current release. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Acked-by: Mike Snitzer <snitzer@redhat.com>	2009-12-04 17:48:32 +00:00
Milan Broz	fec4de9563	Fix tools to report error when stopped by user. (And do not produce internal error message.)	2009-12-03 19:18:33 +00:00
Mike Snitzer	a2552d4f59	Switch status from 32-bit to 64-bit The physical_volume, volume_group, logical_volume and lv_segment structures' 'status' member is now uint64_t. The alignment of these structures was also audited to remove holes. The movement of some members in 'volume_group' and 'lv_segment' eliminates holes. The 'physical_volume' structure still has one 4-byte hole after 'pe_size'; the other structures no longer have any holes. Each structures' size has not changed.	2009-11-24 22:55:55 +00:00
Zdenek Kabelac	7fb52b9c39	Export functions out_inc_indent(), out_dec_indent() for creating indented metadata lines. Macro outnl() is using exported out_newline() instead of direct call f->fn(), that required the visibility of the internal struct formatter.	2009-11-03 11:00:46 +00:00
Petr Rockai	b4048242f5	Handle metadata with unknown segment types more gracefully.	2009-10-16 17:41:49 +00:00
Alasdair Kergon	d557773841	Consolidate LV allocation into alloc_lv().	2009-09-28 17:46:15 +00:00
Dave Wysochanski	905240f91d	Use vg_is_exported(vg) macro everywhere. This patch is all just cleanup and no other patch depends on it. Replace explicit dereference and check with vg_is_exported(). Update a few copyrights and remove unnecessary whitespace. Should be no functional change.	2009-09-14 19:44:15 +00:00
Mike Snitzer	57b660356e	Add devices/data_alignment_offset_detection to lvm.conf. If the pvcreate --dataalignmentoffset option is not specified the start of a PV's aligned data area will be shifted by the associated 'alignment_offset' exposed in sysfs (unless devices/data_alignment_offset_detection is disabled in lvm.conf). Signed-off-by: Mike Snitzer <snitzer@redhat.com>	2009-08-01 17:07:36 +00:00
Mike Snitzer	9607eba5c2	Fix compile warnings from recently added log_very_verbose() in _text_pv_write()	2009-07-31 14:23:06 +00:00
Mike Snitzer	377b6a5843	Disable the "new pe_start policy" Documented which use-cases force the reinstatement of the nuanced handling of pe_start. As soon as orphan PVs are eliminated much of this will no longer be a concern ('preserve_pe_start' can be reenabled in .pv_setup). Added defensive 'if (pv->pe_align)' check in _text_pv_write()'s pe_start loop.	2009-07-30 21:15:17 +00:00
Mike Snitzer	733bd656b2	Revert 'preserve_pe_start' related code in _text_pv_setup If pv_setup was given a non-zero pe_start it would short-circuit establishing a default pv->pe_align. pv->pe_align=0 would result in a divide by zero in _mda_setup(). 'vgconvert -M2 $vgname' hit this. .pv_write still properly preserves pe_start if it was supplied.	2009-07-30 18:40:22 +00:00
Mike Snitzer	04b2a4bdcf	Add --dataalignmentoffset to pvcreate to shift start of aligned data area Adds pe_align_offset to 'struct physical_volume'; is initialized with set_pe_align_offset(). After pe_start is established pe_align_offset is added to it. Signed-off-by: Mike Snitzer <snitzer@redhat.com>	2009-07-30 17:45:28 +00:00
Mike Snitzer	d01a37f597	Preserve pe_start in .pv_setup and .pv_write if pe_start was supplied. Signed-off-by: Mike Snitzer <snitzer@redhat.com>	2009-07-30 17:42:33 +00:00
Mike Snitzer	c8a4e489c1	Remove legacy support for preserving pe_start if a PV already has data areas. This preserved pe_start would quickly be readjusted to follow the first mda anyway. An example use-case that hit this code path is: running pvcreate on an already existing PV _without_ a preceeding pvremove. Signed-off-by: Mike Snitzer <snitzer@redhat.com>	2009-07-30 17:41:01 +00:00

1 2 3 4 5 ...

294 Commits