shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-22 17:35:59 +03:00

Author	SHA1	Message	Date
Petr Rockai	a368698672	lvmetad: Hide corrupt MDAs from the cache. This is probably not optimal, but makes the lvmetad case mimic non-lvmetad code more closely. It also fixes vgremove of a partially corrupt VG with lvmetad, as _vg_write_raw (and consequently, entire vg_write) currently panics when it encounters a corrupt MDA. Ideally, we'd be able to explicitly control when it is safe to ignore them.	2014-02-28 11:23:52 +01:00
Peter Rajnoha	08116a4962	cleanup: missing header file	2014-02-20 09:07:38 +01:00
Petr Rockai	b391ae88e5	format-text: Avoid a label_scan while in a critical_section().	2014-02-19 17:43:30 +01:00
Jonathan Brassow	97be8b3482	cache: Code changes to allow creation of cache pools This patch allows the creation and removal of cache pools. Users are not yet able to create cache LVs. They are only able to define the space used for the cache and its characteristics (chunk_size and cache mode ATM) by creating the cache pool.	2014-02-04 11:57:08 -06:00
Alasdair G Kergon	4aa8a14fc2	compilation: Rename tags variables to tagsl.	2014-01-30 21:09:28 +00:00
Alasdair G Kergon	5eee73bd7c	pvresize: Fix orphan PV size calculation. The size of any metadata must be ignored when calculating the size of an orphan PV. Bug introduced by `603b45e0ed` ("pvresize: Do not use pv_read (get the PV from orphan VG).")	2014-01-17 01:12:04 +00:00
Alasdair G Kergon	ebac2ed5be	pvresize: Avoid archiving orphan VG metadata. Block creations of archive and backup files for internal orphan VGs. Bug introduced by `603b45e0ed` ("pvresize: Do not use pv_read (get the PV from orphan VG).")	2014-01-16 23:02:59 +00:00
Peter Rajnoha	d443bfac21	config: fix metadata/disk_areas config setting registration The metadata/disk_areas setting was incorrectly registered as "string" configuration option but it's a section where each area is defined in its own subsection with "start_sector", "size" and "id" setting. This setting is not officialy supported, it's undocumented and it's used solely for debugging. Note: At this moment, it does not seem to be working with lvmetad!	2013-12-13 16:52:51 +01:00
Zdenek Kabelac	30a81e5989	cleanup: self compilable headers	2013-12-12 13:28:19 +01:00
Zdenek Kabelac	01c438a96c	format-text: ensure aligment is not 0 Make sure this path of code is not used for alignment == 0, to prevent division by 0.	2013-11-28 12:42:39 +01:00
Zdenek Kabelac	782a356e7c	archiver: add check for dm_pool_strdup It will likely not fail to duplicate empty string, but just keep the test of result of this function consistent. Also on error path restore extent_size if in some case someone would still use that variable.	2013-11-22 21:00:54 +01:00
Zdenek Kabelac	3d3b8bfd1c	pv_write: check for lvmcache_add_mda failure Add missing test of failing lvmcache_add_mda() call.	2013-11-22 20:55:09 +01:00
Petr Rockai	9b91977f4e	labeller: Make the use of "private" as "fmt" explicit. All labellers always use the "private" (void *) field as the fmt pointer. Making this fact explicit in the type of the labeller simplifies the label reporting code which needs to extract the format. Moreover, it removes a number of error-prone casts from the code.	2013-11-17 21:41:27 +01:00
Peter Rajnoha	039bdad732	activation: flag temporary LVs internally Add LV_TEMPORARY flag for LVs with limited existence during command execution. Such LVs are temporary in way that they need to be activated, some action done and then removed immediately. Such LVs are just like any normal LV - the only difference is that they are removed during LVM command execution. This is also the case for LVs representing future pool metadata spare LVs which we need to initialize by using the usual LV before they are declared as pool metadata spare. We can optimize some other parts like udev to do a better job if it knows that the LV is temporary and any processing on it is just useless. This flag is orthogonal to LV_NOSCAN flag introduced recently as LV_NOSCAN flag is primarily used to mark an LV for the scanning to be avoided before the zeroing of the device happens. The LV_TEMPORARY flag makes a difference between a full-fledged LV visible in the system and the LV just used as a temporary overlay for some action that needs to be done on underlying PVs. For example: lvcreate --thinpool POOL --zero n -L 1G vg - first, the usual LV is created to do a clean up for pool metadata spare. The LV is activated, zeroed, deactivated. - between "activated" and "zeroed" stage, the LV_NOSCAN flag is used to avoid any scanning in udev - betwen "zeroed" and "deactivated" stage, we need to avoid the WATCH udev rule, but since the LV is just a usual LV, we can't make a difference. The LV_TEMPORARY internal LV flag helps here. If we create the LV with this flag, the DM_UDEV_DISABLE_DISK_RULES and DM_UDEV_DISABLE_OTHER_RULES flag are set (just like as it is with "invisible" and non-top-level LVs) - udev is directed to skip WATCH rule use. - if the LV_TEMPORARY flag was not used, there would normally be a WATCH event generated once the LV is closed after "zeroed" stage. This will make problems with immediated deactivation that follows.	2013-10-23 14:09:37 +02:00
Peter Rajnoha	6b35c70e8b	metadata: add INTERNAL_ERROR to "Metadata inconsistency" msg So we can spot it better if it occurs.	2013-10-10 13:34:43 +02:00
Peter Rajnoha	029b8fbe76	metadata: properly register LV_NOSCAN flag Addendum to commit `ce7489e` which introduced a new internal LV_NOSCAN flag and so it needs to be marked that way properly otherwise it ends up unrecognized and improperly handled during metadata export.	2013-10-10 13:24:32 +02:00
Alasdair G Kergon	c8057aec36	release 2.02.102 18 files changed, 137 insertions(+), 203 deletions(-)	2013-09-23 15:43:37 +01:00
Petr Rockai	3df50d822b	vgconvert: Do not call lvmetad_vg_remove (path shared with vgcfgbackup).	2013-09-18 12:53:11 +02:00
Petr Rockai	054cf25b5f	vgcfgrestore: Remove VG rom lvmetad later, to better deal with errors.	2013-09-18 11:24:58 +02:00
Peter Rajnoha	34d207d9b3	lvmetad: fix mda offset/size overflow if >= 4g (32bit) When reading an info about MDAs from lvmetad, we need to use 64 bit int to read the value of the offset/size, otherwise the value is overflows and then it's used throughout! This is dangerous if we're trying to write such metadata area then, mostly visible if we're using 2 mdas where the 2nd one is at the end of the underlying device and hence the value of the mda offset is high enough to cause problems: (the offset trimmed to value of 0 instead of 4096m, so we write at the very start of the disk (or elsewhere if the offset has some other value!) [1] raw/~ # lvcreate -s -l 100%FREE vg --virtualsize 4097m Logical volume "lvol0" created [1] raw/~ # pvcreate --metadatacopies 2 /dev/vg/lvol0 Physical volume "/dev/vg/lvol0" successfully created [1] raw/~ # hexdump -n 512 /dev/vg/lvol0 0000000 0000 0000 0000 0000 0000 0000 0000 0000 * 0000200 [1] raw/~ # pvchange -u /dev/vg/lvol0 Physical volume "/dev/vg/lvol0" changed 1 physical volume changed / 0 physical volumes not changed [1] raw/~ # hexdump -n 512 /dev/vg/lvol0 0000000 d43e d2a5 4c20 4d56 2032 5b78 4135 7225 0000010 4e30 3e2a 0001 0000 0000 0000 0000 0000 0000020 0000 0010 0000 0000 0000 0000 0000 0000 0000030 0000 0000 0000 0000 0000 0000 0000 0000 * 0000200 ======= (the offset overflows to undefined values which is far behind the end of the disk) [1] raw/~ # lvcreate -s -l 100%FREE vg --virtualsize 100g Logical volume "lvol0" created [1] raw/~ # pvcreate --metadatacopies 2 /dev/vg/lvol0 Physical volume "/dev/vg/lvol0" successfully created [1] raw/~ # pvchange -u /dev/vg/lvol0 /dev/vg/lvol0: lseek 18446744073708503040 failed: Invalid argument /dev/vg/lvol0: lseek 18446744073708503040 failed: Invalid argument Failed to store physical volume "/dev/vg/lvol0" 0 physical volumes changed / 1 physical volume not changed	2013-08-06 13:37:42 +02:00
Zdenek Kabelac	460d0254eb	thin: add pool metadata spare lv support Add support for pool's metadata spare volume.	2013-07-18 18:22:43 +02:00
Zdenek Kabelac	20187fc190	cleanup: use dm_list_empty Check for empty list directly.	2013-07-18 18:22:42 +02:00
Peter Rajnoha	7dc8c84b18	activation: add support for skipping activation of selected LVs Also add -k/--setactivationskip y/n and -K/--ignoreactivationskip options to lvcreate. The --setactivationskip y sets the flag in metadata for an LV to skip the LV during activation. Also, the newly created LV is not activated. Thin snapsots have this flag set automatically if not specified directly by the --setactivationskip y/n option. The --ignoreactivationskip overrides the activation skip flag set in metadata for an LV (just for the run of the command - the flag is not changed in metadata!) A few examples for the lvcreate with the new options: (non-thin snap LV => skip flag not set in MDA + LV activated) raw/~ $ lvcreate -l1 vg Logical volume "lvol0" created raw/~ $ lvs -o lv_name,attr vg/lvol0 LV Attr lvol0 -wi-a---- (non-thin snap LV + -ky => skip flag set in MDA + LV not activated) raw/~ $ lvcreate -l1 -ky vg Logical volume "lvol1" created raw/~ $ lvs -o lv_name,attr vg/lvol1 LV Attr lvol1 -wi------ (non-thin snap LV + -ky + -K => skip flag set in MDA + LV activated) raw/~ $ lvcreate -l1 -ky -K vg Logical volume "lvol2" created raw/~ $ lvs -o lv_name,attr vg/lvol2 LV Attr lvol2 -wi-a---- (thin snap LV => skip flag set in MDA (default behaviour) + LV not activated) raw/~ $ lvcreate -L100M -T vg/pool -V 1T -n thin_lv Logical volume "thin_lv" created raw/~ $ lvcreate -s vg/thin_lv -n thin_snap Logical volume "thin_snap" created raw/~ $ lvs -o name,attr vg LV Attr pool twi-a-tz- thin_lv Vwi-a-tz- thin_snap Vwi---tz- (thin snap LV + -K => skip flag set in MDA (default behaviour) + LV activated) raw/~ $ lvcreate -s vg/thin_lv -n thin_snap -K Logical volume "thin_snap" created raw/~ $ lvs -o name,attr vg/thin_lv LV Attr thin_lv Vwi-a-tz- (thins snap LV + -kn => no skip flag in MDA (default behaviour overridden) + LV activated) [0] raw/~ # lvcreate -s vg/thin_lv -n thin_snap -kn Logical volume "thin_snap" created [0] raw/~ # lvs -o name,attr vg/thin_snap LV Attr thin_snap Vwi-a-tz-	2013-07-12 20:39:07 +02:00
Peter Rajnoha	e21e38cf74	metadata: add support for storing profile name in metadata (during vgcreate/lvcreate) If "vgcreate/lvcreate --profile <profile_name>" is used, the profile name is automatically stored in metadata for making it possible to load it automatically next time the VG/LV is used.	2013-07-02 15:19:09 +02:00
Peter Rajnoha	50bf2c0db1	config: add profile arg to find_config_tree_int	2013-07-02 15:19:09 +02:00
Peter Rajnoha	eeb7b0f7fa	config: add profile arg to find_config_tree_node	2013-07-02 15:19:09 +02:00
Peter Rajnoha	c5e6bc393e	metadata: read VG/LV profile name from metadata if it exists and load it This is per VG/LV profile loading on demand. The profile itself is saved in struct volume_group/logical_volume as "profile" field so we can reference it whenever needed.	2013-07-02 15:19:09 +02:00
Peter Rajnoha	da3ea66a96	config: add config_source_t type to identify configuration source A helper type that helps with identification of the configuration source which makes handling the configuration cascade a bit easier, mainly removing and adding configuration trees to cascade dynamically. Currently, the possible types are: CONFIG_UNDEFINED - configuration is not defined yet (not initialized) CONFIG_FILE - one file configuration CONFIG_MERGED_FILES - configuration that is a result of merging more files into one CONFIG_STRING - configuration string typed on cmd line directly CONFIG_PROFILE - profile configuration (the new type of configuration, patches will follow...) Also, generalize existing "remove_overridden_config_tree" to work with configuration type identification in a cascade. Before, it was just the CONFIG_STRING we used. Now, we need some more to add in a cascade (like the CONFIG_PROFILE). So, we have: struct dm_config_tree remove_config_tree_by_source(struct cmd_context cmd, config_source_t source); config_source_t config_get_source_type(struct dm_config_tree *cft); ... for removing the tree by its source type from the cascade and simply getting the source type.	2013-07-02 15:19:08 +02:00
Zdenek Kabelac	b31725d0ae	archive: add missing bit set In the last update not all code paths have set the archived flag. If we run in test mode or without archiving enabled - set the bit as well - so test whether archiving has been called succesfully will be ok. (in relase fix).	2013-07-02 11:07:15 +02:00
Zdenek Kabelac	e30028004b	archiver: do not archive vg more then once Do not keep multiple archives for the executed command. Reuse the ALLOCATABLE_PV from pv status for ARCHIVED_VG vg status. Mark VG with the bit with the first archivation.	2013-07-01 23:09:26 +02:00
Peter Rajnoha	0ca1688134	metadata: log_debug only when BA found in metadata ...not the other way round as it was before. This way it makes more sense as BA use is exceptional and it's useless to contaminate the log with messages about BA not being found in metadata.	2013-06-27 16:03:35 +02:00
Peter Rajnoha	6de45db5b5	cleanup: clear outdated comment (TODO already done)	2013-06-27 15:26:39 +02:00
Zdenek Kabelac	2562968864	vgcfgrestore: fix crash on restore of wrong vgname When vgname has not existed in metadata, it has crashed on double free in format_instance destroy() - since VG was created, used FID and was released - which also released FID, so further use was accessing bad memory. Fix it for this code path before release_vg() so FID will exists when _vg_read_file_name() returns NULL.	2013-06-18 22:11:21 +02:00
Petr Rockai	c1e851e208	Move export_vg_to_config_tree alongside export_vg_to_buffer.	2013-06-10 15:55:55 +02:00
Peter Rajnoha	732859d21f	refactor: rename embedding area -> bootloader area	2013-05-28 12:37:22 +02:00
Jonathan Brassow	2e0740f7ef	RAID: Add writemostly/writebehind support for RAID1 'lvchange' is used to alter a RAID 1 logical volume's write-mostly and write-behind characteristics. The '--writemostly' parameter takes a PV as an argument with an optional trailing character to specify whether to set ('y'), unset ('n'), or toggle ('t') the value. If no trailing character is given, it will set the flag. Synopsis: lvchange [--writemostly <PV>:{t\|y\|n}] [--writebehind <count>] vg/lv Example: lvchange --writemostly /dev/sdb1:y --writebehind 512 vg/raid1_lv The last character in the 'lv_attr' field is used to show whether a device has the WriteMostly flag set. It is signified with a 'w'. If the device has failed, the 'p'artial flag has priority. Example ("nosync" raid1 with mismatch_cnt and writemostly): [~]# lvs -a --segment vg LV VG Attr #Str Type SSize raid1 vg Rwi---r-m 2 raid1 500.00m [raid1_rimage_0] vg Iwi---r-- 1 linear 500.00m [raid1_rimage_1] vg Iwi---r-w 1 linear 500.00m [raid1_rmeta_0] vg ewi---r-- 1 linear 4.00m [raid1_rmeta_1] vg ewi---r-- 1 linear 4.00m Example (raid1 with mismatch_cnt, writemostly - but failed drive): [~]# lvs -a --segment vg LV VG Attr #Str Type SSize raid1 vg rwi---r-p 2 raid1 500.00m [raid1_rimage_0] vg Iwi---r-- 1 linear 500.00m [raid1_rimage_1] vg Iwi---r-p 1 linear 500.00m [raid1_rmeta_0] vg ewi---r-- 1 linear 4.00m [raid1_rmeta_1] vg ewi---r-p 1 linear 4.00m A new reportable field has been added for writebehind as well. If write-behind has not been set or the LV is not RAID1, the field will be blank. Example (writebehind is set): [~]# lvs -a -o name,attr,writebehind vg LV Attr WBehind lv rwi-a-r-- 512 [lv_rimage_0] iwi-aor-w [lv_rimage_1] iwi-aor-- [lv_rmeta_0] ewi-aor-- [lv_rmeta_1] ewi-aor-- Example (writebehind is not set): [~]# lvs -a -o name,attr,writebehind vg LV Attr WBehind lv rwi-a-r-- [lv_rimage_0] iwi-aor-w [lv_rimage_1] iwi-aor-- [lv_rmeta_0] ewi-aor-- [lv_rmeta_1] ewi-aor--	2013-04-15 13:59:46 -05:00
Peter Rajnoha	386886f71c	config: refer to config nodes using assigned IDs For example, the old call and reference: find_config_tree_str(cmd, "devices/dir", DEFAULT_DEV_DIR) ...now becomes: find_config_tree_str(cmd, devices_dir_CFG) So we're referring to the named configuration ID instead of passing the configuration path and the default value is taken from central config definition in config_settings.h automatically.	2013-03-06 10:14:33 +01:00
Peter Rajnoha	a9d0e25627	cleanup: remove struct pv_header_extension reference from struct pv_header Just to prevent accidental and improper use when reading the layout from disk because of the already existing disk_areas_xl[0] lists that are variable in size. We can read pv_header_extension only after we know exactly where the lists end...	2013-02-27 10:47:24 +01:00
Peter Rajnoha	b778653f03	pv_header_extension: add support for writing PV header extension (flags & Embedding Area) The PV header extension information (PV header extension version, flags and list of Embedding Area locations) is stored just beyond the PV header base. When calculating the Embedding Area start value (ea_start), the same logic is used as when calculating the pe_start value for Data Area - the value must follow exactly the same alignment restrictions for its start value (the alignment detected automatically or provided via command line using the --dataalignment and --dataalignmentoffset arguments). The Embedding Area is placed at the very start of the PV, starting at ea_start. The Data Area starting at pe_start is placed next. The pe_start is still properly aligned. Due to the pe_start alignment, it's possible that the resulting Embedding Area size (ea_size) ends up bigger in size than requested (but never less than requested).	2013-02-26 11:28:00 +01:00
Peter Rajnoha	9dbe25709e	pv_header_extension: add support for reading PV header extension (flags & Embedding Area) New tools with PV header extension support will read the extension if it exists and it's not an error if it does not exist (so old PVs will still work seamlessly with new tools). Old tools without PV header extension support will just ignore any extension. As for the Embedding Area location information (its start and size), there are actually two places where this is stored: - PV header extension - VG metadata The VG metadata contains a copy of what's written in the PV header extension about the Embedding Area location (NULL value is not copied): physical_volumes { pv0 { id = "AkSSRf-difg-fCCZ-NjAN-qP49-1zzg-S0Fd4T" device = "/dev/sda" # Hint only status = ["ALLOCATABLE"] flags = [] dev_size = 262144 # 128 Megabytes pe_start = 67584 pe_count = 23 # 92 Megabytes ea_start = 2048 ea_size = 65536 # 32 Megabytes } } The new metadata fields are "ea_start" and "ea_size". This is mostly useful when restoring the PV by using existing metadata backups (e.g. pvcreate --restorefile ...). New tools does not require these two fields to exist in VG metadata, they're not compulsory. Therefore, reading old VG metadata which doesn't contain any Embedding Area information will not end up with any kind of error but only a debug message that the ea_start and ea_size values were not found. Old tools just ignore these extra fields in VG metadata.	2013-02-26 11:27:23 +01:00
Peter Rajnoha	60c5d4c42f	pv_header_extension: add supporting infrastructure for PV header extension (flags & Embedding Area) PV header extension comes just beyond the existing PV header base: PV header base (existing): - uuid - device size - null-terminated list of Data Areas - null-terminater list of MetaData Areas PV header extension: - extension version - flags - null-terminated list of Embedding Areas This patch also adds "eas" (Embedding Areas) list to lvmcache (lvmcache_info) and it also adds support for common operations on the list (just like for already existing "das" - Data Areas list): - lvmcache_add_ea - lvmcache_update_eas - lvmcache_foreach_ea - lvmcache_del_eas Also, add ea_start and ea_size to struct physical_volume for processing PV Embedding Area location throughout the code (currently only one Embedding Area is supported, though the definition on disk allows for more if needed in the future...). Also, define FMT_EAS format flag to mark that the format actually supports Embedding Areas (currently format-text only).	2013-02-26 11:25:16 +01:00
Peter Rajnoha	6d8de3638c	cleanup: use struct pvcreate_restorable_params throughout	2013-02-26 11:25:11 +01:00
Zdenek Kabelac	87331dc419	thin: add support for external origin Add internal support for thin volume's external origin.	2013-02-23 10:36:58 +01:00
Peter Rajnoha	303e86adc8	pvcreate: fix alignment to incorporate alignment offset if PV has 0 MDAs If zero metadata copies are used, there's no further recalculation of PV alignment that happens when adding metadata areas to the PV and which actually calculates the alignment correctly as a matter of fact. So fix this for "PV without MDA" case as well. Before this patch: [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 1 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 12.00m [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 0 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 8.00m After this patch: [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 1 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 12.00m [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 0 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 12.00m Also, remove a superfluous condition "pv->pe_start < pv->pe_align" in: if (pe_start == PV_PE_START_CALC && pv->pe_start < pv->pe_align) pv->pe_start = pv->pe_align ... This part of the condition is not reachable as with the PV_PE_START_CALC, we always have pv->pe_start set to 0 from the PV struct initialisation (...the pv->pe_start value is just being calculated).	2013-02-21 14:51:19 +01:00
Peter Rajnoha	a7d6a612b8	fix: 'Couldn't read extent size' --> '... extent start'	2013-02-21 13:33:27 +01:00
Alasdair G Kergon	06abb2dd4c	logging: classify log_debug messages Place most log_debug() messages into a class.	2013-01-07 22:30:29 +00:00
Zdenek Kabelac	ff5612c0c3	format-text: check for _text_create_text_instance Test if 'fid' creation failed and report stack trace, break the loop and do not pass NULL fid further.	2012-12-15 17:23:23 +01:00
Zdenek Kabelac	21f6511bc2	cleanup: reorder code Swap if() test condition and check for failure and use traditional 'stack' trace.	2012-12-15 14:57:40 +01:00
Zdenek Kabelac	09b7ceea95	thin: allow restore with --force Allow restoring metadata with thin pool volumes. No validation is done for this case within vgcfgrestore tool - thus incorrect metadata may lead to destruction of pool content.	2012-11-27 14:08:24 +01:00
Petr Rockai	60668f823e	Automatically restore MISSING PVs with no MDAs.	2012-11-25 20:41:56 +01:00
Zdenek Kabelac	f260f99d57	cleanup: switch log_error to log_warn Use log_warn to print non-fatal warning messages. Use of log_error would confuse checker for testing whether proper error has been reported for some real error.	2012-10-17 15:41:35 +02:00
Petr Rockai	c9f56d639b	lvmetad: Use "%" PRId64 in place of "%d" for extra clarity.	2012-09-26 17:26:16 +02:00
Petr Rockai	2276379a71	lib/cache/lvmetad: Refactor to use dm_config_tree in requests. We were using daemon_send_simple until now, but it is no longer adequate, since we need to manipulate requests in a generic way (adding a validity token to each request), and the tree-based request interface is much more suitable for this.	2012-09-26 14:49:15 +02:00
Zdenek Kabelac	286cd2006b	cleanup: drop unneeded included header files This headers were not resolving anything used for compiled .c files. Remove unused util.c file.	2012-08-23 14:37:20 +02:00
Zdenek Kabelac	6f3cd63551	cleanup: replace memset with struct initilization Simplifies the code, properly detects too long socket paths, drops unused parameter.	2012-06-22 13:23:03 +02:00
Peter Rajnoha	9c17acdfe8	Fix division by zero if PV with zero PE count is used during vgcfgrestore.	2012-05-09 12:30:56 +00:00
Peter Rajnoha	cb08b8eb7e	Check if info struct returned is not NULL. Just some missing checks revealed by Coverity in recent code.	2012-04-10 12:26:27 +00:00
Alasdair Kergon	9c159ea320	Pass struct device around internally rather than dev_t. Add 3rd daemon return state "unknown" for lookups that are carried out successfully but don't find the item requested. Avoid issuing error messages when it's expected that a device that's being looked up in lvmetad might not be there.	2012-03-02 20:46:36 +00:00
Alasdair Kergon	d742cdf327	Change pvscan --lvmetad to pvscan --cache.	2012-03-02 18:09:46 +00:00
Alasdair Kergon	5b613cff97	Pass 'single_device' parameter down to suppress 'Can't find uuid' messages when reading VG text metadate and called from pvscan --lvmetad. (Longer-term, that check needs moving outside of that code.)	2012-02-29 02:35:35 +00:00
Zdenek Kabelac	a46cc72fd2	Add some stack traces for dev_close error paths	2012-02-28 10:11:35 +00:00
Zdenek Kabelac	d2a3352755	Just code move of hash initialization in front of function Make sure both hash tables are initialized before _read_sections() call. Presents no functional change (since PV scan phase was not adding LV hashes), but makes the code easier to handle mem failing case, and static analyzer is hapier as well.	2012-02-27 11:40:58 +00:00
Zdenek Kabelac	b9141fcefa	Add stack traces for lock_vol failures Adding at least stack traces with some FIXMEs for cases, where we might want to do something cleaver - maybe fail command or give user hints something is not going well ? For remote_backup is stack probably 'good' enough for now.	2012-02-27 11:35:59 +00:00
Zdenek Kabelac	c608e46675	Remove test for pvid Since pvid is char buffer[] and not pointer, there is no point to check it for NULL.	2012-02-27 09:54:25 +00:00
Zdenek Kabelac	b6c5ea358e	Some reformating for lvmetad uddates cleanup gcc warning, use PRIu64 header cleanups const pointer fixes.	2012-02-23 17:59:32 +00:00
Petr Rockai	dae0822698	The lvmetad client-side integration. Only active when use_lvmetad = 1 is set in lvm.conf and lvmetad is running.	2012-02-23 13:11:07 +00:00
Zdenek Kabelac	bed744c15d	Add check for mda_copy failure	2012-02-13 11:09:25 +00:00
Zdenek Kabelac	52f2f3eae4	Add free_orphan_vg Move commod code to destroy orphan VG into free_orphan_vg() function. Use orphan vgmem for creation of PV lists. Remove some free_pv_fid() calls (FIXME: check all of them) FIXME: Check whether we could merge release_vg back again for all VGs.	2012-02-13 11:03:59 +00:00
Zdenek Kabelac	f9411bb2af	Clean error paths for format instance With updated orphan VG code this code needed some updates. Add missing log_error for allocation failures.	2012-02-13 10:56:31 +00:00
Alasdair Kergon	b719e3d323	FMT_INSTANCE_VG is redundant now	2012-02-12 23:01:19 +00:00
Petr Rockai	6e41729eb8	Keep a global (per-format) orphan_vg and keep any and all orphan PVs linked to it. Avoids the need for FMT_INSTANCE_PV and enables further simplifications. No functional change, internal refactor only.	2012-02-10 02:53:03 +00:00
Petr Rockai	8e5f7cf3dc	Move lvmcache data structures behind an API (making the structures private to lvmcache.c). No functional change.	2012-02-10 01:28:27 +00:00
Zdenek Kabelac	33dea28e23	Use dm_snprintf and improve error handling Add standard error reporting with error logging. Use plain alloc instead of zalloc for string buffer. Use dm_snprintf with valid test for <0.	2012-02-08 12:50:10 +00:00
Zdenek Kabelac	ee54e43702	Fix resource leaks for failing allocation In case, something would fail during format initialization, return allocated memory.	2012-02-08 10:49:36 +00:00
Zdenek Kabelac	96bffe6a4a	Instrument code that pointer are already released Set pointers to NULL since on the function exit they are no longer valid.	2012-01-25 22:35:36 +00:00
Zdenek Kabelac	e6771e50a9	Check for correctness of uint64 value if exists	2012-01-25 21:43:51 +00:00
Zdenek Kabelac	18b3d24692	Thin until proper vgcfgrestore for thin is implementad, disable restore. Since it may probably do more harm to leave it enabled - add extra test for presence of thin volumes in VG, and in this case disable restore.	2012-01-20 11:01:13 +00:00
Zdenek Kabelac	53d7985fa1	Add support to keep info about creation time and host for each LV Basic support to keep info when the LV was created. Host and time is stored into LV mda section. FIXME: Current version doesn't support configurable string via lvm.conf and used fixed version strftime "%Y-%m-%d %T %z".	2012-01-19 15:31:45 +00:00
Zdenek Kabelac	2465451549	Rename internal macro to match signess Since _read_int64 called dm_config_get_uint64, rename it to less confusing _read_uint64.	2012-01-19 15:17:46 +00:00
Zdenek Kabelac	61158adbcf	Allow empty strings for description and creation_host config fields	2011-12-21 12:49:00 +00:00
Petr Rockai	845b1df617	Make a cleaner split between config tree and config file functionality. Move the latter out of libdm.	2011-12-18 21:56:03 +00:00
Jonathan Earl Brassow	0c506d9a40	Support the ability to replace specific devices in a RAID array. RAID is not like traditional LVM mirroring. LVM mirroring required failed devices to be removed or the logical volume would simply hang. RAID arrays can keep on running with failed devices. In fact, for RAID types other than RAID1, removing a device would mean substituting an error target or converting to a lower level RAID (e.g. RAID6 -> RAID5, or RAID4/5 to RAID0). Therefore, rather than removing a failed device unconditionally and potentially allocating a replacement, RAID allows the user to "replace" a device with a new one. This approach is a 1-step solution vs the current 2-step solution. example> lvconvert --replace <dev_to_remove> vg/lv [possible_replacement_PVs] '--replace' can be specified more than once. example> lvconvert --replace /dev/sdb1 --replace /dev/sdc1 vg/lv	2011-11-30 02:02:10 +00:00
Zdenek Kabelac	900f5f8187	Replace dynamic buffer allocations for PATH_MAX Use static buffer instead of stack allocated buffer. This reduces stack size usage of lvm tool and the change is very simple. Since the whole library is not thread safe - it should not add any new problems - and if there will be some conversion it's easy to convert this to use some preallocated buffer.	2011-11-18 19:31:09 +00:00
Peter Rajnoha	5680d14ecd	Avoid 'mda inconsistency' by properly registering UNLABELLED_PV flag (2.02.86). When a PV label write is deferred to a vg_write call (as introduced by a patch in 2.02.86), the PV is flagged with the internal UNLABELLED_PV flag. However, when calling vg_archive before vg_write, we still have the PV labelled with the UNLABELLED_PV flag which was not recognised as a proper flag while exporting VG metadata: # vgcreate vg /dev/sda No physical volume label read from /dev/sda Metadata inconsistency: Not all flags successfully exported. Metadata inconsistency: Not all flags successfully exported. Writing physical volume data to disk "/dev/sda" Physical volume "/dev/sda" successfully created Volume group "vg" successfully created	2011-11-15 11:54:15 +00:00
Zdenek Kabelac	f2c56bc3b6	Drop mempool parameter from read functions Use implicit vgmem pool.	2011-10-23 16:05:45 +00:00
Zdenek Kabelac	72ff89d279	Always use vg memory pool for allocated lv segment Remove mem pool parameter from alloc_lv_segment() Since we should always allocate LV segment from the vg mempool.	2011-10-23 16:02:01 +00:00
Alasdair Kergon	ef78ebf35a	lvcreate/remove thin_pool and thin volumes (--driverloaded n only)	2011-09-08 16:41:18 +00:00
Alasdair Kergon	9ac61d2ba2	lvcreate parsing for thin provisioning. The rest is incomplete so this isn't usable yet.	2011-09-06 00:26:42 +00:00
Zdenek Kabelac	3caa77f831	Use size_t return type Since these function returns buffer size - use size_t type for them.	2011-09-01 10:25:22 +00:00
Petr Rockai	97a4b5165e	Replace const usage of dm_config_find_node with more appropriate value-lookup functionality. A number of bugs (copied and pasted all over the code) should disappear: - most string lookup based on dm_config_find_node would segfault when encountering a non-zero integer (the intention there was to print an error message instead) - check for required sections in metadata would have been satisfied by values as well (i.e. not sections) - encountering a section in place of expected flag value would have segfaulted (due to assumed but unchecked cn->v != NULL)	2011-08-31 15:19:19 +00:00
Petr Rockai	e59e2f7c3c	Move the core of the lib/config/config.c functionality into libdevmapper, leaving behind the LVM-specific parts of the code (convenience wrappers that handle `struct device` and `struct cmd_context`, basically). A number of functions have been renamed (in addition to getting a dm_ prefix) -- namely, all of the config interface now has a dm_config_ prefix.	2011-08-30 14:55:15 +00:00
Peter Rajnoha	d35188058b	Directly allocate buffer memory in a pvck scan instead of using a mempool. There's a very high memory usage when calling _pv_analyse_mda_raw (e.g. while executing pvck) that can end up with "out of memory". _pv_analyse_mda_raw scans for metadata in the MDA, iteratively increasing the size to scan with SECTOR_SIZE until we find a probable config section or we're at the edge of the metadata area. However, when using a memory pool, we're also iteratively chasing for bigger and bigger mempool chunk which can't be found and so we're always allocating a new one, consuming more and more memory... This patch just changes the mempool to direct memory allocation in this problematic part of the code.	2011-08-29 13:37:36 +00:00
Zdenek Kabelac	077a6755ff	Replace free_vg with release_vg Move the free_vg() to vg.c and replace free_vg with release_vg and make the _free_vg internal. Patch is needed for sharing VG in vginfo cache so the release_vg function name is a better fit here.	2011-08-10 20:25:29 +00:00
Jonathan Earl Brassow	cac52ca4ce	Add basic RAID segment type(s) support. Implementation described in doc/lvm2-raid.txt. Basic support includes: - ability to create RAID 1/4/5/6 arrays - ability to delete RAID arrays - ability to display RAID arrays Notable missing features (not included in this patch): - ability to clean-up/repair failures - ability to convert RAID segment types - ability to monitor RAID segment types	2011-08-02 22:07:20 +00:00
Zdenek Kabelac	bebe60b70c	Code move of vg_mark_partial() up in stack It's useful to keep the partial flag cached - so just move the call for vg_mark_partil_lvs() into import_vg_from_config_tree() so it gets evaluated before it goes through the lvmcache. This patch should not present any functional change. Note: It is rather temporal solution - proper place is probably inside the 'read' call back - but needs some more discussion. For now using this minor hack.	2011-06-17 14:39:10 +00:00
Zdenek Kabelac	93a98c2672	Remove unused internal flag ACTIVATE_EXCL from the code	2011-06-17 14:30:58 +00:00
Petr Rockai	6d25c0d26f	Fix RHBZ 651590 (failure to lock LV results in failure to repair mirror after transient error), stemming from the following sequence of events: 1) devices fail IO, triggering repair 2) dmeventd starts fixing up the mirror 3) during the downconversion, a new metadata version is written --> the devices come back online here 4) the mirror device suspend/resume is called to update DM tables 5) during the suspend/resume cycle, pre-commit metadata is read; however, since the failed devices are now back online, we get back inconsistent set of precommit metadata and the whole operation fails The patch relaxes the check that fails in step 5 above, namely by ignoring inconsistencies coming from PVs that are marked MISSING.	2011-06-15 17:45:02 +00:00
Alasdair Kergon	3cac20f850	Defer writing PV labels to vg_write. Store label_sector only in struct physical_volume.	2011-06-01 19:29:31 +00:00
Peter Rajnoha	c08c564e21	Use new dev_open_readonly fn to prevent opening devices for read-write when not necessary. Before, we used vg_write_lock_held call to determnine the way a device is opened. Unfortunately, this opened many devices in RW mode when it was not really necessary. With the OPTIONS+="watch" rule used in the udev rules, this could fire numerous events while closing such devices (and it caused useless scans from within udev rules in return). A common bug we hit with this was with the lvremove command which was unable to remove the LV since it was being opened from within the udev rules. This patch should minimize such situations (at least with respect to LVM handling of devices). Though there's still a possibility someone will open a device 'outside' in parallel and fire the event based on the watch rule when closing a device once opened for RW.	2011-05-28 09:48:14 +00:00
Alasdair Kergon	5510b4e7d7	test update without WHATS_NEW to check it gives warning now	2011-04-29 19:06:17 +00:00
Zdenek Kabelac	b680d5bf7b	Fix use of released vgname and vgid Avoid using of already released memory when duplicated MDA is found. As get_pv_from_vg_by_id() may call lvmcache_label_scan() use the local copy of the vgname and vgid on the stack as vginfo may dissapear and code was then accessing garbage in memory. i.e. pvs /dev/loop0 (when /dev/loop0 and /dev/loop1 has same MDA content) Invalid read of size 1 at 0x523C986: dm_hash_lookup (hash.c:325) by 0x440C8C: vginfo_from_vgname (lvmcache.c:399) by 0x4605C0: _create_vg_text_instance (format-text.c:1882) by 0x46140D: _text_create_text_instance (format-text.c:2243) by 0x47EB49: _vg_read (metadata.c:2887) by 0x47FBD8: vg_read_internal (metadata.c:3231) by 0x477594: get_pv_from_vg_by_id (metadata.c:344) by 0x45F07A: _get_pv_if_in_vg (format-text.c:1400) by 0x45F0B9: _populate_pv_fields (format-text.c:1414) by 0x45F40F: _text_pv_read (format-text.c:1493) by 0x480431: _pv_read (metadata.c:3500) by 0x4802B2: pv_read (metadata.c:3462) Address 0x652ab80 is 0 bytes inside a block of size 4 free'd at 0x4C2756E: free (vg_replace_malloc.c:366) by 0x442277: _free_vginfo (lvmcache.c:963) by 0x44235E: _drop_vginfo (lvmcache.c:992) by 0x442B23: _lvmcache_update_vgname (lvmcache.c:1165) by 0x443449: lvmcache_update_vgname_and_id (lvmcache.c:1358) by 0x443C07: lvmcache_add (lvmcache.c:1492) by 0x46588C: _text_read (text_label.c:271) by 0x466A65: label_read (label.c:289) by 0x4413FC: lvmcache_label_scan (lvmcache.c:635) by 0x4605AD: _create_vg_text_instance (format-text.c:1881) by 0x46140D: _text_create_text_instance (format-text.c:2243) by 0x47EB49: _vg_read (metadata.c:2887) Add testing script	2011-04-21 13:13:40 +00:00
Zdenek Kabelac	2c5827076b	Add missing printf attributes These attributes were missing in previous patch, that was adding instrumentation for printf formating string parameter.	2011-04-08 14:21:34 +00:00
Jonathan Earl Brassow	60c10a45ce	s/MIRROR_NOTSYNCED/LV_NOTSYNCED/ - Flag will may refer to more than just mirrors	2011-03-29 12:51:57 +00:00
Alasdair Kergon	9c58641e74	Rename _check_version	2011-03-27 13:44:08 +00:00
Zdenek Kabelac	844b75f4d6	Fix allocation of system_id As code uses strncpy(system_id, NAME_LEN) and doesn't set '\0' Fix it by always allocating NAME_LEN + 1 buffer size and with zalloc we always get '\0' as the last byte. This bug may trigger some unexpected behavior of the string operation code - depends on the pool allocator. FIXME: refactor this code to alloc_vg.	2011-03-13 23:05:48 +00:00
Peter Rajnoha	ff4479414c	Use format instance mempool where possible and adequate.	2011-03-11 15:10:16 +00:00
Peter Rajnoha	e8d4946ec7	Various cleanups for fid mem and ref_count changes. Missing free_vg on error_path in lvmcache_get_vg fn. Call destroy_instance only if the fid is not part of the vg in backup_read_vg fn (otherwise it's part of the VG we're returning and we definitely don't want to destroy it!).	2011-03-11 15:08:31 +00:00
Peter Rajnoha	1307ddf4cf	Use only vg_set_fid and new pv_set_fid fn to assign the format instance. This is essential for proper format instance ref_count support. We must use these functions to set the fid everywhere from now on, even the NULL value!	2011-03-11 14:50:13 +00:00
Peter Rajnoha	293481107f	Make create_text_context fn static and move it inside create_instance fn. We'd like to use the fid mempool for text_context that is stored in the instance (we used cmd mempool before, so the order of initialisation was not a matter, but now it is since we need to create the fid mempool first which happens in create_instance fn). The text_context initialisation is not needed anywhere outside the create_instance fn so move it there.	2011-03-11 14:45:17 +00:00
Peter Rajnoha	a1bec4e685	Add mem and ref_count fields to struct format_instance for own mempool use. Format instances can be created anytime on demand and it contains metadata area information mostly (at least for now, but in the future, we may store more things here to update/edit in a PV/VG). In case we have lots of metadata areas, memory consumption will rise. Using cmd context mempool is not quite optimal here because it is destroyed too late. So let's use a separate mempool for format instances. Reference counting is used because fids could be shared, e.g. each PV has either a PV-based fid or VG-based fid. If it's VG-based, each PV has a shared fid with the VG - a reference to VG's fid.	2011-03-11 14:38:38 +00:00
Peter Rajnoha	56f5b12eed	Use new alloc_fid fn for common format instance initialisation.	2011-03-11 14:30:27 +00:00
Zdenek Kabelac	3019419e95	Refactor vg allocation code Create new function alloc_vg() to allocate VG structure. It takes pool_name (for easier debugging). and also take vg_name to futher simplify code. Move remainder of _build_vg_from_pds to _pool_vg_read and use vg memory pool for import functions. (it's been using smem -> fid mempool -> cmd mempool) (FIXME: remove mempool parameter for import functions and use vg). Move remainder of the _build_vg to _format1_vg_read	2011-03-10 12:43:29 +00:00
Peter Rajnoha	15b9215534	Use a copy if moving an mda from pv fid to vg fid. We'll destroy the pv fid (with all mdas in it) after merging all pv mdas to a vg in _text_pv_setup fn, hence we need to use a copy here!	2011-03-02 10:23:29 +00:00
Peter Rajnoha	0b100565ae	Make add_metadata_area_to_pv/remove_metadata_area_from_pv static. No need to put these in format-text.h, it's not used anywhere else actually.	2011-03-02 10:19:14 +00:00
Milan Broz	0cb777d642	Rephrase backup message.	2011-02-28 20:50:01 +00:00
Peter Rajnoha	150e43a05c	Use pv->vg_name directly instead of pv->vg->name in _text_pv_write. This also prevents a possible segfault during an automatic repair when the PV does not belong to a VG anymore and we call pv_write_orphan.	2011-02-28 17:05:48 +00:00
Peter Rajnoha	3b97e8d643	Allow non-orphan PVs with two metadata areas to be resized. We allow writing non-orphan PVs only for resize now. The "orphan PV" assert in pv_write fn uses the "allow_non_orphan" parameter to control this assert. However, we should find a more elaborate solution so we can remove this restriction altogether (pv_write together with vg_write is not atomic, we need to find a safe mechanism so there's an easy revert possible in case of an error).	2011-02-28 13:19:02 +00:00
Peter Rajnoha	4b8f066c19	vgconvert is fixed now to work with the changes in metadata area handling - enable the tests. Add a small fix that preserves pe_start for lvm1 PVs when being converted. (this fix needs to be replaced with something more clever, but let's have this working now)	2011-02-25 14:12:14 +00:00
Peter Rajnoha	4a304dc1d8	Allow only orphan PVs to be resized even with two metadata areas.	2011-02-25 14:08:54 +00:00
Peter Rajnoha	38b0564cab	Read PV metadata information from cache if pv_setup called with pv->fid == vg->fid. If the PV is already part of the VG (so the pv->fid == vg->fid), it makes no sense to attach the mdas information from PV to a VG. Instead, we read new PV metadata information from cache and attach it to the VG fid.	2011-02-25 13:59:47 +00:00
Peter Rajnoha	ea4a41e961	Fix a bug in metadata location calculation, cleanup pv_add_metadata_area fn. This bug (a missing line) caused the 2nd MDA area location to be calculated incorrectly and it didn't fit the disk size properly. (https://www.redhat.com/archives/lvm-devel/2011-February/msg00127.html)	2011-02-25 13:50:02 +00:00
Peter Rajnoha	51aed1992f	Add old_uuid field to struct physical_volume so we can still reference a PV with its old UUID when we're changig it (the cache as well as metadata area index has the old uuid that we need to use to access the information!)	2011-02-21 12:31:28 +00:00
Peter Rajnoha	cb2396730a	Change pvresize code to work with new metadata handling interface and allow resizing a PV with two metadata areas.	2011-02-21 12:27:26 +00:00
Peter Rajnoha	17ad2b1115	Change pv_write code to work with the changes in metadata handling interface and changes in format_instance.	2011-02-21 12:26:27 +00:00
Peter Rajnoha	903d7db050	Remove unused _mda_setup fn. This functionality is covered by new pv_add_metadata_area fn.	2011-02-21 12:25:16 +00:00
Peter Rajnoha	94d91fdda1	Change the code throughout to use new pv_initialise and modified pv_setup fn. Change pv_create code to work with these changes together with using new pv_add_metadata_area fn to add metadata areas for a PV being created.	2011-02-21 12:24:15 +00:00
Peter Rajnoha	617b900d85	Separate new pv_initialise function out of the original pv_setup code. pv_initiliase initialises a new PV pv_setup sets up an existing PV with a VG	2011-02-21 12:20:18 +00:00
Peter Rajnoha	981895a860	Add new pv_remove_metadata_area interface function.	2011-02-21 12:17:54 +00:00
Peter Rajnoha	8d5d20a526	Add new pv_add_metadata_area interface function.	2011-02-21 12:17:26 +00:00
Peter Rajnoha	305816232d	Remove useless mdas parameter for pv_read (from now on, we store mdas in a format instance)	2011-02-21 12:15:59 +00:00
Peter Rajnoha	f8b78ec613	Add vg_set_fid function to change VG format instance. This function also sets a reference to a new VG format instance for all PVs that are part of the VG so the PV-VG interconnection is consistent after the change.	2011-02-21 12:10:58 +00:00
Peter Rajnoha	c0c21864c6	Change the code throughout for recent changes in format_instance handling.	2011-02-21 12:07:03 +00:00
Peter Rajnoha	88129db5e1	Change create_instance to create PV-based as well as VG-based format instances. Add supporting functions to work with the format instance and metadata area structures stored within the format instance. Add support for simple indexing of metadata areas using PV id and mda order (for on-disk PV only for now, we can extend the indexing even for other mdas if needed - we only need to define a proper key for the index).	2011-02-21 12:05:49 +00:00
Zdenek Kabelac	4ebc6404ee	Void* arithmetic replaced with char*	2011-02-18 14:34:41 +00:00
Zdenek Kabelac	b1bcff7424	Critical section New strategy for memory locking to decrease the number of call to to un/lock memory when processing critical lvm functions. Introducing functions for critical section. Inside the critical section - memory is always locked. When leaving the critical section, the memory stays locked until memlock_unlock() is called - this happens with sync_local_dev_names() and sync_dev_names() function call. memlock_reset() is needed to reset locking numbers after fork (polldaemon). The patch itself is mostly rename: memlock_inc -> critical_section_inc memlock_dec -> critical_section_dec memlock -> critical_section Daemons (clmvd, dmevent) are using memlock_daemon_inc&dec (mlockall()) thus they will never release or relock memory they've already locked memory. Macros sync_local_dev_names() and sync_dev_names() are functions. It's better for debugging - and also we do not need to add memlock.h to locking.h header (for memlock_unlock() prototyp).	2011-02-18 14:16:11 +00:00
Zdenek Kabelac	135af49da5	Increase hash table size to 1024 lv names and 64 pv uuids	2011-02-03 16:03:13 +00:00
Zdenek Kabelac	16f000bcb4	Fix wipe size when seting up mda.	2011-02-03 01:41:03 +00:00
Zdenek Kabelac	a5c6acf22a	Skip NULL check before dm_free dm_free checks for NULL itself.	2011-01-28 10:16:04 +00:00
Zdenek Kabelac	6feecf76d4	Change import_vg_from_buffer to use config_tree Change function import_vg_from_buffer() to import_vg_from_config_tree(). Instead of creating config tree inside the function allow config tree to be passed as parameter - usable later for caching.	2011-01-10 13:13:42 +00:00
Zdenek Kabelac	ff4a77c5ca	Intentionaly ignore result from get_config_uint32	2011-01-06 15:25:07 +00:00
Zdenek Kabelac	5fc79ef6dc	Add sys_debug loging for unlink This unlink intentionally silently ignores any errors. It's still worth to trace its error status in debug mode.	2011-01-05 15:06:10 +00:00
Zdenek Kabelac	0ddb15964a	Remove check for existance of vg pointer Checking for vg being != NULL in this place is not needed. Pointer vg is already dereferced in this function above this code line. Also this internal function _read_pv is always called with valid 'vg' pointer.	2010-12-22 15:44:09 +00:00
Zdenek Kabelac	1102378e1c	Add backtraces for archive and backup_locally If archive or back_locally fails - add stack trace.	2010-12-22 13:45:33 +00:00
Zdenek Kabelac	9d9de35dca	Remove const usage from destroy callbacks As const segment_type or const format_type are never released use their non-const version and remove const downcast from dm_free calls. This change fixes many gcc warnings we were getting from them.	2010-12-20 13:32:49 +00:00
Zdenek Kabelac	ba96eb24fa	Some const cleanups Minor const warning fixes and internal API updates.	2010-12-20 13:19:13 +00:00
Zdenek Kabelac	760d1fac55	Add more strict const pointers around config tree To have better control were the config tree could be modified use more const pointers and very carefully downcast them back to non-const (for config tree merge).	2010-12-20 13:12:55 +00:00
Alasdair Kergon	acb037657c	Fix scanning of VGs without in-PV mdas. Set cmd->independent_metadata_areas if metadata/dirs or disk_areas in use. - Identify and record this state. Don't skip full scan when independent mdas are present even if memlock is set. - Clusters and OOM aren't supported, so no problem doing the proper scans. Avoid revalidating the label cache immediately after scanning. - A simple optimisation. Support scanning for a single VG in independent mdas. - Not used by the fix but I left it in anyway as later patches might use it.	2010-12-10 22:39:52 +00:00
Alasdair Kergon	2b82bd79f5	Rename vg_release to free_vg.	2010-12-08 20:50:48 +00:00
Alasdair Kergon	1415afcdba	Fix memory leak when VG allocation policy in metadata is invalid. Ignore unrecognised allocation policy found in metadata instead of aborting. Fix another missing vg_release() in _vg_read_by_vgid.	2010-11-29 18:35:37 +00:00
Zdenek Kabelac	21ba805499	Fix memory leak in error path Nicely hidden memory leak in outf macro error path. This macro is using out_text() and does automagical return_0. That would leak tag_buffer allocated memory. As there was same code for tags output - create _out_tags() function.	2010-11-29 12:19:58 +00:00
Zdenek Kabelac	dce59eb407	Remove unused 'i' in _pv_analyze_mda_raw 'i' is unused in the function - remove it.	2010-11-29 11:16:58 +00:00
Zdenek Kabelac	419d5219cb	Fix NULL pointer dereference for too large MDA error path Replace dereference of NULL vg with passed vgname to the function _vg_read_raw_area() in the error path for too large MDA.	2010-10-26 09:13:13 +00:00
Dave Wysochanski	637ac19e60	Rename 'flags' to 'status' for struct metadata_area. In other LVM memory structures such as volume_group, the field used to store flags is called "status", and on-disk fields are called 'flags', so rename the one inside metadata_area to be consistent. Not only is it more consistent with existing code but is cleaner to say "the status of this mda is ignored". Background for this patch - prajnoha pinged me on IRC this morning about a fix he was working on related to metadataignore when metadata/dirs was set. I was reviewing my patches from this year and realized the 'flags' field was probably not the best choice when I originally did the metadataignore patches.	2010-10-05 17:34:05 +00:00
Alasdair Kergon	ac0252ca07	Add dm_zalloc and use it and dm_pool_zalloc throughout.	2010-09-30 21:06:50 +00:00
Peter Rajnoha	5936dd0381	Fix memory leak of vg_read while using live copies of metadata in directories.	2010-09-30 14:12:14 +00:00
Alasdair Kergon	44a31a9c2f	Speed up CRC32 calculations by using a larger lookup table. Use -DDEBUG_CRC32 to revert to old function and check new one gives same result.	2010-09-27 19:09:34 +00:00
Peter Rajnoha	064ed484b4	"goto_bad" should be used in alloc_printed_tags function, not "goto bad".	2010-09-21 10:42:02 +00:00
Peter Rajnoha	48ae64529a	Use dynamic allocation for metadata's tag buffer (removes 4096 char. limit).	2010-09-20 14:23:20 +00:00
Peter Rajnoha	d20ce59b80	Add random suffix to archive file names to prevent races when being created. In certain configurations, we're not under a VG rw lock while trying to write a new archive file with VG metadata. A common example is using "vgs" while having the content of backup and archive directories empty. The code scans the content of these directories and tries to determine the final index that should be used in archive name. Since we're not under a lock, we can get into a race while choosing the index which could end up showing errors about not being able to rename to final archive name. Let's add random number suffix to these archive file names so we can avoid the race.	2010-09-09 13:13:12 +00:00
Peter Rajnoha	dc8478458e	Reinitialize archive and backup handling on toolcontext refresh. For example, when using '--config "backup { ... }"' line, the values from lvm.conf (or default values) should be overridden. This patch adds reinitialisation of archive and backup handling on toolcontext refresh which makes these settings to be applied.	2010-09-09 13:07:13 +00:00
Milan Broz	fc86426b56	Fix previous const removal.	2010-08-26 12:22:05 +00:00
Mike Snitzer	4efb1d9cbb	Update heuristic used for default and detected data alignment. Add "devices/default_data_alignment" to lvm.conf to control the internal default that LVM2 uses: 0==64k, 1==1MB, 2==2MB, etc. If --dataalignment (or lvm.conf's "devices/data_alignment") is specified then it is always used to align the start of the data area. This means the md_chunk_alignment and data_alignment_detection are disabled if set. (Same now applies to pvcreate --dataalignmentoffset, the specified value will be used instead of the result from data_alignment_offset_detection) set_pe_align() still looks to use the determined default alignment (based on lvm.conf's default_data_alignment) if the default is a multiple of the MD or topology detected values.	2010-08-20 20:59:05 +00:00
Mike Snitzer	14a9722185	Avoid changing aligned pe_start as a side-effect of very verbose logging.	2010-08-03 18:19:42 +00:00
Zdenek Kabelac	c10f7fd039	Fix constness warning in archive_file structure from archive.c.	2010-08-03 13:09:21 +00:00
Zdenek Kabelac	9f926fd060	Use void parameter for function definition.	2010-08-03 13:06:35 +00:00
Alasdair Kergon	08f1ddea6c	Use __attribute__ consistently throughout.	2010-07-09 15:34:40 +00:00
Dave Wysochanski	a5fb2bbff3	Pass metadataignore to pv_create, pv_setup, _mda_setup, and add_mda. Pass metadataignore through PV creation / setup paths. As a result of this cleanup, we can remove the unnecessary setting of mda_ignore bits inside pvcreate_single(), after call to pv_create. For now, just set metadataignore to '0' in some places. This is equivalent to the prior functionality, although the 0 is given by the caller not hardcoded in _mda_setup() call. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-07-08 18:24:29 +00:00
Alasdair Kergon	d8886386bd	more mda ignore cleanups	2010-06-30 19:28:35 +00:00
Alasdair Kergon	23177eda88	more metadataignore message/code cleanup	2010-06-30 17:13:05 +00:00
Alasdair Kergon	647c64c796	Improve various log messages.	2010-06-30 13:51:11 +00:00
Dave Wysochanski	a5bf70018b	Add --metadataignore to pvcreate. Allow metadataignore flag to be passed in to pvcreate. Ideally, more refactoring of the mda allocation / initialization is warranted, but for now, we just add another parameter to 'add_mda' to take an existing mda ignored flag. We need to do this or pv_write loses the state of the mda 'ignored' flag before copying and writing to disk.	2010-06-30 12:17:24 +00:00
Dave Wysochanski	d37dd5b2d3	Improve logging for metadata ignore by printing device name. Print device name when setting or clearing metadata ignore bit. Example: label/label.c:160 /dev/loop2: lvm2 label detected cache/lvmcache.c:1136 lvmcache: /dev/loop2: now in VG #orphans_lvm2 (#orphans_lvm2) metadata/metadata.c:4142 Setting mda ignored flag for metadata_locn /dev/loop2. format_text/text_label.c:318 Skipping mda with ignored flag on device /dev/loop2 at offset 4096	2010-06-29 22:37:32 +00:00
Dave Wysochanski	710c9373bf	Add some log_verbose debug statements related to metadataignore. Logging isn't ideal, especially for mda_set_ignore. Ideally we'd like to display the device name and offset in this case but this requires a bit more work and a per-format 'mda_description' function pointer definition (we don't have access to mda_context in metadata.c).	2010-06-29 22:25:58 +00:00
Dave Wysochanski	559aee44ab	Add error message if backup_to_file fails because of empty in_use mdas list.	2010-06-29 15:03:59 +00:00
Dave Wysochanski	5778fdeeb8	Add more initializations of 'mda->flags' field. Mda allocation needs refactored into a single function but as an interim step, ensure mda->flags is initialized properly.	2010-06-29 14:52:56 +00:00
Dave Wysochanski	fa832e3a55	Attempt to fix intermittent failure with non-debug configured vgcfgbackup. There's an intermittent failure with vgcfgbackup that seems to have been introduced with the metadataignore / vgmetadatacopies patchset. Intermittent failures are often the result of uninitialized data, so this patch calls zalloc in a few places it might matter.	2010-06-29 13:29:53 +00:00
Dave Wysochanski	7042e06a2a	Make vg->mda_copies persistent in on disk vg metadata. This patch adds the ability to read/write the vg->mda_copies values from/to the vg metadata. If we read the VG metadata and this field does not exist, we set mda_copies to the default value of 0. Later in the code, we use this special '0' value to indicate a disable of metadata balancing. This should preserve existing LVM behavior and ensure metadata balancing can be turned off should the need arise. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:37:10 +00:00
Dave Wysochanski	69d1732334	Update _vg_read and _text_create_text_instance to use fid_add_mda[s]. When we are constructing the vg, we may need to adjust the list of metadata_areas if there are ignored mdas. At label read time, we do not read the metadata of ignored mdas, and as a result, they do not get placed on vg->fid->metadata_areas inside _text_create_text_instance since lvmcache does not have these areas attached to vginfo->infos. However, when we're checking the pvids inside _vg_read, after having read another metadata area from another PV, we do have the opportunity to update the metadata_area and metadata_areas_ignored lists based on the read metadata_area. We need accurate mda lists for the reporting functions that count the ignored mdas, as well as general correctness of mda balancing. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:35:17 +00:00
Dave Wysochanski	e6bd367b57	Implement ignore of mda if bit set by skipping r/w of metadata. We implement ignore of an mda at label_read time by checking for the ignore bit, and then skipping the reading of the vgname and other information in the metadata. This will have an effect similar to a PV found with no mdas. Thus, it will look like an orphan in the cache until we scan the rest of the system and find a PV with metadata, and the mda will not be on the vg->fid->metadata_areas list so no read/writes will be done to the metadata area. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:34:24 +00:00
Dave Wysochanski	9ccac021a7	Add metadata_areas_ignored list and functions to manage ignored mdas. Add a second mda list, metadata_areas_ignored to fid, and a couple functions, fid_add_mda() and fid_add_mdas() to help manage the lists. These functions are needed to properly count the ignored mdas and manage the lists attached to the 'fid' and ultimately the 'vg'. Ensure metadata_areas_ignored is initialized in other formats, even if the list is never used. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:33:22 +00:00
Dave Wysochanski	f55a20eb36	Rename fid->metadata_areas to fid->metadata_areas_in_use. Rename the metadata_areas list to an 'in_use' list to prepare for future 'ignored' list.	2010-06-28 20:32:44 +00:00
Dave Wysochanski	ef4fa155a5	Add mda location specific mda_copy constructor. Because of the way mdas are handled internally, where a PV in a VG has mdas on both info->mdas and vg->fid->metadata_areas list, we need a location independent copy constructor for struct metadata_area. Break up the existing format-text specific copy constructor into a format independent piece and a format dependent piece. This function is necessary to properly implement pv_set_mda_ignored(). Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-by: Alasdair G Kergon <agk@redhat.com>	2010-06-28 20:31:59 +00:00
Dave Wysochanski	29f24d4634	Add mda_locns_match() internal library function for mapping pv/device to VG mda. A metadata_area is defined independent of the location. One downside is that there is no obvious mapping from a pv to an mda. For a PV in a VG, we need a way to start with a PV and end up with an MDA, if we are to manage mdas starting with a device/pv. This function provides us a way to go down the list of PVs on a VG, and identify which ones match a particular PV. I'm not entirely happy with this approach, but it does fit into the existing structures in a reasonable way. An alternative solution might be to refactor the VG - PV interface such that mdas are a list tied to a PV. However, this seemed a bit tricky since a PV does not come into existence until after the list of mdas is constructed (see _vg_read() - we create a 'fid' and attach mdas to it, then we go through them and attach pvs). Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-by: Alasdair G Kergon <agk@redhat.com>	2010-06-28 20:31:38 +00:00
Dave Wysochanski	a6b36a5901	Ensure in-memory state matches on-disk state of mda ignore bit. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:31:18 +00:00
Dave Wysochanski	09e0f43ba0	Allow raw_read_mda_header to be called from text_label.c. We'd like to pass in mda_header to vgname_from_mda(). In order to do this, we need to call raw_read_mda_header() from text_label.c, _text_read(), which gets called from the label_read() path, and peers into the metadata and update vginfo cache. We should check the disable bit here, and if set, not peer into the vg metadata, thus reducing the I/O to disk. In the process, move vgname_from_mda() to layout.h, since the fn only gets called from format_text code, and we need the mda_header definition from the private layout.h. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:31:01 +00:00
Dave Wysochanski	da0b4d8770	Move dev_open/dev_close outside vgname_from_mda(). Refactor vgname_from_mda() so caller must open/close the device. Should be no functional change. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:30:46 +00:00
Dave Wysochanski	96597c2eab	Move dev_open / dev_close outside _vg_read_raw_area(). This refactoring moves the device open/close up one level to the caller of _vg_read_raw_area(). Should be no functional change and facilitate future changes related to metadata balancing. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:30:30 +00:00
Dave Wysochanski	322c5868b3	Add location independent flag and functions to ignore mdas. First we add a 'flags' field to the location independent metadata_area structure, and a MDA_IGNORE flag. The mda_is_ignored and mda_set_ignored functions are added to manage the flag. Adding the flag and functions gives a library interface to ignore metadata areas independent of the underlying location (disk, file, etc). The location specific read/write functions must then handle the specifics of what this flag means to the location. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-by: Alasdair G Kergon <agk@redhat.com>	2010-06-28 20:30:14 +00:00
Dave Wysochanski	d144d5eeb7	Add text format specific 'rlocn' ignore flag and access functions. Adding a flag to the 'rlocn' structure in the mda header of the text format allows us to flip a bit to ignore an area on disk that stores the metadata via the text format specific mda_header. This patch defines the flag and access functions to manage the flag. Other patches will manage the ignore on a format-independent basis, by using a flag in the metadata_area structure. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:29:57 +00:00
Dave Wysochanski	7c604e7649	Change 'filler' to 'flags' in on-disk 'raw_locn' structure. Future patches will make use of a specific flag in the on-disk 'raw_locn' structure to enable/disable metadata areas, and facilitate metadata balancing. Note that 'filler' is always set to '0' (see add_mda() - memset), so use of this area as a non-zero flags field is a safe way to provide future code features. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:29:42 +00:00
Dave Wysochanski	58f55600d0	Add device name to output of error messages in raw_read_mda_header(). It would be helpful if we had the device name when something like a mda_header checksum error occurs. Before: ./tools/lvm pvs -opv_name,vg_name,uuid,mda_count,pv_mda_count_ignored,vg_mda_count,vg_mda_count_ignored,vg_mda_copies Incorrect metadata area header checksum PV VG PV UUID #PMda #PMdaIgn #VMda #VMdaIgn #VMdaCps /dev/loop0 vgtest2 sVv26t-gjpb-Rcau-uBDO-Cx04-GbRR-6Ssq7e 2 0 4 0 4 /dev/loop1 vgtest2 zXWStT-qE8F-mbkc-RfgH-aytv-mptF-Y5Ce09 2 0 4 0 4 /dev/loop2 riCpK9-9G8r-LlIp-i2oh-mb3N-CUzk-u5YpuR 1 0 0 0 0 /dev/loop3 vgtest tQCUjm-rmyd-i92d-4eeE-UYBW-v1vQ-kRaA17 2 0 4 2 0 /dev/loop4 vgtest ZRvpeI-p8F1-ccVW-BBac-xhl1-aGXU-CbP0oo 2 2 4 2 0 After: ./tools/lvm pvs -opv_name,vg_name,uuid,mda_count,pv_mda_count_ignored,vg_mda_count,vg_mda_count_ignored,vg_mda_copies Incorrect metadata area header checksum on /dev/loop2 at offset 4096 PV VG PV UUID #PMda #PMdaIgn #VMda #VMdaIgn #VMdaCps /dev/loop0 vgtest2 sVv26t-gjpb-Rcau-uBDO-Cx04-GbRR-6Ssq7e 2 0 4 0 4 /dev/loop1 vgtest2 zXWStT-qE8F-mbkc-RfgH-aytv-mptF-Y5Ce09 2 0 4 0 4 /dev/loop2 riCpK9-9G8r-LlIp-i2oh-mb3N-CUzk-u5YpuR 1 0 0 0 0 /dev/loop3 vgtest tQCUjm-rmyd-i92d-4eeE-UYBW-v1vQ-kRaA17 2 0 4 2 0 /dev/loop4 vgtest ZRvpeI-p8F1-ccVW-BBac-xhl1-aGXU-CbP0oo 2 2 4 2 0	2010-06-22 19:18:27 +00:00
Peter Rajnoha	03023d3965	Fix incorrect memory pool deallocation while using vg_read for files. We create a separate pool "lvm2 vg_read" for vg_read and we don't use cmd->mem anymore.	2010-06-01 12:08:50 +00:00
Zdenek Kabelac	8fea97b7e7	Replicator: base lvm2 support Adding configure.in support for Replicators. Adding basic lib lvm support for Replicators. Adding flags REPLICATOR and REPLICATOR_LOG. Adding segments SEG_REPLICATOR and SEG_REPLICATOR_DEV. Adding basic methods for handling replicator metadata.	2010-05-21 12:36:30 +00:00
Petr Rockai	9409998d71	Suppress duplicate error messages about read failures and missing devices.	2010-05-05 22:37:52 +00:00
Peter Rajnoha	1e696b0c15	Do not reset position in metadata ring buffer on vgrename and vgcfgrestore. We should write metadata into next position in the ring buffer while calling vgrename and vgcfgrestore. At this code level (_vg_write_raw), we were not able to determine if this is a rename or not. If yes, then accompanying VG structure passed here has a new name set, not the old one. When looking for a location where to put metadata next, we were given a NULL value because of failed VG name comparison (in _find_vg_rlocn) between the name in existing metadata and metadata we're just about to write. This resets the position in the ring buffer, overwriting any existing metadata (and also incorrectly updates the cache to "orphan" afterwards). This patch just adds old_name item in struct volume_group that we can check and use if necessary and detect renames at lower layers as well. The same applies for vgcfgrestore, but here we're using a special value of old_name, an empty string, to disable the check with existing metadata totally.	2010-04-14 13:09:16 +00:00
Alasdair Kergon	aab7a3978b	Fix pvmove allocation to take existing parallel stripes into account. When moving parts of striped LVs, pvmove wouldn't care about leaving you with two stripes on the same disk. Now --alloc anywhere is needed for that. (Tried and gave up on two alternative approaches before the one committed here.)	2010-04-08 00:28:57 +00:00
Dave Wysochanski	9e82787da2	Add add_pvl_to_vgs() - helper function to add a pv to a vg list. Small refactor of main places in the code where a pv is added to a vg into a small function which adds the pv to the list and updates the vg counts. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-04-06 14:04:54 +00:00
Dave Wysochanski	36e9d03d1b	Refactor _read_pv() code that updates vg->extent_count and vg->free_count. Simple refactor to mov code that updates the vg extent counts from a single pv's counts close to the code that adds a pv to vg->pvs and updates vg->pv_count. No functional change. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-04-06 14:04:03 +00:00
Alasdair Kergon	258db3ad8e	Change most remaining log_error WARNING messages to log_warn.	2010-04-01 10:34:09 +00:00
Milan Broz	6733116a19	Fix all segments memory is allocated from vg private mempool. Physical segments were still allocated from global command context mempool. This leads to very high memory usage when activating large VG (vgchange). (Memory usage was about 2G when >3000LVs). Fix it by properly using vg->vgmem private pool, so all the memory is released early. New memory pool parameter is needed here for pv_split_segment function. Also fix the same problem in some minor allocations (vg description, lv segment split).	2010-03-31 17:23:18 +00:00

... 2 3 4 5 6 ...

620 Commits