shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-21 13:34:40 +03:00

Author	SHA1	Message	Date
Peter Rajnoha	303e86adc8	pvcreate: fix alignment to incorporate alignment offset if PV has 0 MDAs If zero metadata copies are used, there's no further recalculation of PV alignment that happens when adding metadata areas to the PV and which actually calculates the alignment correctly as a matter of fact. So fix this for "PV without MDA" case as well. Before this patch: [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 1 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 12.00m [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 0 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 8.00m After this patch: [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 1 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 12.00m [1] raw/~ # pvcreate --dataalignment 8m --dataalignmentoffset 4m --metadatacopies 0 /dev/sda Physical volume "/dev/sda" successfully created [1] raw/~ # pvs -o pv_name,pe_start PV 1st PE /dev/sda 12.00m Also, remove a superfluous condition "pv->pe_start < pv->pe_align" in: if (pe_start == PV_PE_START_CALC && pv->pe_start < pv->pe_align) pv->pe_start = pv->pe_align ... This part of the condition is not reachable as with the PV_PE_START_CALC, we always have pv->pe_start set to 0 from the PV struct initialisation (...the pv->pe_start value is just being calculated).	2013-02-21 14:51:19 +01:00
Jonathan Brassow	70f57996b3	RAID: Add new 'raid10_segtype_default' setting in lvm.conf If '--mirrors/-m' and '--stripes/-i' are used together when creating a logical volume, mirrors-over-stripes is currently chosen. The user can override this by using the '--type raid10' option on creation. However, we want a place where we can set the default behavior to 'raid10' explicitly - similar to the "mirror" and "raid1" tunable, mirror_segtype_default. A follow-on patch should use this new setting to change the default from "mirror" to "raid10", as this is the preferred segment type.	2013-02-20 15:10:04 -06:00
Jonathan Brassow	dc2ce71313	clean-up: Remove a FIXME question that has been settled It is ok for us to use the shorthand 'lv_is_virtual' to detect error targets in a RAID LV when searching for candidates for device replacement.	2013-02-20 15:03:58 -06:00
Jonathan Brassow	bd0ee420b5	RAID: Allow remove/replace of sub-LVs composed of error segments. When a device fails, we may wish to replace those segments with an error segment. (Like when a 'vgreduce --removemissing' removes a failed device that happens to be a RAID image/meta.) We are then left with images that we will eventually want to remove or replace. This patch allows us to pull out these virtual "error" sub-LVs. This allows a user to 'lvconvert -m -1 vg/lv' to extract the bad sub-LVs. Sub-LVs with error segments are considered for extraction before other possible devices so that good devices are not accidentally removed. This patch also adds the ability to replace RAID images that contain error segments. The user will still be unable to run 'lvconvert --replace' because there is no way to address the 'error' segment (i.e. no PV that it is associated with). However, 'lvconvert --repair' can be used to replace the image's error segment with a new PV. This is also the most appropriate way to do it, since the LV will continue to be reported as 'partial'.	2013-02-20 14:58:56 -06:00
Jonathan Brassow	845852d6b4	RAID: Make 'vgreduce --removemissing' work with RAID LVs Currently it is impossible to remove a failed PV which has a RAID LV on it. This patch fixes the issue by replacing the failed PV with an 'error' segment within the affected sub-LVs. Once there is no longer a RAID LV using the PV, it can be removed. Most often, it is better to replace a failed RAID device with a spare. (You can use 'lvconvert --repair <vg>/<LV>' to accomplish that.) However, if there are no spares in the volume group and none will be added, it is useful to be able to removed the failed device. Following patches address the ability to perform 'lvconvert' operations on RAID LVs that contain sub-LVs composed of 'error' segments.	2013-02-20 14:52:46 -06:00
Jonathan Brassow	0e4ffd9d3b	clean-up: Rename lvm.conf setting 'mirror_region_size' to 'raid_region_size' We have been using 'mirror_region_size' in lvm.conf as the default region size for RAID logical volumes as well as mirror logical volumes. Since, "raid" is more inclusive and representative than "mirror", I have changed the name of this setting. We must still check for the old setting and warn the user if we are overriding it with the new setting if both happen to be present.	2013-02-20 14:40:17 -06:00
Peter Rajnoha	a7d6a612b8	fix: 'Couldn't read extent size' --> '... extent start'	2013-02-21 13:33:27 +01:00
Peter Rajnoha	722ca363f0	report: fix pvs -o pv_free reporting for PVs with 0 PEs [0] raw/~ # lsblk -o NAME,SIZE /dev/sda NAME SIZE sda 128M [0] raw/~ # pvcreate --dataalignment 128m /dev/sda Physical volume "/dev/sda" successfully created [0] raw/~ # vgcreate vg /dev/sda Volume group "vg" successfully created [0] raw/~ # lvcreate -l1 vg Volume group "vg" has insufficient free space (0 extents): 1 required. Before this patch: [0] raw/~ # pvs -o pv_name,pv_free PV PFree /dev/sda 128.00m After this patch: [0] raw/~ # pvs -o pv_name,pv_free PV PFree /dev/sda 0	2013-02-21 13:28:07 +01:00
Zdenek Kabelac	e566faaae6	cleanup: old style gcc	2013-02-05 16:54:12 +01:00
Zdenek Kabelac	d97605beaf	cleanup: preserve signesss and type size on return values	2013-02-05 16:54:11 +01:00
Zdenek Kabelac	7910b6c0ba	thin: update pool_is_active Change it to take LV and move it to exported header - seems to be a better fit for usability from tools/ directory.	2013-02-05 16:54:11 +01:00
Zdenek Kabelac	c984d8fbab	thin: properly unmark volume after detach When the volume is detached form thin pool, unmask THIN_VOLUME flag and reset related pointers.	2013-02-05 14:40:37 +01:00
Zdenek Kabelac	11eaf1c98c	thin: add function pool_is_active This internal function check for active pool device. For cluster it checks every thin volume, On the non-clustered VG we need to check just for presence of -tpool device.	2013-02-05 14:35:44 +01:00
Zdenek Kabelac	9d445f371c	report: leave empty report field for 0 Since we do not support LVs with 0 size, use this value as 'error' value for devices without origin, and leave this field blank as in other cases.	2013-02-05 14:32:37 +01:00
Zdenek Kabelac	ddeb37f282	cleanup: add internal error check Check if 'is_removable' is defined and report internal error, if it's missing.	2013-02-05 14:27:24 +01:00
Jonathan Brassow	f5cd9c3563	clean-up: Another functiont that can use 'lv_layer' lib/activate/dev_manager.c:dev_manager_raid_status() can also use the new 'lv_layer' function.	2013-02-04 17:10:16 -06:00
Zdenek Kabelac	a4870c79ca	thin: use noflush for obtaining transaction_id Do not flush thin pool data, when reading transation_id status.	2013-02-04 19:05:56 +01:00
Zdenek Kabelac	153ce89af3	cleanup: comment update Just update code comment and use single line if().	2013-02-04 19:05:43 +01:00
Zdenek Kabelac	b37a0a39e3	cleanup: indent line	2013-02-04 19:01:11 +01:00
Zdenek Kabelac	8ed0b6f312	thin: replace is_active with send_messages Since is_active is only used for thinp replace struct member with more meaningful send_messages flag	2013-02-04 19:01:10 +01:00
Zdenek Kabelac	4af4241ba4	use lv_layer	2013-02-04 19:01:10 +01:00
Zdenek Kabelac	ca7abbce8a	activate: add lv_layer function Add function to return layer name for LV.	2013-02-04 19:01:10 +01:00
Zdenek Kabelac	9f433e6ee3	cleanup: postpone lv_is_thin_volume check Code move to make it easier to follow and call _add_dev_to_dtree() in the separate if() branch for thin volumes.	2013-02-04 19:00:19 +01:00
Jonathan Brassow	801d4f96a8	RAID: Improve 'lvs' attribute reporting of RAID LVs and sub-LVs There are currently a few issues with the reporting done on RAID LVs and sub-LVs. The most concerning is that 'lvs' does not always report the correct failure status of individual RAID sub-LVs (devices). This can occur when a device fails and is restored after the failure has been detected by the kernel. In this case, 'lvs' would report all devices are fine because it can read the labels on each device just fine. Example: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) However, 'dmsetup status' on the device tells us a different story: [root@bp-01 lvm2]# dmsetup status vg-lv 0 1024000 raid raid1 2 DA 1024000/1024000 In this case, we must also be sure to check the RAID LVs kernel status in order to get the proper information. Here is an example of the correct output that is displayed after this patch is applied: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-p 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-p /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-p /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) The other case where 'lvs' gives incomplete or improper output is when a device is replaced or added to a RAID LV. It should display that the RAID LV is in the process of sync'ing and that the new device is the only one that is not-in-sync - as indicated by a leading 'I' in the Attr column. (Remember that 'i' indicates an (i)mage that is in-sync and 'I' indicates an (I)mage that is not in sync.) Here's an example of the old incorrect behaviour: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [root@bp-01 lvm2]# lvconvert -m +1 vg/lv; lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 0.00 lv_rimage_0(0),lv_rimage_1(0),lv_rimage_2(0) [lv_rimage_0] vg Iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg Iwi-aor-- /dev/sdb1(1) [lv_rimage_2] vg Iwi-aor-- /dev/sdc1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [lv_rmeta_2] vg ewi-aor-- /dev/sdc1(0) Note that all the images currently are marked as 'I' even though it is only the last device that has been added that should be marked. Here is an example of the correct output after this patch is applied: [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [root@bp-01 lvm2]# lvconvert -m +1 vg/lv; lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg rwi-a-r-- 0.00 lv_rimage_0(0),lv_rimage_1(0),lv_rimage_2(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg iwi-aor-- /dev/sdb1(1) [lv_rimage_2] vg Iwi-aor-- /dev/sdc1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-- /dev/sdb1(0) [lv_rmeta_2] vg ewi-aor-- /dev/sdc1(0) Note only the last image is marked with an 'I'. This is correct and we can tell that it isn't the whole array that is sync'ing, but just the new device. It also works under snapshots... [root@bp-01 lvm2]# lvs -a -o name,vg_name,attr,copy_percent,devices vg LV VG Attr Cpy%Sync Devices lv vg owi-a-r-p 33.47 lv_rimage_0(0),lv_rimage_1(0),lv_rimage_2(0) [lv_rimage_0] vg iwi-aor-- /dev/sda1(1) [lv_rimage_1] vg Iwi-aor-p /dev/sdb1(1) [lv_rimage_2] vg Iwi-aor-- /dev/sdc1(1) [lv_rmeta_0] vg ewi-aor-- /dev/sda1(0) [lv_rmeta_1] vg ewi-aor-p /dev/sdb1(0) [lv_rmeta_2] vg ewi-aor-- /dev/sdc1(0) snap vg swi-a-s-- /dev/sda1(51201)	2013-02-01 11:33:54 -06:00
Jonathan Brassow	37ffe6a13a	RAID: Cache previous results of lv_raid_dev_health for future use We can avoid many dev_manager (ioctl) calls by caching the results of previous calls to lv_raid_dev_health. Just considering the case where 'lvs -a' is called to get the attributes of a RAID LV and its sub-lvs, this function would be called many times. (It would be called at least 7 times for a 3-way RAID1 - once for the health of each sub-LV and once for the health of the top-level LV.) This is a good idea because the sub-LVs are processed in groups along with their parent RAID LV and in each case, it is the parent LV whose status will be queried. Therefore, there only needs to be one trip through dev_manager for each time the group is processed.	2013-02-01 11:32:18 -06:00
Jonathan Brassow	c8242e5cf4	RAID: Add RAID status accessibility functions Similar to the way thin* accesses its kernel status, we add a method for RAID to grab the various values in its status output without the higher levels (LVM) having to understand how to parse the output. Added functions include: - lib/activate/dev_manager.c:dev_manager_raid_status() Pulls the status line from the kernel - libdm/libdm-deptree.c:dm_get_status_raid() Parses status line and puts components into dm_status_raid struct - lib/activate/activate.c:lv_raid_dev_health() Accesses dm_status_raid to deliver raid dev_health string The new structure and functions can provide a more unified way to access status information. ('lv_raid_percent' could switch to using these functions, for example.)	2013-02-01 11:31:47 -06:00
Petr Rockai	1e4a9534f4	lvmetad: Call _lvmetad_handle_reply in lvmetad_vg_lookup.	2013-01-16 11:19:33 +01:00
Sebastian Ott	9602e68577	filters: add scm devices Fix this: pvcreate /dev/scma Device /dev/scma not found (or ignored by filtering). Reported-by: Peter Oberparleiter <peter.oberparleiter@de.ibm.com> Signed-off-by: Sebastian Ott <sebott@linux.vnet.ibm.com>	2013-01-11 09:24:07 +01:00
Alasdair G Kergon	06abb2dd4c	logging: classify log_debug messages Place most log_debug() messages into a class.	2013-01-07 22:30:29 +00:00
Alasdair G Kergon	7f747a0d73	logging: add debug classes Add log/debug_classes to lvm.conf to allow debug messages to be classified and filtered at runtime. The dm_errno field is only used by log_error(), so I've redefined it for log_debug() messages to hold the message class. By default, all existing messages appear, but we can add categories that generate high volumes of data, such as logging all traffic to/from lvmetad.	2013-01-07 22:25:19 +00:00
Alasdair G Kergon	b617109fff	lvmetad: fix format1 updates fmt1 doesn't have a separate commit function: updates take effect immediately vg_write is called, so we must update lvmetad at this point if we're going to go on and ask lvmetad for the VG metadata again before calling the commit function (though that's probably an unsupported and pointless thing to do anyway as the client must already have that data and it cannot have changed because it's locked and with devs suspended we shouldn't be communicating with lvmetad; so when that's fixed properly, this fix here can be reverted). This problem showed up as an internal error when lvremoving an LVM1 snapshot. > Internal error: LV snap1 (00000000000000000000000000000001) missing from preload metadata https://bugzilla.redhat.com/891855	2013-01-05 03:17:35 +00:00
Alasdair G Kergon	48e1ae7f6a	lvmetad: add basic client-side debug logging First attempt at showing precisely what use any command is making of lvmetad in the -vvvv trace information.	2013-01-05 00:35:50 +00:00
Alasdair G Kergon	41e7f45258	lvmetad: rename device vars and move _token_update Move _token_update() to avoid the need for _lvmetad_send prototype. Use 'dev' consistently for a struct device * variable. Use 'devno' for a dev_t.	2013-01-04 23:45:22 +00:00
Alasdair G Kergon	6d760b2c63	lvmetad: improve client logging when connecting Rename lvmetad_warning() to lvmetad_connect_or_warn(). Log all connection attempts on the client side, whether successful or not. Reduce some nesting and remove a redundant assertion.	2013-01-04 23:22:30 +00:00
Jonathan Brassow	970dfbcd69	RAID: Limit replacement of devices when array is not in-sync. If a RAID array is not in-sync, replacing devices should not be allowed as a general rule. This is because the contents used to populate the incoming device may be undefined because the devices being read where not in-sync. The kernel enforces this rule unless overridden by not allowing the creation of an array that is not in-sync and includes a devices that needs to be rebuilt. Since we cannot know the sync state of an LV if it is inactive, we must also enforce the rule that an array must be active to replace devices. That leaves us with the following conditions: 1) never allow replacement or repair of devices if the LV is in-active 2) never allow replacement if the LV is not in-sync 3) allow repair if the LV is not in-sync, but warn that contents may not be recoverable. In the case where a user is performing the repair on the command line via 'lvconvert --repair', the warning is printed before the user is prompted if they would like to replace the device(s). If the repair is automated (i.e. via dmeventd and policy is "allocate"), then the device is replaced if possible and the warning is printed.	2012-12-18 14:40:42 -06:00
Zdenek Kabelac	401c9aba4a	pv_read: add missing check for valid info If the lvmcache_info_from_pvid() fails to find valid info, invoke the lookup by dev, and only in this case call lvmcache_info_from_pvid() again. Also check for the result of info and return error directly, so the NULL is not passed to lvmcache_get_label().	2012-12-15 17:23:27 +01:00
Zdenek Kabelac	e012d0635d	lvmetad: check id_read_format error status Detect error from id_read_format() function.	2012-12-15 17:23:27 +01:00
Zdenek Kabelac	ff5612c0c3	format-text: check for _text_create_text_instance Test if 'fid' creation failed and report stack trace, break the loop and do not pass NULL fid further.	2012-12-15 17:23:23 +01:00
Zdenek Kabelac	740ab81d03	log: move abort past syslog When the abort_on_internal_errors is enabled, we aborted prior the syslog logging output. Since such fatal error gets level _LOG_FATAL it should not be blocked by debug_level() check so lets move it further, to get abort error logged also via syslog.	2012-12-15 17:22:48 +01:00
Zdenek Kabelac	575c4ed964	cleanup: use proper const in apply_lvname_restrictions Better constness used for reserved prefixes and strings. Also simplify a bit validate_name and use direct char checks isntead of 2 strcmp() calls.	2012-12-15 14:57:40 +01:00
Zdenek Kabelac	21f6511bc2	cleanup: reorder code Swap if() test condition and check for failure and use traditional 'stack' trace.	2012-12-15 14:57:40 +01:00
Zdenek Kabelac	8ab4334505	cleanup: ignore return values These dm_snprintfs should not fail, since enough space is reserved. So return intentionaly ignored.	2012-12-15 14:57:40 +01:00
Petr Rockai	f14f2d4378	lvmetad: Fix autoactivation for MDA-less PVs. Calling pvscan --cache with -aay on a PV without an MDA would spuriously fail with an internal error, because of an incorrect assumption that a parsed VG structure was always available. This is not true and the autoactivation handler needs to call vg_read to obtain metadata in cases where the PV had no MDAs to parse. Therefore, we pass vgid into the handler instead of the (possibly NULL) VG coming from the PV's MDA.	2012-12-12 13:19:04 +01:00
Marian Csontos	ff5c1c576c	lvmetad: use dm_config_destroy to free pvmeta Release pvmeta handler with proper dm_config_destroy() function. TODO: Fix primary fault for this internal error. Signed-off-by: mcsontos@redhat.com	2012-12-11 11:55:12 +01:00
Zdenek Kabelac	17be6d5210	thin: fix test for dicards ignore settings Arghh, this was bad last-minute shortening of if() expression in the commit `1ef9831018`. dm_tree_node_set_thin_pool_discard() must not run in the same expression as check for non-power-2 discard, otherwise there are 2 calls for dm_tree_node_set_thin_pool_discard and whole setting of discards is missinterpretted. In-relase fix it by using proper parentheses {}.	2012-12-11 11:26:19 +01:00
Zdenek Kabelac	ec49f07b0d	mirrors: fix leak in device_is_usable mirror check Function _ignore_blocked_mirror_devices was not release allocated strings images_health and log_health. In error paths it was also not releasing dm_task structure. Swaped return code of _ignore_blocked_mirror_devices and use 1 as success. In _parse_mirror_status use log_error if memory allocation fails and few more errors so they are no going unnoticed as debug messages. On error path always clear return values and free strings. For dev_create_file use cache mem pool to avoid memleak.	2012-12-11 11:15:22 +01:00
Peter Rajnoha	35a4d70aad	activation: don't miss the log on empty {auto_activation\|read_only\|}_volume_list Addendum to previous commit...	2012-12-04 14:12:36 +01:00
Peter Rajnoha	e2be2652ad	Allow empty activation/{auto_activation\|read_only\|}_volume_list config option. In case we don't want to activate, autoactivate or have the VG/LV read-only. Primarily targeted for the auto_activation_volume_list, but it makes no harm for other settings (the part of the code that reads these three settings is shared, but there's no reason to separate it only for this change).	2012-12-04 10:33:54 +01:00
Zdenek Kabelac	a530c70b21	thin: update thin feature detection Safe 1 static var and keep whole detection within one function.	2012-12-03 13:03:41 +01:00
Zdenek Kabelac	5ec20e267f	thin: reworked thin feature detection Rework thin feature detection to support runtime section to allow to disable them selectively. New lvm.conf option is born: global/thin_disabled_features	2012-12-03 11:57:40 +01:00

1 2 3 4 5 ...

3090 Commits