shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-21 13:34:40 +03:00

Author	SHA1	Message	Date
Milan Broz	b0485a996f	Restrict lvm1 partial mode. Current lvm1 allocation code seems to not properly map segments on missing PVs. For now disable this functionality. (It never worked and previous commit just introduced segfault here.) So the partial mode in lvm1 can only process missing PVs with no LV segments only. Also do not use random PV UUID for missing part but use fixed string derived from VG UUID (to not confuse clvmd tests).	2010-10-04 18:59:01 +00:00
Alasdair Kergon	ac0252ca07	Add dm_zalloc and use it and dm_pool_zalloc throughout.	2010-09-30 21:06:50 +00:00
Dave Wysochanski	d06b88437e	Update WHATS_NEW for commits related to generic property functions	2010-09-30 14:26:48 +00:00
Peter Rajnoha	5936dd0381	Fix memory leak of vg_read while using live copies of metadata in directories.	2010-09-30 14:12:14 +00:00
Zdenek Kabelac	3d7e8602ed	Fix memory leak of config_tree Adding missing destroy_config_tree() for cft_override if it has been allocated.	2010-09-30 11:44:54 +00:00
Zdenek Kabelac	1ebbfd9958	Fix leaked pool report Swap pool destruction order in dmeventd_lvm2_exit() to fix leak report.	2010-09-30 11:40:14 +00:00
Zdenek Kabelac	c631be7684	Maps fix Read complete content of /proc/self/maps into one buffer without realocation in the middle of reading and before doing any m/unlock operation with these lines - as some of them gets change. With previous implementation we've read some mappings twice ([stack])	2010-09-30 11:32:40 +00:00
Alasdair Kergon	f6b1c45bf1	Speed up unquoting of quoted double quotes and backslashes.	2010-09-28 01:29:06 +00:00
Alasdair Kergon	44a31a9c2f	Speed up CRC32 calculations by using a larger lookup table. Use -DDEBUG_CRC32 to revert to old function and check new one gives same result.	2010-09-27 19:09:34 +00:00
Alasdair Kergon	87368c35b9	pre-release	2010-09-24 16:24:57 +00:00
Peter Rajnoha	bad35c6554	Add escape sequence for ':' and '@' found in device names used as PVs.	2010-09-23 12:02:33 +00:00
Alasdair Kergon	0cb07b65f3	Replace alloca with dm_malloc in _aligned_io. (This section of code dates from 2.4 and could be written more efficiently nowadays.)	2010-09-22 22:31:45 +00:00
Milan Broz	980d2d8683	Fix handling of partial VG for lvm1 format metadata If some lvm1 device is missing, lvm fails on all operations # vgcfgbackup -f bck -P vg_test Partial mode. Incomplete volume groups will be activated read-only. 3 PV(s) found for VG vg_test: expected 4 PV segment VG free_count mismatch: 152599 != 228909 PV segment VG extent_count mismatch: 152600 != 228910 Internal error: PV segments corrupted in vg_test. Volume group "vg_test" not found Allow loading of lvm1 partial VG by allocating "new" missing PV, which covers lost space. Also this fake mising PV inform code that it is partial VG. https://bugzilla.redhat.com/show_bug.cgi?id=501390	2010-09-22 13:45:21 +00:00
Alasdair Kergon	a171bb6e85	Track recursive filter iteration to avoid refreshing while in use. (2.02.56)	2010-09-22 01:36:13 +00:00
Peter Rajnoha	70431c8146	Revert to old glibc behaviour for vsnprintf used in emit_to_buffer function. Revert to old glibc behaviour for vsnprintf used in emit_to_buffer fn. Otherwise, the check that follows would be wrong for new glibc versions. This caused the rh bug #633033 to be undetected and pass throught the check, corrupting the metadata!	2010-09-20 14:25:27 +00:00
Peter Rajnoha	48ae64529a	Use dynamic allocation for metadata's tag buffer (removes 4096 char. limit).	2010-09-20 14:23:20 +00:00
Peter Rajnoha	d20ce59b80	Add random suffix to archive file names to prevent races when being created. In certain configurations, we're not under a VG rw lock while trying to write a new archive file with VG metadata. A common example is using "vgs" while having the content of backup and archive directories empty. The code scans the content of these directories and tries to determine the final index that should be used in archive name. Since we're not under a lock, we can get into a race while choosing the index which could end up showing errors about not being able to rename to final archive name. Let's add random number suffix to these archive file names so we can avoid the race.	2010-09-09 13:13:12 +00:00
Peter Rajnoha	dc8478458e	Reinitialize archive and backup handling on toolcontext refresh. For example, when using '--config "backup { ... }"' line, the values from lvm.conf (or default values) should be overridden. This patch adds reinitialisation of archive and backup handling on toolcontext refresh which makes these settings to be applied.	2010-09-09 13:07:13 +00:00
Jonathan Earl Brassow	34cbedceaf	This patch fixes an issue where cluster mirror write I/O can be opprobriously slow if created with '--nosync'. One of the ways cluster mirrors coordinate I/O and recovery amoung the different machines is by the use of the log function 'is_remote_recovering()' which lets nodes know if a region they wish to perform a write on is currently being recovered on another node. If the region is being recovered, the I/O is delayed. The 'is_remote_recovering' routine has been optimized to avoid the deluge of requests that would be issued to the userspace log server by maintaining a marker of how far the recovery has gotten. It can then immediately return 'not recovering' if the region being inquired about is less than this mark. Additionally, if the region of concern is greater than the mark, the function will limit the number of transmissions to userspace by assuming the region /is/ being recovered when skipping the transmission. This limits the amount of processing and updates the mark in 1/4 sec time steps. This patch fixes a problem where 'the mark' is not being updated because of faulty logic in the userspace log daemon. When '--nosync' is used to create a cluster mirror, the userspace log daemon never has a chance to update the mark in the normal way. The fix is to set the mark to "complete" if the mirror was created with the --nosync flag.	2010-08-30 18:37:42 +00:00
Jonathan Earl Brassow	6ec8c52201	This patch fixes a problem where the mirror polling process may never complete. If you convert from a linear to a mirror and then convert that mirror back to linear /while/ the previous (up)convert is taking place, the mirror polling process will never complete. This is because the function that polls the mirror for completion doesn't check if it is still polling a mirror and the copy_percent that it gets back from the linear device is certainly never 100%. The fix is simply to check if the daemon is still looking at a mirror device - if not, return PROGRESS_CHECK_FAILED. The user sees the following output from the first (up)convert if someone else sneaks in and does a down-convert shortly after their convert: [root@bp-01 ~]# lvconvert -m1 vg/lv vg/lv: Converted: 43.4% ABORTING: Mirror percentage check failed.	2010-08-26 16:29:12 +00:00
Jonathan Earl Brassow	a71d6051ed	This patch fixes a potential for I/O to hang and LVM commands to block when a mirror under a snapshot suffers a failure. The problem has to do with label scanning. When a mirror suffers a failure, the kernel blocks I/O to prevent corruption. When LVM attempts to repair the mirror, it scans the devices on the system for LVM labels. While mirrors are skipped during this scanning process, snapshot-origins are not. When the origin is scanned, it kicks up I/O to the mirror (which is blocked) underneath - causing the label scan (an thus the repair operation) to hang. This patch simply bypasses snapshot-origin devices when doing labels scans (while ignore_suspended_devices() is set). This fixes the issue.	2010-08-26 14:21:50 +00:00
Milan Broz	c7af31dbd7	Fix return type qualifier to avoid compiler warning. introduced in commit `b16b4d92a7` "Improve various log messages." fixes a lot of ../include/metadata.h:148: warning: type qualifiers ignored on function return type	2010-08-26 12:08:19 +00:00
Fabio M. Di Nitto	34b6075d68	Based on auto-detection or user requested cluster managers for clvmd, set appropriate Required-Start and Required-Stop at configure time. Reorder the checks for user selected cluster managers to match auto detected ones, to be consistent in the output. Add special case for qdiskd that´s started after cman/lock_gulmd for RHEL-4/RHEL-5.	2010-08-23 11:37:02 +00:00
Milan Broz	e9848871d6	Fix pvmove --abort <dev> return code It prints error code even if abort operation succeeds: pvmove --abort /dev/sdb Command failed with status code 5.	2010-08-23 11:34:40 +00:00
Milan Broz	cf704d22b6	Fix pvmove --abort to work even for empty pvmove LV If pvmove crashed and metadata contains pvmove LV but without miorrored segments, pvmove --abort will not repair the situation (and finish wth success!). Fix it by allowing metadata update if aborting (thus removing pvmove LV) even if no moved LVs detected. (Tested on real metadata provided by an lvm user:-)	2010-08-23 11:34:10 +00:00
Alasdair Kergon	762812e71c	.	2010-08-21 00:18:05 +00:00
Mike Snitzer	4efb1d9cbb	Update heuristic used for default and detected data alignment. Add "devices/default_data_alignment" to lvm.conf to control the internal default that LVM2 uses: 0==64k, 1==1MB, 2==2MB, etc. If --dataalignment (or lvm.conf's "devices/data_alignment") is specified then it is always used to align the start of the data area. This means the md_chunk_alignment and data_alignment_detection are disabled if set. (Same now applies to pvcreate --dataalignmentoffset, the specified value will be used instead of the result from data_alignment_offset_detection) set_pe_align() still looks to use the determined default alignment (based on lvm.conf's default_data_alignment) if the default is a multiple of the MD or topology detected values.	2010-08-20 20:59:05 +00:00
Dave Wysochanski	aa06d91972	Update WHATS_NEW	2010-08-20 20:35:55 +00:00
Milan Broz	586b56b18c	Fix wrong use of LCK_WRITE In all top vg read functions only LCK_VG_READ/WRITE can be used. All other vg lock definitions are low-level backend machinery. Moreover, LCK_WRITE cannot be tested through bitmask. This patch fixes these mistakes. For _recover_vg() we do not need lock_flags, it can be only two of above and we always upgrading to LCK_VG_WRITE lock there. (N.B. that code is racy) There is no functional change in code (despite wrong masking it produces correct bits:-)	2010-08-19 23:26:31 +00:00
Milan Broz	727f7bfa49	Detect LUKS signature in pvcreate One shiny day we should use libblkid here. But now using LUKS is very common together with LVM and pvcreate destroys LUKS completely. So for user's convenience, try to detect LUKS signature and allow abort.	2010-08-19 23:08:18 +00:00
Milan Broz	c37a14506a	Fix file descriptor leak in swap signature detection	2010-08-19 23:05:45 +00:00
Milan Broz	cca6ab596f	Remove assumption that --yes must be used only in --force mode This is not only undocumented but is is also in violation with --help documentation. Using --yes without --force is useful in pvcreate when it detects old signature.	2010-08-19 23:04:37 +00:00
Milan Broz	2d5e2b52ca	Change the pvcreate swap/md logic pvcreate detects MD and swap signature. The logic hidden there is not only documented but it is also user unfriendly. Who invented this logic should run pvcreate on its own critical MD device to see why;-) This patch - creates one function instead of duplication code - asks if user want to overwrite signature - allows aborting (!) (Please note that writing LVM signatute without wiping old is wrong, it confuses blkid, MD will not work anyway and swap and LUKS is broken too.)	2010-08-19 23:03:34 +00:00
Alasdair Kergon	7dbfd06d99	post-release	2010-08-19 22:33:14 +00:00
Alasdair Kergon	523fad915b	pre-release	2010-08-18 20:57:10 +00:00
Jonathan Earl Brassow	53670b18f5	Fix for bug 596453: multiple mirror image failures cause lvm repair... The lvm repair issues I believe are the superficial symptoms of this bug - there are worse issues that are not as clearly seen. From my inline comments: * If the mirror was successfully recovered, we want to always * force every machine to write to all devices - otherwise, * corruption will occur. Here's how: * Node1 suffers a failure and marks a region out-of-sync * Node2 attempts a write, gets by is_remote_recovering, * and queries the sync status of the region - finding * it out-of-sync. * Node2 thinks the write should be a nosync write, but it * hasn't suffered the drive failure that Node1 has yet. * It then issues a generic_make_request directly to * the primary image only - which is exactly the device * that has suffered the failure. * Node2 suffers a lost write - which completely bypasses the * mirror layer because it had gone through generic_m_r. * The file system will likely explode at this point due to * I/O errors. If it wasn't the primary that failed, it is * easily possible in this case to issue writes to just one * of the remaining images - also leaving the mirror inconsistent. * * We let in_sync() return 1 in a cluster regardless of what is * in the bitmap once recovery has successfully completed on a * mirror. This ensures the mirroring code will continue to * attempt to write to all mirror images. The worst that can * happen for reads is that additional read attempts may be * taken.	2010-08-17 23:56:23 +00:00
Alasdair Kergon	22149572e8	Use 'SINGLENODE' instead of 'dead' in clvmd singlenode messages. Ignore snapshots when performing mirror recovery beneath an origin. Pass LCK_ORIGIN_ONLY flag around cluster. Add suspend_lv_origin and resume_lv_origin using LCK_ORIGIN_ONLY.	2010-08-17 19:25:05 +00:00
Alasdair Kergon	2d6fcbf67d	Allow internal suspend and resume of origin without its snapshots.	2010-08-17 16:25:32 +00:00
Alasdair Kergon	85ed403002	Fix dev_manager_transient to access -real device not snapshot-origin. (brassow) Another reminder why cloning functions impedes maintenance.	2010-08-17 01:51:12 +00:00
Alasdair Kergon	f92b4f9482	Monitor origin -real device below snapshot instead of overlay device. (brassow)	2010-08-17 01:16:41 +00:00
Alasdair Kergon	85a80e0505	Don't really change monitoring status when in test mode.	2010-08-16 23:29:09 +00:00
Alasdair Kergon	d1e8046f56	Various small cleanups and fixes related to monitoring.	2010-08-16 22:54:35 +00:00
Jonathan Earl Brassow	d0191bf9f4	Fix for bug 612291: dm devices of split off mirror images are not removed DM devices were not handled properly on nodes in a cluster that were not where the splitmirrors command was issued. This was happening because suspend_lv/resume_lv were being used in a place where activate_lv should have been used. When the suspend/resume are issued on (effectively) new LVs, their 'resource' (UUID) is not located in the lv_hash. Thus, both operations turn into no-ops. You can see this from the output of clvmd from one of the remote nodes: <snip> do_suspend_lv, lock not already held <snip> do_resume_lv, lock not already held 'activate_lv' enjoins the other nodes in the cluster to process the lock and activate the new LV. clvmd output from remote node as follows: do_lock_lv: resource 'zMseY7CBuO3Ty09vXlplPAHzD0Y0CovjrTdv0R1VcwggMwPdYhutHErRcwm5Nd2S', cmd = 0x19 LCK_LV_ACTIVATE (READ\|LV\|NONBLOCK), flags = 0x84 (DMEVENTD_MONITOR ), memlock = 1 sync_lock: 'zMseY7CBuO3Ty09vXlplPAHzD0Y0CovjrTdv0R1VcwggMwPdYhutHErRcwm5Nd2S' mode:1 flags=1 sync_lock: returning lkid 27b0001 Signed-off-by: Jonathan Brassow <jbrassow@redhat.com> Reviewed-by: Petr Rockai <prockai@redhat.com>	2010-08-16 18:02:14 +00:00
Fabio M. Di Nitto	2565ffad25	Fix clvmd init script return code when executed as non-root user. clvmd daemon itself does the right thing when invoked as non-root, by returning 4. The patch removes the use daemon function from /etc/rc.d/init.d/functions that´s unnecessary and has th bad habit to mask the return codes from the real daemon. Add a simple and generic check to see if clvmd is executed by root or not. Our stop/reload/restart paths in the init script are complex and not all the tools involved in the process are guaranteed to return 4 if executed by non-root against a process that´s running as root (for example kill -TERM will return -1 and parsing the output to catch the error is suboptimal at best). https://bugzilla.redhat.com/show_bug.cgi?id=553381	2010-08-12 09:14:59 +00:00
Mike Snitzer	b123a82d73	Change default alignment of pe_start to 1MB. The new standard in the storage industry is to default alignment of data areas to 1MB. fdisk, parted, and mdadm have all been updated to this default. Update LVM to align the PV's data area start (pe_start) to 1MB. This provides a more useful default than the previous default of 64K (which generally ended up being a 192K pe_start once the first metadata area was created). Before this patch: # pvs -o name,vg_mda_size,pe_start PV VMdaSize 1st PE /dev/sdd 188.00k 192.00k After this patch: # pvs -o name,vg_mda_size,pe_start PV VMdaSize 1st PE /dev/sdd 1020.00k 1.00m The heuristic for setting the default alignment for LVM data areas is: - If the default value (1MB) is a multiple of the detected alignment then just use the default. - Otherwise, use the detected value. In practice this means we'll almost always use 1MB -- that is unless: - the alignment was explicitly specified with --dataalignment - or MD's full stripe width, or the {minimum,optimal}_io_size exceeds 1MB - or the specified/detected value is not a power-of-2	2010-08-12 04:11:48 +00:00
Mike Snitzer	dff224669d	Require --restorefile when using pvcreate --uuid. Introduce --norestorefile to allow user to override the new requirement. This can also be overridden with "devices/require_restorefile_with_uuid" in lvm.conf -- however the default is 1. Signed-off-by: Mike Snitzer <snitzer@redhat.com>	2010-08-12 04:08:59 +00:00
Peter Rajnoha	626242c1bd	Recognise and give preference to md device partitions (blkext major). We can already detect MD devices internally. But when using MD partitions, these have "block extended major" (blkext) assigned (259). Blkext major is also used in general, so we need to check whether the original device is an MD device actually.	2010-08-11 12:14:23 +00:00
Petr Rockai	f21beaf20c	Update WHATS_NEW.	2010-08-09 14:06:03 +00:00
Jonathan Earl Brassow	8d2d4f1fa0	Fix for bug 619221 - log device splitting regression An incorrect fix on July 13, 2010 for an annoyance has caused a regression. The offending check-in was part of the 2.02.71 release of LVM. That check-in caused any PVs specified on the command line to be ignored when performing a mirror split. This patch reverses the aforementioned check-in (solving the regressions) and posits a new solution to the list reversal problem. The original problem was that we would always take the lowest mimage LVs from a mirror when performing a split, but what we really want is to take the highest mimage LVs. This patch accomplishes that by working through the list in reverse order - choosing the higher numbered mimages first. (This also reduces the amount of processing necessary.) Signed-off-by: Jonathan Brassow <jbrassow@redhat.com> Reviewed-by: Takahiro Yasui <takahiro.yasui@hds.com>	2010-08-06 15:38:32 +00:00
Jonathan Earl Brassow	498747d792	A misunderstanding of the return value of 'dm_bit' has been causing a data corruption bug in cmirror. 'dm_bit' is only ever used as a boolean operation within LVM, but it can return a range of values. If the bit is set, a power of 2 is returned. If the bit is unset, 0 is returned. 'log_test_bit' (a function in the cluster mirror log daemon code) has switched to using the dm bit operations in rhel6. There are two places in the daemon code where 'log_test_bit' is not used merely as a boolean, but rather the return value is used as the return value for the log functions 'is_clean' and 'in_sync' - having assumed that 'dm_bit' was returning 0 or 1 only. One place the 'in_sync' function is utilized is in 'dm_rh_get_state' - a function that informs the mirroring code how to treat I/O and which devices to read/write from. 'dm_rh_get_state' was checking if the return value of 'in_sync' was 1 to determine if the region was DM_RH_CLEAN. Since 'dm_bit' (and by extension 'log_test_bit' and 'in_sync') was returning powers of 2, DM_RH_CLEAN was rarely being reported as it should have been. Thinking the region was out-of-sync, the mirroring code would write only to the primary device. When the primary device was failed, all of those writes were lost - leaving the entire mirror corrupted.	2010-08-04 18:18:18 +00:00

1 2 3 4 5 ...

1745 Commits