shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-22 17:35:59 +03:00

Author	SHA1	Message	Date
Jonathan Earl Brassow	53670b18f5	Fix for bug 596453: multiple mirror image failures cause lvm repair... The lvm repair issues I believe are the superficial symptoms of this bug - there are worse issues that are not as clearly seen. From my inline comments: * If the mirror was successfully recovered, we want to always * force every machine to write to all devices - otherwise, * corruption will occur. Here's how: * Node1 suffers a failure and marks a region out-of-sync * Node2 attempts a write, gets by is_remote_recovering, * and queries the sync status of the region - finding * it out-of-sync. * Node2 thinks the write should be a nosync write, but it * hasn't suffered the drive failure that Node1 has yet. * It then issues a generic_make_request directly to * the primary image only - which is exactly the device * that has suffered the failure. * Node2 suffers a lost write - which completely bypasses the * mirror layer because it had gone through generic_m_r. * The file system will likely explode at this point due to * I/O errors. If it wasn't the primary that failed, it is * easily possible in this case to issue writes to just one * of the remaining images - also leaving the mirror inconsistent. * * We let in_sync() return 1 in a cluster regardless of what is * in the bitmap once recovery has successfully completed on a * mirror. This ensures the mirroring code will continue to * attempt to write to all mirror images. The worst that can * happen for reads is that additional read attempts may be * taken.	2010-08-17 23:56:23 +00:00
Alasdair Kergon	c496bc13e2	Attempt to fix buildbot failure in t-lvconvert-mirror.sh due to failing to wait for mirror to get into sync before running subsequent command.	2010-08-17 22:01:41 +00:00
Alasdair Kergon	22149572e8	Use 'SINGLENODE' instead of 'dead' in clvmd singlenode messages. Ignore snapshots when performing mirror recovery beneath an origin. Pass LCK_ORIGIN_ONLY flag around cluster. Add suspend_lv_origin and resume_lv_origin using LCK_ORIGIN_ONLY.	2010-08-17 19:25:05 +00:00
Alasdair Kergon	2d6fcbf67d	Allow internal suspend and resume of origin without its snapshots.	2010-08-17 16:25:32 +00:00
Alasdair Kergon	85ed403002	Fix dev_manager_transient to access -real device not snapshot-origin. (brassow) Another reminder why cloning functions impedes maintenance.	2010-08-17 01:51:12 +00:00
Alasdair Kergon	f92b4f9482	Monitor origin -real device below snapshot instead of overlay device. (brassow)	2010-08-17 01:16:41 +00:00
Alasdair Kergon	85a80e0505	Don't really change monitoring status when in test mode.	2010-08-16 23:29:09 +00:00
Mike Snitzer	ed878c8a9a	Add some v1 to v2 metadata upgrade testing.	2010-08-16 23:21:20 +00:00
Alasdair Kergon	d1e8046f56	Various small cleanups and fixes related to monitoring.	2010-08-16 22:54:35 +00:00
Alasdair Kergon	470c18847d	Remove superfluous NULL pointer tests before dm_free from dmeventd.	2010-08-16 18:19:46 +00:00
Jonathan Earl Brassow	d0191bf9f4	Fix for bug 612291: dm devices of split off mirror images are not removed DM devices were not handled properly on nodes in a cluster that were not where the splitmirrors command was issued. This was happening because suspend_lv/resume_lv were being used in a place where activate_lv should have been used. When the suspend/resume are issued on (effectively) new LVs, their 'resource' (UUID) is not located in the lv_hash. Thus, both operations turn into no-ops. You can see this from the output of clvmd from one of the remote nodes: <snip> do_suspend_lv, lock not already held <snip> do_resume_lv, lock not already held 'activate_lv' enjoins the other nodes in the cluster to process the lock and activate the new LV. clvmd output from remote node as follows: do_lock_lv: resource 'zMseY7CBuO3Ty09vXlplPAHzD0Y0CovjrTdv0R1VcwggMwPdYhutHErRcwm5Nd2S', cmd = 0x19 LCK_LV_ACTIVATE (READ\|LV\|NONBLOCK), flags = 0x84 (DMEVENTD_MONITOR ), memlock = 1 sync_lock: 'zMseY7CBuO3Ty09vXlplPAHzD0Y0CovjrTdv0R1VcwggMwPdYhutHErRcwm5Nd2S' mode:1 flags=1 sync_lock: returning lkid 27b0001 Signed-off-by: Jonathan Brassow <jbrassow@redhat.com> Reviewed-by: Petr Rockai <prockai@redhat.com>	2010-08-16 18:02:14 +00:00
Alasdair Kergon	1fad9a68ae	set DEFAULT_RUN_DIR (missed from earlier checkin?)	2010-08-16 17:49:26 +00:00
Peter Rajnoha	f069a14ca7	WHATS_NEW_DM	2010-08-16 11:22:08 +00:00
Peter Rajnoha	2d7614322e	dm-mod autoloading support is in kernel 2.6.36 actually.	2010-08-16 11:13:18 +00:00
Peter Rajnoha	a6a66e7c83	Fix udev rules to support udev database content generated by older rules. This can happen with older rules (without support for synthesized events) that are still part of initrd while using new udev rules in the system itself. The consequence was that new udev rules incorrectly assumed that not having DM_UDEV_PRIMARY_SOURCE_FLAG set always means the uevent is synthesized and inappropriate (device is still not properly activated) and so it should be ignored. However, initrd is not updated automatically while updating the libdevmapper/udev rules in the system and so we end up with the rules not detecting and setting crucial parts in the initrd environment and the rules in the system that rely on the information that should have been stored in udev db (which is incorrect in this configuration, of course). The overall consequence is that the update of libdevmapper/lvm2 without regenerating the initrd could end up with a boot failure! Ignoring the event means removing any existing symlinks in /dev! To fix this, increase udev rules version to make a difference. So from now on, mark rules without proper support for synthesized events as DM_UDEV_RULES_VSN="1" and 2 (or higher) if that support is included.	2010-08-12 13:41:18 +00:00
Peter Rajnoha	b045f4fe9a	Reinstate detection of inappropriate uevent with DISK_RO set and suppress it. We still need to detect this one! We're not so strict with CHANGE events as with the ADD events while applying filters in the rules so this one would pass and it would process the rules prematurely (because it appears before the actual CHANGE event used when resuming a DM device while setting read-only state at the same time).	2010-08-12 13:07:08 +00:00
Fabio M. Di Nitto	2565ffad25	Fix clvmd init script return code when executed as non-root user. clvmd daemon itself does the right thing when invoked as non-root, by returning 4. The patch removes the use daemon function from /etc/rc.d/init.d/functions that´s unnecessary and has th bad habit to mask the return codes from the real daemon. Add a simple and generic check to see if clvmd is executed by root or not. Our stop/reload/restart paths in the init script are complex and not all the tools involved in the process are guaranteed to return 4 if executed by non-root against a process that´s running as root (for example kill -TERM will return -1 and parsing the output to catch the error is suboptimal at best). https://bugzilla.redhat.com/show_bug.cgi?id=553381	2010-08-12 09:14:59 +00:00
Mike Snitzer	9ac9e06b5c	fix t-pvcreate-operation-md.sh to require kernel.org Linux >= 2.6.33 for the final alignment_offset check. In the future, might look to check for the RHEL6 kernel too.	2010-08-12 04:56:05 +00:00
Mike Snitzer	b123a82d73	Change default alignment of pe_start to 1MB. The new standard in the storage industry is to default alignment of data areas to 1MB. fdisk, parted, and mdadm have all been updated to this default. Update LVM to align the PV's data area start (pe_start) to 1MB. This provides a more useful default than the previous default of 64K (which generally ended up being a 192K pe_start once the first metadata area was created). Before this patch: # pvs -o name,vg_mda_size,pe_start PV VMdaSize 1st PE /dev/sdd 188.00k 192.00k After this patch: # pvs -o name,vg_mda_size,pe_start PV VMdaSize 1st PE /dev/sdd 1020.00k 1.00m The heuristic for setting the default alignment for LVM data areas is: - If the default value (1MB) is a multiple of the detected alignment then just use the default. - Otherwise, use the detected value. In practice this means we'll almost always use 1MB -- that is unless: - the alignment was explicitly specified with --dataalignment - or MD's full stripe width, or the {minimum,optimal}_io_size exceeds 1MB - or the specified/detected value is not a power-of-2	2010-08-12 04:11:48 +00:00
Mike Snitzer	dff224669d	Require --restorefile when using pvcreate --uuid. Introduce --norestorefile to allow user to override the new requirement. This can also be overridden with "devices/require_restorefile_with_uuid" in lvm.conf -- however the default is 1. Signed-off-by: Mike Snitzer <snitzer@redhat.com>	2010-08-12 04:08:59 +00:00
Peter Rajnoha	dc39d2a66e	WHATS_NEW_DM	2010-08-11 13:12:31 +00:00
Peter Rajnoha	626242c1bd	Recognise and give preference to md device partitions (blkext major). We can already detect MD devices internally. But when using MD partitions, these have "block extended major" (blkext) assigned (259). Blkext major is also used in general, so we need to check whether the original device is an MD device actually.	2010-08-11 12:14:23 +00:00
Petr Rockai	f21beaf20c	Update WHATS_NEW.	2010-08-09 14:06:03 +00:00
Petr Rockai	f3ad0dcfde	Never scan internal LVM devices.	2010-08-09 14:05:16 +00:00
Joe Thornber	0ae41c4412	[REGEX] fix a long standing off-by-one error (found by valgrind-pool)	2010-08-09 10:58:27 +00:00
Joe Thornber	52e1564fdd	[MM] Make valgrind aware of the pool allocators ./configure with --enable-valgrind-pool to enable this.	2010-08-09 10:56:01 +00:00
Joe Thornber	fae2c49259	[REGEX] fix bug in matcher that was causing segfault with chars of 0x80 and over.	2010-08-09 10:30:52 +00:00
Joe Thornber	7def56f1e4	[REGEX] Parse regexes that contain chars with value > 0x80 This is a long standing issue. Fixed by casting a char value to unsigned char before using it as an index into a bitset.	2010-08-09 10:29:42 +00:00
Joe Thornber	95aa395ddd	[REGEX] add a unit test for regexes containing chars with value over x80	2010-08-09 10:27:31 +00:00
Joe Thornber	45035ffa1a	[REGEX] matcher_t unit test now takes a flag to turn on fingerprinting	2010-08-09 10:23:54 +00:00
Jonathan Earl Brassow	8d2d4f1fa0	Fix for bug 619221 - log device splitting regression An incorrect fix on July 13, 2010 for an annoyance has caused a regression. The offending check-in was part of the 2.02.71 release of LVM. That check-in caused any PVs specified on the command line to be ignored when performing a mirror split. This patch reverses the aforementioned check-in (solving the regressions) and posits a new solution to the list reversal problem. The original problem was that we would always take the lowest mimage LVs from a mirror when performing a split, but what we really want is to take the highest mimage LVs. This patch accomplishes that by working through the list in reverse order - choosing the higher numbered mimages first. (This also reduces the amount of processing necessary.) Signed-off-by: Jonathan Brassow <jbrassow@redhat.com> Reviewed-by: Takahiro Yasui <takahiro.yasui@hds.com>	2010-08-06 15:38:32 +00:00
Jonathan Earl Brassow	498747d792	A misunderstanding of the return value of 'dm_bit' has been causing a data corruption bug in cmirror. 'dm_bit' is only ever used as a boolean operation within LVM, but it can return a range of values. If the bit is set, a power of 2 is returned. If the bit is unset, 0 is returned. 'log_test_bit' (a function in the cluster mirror log daemon code) has switched to using the dm bit operations in rhel6. There are two places in the daemon code where 'log_test_bit' is not used merely as a boolean, but rather the return value is used as the return value for the log functions 'is_clean' and 'in_sync' - having assumed that 'dm_bit' was returning 0 or 1 only. One place the 'in_sync' function is utilized is in 'dm_rh_get_state' - a function that informs the mirroring code how to treat I/O and which devices to read/write from. 'dm_rh_get_state' was checking if the return value of 'in_sync' was 1 to determine if the region was DM_RH_CLEAN. Since 'dm_bit' (and by extension 'log_test_bit' and 'in_sync') was returning powers of 2, DM_RH_CLEAN was rarely being reported as it should have been. Thinking the region was out-of-sync, the mirroring code would write only to the primary device. When the primary device was failed, all of those writes were lost - leaving the entire mirror corrupted.	2010-08-04 18:18:18 +00:00
Petr Rockai	851aaf4ecc	Reduce severity of the "mirror transient status" log message (this was never intended to be a log_error).	2010-08-04 15:55:03 +00:00
Mike Snitzer	57d31c99cc	Require logical volume(s) to be explicitly named for lvconvert --merge.	2010-08-03 20:22:31 +00:00
Mike Snitzer	14a9722185	Avoid changing aligned pe_start as a side-effect of very verbose logging.	2010-08-03 18:19:42 +00:00
Peter Rajnoha	97df4e4675	Use built-in rules for device aliases: block/ < dm- < disk/ < mapper/ < other.	2010-08-03 13:39:27 +00:00
Zdenek Kabelac	59f8043d7c	Fix 'void' arithmetic warnings in dbg_malloc.c. Use more readable char[idx] access instead of char+idx access.	2010-08-03 13:24:07 +00:00
Zdenek Kabelac	30168792f4	Fix 'void*' arithmetic warning in some functions from libdm-iface.c.	2010-08-03 13:16:21 +00:00
Zdenek Kabelac	3eadbbeb12	Fix const warning in dev_manager_info() and _dev_manager_lv_rmnodes().	2010-08-03 13:13:01 +00:00
Zdenek Kabelac	c10f7fd039	Fix constness warning in archive_file structure from archive.c.	2010-08-03 13:09:21 +00:00
Zdenek Kabelac	9f926fd060	Use void parameter for function definition.	2010-08-03 13:06:35 +00:00
Zdenek Kabelac	0a89c2307e	Wait for node creation before displaying debug info in dmsetup. Readahead check needs to see created node - so wait till udev gets in sync.	2010-08-03 13:04:32 +00:00
Zdenek Kabelac	56081fe97b	Clean generated files .exported_symbols_generated, example.conf for distclean.	2010-08-03 13:00:45 +00:00
Zdenek Kabelac	c8675c4933	Fix return status 0 for "dmsetup info -c -o help". Solution returns success for _report_init when help is passed, and caller needs to check for _report existance.	2010-08-03 12:56:00 +00:00
Peter Rajnoha	d87427446c	Add check for kernel semaphore support and disable udev_sync if not available. udev_sync feature requires semaphores (part of System V IPC) to be configured in kernel (CONFIG_SYSVIPC). Check whether it is supported and if not, give a warning message and disable udev synchronisation code automatically to avoid any further error states and associated problems. One should use the kernel with System V IPC support enabled or libdevmapper with udev_sync feature disabled.	2010-08-03 07:56:03 +00:00
Jonathan Earl Brassow	cbd41292a4	Taka's fix for handling failure of all mirrored log devices and all but one mirror leg. <patch header> To handle a double failure of a mirrored log, Jon's two patches are commited, however, lvconvert command can't still handle an error when mirror leg and mirrored log got failure at the same time. [Patch]: Handle both devices of a mirrored log failing (bug 607347) posted: https://www.redhat.com/archives/lvm-devel/2010-July/msg00009.html commit: https://www.redhat.com/archives/lvm-devel/2010-July/msg00027.html [Patch]: Handle both devices of a mirrored log failing (bug 607347) - additional fix posted: https://www.redhat.com/archives/lvm-devel/2010-July/msg00093.html commit: https://www.redhat.com/archives/lvm-devel/2010-July/msg00101.html In the second patch, the target type of mirrored log is replaced with error target when remove_log is set to 1, but this procedure should be also used in other cases such as the number of mirror leg is 1. This patch relocates the procedure to the main path. In addition, I added following three changes. - Removed tmp_orphan_lvs handling procedure It seems that _delete_lv() can handle detached_log_lv properly without adding mirror legs in mirrored log to tmp_orphan_lvs. Therefore, I removed the procedure. - Removed vg_write()/vg_commit() Metadata is saved by vg_write()/vg_commit() just after detached_log_lv is handled. Therefore, I removed vg_write()/vg_commit(). - With Jon's second patch, we think that we don't have to call remove_mirror_log() in _lv_update_mirrored_log() because will be handled remove_mirror_images() in _lvconvert_mirrors_repaire(). </patch header> Signed-off-by: Takahiro Yasui <takahiro.yasui@hds.com> Reviewed-by: Petr Rockai <prockai@redhat.com> Signed-off-by: Jonathan Brassow <jbrassow@redhat.com>	2010-08-02 21:07:40 +00:00
Jonathan Earl Brassow	efaaf3146d	Disallow mirrored logs in cluster mirrors. The cluster log daemon (cmirrord) is not multi-threaded and can handle only one request at a time. When a log is stacked on top of a mirror (which itself contains a 'core' log), it creates a situation that cannot be solved without threading. When the top level mirror issues a "resume", the log daemon attempts to read from the log device to retrieve the log state. However, the log is a mirror which, before issuing the read, attempts to determine the 'sync' status of the region of the mirror which is to be read. This sync status request cannot be completed by the daemon because it is blocked on a read I/O to the very mirror requesting the sync status.	2010-08-02 19:03:45 +00:00
Alasdair Kergon	0dfae7e47a	Fix lib.device-mapper to wait for include too	2010-08-02 13:56:34 +00:00
Zdenek Kabelac	1342414a8d	Minor speedup of lock test - Instead of waiting for flock process finish kill the flock process.	2010-08-02 13:23:01 +00:00
Zdenek Kabelac	3015a9ab9c	Add shell function to trim spaces. Test values as "$val" to avoid weird results when spaces are in output.	2010-08-02 13:20:50 +00:00

1 2 3 4 5 ...

4923 Commits