shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-22 17:35:59 +03:00

Author	SHA1	Message	Date
Dave Wysochanski	81f0124a58	Refactor metadata.[ch] into vg.[ch] for vg functions. The metadata.[ch] files are very large. This patch makes a first attempt at separating out vg functions and data, particularly related to the reporting fields calculations.	2010-09-30 13:04:55 +00:00
Zdenek Kabelac	c631be7684	Maps fix Read complete content of /proc/self/maps into one buffer without realocation in the middle of reading and before doing any m/unlock operation with these lines - as some of them gets change. With previous implementation we've read some mappings twice ([stack])	2010-09-30 11:32:40 +00:00
Alasdair Kergon	f6b1c45bf1	Speed up unquoting of quoted double quotes and backslashes.	2010-09-28 01:29:06 +00:00
Alasdair Kergon	8a075c6123	drop an unnecessary 'stack'	2010-09-27 19:15:13 +00:00
Alasdair Kergon	46d4a6acf8	was renamed	2010-09-27 19:10:46 +00:00
Alasdair Kergon	44a31a9c2f	Speed up CRC32 calculations by using a larger lookup table. Use -DDEBUG_CRC32 to revert to old function and check new one gives same result.	2010-09-27 19:09:34 +00:00
Peter Rajnoha	bad35c6554	Add escape sequence for ':' and '@' found in device names used as PVs.	2010-09-23 12:02:33 +00:00
Alasdair Kergon	0cb07b65f3	Replace alloca with dm_malloc in _aligned_io. (This section of code dates from 2.4 and could be written more efficiently nowadays.)	2010-09-22 22:31:45 +00:00
Milan Broz	980d2d8683	Fix handling of partial VG for lvm1 format metadata If some lvm1 device is missing, lvm fails on all operations # vgcfgbackup -f bck -P vg_test Partial mode. Incomplete volume groups will be activated read-only. 3 PV(s) found for VG vg_test: expected 4 PV segment VG free_count mismatch: 152599 != 228909 PV segment VG extent_count mismatch: 152600 != 228910 Internal error: PV segments corrupted in vg_test. Volume group "vg_test" not found Allow loading of lvm1 partial VG by allocating "new" missing PV, which covers lost space. Also this fake mising PV inform code that it is partial VG. https://bugzilla.redhat.com/show_bug.cgi?id=501390	2010-09-22 13:45:21 +00:00
Alasdair Kergon	ec8a4dac46	Fix name in msg in last checkin. (The problem the last checkin addressed was a segfault in 'pvs -a' if .cache didn't contain every PV in a VG.)	2010-09-22 01:50:38 +00:00
Alasdair Kergon	a171bb6e85	Track recursive filter iteration to avoid refreshing while in use. (2.02.56)	2010-09-22 01:36:13 +00:00
Peter Rajnoha	064ed484b4	"goto_bad" should be used in alloc_printed_tags function, not "goto bad".	2010-09-21 10:42:02 +00:00
Peter Rajnoha	70431c8146	Revert to old glibc behaviour for vsnprintf used in emit_to_buffer function. Revert to old glibc behaviour for vsnprintf used in emit_to_buffer fn. Otherwise, the check that follows would be wrong for new glibc versions. This caused the rh bug #633033 to be undetected and pass throught the check, corrupting the metadata!	2010-09-20 14:25:27 +00:00
Peter Rajnoha	48ae64529a	Use dynamic allocation for metadata's tag buffer (removes 4096 char. limit).	2010-09-20 14:23:20 +00:00
Dave Wysochanski	97709450ca	Update vg_mda_free 'get' function to multiply by SECTOR_SIZE.	2010-09-09 19:38:03 +00:00
Peter Rajnoha	d20ce59b80	Add random suffix to archive file names to prevent races when being created. In certain configurations, we're not under a VG rw lock while trying to write a new archive file with VG metadata. A common example is using "vgs" while having the content of backup and archive directories empty. The code scans the content of these directories and tries to determine the final index that should be used in archive name. Since we're not under a lock, we can get into a race while choosing the index which could end up showing errors about not being able to rename to final archive name. Let's add random number suffix to these archive file names so we can avoid the race.	2010-09-09 13:13:12 +00:00
Peter Rajnoha	dc8478458e	Reinitialize archive and backup handling on toolcontext refresh. For example, when using '--config "backup { ... }"' line, the values from lvm.conf (or default values) should be overridden. This patch adds reinitialisation of archive and backup handling on toolcontext refresh which makes these settings to be applied.	2010-09-09 13:07:13 +00:00
Jonathan Earl Brassow	a71d6051ed	This patch fixes a potential for I/O to hang and LVM commands to block when a mirror under a snapshot suffers a failure. The problem has to do with label scanning. When a mirror suffers a failure, the kernel blocks I/O to prevent corruption. When LVM attempts to repair the mirror, it scans the devices on the system for LVM labels. While mirrors are skipped during this scanning process, snapshot-origins are not. When the origin is scanned, it kicks up I/O to the mirror (which is blocked) underneath - causing the label scan (an thus the repair operation) to hang. This patch simply bypasses snapshot-origin devices when doing labels scans (while ignore_suspended_devices() is set). This fixes the issue.	2010-08-26 14:21:50 +00:00
Milan Broz	fc86426b56	Fix previous const removal.	2010-08-26 12:22:05 +00:00
Milan Broz	c7af31dbd7	Fix return type qualifier to avoid compiler warning. introduced in commit `b16b4d92a7` "Improve various log messages." fixes a lot of ../include/metadata.h:148: warning: type qualifiers ignored on function return type	2010-08-26 12:08:19 +00:00
Alasdair Kergon	4e19541b8d	autoreconf also updates configure.h.in	2010-08-21 00:16:37 +00:00
Mike Snitzer	7063efe1bd	Switch to using configure --with-default-data-alignment=<NUM> to establish DEFAULT_DATA_ALIGNMENT. Again, 0=64KiB, 1=1MiB, 2=2MiB Default is 1.	2010-08-20 22:24:58 +00:00
Mike Snitzer	4efb1d9cbb	Update heuristic used for default and detected data alignment. Add "devices/default_data_alignment" to lvm.conf to control the internal default that LVM2 uses: 0==64k, 1==1MB, 2==2MB, etc. If --dataalignment (or lvm.conf's "devices/data_alignment") is specified then it is always used to align the start of the data area. This means the md_chunk_alignment and data_alignment_detection are disabled if set. (Same now applies to pvcreate --dataalignmentoffset, the specified value will be used instead of the result from data_alignment_offset_detection) set_pe_align() still looks to use the determined default alignment (based on lvm.conf's default_data_alignment) if the default is a multiple of the MD or topology detected values.	2010-08-20 20:59:05 +00:00
Dave Wysochanski	614469b544	Define GET_NUM_PROPERTY_FN macro to simplify numeric property 'get' functions.	2010-08-20 13:02:39 +00:00
Dave Wysochanski	cc171eb8ee	Add implmentation for simple numeric 'get' property functions. Add 'get' functions based on the simple macro function definition for a numeric property. Add 'get' functions for the following: _vg_extent_count_get, _vg_free_count_get, _max_lv_get, _max_pv_get, _pv_count_get, _lv_count_get, _snap_count_get, _vg_seqno_get, _vg_size_get, _vg_free_get, vg_mda_*. For size functions, multiply by SECTOR_SIZE to return the value in bytes.	2010-08-20 12:45:09 +00:00
Dave Wysochanski	1af822bff0	Define GET_NUM_PROPERTY_FN macro to simplify numeric property 'get' functions.	2010-08-20 12:44:58 +00:00
Dave Wysochanski	fc65b9038e	Add properties.[ch] to lib/report, defined based on columns.h. Extend the existing reporting infrastructure definitions and structures to include a 'get' and 'set' function for each field. We will provide a 'get' and 'set' function for each of these fields, which will be utilized by exported lvm2app functions. Define a default _not_implemented 'get' and 'set' function that just sets an errno and returns 0. Future patches will actually implement the specific 'get' and 'set' functions for each property. For read-only properties, only the 'get' function will be implemented. Define vg_get_property() function to query a property. We will call this from a lvm2app function.	2010-08-20 12:44:47 +00:00
Dave Wysochanski	7bdc15c8bb	Remove explicit double quotes from columns.h 'id' entries. The 'id' entries in columns.h are the report field names. Since these are unique, we'd like to use them in generation of 'get' / 'set' functions. As a step towards using them for this purpose, remove the explicit double quotes and use the macro '#' character to add the double quotes back when placing them into the '_fields' array 'id' member.	2010-08-20 12:44:17 +00:00
Dave Wysochanski	d5722ebb21	Add 'flags' field to columns.h and define FIELD_MODIFIABLE. Add a 'flags' field to columns.h, and set it to 0 by default. Define FIELD_MODIFIABLE flag to indicate whether a 'set' function exists to change the field's value.	2010-08-20 12:44:03 +00:00
Dave Wysochanski	69d67dc2ca	Add vg_mda_size and vg_mda_free functions. Add supporting functions to get vg_mda_size and vg_mda_free fields. Should be no functional change.	2010-08-20 12:43:49 +00:00
Milan Broz	586b56b18c	Fix wrong use of LCK_WRITE In all top vg read functions only LCK_VG_READ/WRITE can be used. All other vg lock definitions are low-level backend machinery. Moreover, LCK_WRITE cannot be tested through bitmask. This patch fixes these mistakes. For _recover_vg() we do not need lock_flags, it can be only two of above and we always upgrading to LCK_VG_WRITE lock there. (N.B. that code is racy) There is no functional change in code (despite wrong masking it produces correct bits:-)	2010-08-19 23:26:31 +00:00
Milan Broz	727f7bfa49	Detect LUKS signature in pvcreate One shiny day we should use libblkid here. But now using LUKS is very common together with LVM and pvcreate destroys LUKS completely. So for user's convenience, try to detect LUKS signature and allow abort.	2010-08-19 23:08:18 +00:00
Milan Broz	c37a14506a	Fix file descriptor leak in swap signature detection	2010-08-19 23:05:45 +00:00
Milan Broz	2d5e2b52ca	Change the pvcreate swap/md logic pvcreate detects MD and swap signature. The logic hidden there is not only documented but it is also user unfriendly. Who invented this logic should run pvcreate on its own critical MD device to see why;-) This patch - creates one function instead of duplication code - asks if user want to overwrite signature - allows aborting (!) (Please note that writing LVM signatute without wiping old is wrong, it confuses blkid, MD will not work anyway and swap and LUKS is broken too.)	2010-08-19 23:03:34 +00:00
Alasdair Kergon	22149572e8	Use 'SINGLENODE' instead of 'dead' in clvmd singlenode messages. Ignore snapshots when performing mirror recovery beneath an origin. Pass LCK_ORIGIN_ONLY flag around cluster. Add suspend_lv_origin and resume_lv_origin using LCK_ORIGIN_ONLY.	2010-08-17 19:25:05 +00:00
Alasdair Kergon	2d6fcbf67d	Allow internal suspend and resume of origin without its snapshots.	2010-08-17 16:25:32 +00:00
Alasdair Kergon	85ed403002	Fix dev_manager_transient to access -real device not snapshot-origin. (brassow) Another reminder why cloning functions impedes maintenance.	2010-08-17 01:51:12 +00:00
Alasdair Kergon	f92b4f9482	Monitor origin -real device below snapshot instead of overlay device. (brassow)	2010-08-17 01:16:41 +00:00
Alasdair Kergon	85a80e0505	Don't really change monitoring status when in test mode.	2010-08-16 23:29:09 +00:00
Alasdair Kergon	d1e8046f56	Various small cleanups and fixes related to monitoring.	2010-08-16 22:54:35 +00:00
Jonathan Earl Brassow	d0191bf9f4	Fix for bug 612291: dm devices of split off mirror images are not removed DM devices were not handled properly on nodes in a cluster that were not where the splitmirrors command was issued. This was happening because suspend_lv/resume_lv were being used in a place where activate_lv should have been used. When the suspend/resume are issued on (effectively) new LVs, their 'resource' (UUID) is not located in the lv_hash. Thus, both operations turn into no-ops. You can see this from the output of clvmd from one of the remote nodes: <snip> do_suspend_lv, lock not already held <snip> do_resume_lv, lock not already held 'activate_lv' enjoins the other nodes in the cluster to process the lock and activate the new LV. clvmd output from remote node as follows: do_lock_lv: resource 'zMseY7CBuO3Ty09vXlplPAHzD0Y0CovjrTdv0R1VcwggMwPdYhutHErRcwm5Nd2S', cmd = 0x19 LCK_LV_ACTIVATE (READ\|LV\|NONBLOCK), flags = 0x84 (DMEVENTD_MONITOR ), memlock = 1 sync_lock: 'zMseY7CBuO3Ty09vXlplPAHzD0Y0CovjrTdv0R1VcwggMwPdYhutHErRcwm5Nd2S' mode:1 flags=1 sync_lock: returning lkid 27b0001 Signed-off-by: Jonathan Brassow <jbrassow@redhat.com> Reviewed-by: Petr Rockai <prockai@redhat.com>	2010-08-16 18:02:14 +00:00
Mike Snitzer	b123a82d73	Change default alignment of pe_start to 1MB. The new standard in the storage industry is to default alignment of data areas to 1MB. fdisk, parted, and mdadm have all been updated to this default. Update LVM to align the PV's data area start (pe_start) to 1MB. This provides a more useful default than the previous default of 64K (which generally ended up being a 192K pe_start once the first metadata area was created). Before this patch: # pvs -o name,vg_mda_size,pe_start PV VMdaSize 1st PE /dev/sdd 188.00k 192.00k After this patch: # pvs -o name,vg_mda_size,pe_start PV VMdaSize 1st PE /dev/sdd 1020.00k 1.00m The heuristic for setting the default alignment for LVM data areas is: - If the default value (1MB) is a multiple of the detected alignment then just use the default. - Otherwise, use the detected value. In practice this means we'll almost always use 1MB -- that is unless: - the alignment was explicitly specified with --dataalignment - or MD's full stripe width, or the {minimum,optimal}_io_size exceeds 1MB - or the specified/detected value is not a power-of-2	2010-08-12 04:11:48 +00:00
Mike Snitzer	dff224669d	Require --restorefile when using pvcreate --uuid. Introduce --norestorefile to allow user to override the new requirement. This can also be overridden with "devices/require_restorefile_with_uuid" in lvm.conf -- however the default is 1. Signed-off-by: Mike Snitzer <snitzer@redhat.com>	2010-08-12 04:08:59 +00:00
Peter Rajnoha	626242c1bd	Recognise and give preference to md device partitions (blkext major). We can already detect MD devices internally. But when using MD partitions, these have "block extended major" (blkext) assigned (259). Blkext major is also used in general, so we need to check whether the original device is an MD device actually.	2010-08-11 12:14:23 +00:00
Petr Rockai	f3ad0dcfde	Never scan internal LVM devices.	2010-08-09 14:05:16 +00:00
Jonathan Earl Brassow	8d2d4f1fa0	Fix for bug 619221 - log device splitting regression An incorrect fix on July 13, 2010 for an annoyance has caused a regression. The offending check-in was part of the 2.02.71 release of LVM. That check-in caused any PVs specified on the command line to be ignored when performing a mirror split. This patch reverses the aforementioned check-in (solving the regressions) and posits a new solution to the list reversal problem. The original problem was that we would always take the lowest mimage LVs from a mirror when performing a split, but what we really want is to take the highest mimage LVs. This patch accomplishes that by working through the list in reverse order - choosing the higher numbered mimages first. (This also reduces the amount of processing necessary.) Signed-off-by: Jonathan Brassow <jbrassow@redhat.com> Reviewed-by: Takahiro Yasui <takahiro.yasui@hds.com>	2010-08-06 15:38:32 +00:00
Petr Rockai	851aaf4ecc	Reduce severity of the "mirror transient status" log message (this was never intended to be a log_error).	2010-08-04 15:55:03 +00:00
Mike Snitzer	14a9722185	Avoid changing aligned pe_start as a side-effect of very verbose logging.	2010-08-03 18:19:42 +00:00
Peter Rajnoha	97df4e4675	Use built-in rules for device aliases: block/ < dm- < disk/ < mapper/ < other.	2010-08-03 13:39:27 +00:00
Zdenek Kabelac	3eadbbeb12	Fix const warning in dev_manager_info() and _dev_manager_lv_rmnodes().	2010-08-03 13:13:01 +00:00
Zdenek Kabelac	c10f7fd039	Fix constness warning in archive_file structure from archive.c.	2010-08-03 13:09:21 +00:00
Zdenek Kabelac	9f926fd060	Use void parameter for function definition.	2010-08-03 13:06:35 +00:00
Jonathan Earl Brassow	cbd41292a4	Taka's fix for handling failure of all mirrored log devices and all but one mirror leg. <patch header> To handle a double failure of a mirrored log, Jon's two patches are commited, however, lvconvert command can't still handle an error when mirror leg and mirrored log got failure at the same time. [Patch]: Handle both devices of a mirrored log failing (bug 607347) posted: https://www.redhat.com/archives/lvm-devel/2010-July/msg00009.html commit: https://www.redhat.com/archives/lvm-devel/2010-July/msg00027.html [Patch]: Handle both devices of a mirrored log failing (bug 607347) - additional fix posted: https://www.redhat.com/archives/lvm-devel/2010-July/msg00093.html commit: https://www.redhat.com/archives/lvm-devel/2010-July/msg00101.html In the second patch, the target type of mirrored log is replaced with error target when remove_log is set to 1, but this procedure should be also used in other cases such as the number of mirror leg is 1. This patch relocates the procedure to the main path. In addition, I added following three changes. - Removed tmp_orphan_lvs handling procedure It seems that _delete_lv() can handle detached_log_lv properly without adding mirror legs in mirrored log to tmp_orphan_lvs. Therefore, I removed the procedure. - Removed vg_write()/vg_commit() Metadata is saved by vg_write()/vg_commit() just after detached_log_lv is handled. Therefore, I removed vg_write()/vg_commit(). - With Jon's second patch, we think that we don't have to call remove_mirror_log() in _lv_update_mirrored_log() because will be handled remove_mirror_images() in _lvconvert_mirrors_repaire(). </patch header> Signed-off-by: Takahiro Yasui <takahiro.yasui@hds.com> Reviewed-by: Petr Rockai <prockai@redhat.com> Signed-off-by: Jonathan Brassow <jbrassow@redhat.com>	2010-08-02 21:07:40 +00:00
Jonathan Earl Brassow	efaaf3146d	Disallow mirrored logs in cluster mirrors. The cluster log daemon (cmirrord) is not multi-threaded and can handle only one request at a time. When a log is stacked on top of a mirror (which itself contains a 'core' log), it creates a situation that cannot be solved without threading. When the top level mirror issues a "resume", the log daemon attempts to read from the log device to retrieve the log state. However, the log is a mirror which, before issuing the read, attempts to determine the 'sync' status of the region of the mirror which is to be read. This sync status request cannot be completed by the daemon because it is blocked on a read I/O to the very mirror requesting the sync status.	2010-08-02 19:03:45 +00:00
Dave Wysochanski	936541ec56	Remove irrelevant comments relating to vg_mda_copies.	2010-07-30 16:47:27 +00:00
Alasdair Kergon	8bae0a1ecf	Change clvmd to communicate with lvm via a socket in /var/run/lvm. (mbroz) https://bugzilla.redhat.com/show_bug.cgi?id=614248 [CVE-2010-2526]	2010-07-28 13:55:42 +00:00
Dave Wysochanski	81bf06ea38	Clarify help text for vg_mda_count.	2010-07-21 19:44:25 +00:00
Jonathan Earl Brassow	9baacefc77	Building without the '--enable-cmirrord' option means that CMIRRORD_PIDFILE is not defined. This makes the build fail. Therefore, we need to conditionalize the check for cmirrord based on if CMIRRORD_PIDFILE is defined.	2010-07-21 15:21:24 +00:00
Jonathan Earl Brassow	405c4a45d8	It's not enough to check for the kernel module in the case of cluster mirrors, we must also check that the log daemon (cmirrord) is running. The log module can be auto-loaded, but the daemon cannot be "auto-started". Failing to check for the daemon produces cryptic messages that customers have a hard time deciphering. (The system messages do report that the log daemon is not running, but people don't seem to find this message easily.) Here are examples of what is printed when the module is available, but the log daemon has not been started. [root@bp-01 LVM2]# lvcreate -m1 -l1 -n lv vg Shared cluster mirrors are not available. [root@bp-01 LVM2]# lvcreate -m1 -l1 -n lv vg -v Setting logging type to disk Finding volume group "vg" Archiving volume group "vg" metadata (seqno 3). Creating logical volume lv Executing: /sbin/modprobe dm-log-userspace Cluster mirror log daemon is not running Shared cluster mirrors are not available. Creating volume group backup "/etc/lvm/backup/vg" (seqno 4).	2010-07-21 13:40:21 +00:00
Jonathan Earl Brassow	60f425d1b3	Fix for bug 614164: No check for existing name when splitting mirror The user could use the same name as an existing LV when specifying a name for an LV split off from a mirror. This causes all sorts of issues.	2010-07-13 22:24:39 +00:00
Jonathan Earl Brassow	c42b084793	Fix for bugs: 612248 & 612291 Split mirror issues The main problem with these bugs was that the newly split off LV was not being suspended properly. This meant that the memlock count was not being balanced, the DM devices were not being renamed, and some DM devices which should have been removed were not. I've also renamed some of the variables and added comments to make things clearer as to what is going on. (I can break this patch in two if it means easier review.)	2010-07-13 21:48:16 +00:00
Fabio M. Di Nitto	8c4e8a185a	Add dm_create_lockfile to libdm to handle pidfiles for all daemons. Switch dmeventd to use dm_create_lockfile and drop duplicate code. Allow clvmd pidfile to be configurable. Switch cmirrord and clvmd to use dm_create_lockfile.	2010-07-13 13:51:01 +00:00
Peter Rajnoha	3122f963b0	Addendum for previous patch - show VG/LV name everywhere so the messages are consistent.	2010-07-12 12:38:35 +00:00
Peter Rajnoha	fefa43235f	Add more verbose messages while checking volume_list and hosttags settings. This should bring less confusion when there are some settings left and people just forgot about it and then they run into problems. These messages should give them a hint of what's really going on.	2010-07-12 11:37:49 +00:00
Jonathan Earl Brassow	a93fb6299f	Failed to test for the case where a log was requested to be removed even though there was no log. A simple run through the in-tree test suite would have caught this. :( - if (lv_is_mirrored(detached_log_lv) && + if (detached_log_lv && lv_is_mirrored(detached_log_lv) && Also, made some cosmetic changes suggested by kabi after my last check-in (e.g. s/return 0/return_0/ and adding an error message).	2010-07-09 17:57:51 +00:00
Dave Wysochanski	f77fb62b2a	Add log_error when strdup fails in {vg\|lv}_change_tag(). Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-07-09 16:57:44 +00:00
Alasdair Kergon	08f1ddea6c	Use __attribute__ consistently throughout.	2010-07-09 15:34:40 +00:00
Alasdair Kergon	80e569104b	Remove superfluous fn prototypes.	2010-07-09 15:21:10 +00:00
Jonathan Earl Brassow	aa5734f2a3	Finish fix for bug 607347: failing both redundant mirror log legs... A previous check-in added logic to handle the case where both images of a mirrored log failed. It solved the problem by simply removing the log entirely - leaving the parent mirror with a 'core' log. This worked for most cases. However, if there was a small delay between the failures of the two mirrored log devices, the mirror would hang, LVM would hang, and no additional LVM commands could be issued. When the first leg of the log fails, it signals the need for repair. Before 'lvconvert --repair' is run by dmeventd, the second leg fails. 'lvconvert' would see both devices as failed and try to remove the log entirely. When it came time to suspend the parent mirror to update the configuration, the suspend would hang because it couldn't get any I/O through the mirrored log, which was plugged waiting for corrective action. The solution is to replace the log with an error target to clear any pending writes before removing it. This allows the parent mirror to suspend and make the proper changes.	2010-07-09 15:08:12 +00:00
Dave Wysochanski	a5fb2bbff3	Pass metadataignore to pv_create, pv_setup, _mda_setup, and add_mda. Pass metadataignore through PV creation / setup paths. As a result of this cleanup, we can remove the unnecessary setting of mda_ignore bits inside pvcreate_single(), after call to pv_create. For now, just set metadataignore to '0' in some places. This is equivalent to the prior functionality, although the 0 is given by the caller not hardcoded in _mda_setup() call. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-07-08 18:24:29 +00:00
Dave Wysochanski	dce204cec5	Init mda->list in mda_copy. This patch should be no functional change as all callers initialize mda->list.	2010-07-08 17:41:46 +00:00
Zdenek Kabelac	764eb41086	Fix format string from patch apply mistake	2010-07-08 14:47:46 +00:00
Zdenek Kabelac	37036b0215	Small update of memlock debug messages. Gives slightly better alligned lines for reading.	2010-07-08 13:05:27 +00:00
Zdenek Kabelac	4ec2ae8632	Do not log backtrace in valid _lv_resume() code path	2010-07-08 12:24:04 +00:00
Dave Wysochanski	7041b476ac	Add warning to vgextend and pvchange if metadataignore given on cmdline. Warn the user then change the value of vg_mda_copies. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-07-07 18:59:45 +00:00
Alasdair Kergon	7f7af46862	Adjust auto-metadata repair and caching logic to try to cope with empty mdas. - If a PV contained empty mdas, the auto-recovery code was not kicking in. - The 'inconsistent' state was getting lost when metadata was cached so recovery didn't kick in. But leave the behaviour alone when using precommitted metadata because of a warning in a confusing FIXME. In my testing, pvs and vgs didn't repair inconsistent metadata like they used to do. (How many other tools fail similarly now?) And there should be no need to cache inconsistent metadata because it is supposed to get repaired under the protection of a write lock immediately it is discovered. This code is in need of a redesign based on first principles. I still see bugs in this code and this commit is risky.	2010-07-07 02:53:16 +00:00
Alasdair Kergon	6c8655ce9b	fix code in 2nd mda unignore loop to match 1st loop	2010-07-06 20:09:38 +00:00
Alasdair Kergon	68f4e0c734	s/flags/mda/	2010-07-06 17:29:50 +00:00
Alasdair Kergon	0db1bbc3c3	shorten mesg	2010-07-06 17:27:32 +00:00
Alasdair Kergon	643f234119	fix jumbled args in 'Adjusting' message	2010-07-06 17:26:08 +00:00
Alasdair Kergon	d911ec67a9	Randomly select which mdas to use or ignore. Add some missing standard configure.in checks.	2010-07-05 22:23:15 +00:00
Alasdair Kergon	db3c1ac1c8	Add printf format attributes to yes_no_prompt & dm_{sn,as}printf and fix a calle	2010-07-02 21:16:50 +00:00
Alasdair Kergon	d0709eed62	remove unneeded header	2010-07-02 10:25:16 +00:00
Alasdair Kergon	9b95a5a939	Always pass unsuspended dm devices through persistent filter to other filters. Move test for suspended dm devices ahead of other filters.	2010-07-02 02:09:57 +00:00
Alasdair Kergon	12eadbabdd	improve vgmetadatacopies unmanaged message	2010-06-30 20:03:52 +00:00
Dave Wysochanski	3b9d1b1a96	Check for missing_pv in vg_remove loop. If a pv is missing, we should just skip it rather than checking the device size and failing the vgremove.	2010-06-30 19:55:43 +00:00
Alasdair Kergon	d8886386bd	more mda ignore cleanups	2010-06-30 19:28:35 +00:00
Dave Wysochanski	40b4d1c3ae	Refactor vg_remove_check to place pv removal into separate function.	2010-06-30 18:03:52 +00:00
Alasdair Kergon	23177eda88	more metadataignore message/code cleanup	2010-06-30 17:13:05 +00:00
Alasdair Kergon	efe75fd705	revert that	2010-06-30 14:54:29 +00:00
Alasdair Kergon	a6c4427188	suppress useless compiler warning	2010-06-30 14:52:29 +00:00
Dave Wysochanski	ef7b409966	Only attempt to guarantee 1 mda ignored if there's at least one mda in the vg.	2010-06-30 14:48:07 +00:00
Alasdair Kergon	67b91d0848	Only attempt to guarantee 1 mda ignored if there's at least one mda in the vg.	2010-06-30 14:27:40 +00:00
Alasdair Kergon	647c64c796	Improve various log messages.	2010-06-30 13:51:11 +00:00
Dave Wysochanski	7985f80c63	Add pvmetadatacopies to lvm.conf and pvcreate man pages.	2010-06-30 12:49:28 +00:00
Dave Wysochanski	a5bf70018b	Add --metadataignore to pvcreate. Allow metadataignore flag to be passed in to pvcreate. Ideally, more refactoring of the mda allocation / initialization is warranted, but for now, we just add another parameter to 'add_mda' to take an existing mda ignored flag. We need to do this or pv_write loses the state of the mda 'ignored' flag before copying and writing to disk.	2010-06-30 12:17:24 +00:00
Dave Wysochanski	6af5155529	Improve logging for setting --vgmetadatacopies. Example of logging: metadata/metadata.c:1127 Setting mda_copies = 3 on vg vgtest metadata/pv_manip.c:296 /dev/loop2 0: 0 25: NULL(0:0) metadata/pv_manip.c:296 /dev/loop3 0: 0 25: NULL(0:0) metadata/pv_manip.c:296 /dev/loop4 0: 0 25: NULL(0:0) metadata/metadata.c:1072 Adjusting ignored mdas on vg vgtest, vg_mda_used_count=5, vg_mda_copies=3 metadata/metadata.c:1015 Setting ignore flag for 2 mdas on vg vgtest metadata/metadata.c:4151 Setting mda ignored flag for metadata_locn /dev/loop2. metadata/metadata.c:4151 Setting mda ignored flag for metadata_locn /dev/loop3.	2010-06-29 22:41:28 +00:00
Dave Wysochanski	d37dd5b2d3	Improve logging for metadata ignore by printing device name. Print device name when setting or clearing metadata ignore bit. Example: label/label.c:160 /dev/loop2: lvm2 label detected cache/lvmcache.c:1136 lvmcache: /dev/loop2: now in VG #orphans_lvm2 (#orphans_lvm2) metadata/metadata.c:4142 Setting mda ignored flag for metadata_locn /dev/loop2. format_text/text_label.c:318 Skipping mda with ignored flag on device /dev/loop2 at offset 4096	2010-06-29 22:37:32 +00:00
Dave Wysochanski	710c9373bf	Add some log_verbose debug statements related to metadataignore. Logging isn't ideal, especially for mda_set_ignore. Ideally we'd like to display the device name and offset in this case but this requires a bit more work and a per-format 'mda_description' function pointer definition (we don't have access to mda_context in metadata.c).	2010-06-29 22:25:58 +00:00
Dave Wysochanski	a375ced300	Move code into pv_change_metadataignore library function. In preparation to call this from both pvcreate as well as pvchange, move the guts of metadataignore into a library function. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-29 21:32:44 +00:00
Dave Wysochanski	559aee44ab	Add error message if backup_to_file fails because of empty in_use mdas list.	2010-06-29 15:03:59 +00:00
Dave Wysochanski	5778fdeeb8	Add more initializations of 'mda->flags' field. Mda allocation needs refactored into a single function but as an interim step, ensure mda->flags is initialized properly.	2010-06-29 14:52:56 +00:00
Dave Wysochanski	fa832e3a55	Attempt to fix intermittent failure with non-debug configured vgcfgbackup. There's an intermittent failure with vgcfgbackup that seems to have been introduced with the metadataignore / vgmetadatacopies patchset. Intermittent failures are often the result of uninitialized data, so this patch calls zalloc in a few places it might matter.	2010-06-29 13:29:53 +00:00
Dave Wysochanski	a9d8bf269a	Allow 'all' and 'unmanaged' values for --vgmetadatacopies. Allowing an 'all' and 'unmanaged' value is more intuitive, and provides a simple way for users to get back to original LVM behavior of metadata written to all PVs in the volume group. If the user requests "--vgmetadatacopies unmanaged", this instructs LVM not to manage the ignore bits to achieve a specific number of metadata copies in the volume group. The user is free to use "pvchange --metadataignore" to control the mdas on a per-PV basis. If the user requests "--vgmetadatacopies all", this instructs LVM to do 2 things: 1) clear all ignore bits, and 2) set the "unmanaged" policy going forward. Internally, we use the special MAX_UINT32 value to indicate 'all'. This 'just' works since it's the largest value possible for the field and so all 'ignore' bits on all mdas in the VG will get cleared inside _vg_metadata_balance(). However, after we've called the _vg_metadata_balance function, we check for the special 'all' value, and if set, we write the "unmanaged" value into the metadata. As such, the 'all' value is never written to disk. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:40:01 +00:00
Dave Wysochanski	a09a8efb66	Update check in vg_split_mdas to account for ignored mdas list. The check in vg_split_mdas will trigger an error if the 'from' vg list is empty. However, this might be ok in some instances now that we have ignored mdas. Relax this check so an error is triggered only in the case where there's truly no more mdas in the 'from' vg. One example of where this makes a difference is with vgreduce. If we try to vgreduce a PV with un-ignored mdas, this should trigger the balancing function to un-ignore mdas on another PV in the VG. However, we don't get to vg_write() before we fail because this list size check fails, and we see an error message indicating: "Cannot remove final metadata area ..." Another example is with vgsplit into a new VG, where the PVs being moved contain all ignored mdas. We must move the mdas on fid->metadata_areas_ignored from 'vg_from' to 'vg_to'. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:38:56 +00:00
Dave Wysochanski	f61cd7b249	Ensure fid mda lists are populated correctly during vgextend. The vgextend path calls add_pv_to_vg(). Inside add_pv_to_vg(), we must ensure we pass the correct mdas list into pv_setup(), as copies of mdas are placed on the vg->fid list. If we don't place the mdas on the correct vg->fid list, the various counts may be incorrect and the metadata balance algorithm will not work when called from vg_write() path. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:38:39 +00:00
Dave Wysochanski	1b54343328	Implement _vg_adjust_ignored_mdas and call from vg_write() path. Compare the value of the newly added vg_mda_copies field (--vgmetadatacopies parameter) with the current count of in-use mdas and ignoring or unignoring mdas as necessary to get to the target count. Also, as a safety check before returning, ensure we have at least one mda enabled. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:37:54 +00:00
Dave Wysochanski	3534fb40df	Add vg_mda_copies display field to 'vgs' command. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:37:23 +00:00
Dave Wysochanski	7042e06a2a	Make vg->mda_copies persistent in on disk vg metadata. This patch adds the ability to read/write the vg->mda_copies values from/to the vg metadata. If we read the VG metadata and this field does not exist, we set mda_copies to the default value of 0. Later in the code, we use this special '0' value to indicate a disable of metadata balancing. This should preserve existing LVM behavior and ensure metadata balancing can be turned off should the need arise. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:37:10 +00:00
Dave Wysochanski	821f0cc5ea	Add vg get/set methods for VG metadata copies. This patch adds the get and partially implemented set function. The 'set' function should probably ignore or un-ignore metadata areas based on new values. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:36:56 +00:00
Dave Wysochanski	88d7dc1af8	Add mda_copies to VG structures and initialization. Add a field to struct volume_group to later implement metadata balancing: - mda_copies: target # of non-ignored mdas in the VG; default 0 (do not control pv 'ignore mdas' bit. This patch just adds the parameter to the structures with the default values but does not modify any commands. Should be no functional change. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:36:37 +00:00
Dave Wysochanski	0f2f8a5c3a	Before committing each mda, arrange mdas so ignored mdas get committed first. Arrange mdas so mdas that are to be ignored come first. This is an optimization that ensures consistency on disk for the longest period of time. This was noted by agk in review of the v4 patchset of pvchange-based mda balance. Note the following example for an explanation of the background: Assume the initial state on disk is as follows: PV0 (v1, non-ignored) PV1 (v1, non-ignored) PV2 (v1, non-ignored) PV3 (v1, non-ignored) If we did not sort the list, we would have a commit sequence something like this: PV0 (v2, non-ignored) PV1 (v2, ignored) PV2 (v2, ignored) PV3 (v2, non-ignored) After the commit of PV0's mdas, we'd have an on-disk state like this: PV0 (v2, non-ignored) PV1 (v1, non-ignored) PV2 (v1, non-ignored) PV3 (v1, non-ignored) This is an inconsistent state of the disk. If the machine fails, the next time it was brought back up, the auto-correct mechanism in vg_read would update the metadata on PV1-PV3. However, if possible we try to avoid inconsistent on-disk states. Clearly, because we did not sort, we have a greater chance of on-disk inconsistency - from the time the commit of PV0 is complete until the time PV3 is complete. We could improve the amount of time the on-disk state is consistent by simply sorting the commit order as follows: PV1 (v2, ignored) PV2 (v2, ignored) PV0 (v2, non-ignored) PV3 (v2, non-ignored) Thus, after the first PV is committed (in this case PV1), on-disk we would have: PV0 (v1, non-ignored) PV1 (v2, ignored) PV2 (v1, non-ignored) PV3 (v1, non-ignored) This is clearly a consistent state. PV1 will be read but the mda will be ignored. All other PVs contain v1 metadata, and no auto-correct will be required. In fact, if we commit all PVs with ignored mdas first, we'll only have an inconsistent state when we start writing non-ignored PVs, and thus the chances we'll get an inconsistent state on disk is much less with the sorted method. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:35:49 +00:00
Dave Wysochanski	77e0ed4be7	Refactor vg_commit() to add _vg_commit_mdas(). Factor out calling mda->ops->vg_commit() for each mda. No functional change. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:35:33 +00:00
Dave Wysochanski	69d1732334	Update _vg_read and _text_create_text_instance to use fid_add_mda[s]. When we are constructing the vg, we may need to adjust the list of metadata_areas if there are ignored mdas. At label read time, we do not read the metadata of ignored mdas, and as a result, they do not get placed on vg->fid->metadata_areas inside _text_create_text_instance since lvmcache does not have these areas attached to vginfo->infos. However, when we're checking the pvids inside _vg_read, after having read another metadata area from another PV, we do have the opportunity to update the metadata_area and metadata_areas_ignored lists based on the read metadata_area. We need accurate mda lists for the reporting functions that count the ignored mdas, as well as general correctness of mda balancing. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:35:17 +00:00
Dave Wysochanski	bb723d7897	Use mdas_empty_or_ignored() in place of checks for empty mda list. With the addition of ignored mdas, we replace all checks for an empty mda list with a new function to look for either an empty mda list or ignored mdas. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:34:58 +00:00
Dave Wysochanski	f9c307cd07	Add mdas_empty_or_ignored() helper function. Add a helper function to consolidate checking for an empty mdas list or ignored mdas. Ignored mdas should behave almost identically to an empty mda list - the metadata areas should not be read or written to. This function will make it easier to implement metadata balancing and easier to track pvs with an empty mda list or ignored mdas. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:34:40 +00:00
Dave Wysochanski	e6bd367b57	Implement ignore of mda if bit set by skipping r/w of metadata. We implement ignore of an mda at label_read time by checking for the ignore bit, and then skipping the reading of the vgname and other information in the metadata. This will have an effect similar to a PV found with no mdas. Thus, it will look like an orphan in the cache until we scan the rest of the system and find a PV with metadata, and the mda will not be on the vg->fid->metadata_areas list so no read/writes will be done to the metadata area. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:34:24 +00:00
Dave Wysochanski	cdbe475fe3	Define new functions and vgs/pvs fields related to mda ignore. Define a new pvs field, pv_mda_used_count, and a new vgs field, vg_mda_used_count to match the existing pv_mda_count and vg_mda_count. These new fields count the number of mdas that have the 'ignored' bit clear (they are in use on the PV / VG). Also define various supporting functions to implement the counting as well as setting the ignored flag and determining if an mda is ignored. These high level functions call into the lower level location independent mda ignore functions defined by earlier patches. Note that counting ignored mdas in a vg requires traversing both lists and checking for the ignored bit on the mda. The count of 'ignored' mdas then is defined by having the bit set, not by which list the mda is on. The list does determine whether LVM actually does read/write to the mda, though we must count the bits in order to return accurate numbers for the various counts. Also, pv_mda_set_ignored must search both vg lists for ignored mda. If the state changes and needs to be committed to disk, the ignored mda will be on the non-ignored list. Note also in pv_mda_set_ignored(), we must properly manage the mda lists. If we change the ignored state of an mda, we must change any mdas on vg->fid->metadata_areas that correspond to this pv. Also, we may need to allocate a copy of the mda, as is done when fid->metadata_areas is populated from _vg_read(), if we are un-ignoring an ignored mda. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:33:44 +00:00
Dave Wysochanski	9ccac021a7	Add metadata_areas_ignored list and functions to manage ignored mdas. Add a second mda list, metadata_areas_ignored to fid, and a couple functions, fid_add_mda() and fid_add_mdas() to help manage the lists. These functions are needed to properly count the ignored mdas and manage the lists attached to the 'fid' and ultimately the 'vg'. Ensure metadata_areas_ignored is initialized in other formats, even if the list is never used. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:33:22 +00:00
Dave Wysochanski	f55a20eb36	Rename fid->metadata_areas to fid->metadata_areas_in_use. Rename the metadata_areas list to an 'in_use' list to prepare for future 'ignored' list.	2010-06-28 20:32:44 +00:00
Dave Wysochanski	6b596f685f	Use vg_mda_count() in vgdisplay. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:32:21 +00:00
Dave Wysochanski	ef4fa155a5	Add mda location specific mda_copy constructor. Because of the way mdas are handled internally, where a PV in a VG has mdas on both info->mdas and vg->fid->metadata_areas list, we need a location independent copy constructor for struct metadata_area. Break up the existing format-text specific copy constructor into a format independent piece and a format dependent piece. This function is necessary to properly implement pv_set_mda_ignored(). Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-by: Alasdair G Kergon <agk@redhat.com>	2010-06-28 20:31:59 +00:00
Dave Wysochanski	29f24d4634	Add mda_locns_match() internal library function for mapping pv/device to VG mda. A metadata_area is defined independent of the location. One downside is that there is no obvious mapping from a pv to an mda. For a PV in a VG, we need a way to start with a PV and end up with an MDA, if we are to manage mdas starting with a device/pv. This function provides us a way to go down the list of PVs on a VG, and identify which ones match a particular PV. I'm not entirely happy with this approach, but it does fit into the existing structures in a reasonable way. An alternative solution might be to refactor the VG - PV interface such that mdas are a list tied to a PV. However, this seemed a bit tricky since a PV does not come into existence until after the list of mdas is constructed (see _vg_read() - we create a 'fid' and attach mdas to it, then we go through them and attach pvs). Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-by: Alasdair G Kergon <agk@redhat.com>	2010-06-28 20:31:38 +00:00
Dave Wysochanski	a6b36a5901	Ensure in-memory state matches on-disk state of mda ignore bit. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:31:18 +00:00
Dave Wysochanski	09e0f43ba0	Allow raw_read_mda_header to be called from text_label.c. We'd like to pass in mda_header to vgname_from_mda(). In order to do this, we need to call raw_read_mda_header() from text_label.c, _text_read(), which gets called from the label_read() path, and peers into the metadata and update vginfo cache. We should check the disable bit here, and if set, not peer into the vg metadata, thus reducing the I/O to disk. In the process, move vgname_from_mda() to layout.h, since the fn only gets called from format_text code, and we need the mda_header definition from the private layout.h. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:31:01 +00:00
Dave Wysochanski	da0b4d8770	Move dev_open/dev_close outside vgname_from_mda(). Refactor vgname_from_mda() so caller must open/close the device. Should be no functional change. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:30:46 +00:00
Dave Wysochanski	96597c2eab	Move dev_open / dev_close outside _vg_read_raw_area(). This refactoring moves the device open/close up one level to the caller of _vg_read_raw_area(). Should be no functional change and facilitate future changes related to metadata balancing. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:30:30 +00:00
Dave Wysochanski	322c5868b3	Add location independent flag and functions to ignore mdas. First we add a 'flags' field to the location independent metadata_area structure, and a MDA_IGNORE flag. The mda_is_ignored and mda_set_ignored functions are added to manage the flag. Adding the flag and functions gives a library interface to ignore metadata areas independent of the underlying location (disk, file, etc). The location specific read/write functions must then handle the specifics of what this flag means to the location. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-by: Alasdair G Kergon <agk@redhat.com>	2010-06-28 20:30:14 +00:00
Dave Wysochanski	d144d5eeb7	Add text format specific 'rlocn' ignore flag and access functions. Adding a flag to the 'rlocn' structure in the mda header of the text format allows us to flip a bit to ignore an area on disk that stores the metadata via the text format specific mda_header. This patch defines the flag and access functions to manage the flag. Other patches will manage the ignore on a format-independent basis, by using a flag in the metadata_area structure. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:29:57 +00:00
Dave Wysochanski	7c604e7649	Change 'filler' to 'flags' in on-disk 'raw_locn' structure. Future patches will make use of a specific flag in the on-disk 'raw_locn' structure to enable/disable metadata areas, and facilitate metadata balancing. Note that 'filler' is always set to '0' (see add_mda() - memset), so use of this area as a non-zero flags field is a safe way to provide future code features. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:29:42 +00:00
Jonathan Earl Brassow	68c31a2a36	Fix for bz608048 from Taka... The same region size is used for both mirror volume and mirrored log volume, but when the physical extent size is bigger than region size, the size of mirror leg for mirrored log is smaller than the region size and lvcreate command fails. This patch adjusts a region size of mirrored log to a smaller value of region size or physical extent size. [This patch ensures that the region_size of the mirrored log does not exceed the size of the mirrored log itself, which would violate the kernel constraint: (region_size <= ti->len).] Signed-off-by: Takahiro Yasui <takahiro.yasui@hds.com> Signed-off-by: Jonathan Brassow <jbrassow@redhat.com>	2010-06-28 14:19:41 +00:00
Zdenek Kabelac	d301e5917f	Preload libc locale messages. Preload libc.mo file for localized lvm before taking memory lock - this way we prevent disk access for some error paths in libdm, that prints localized errno messages while they are still in memory locked state.	2010-06-24 08:29:30 +00:00
Jonathan Earl Brassow	42f7fd0590	The function that runs to compress a stacked mirror after converting from 2-way to 3-way mirror (collapse_mirrored_lv) was calling '_remove_mirror_images' with the 'remove_log' parameter set. When the code was put in to fix 599898 to honor log parameters during conversion, this argument was suddenly being honored. Thus, when someone would convert from a 2-way to 3-way mirror, the log would get removed. 'collapse_mirrored_lv' should not be calling '_remove_mirror_images' with 'remove_log' set.	2010-06-23 13:57:26 +00:00
Alasdair Kergon	07ae1d4943	Add lv_path to reports to offer full /dev pathname.	2010-06-23 12:32:08 +00:00
Milan Broz	f9e177d281	Fix "allocated" warning typo.	2010-06-22 21:10:53 +00:00
Dave Wysochanski	58f55600d0	Add device name to output of error messages in raw_read_mda_header(). It would be helpful if we had the device name when something like a mda_header checksum error occurs. Before: ./tools/lvm pvs -opv_name,vg_name,uuid,mda_count,pv_mda_count_ignored,vg_mda_count,vg_mda_count_ignored,vg_mda_copies Incorrect metadata area header checksum PV VG PV UUID #PMda #PMdaIgn #VMda #VMdaIgn #VMdaCps /dev/loop0 vgtest2 sVv26t-gjpb-Rcau-uBDO-Cx04-GbRR-6Ssq7e 2 0 4 0 4 /dev/loop1 vgtest2 zXWStT-qE8F-mbkc-RfgH-aytv-mptF-Y5Ce09 2 0 4 0 4 /dev/loop2 riCpK9-9G8r-LlIp-i2oh-mb3N-CUzk-u5YpuR 1 0 0 0 0 /dev/loop3 vgtest tQCUjm-rmyd-i92d-4eeE-UYBW-v1vQ-kRaA17 2 0 4 2 0 /dev/loop4 vgtest ZRvpeI-p8F1-ccVW-BBac-xhl1-aGXU-CbP0oo 2 2 4 2 0 After: ./tools/lvm pvs -opv_name,vg_name,uuid,mda_count,pv_mda_count_ignored,vg_mda_count,vg_mda_count_ignored,vg_mda_copies Incorrect metadata area header checksum on /dev/loop2 at offset 4096 PV VG PV UUID #PMda #PMdaIgn #VMda #VMdaIgn #VMdaCps /dev/loop0 vgtest2 sVv26t-gjpb-Rcau-uBDO-Cx04-GbRR-6Ssq7e 2 0 4 0 4 /dev/loop1 vgtest2 zXWStT-qE8F-mbkc-RfgH-aytv-mptF-Y5Ce09 2 0 4 0 4 /dev/loop2 riCpK9-9G8r-LlIp-i2oh-mb3N-CUzk-u5YpuR 1 0 0 0 0 /dev/loop3 vgtest tQCUjm-rmyd-i92d-4eeE-UYBW-v1vQ-kRaA17 2 0 4 2 0 /dev/loop4 vgtest ZRvpeI-p8F1-ccVW-BBac-xhl1-aGXU-CbP0oo 2 2 4 2 0	2010-06-22 19:18:27 +00:00
Jonathan Earl Brassow	a7d355a28c	Mirrors can be layered - as in the case of an converting 2-way to 3-way mirror. When conversion operations are performed on these types of mirrors, log options can be confused/ignored. In the case of a converting 3-way mirror, we have a top-level 2-way corelog mirror whose legs are 1) a 2-way disk-log mirror and 2) a linear device. If we wish to convert this 3-way mirror to a 2-way mirror, the linear device is removed and the extra top layer is eliminated. If we also wished to convert the disk log to a core log in the same step, ambiguity creeps in. It is somewhat obvious what the user wants - a 2-way mirror with a corelog. However, looking at the top level mirror before compression, it seems that the mirror already has a core log. This is why the operation seemed to fail. This patch simply re-evaluates what mirrored_seg points to after a compression and then considers the log argument. This is a fix for bug 599898.	2010-06-21 16:12:33 +00:00
Alasdair Kergon	b4ee00356b	Various cleanups following recent commits.	2010-06-21 15:56:57 +00:00
Milan Broz	d2031f6a16	Clean up cluster lock mode and flags definition. Code is mixing up internal DLM and LVM definitions of lock modes and flags. OpenAIS and singlenode locking do not depend on DLM but code currently cannot be compiled without libdlm.h! LCK_* flags is LVM abstraction, used through all the code. Only low-level backend (clvmd-cman etc) should use DLM definitions, also this code should do all needed conversions. Because there are two DLM flags used in generic code (NOQUEUE, CONVERT) we define it similar way like lock modes. (So all needed binary-compatible flags are on one place in locking.h) (Further code cleaning still needed, though:-)	2010-06-17 12:48:54 +00:00
Zdenek Kabelac	b5c2529bf8	Use absolute paths in commands clvmd restart does not work at all if clvmd binary is not in current dir.	2010-06-03 13:50:26 +00:00
Dave Wysochanski	e9a87f2e16	Revert _init_rand() to reset errno - restores original init behavior.	2010-06-01 21:47:57 +00:00
Dave Wysochanski	e3a1fc8af3	Do not fail lvm_init() if init_logging() generates an errno. Revert to original behavior of lvm commands if init_logging() generated an errno. Fixes rhbz 592967.	2010-06-01 21:46:29 +00:00
Alasdair Kergon	d01888706d	Don't merge unchanged persistent cache file before dumping if tool scanned.	2010-06-01 19:02:12 +00:00
Peter Rajnoha	03023d3965	Fix incorrect memory pool deallocation while using vg_read for files. We create a separate pool "lvm2 vg_read" for vg_read and we don't use cmd->mem anymore.	2010-06-01 12:08:50 +00:00
Alasdair Kergon	ba61f84874	Replace strncmp kernel version number checks with proper ones	2010-05-24 23:11:34 +00:00
Alasdair Kergon	ed122a962d	Avoid selecting names under /dev/block if there is an alternative.	2010-05-24 22:53:48 +00:00
Alasdair Kergon	5782a38543	Choose between clustered log versions based on kernel version. Add fixmes for broken strcmp.	2010-05-24 17:46:47 +00:00
Alasdair Kergon	1493f24e3e	Update clustered log kernel module name to log-userspace.	2010-05-24 16:30:15 +00:00
Petr Rockai	d345bf2cd3	Account for mirror transient status when doing lvconvert --repair.	2010-05-24 15:32:20 +00:00
Zdenek Kabelac	4ef2bf27a7	Update Copyright date for resently modifed files	2010-05-24 09:04:27 +00:00
Zdenek Kabelac	591fc4d2d1	Replicator: add replicator to dtree Adding all replicator related LVs to dtree. Start of one replicator_dev initiate start of all other related.	2010-05-24 09:01:05 +00:00
Zdenek Kabelac	dc7d777606	Replicator: update _create_partial_dtree() Adding function _add_partial_replicator_to_dtree() to create partial tree for Replicator target. Using dm_tree_node_set_presuspend_node() for Replicator.	2010-05-21 14:47:58 +00:00
Zdenek Kabelac	9249fb12b8	Replicator: activate checks for missing vgs Activation needs to have all remote VGs locked to pass for replicator. So if activated LV is replicator-dev LV - read all remote VGs.	2010-05-21 14:34:01 +00:00
Zdenek Kabelac	b5f33faf9f	Replicator: lock_lv_vol() finds missing VGs Find and check for all needed VGs before calling lock_vol().	2010-05-21 14:29:49 +00:00
Zdenek Kabelac	65928349e7	Replicator: add read and release VGs for rsites Add functions to read and release remote VGs for replicator sites in activation context.	2010-05-21 14:07:16 +00:00
Zdenek Kabelac	f6d7e637c3	Add toolcontext.h header file.	2010-05-21 13:34:09 +00:00
Zdenek Kabelac	49e7eb3570	Remove files from wrong directory	2010-05-21 13:18:25 +00:00
Zdenek Kabelac	4c77dcc4ea	Right directory name for replicator files. My local CVS was placing file in slightly different directory by using obsolete files.	2010-05-21 13:17:20 +00:00
Zdenek Kabelac	0e6270e308	Hmm - fixing cvs import mistake	2010-05-21 13:14:02 +00:00
Zdenek Kabelac	6222635b38	Replicator: add find_replicator_vgs Adding find_replicator_vgs() function to find all needed VGs for replicator-dev LV. This function is later called before taking lock_vol().	2010-05-21 12:55:25 +00:00
Zdenek Kabelac	12569ccb03	Replicator: add sorted cmd_vg list Introduce struct cmd_vg to store information about needed volume group name, vgid, flags and the pointer to opened VG. Keep VGs list in alphabetical order for locking order. Introduce functions: cmd_vg_add() add new cmd_vg entry. cmd_vg_lookup() search cmd_vgs for vg_name. cmd_vg_read() open VGs in cmd_vgs list. cmd_vg_release() close VGs in reversed order.	2010-05-21 12:52:01 +00:00
Zdenek Kabelac	0a02d30ea4	Replicator: extend volume_group with list of VGs and flag Add pointer to linked list of opened VGs. List temporarily keeps the information about needed or locked and opened VGs for replicator target. Also add cmd_missing_vgs flag information for quick check and also for possible continuos process_each_lv() usage where we need to detect whether failure has been caused by missing VG or some other reason.	2010-05-21 12:47:46 +00:00
Zdenek Kabelac	e86e45f7ea	Replicator: extend _lv_each_dependency() with dependencies for Replicator devices	2010-05-21 12:45:18 +00:00
Zdenek Kabelac	651cae3c5c	Replicator: check replicator segment Check for possible problems within replicator structures. Used also by vg_validate.	2010-05-21 12:43:02 +00:00
Zdenek Kabelac	1207106fbc	Replicator: new files for Replicator target	2010-05-21 12:40:05 +00:00
Zdenek Kabelac	8fea97b7e7	Replicator: base lvm2 support Adding configure.in support for Replicators. Adding basic lib lvm support for Replicators. Adding flags REPLICATOR and REPLICATOR_LOG. Adding segments SEG_REPLICATOR and SEG_REPLICATOR_DEV. Adding basic methods for handling replicator metadata.	2010-05-21 12:36:30 +00:00
Alasdair Kergon	21c56e377b	backup->cache	2010-05-20 22:32:44 +00:00
Alasdair Kergon	ae6164cee3	If unable to obtain snapshot percentage leave value blank on reports.	2010-05-20 22:24:33 +00:00
Alasdair Kergon	3399ae320a	Add install_system_dirs makefile target. Add configure options for system and locking directories.	2010-05-20 13:47:21 +00:00
Dave Wysochanski	dd2a0e940d	Add find_vgname_from_{pvname\|pvid} functions. Some commands start with a pvname, but we'd like to force users to start with a vg handle to obtain a pv handle. Our best option seems to be providing a way to look up the vgname from the pvname, and then require them to use vg_read/vg_open. In addition to the pvname lookup function, this patch also provides a lookup by pvid. The lookup by pvid can be used in conjunction with lvmcache_get_pvids to process all pvs in the system. The pvid find function first calls lvmcache_vgname_from_pvid, which may cause the label to be read if it is not in the cache. If the vgname is returned is an orphan, we then check to see if there are metadata areas, and if not, we scan every PV on the system by calling scan_vgs_for_pvs(). In most cases we should not need to do this, and by using the info->mdas count, we avoid calling pv_read() as prior code did. So this patch is a bit cleaner and should allow us to refactor more of the pv code. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-05-19 11:52:37 +00:00
Dave Wysochanski	cfb203e14d	Add lvmcache_vgname_from_pvid(). Add lvmcache function to lookup a vgname from a pvid.	2010-05-19 11:52:21 +00:00
Dave Wysochanski	dd4097a6a4	Add pvid_from_devname() lvmcache function. Add supporting function for mappings from devname -> pvid -> vgname.	2010-05-19 11:52:07 +00:00
Alasdair Kergon	1d837442bf	Add is_global_vg and split out from is_orphan_vg.	2010-05-19 02:36:33 +00:00
Alasdair Kergon	34220fe292	Validate orphan and VG_GLOBAL lock order too.	2010-05-19 02:08:50 +00:00
Alasdair Kergon	8c965cafb7	Note that orphan lock is always obtained last	2010-05-19 01:49:08 +00:00
Alasdair Kergon	fa305e2ec6	Accept orphan VG names as parameters to lock_vol() and related functions.	2010-05-19 01:16:40 +00:00
Alasdair Kergon	24d21cfcee	Use is_orphan_vg in place of hard-coded prefix tests.	2010-05-19 00:52:55 +00:00
Alasdair Kergon	a988efda46	pre-release	2010-05-17 18:39:01 +00:00
Jonathan Earl Brassow	a932c2b61f	Disallow toggling the cluster attribute of a volume group if there are active mirrors or snapshots. We don't have the mechanisms in place to change the device-mapper tables for those targets that have behavioral differences between cluster and single machine instances. Allowing users to change the attribute but not changing the target's behavior can lead to data corruption. The following bugs are fixed/avoided by this patch: 235123 - vgchange -c [ny] do not change target types when necessary 289331 - RFE: switching from cluster domain to local domain needs to deactivate volume somehow 289541 - when changing from local to cluster, volumes can not appear to be deactivated	2010-05-14 15:19:42 +00:00
Milan Broz	ea9e387f87	Another one internal device layer fix...	2010-05-14 12:39:52 +00:00
Milan Broz	b8fb120ee0	Fix empty layer detection is scan devices.	2010-05-14 12:30:43 +00:00
Milan Broz	b449f0a188	Fix device_is_usable to properly detect only internal LV names.	2010-05-14 12:03:32 +00:00
Milan Broz	28e2b5b213	Skip also special lvm devices in scan (if ignore suspended is used). This should avoid various races between dmeventd on multiple nodes in cluster where one node already repairing device and another run full scan and locks the device.	2010-05-13 18:38:38 +00:00
Milan Broz	9ad39e546b	Currently if clvmd is running and user issues vgscan, the device cache file is dumped both in vgscan and clvmd process. Unfortunately, clvmd calls lvmcache_label_scan, it properly destroys persistent filter, but during persistent_filter_dump it merges old cache content back! This causes that change in filters is not properly propagated into device cache after vgscan on cluster. (Only new devices are added.) https://bugzilla.redhat.com/show_bug.cgi?id=591861	2010-05-13 13:04:03 +00:00
Peter Rajnoha	d50c6d4bf5	Add new --sysinit option for vgchange and lvchange. A shortcut for --ignorelockingfailure, --ignoremonitoring, --poll n options and LVM_SUPPRESS_LOCKING_FAILURE_MESSAGES environment variable used all at once in initialisation scripts (e.g. rc.sysinit or initrd).	2010-05-06 11:15:55 +00:00
Zdenek Kabelac	180b3dea11	Install plugins to subdirs Target install_dm_plugin installs files to libdir/device-mapper. Target install_lvm2_plugin installs files to libdir/lvm2. Both targets creates relative links to libdir to keep the code compatible with current dlopen handling. Once we will be able to read plugins from subdir, links could be removed.	2010-05-06 10:07:46 +00:00
Petr Rockai	9409998d71	Suppress duplicate error messages about read failures and missing devices.	2010-05-05 22:37:52 +00:00
Zdenek Kabelac	244646902c	Initialise _vginfos list staticaly so there is no problem with using uninitialised variables in case, lvmcache_destory() is called without lvmcache_init().	2010-04-30 12:54:31 +00:00
Zdenek Kabelac	7147cd9fe5	Release pools for regex if there is error during processing (fixes error messages about unreleased pools).	2010-04-30 12:37:04 +00:00
Zdenek Kabelac	8889fda53d	Show string with invalid pattern and start error message with capital letter.	2010-04-30 12:31:32 +00:00
Alasdair Kergon	54d7741a3e	Remove no-longer-used arg_ptr_value. Fix -M and --type to use strings not pointers that change on config refresh.	2010-04-29 01:38:12 +00:00
Jonathan Earl Brassow	56a5925aed	Fix comment from last commit. Additionally, there is no need to put a comment into the WHATS_NEW file if it is a regression that was created and fixed inside the same release window.	2010-04-27 15:26:58 +00:00
Jonathan Earl Brassow	d7c9d72390	Patch to fix bug 586021 and mantain historical behavior of being able to remove more images from a mirror than the number of PVs directly specified for removal. The effort to fix bug 581611 corrected a bug that was unnoticed at the time. The loop in _remove_mirror_images that looks over the specified PVs was allowing devices that were previously counted and moved to the end of the list to be double-counted. This resulted in the number of devices needed for removal always being satisfied - even if the user did not specify enough PVs for removal to satisfy the request. When 581611 was fixed, this double-counting no longer took place and the result was to remove only the minimum of the number of PVs specified or the number that was asked to be removed. By simply always setting 'new_area_count' (as used to be done only in the else statement), we return to the previous behavior. Indeed, this is exactly what the double-counting was allowing to happen before the fix of 581611.	2010-04-27 14:57:49 +00:00
Mike Snitzer	60267bdce8	Disallow the direct removal of a merging snapshot. Allow lv_remove_with_dependencies() to know the top-level LV that was requested to be removed (otherwise it recurses and we lose context). A merging snapshot cannot be removed directly but the associated origin can be. Disallow removal of a merging snapshot unless the associated origin is also being removed.	2010-04-23 19:27:10 +00:00
Peter Rajnoha	6ddb5ecd72	Set appropriate udev flags for reserved LVs. There's no need for foreign udev rules to touch LVM reserved devices (snapshot, pvmove, _mlog, _mimage, _vorigin) even if they happen to be visible. The same applies for /dev/disk content - no need to create any content for these devices (and so no need to run any "blkid" etc.). This also prevents setting any inotify "watch" from udev rules on such devices that is a source of race conditions (the rules need to honor DM_UDEV_DISABLE_OTHER_RULES_FLAG for this to work though).	2010-04-23 14:16:32 +00:00
Mike Snitzer	1f661c5dd8	When removing a snapshot avoid preloading the origin if the snapshot-merge target is not active.	2010-04-23 02:57:39 +00:00
Jonathan Earl Brassow	66f79d05eb	Disallow the primary mirror image from being removed when the mirror is not in-sync. This restriction is not extended to repair operations (i.e. it will not limit what 'lvconvert --repair' can do).	2010-04-21 13:55:08 +00:00
Alasdair Kergon	ee90b8197f	Move function up file	2010-04-20 12:14:28 +00:00
Peter Rajnoha	1e696b0c15	Do not reset position in metadata ring buffer on vgrename and vgcfgrestore. We should write metadata into next position in the ring buffer while calling vgrename and vgcfgrestore. At this code level (_vg_write_raw), we were not able to determine if this is a rename or not. If yes, then accompanying VG structure passed here has a new name set, not the old one. When looking for a location where to put metadata next, we were given a NULL value because of failed VG name comparison (in _find_vg_rlocn) between the name in existing metadata and metadata we're just about to write. This resets the position in the ring buffer, overwriting any existing metadata (and also incorrectly updates the cache to "orphan" afterwards). This patch just adds old_name item in struct volume_group that we can check and use if necessary and detect renames at lower layers as well. The same applies for vgcfgrestore, but here we're using a special value of old_name, an empty string, to disable the check with existing metadata totally.	2010-04-14 13:09:16 +00:00
Peter Rajnoha	bda3982016	Use UUIDs instead of names while processing event handlers. Internally, we used DM names instead of UUIDs while processing event handlers. This caused problems while trying to vgrename a VG with active LVs where the names are being changed and so the devices were not found then. The patch also contains a little bit of refactoring, moving "build_dlid" code found in dev_manager.c to "build_dm_uuid", now in lvm-string.c (so we have build_dm_uuid and build_dm_name at one place).	2010-04-14 13:01:38 +00:00

... 2 3 4 5 6 ...

2222 Commits