shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-22 17:35:59 +03:00

Author	SHA1	Message	Date
Zdenek Kabelac	c10f7fd039	Fix constness warning in archive_file structure from archive.c.	2010-08-03 13:09:21 +00:00
Zdenek Kabelac	9f926fd060	Use void parameter for function definition.	2010-08-03 13:06:35 +00:00
Jonathan Earl Brassow	cbd41292a4	Taka's fix for handling failure of all mirrored log devices and all but one mirror leg. <patch header> To handle a double failure of a mirrored log, Jon's two patches are commited, however, lvconvert command can't still handle an error when mirror leg and mirrored log got failure at the same time. [Patch]: Handle both devices of a mirrored log failing (bug 607347) posted: https://www.redhat.com/archives/lvm-devel/2010-July/msg00009.html commit: https://www.redhat.com/archives/lvm-devel/2010-July/msg00027.html [Patch]: Handle both devices of a mirrored log failing (bug 607347) - additional fix posted: https://www.redhat.com/archives/lvm-devel/2010-July/msg00093.html commit: https://www.redhat.com/archives/lvm-devel/2010-July/msg00101.html In the second patch, the target type of mirrored log is replaced with error target when remove_log is set to 1, but this procedure should be also used in other cases such as the number of mirror leg is 1. This patch relocates the procedure to the main path. In addition, I added following three changes. - Removed tmp_orphan_lvs handling procedure It seems that _delete_lv() can handle detached_log_lv properly without adding mirror legs in mirrored log to tmp_orphan_lvs. Therefore, I removed the procedure. - Removed vg_write()/vg_commit() Metadata is saved by vg_write()/vg_commit() just after detached_log_lv is handled. Therefore, I removed vg_write()/vg_commit(). - With Jon's second patch, we think that we don't have to call remove_mirror_log() in _lv_update_mirrored_log() because will be handled remove_mirror_images() in _lvconvert_mirrors_repaire(). </patch header> Signed-off-by: Takahiro Yasui <takahiro.yasui@hds.com> Reviewed-by: Petr Rockai <prockai@redhat.com> Signed-off-by: Jonathan Brassow <jbrassow@redhat.com>	2010-08-02 21:07:40 +00:00
Jonathan Earl Brassow	efaaf3146d	Disallow mirrored logs in cluster mirrors. The cluster log daemon (cmirrord) is not multi-threaded and can handle only one request at a time. When a log is stacked on top of a mirror (which itself contains a 'core' log), it creates a situation that cannot be solved without threading. When the top level mirror issues a "resume", the log daemon attempts to read from the log device to retrieve the log state. However, the log is a mirror which, before issuing the read, attempts to determine the 'sync' status of the region of the mirror which is to be read. This sync status request cannot be completed by the daemon because it is blocked on a read I/O to the very mirror requesting the sync status.	2010-08-02 19:03:45 +00:00
Dave Wysochanski	936541ec56	Remove irrelevant comments relating to vg_mda_copies.	2010-07-30 16:47:27 +00:00
Alasdair Kergon	8bae0a1ecf	Change clvmd to communicate with lvm via a socket in /var/run/lvm. (mbroz) https://bugzilla.redhat.com/show_bug.cgi?id=614248 [CVE-2010-2526]	2010-07-28 13:55:42 +00:00
Dave Wysochanski	81bf06ea38	Clarify help text for vg_mda_count.	2010-07-21 19:44:25 +00:00
Jonathan Earl Brassow	9baacefc77	Building without the '--enable-cmirrord' option means that CMIRRORD_PIDFILE is not defined. This makes the build fail. Therefore, we need to conditionalize the check for cmirrord based on if CMIRRORD_PIDFILE is defined.	2010-07-21 15:21:24 +00:00
Jonathan Earl Brassow	405c4a45d8	It's not enough to check for the kernel module in the case of cluster mirrors, we must also check that the log daemon (cmirrord) is running. The log module can be auto-loaded, but the daemon cannot be "auto-started". Failing to check for the daemon produces cryptic messages that customers have a hard time deciphering. (The system messages do report that the log daemon is not running, but people don't seem to find this message easily.) Here are examples of what is printed when the module is available, but the log daemon has not been started. [root@bp-01 LVM2]# lvcreate -m1 -l1 -n lv vg Shared cluster mirrors are not available. [root@bp-01 LVM2]# lvcreate -m1 -l1 -n lv vg -v Setting logging type to disk Finding volume group "vg" Archiving volume group "vg" metadata (seqno 3). Creating logical volume lv Executing: /sbin/modprobe dm-log-userspace Cluster mirror log daemon is not running Shared cluster mirrors are not available. Creating volume group backup "/etc/lvm/backup/vg" (seqno 4).	2010-07-21 13:40:21 +00:00
Jonathan Earl Brassow	60f425d1b3	Fix for bug 614164: No check for existing name when splitting mirror The user could use the same name as an existing LV when specifying a name for an LV split off from a mirror. This causes all sorts of issues.	2010-07-13 22:24:39 +00:00
Jonathan Earl Brassow	c42b084793	Fix for bugs: 612248 & 612291 Split mirror issues The main problem with these bugs was that the newly split off LV was not being suspended properly. This meant that the memlock count was not being balanced, the DM devices were not being renamed, and some DM devices which should have been removed were not. I've also renamed some of the variables and added comments to make things clearer as to what is going on. (I can break this patch in two if it means easier review.)	2010-07-13 21:48:16 +00:00
Fabio M. Di Nitto	8c4e8a185a	Add dm_create_lockfile to libdm to handle pidfiles for all daemons. Switch dmeventd to use dm_create_lockfile and drop duplicate code. Allow clvmd pidfile to be configurable. Switch cmirrord and clvmd to use dm_create_lockfile.	2010-07-13 13:51:01 +00:00
Peter Rajnoha	3122f963b0	Addendum for previous patch - show VG/LV name everywhere so the messages are consistent.	2010-07-12 12:38:35 +00:00
Peter Rajnoha	fefa43235f	Add more verbose messages while checking volume_list and hosttags settings. This should bring less confusion when there are some settings left and people just forgot about it and then they run into problems. These messages should give them a hint of what's really going on.	2010-07-12 11:37:49 +00:00
Jonathan Earl Brassow	a93fb6299f	Failed to test for the case where a log was requested to be removed even though there was no log. A simple run through the in-tree test suite would have caught this. :( - if (lv_is_mirrored(detached_log_lv) && + if (detached_log_lv && lv_is_mirrored(detached_log_lv) && Also, made some cosmetic changes suggested by kabi after my last check-in (e.g. s/return 0/return_0/ and adding an error message).	2010-07-09 17:57:51 +00:00
Dave Wysochanski	f77fb62b2a	Add log_error when strdup fails in {vg\|lv}_change_tag(). Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-07-09 16:57:44 +00:00
Alasdair Kergon	08f1ddea6c	Use __attribute__ consistently throughout.	2010-07-09 15:34:40 +00:00
Alasdair Kergon	80e569104b	Remove superfluous fn prototypes.	2010-07-09 15:21:10 +00:00
Jonathan Earl Brassow	aa5734f2a3	Finish fix for bug 607347: failing both redundant mirror log legs... A previous check-in added logic to handle the case where both images of a mirrored log failed. It solved the problem by simply removing the log entirely - leaving the parent mirror with a 'core' log. This worked for most cases. However, if there was a small delay between the failures of the two mirrored log devices, the mirror would hang, LVM would hang, and no additional LVM commands could be issued. When the first leg of the log fails, it signals the need for repair. Before 'lvconvert --repair' is run by dmeventd, the second leg fails. 'lvconvert' would see both devices as failed and try to remove the log entirely. When it came time to suspend the parent mirror to update the configuration, the suspend would hang because it couldn't get any I/O through the mirrored log, which was plugged waiting for corrective action. The solution is to replace the log with an error target to clear any pending writes before removing it. This allows the parent mirror to suspend and make the proper changes.	2010-07-09 15:08:12 +00:00
Dave Wysochanski	a5fb2bbff3	Pass metadataignore to pv_create, pv_setup, _mda_setup, and add_mda. Pass metadataignore through PV creation / setup paths. As a result of this cleanup, we can remove the unnecessary setting of mda_ignore bits inside pvcreate_single(), after call to pv_create. For now, just set metadataignore to '0' in some places. This is equivalent to the prior functionality, although the 0 is given by the caller not hardcoded in _mda_setup() call. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-07-08 18:24:29 +00:00
Dave Wysochanski	dce204cec5	Init mda->list in mda_copy. This patch should be no functional change as all callers initialize mda->list.	2010-07-08 17:41:46 +00:00
Zdenek Kabelac	764eb41086	Fix format string from patch apply mistake	2010-07-08 14:47:46 +00:00
Zdenek Kabelac	37036b0215	Small update of memlock debug messages. Gives slightly better alligned lines for reading.	2010-07-08 13:05:27 +00:00
Zdenek Kabelac	4ec2ae8632	Do not log backtrace in valid _lv_resume() code path	2010-07-08 12:24:04 +00:00
Dave Wysochanski	7041b476ac	Add warning to vgextend and pvchange if metadataignore given on cmdline. Warn the user then change the value of vg_mda_copies. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-07-07 18:59:45 +00:00
Alasdair Kergon	7f7af46862	Adjust auto-metadata repair and caching logic to try to cope with empty mdas. - If a PV contained empty mdas, the auto-recovery code was not kicking in. - The 'inconsistent' state was getting lost when metadata was cached so recovery didn't kick in. But leave the behaviour alone when using precommitted metadata because of a warning in a confusing FIXME. In my testing, pvs and vgs didn't repair inconsistent metadata like they used to do. (How many other tools fail similarly now?) And there should be no need to cache inconsistent metadata because it is supposed to get repaired under the protection of a write lock immediately it is discovered. This code is in need of a redesign based on first principles. I still see bugs in this code and this commit is risky.	2010-07-07 02:53:16 +00:00
Alasdair Kergon	6c8655ce9b	fix code in 2nd mda unignore loop to match 1st loop	2010-07-06 20:09:38 +00:00
Alasdair Kergon	68f4e0c734	s/flags/mda/	2010-07-06 17:29:50 +00:00
Alasdair Kergon	0db1bbc3c3	shorten mesg	2010-07-06 17:27:32 +00:00
Alasdair Kergon	643f234119	fix jumbled args in 'Adjusting' message	2010-07-06 17:26:08 +00:00
Alasdair Kergon	d911ec67a9	Randomly select which mdas to use or ignore. Add some missing standard configure.in checks.	2010-07-05 22:23:15 +00:00
Alasdair Kergon	db3c1ac1c8	Add printf format attributes to yes_no_prompt & dm_{sn,as}printf and fix a calle	2010-07-02 21:16:50 +00:00
Alasdair Kergon	d0709eed62	remove unneeded header	2010-07-02 10:25:16 +00:00
Alasdair Kergon	9b95a5a939	Always pass unsuspended dm devices through persistent filter to other filters. Move test for suspended dm devices ahead of other filters.	2010-07-02 02:09:57 +00:00
Alasdair Kergon	12eadbabdd	improve vgmetadatacopies unmanaged message	2010-06-30 20:03:52 +00:00
Dave Wysochanski	3b9d1b1a96	Check for missing_pv in vg_remove loop. If a pv is missing, we should just skip it rather than checking the device size and failing the vgremove.	2010-06-30 19:55:43 +00:00
Alasdair Kergon	d8886386bd	more mda ignore cleanups	2010-06-30 19:28:35 +00:00
Dave Wysochanski	40b4d1c3ae	Refactor vg_remove_check to place pv removal into separate function.	2010-06-30 18:03:52 +00:00
Alasdair Kergon	23177eda88	more metadataignore message/code cleanup	2010-06-30 17:13:05 +00:00
Alasdair Kergon	efe75fd705	revert that	2010-06-30 14:54:29 +00:00
Alasdair Kergon	a6c4427188	suppress useless compiler warning	2010-06-30 14:52:29 +00:00
Dave Wysochanski	ef7b409966	Only attempt to guarantee 1 mda ignored if there's at least one mda in the vg.	2010-06-30 14:48:07 +00:00
Alasdair Kergon	67b91d0848	Only attempt to guarantee 1 mda ignored if there's at least one mda in the vg.	2010-06-30 14:27:40 +00:00
Alasdair Kergon	647c64c796	Improve various log messages.	2010-06-30 13:51:11 +00:00
Dave Wysochanski	7985f80c63	Add pvmetadatacopies to lvm.conf and pvcreate man pages.	2010-06-30 12:49:28 +00:00
Dave Wysochanski	a5bf70018b	Add --metadataignore to pvcreate. Allow metadataignore flag to be passed in to pvcreate. Ideally, more refactoring of the mda allocation / initialization is warranted, but for now, we just add another parameter to 'add_mda' to take an existing mda ignored flag. We need to do this or pv_write loses the state of the mda 'ignored' flag before copying and writing to disk.	2010-06-30 12:17:24 +00:00
Dave Wysochanski	6af5155529	Improve logging for setting --vgmetadatacopies. Example of logging: metadata/metadata.c:1127 Setting mda_copies = 3 on vg vgtest metadata/pv_manip.c:296 /dev/loop2 0: 0 25: NULL(0:0) metadata/pv_manip.c:296 /dev/loop3 0: 0 25: NULL(0:0) metadata/pv_manip.c:296 /dev/loop4 0: 0 25: NULL(0:0) metadata/metadata.c:1072 Adjusting ignored mdas on vg vgtest, vg_mda_used_count=5, vg_mda_copies=3 metadata/metadata.c:1015 Setting ignore flag for 2 mdas on vg vgtest metadata/metadata.c:4151 Setting mda ignored flag for metadata_locn /dev/loop2. metadata/metadata.c:4151 Setting mda ignored flag for metadata_locn /dev/loop3.	2010-06-29 22:41:28 +00:00
Dave Wysochanski	d37dd5b2d3	Improve logging for metadata ignore by printing device name. Print device name when setting or clearing metadata ignore bit. Example: label/label.c:160 /dev/loop2: lvm2 label detected cache/lvmcache.c:1136 lvmcache: /dev/loop2: now in VG #orphans_lvm2 (#orphans_lvm2) metadata/metadata.c:4142 Setting mda ignored flag for metadata_locn /dev/loop2. format_text/text_label.c:318 Skipping mda with ignored flag on device /dev/loop2 at offset 4096	2010-06-29 22:37:32 +00:00
Dave Wysochanski	710c9373bf	Add some log_verbose debug statements related to metadataignore. Logging isn't ideal, especially for mda_set_ignore. Ideally we'd like to display the device name and offset in this case but this requires a bit more work and a per-format 'mda_description' function pointer definition (we don't have access to mda_context in metadata.c).	2010-06-29 22:25:58 +00:00
Dave Wysochanski	a375ced300	Move code into pv_change_metadataignore library function. In preparation to call this from both pvcreate as well as pvchange, move the guts of metadataignore into a library function. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-29 21:32:44 +00:00
Dave Wysochanski	559aee44ab	Add error message if backup_to_file fails because of empty in_use mdas list.	2010-06-29 15:03:59 +00:00
Dave Wysochanski	5778fdeeb8	Add more initializations of 'mda->flags' field. Mda allocation needs refactored into a single function but as an interim step, ensure mda->flags is initialized properly.	2010-06-29 14:52:56 +00:00
Dave Wysochanski	fa832e3a55	Attempt to fix intermittent failure with non-debug configured vgcfgbackup. There's an intermittent failure with vgcfgbackup that seems to have been introduced with the metadataignore / vgmetadatacopies patchset. Intermittent failures are often the result of uninitialized data, so this patch calls zalloc in a few places it might matter.	2010-06-29 13:29:53 +00:00
Dave Wysochanski	a9d8bf269a	Allow 'all' and 'unmanaged' values for --vgmetadatacopies. Allowing an 'all' and 'unmanaged' value is more intuitive, and provides a simple way for users to get back to original LVM behavior of metadata written to all PVs in the volume group. If the user requests "--vgmetadatacopies unmanaged", this instructs LVM not to manage the ignore bits to achieve a specific number of metadata copies in the volume group. The user is free to use "pvchange --metadataignore" to control the mdas on a per-PV basis. If the user requests "--vgmetadatacopies all", this instructs LVM to do 2 things: 1) clear all ignore bits, and 2) set the "unmanaged" policy going forward. Internally, we use the special MAX_UINT32 value to indicate 'all'. This 'just' works since it's the largest value possible for the field and so all 'ignore' bits on all mdas in the VG will get cleared inside _vg_metadata_balance(). However, after we've called the _vg_metadata_balance function, we check for the special 'all' value, and if set, we write the "unmanaged" value into the metadata. As such, the 'all' value is never written to disk. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:40:01 +00:00
Dave Wysochanski	a09a8efb66	Update check in vg_split_mdas to account for ignored mdas list. The check in vg_split_mdas will trigger an error if the 'from' vg list is empty. However, this might be ok in some instances now that we have ignored mdas. Relax this check so an error is triggered only in the case where there's truly no more mdas in the 'from' vg. One example of where this makes a difference is with vgreduce. If we try to vgreduce a PV with un-ignored mdas, this should trigger the balancing function to un-ignore mdas on another PV in the VG. However, we don't get to vg_write() before we fail because this list size check fails, and we see an error message indicating: "Cannot remove final metadata area ..." Another example is with vgsplit into a new VG, where the PVs being moved contain all ignored mdas. We must move the mdas on fid->metadata_areas_ignored from 'vg_from' to 'vg_to'. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:38:56 +00:00
Dave Wysochanski	f61cd7b249	Ensure fid mda lists are populated correctly during vgextend. The vgextend path calls add_pv_to_vg(). Inside add_pv_to_vg(), we must ensure we pass the correct mdas list into pv_setup(), as copies of mdas are placed on the vg->fid list. If we don't place the mdas on the correct vg->fid list, the various counts may be incorrect and the metadata balance algorithm will not work when called from vg_write() path. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:38:39 +00:00
Dave Wysochanski	1b54343328	Implement _vg_adjust_ignored_mdas and call from vg_write() path. Compare the value of the newly added vg_mda_copies field (--vgmetadatacopies parameter) with the current count of in-use mdas and ignoring or unignoring mdas as necessary to get to the target count. Also, as a safety check before returning, ensure we have at least one mda enabled. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:37:54 +00:00
Dave Wysochanski	3534fb40df	Add vg_mda_copies display field to 'vgs' command. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:37:23 +00:00
Dave Wysochanski	7042e06a2a	Make vg->mda_copies persistent in on disk vg metadata. This patch adds the ability to read/write the vg->mda_copies values from/to the vg metadata. If we read the VG metadata and this field does not exist, we set mda_copies to the default value of 0. Later in the code, we use this special '0' value to indicate a disable of metadata balancing. This should preserve existing LVM behavior and ensure metadata balancing can be turned off should the need arise. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:37:10 +00:00
Dave Wysochanski	821f0cc5ea	Add vg get/set methods for VG metadata copies. This patch adds the get and partially implemented set function. The 'set' function should probably ignore or un-ignore metadata areas based on new values. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:36:56 +00:00
Dave Wysochanski	88d7dc1af8	Add mda_copies to VG structures and initialization. Add a field to struct volume_group to later implement metadata balancing: - mda_copies: target # of non-ignored mdas in the VG; default 0 (do not control pv 'ignore mdas' bit. This patch just adds the parameter to the structures with the default values but does not modify any commands. Should be no functional change. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:36:37 +00:00
Dave Wysochanski	0f2f8a5c3a	Before committing each mda, arrange mdas so ignored mdas get committed first. Arrange mdas so mdas that are to be ignored come first. This is an optimization that ensures consistency on disk for the longest period of time. This was noted by agk in review of the v4 patchset of pvchange-based mda balance. Note the following example for an explanation of the background: Assume the initial state on disk is as follows: PV0 (v1, non-ignored) PV1 (v1, non-ignored) PV2 (v1, non-ignored) PV3 (v1, non-ignored) If we did not sort the list, we would have a commit sequence something like this: PV0 (v2, non-ignored) PV1 (v2, ignored) PV2 (v2, ignored) PV3 (v2, non-ignored) After the commit of PV0's mdas, we'd have an on-disk state like this: PV0 (v2, non-ignored) PV1 (v1, non-ignored) PV2 (v1, non-ignored) PV3 (v1, non-ignored) This is an inconsistent state of the disk. If the machine fails, the next time it was brought back up, the auto-correct mechanism in vg_read would update the metadata on PV1-PV3. However, if possible we try to avoid inconsistent on-disk states. Clearly, because we did not sort, we have a greater chance of on-disk inconsistency - from the time the commit of PV0 is complete until the time PV3 is complete. We could improve the amount of time the on-disk state is consistent by simply sorting the commit order as follows: PV1 (v2, ignored) PV2 (v2, ignored) PV0 (v2, non-ignored) PV3 (v2, non-ignored) Thus, after the first PV is committed (in this case PV1), on-disk we would have: PV0 (v1, non-ignored) PV1 (v2, ignored) PV2 (v1, non-ignored) PV3 (v1, non-ignored) This is clearly a consistent state. PV1 will be read but the mda will be ignored. All other PVs contain v1 metadata, and no auto-correct will be required. In fact, if we commit all PVs with ignored mdas first, we'll only have an inconsistent state when we start writing non-ignored PVs, and thus the chances we'll get an inconsistent state on disk is much less with the sorted method. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:35:49 +00:00
Dave Wysochanski	77e0ed4be7	Refactor vg_commit() to add _vg_commit_mdas(). Factor out calling mda->ops->vg_commit() for each mda. No functional change. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:35:33 +00:00
Dave Wysochanski	69d1732334	Update _vg_read and _text_create_text_instance to use fid_add_mda[s]. When we are constructing the vg, we may need to adjust the list of metadata_areas if there are ignored mdas. At label read time, we do not read the metadata of ignored mdas, and as a result, they do not get placed on vg->fid->metadata_areas inside _text_create_text_instance since lvmcache does not have these areas attached to vginfo->infos. However, when we're checking the pvids inside _vg_read, after having read another metadata area from another PV, we do have the opportunity to update the metadata_area and metadata_areas_ignored lists based on the read metadata_area. We need accurate mda lists for the reporting functions that count the ignored mdas, as well as general correctness of mda balancing. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:35:17 +00:00
Dave Wysochanski	bb723d7897	Use mdas_empty_or_ignored() in place of checks for empty mda list. With the addition of ignored mdas, we replace all checks for an empty mda list with a new function to look for either an empty mda list or ignored mdas. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:34:58 +00:00
Dave Wysochanski	f9c307cd07	Add mdas_empty_or_ignored() helper function. Add a helper function to consolidate checking for an empty mdas list or ignored mdas. Ignored mdas should behave almost identically to an empty mda list - the metadata areas should not be read or written to. This function will make it easier to implement metadata balancing and easier to track pvs with an empty mda list or ignored mdas. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:34:40 +00:00
Dave Wysochanski	e6bd367b57	Implement ignore of mda if bit set by skipping r/w of metadata. We implement ignore of an mda at label_read time by checking for the ignore bit, and then skipping the reading of the vgname and other information in the metadata. This will have an effect similar to a PV found with no mdas. Thus, it will look like an orphan in the cache until we scan the rest of the system and find a PV with metadata, and the mda will not be on the vg->fid->metadata_areas list so no read/writes will be done to the metadata area. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:34:24 +00:00
Dave Wysochanski	cdbe475fe3	Define new functions and vgs/pvs fields related to mda ignore. Define a new pvs field, pv_mda_used_count, and a new vgs field, vg_mda_used_count to match the existing pv_mda_count and vg_mda_count. These new fields count the number of mdas that have the 'ignored' bit clear (they are in use on the PV / VG). Also define various supporting functions to implement the counting as well as setting the ignored flag and determining if an mda is ignored. These high level functions call into the lower level location independent mda ignore functions defined by earlier patches. Note that counting ignored mdas in a vg requires traversing both lists and checking for the ignored bit on the mda. The count of 'ignored' mdas then is defined by having the bit set, not by which list the mda is on. The list does determine whether LVM actually does read/write to the mda, though we must count the bits in order to return accurate numbers for the various counts. Also, pv_mda_set_ignored must search both vg lists for ignored mda. If the state changes and needs to be committed to disk, the ignored mda will be on the non-ignored list. Note also in pv_mda_set_ignored(), we must properly manage the mda lists. If we change the ignored state of an mda, we must change any mdas on vg->fid->metadata_areas that correspond to this pv. Also, we may need to allocate a copy of the mda, as is done when fid->metadata_areas is populated from _vg_read(), if we are un-ignoring an ignored mda. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:33:44 +00:00
Dave Wysochanski	9ccac021a7	Add metadata_areas_ignored list and functions to manage ignored mdas. Add a second mda list, metadata_areas_ignored to fid, and a couple functions, fid_add_mda() and fid_add_mdas() to help manage the lists. These functions are needed to properly count the ignored mdas and manage the lists attached to the 'fid' and ultimately the 'vg'. Ensure metadata_areas_ignored is initialized in other formats, even if the list is never used. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:33:22 +00:00
Dave Wysochanski	f55a20eb36	Rename fid->metadata_areas to fid->metadata_areas_in_use. Rename the metadata_areas list to an 'in_use' list to prepare for future 'ignored' list.	2010-06-28 20:32:44 +00:00
Dave Wysochanski	6b596f685f	Use vg_mda_count() in vgdisplay. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:32:21 +00:00
Dave Wysochanski	ef4fa155a5	Add mda location specific mda_copy constructor. Because of the way mdas are handled internally, where a PV in a VG has mdas on both info->mdas and vg->fid->metadata_areas list, we need a location independent copy constructor for struct metadata_area. Break up the existing format-text specific copy constructor into a format independent piece and a format dependent piece. This function is necessary to properly implement pv_set_mda_ignored(). Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-by: Alasdair G Kergon <agk@redhat.com>	2010-06-28 20:31:59 +00:00
Dave Wysochanski	29f24d4634	Add mda_locns_match() internal library function for mapping pv/device to VG mda. A metadata_area is defined independent of the location. One downside is that there is no obvious mapping from a pv to an mda. For a PV in a VG, we need a way to start with a PV and end up with an MDA, if we are to manage mdas starting with a device/pv. This function provides us a way to go down the list of PVs on a VG, and identify which ones match a particular PV. I'm not entirely happy with this approach, but it does fit into the existing structures in a reasonable way. An alternative solution might be to refactor the VG - PV interface such that mdas are a list tied to a PV. However, this seemed a bit tricky since a PV does not come into existence until after the list of mdas is constructed (see _vg_read() - we create a 'fid' and attach mdas to it, then we go through them and attach pvs). Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-by: Alasdair G Kergon <agk@redhat.com>	2010-06-28 20:31:38 +00:00
Dave Wysochanski	a6b36a5901	Ensure in-memory state matches on-disk state of mda ignore bit. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:31:18 +00:00
Dave Wysochanski	09e0f43ba0	Allow raw_read_mda_header to be called from text_label.c. We'd like to pass in mda_header to vgname_from_mda(). In order to do this, we need to call raw_read_mda_header() from text_label.c, _text_read(), which gets called from the label_read() path, and peers into the metadata and update vginfo cache. We should check the disable bit here, and if set, not peer into the vg metadata, thus reducing the I/O to disk. In the process, move vgname_from_mda() to layout.h, since the fn only gets called from format_text code, and we need the mda_header definition from the private layout.h. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:31:01 +00:00
Dave Wysochanski	da0b4d8770	Move dev_open/dev_close outside vgname_from_mda(). Refactor vgname_from_mda() so caller must open/close the device. Should be no functional change. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:30:46 +00:00
Dave Wysochanski	96597c2eab	Move dev_open / dev_close outside _vg_read_raw_area(). This refactoring moves the device open/close up one level to the caller of _vg_read_raw_area(). Should be no functional change and facilitate future changes related to metadata balancing. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:30:30 +00:00
Dave Wysochanski	322c5868b3	Add location independent flag and functions to ignore mdas. First we add a 'flags' field to the location independent metadata_area structure, and a MDA_IGNORE flag. The mda_is_ignored and mda_set_ignored functions are added to manage the flag. Adding the flag and functions gives a library interface to ignore metadata areas independent of the underlying location (disk, file, etc). The location specific read/write functions must then handle the specifics of what this flag means to the location. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-by: Alasdair G Kergon <agk@redhat.com>	2010-06-28 20:30:14 +00:00
Dave Wysochanski	d144d5eeb7	Add text format specific 'rlocn' ignore flag and access functions. Adding a flag to the 'rlocn' structure in the mda header of the text format allows us to flip a bit to ignore an area on disk that stores the metadata via the text format specific mda_header. This patch defines the flag and access functions to manage the flag. Other patches will manage the ignore on a format-independent basis, by using a flag in the metadata_area structure. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:29:57 +00:00
Dave Wysochanski	7c604e7649	Change 'filler' to 'flags' in on-disk 'raw_locn' structure. Future patches will make use of a specific flag in the on-disk 'raw_locn' structure to enable/disable metadata areas, and facilitate metadata balancing. Note that 'filler' is always set to '0' (see add_mda() - memset), so use of this area as a non-zero flags field is a safe way to provide future code features. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:29:42 +00:00
Jonathan Earl Brassow	68c31a2a36	Fix for bz608048 from Taka... The same region size is used for both mirror volume and mirrored log volume, but when the physical extent size is bigger than region size, the size of mirror leg for mirrored log is smaller than the region size and lvcreate command fails. This patch adjusts a region size of mirrored log to a smaller value of region size or physical extent size. [This patch ensures that the region_size of the mirrored log does not exceed the size of the mirrored log itself, which would violate the kernel constraint: (region_size <= ti->len).] Signed-off-by: Takahiro Yasui <takahiro.yasui@hds.com> Signed-off-by: Jonathan Brassow <jbrassow@redhat.com>	2010-06-28 14:19:41 +00:00
Zdenek Kabelac	d301e5917f	Preload libc locale messages. Preload libc.mo file for localized lvm before taking memory lock - this way we prevent disk access for some error paths in libdm, that prints localized errno messages while they are still in memory locked state.	2010-06-24 08:29:30 +00:00
Jonathan Earl Brassow	42f7fd0590	The function that runs to compress a stacked mirror after converting from 2-way to 3-way mirror (collapse_mirrored_lv) was calling '_remove_mirror_images' with the 'remove_log' parameter set. When the code was put in to fix 599898 to honor log parameters during conversion, this argument was suddenly being honored. Thus, when someone would convert from a 2-way to 3-way mirror, the log would get removed. 'collapse_mirrored_lv' should not be calling '_remove_mirror_images' with 'remove_log' set.	2010-06-23 13:57:26 +00:00
Alasdair Kergon	07ae1d4943	Add lv_path to reports to offer full /dev pathname.	2010-06-23 12:32:08 +00:00
Milan Broz	f9e177d281	Fix "allocated" warning typo.	2010-06-22 21:10:53 +00:00
Dave Wysochanski	58f55600d0	Add device name to output of error messages in raw_read_mda_header(). It would be helpful if we had the device name when something like a mda_header checksum error occurs. Before: ./tools/lvm pvs -opv_name,vg_name,uuid,mda_count,pv_mda_count_ignored,vg_mda_count,vg_mda_count_ignored,vg_mda_copies Incorrect metadata area header checksum PV VG PV UUID #PMda #PMdaIgn #VMda #VMdaIgn #VMdaCps /dev/loop0 vgtest2 sVv26t-gjpb-Rcau-uBDO-Cx04-GbRR-6Ssq7e 2 0 4 0 4 /dev/loop1 vgtest2 zXWStT-qE8F-mbkc-RfgH-aytv-mptF-Y5Ce09 2 0 4 0 4 /dev/loop2 riCpK9-9G8r-LlIp-i2oh-mb3N-CUzk-u5YpuR 1 0 0 0 0 /dev/loop3 vgtest tQCUjm-rmyd-i92d-4eeE-UYBW-v1vQ-kRaA17 2 0 4 2 0 /dev/loop4 vgtest ZRvpeI-p8F1-ccVW-BBac-xhl1-aGXU-CbP0oo 2 2 4 2 0 After: ./tools/lvm pvs -opv_name,vg_name,uuid,mda_count,pv_mda_count_ignored,vg_mda_count,vg_mda_count_ignored,vg_mda_copies Incorrect metadata area header checksum on /dev/loop2 at offset 4096 PV VG PV UUID #PMda #PMdaIgn #VMda #VMdaIgn #VMdaCps /dev/loop0 vgtest2 sVv26t-gjpb-Rcau-uBDO-Cx04-GbRR-6Ssq7e 2 0 4 0 4 /dev/loop1 vgtest2 zXWStT-qE8F-mbkc-RfgH-aytv-mptF-Y5Ce09 2 0 4 0 4 /dev/loop2 riCpK9-9G8r-LlIp-i2oh-mb3N-CUzk-u5YpuR 1 0 0 0 0 /dev/loop3 vgtest tQCUjm-rmyd-i92d-4eeE-UYBW-v1vQ-kRaA17 2 0 4 2 0 /dev/loop4 vgtest ZRvpeI-p8F1-ccVW-BBac-xhl1-aGXU-CbP0oo 2 2 4 2 0	2010-06-22 19:18:27 +00:00
Jonathan Earl Brassow	a7d355a28c	Mirrors can be layered - as in the case of an converting 2-way to 3-way mirror. When conversion operations are performed on these types of mirrors, log options can be confused/ignored. In the case of a converting 3-way mirror, we have a top-level 2-way corelog mirror whose legs are 1) a 2-way disk-log mirror and 2) a linear device. If we wish to convert this 3-way mirror to a 2-way mirror, the linear device is removed and the extra top layer is eliminated. If we also wished to convert the disk log to a core log in the same step, ambiguity creeps in. It is somewhat obvious what the user wants - a 2-way mirror with a corelog. However, looking at the top level mirror before compression, it seems that the mirror already has a core log. This is why the operation seemed to fail. This patch simply re-evaluates what mirrored_seg points to after a compression and then considers the log argument. This is a fix for bug 599898.	2010-06-21 16:12:33 +00:00
Alasdair Kergon	b4ee00356b	Various cleanups following recent commits.	2010-06-21 15:56:57 +00:00
Milan Broz	d2031f6a16	Clean up cluster lock mode and flags definition. Code is mixing up internal DLM and LVM definitions of lock modes and flags. OpenAIS and singlenode locking do not depend on DLM but code currently cannot be compiled without libdlm.h! LCK_* flags is LVM abstraction, used through all the code. Only low-level backend (clvmd-cman etc) should use DLM definitions, also this code should do all needed conversions. Because there are two DLM flags used in generic code (NOQUEUE, CONVERT) we define it similar way like lock modes. (So all needed binary-compatible flags are on one place in locking.h) (Further code cleaning still needed, though:-)	2010-06-17 12:48:54 +00:00
Zdenek Kabelac	b5c2529bf8	Use absolute paths in commands clvmd restart does not work at all if clvmd binary is not in current dir.	2010-06-03 13:50:26 +00:00
Dave Wysochanski	e9a87f2e16	Revert _init_rand() to reset errno - restores original init behavior.	2010-06-01 21:47:57 +00:00
Dave Wysochanski	e3a1fc8af3	Do not fail lvm_init() if init_logging() generates an errno. Revert to original behavior of lvm commands if init_logging() generated an errno. Fixes rhbz 592967.	2010-06-01 21:46:29 +00:00
Alasdair Kergon	d01888706d	Don't merge unchanged persistent cache file before dumping if tool scanned.	2010-06-01 19:02:12 +00:00
Peter Rajnoha	03023d3965	Fix incorrect memory pool deallocation while using vg_read for files. We create a separate pool "lvm2 vg_read" for vg_read and we don't use cmd->mem anymore.	2010-06-01 12:08:50 +00:00
Alasdair Kergon	ba61f84874	Replace strncmp kernel version number checks with proper ones	2010-05-24 23:11:34 +00:00
Alasdair Kergon	ed122a962d	Avoid selecting names under /dev/block if there is an alternative.	2010-05-24 22:53:48 +00:00
Alasdair Kergon	5782a38543	Choose between clustered log versions based on kernel version. Add fixmes for broken strcmp.	2010-05-24 17:46:47 +00:00
Alasdair Kergon	1493f24e3e	Update clustered log kernel module name to log-userspace.	2010-05-24 16:30:15 +00:00
Petr Rockai	d345bf2cd3	Account for mirror transient status when doing lvconvert --repair.	2010-05-24 15:32:20 +00:00
Zdenek Kabelac	4ef2bf27a7	Update Copyright date for resently modifed files	2010-05-24 09:04:27 +00:00
Zdenek Kabelac	591fc4d2d1	Replicator: add replicator to dtree Adding all replicator related LVs to dtree. Start of one replicator_dev initiate start of all other related.	2010-05-24 09:01:05 +00:00
Zdenek Kabelac	dc7d777606	Replicator: update _create_partial_dtree() Adding function _add_partial_replicator_to_dtree() to create partial tree for Replicator target. Using dm_tree_node_set_presuspend_node() for Replicator.	2010-05-21 14:47:58 +00:00
Zdenek Kabelac	9249fb12b8	Replicator: activate checks for missing vgs Activation needs to have all remote VGs locked to pass for replicator. So if activated LV is replicator-dev LV - read all remote VGs.	2010-05-21 14:34:01 +00:00
Zdenek Kabelac	b5f33faf9f	Replicator: lock_lv_vol() finds missing VGs Find and check for all needed VGs before calling lock_vol().	2010-05-21 14:29:49 +00:00
Zdenek Kabelac	65928349e7	Replicator: add read and release VGs for rsites Add functions to read and release remote VGs for replicator sites in activation context.	2010-05-21 14:07:16 +00:00
Zdenek Kabelac	f6d7e637c3	Add toolcontext.h header file.	2010-05-21 13:34:09 +00:00
Zdenek Kabelac	49e7eb3570	Remove files from wrong directory	2010-05-21 13:18:25 +00:00
Zdenek Kabelac	4c77dcc4ea	Right directory name for replicator files. My local CVS was placing file in slightly different directory by using obsolete files.	2010-05-21 13:17:20 +00:00
Zdenek Kabelac	0e6270e308	Hmm - fixing cvs import mistake	2010-05-21 13:14:02 +00:00
Zdenek Kabelac	6222635b38	Replicator: add find_replicator_vgs Adding find_replicator_vgs() function to find all needed VGs for replicator-dev LV. This function is later called before taking lock_vol().	2010-05-21 12:55:25 +00:00
Zdenek Kabelac	12569ccb03	Replicator: add sorted cmd_vg list Introduce struct cmd_vg to store information about needed volume group name, vgid, flags and the pointer to opened VG. Keep VGs list in alphabetical order for locking order. Introduce functions: cmd_vg_add() add new cmd_vg entry. cmd_vg_lookup() search cmd_vgs for vg_name. cmd_vg_read() open VGs in cmd_vgs list. cmd_vg_release() close VGs in reversed order.	2010-05-21 12:52:01 +00:00
Zdenek Kabelac	0a02d30ea4	Replicator: extend volume_group with list of VGs and flag Add pointer to linked list of opened VGs. List temporarily keeps the information about needed or locked and opened VGs for replicator target. Also add cmd_missing_vgs flag information for quick check and also for possible continuos process_each_lv() usage where we need to detect whether failure has been caused by missing VG or some other reason.	2010-05-21 12:47:46 +00:00
Zdenek Kabelac	e86e45f7ea	Replicator: extend _lv_each_dependency() with dependencies for Replicator devices	2010-05-21 12:45:18 +00:00
Zdenek Kabelac	651cae3c5c	Replicator: check replicator segment Check for possible problems within replicator structures. Used also by vg_validate.	2010-05-21 12:43:02 +00:00
Zdenek Kabelac	1207106fbc	Replicator: new files for Replicator target	2010-05-21 12:40:05 +00:00
Zdenek Kabelac	8fea97b7e7	Replicator: base lvm2 support Adding configure.in support for Replicators. Adding basic lib lvm support for Replicators. Adding flags REPLICATOR and REPLICATOR_LOG. Adding segments SEG_REPLICATOR and SEG_REPLICATOR_DEV. Adding basic methods for handling replicator metadata.	2010-05-21 12:36:30 +00:00
Alasdair Kergon	21c56e377b	backup->cache	2010-05-20 22:32:44 +00:00
Alasdair Kergon	ae6164cee3	If unable to obtain snapshot percentage leave value blank on reports.	2010-05-20 22:24:33 +00:00
Alasdair Kergon	3399ae320a	Add install_system_dirs makefile target. Add configure options for system and locking directories.	2010-05-20 13:47:21 +00:00
Dave Wysochanski	dd2a0e940d	Add find_vgname_from_{pvname\|pvid} functions. Some commands start with a pvname, but we'd like to force users to start with a vg handle to obtain a pv handle. Our best option seems to be providing a way to look up the vgname from the pvname, and then require them to use vg_read/vg_open. In addition to the pvname lookup function, this patch also provides a lookup by pvid. The lookup by pvid can be used in conjunction with lvmcache_get_pvids to process all pvs in the system. The pvid find function first calls lvmcache_vgname_from_pvid, which may cause the label to be read if it is not in the cache. If the vgname is returned is an orphan, we then check to see if there are metadata areas, and if not, we scan every PV on the system by calling scan_vgs_for_pvs(). In most cases we should not need to do this, and by using the info->mdas count, we avoid calling pv_read() as prior code did. So this patch is a bit cleaner and should allow us to refactor more of the pv code. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-05-19 11:52:37 +00:00
Dave Wysochanski	cfb203e14d	Add lvmcache_vgname_from_pvid(). Add lvmcache function to lookup a vgname from a pvid.	2010-05-19 11:52:21 +00:00
Dave Wysochanski	dd4097a6a4	Add pvid_from_devname() lvmcache function. Add supporting function for mappings from devname -> pvid -> vgname.	2010-05-19 11:52:07 +00:00
Alasdair Kergon	1d837442bf	Add is_global_vg and split out from is_orphan_vg.	2010-05-19 02:36:33 +00:00
Alasdair Kergon	34220fe292	Validate orphan and VG_GLOBAL lock order too.	2010-05-19 02:08:50 +00:00
Alasdair Kergon	8c965cafb7	Note that orphan lock is always obtained last	2010-05-19 01:49:08 +00:00
Alasdair Kergon	fa305e2ec6	Accept orphan VG names as parameters to lock_vol() and related functions.	2010-05-19 01:16:40 +00:00
Alasdair Kergon	24d21cfcee	Use is_orphan_vg in place of hard-coded prefix tests.	2010-05-19 00:52:55 +00:00
Alasdair Kergon	a988efda46	pre-release	2010-05-17 18:39:01 +00:00
Jonathan Earl Brassow	a932c2b61f	Disallow toggling the cluster attribute of a volume group if there are active mirrors or snapshots. We don't have the mechanisms in place to change the device-mapper tables for those targets that have behavioral differences between cluster and single machine instances. Allowing users to change the attribute but not changing the target's behavior can lead to data corruption. The following bugs are fixed/avoided by this patch: 235123 - vgchange -c [ny] do not change target types when necessary 289331 - RFE: switching from cluster domain to local domain needs to deactivate volume somehow 289541 - when changing from local to cluster, volumes can not appear to be deactivated	2010-05-14 15:19:42 +00:00
Milan Broz	ea9e387f87	Another one internal device layer fix...	2010-05-14 12:39:52 +00:00
Milan Broz	b8fb120ee0	Fix empty layer detection is scan devices.	2010-05-14 12:30:43 +00:00
Milan Broz	b449f0a188	Fix device_is_usable to properly detect only internal LV names.	2010-05-14 12:03:32 +00:00
Milan Broz	28e2b5b213	Skip also special lvm devices in scan (if ignore suspended is used). This should avoid various races between dmeventd on multiple nodes in cluster where one node already repairing device and another run full scan and locks the device.	2010-05-13 18:38:38 +00:00
Milan Broz	9ad39e546b	Currently if clvmd is running and user issues vgscan, the device cache file is dumped both in vgscan and clvmd process. Unfortunately, clvmd calls lvmcache_label_scan, it properly destroys persistent filter, but during persistent_filter_dump it merges old cache content back! This causes that change in filters is not properly propagated into device cache after vgscan on cluster. (Only new devices are added.) https://bugzilla.redhat.com/show_bug.cgi?id=591861	2010-05-13 13:04:03 +00:00
Peter Rajnoha	d50c6d4bf5	Add new --sysinit option for vgchange and lvchange. A shortcut for --ignorelockingfailure, --ignoremonitoring, --poll n options and LVM_SUPPRESS_LOCKING_FAILURE_MESSAGES environment variable used all at once in initialisation scripts (e.g. rc.sysinit or initrd).	2010-05-06 11:15:55 +00:00
Zdenek Kabelac	180b3dea11	Install plugins to subdirs Target install_dm_plugin installs files to libdir/device-mapper. Target install_lvm2_plugin installs files to libdir/lvm2. Both targets creates relative links to libdir to keep the code compatible with current dlopen handling. Once we will be able to read plugins from subdir, links could be removed.	2010-05-06 10:07:46 +00:00
Petr Rockai	9409998d71	Suppress duplicate error messages about read failures and missing devices.	2010-05-05 22:37:52 +00:00
Zdenek Kabelac	244646902c	Initialise _vginfos list staticaly so there is no problem with using uninitialised variables in case, lvmcache_destory() is called without lvmcache_init().	2010-04-30 12:54:31 +00:00
Zdenek Kabelac	7147cd9fe5	Release pools for regex if there is error during processing (fixes error messages about unreleased pools).	2010-04-30 12:37:04 +00:00
Zdenek Kabelac	8889fda53d	Show string with invalid pattern and start error message with capital letter.	2010-04-30 12:31:32 +00:00
Alasdair Kergon	54d7741a3e	Remove no-longer-used arg_ptr_value. Fix -M and --type to use strings not pointers that change on config refresh.	2010-04-29 01:38:12 +00:00
Jonathan Earl Brassow	56a5925aed	Fix comment from last commit. Additionally, there is no need to put a comment into the WHATS_NEW file if it is a regression that was created and fixed inside the same release window.	2010-04-27 15:26:58 +00:00
Jonathan Earl Brassow	d7c9d72390	Patch to fix bug 586021 and mantain historical behavior of being able to remove more images from a mirror than the number of PVs directly specified for removal. The effort to fix bug 581611 corrected a bug that was unnoticed at the time. The loop in _remove_mirror_images that looks over the specified PVs was allowing devices that were previously counted and moved to the end of the list to be double-counted. This resulted in the number of devices needed for removal always being satisfied - even if the user did not specify enough PVs for removal to satisfy the request. When 581611 was fixed, this double-counting no longer took place and the result was to remove only the minimum of the number of PVs specified or the number that was asked to be removed. By simply always setting 'new_area_count' (as used to be done only in the else statement), we return to the previous behavior. Indeed, this is exactly what the double-counting was allowing to happen before the fix of 581611.	2010-04-27 14:57:49 +00:00
Mike Snitzer	60267bdce8	Disallow the direct removal of a merging snapshot. Allow lv_remove_with_dependencies() to know the top-level LV that was requested to be removed (otherwise it recurses and we lose context). A merging snapshot cannot be removed directly but the associated origin can be. Disallow removal of a merging snapshot unless the associated origin is also being removed.	2010-04-23 19:27:10 +00:00
Peter Rajnoha	6ddb5ecd72	Set appropriate udev flags for reserved LVs. There's no need for foreign udev rules to touch LVM reserved devices (snapshot, pvmove, _mlog, _mimage, _vorigin) even if they happen to be visible. The same applies for /dev/disk content - no need to create any content for these devices (and so no need to run any "blkid" etc.). This also prevents setting any inotify "watch" from udev rules on such devices that is a source of race conditions (the rules need to honor DM_UDEV_DISABLE_OTHER_RULES_FLAG for this to work though).	2010-04-23 14:16:32 +00:00
Mike Snitzer	1f661c5dd8	When removing a snapshot avoid preloading the origin if the snapshot-merge target is not active.	2010-04-23 02:57:39 +00:00
Jonathan Earl Brassow	66f79d05eb	Disallow the primary mirror image from being removed when the mirror is not in-sync. This restriction is not extended to repair operations (i.e. it will not limit what 'lvconvert --repair' can do).	2010-04-21 13:55:08 +00:00
Alasdair Kergon	ee90b8197f	Move function up file	2010-04-20 12:14:28 +00:00
Peter Rajnoha	1e696b0c15	Do not reset position in metadata ring buffer on vgrename and vgcfgrestore. We should write metadata into next position in the ring buffer while calling vgrename and vgcfgrestore. At this code level (_vg_write_raw), we were not able to determine if this is a rename or not. If yes, then accompanying VG structure passed here has a new name set, not the old one. When looking for a location where to put metadata next, we were given a NULL value because of failed VG name comparison (in _find_vg_rlocn) between the name in existing metadata and metadata we're just about to write. This resets the position in the ring buffer, overwriting any existing metadata (and also incorrectly updates the cache to "orphan" afterwards). This patch just adds old_name item in struct volume_group that we can check and use if necessary and detect renames at lower layers as well. The same applies for vgcfgrestore, but here we're using a special value of old_name, an empty string, to disable the check with existing metadata totally.	2010-04-14 13:09:16 +00:00
Peter Rajnoha	bda3982016	Use UUIDs instead of names while processing event handlers. Internally, we used DM names instead of UUIDs while processing event handlers. This caused problems while trying to vgrename a VG with active LVs where the names are being changed and so the devices were not found then. The patch also contains a little bit of refactoring, moving "build_dlid" code found in dev_manager.c to "build_dm_uuid", now in lvm-string.c (so we have build_dm_uuid and build_dm_name at one place).	2010-04-14 13:01:38 +00:00
Dave Wysochanski	af46c894d0	Add pv->vg to solidify link between a pv and a vg. lvm2app needs a link back to the vg in order to use the vg handle for memory allocations as well as other things. This patch adds the field to struct physical_volume, and sets pv->vg when reading a vg from disk or extending a vg by using the helper function previously added, add_pvl_to_vgs(). Moves and renames are handled with separate code inside move_pv() and vgmerge(). Add pv->vg check to vg_validate(). A NULL value in pv->vg signifies membership in the orphan VG. Note though in the case of pv_read() on a device with metadatacopies == 0, more devices may need to be read for an authoritative answer. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-04-13 17:26:36 +00:00
Dave Wysochanski	11647ad01c	Use del_pvl_from_vgs() in vgreduce paths. Somehow these got missed in earlier patches. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-04-13 17:26:20 +00:00
Dave Wysochanski	0adfbfd5ea	Call add_pvl_to_vgs() and del_pvl_from_vgs() from more places. Now that we have library functions to add/delete a pv from the vg->pvs list, call them from everywhere. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-04-13 17:26:03 +00:00
Dave Wysochanski	8cfd64de78	Add del_pvl_from_vgs() and move prototypes into metadata-exported.h Add a delete function to manage the vg->pvs list. NOTE: It may be possible to do further cleanup to these add/del functions by passing a 'pv' as input instead of 'pv_list'. The pv_list is used for functions which do allocations (lvcreate) while other places in the code just manage a list of 'pv' (e.g. import functions, vgextend, etc). Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-04-13 17:25:44 +00:00
Dave Wysochanski	a6a4941504	Move increment of vg->pv_count from import_pool_vg() to import_pool_pvs(). Move the increment of vg->pv_count next to the place where we add to vg->pvs. It looks safe to do this since the only caller of import_pool_vg() calls import_pool_pvs() immediately afterward, and there is no way import_pool_vg() can fail (always returns 1). However, if there's a memory allocation failure inside import_pool_pvs(), we will end up with a different count in vg->pv_count that with the original code. In any case, vg->pv_count should be as close to dm_list_size(&vg->pvs) as possible, as is the case everywhere else in the code. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-04-13 17:25:26 +00:00
Dave Wysochanski	f6644ff8fe	Remove unnecessary parameter from import_pool_pvs(). The dm_list * parameter is unnecessary since we are passing in 'vg' and the only caller of import_pool_pvs() passes '&vg->pvs' in the dm_list * parameter. Just use vg->pvs directly in the function. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-04-13 17:25:13 +00:00
Milan Broz	e4594d9c23	Mask LCK_HOLD flag in cluster VG locks for compatibility reasons.	2010-04-13 14:36:24 +00:00
Alasdair Kergon	68176be18e	Add --stripes to lvconvert too.	2010-04-13 01:54:32 +00:00
Alasdair Kergon	0693099223	Don't ignore error if resuming any LV fails in resume_lvs. Skip closing persistent filter cache file if open failed.	2010-04-12 11:52:53 +00:00
Zdenek Kabelac	23b059e7b7	INSTALL rules updates Patch is inspired by Debian's extra patch. - removes OWNER & GROUP make vars they are parts of INSTALL command. - adds INSTALL_PROGRAM for executable, uses $(INSTALL) - adds INSTALL_DATA for non-executable data, uses ($INSTALL) - adds INSTALL_WDATA for writable non-executable data, uses ($INSTALL) - adds configure option --enable-write_install - to support installatin of writable files used by distribution - replaces usage of ifeq @LIB_SUFFIX@ with $(LIB_SUFFIX) - installs .a files from static builds without executable flag - installs .a files to $(usrlibdir) instead of $(libdir) - installs all static binaries to $(staticdir) - create .so links for devel package in $(usrlibdir) instead of $(libdir) - makes .so and .so.LIB_VERSION files within builddir - removes VERSIONED_SHLIB and created versioned LIB_SHARED automagicaly - install LIB_SHARED via install_lib_shared target - install plugins via install_lib_shared_plugin target - prints whole 'install' command during installation instead of less informative "Installing $(something) $(somewhere)" - install multiple man pages with one INSTALL command - use DISTCLEAN_TARGETS instead of creating multiple distclean targets	2010-04-09 21:42:48 +00:00
Zdenek Kabelac	c737d34804	Use vpath instead of VPATH. Usage of VPATH makes troubles when used within $(builddir). Not only source files are being found through VPATH, but targets as well. (make --debug=v) Thus if user builds the code in $(srcdir) and also in some $(builddir) he gets mangled results as some generated files (i.e. .export.sym) are 'reused' from $(srcdir) instead of $(builddir). This patch switches to use vpath were we could explicitly name suffixes that should be looked via vpath - we must take care, we do not generate files with these suffixes: .c, .in, .po, .exported_symbols	2010-04-09 21:34:25 +00:00
Alasdair Kergon	1485ce69c4	Permit mimage LVs to be striped in lvcreate and lvresize.	2010-04-09 01:00:10 +00:00
Dave Wysochanski	fddc256a02	Check for duplicate paths (pvids) on the commandline of vgcreate. A user specifying duplicate paths on the cmdline of vgcreate will get a message similar to the following: vgcreate vgtest2 /dev/loop3 /dev/loop5 Found duplicate PV jk1lXsKzwyOKlXq6bhaFFKMQQ06oPgu8: using /dev/loop5 not /dev/loop3 Found duplicate PV jk1lXsKzwyOKlXq6bhaFFKMQQ06oPgu8: using /dev/loop3 not /dev/loop5 Internal error: Duplicate PV id jk1lXs-Kzwy-OKlX-q6bh-aFFK-MQQ0-6oPgu8 detected for /dev/loop3 in vgtest2. This is caught by vg_validate(), but it would be good to find this condition earlier in the vgcreate code. add_pv_to_vg() currently checks by pvname, but does not look for duplcate pvids. This patch adds the check for duplicate pvids and results in new error output as follows: vgcreate vgtest2 /dev/loop3 /dev/loop5 Found duplicate PV jk1lXsKzwyOKlXq6bhaFFKMQQ06oPgu8: using /dev/loop5 not /dev/loop3 Found duplicate PV jk1lXsKzwyOKlXq6bhaFFKMQQ06oPgu8: using /dev/loop3 not /dev/loop5 Physical volume '/dev/loop5 (jk1lXs-Kzwy-OKlX-q6bh-aFFK-MQQ0-6oPgu8)' listed more than once. Unable to add physical volume '/dev/loop5' to volume group 'vgtest2'. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-04-08 15:18:35 +00:00
Alasdair Kergon	4d0e07a799	missing ?:	2010-04-08 00:56:26 +00:00
Alasdair Kergon	b3302a0c3c	suppress bogus compiler warning	2010-04-08 00:52:41 +00:00
Alasdair Kergon	aab7a3978b	Fix pvmove allocation to take existing parallel stripes into account. When moving parts of striped LVs, pvmove wouldn't care about leaving you with two stripes on the same disk. Now --alloc anywhere is needed for that. (Tried and gave up on two alternative approaches before the one committed here.)	2010-04-08 00:28:57 +00:00
Alasdair Kergon	f55021f487	Fix incorrect removal of symlinks after LV deactivation fails.	2010-04-07 20:04:41 +00:00
Alasdair Kergon	6d25082d65	Fix is_partitioned_dev not to attempt to reopen device.	2010-04-06 17:36:41 +00:00
Dave Wysochanski	9e82787da2	Add add_pvl_to_vgs() - helper function to add a pv to a vg list. Small refactor of main places in the code where a pv is added to a vg into a small function which adds the pv to the list and updates the vg counts. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-04-06 14:04:54 +00:00
Dave Wysochanski	b4a9a2d313	Refactor format1 vg->pvs list add and vg->pv_count. Refactor adding to the vg->pvs list and incrementing the count, which will allow further refactoring. Should be no functional change. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-04-06 14:04:20 +00:00
Dave Wysochanski	36e9d03d1b	Refactor _read_pv() code that updates vg->extent_count and vg->free_count. Simple refactor to mov code that updates the vg extent counts from a single pv's counts close to the code that adds a pv to vg->pvs and updates vg->pv_count. No functional change. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-04-06 14:04:03 +00:00
Dave Wysochanski	53ad3cad14	Add pv to vg->pvs after check for maximum value of vg->extent_count. In add_pv_to_vg(), we should only add the pv to vg->pvs after all internal checks have passed. The check for vg->extent_count exeeding maximum was after we added the pv to the list, so this function could return a state of vg->pvs that did not reflect other parameters such as vg->pv_count. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-04-06 14:03:43 +00:00
Alasdair Kergon	d27c8b5660	remove compiler warning	2010-04-02 01:35:34 +00:00
Alasdair Kergon	6ec52a01ee	A few more log_error to log_warn changes for mirrors.	2010-04-01 14:54:37 +00:00
Zdenek Kabelac	ddf19eae0f	Better debug message for Un/Locked memory size.	2010-04-01 14:53:47 +00:00
Alasdair Kergon	46bd6d7a2f	temporarily downgrade the 'open while suspended' error till we fix it properly	2010-04-01 14:30:51 +00:00
Alasdair Kergon	abb9fb8370	Try to fix tracking of whether or not log extents need allocating.	2010-04-01 13:58:13 +00:00
Zdenek Kabelac	39b6ac476d	Set ret value to success initially.	2010-04-01 13:43:12 +00:00
Alasdair Kergon	0c67893ce9	Avoid endless loop if lv->segments list is corrupted	2010-04-01 13:08:06 +00:00
Alasdair Kergon	e7159c828b	initialise log_allocated to 0	2010-04-01 12:29:07 +00:00
Alasdair Kergon	d723636d52	Limit number of error messages when checking LV segments.	2010-04-01 12:14:20 +00:00
Alasdair Kergon	a1192f17ba	Improve vg_validate to detect some loops in lists.	2010-04-01 11:45:36 +00:00
Alasdair Kergon	0640232acd	Improve vg_validate to detect some loops in lists.	2010-04-01 11:43:24 +00:00
Alasdair Kergon	258db3ad8e	Change most remaining log_error WARNING messages to log_warn.	2010-04-01 10:34:09 +00:00
Alasdair Kergon	bce2869d92	Attempt to fix non-ALLOC_ANYWHERE allocation code after recent changes broke The preference given to the PVs with the largest free areas.	2010-03-31 20:26:04 +00:00
Milan Broz	d7cbaae1fd	Always use blocking lock for VGs and orphan locks. Because we have now strong rule for lock ordering: - VG locks must be taken in alphabetical order - ORPHAN locks must be the last vgs_locked() is now not needed. This fixes problem with orphan locking, e.g. vgremove VG1 \| vgremove VG2 lock(VG1) \| lock(VG2) lock(ORPHAN) \| lock(ORPHAN) -> fail, non-blocking https://bugzilla.redhat.com/show_bug.cgi?id=578413 (More similar places in code.)	2010-03-31 17:23:56 +00:00
Milan Broz	6733116a19	Fix all segments memory is allocated from vg private mempool. Physical segments were still allocated from global command context mempool. This leads to very high memory usage when activating large VG (vgchange). (Memory usage was about 2G when >3000LVs). Fix it by properly using vg->vgmem private pool, so all the memory is released early. New memory pool parameter is needed here for pv_split_segment function. Also fix the same problem in some minor allocations (vg description, lv segment split).	2010-03-31 17:23:18 +00:00
Milan Broz	0423887528	Do not traverse PV segment list twice. In addition to previous patch, we really do not need to search for segment which was just allocated in split request. Make pv_split_segment function return newly allocated (split) segment also. (So after this patch, there is only one user of slow find_peg_by_pe).	2010-03-31 17:22:26 +00:00
Milan Broz	80b96a8974	Optimise PV segments search. The function find_peg_by_pe is incredibly inefficient for Pvs with many segments. In shiny future there should be binary (or interval) tree instead of sorted linked list (volunteers?). Anyway, for now, we can use dirty trick here to optimise this case: - Allocations are usually applied from the beginning of PV (we have no alloocation policy which allocates areas "backwards") - The only user of find_peg_by_pe is pv_split_segment() call. In most cases it need to split last PV segment. So if we search sorted pv segment list backwards, we hit the requested segment immediatelly. This patch applies this tiny change. (and saves >30% of processing time when >3000LVs segments are on one PV!) To discourage using this inefficient function from other code, it is moved to pv_manip.c and used static for now:-)	2010-03-31 17:21:40 +00:00
Milan Broz	c8b0988586	Remove vg_validate call when parsing cached metadata. vg_validate call is an adept to optimisation, it is very ineeficient and slow. Anyway, we should call it only before writing data to disk. The call in lvmcache was just temporary validation, we realy do not need to revalidate cached metadata every time. (Actually, I added that there just to prove that cache works properly and forgot to remove it.) Patch removes it from lvmcache completely, this can hit only internal bug in export function (and this bug must be detected in any vg_write call anyway before).	2010-03-31 17:20:44 +00:00
Milan Broz	d59a2b6109	Use hash table for quick lv reference when reading metadata. The _read_vg uses already hash for PVs to optimise reading of large VGs and avoiding repeated PV list traversing. Use the same aproach to speed up parsing VG with many LVs.	2010-03-31 17:20:02 +00:00
Mikulas Patocka	655849fb14	A missing space in the error message. Add missing parentheses to an error message Signed-off-by: Mikulas Patocka <mpatocka@redhat.com>	2010-03-31 12:06:30 +00:00
Zdenek Kabelac	5de52e34f0	Count only readable size for memlock stats. As we mlock() only readable pages, makes statistics only for readable bytes.	2010-03-30 14:41:58 +00:00
Zdenek Kabelac	3f67b36b35	Update memlock Code moves initilization of stats values to _memlock_maps(). For dmeventd we need to use mlockall() - so avoid reading config value and go with _use_mlockall code path. Patch assumes dmeventd uses C locales! Patch needs the call or memlock_inc_daemon() before memlock_inc() (which is our common use case). Some minor code cleanup patch for _un/_lock_mem_if_needed().	2010-03-30 14:41:23 +00:00
Alasdair Kergon	1dee5eb625	Fix --alloc contiguous policy only to allocate one set of parallel areas.	2010-03-29 17:59:46 +00:00
Zdenek Kabelac	b7be589ed0	Fixing another set of distclean problems where we left some generated files in clvmd, dmevend, man, tests. Don't include dependency files for clow and cscope.out targets Improve dependency tracking for dmeventd and liblvm2cmd sources.	2010-03-29 14:17:59 +00:00
Zdenek Kabelac	b41f5924bf	Update cflow file generation - support build dir and use $(top_srcdir) to obtain sources. Create make.tmpl target for simplier generation of cflow files with the help of CFLOW_LIST, CFLOW_LIST_TARGET, CFLOW_TARGET. Still cflow usage is not perfect.	2010-03-29 14:11:17 +00:00
Zdenek Kabelac	1a91d0914e	distclean fixes Move daemons/ and lib/ subtargets to their Makefiles so we don't get double cleanup error during execution of distclean target. Instead of duplicating clean target inside distclean target, just use it as a subtarget and avoid add duplicating code.	2010-03-29 14:09:25 +00:00
Jonathan Earl Brassow	7a369d3704	Add ability to create mirrored logs for mirror LVs. This check-in enables the 'mirrored' log type. It can be specified by using the '--mirrorlog' option as follows: #> lvcreate -m1 --mirrorlog mirrored -L 5G -n lv vg I've also included a couple updates to the testsuite. These updates include tests for the new log type, and some fixes to some of the lvconvert tests.	2010-03-26 22:15:43 +00:00
Mike Snitzer	7b0f529d3e	Fix clvmd cluster propagation of dmeventd monitoring mode. clvmd's do_lock_lv() already properly controls dmeventd monitoring based on LCK_DMEVENTD_MONITOR_MODE in lock_flags -- though one small fix was needed for this to work: _lock_for_cluster() must treat dmeventd_monitor_mode()'s return as a tri-state value. Also cleanup do_lock_lv() to: - explicitly init_dmeventd_monitor() based on LCK_DMEVENTD_MONITOR_MODE - no longer reset init_dmeventd_monitor() to default at the end of do_lock_lv() -- it is unnecessary	2010-03-26 15:40:13 +00:00
Alasdair Kergon	2abbc07f3c	Allow ALLOC_ANYWHERE to split contiguous areas.	2010-03-25 21:19:26 +00:00
Alasdair Kergon	a7ca334681	Add some assertions to allocation code.	2010-03-25 18:16:54 +00:00
Alasdair Kergon	f4cea344b1	improve a few comments in last check-in	2010-03-25 02:40:09 +00:00
Alasdair Kergon	8d6722c8ad	Introduce pv_area_used into allocation algorithm and add debug messages. This is the next preparatory step towards better --alloc anywhere support and is not intended to break anything that currently works so please report any problems - segfaults, bogus data in the new debug messages, or if the code now chooses bizarre allocation layouts.	2010-03-25 02:31:48 +00:00
Mike Snitzer	a6bc975a24	Improve activation monitoring option processing . Add "monitoring" option to "activation" section of lvm.conf . Have clvmd consult the lvm.conf "activation/monitoring" too. . Introduce toollib.c:get_activation_monitoring_mode(). . Error out when both --monitor and --ignoremonitoring are provided. . Add --monitor and --ignoremonitoring support to lvcreate. Update lvcreate man page accordingly. . Clarify that '--monitor' controls the start and stop of monitoring in the {vg,lv}change man pages. Signed-off-by: Mike Snitzer <snitzer@redhat.com>	2010-03-23 22:30:18 +00:00
Petr Rockai	a2b6bbdfb2	Also honour abort_on_internal_errors when log_fn is set.	2010-03-23 18:18:49 +00:00
Alasdair Kergon	36f9d53b60	Allow dynamic extension of array of areas selected as allocation candidates.	2010-03-23 15:07:55 +00:00
Peter Rajnoha	5161ecb98d	Autoreconf. (Strictly require libudev if udev_sync is used)	2010-03-23 14:44:42 +00:00
Dave Wysochanski	15fdc8d3ee	Avoid scanning all pvs in the system if operating on a device with mdas. When we pv_read() a device that has an orphan vgname, we might need to scan the system to be sure this is true. However, if the PV has mdas, there's no way possible for it to have an orphan vgname unless it is a true orphan. Some areas of the code were optimized to take advantage of this fact, while others were not (we would still do the expensive scan if a device had mdas but had an orphan VG). This patch unifies the code so that every place we are operating on such a PV, we skip the expensive scan if there are mdas. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Acked-by: Petr Rockai <prockai@redhat.com> Acked-by: Alasdair G Kergon <agk@redhat.com>	2010-03-18 17:29:12 +00:00
Alasdair Kergon	1091650a79	autoreconf & add missing WHATS_NEW entry	2010-03-18 13:24:35 +00:00
Petr Rockai	649c45078f	Add infrastructure for running the functional testsuite with locking_type set to 3, using a local (singlenode) clvmd.	2010-03-18 09:19:30 +00:00
Milan Broz	acb4b5e4de	Fix pvcreate device check. If user try to vgcreate or vgextend non-existent VG, these messages appears: # vgcreate xxx /dev/xxx Internal error: Volume Group xxx was not unlocked Device /dev/xxx not found (or ignored by filtering). Unable to add physical volume '/dev/xxx' to volume group 'xxx'. Internal error: Attempt to unlock unlocked VG xxx. (the same with existing VG and non-existing PV & vgextend) # vgextend vg_test /dev/xxx ... It is caused because code tries to "refresh" cache if md filter is switched on using cache destroy. But we can change filters and rescan even without this machinery now, just use refresh_filters (and reset md filter afterwards). (Patch also discovers cache alias bug in vgsplit test, fix it by using better filter line.)	2010-03-17 14:44:18 +00:00
Alasdair Kergon	0a5182fc97	Suppress repeated errors about the same missing PV uuids. Bypass full device scans when using internally-cached VG metadata.	2010-03-17 02:11:18 +00:00
Alasdair Kergon	f168869702	fix last checkin	2010-03-16 19:06:57 +00:00
Alasdair Kergon	b1f9a2f5d1	Only do one full device scan during each read of text format metadata.	2010-03-16 17:30:00 +00:00
Alasdair Kergon	38220f9fe9	Remove unnecessary full_scan parameter from get_vgids and get_vgnames calls.	2010-03-16 16:57:03 +00:00
Alasdair Kergon	cccae7e633	Look up missing PVs by uuid not dev_name in _pvs_single to avoid invalid stat. Make find_pv_in_vg_by_uuid() return same type as related functions.	2010-03-16 15:30:48 +00:00
Alasdair Kergon	770dc81b8e	Introduce is_missing_pv().	2010-03-16 14:37:38 +00:00
Alasdair Kergon	415feb2f44	some missing debug messages	2010-03-09 12:31:51 +00:00
Zdenek Kabelac	d0c3da55a0	Update comments for selecting maps Use dm_snprintf and check result whether we create correct /proc path name	2010-03-09 10:25:50 +00:00
Alasdair Kergon	f0f43bc093	Misc cleanups in the new mlock code, incl. improving some variable names & messages; using more statics (for now) to avoid redundant recalculation; validating config file just once on loading; keeping maps file open.	2010-03-09 03:16:11 +00:00
Zdenek Kabelac	4e27a85a2b	Use mlock() only on 'r' memory maps	2010-03-08 17:14:21 +00:00
Zdenek Kabelac	26ade5f27e	Unconditionaly ignore also Virtual Dynamically-linked Shared Object (VDSO on 32bit is VSyscall on 64bit) It seems it could be locked on 64bit kernels running 32bit binaries, but it makes troubles on real 32bit machines where mlock() returns error when trying to lock such map area. (0xffffe000) Behavior of mlockall() seems to be similar.	2010-03-08 15:55:52 +00:00
Zdenek Kabelac	c900819b4e	Use '_' prefix for local static variable.	2010-03-05 15:14:03 +00:00
Zdenek Kabelac	18b82048e4	mlockall() -> mlock() This patch adds a new implementation of locking function instead of mlockall() that may lock way too much memory (>100MB). New function instead uses mlock() system call and selectively locks memory areas from /proc/self/maps trying to avoid locking areas unused during lock-ed state. Patch also adds struct cmd_context to all memlock() calls to have access to configuration. For backward compatibility functionality of mlockall() is preserved with "activation/use_mlockall" flag. As a simple check, locking and unlocking counts the amount of memory and compares whether values are matching.	2010-03-05 14:48:33 +00:00
Zdenek Kabelac	539f4a7728	Readline linking update Modify linking of readline library. Create new substituted varible READLINE_LIBS - readline library is linked ONLY with tools that really use it - i.e. lvm. (Static lvm does not use readlin). Previous behaviour put this library into the variable LIBS and thus linked it with all created object files of lvm project (i.e. plugins...). READLINE detection is simplified. Termcap library is linked in only if readline library doesn't have its own dependency (i.e. old distributions).	2010-03-04 11:19:15 +00:00
Zdenek Kabelac	814aebc4e9	Use $(top_builddir) for inclusion of make.tmpl in Makefiles.	2010-03-04 09:51:37 +00:00
Mike Snitzer	c485fe183e	Handle a misaligned device that reports a -1 alignment_offset. The kernel's blk_stack_limits() function may flag a device as 'misaligned'. If it does the alignment_offset will be -1. Update set_pe_align_offset() to accommodate this corner case.	2010-03-02 21:56:14 +00:00
Alasdair Kergon	16d9293bd7	Extend core allocation code in preparation for mirrored log areas.	2010-03-01 20:00:20 +00:00
Milan Broz	65752052e1	Remove lvs_in_vg_activated_by_uuid_only call. There is no difference from lvs_in_vg_activated now, convert all users to this call.	2010-02-24 20:01:40 +00:00
Milan Broz	ab9663f394	Always query device by uuid only. lvm2 devices have always UUID set even if imported from lvm1 metadata. Patch removes name argument from dev_manager_info call and converts all activation related calls to use query by UUID. Also it simplifies mknode call (which is the only user on mknodes parameter).	2010-02-24 20:00:56 +00:00
Dave Wysochanski	3c23ff0f2e	Add dm_pool_strdup to allocate memory and copy a tag in {lv\|vg}_change_tag() We need to allocate memory for the tag and copy the tag value before we add it to the list of tags. We could put this inside lvm2app since the tools keep their memory around until vg_write/vg_commit is called, but we put it inside the internal library to minimize code in lvm2app. We need to copy the tag passed in by the caller to ensure the lifetime of the memory until the {vg\|lv} handle is released. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-02-24 18:15:57 +00:00
Dave Wysochanski	cd69ee7453	Refactor lvchange_tag() to call lv_change_tag() library function. Similar refactoring to vgchange - pull out common parts and put into library function for reuse. Should be no functional change. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-02-24 18:15:49 +00:00
Dave Wysochanski	e17bcc7432	Refactor _vgchange_tag() to vg_change_tag() library function. Pull out common code to be called from tools as well as lvm2app. Leave archive() at tool level so we can use from vgcreate as well as vgchange. Should be no functional change. - add stack macro in vgchange Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-02-24 18:15:05 +00:00
Petr Rockai	f59b0f3e07	Add _mlog devices to dependency trees using UUID, not name, in activation.	2010-02-23 15:49:52 +00:00
Mike Snitzer	4bdebfd151	Do not reload origin again in lv_remove_single() if it had a merging snapshot. vg_remove_snapshot() will have already performed the required reload.	2010-02-17 23:36:45 +00:00
Mike Snitzer	a5ec3e3827	Refactor snapshot-merge deptree and device removal to support info-by-uuid Add a merging snapshot to the deptree, using the "error" target, rather than avoid adding it entirely. This allows proper cleanup of the -cow device without having to rename the -cow to use the origin's name as a prefix. Move the preloading of the origin LV, after a merge, from lv_remove_single() to vg_remove_snapshot(). Having vg_remove_snapshot() preload the origin allows the -cow device to be released so that it can be removed via deactivate_lv(). lv_remove_single()'s deactivate_lv() reliably removes the -cow device because the associated snapshot LV, that is to be removed when a snapshot-merge completes, is always added to the deptree (and kernel -- via "error" target). Now when the snapshot LV is removed both the -cow and -real devices get removed using uuid rather than device name. This paves the way for us to switch over to info-by-uuid queries. Signed-off-by: Mike Snitzer <snitzer@redhat.com>	2010-02-17 22:59:46 +00:00
Alasdair Kergon	c97cbf8c08	pre-release	2010-02-15 23:53:15 +00:00
Zdenek Kabelac	d251e86856	Use dm_report_field_int32 instead of dm_report_field_uint64 for printing '-1'	2010-02-15 20:27:33 +00:00
Zdenek Kabelac	810349313a	Cleanup float arithmetic gcc warning.	2010-02-15 18:35:06 +00:00
Zdenek Kabelac	88c4995d1d	* add more 'const' - fixes gcc constness warning	2010-02-15 18:34:00 +00:00
Peter Rajnoha	b9259173d0	Add LVM_SUPPRESS_LOCKING_FAILURE_MESSAGES environment variable to suppress error and warning mesages while --ignorelockingfailure is used.	2010-02-15 16:46:56 +00:00
Dave Wysochanski	629efc6a89	Export lvm_pv_get_size(), lvm_pv_get_free(), lvm_pv_get_dev_size in lvm2app. We add these exports to show the pv_size and pv_free and dev_size fields. Fixes rhbz561423. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-02-14 03:21:37 +00:00
Dave Wysochanski	ed3329eb45	Fix off by 512 sizes for lvm2app. Internally we store sizes in sectors, but lvm2app exports sizes in bytes. We could get fancier and allow units configuration but this fix should do for now. Fixes rhbz561422. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-02-14 03:21:06 +00:00
Mike Snitzer	15816a3b8c	Add 'fail_if_percent_unsupported' arg to _percent() and _percent_run(). We unfortunately don't yet _know_, in dev_manager_snapshot_percent(), if a snapshot-merge target is active (activation is deferred if dev is open); so we can't short-circuit origin devices based purely on existing LVM LV attributes. Set 'fail_if_percent_unsupported' in dev_manager_snapshot_percent() for a merging origin LV, otherwise passing unsupported LV types to _percent will lead to a default successful return with percent_range as PERCENT_100. For a merging origin, PERCENT_100 will result in a polldaemon that runs infinitely (because completion is PERCENT_0).	2010-02-10 14:38:24 +00:00
Mike Snitzer	d6bf26af8e	Remove false "failed to find tree node for <lv>" error from _cached_info(). When activating a merging origin it is valid, and expected, to not have a node in the deptree for both the origin and its merging snapshot. The _cached_info() caller is only concerned with whether a device is open. If there isn't a node in the tree the associated device is definitely not open.	2010-02-08 23:28:06 +00:00
Peter Rajnoha	04fa77c3be	This is related to liblvm and its lvm_list_vg_names() and lvm_list_vg_uuids() functions where we should not expose internal VG names/uuids (the ones with "#" prefix )through the interface. Otherwise, we could end up with library users opening internal VGs which will initiate locking mechanism that won't be cleaned up properly. "#orphans_{lvm1, lvm2, pool}" names are treated in a special way, they are truncated first to "orphans" and this is used as a part of the lock name then (e.g. while calling lvm_vg_open()). When library user calls lvm_vg_close(), the original name "orphans_{lvm1, lvm2, pool}" is used directly and therefore no unlock occurs. We should exclude internal VG names and uuids in the lists provided by lvmcache: lvmcache_get_vgids() and lvmcache_get_vgnames().	2010-02-03 14:08:39 +00:00
Dave Wysochanski	8caf272a93	Add copy constructor for struct metadata_area. Clean up cut&paste code with proper copy constructor.	2010-02-02 16:26:34 +00:00
Milan Broz	4b373907f6	Always query device by using uuid only and not name in clvmd. Otherwise confusion with the device of the same name (but different UUID, e.g. non-lvm device) can happen.	2010-01-27 13:23:57 +00:00
Milan Broz	b1ef9cd0ad	Fix return code of info callbacks. In dev_manager_info 0 means error and 1 info is returned, not that device exists (that value is part of info struct). Fix query by uuid only (no name) which returns 0 when device does not exist.	2010-01-26 07:58:23 +00:00
Mike Snitzer	537f7456e2	Eliminate extra ioctls just to check open_count in _add_new_lv_to_dtree. DM >= 4.7.0 always returns open_count so just use the associated nodes' existing info. Introduce _cached_info() to get an LV's cached info.	2010-01-22 15:40:31 +00:00
Mike Snitzer	5cb0d45d18	Removed inactive_table check from _lv_has_target_type. This check doesn't offer any benefit (that I can recall) and testing validates that.	2010-01-22 13:28:54 +00:00
Milan Broz	98d85f25be	Move error message to locking constructor and print more descriptive message if locking fails instead of "Locking type -1 initialisation failed." Use read-only locking instead of misleading ignorelocking option in message.	2010-01-22 09:45:29 +00:00
Dave Wysochanski	a7ca101517	Call _alloc_pv() inside _pv_read() and clean up error paths. We should be consistent with pv constructors so call _alloc_pv() here as we do from pv_create().	2010-01-21 21:09:23 +00:00
Dave Wysochanski	1d749d01fb	Remove useless memory allocation for pv->vg_name in _alloc_pv(). All this seems to do is provide a memory leak so remove it. The only caller of _alloc_pv() later explicitly sets pv->vg_name = fmt->orphan_vg_name so clearly this allocation should be removed. I also saw no where in the code where strncpy was used to assign pv->vg_name - only direct assignments and strdup's.	2010-01-21 21:04:44 +00:00
Dave Wysochanski	2b1446c7d6	Correct 'void *' usage in pvcreate_single. Remove needless cast.	2010-01-21 21:04:20 +00:00
Mike Snitzer	dfcb905db0	Preload the origin prior to suspend IFF snapshot(s) still exist after a merge completes. This narrows the scope of this "hack" (which still needs a proper fix within the deptree). This stops dmeventd from trying to access snapshot devices that were already removed.	2010-01-20 21:53:10 +00:00
Milan Broz	7a96516819	Never scan suspended devices in clvmd. For mirror repair (and similar tasks) it can happen that full device rescan is issued from clvmd. Because code can be in the middle of repair (calling suspend) clvmd should never try to scan suspended devices (otherwise it causes deadlock). Also code must not change ignore_suspended_device flag when doing refresh_filters (called from lvmcache scan code).	2010-01-19 13:25:00 +00:00
Mike Snitzer	6c1c02dc52	Change dev_manager_mirror_percent()'s 'struct logical_volume *' to be 'const'. Be consistent with its use (and dev_manager_snapshot_percent()). Pass 'lv' from dev_manager_snapshot_percent() to _percent() to _percent_run(). _percent_run() always dereferenced 'lv' (when initializing segh) even though it may have been NULL (as was the case until now for dev_manager_snapshot_percent()). If a "snapshot-origin" LV (snapshot-merge whose merge was deferred becuase it was open) was passed to _percent_run() it would always return 100%. Update _percent_run() to NOT return PERCENT_100 et. al. if ->target_percent() wasn't ever called and supplied 'lv' is a merging origin. A default return of 100% does not work for snapshot-merge. Also tweak a related lvconvert log_error() to include "Aborting merge."	2010-01-15 22:58:25 +00:00
Mike Snitzer	eb9454ab68	Detect case of both merging_store and cow_store supplied in _snap_text_import().	2010-01-15 17:46:08 +00:00
Mike Snitzer	e47a591d76	Improve target type compatibility checking in _percent_run(). Add 'target_status_compatible' method to 'struct segtype_handler'.	2010-01-15 16:35:26 +00:00
Alasdair Kergon	8dc351e8d4	Note some problems still to be addressed.	2010-01-14 14:39:57 +00:00
Zdenek Kabelac	fc28b13c7d	Cleanup const compiler warning	2010-01-14 10:17:12 +00:00
Zdenek Kabelac	4269e36315	Move initialization of the 'cmd' member of the struct alloc_handle before the first potentional return.	2010-01-14 10:09:42 +00:00
Zdenek Kabelac	5f31bc7926	lvol%d is generated for NULL name in lv_create_empty(). So just avoid code duplication.	2010-01-14 10:08:03 +00:00
Milan Broz	43b24f7c61	Fix clvmd automatic target module loading crash.	2010-01-13 17:40:17 +00:00
Mike Snitzer	c52678ee9b	Rename segment and lv status flag from SNAPSHOT_MERGE to MERGING. Eliminate 'merging_snapshot' from 'struct logical_volume' and just use 'snapshot' for origin lv's reference to the merging snapshot; also set MERGING in the origin lv's status.	2010-01-13 01:56:18 +00:00
Mike Snitzer	c79b425135	Add snapshot merge wrappers to abstract the associations and flags used to represent merging origin and snapshot volumes.	2010-01-13 01:55:43 +00:00
Mike Snitzer	c582e3c039	Merge on activate support. If either the origin or snapshot that is to be merged is open the merge will not start; only the merge metadata will be written. The merge will start on the next activation of the origin (or via lvchange --refresh) IFF both the origin and snapshot are closed. Merge on activate is particularly important if we want to merge over a mounted filesystem that cannot be unmounted (until next boot) --- for example root.	2010-01-13 01:54:34 +00:00
Mike Snitzer	28c3f0354a	When turning merging origin into non-merging origin, there is bad sequence: snapshots are suspended, new origin is created, snapshots are resumed, new origin is resumed. So it allocates memory while suspended. To fix it, move vg_commit after suspend_lv, so that the suspend code will treat it as precommitted vg and will preload new origin prior to suspend. NOTE: agk doesn't like this "hack"; need to revisit and fix	2010-01-13 01:52:58 +00:00
Mike Snitzer	3a8d01b6e1	Reload origin if merging has stopped.	2010-01-13 01:51:45 +00:00
Mike Snitzer	c45af88a83	When there is merging snapshot, report percentage on the origin LV. Because the snapshot LV will be hidden this is needed so the user can see merging progress with "lvs" command.	2010-01-13 01:49:22 +00:00
Mike Snitzer	3b6a076e1e	Report merging snapshot as 'S' instead of 's': This is useful for when the snapshot is still active and merging hasn't started yet; it shows a merge is pending. Once merging starts the merging snapshot will be hidden but can still be displayed with 'lvs -a' Report snapshot origin with merging snapshot as 'O' instead of 'o': Before merge starts this shows that a merge is pending. While merging the snapshot will be hidden, 'O' enables a user to see that there is a snapshot merging.	2010-01-13 01:48:38 +00:00
Mike Snitzer	0b2d216f8e	Merging device is loaded with "-cow" suffix and with base name of the origin. This is needed so that "-cow" device can be found and removed when lvremove is performed.	2010-01-13 01:44:37 +00:00
Mike Snitzer	c21b944a35	Conditionally push down either the "snapshot-origin" or "snapshot-merge" target based on whether the LV is a merging snapshot. When activating a snapshot-merge target do not attempt to monitor the LV for events; the polldaemon will monitor the snapshot as it is merged. Allow "snapshot-merge" target's usage to be parsed via standard "snapshot" methods. NOTE: follow on fixes to the _percent_run change are still needed	2010-01-13 01:43:32 +00:00
Mike Snitzer	aa6f4e51a7	Add support for "snapshot-merge" target. Introduces new libdevmapper function dm_tree_node_add_snapshot_merge_target Verifies that the kernel (dm-snapshot) provides the 'snapshot-merge' target. Activate origin LV as snapshot-merge target. Using snapshot-origin target would be pointless because the origin contains volatile data while a merge is in progress. Because snapshot-merge target is activated in place of the snapshot-origin target it must be resumed after all other snapshots (just like snapshot-origin does) --- otherwise small window for data corruption would exist. Ideally the merging snapshot would not be activated at all but if it is to be activated (because snapshot was already active) it _must_ be done after the snapshot-merge. This insures that DM's snapshot-merge target will perform exception handover in the proper order (new->resume before old->resume). DM's snapshot-merge does support handover if the reverse sequence is used (old->resume before new->resume) but DM will fail to resume the old snapshot; leaving it suspended. To insure the proper activation sequence dm_tree_activate_children() was updated to accommodate an additional 'activation_priority' level. All regular snapshots are 0, snapshot-merge is 1, and merging snapshot is 2.	2010-01-13 01:39:44 +00:00
Mike Snitzer	68e8f5a4a2	Add 'SNAPSHOT_MERGE' lv_segment 'status' flag. Make 'merging_snapshot' pointer that points from the origin to the segment that represents the merging snapshot. Import/export 'merging_store' metadata. Do not allow creating snapshots while another snapshot is merging. Snapshot created in this state would certainly contain invalid data. NOTE: patches at the end of this series will remove 'merging_snapshot' and will introduce helpful wrappers and cleanups.	2010-01-13 01:35:49 +00:00
Alasdair Kergon	109e6334b0	Fix allocation code not to stop at the first area of a PV that fits. This spurious 'break' has been here since this code was first committed in June 2005 and stopped the algorithm behaving as described in the comment above it and rendered the variable 'already_found_one' useless.	2010-01-12 20:53:20 +00:00
Alasdair Kergon	f3ac7d1b82	Revert so-called "redundant" log until after next release.	2010-01-12 14:00:51 +00:00
Jonathan Earl Brassow	673421ffc2	Testsuite updates and fixes for recently added features. 1. Found bug in 'redundant log' implementation that caused problems when converting a linear that spanned multiple devices to a mirror (wasn't checking for NULL value of provided parameter in _alloc_parallel_area) 2. Testsuite was failing to perform tests when 'not' modifier was used. This allowed a couple issues to slip through. Added a 'not_sh' modifier that negates tests performed by functions defined in the shell source file. 3. Was initializing a variable to far down, which cause previously set value to be overridden. (This was the result of the collision of the "redundant log" and lvconvert fix patches.)	2010-01-11 21:20:19 +00:00
Alasdair Kergon	e55df6fee5	Use _LOG_FATAL when aborting on an internal error.	2010-01-11 20:41:39 +00:00
Alasdair Kergon	e995e713f9	Internal errors triggering abort cannot be suppressed. (kabi)	2010-01-11 20:30:32 +00:00
Mike Snitzer	ccd6d287eb	remove errant comment fragment	2010-01-11 19:12:25 +00:00
Mike Snitzer	beacd0b303	Reset _vgs_locked in lvmcache_init() Upon successful fork(), _become_daemon() must assert that the locks that are currently held belong to the parent, not the child. All of the child's internal state saying 'this process holds a lock' has to be reset. A proper lvmcache_locking_reset() should follow later.	2010-01-11 19:08:18 +00:00
Peter Rajnoha	a750353641	Add support to disable udev checking: LVM_UDEV_DISABLE_CHECKING=1 env. var. LVM_UDEV_DISABLE_CHECKING=1 applies for /dev/<vgname> content only. We still need to define DM_UDEV_DISABLE_CHECKING=1 for /dev/mapper content.	2010-01-11 15:40:03 +00:00
Mike Snitzer	b422bb2187	remove unused variable 'i' that was recently introduced in lv_add_segment	2010-01-10 20:44:09 +00:00
Jonathan Earl Brassow	23f4aabd69	update comment	2010-01-08 23:06:36 +00:00
Jonathan Earl Brassow	77dd1c0e5f	Add the new mirror log type "redundant". The options are now: --mirrorlog core: in-memory log --mirrorlog disk: persistent log --mirrorlog redundant: redundant persistent log Signed-off-by: Jonathan Brassow <jbrassow@redhat.com>	2010-01-08 22:32:35 +00:00
Jonathan Earl Brassow	72e0743621	This patch adds the capability to split off a mirror legs. It is pretty much the same as reducing the number of mirror legs, but we just don't delete them afterwards. The following command line interface is enforced: prompt> lvconvert --splitmirror <n> -n <name> <VG>/<LV> where 'n' is the number of images to split off, and where 'name' is the name of the newly split off logical volume. If more than one leg is split off, a new mirror will be the result. The newly split off mirror will have a 'core' log. Example: [root@bp-01 LVM2]# !lvs lvs -a -o name,copy_percent,devices LV Copy% Devices lv 100.00 lv_mimage_0(0),lv_mimage_1(0),lv_mimage_2(0),lv_mimage_3(0) [lv_mimage_0] /dev/sdb1(0) [lv_mimage_1] /dev/sdc1(0) [lv_mimage_2] /dev/sdd1(0) [lv_mimage_3] /dev/sde1(0) [lv_mlog] /dev/sdi1(0) [root@bp-01 LVM2]# lvconvert --splitmirrors 2 --name split vg/lv /dev/sd[ce]1 Logical volume lv converted. [root@bp-01 LVM2]# !lvs lvs -a -o name,copy_percent,devices LV Copy% Devices lv 100.00 lv_mimage_0(0),lv_mimage_2(0) [lv_mimage_0] /dev/sdb1(0) [lv_mimage_2] /dev/sdd1(0) [lv_mlog] /dev/sdi1(0) split 100.00 split_mimage_0(0),split_mimage_1(0) [split_mimage_0] /dev/sde1(0) [split_mimage_1] /dev/sdc1(0) It can be seen that '--splitmirror <n>' is exactly the same as '--mirrors -<n>' (note the minus sign), except there is the additional notion to keep the image being detached from the mirror instead of just throwing it away. Signed-off-by: Jonathan Brassow <jbrassow@redhat.com>	2010-01-08 22:00:31 +00:00
Zdenek Kabelac	56bb994166	orig_status preserves 64bit status.	2010-01-08 10:50:11 +00:00
Peter Rajnoha	cda69e17ef	Add activation/udev_rules config option in lvm.conf. Add dm_tree_add_dev_with_udev_flags to provide wider support for udev flags.	2010-01-07 19:54:21 +00:00
Zdenek Kabelac	387f47078c	Add few const modifiers.	2010-01-07 14:47:57 +00:00
Zdenek Kabelac	a362bb23c0	Use macro outsize() with check for error return value.	2010-01-07 14:46:51 +00:00
Zdenek Kabelac	ea8acabe26	Export function out_text_with_comment() and add outfc() macro that checks for error.	2010-01-07 14:45:28 +00:00
Zdenek Kabelac	1e13fa7a6a	Add macros outsize() for out_size() and outhint() for out_hint() that check for errors in a similar way as outf() for out_text().	2010-01-07 14:40:46 +00:00
Zdenek Kabelac	c75550f5ba	Use offsetof() macro and avoid defining dummy static union for FIELD() macro. Makes it compilable by clang compiler.	2010-01-07 14:37:11 +00:00
Zdenek Kabelac	f760f97a1f	Just add '.' at the end of error message.	2010-01-07 14:29:53 +00:00
Milan Broz	03984e05a3	Rename mirror_device_fault_policy to mirror_image_fault policy	2010-01-06 13:27:06 +00:00
Milan Broz	5d196aa430	Use fixed buffer to prevent stack overflow in persistent filter dump.	2010-01-06 13:25:36 +00:00
Mike Snitzer	5b7f6ad698	Use snapshot metadata usage to determine if snapshot is empty Version >= 1.8.0 of the DM snapshot target appends metadata sectors used to a snapshot's status. This patch allows LVM2 to accurately determine if the snapshot store is empty. Knowing when a snapshot store is empty is important in the context of snapshot-merge (means merge is complete). Also update LVM2 to be aware of the possibility for "Merge failed" in the snapshot-merge target's status. Signed-off-by: Mike Snitzer <snitzer@redhat.com>	2010-01-05 21:14:04 +00:00
Mike Snitzer	56b3d20462	Add missing 'stack;' for all activate_lv and deactivate_lv callers. Signed-off-by: Mike Snitzer <snitzer@redhat.com>	2010-01-05 21:08:34 +00:00
Mike Snitzer	df13cf08d5	Add missing 'stack;' for all suspend_lv and resume_lv callers. Signed-off-by: Mike Snitzer <snitzer@redhat.com>	2010-01-05 21:07:31 +00:00
Mike Snitzer	7a8fa6aaab	Add a [--poll {y\|n}] flag to vgchange and lvchange to control whether the background polldaemon is allowed to start. It can be used standalone or in conjunction with --refresh or --available y. Control over when the background polldaemon starts will be particularly important for snapshot-merge of a root filesystem. Dracut will be updated to activate all LVs with: --poll n The lvm2-monitor initscript will start polling with: --poll y NOTE: Because we currently have no way of knowing if a background polldaemon is active for a given LV the following limitations exist and have been deemed acceptable: 1) it is not possible to stop an active polldaemon; so the lvm2-monitor initscript doesn't stop running polldaemon(s) 2) redundant polldaemon instances will be started for all specified LVs if vgchange or lvchange are repeatedly used with '--poll y' Signed-off-by: Mike Snitzer <snitzer@redhat.com>	2010-01-05 20:56:51 +00:00
Milan Broz	0e06c92fdf	Propagate commit and revert metadata event to other nodes in cluster. This patch tries to correctly track changes in lvmcache related to commit/revert. For vg_commit: if there is cached precommitted metadata, after successfull commit these metadata must be tracked as committed. For vg_revert: remote nodes must drop precommitted metadata and its flag in lvmcache. (N.B. Patch do not touch LV locks here in any way.) All this machinery is needed to properly solve remote node cache invalidaton which cause several problems recently observed.	2010-01-05 16:09:33 +00:00
Milan Broz	c9118a1d20	Proper mask lock mode for vg lock. Lock mode is int masked by LCK_TYPE_MASK, always. Patch also remove uneccessary masking lock flag on sender side, if masking is needed, it is don on client side already.	2010-01-05 16:07:56 +00:00
Milan Broz	d37be0b865	Add possibility to handle precommitted metadata in lvmcache. - Add drop_precommitted flag to force drop precommitted metadata - add lvmcache_commit_metadata() which upgrades precommitted metadata in cache No functional change in this patch - just preparation for following change.	2010-01-05 16:06:42 +00:00
Milan Broz	d7f44761ab	Properly decode flags even for VG locks. And decode flags in humar readable form in client. And clean some trailing whitespaces. No functional change in this patch (only debugging messages changed).	2010-01-05 16:03:37 +00:00
Milan Broz	4b1687fb74	Do not set precommitted flag in cache when precommitted metadata does not exist. The use_precommitted flag indicates, that we want to use precommitted metadata (used in suspend call to preload table with precommitted data). But if there are no such data, committed metadata are read but the cache still contains that precommitted flag. (The problem is that later possible drop_metadata call will not invalidate device in cache.) The wrong precommitted state is stored in on remote nodes during normal suspend/resume cycle _without_ vg_write/commit. Use the PRECOMMITTED status flag here instead (which is always set if using precommited metadata here).	2010-01-05 16:01:22 +00:00
Milan Broz	60494fe74b	Resume volumes in reverse order to preserve memlock pairing. If renaming snapshot with virtual origin, the origin is renamed too. But the code must resume LVs in reverse order to properly pair memlock (in cluster locking). (The resume of snapshot resumes origin too and later resume is ignored otherwise.)	2010-01-05 15:58:11 +00:00
Milan Broz	cfe30f1df3	Drop metadata cache after device was autorepaired and removed from VG. All long running processes must reload metadata when some device becomes orphan after repair.	2009-12-18 12:45:41 +00:00
Milan Broz	aa02928ff7	Remove missing flag if PV reappeared and is empty. When PV device reappears with old metadata, it is always updated to new version byt atutomatic metadata repair. Remove missing flag if device is empty. If device contains allocated extents, issue warning that user must remove volumes and re-add this PV before manipulating with this volume. This partially solves bug 547842 when one PV (log) is failed, dmeventd removes that device and later this device reappears and is wrongly added into VG marked missing.	2009-12-18 12:44:20 +00:00
Petr Rockai	fbcb06145b	Revert another unintended change that snuck in.	2009-12-17 15:59:53 +00:00
Petr Rockai	dff5da2d64	Fix removal of multiple devices from a mirror (+ regression test).	2009-12-17 15:38:29 +00:00
Petr Rockai	207542b40e	Revert unintended change that slipped in with last checkin.	2009-12-16 19:26:20 +00:00
Petr Rockai	550cae2340	#define an INTERNAL_ERROR macro and use it throughout LVM.	2009-12-16 19:22:11 +00:00
Zdenek Kabelac	b1ebf028de	Cleanup returns for void functions.	2009-12-11 13:16:37 +00:00
Zdenek Kabelac	735308699c	Destroy allocated mempool in _vg_read_orphans() error path.	2009-12-11 13:14:44 +00:00
Milan Broz	34de60e4d4	Call explicitly suspend for temporary mirror layer. The memlock_inc() fix is wrong, memlock count is not propagated to long living process (clvmd) and just it underflow there. Also suspend is needed to pre-load precommited metadata on other nodes (remapping to error taget in this case). With explicit suspend we generate lock request and code can update memlock count. (Infinitely "locked" memory caused that fs_unlock() was not called properly and on cluster nodes remains old links in /dev/mapper for not active devices.) (N.B. failing of suspend call here is not handled as fatal error - the LV is going to be removed later anyway.)	2009-12-09 19:53:39 +00:00
Milan Broz	adee669441	Use more descriptive variable name for temporary layer lv.	2009-12-09 19:43:39 +00:00
Milan Broz	0fa0e6addf	Allow manipulation with precommited metadata even when a PV is missing. The new recovery code first tries to repair LV and then removes failed PV from VG. It means that during operation there can be VG with PV missing, and vg_read code handles it like not consistent VG. We already allows returning "inconsistent" commited metadata, for mirror repair we need this for precommited too. (The suspend call prepares precommited metadata to inactive table on other cluster nodes.) "Inconsistent" here means - correct metadata, just with some metadata areas not found (obviously on missing or failed PVs).	2009-12-09 19:29:04 +00:00
Milan Broz	7251c8f3a5	Get rid of magic masks in cluster locking code. Patch should not cause any problems, only real change is removing LCK_LOCAL bit from lock type flag, it is never used there. (LCK_LOCAL is part arg[1] bits anyway.)	2009-12-09 18:28:27 +00:00
Milan Broz	f9f8cd08b3	Get rid of hardcoded 0xffdf cluster lock flag. There is hidded change - the upper flags (0xffff0000) and now not cleared, but there are unused anyway.	2009-12-09 18:16:38 +00:00
Milan Broz	f72a06ccf7	Remove newly created log volume if initial deactivation fails. If there is problem deactivate LV and _init_mirror_log is called with remove_on_failure = 1, remove the newly created log LV from metadata. (This can happen if there is active device with the same name but different UUID.) The main reason for this "workaround" patch is to - do not keep _mlog volume in metadata, so user can repeat the action - print better error message describing the real problem # lvcreate -m 2 -n lv1 -l 1 --nosync vg_bar WARNING: New mirror won't be synchronised. Don't read what you didn't write! /dev/vg_bar/lv1_mlog: not found: device not cleared Aborting. Failed to wipe mirror log. Error locking on node bar-01: Input/output error Unable to deactivate mirror log LV. Manual intervention required. Failed to create mirror log. # lvcreate -m 2 -n lv1 -l 1 --nosync vg_bar WARNING: New mirror won't be synchronised. Don't read what you didn't write! Aborting. Unable to deactivate mirror log. Failed to initialise mirror log.	2009-12-09 18:09:52 +00:00
Dave Wysochanski	59baeb838c	Update a few more uint64_t's related to the 64-bit status change. At this point they probably do not matter but going forward they may - depends on future patches for replicator, etc. I think these probably got missed because they were 'flags' so I changed the name to 'status' to be consistent. So the on-disk things 'flags' and the in structure 'status' (bits). NOTE: WHATS_NEW already has entry for this in current release. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Acked-by: Mike Snitzer <snitzer@redhat.com>	2009-12-04 17:48:32 +00:00
Milan Broz	63ae0d1464	Fix memory lock imbalance in lv_suspend if already suspended. pvmove suspends all moved LVs + pvmoveX mirrored LV itself. This suspends even underlying pvmoveX and following explicit suspend call is just noop. But in resume the pvmoveX volume is no longer underlying device for moved LVs, so it performs full resume with memlock decrease. Code must call memlock_inc() if suspend is requested, volume is already suspended and error is not requested.	2009-12-03 19:23:40 +00:00
Milan Broz	fec4de9563	Fix tools to report error when stopped by user. (And do not produce internal error message.)	2009-12-03 19:18:33 +00:00
Zdenek Kabelac	60184834b5	minor indent change	2009-12-03 10:01:30 +00:00
Zdenek Kabelac	6c8ffd0362	skip cast from (void*)	2009-12-03 09:59:54 +00:00
Zdenek Kabelac	08e64ce5e9	minor whitespace indentation	2009-12-03 09:58:30 +00:00
Dave Wysochanski	c053fb62bc	Fix setting of readahead in lvcreate. The default comes from the configuration settings, with possible commandline override.	2009-12-03 01:47:33 +00:00
Milan Broz	0548bcc2dc	Fix memory leak in lv_info_by_lvid The lv_from_lvid calls internally vg_read(), we must release vg structure afterwards. Code is called only from clvmd.	2009-12-01 19:10:23 +00:00
Petr Rockai	f1a4ec40fc	If aborting due to an internal error, always print the message causing this.	2009-12-01 13:54:27 +00:00
Petr Rockai	99a304bc17	Optionally abort on internal errors (and leverage this option in the testsuite). (This is showing a problem in the pvmove test for me, so I expect the tests to start failing -- this needs to be fixed separately though.)	2009-11-30 17:17:11 +00:00
Milan Broz	5800aa5c07	Do not allow creating mirrors of more than 8 images. This is kernel limitation in all kernel versions, so better detect this early.	2009-11-27 14:35:38 +00:00
Dave Wysochanski	ccb601a3cb	Remove unnecessary / duplicate dm_list macros and functions. These are no longer used by anyone. The dm_list defines are all in libdevmapper.h and libdm/datastruct/list.c contains any function definitions. There is some code in "old-tests" that still use this but this code is not being maintained. Thanks to Zdenek for spotting this.	2009-11-25 20:44:07 +00:00
Mike Snitzer	a2552d4f59	Switch status from 32-bit to 64-bit The physical_volume, volume_group, logical_volume and lv_segment structures' 'status' member is now uint64_t. The alignment of these structures was also audited to remove holes. The movement of some members in 'volume_group' and 'lv_segment' eliminates holes. The 'physical_volume' structure still has one 4-byte hole after 'pe_size'; the other structures no longer have any holes. Each structures' size has not changed.	2009-11-24 22:55:55 +00:00
Milan Broz	cd501dd440	Move persistent filter dump to more appropriate place. After context_refresh is cache empty, the cache flush does nothing. Call it after lvmcache full rescan if running from log lived process.	2009-11-24 16:11:37 +00:00
Milan Broz	e1ab01e3ad	Refresh device filters before full device rescan in lvmcache. The sysfs filter initialise hash of available devices using scan of /sys/block. We need to refresh even this hash when performing full scan otherwise the newly appeared device could be rejected, because there is no entry in sysfs filter. This easily could happen when attaching new device to cluster node. (Only force refresh of context in clvmd -R works here now). Unfortunately consequences of this are much worse, missing device part on that node is replaced with missing segment (even when no partial arg is selected) and this directly lead to data corruption. See https://bugzilla.redhat.com/show_bug.cgi?id=538515 Simply fix it by refreshing device filters in lvmcache before performing the full device scan.	2009-11-24 16:10:25 +00:00
Milan Broz	6b8304ab43	Fix memory lock imbalance in locking code. (This affects only cluster locking because only cluster locking module set LCK_PRE_MEMLOCK.) With currect code you get # vgchange -a n Internal error: _memlock_count has dropped below 0. when using cluster locking. It is caused by _unlock_memory calls here if ((flags & (LCK_SCOPE_MASK \| LCK_TYPE_MASK)) == LCK_LV_RESUME) memlock_dec(); Unfortunately it is also (wrongly) called in immediate unlock (when LCK_HOLD is not set) from lock_vol (LCK_UNLOCK is misinterpreted as LCK_LV_RESUME). Avoid this by comparing original flags and provide memlock code type of operation (suspend/resume).	2009-11-23 10:55:14 +00:00
Milan Broz	a4893bc377	Revert vg_read_internal change, clvmd cannot use vg_read now. (2.02.55)	2009-11-23 10:44:50 +00:00
Petr Rockai	4e11dfe3ca	In case we refuse to continue due to missing PVs, print a hint about using vgreduce --removemissing to remedy the situation.	2009-11-19 13:44:37 +00:00
Petr Rockai	e2683aafe6	The double resume in remove_mirror_images does not happen always. Only call memlock_inc() when it actually does happen.	2009-11-19 13:42:38 +00:00
Petr Rockai	090585a8f4	Un-export vg_read_internal.	2009-11-19 12:13:37 +00:00
Petr Rockai	2f1d6f7f0c	Add a missing #include (fix compiler warning).	2009-11-19 12:09:53 +00:00
Petr Rockai	4514bdcfa2	Fix another bug in memlock handling, this time the "global" dmeventd memlock was preventing device scans in lvconvert --repair running from inside dmeventd.	2009-11-19 01:11:57 +00:00
Petr Rockai	c85222c461	Add an extra memlock_inc() to _remove_mirror_images to properly balance reference counting (see code comment for details).	2009-11-18 18:23:46 +00:00
Petr Rockai	39b1e58979	Issue an Internal error message whenever _memlock_count drops below 0.	2009-11-18 18:22:32 +00:00
Milan Broz	20babc7b4f	Fix pvmove region_size overflow for very large PVs. Fixes problem reported in https://www.redhat.com/archives/dm-devel/2009-November/msg00104.html The region size multiplication can overflow when using 32bit integer.	2009-11-18 16:48:10 +00:00
Milan Broz	6ddadf4023	Fix compilation warning: activate/dev_manager.c:362: warning: combined_percent_range may be used uninitialized in this function	2009-11-04 14:56:35 +00:00
Zdenek Kabelac	7fb52b9c39	Export functions out_inc_indent(), out_dec_indent() for creating indented metadata lines. Macro outnl() is using exported out_newline() instead of direct call f->fn(), that required the visibility of the internal struct formatter.	2009-11-03 11:00:46 +00:00

... 5 6 7 8 9 ...

2322 Commits