shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2025-01-07 21:18:59 +03:00

Author	SHA1	Message	Date
Alasdair Kergon	1d7649f36b	Reinstate correct permissions when creating mirrors.	2011-06-29 17:05:53 +00:00
Alasdair Kergon	e189a84f57	Append 'm' attribute to pv_attr for missing PVs.	2011-06-29 14:56:33 +00:00
Alasdair Kergon	4d40a6f53c	Remove enforcement of udev verification when using non-standard /dev location. If you change the dev dir, it's your responsibility to adjust udev rules or tell lvm not to use udev too.	2011-06-28 00:23:06 +00:00
Alasdair Kergon	55f83c4399	Move _set_lvm_fallback into toolcontext, fix string comparison (/devtest matched /dev) and note that function should go anyway as it can be overriding a valid config.	2011-06-27 23:43:04 +00:00
Alasdair Kergon	0437bccc3c	Move udev_only logic inside stacked node op code. (We still need to treat add+readhead+del as a no-op.) Rename udev_fallback to verify_udev_operations. Rename --udevfallback to --verifyudev	2011-06-27 21:43:58 +00:00
Alasdair Kergon	140615dafb	remove unused var after recent patch	2011-06-24 23:39:09 +00:00
Jonathan Earl Brassow	9e0edb7ee5	Fix to preserve exclusive activation of mirror while up-converting. When an LVM mirror is up-converted (an additional image added), it creates a temporary mirror stack. The lower-level mirror in the stack that is created was not being activated exclusively - violating the exclusive nature of the original mirror. We now check for exclusive activation of a mirror before converting it, and if found, we ensure that the temporary mirror is also exclusively activated.	2011-06-23 14:00:58 +00:00
Milan Broz	6adbb95b82	Fail allocation if number of extents not divisible by area count Allocation should fail early if this condition is not met. Quick fix for https://bugzilla.redhat.com/show_bug.cgi?id=707779	2011-06-23 10:53:24 +00:00
Jonathan Earl Brassow	9e277b9e2c	Fix issue preventing cluster mirror creation. Mirrors used to be created by first creating a linear device and then adding the other images plus the log. Now mirrors are created by creating all the images in one go and then adding the log separately. The new way ran into the condition that cluster mirrors cannot change the log type (in the case of creation, from core -> disk) while the mirror is not active. (It isn't active because it is in the process of being created.) The reason this condition is in place is because a remote node may have the mirror active, and we don't want to alter the log underneath it. What we really needed was a way of checking if the mirror was active remotely but not locally, and in that case do not allow a change of the log. I've added this check, and cluster mirrors can now be created again.	2011-06-22 21:31:21 +00:00
Peter Rajnoha	418663b61c	Disable udev fallback by default and add activation/udev_fallback to lvm.conf. We've used udev fallback code till now to check whether udev created/removed the entries in /dev correctly and if not, a repair was done (giving a warning messagea about that). This patch adds a possibility to enable this additional check and subsequent fallback only when required (debugging purposes mostly) and trust udev completely. So let's disable the fallback code by default and add a new configuration option "activation/udev_fallback". (The original code for creating the nodes will still be used in case the device directory that is set in lvm.conf differs from the one that udev uses and also when activation/udev_rules is set to 0 - otherwise we would end up with no nodes/symlinks at all)	2011-06-17 14:50:53 +00:00
Zdenek Kabelac	bebe60b70c	Code move of vg_mark_partial() up in stack It's useful to keep the partial flag cached - so just move the call for vg_mark_partil_lvs() into import_vg_from_config_tree() so it gets evaluated before it goes through the lvmcache. This patch should not present any functional change. Note: It is rather temporal solution - proper place is probably inside the 'read' call back - but needs some more discussion. For now using this minor hack.	2011-06-17 14:39:10 +00:00
Zdenek Kabelac	93a98c2672	Remove unused internal flag ACTIVATE_EXCL from the code	2011-06-17 14:30:58 +00:00
Zdenek Kabelac	f50a76379a	Remove test for status flag As the ACTIVATE_EXCL could be set only in clvmd code - there is no use for this test in lv_add_mirrors() function only called from tools context. FIXME: Add cluster test case for this.	2011-06-17 14:27:34 +00:00
Zdenek Kabelac	f3d8974dc9	Add couple FIXMEs around suspicious code	2011-06-17 14:24:18 +00:00
Zdenek Kabelac	c6168a14c9	Use lv_activate_opts struct instead of ACTIVATE_EXCL status flag Let's hope all conditions has been properly converted.	2011-06-17 14:22:48 +00:00
Zdenek Kabelac	3c9ff9e142	Use lv_activate_opts struct instead of ACTIVATE_EXCL status flag.	2011-06-17 14:17:16 +00:00
Zdenek Kabelac	81beded3af	Add lv_activate_opts structure To avoid modification of 'read-only' volume group structure add a new structure to pass local data around the code for LV activation. As origin_only is one such flag - replace this parameter with new struct lv_activate_opts. More parameters might eventually become part of lv_activate_opts.	2011-06-17 14:14:19 +00:00
Petr Rockai	6d25c0d26f	Fix RHBZ 651590 (failure to lock LV results in failure to repair mirror after transient error), stemming from the following sequence of events: 1) devices fail IO, triggering repair 2) dmeventd starts fixing up the mirror 3) during the downconversion, a new metadata version is written --> the devices come back online here 4) the mirror device suspend/resume is called to update DM tables 5) during the suspend/resume cycle, pre-commit metadata is read; however, since the failed devices are now back online, we get back inconsistent set of precommit metadata and the whole operation fails The patch relaxes the check that fails in step 5 above, namely by ignoring inconsistencies coming from PVs that are marked MISSING.	2011-06-15 17:45:02 +00:00
Alasdair Kergon	7df72b3c88	Fix last snapshot removal to avoid table reload while a device is suspended.	2011-06-13 22:28:04 +00:00
Alasdair Kergon	1840aa0974	Maintain a count of the number of suspended devices in libdevmapper and use this for the LVM critical section logic. Also report an error if code tries to load a table while any device is known to be in the suspended state. (If the variety of problems these changes are showing up can't be fixed before the next release, the error messages can be reduced to debug level.)	2011-06-13 03:32:45 +00:00
Alasdair Kergon	29f2c5ada6	Disable critical section internal errors until this can be fixed properly in libdevmapper.	2011-06-12 00:23:50 +00:00
Alasdair Kergon	df390f1799	Major pvmove fix to issue ioctls in the correct order when multiple LVs are affected by the move. (Currently it's possible for I/O to become trapped between suspended devices amongst other problems. The current fix was selected so as to minimise the testing surface. I hope eventually to replace it with a cleaner one that extends the deptree code. Some lvconvert scenarios still suffer from related problems.	2011-06-11 00:03:06 +00:00
Milan Broz	4fb39ae074	Validate mirror segments size Currently some operation with striped mirrors lead to corrupted metadata, this patch just add detection of such situation. Example: # lvcreate -i2 -l10 -n lvs vg_test # lvconvert -m1 vg_test/lvs # lvreduce -f -l1 vg_test/lvs Reducing logical volume lvs to 4.00 MiB Segment extent reduction 9not divisible by #stripes 2 Logical volume lvs successfully resized # lvremove vg_test/lvs Segment extent reduction 1not divisible by #stripes 2 LV segment lvs:0-4294967295 is incorrectly listed as being used by LV lvs_mimage_0 Internal error: LV segments corrupted in lvs_mimage_0.	2011-06-09 19:36:16 +00:00
Peter Rajnoha	afc8a3b104	Fix create_temp_name to replace any '/' found in the hostname with '?'. There's a possibility someone will use the '/' in the hostname. Since we generate a temporary file name (path) including the hostname, any '/' would be ambiguous. We can always set such hostname using 'sethostname' from unistd.h. But the 'hostname' command already includes the check and removes the '/' char. However, some old versions still allow that. See: https://bugzilla.redhat.com/show_bug.cgi?id=711445. Since this is only a temporary name and the possibility of this error is quite negligible, we don't need any complex escape sequence here, just a simple char replace.	2011-06-08 08:49:53 +00:00
Alasdair Kergon	bb056af3c9	missing space in mesg	2011-06-06 12:08:42 +00:00
Alasdair Kergon	0ebd0960b4	Propagate test mode to clvmd to skip activation and changes to held locks.	2011-06-01 21:16:55 +00:00
Alasdair Kergon	3cac20f850	Defer writing PV labels to vg_write. Store label_sector only in struct physical_volume.	2011-06-01 19:29:31 +00:00
Alasdair Kergon	2aa785c85f	Report sector containing label in verbose message.	2011-06-01 19:26:38 +00:00
Alasdair Kergon	453cdee51c	Permit --available with lvcreate so non-snapshot LVs need not be activated.	2011-06-01 19:21:03 +00:00
Alasdair Kergon	677ec408bf	Report sector containing label in verbose message.	2011-06-01 15:30:36 +00:00
Peter Rajnoha	c08c564e21	Use new dev_open_readonly fn to prevent opening devices for read-write when not necessary. Before, we used vg_write_lock_held call to determnine the way a device is opened. Unfortunately, this opened many devices in RW mode when it was not really necessary. With the OPTIONS+="watch" rule used in the udev rules, this could fire numerous events while closing such devices (and it caused useless scans from within udev rules in return). A common bug we hit with this was with the lvremove command which was unable to remove the LV since it was being opened from within the udev rules. This patch should minimize such situations (at least with respect to LVM handling of devices). Though there's still a possibility someone will open a device 'outside' in parallel and fire the event based on the watch rule when closing a device once opened for RW.	2011-05-28 09:48:14 +00:00
Alasdair Kergon	564360c195	test	2011-05-24 14:10:55 +00:00
Alasdair Kergon	2d56f030f0	test	2011-05-24 14:09:41 +00:00
Alasdair Kergon	972d37f570	test	2011-05-24 14:00:57 +00:00
Alasdair Kergon	33bd9138fa	test	2011-05-24 13:53:26 +00:00
Alasdair Kergon	0b70507434	Add and use dev_open_readonly and variations.	2011-05-24 13:36:57 +00:00
Peter Rajnoha	5fb0c20297	Do not issue an error message when unable to remove .cache on read-only fs.	2011-05-12 12:42:47 +00:00
Petr Rockai	eee66d2a80	When glibc needs buffers for line buffering of input and output buffers, it allocates these buffers in such way it adds memory page for each such buffer and size of unlock memory check will mismatch by 1 or 2 pages. This happens when we print or read lines without '\n' so these buffers are used. To avoid this extra allocation, use setvbuf to set these bufffers ahead. Signed-off-by: Zdenek Kabelac <zkabelac@redhat.com> Reviewed-by: Milan Broz <mbroz@redhat.com> Reviewed-by: Petr Rockai <prockai@redhat.com>	2011-05-07 13:50:11 +00:00
Petr Rockai	833a287337	Make vg_mark_partial_lvs also clear existing PARTIAL_LV flags, so it can be issued repeatedly on the same VG, keeping the PARTIAL_LV flags up to date.	2011-05-07 13:32:05 +00:00
Alasdair Kergon	5510b4e7d7	test update without WHATS_NEW to check it gives warning now	2011-04-29 19:06:17 +00:00
Alasdair Kergon	4cda44bfb9	commands/toolcontext.c:578: warning: â€˜udev_dirâ€™ may be used uninitialized in this function commands/toolcontext.c:576: warning: â€˜udev_dir_lenâ€™ may be used uninitialized in this function Bogus - suppress them.	2011-04-29 16:23:39 +00:00
Alasdair Kergon	919ab56b6d	pre-release clean-ups	2011-04-29 00:21:13 +00:00
Alasdair Kergon	9cda028a96	clean up critical section patch	2011-04-28 20:29:59 +00:00
Zdenek Kabelac	96c4abee62	Missing space in debug message	2011-04-28 19:59:17 +00:00
Alasdair Kergon	2c56f60db4	Set pv_min_size to 2048KB to exclude floppy drives. Previously was 512.	2011-04-28 17:33:34 +00:00
Peter Rajnoha	edcda01a1e	Obtain device list from udev by default if LVM2 is compiled with udev support. Also, add a new 'obtain_device_list_from_udev' setting to lvm.conf with which we can turn this feature on or off if needed. If set, the cache of block device nodes with all associated symlinks will be constructed out of the existing udev database content. This avoids using and opening any inapplicable non-block devices or subdirectories found in the device directory. This setting is applied to udev-managed device directory only, other directories will be scanned fully. LVM2 needs to be compiled with udev support for this setting to take effect. N.B. Any device node or symlink not managed by udev in udev directory will be ignored with this setting on.	2011-04-22 12:05:32 +00:00
Peter Rajnoha	40059f18d9	Move common libudev code to lvm-wrappers.[ch]. ...so we can use it throughout.	2011-04-22 11:59:59 +00:00
Zdenek Kabelac	5c246f4876	Skip check for NULL before dm_free dm_free makes this test itself.	2011-04-21 13:15:26 +00:00
Zdenek Kabelac	b680d5bf7b	Fix use of released vgname and vgid Avoid using of already released memory when duplicated MDA is found. As get_pv_from_vg_by_id() may call lvmcache_label_scan() use the local copy of the vgname and vgid on the stack as vginfo may dissapear and code was then accessing garbage in memory. i.e. pvs /dev/loop0 (when /dev/loop0 and /dev/loop1 has same MDA content) Invalid read of size 1 at 0x523C986: dm_hash_lookup (hash.c:325) by 0x440C8C: vginfo_from_vgname (lvmcache.c:399) by 0x4605C0: _create_vg_text_instance (format-text.c:1882) by 0x46140D: _text_create_text_instance (format-text.c:2243) by 0x47EB49: _vg_read (metadata.c:2887) by 0x47FBD8: vg_read_internal (metadata.c:3231) by 0x477594: get_pv_from_vg_by_id (metadata.c:344) by 0x45F07A: _get_pv_if_in_vg (format-text.c:1400) by 0x45F0B9: _populate_pv_fields (format-text.c:1414) by 0x45F40F: _text_pv_read (format-text.c:1493) by 0x480431: _pv_read (metadata.c:3500) by 0x4802B2: pv_read (metadata.c:3462) Address 0x652ab80 is 0 bytes inside a block of size 4 free'd at 0x4C2756E: free (vg_replace_malloc.c:366) by 0x442277: _free_vginfo (lvmcache.c:963) by 0x44235E: _drop_vginfo (lvmcache.c:992) by 0x442B23: _lvmcache_update_vgname (lvmcache.c:1165) by 0x443449: lvmcache_update_vgname_and_id (lvmcache.c:1358) by 0x443C07: lvmcache_add (lvmcache.c:1492) by 0x46588C: _text_read (text_label.c:271) by 0x466A65: label_read (label.c:289) by 0x4413FC: lvmcache_label_scan (lvmcache.c:635) by 0x4605AD: _create_vg_text_instance (format-text.c:1881) by 0x46140D: _text_create_text_instance (format-text.c:2243) by 0x47EB49: _vg_read (metadata.c:2887) Add testing script	2011-04-21 13:13:40 +00:00
Mike Snitzer	ffcb1b9c2c	Improve the discard documentation. Also improve discard code in pv_manip.c to properly account for case when pe_start=0 and the first physical extent is to be released (currently skip the first extent to avoid discarding the PV label).	2011-04-13 18:26:39 +00:00
Mike Snitzer	727373c176	Use uint32_t rather than uint64_t.	2011-04-12 22:04:04 +00:00
Mike Snitzer	fdc8670327	Add "devices/issue_discards" to lvm.conf. Issue discards on lvremove if enabled and both storage and kernel have support.	2011-04-12 21:59:01 +00:00
Zdenek Kabelac	96077265c4	Replace dm_snprintf with strncpy My previous patch fixed incorrect error check for dm_snprintf. However in this particular case - dm_snprintf has been used differently - just like strncpy + setting last char with '\0' - so the code had to return error - because the buffer was to short for whole string. Patch replaces it with real strncpy. Also test for alloca() failure is removed - as the program behaviour is rather undefined in this case - it never returns NULL.	2011-04-12 14:13:17 +00:00
Petr Rockai	db22d9b978	This patchset refactors some reporting code and completes the remaining lvseg properties for lvm2app, 'devices' and 'seg_pe_ranges'. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-by: Petr Rockai <prockai@redhat.com>	2011-04-12 12:24:29 +00:00
Zdenek Kabelac	c67d2b4dd4	Fix incorrect tests for dm_snprintf() failure As the memory is preallocated based on arg size in these cases, the error would be quite hard to trigger here anyway.	2011-04-09 19:05:23 +00:00
Zdenek Kabelac	a1eba521e3	Fix some unmatching sign comparation gcc warnings Simple replacement for unsigned type - usually in for() loops.	2011-04-08 14:40:18 +00:00
Zdenek Kabelac	2c5827076b	Add missing printf attributes These attributes were missing in previous patch, that was adding instrumentation for printf formating string parameter.	2011-04-08 14:21:34 +00:00
Zdenek Kabelac	040cdff1d4	Better const cast logic (although still gcc gives const violation warning)	2011-04-08 14:14:57 +00:00
Zdenek Kabelac	45109d497c	Fix some forgotten -Wold-style-definition gcc warnings	2011-04-08 14:13:08 +00:00
Zdenek Kabelac	e42cb2f892	Newer gcc doesn't need this trick In fact it now generates an opposite warning about using undefined variable.	2011-04-08 14:11:40 +00:00
Jonathan Earl Brassow	532e6c8ae3	Thanks to Zdenek Kabelac (kabi) for pointing out that I was using dm_pool_free incorrectly. This check-in fixes that incorrect usage. I've also added a WHATS_NEW line to reflect the changes I made to allow lv_extend to operate on 0 length intrinsically layered LVs (i.e mirrors and RAID). I forgot that in the last commit.	2011-04-07 21:49:29 +00:00
Jonathan Earl Brassow	fe93c99ad9	This patch adds the ability to extend 0 length layered LVs. This allows us to allocate all images of a mirror (or RAID array) at one time during create. The current mirror implementation still requires a separate allocation for the log, however.	2011-04-06 21:32:20 +00:00
Peter Rajnoha	29684f590c	Cleanup fid finalization code in free_vg and allow exactly the same fid to be set again for a PV/VG. Actually, we can call vg_set_fid(vg, NULL) instead of calling destroy_instance for all PV structs and a VG struct - it's the same code we already have in the vg_set_fid. Also, allow exactly the same fid to be set again for the same PV/VG Before, this could end up with the fid destroyed because we destroyed existing fid first and then we used the new one and we didn't care whether existing one == new one by chance.	2011-04-01 14:54:20 +00:00
Zdenek Kabelac	3d04380691	Use created hash tables for quick check of LV, PV. Instead of searching linear list of all LVs, PVs - use created hash tables also for quick mapping between LV. (Note - for small number of PVs or LVs the overhead of the hash is bigger). TODO: Use hash tables in volume_group structure directly.	2011-03-30 13:35:51 +00:00
Zdenek Kabelac	d992bbbaa3	Keep the cache content when the exported vg buffer is matching Instead of regenerating config tree and parsing same data again, check whether export_vg_to_buffer does not produce same string as the one already cached - in this case keep it, otherwise throw cached content away. For the code simplicity calling _free_cached_vgmetadata() with vgmetadata == NULL as the function handles this itself. Note: sometimes export_vg_to_buffer() generates almost the same data with just different time stamp, but for the patch simplicity, data are reparsed in this case. This patch currently helps for vgrefresh.	2011-03-30 13:14:34 +00:00
Zdenek Kabelac	a66bff47f1	Few more files filtered from memory locking Code located in these files should not be used in critical section.	2011-03-30 13:06:13 +00:00
Zdenek Kabelac	df336e72d2	Optimise error message write to _lvm_errmsg Isn't usually perfomance critical - but log_error is used i.e.for debuging, this code noticable slows down the processing. Added 512KB limit to avoid memory exhastions in case of some endless loop. TODO: use _lvm_errmsg buffer only when lvm2api needs it.	2011-03-30 12:53:04 +00:00
Zdenek Kabelac	01fb91c615	Valgrind updates Avoid locking sum testing with valgrind compilation. Make memory unaccessible in the valgrind for dm_pool_abadon_object. Valgrind hinting should not be needed in _free_chunk for dm_free.	2011-03-30 12:43:32 +00:00
Zdenek Kabelac	142d2e8740	Fix reading of unitialized memory Could be reached via few of our lvm2 test cases: ==11501== Invalid read of size 8 ==11501== at 0x49B2E0: _area_length (import-extents.c:204) ==11501== by 0x49B40C: _read_linear (import-extents.c:222) ==11501== by 0x49B952: _build_segments (import-extents.c:323) ==11501== by 0x49B9A0: _build_all_segments (import-extents.c:334) ==11501== by 0x49BB4C: import_extents (import-extents.c:364) ==11501== by 0x497655: _format1_vg_read (format1.c:217) ==11501== by 0x47E43E: _vg_read (metadata.c:2901) cut from t-vgcvgbackup-usage.sh -- pvcreate -M1 $(cat DEVICES) vgcreate -M1 -c n $vg $(cat DEVICES) lvcreate -l1 -n $lv1 $vg $dev1 -- Idea of the fix is rather defensive - to allocate one extra element to 'map' array which is then used in _area_length() - where the loop checks, whether next map entry is continuous. By placing there always one extra zero entry - we fix the read of unallocated memory, and we make sure the data would not make a continous block. FIXME: there could be a problem if some special broken lvm1 data would be imported. As the format1 is currently not really used - leave it for future fix and use this small hotfix for now.	2011-03-30 12:30:39 +00:00
Zdenek Kabelac	1bedd3a97b	Use id_equal instead of strncmp() More consistent and easier to read.	2011-03-29 21:57:56 +00:00
Zdenek Kabelac	3aef5ae7fb	Fix access to released memory Invalid primary_vginfo was supposed to move all its lvmcache_infos to orphan_vginfo - however it has called _drop_vginfo() inside the loop that released primary_vginfo itself - thus made the loop using released memory. Use _vginfo_detach_info() instead and call _drop_vginfo after th loop is finished. Valgrind trace it should fix: Invalid read of size 8 at 0x41E960: _lvmcache_update_vgname (lvmcache.c:1229) by 0x41EF86: lvmcache_update_vgname_and_id (lvmcache.c:1360) by 0x441393: _text_read (text_label.c:329) by 0x442221: label_read (label.c:289) by 0x41CF92: lvmcache_label_scan (lvmcache.c:635) by 0x45B303: _vg_read_by_vgid (metadata.c:3342) by 0x45B4A6: lv_from_lvid (metadata.c:3381) by 0x41B555: lv_activation_filter (activate.c:1346) by 0x415868: do_activate_lv (lvm-functions.c:343) by 0x415E8C: do_lock_lv (lvm-functions.c:532) by 0x40FD5F: do_command (clvmd-command.c:120) by 0x413D7B: process_local_command (clvmd.c:1686) Address 0x63eba10 is 16 bytes inside a block of size 160 free'd at 0x4C2756E: free (vg_replace_malloc.c:366) by 0x41DE70: _free_vginfo (lvmcache.c:980) by 0x41DEDA: _drop_vginfo (lvmcache.c:998) by 0x41E854: _lvmcache_update_vgname (lvmcache.c:1238) by 0x41EF86: lvmcache_update_vgname_and_id (lvmcache.c:1360) by 0x441393: _text_read (text_label.c:329) by 0x442221: label_read (label.c:289) by 0x41CF92: lvmcache_label_scan (lvmcache.c:635) by 0x45B303: _vg_read_by_vgid (metadata.c:3342) by 0x45B4A6: lv_from_lvid (metadata.c:3381) by 0x41B555: lv_activation_filter (activate.c:1346) by 0x415868: do_activate_lv (lvm-functions.c:343) problematic line: dm_list_iterate_items_safe(info2, info3, &primary_vginfo->infos)	2011-03-29 21:34:18 +00:00
Zdenek Kabelac	7f0d89f8b4	Fix sending uninitilised bytes in cluster messages Fix 2 more functions sending cluster messages to avoid passing uninitilised bytes and compensate 1 extra byte attached to the message from the clvm_header.args[1] member variable.	2011-03-29 21:05:39 +00:00
Zdenek Kabelac	aaf92617b0	Fix -Wold-style-definition gcc warnings	2011-03-29 20:30:05 +00:00
Zdenek Kabelac	f77736cab5	Remove double braces Clang gives notice about possible confusion as commonly double bracces are used when some assignment is done inside them.	2011-03-29 20:19:03 +00:00
Jonathan Earl Brassow	60c10a45ce	s/MIRROR_NOTSYNCED/LV_NOTSYNCED/ - Flag will may refer to more than just mirrors	2011-03-29 12:51:57 +00:00
Alasdair Kergon	9c58641e74	Rename _check_version	2011-03-27 13:44:08 +00:00
Jonathan Earl Brassow	be226be635	Fix unhandled condition in _move_lv_segments If _move_lv_segments is passed a 'lv_from' that does not yet have any segments, it will screw things up because the code that does the segment copy assumes there is at least one segment. See copy code here: lv_to->segments = lv_from->segments; lv_to->segments.n->p = &lv_to->segments; lv_to->segments.p->n = &lv_to->segments; If 'segments' is an empty list, the first statement copies over the values, but the next two reset those values to point to the other LV's list structure. 'lv_to' now appears to have one segment, but it is really an ill-set pointer.	2011-03-25 22:02:27 +00:00
Jonathan Earl Brassow	58bdd1654b	Replace malloc with zalloc when creating segment_type's	2011-03-25 21:59:42 +00:00
Petr Rockai	5ef2808bc7	In some cases, we could end up with a mirrored LV without a MIRRORED flag. In other cases, the code could wind up removing wrong number of mirrors. In yet other cases, we could remove the right number of mirrors, but fail to respect the removal preferences (i.e. keep an image that was requested to be removed while removing an image that was requested to be kept). Under some circumstances, remove_mirror_images could also get stuck in an infinite loop. This patch should fix all of the above undesirable behaviours. Signed-off-by: Petr Rockai <prockai@redhat.com> Reviewed-by: Jonathan Brassow <jbrassow@redhat.com>	2011-03-24 12:28:02 +00:00
Milan Broz	52a5cd31c4	Mitigate some warnings if running as non-root user. LVM doesn't behave correctly if running as non-root user, there is warning when it detects it. Despite this, it produces many error messages, saying nothing. See https://bugzilla.redhat.com/show_bug.cgi?id=620571 This patch fixes two things: 1) Removes eror message from device_is_usable() which has no information value anyway (real warning is printed inside it). 2) it fixes device-mapper initialization, if we support core dm module autoload and device node is present, it should fail early and not try recreate existing and correct node. (non-root == permission denied here) N.B. In future code should support user roles, some more drastic checks in code are probably contraproductive now.	2011-03-18 12:17:57 +00:00
Zdenek Kabelac	b8ccce3500	Add missing \0 for grown debug object Attach \0 for proper char* display - otherwise somewhat random message could be displayed in debug more and read of unpredictable read of uninitilized memory values could happen.	2011-03-14 17:00:57 +00:00
Zdenek Kabelac	612e606392	Revert this commit This buffer allocation must have been problem somewhere else. (as sizeof() already has the 'extra' '\0' included). For now reverting this commit.	2011-03-13 23:18:30 +00:00
Zdenek Kabelac	844b75f4d6	Fix allocation of system_id As code uses strncpy(system_id, NAME_LEN) and doesn't set '\0' Fix it by always allocating NAME_LEN + 1 buffer size and with zalloc we always get '\0' as the last byte. This bug may trigger some unexpected behavior of the string operation code - depends on the pool allocator. FIXME: refactor this code to alloc_vg.	2011-03-13 23:05:48 +00:00
Zdenek Kabelac	e1cb521dd9	Use proper size of strncpy Avoid reading extra character if we expect to have there '\0'.	2011-03-13 23:01:08 +00:00
Zdenek Kabelac	c9c1730705	Fix buffer allocation size for uuid string We have 3 components and traling '\0' so allocate proper room for all of them. Problem was nicely hidden by allocation from pool and allocation aligment offset - so to trigger real problem with this one is actually hard.	2011-03-13 22:57:51 +00:00
Zdenek Kabelac	218f657794	Fix usage of readlink Return value of readlink limits valid string size. Characters after returned size present some garbage to printf. Fix it by placing '\0' on the return size value.	2011-03-13 22:52:16 +00:00
Peter Rajnoha	ff4479414c	Use format instance mempool where possible and adequate.	2011-03-11 15:10:16 +00:00
Peter Rajnoha	e8d4946ec7	Various cleanups for fid mem and ref_count changes. Missing free_vg on error_path in lvmcache_get_vg fn. Call destroy_instance only if the fid is not part of the vg in backup_read_vg fn (otherwise it's part of the VG we're returning and we definitely don't want to destroy it!).	2011-03-11 15:08:31 +00:00
Peter Rajnoha	2feb2a66fd	Call destroy_instance for any PVs found in VG structure during vg_free call. This is necessary for proper format instance ref_count support. We iterate over vg->pvs and vg->removed_pvs list and the ref_count is decremented and then it is destroyed if not referenced anymore.	2011-03-11 15:06:13 +00:00
Peter Rajnoha	84f48499a3	Add new free_pv_fid fn and use it throughout to free all attached fids. Since format instances will use own memory pool, it's necessary to properly deallocate it. For now, only fid is deallocated. The PV structure itself still uses cmd mempool mostly, but anytime we'd like to add a mempool in the struct physical_volume, we can just rename this fn to free_pv and add the code (like we have free_vg fn for VGs).	2011-03-11 14:56:56 +00:00
Peter Rajnoha	1307ddf4cf	Use only vg_set_fid and new pv_set_fid fn to assign the format instance. This is essential for proper format instance ref_count support. We must use these functions to set the fid everywhere from now on, even the NULL value!	2011-03-11 14:50:13 +00:00
Peter Rajnoha	293481107f	Make create_text_context fn static and move it inside create_instance fn. We'd like to use the fid mempool for text_context that is stored in the instance (we used cmd mempool before, so the order of initialisation was not a matter, but now it is since we need to create the fid mempool first which happens in create_instance fn). The text_context initialisation is not needed anywhere outside the create_instance fn so move it there.	2011-03-11 14:45:17 +00:00
Peter Rajnoha	a1bec4e685	Add mem and ref_count fields to struct format_instance for own mempool use. Format instances can be created anytime on demand and it contains metadata area information mostly (at least for now, but in the future, we may store more things here to update/edit in a PV/VG). In case we have lots of metadata areas, memory consumption will rise. Using cmd context mempool is not quite optimal here because it is destroyed too late. So let's use a separate mempool for format instances. Reference counting is used because fids could be shared, e.g. each PV has either a PV-based fid or VG-based fid. If it's VG-based, each PV has a shared fid with the VG - a reference to VG's fid.	2011-03-11 14:38:38 +00:00
Peter Rajnoha	56f5b12eed	Use new alloc_fid fn for common format instance initialisation.	2011-03-11 14:30:27 +00:00
Zdenek Kabelac	a6f38f9d6a	Missed merge fix in vg_validate patch	2011-03-10 22:39:36 +00:00
Zdenek Kabelac	027a55d0fb	Optimise _eat_space and _get_token Makes the code more readable and has a smaller number of memory accesses thus it's small optimisation as well. For _get_token() optimize number parsing. Check for '.' char only if it's not a digit. Move pointer incrementation into one place. For _eat_space() check only p->te for '\0' in skipping of comment line. Avoid check for '\0' when we know it is space. Also master while loop doesn't need checking p->tb for '\0'. We just need to check p->tb isn't already at the end of buffer. This could give 'extra' loop cycle if we are already there - but safes memory access in every other case.	2011-03-10 14:51:35 +00:00
Zdenek Kabelac	442dbf9ad8	Refactor code for _lv_postoder Add _lv_postorder_vg() - for calling _lv_postorder() for every LV from VG. We use this in 2 places - vg_mark_partial_lvs() and vg_validate() so make it as a one function. Benefit here is - to use only one cleanup code and avoid potentially duplicate scans of same LVs.	2011-03-10 14:40:32 +00:00
Zdenek Kabelac	4ee2b4965f	Use hash tables for validating names Accelerate validation loop by using lvname, lvid, pvid hash tables. Also merge pvl loop into one cycle now - no need to scan the list twice. List scan is stopped when dm_hash_insert fails. The error message with loop_counter1 is no longer provided - however the message has been misleading anyway.	2011-03-10 13:11:59 +00:00
Zdenek Kabelac	3019419e95	Refactor vg allocation code Create new function alloc_vg() to allocate VG structure. It takes pool_name (for easier debugging). and also take vg_name to futher simplify code. Move remainder of _build_vg_from_pds to _pool_vg_read and use vg memory pool for import functions. (it's been using smem -> fid mempool -> cmd mempool) (FIXME: remove mempool parameter for import functions and use vg). Move remainder of the _build_vg to _format1_vg_read	2011-03-10 12:43:29 +00:00
Alasdair Kergon	9cfdf8031e	Avoid possible endless loop in _free_vginfo when 4 or more VGs have same name.	2011-03-10 03:03:03 +00:00
Alasdair Kergon	2f25c320fb	Use empty string instead of /dev// for LV path when there's no VG. Don't allocate unused VG mempool in _pvsegs_sub_single.	2011-03-09 12:44:42 +00:00
Zdenek Kabelac	2bd6583a34	Use lvm_getpagesize wrapper	2011-03-06 17:52:07 +00:00
Milan Broz	affd9c086d	Fix hardcoded page size, fixing test fails with 8k page and new kernel.	2011-03-06 16:47:43 +00:00
Zdenek Kabelac	55f6627427	Fix reading of released memory lvseg_segtype_dup used memory pool vg memory pool for strind duplication. However this one gets released before reporting happens so the command like: pvs -o segtype prints data from already released memory pool. Thanks to the fact there is not much allocation happing after the VG is released, the memory stays unmodified and correct result is printed. Fix adds support for mempool passed parameter (like other similar query commands) and uses dm_report memory pool for string duplication.	2011-03-05 12:14:00 +00:00
Milan Broz	be3510b204	PE size overflows, on most architectures it is catch by "PE cannot be 0" but s390x unfortunately return something usable. Always use unit64 in inital parameter check.	2011-03-02 20:00:09 +00:00
Peter Rajnoha	15b9215534	Use a copy if moving an mda from pv fid to vg fid. We'll destroy the pv fid (with all mdas in it) after merging all pv mdas to a vg in _text_pv_setup fn, hence we need to use a copy here!	2011-03-02 10:23:29 +00:00
Peter Rajnoha	0b100565ae	Make add_metadata_area_to_pv/remove_metadata_area_from_pv static. No need to put these in format-text.h, it's not used anywhere else actually.	2011-03-02 10:19:14 +00:00
Milan Broz	cbedb99e4c	Fix some compile warnings on RHEL5 - returned char not needed to be explicitly const - warn if pipe() fails in clvmd (more fixes here needed for error paths...) - assign (and ignore) read() output in drain buffer	2011-03-01 20:17:56 +00:00
Milan Broz	0cb777d642	Rephrase backup message.	2011-02-28 20:50:01 +00:00
Zdenek Kabelac	36653e8903	Add fall through comments Add comments to switch case construct.	2011-02-28 19:53:03 +00:00
Peter Rajnoha	150e43a05c	Use pv->vg_name directly instead of pv->vg->name in _text_pv_write. This also prevents a possible segfault during an automatic repair when the PV does not belong to a VG anymore and we call pv_write_orphan.	2011-02-28 17:05:48 +00:00
Peter Rajnoha	3b97e8d643	Allow non-orphan PVs with two metadata areas to be resized. We allow writing non-orphan PVs only for resize now. The "orphan PV" assert in pv_write fn uses the "allow_non_orphan" parameter to control this assert. However, we should find a more elaborate solution so we can remove this restriction altogether (pv_write together with vg_write is not atomic, we need to find a safe mechanism so there's an easy revert possible in case of an error).	2011-02-28 13:19:02 +00:00
Alasdair Kergon	1a52fa6858	Fix check for log-only allocation in new alloc normal loop.	2011-02-27 01:16:52 +00:00
Alasdair Kergon	92ffcda183	Various changes to the allocation algorithms: Expect some fallout. There is a lot to test. Two new config settings added that are intended to make the code behave closely to the way it did before - worth a try if you find problems.	2011-02-27 00:38:31 +00:00
Peter Rajnoha	4b8f066c19	vgconvert is fixed now to work with the changes in metadata area handling - enable the tests. Add a small fix that preserves pe_start for lvm1 PVs when being converted. (this fix needs to be replaced with something more clever, but let's have this working now)	2011-02-25 14:12:14 +00:00
Peter Rajnoha	4a304dc1d8	Allow only orphan PVs to be resized even with two metadata areas.	2011-02-25 14:08:54 +00:00
Peter Rajnoha	f74bd57ec9	Revert the patch for vgconvert to work with recent changes in metadata area handling. This should work now with the help of the patch from previous commit.	2011-02-25 14:02:53 +00:00
Peter Rajnoha	38b0564cab	Read PV metadata information from cache if pv_setup called with pv->fid == vg->fid. If the PV is already part of the VG (so the pv->fid == vg->fid), it makes no sense to attach the mdas information from PV to a VG. Instead, we read new PV metadata information from cache and attach it to the VG fid.	2011-02-25 13:59:47 +00:00
Peter Rajnoha	ea4a41e961	Fix a bug in metadata location calculation, cleanup pv_add_metadata_area fn. This bug (a missing line) caused the 2nd MDA area location to be calculated incorrectly and it didn't fit the disk size properly. (https://www.redhat.com/archives/lvm-devel/2011-February/msg00127.html)	2011-02-25 13:50:02 +00:00
Peter Rajnoha	c901a92aa5	%ld -> PRIu64	2011-02-21 13:09:27 +00:00
Peter Rajnoha	9c0035c129	Fix metadata balance code to work with recent changes in metadata handling interface (with the changes in format_instance).	2011-02-21 12:33:16 +00:00
Peter Rajnoha	51aed1992f	Add old_uuid field to struct physical_volume so we can still reference a PV with its old UUID when we're changig it (the cache as well as metadata area index has the old uuid that we need to use to access the information!)	2011-02-21 12:31:28 +00:00
Peter Rajnoha	6bdc80743e	Fix vgconvert code to work with changes in metadata area handling and changes in format_instance. Add new 'vg_convert' function.	2011-02-21 12:29:21 +00:00
Peter Rajnoha	cb2396730a	Change pvresize code to work with new metadata handling interface and allow resizing a PV with two metadata areas.	2011-02-21 12:27:26 +00:00
Peter Rajnoha	17ad2b1115	Change pv_write code to work with the changes in metadata handling interface and changes in format_instance.	2011-02-21 12:26:27 +00:00
Peter Rajnoha	903d7db050	Remove unused _mda_setup fn. This functionality is covered by new pv_add_metadata_area fn.	2011-02-21 12:25:16 +00:00
Peter Rajnoha	94d91fdda1	Change the code throughout to use new pv_initialise and modified pv_setup fn. Change pv_create code to work with these changes together with using new pv_add_metadata_area fn to add metadata areas for a PV being created.	2011-02-21 12:24:15 +00:00
Peter Rajnoha	617b900d85	Separate new pv_initialise function out of the original pv_setup code. pv_initiliase initialises a new PV pv_setup sets up an existing PV with a VG	2011-02-21 12:20:18 +00:00
Peter Rajnoha	981895a860	Add new pv_remove_metadata_area interface function.	2011-02-21 12:17:54 +00:00
Peter Rajnoha	8d5d20a526	Add new pv_add_metadata_area interface function.	2011-02-21 12:17:26 +00:00
Peter Rajnoha	305816232d	Remove useless mdas parameter for pv_read (from now on, we store mdas in a format instance)	2011-02-21 12:15:59 +00:00
Peter Rajnoha	6e0b348d34	Add format instance support for pv_read code.	2011-02-21 12:13:40 +00:00
Peter Rajnoha	56280d0d3a	Initialise a new PV-based format instance for a PV that is being created.	2011-02-21 12:12:32 +00:00
Peter Rajnoha	f8b78ec613	Add vg_set_fid function to change VG format instance. This function also sets a reference to a new VG format instance for all PVs that are part of the VG so the PV-VG interconnection is consistent after the change.	2011-02-21 12:10:58 +00:00
Peter Rajnoha	c0c21864c6	Change the code throughout for recent changes in format_instance handling.	2011-02-21 12:07:03 +00:00
Peter Rajnoha	88129db5e1	Change create_instance to create PV-based as well as VG-based format instances. Add supporting functions to work with the format instance and metadata area structures stored within the format instance. Add support for simple indexing of metadata areas using PV id and mda order (for on-disk PV only for now, we can extend the indexing even for other mdas if needed - we only need to define a proper key for the index).	2011-02-21 12:05:49 +00:00
Peter Rajnoha	716c4ebe52	Change and generalise struct format_instance for PV and VG use.	2011-02-21 12:01:22 +00:00
Alasdair Kergon	a8d13f9499	Handle decimal digits with --units instead of ignoring them silently. Fix remaining warnings and compile with -Wpointer-arith.	2011-02-18 23:09:55 +00:00
Zdenek Kabelac	476ef1886f	Memory unlock allows 1 page difference As the kernel seems to be doing weird things during mlock -> munlock - allow 1 page locking difference without warning - and log just debug message for a 1 page difference. Allocation happens outside critical section probably during log_warn printing. Should make tests passing for now.	2011-02-18 14:51:04 +00:00
Zdenek Kabelac	aec2115410	Const fixing Fixing some const warnings - with API change in: int vg_extend(struct volume_group vg, int pv_count, const char const pv_names, Change is needed - as lvm2api expects const behaviour here. So vg_extend() is doing local strdup for unescaping. skip_dev_dir return const char from const char* vg_name. Rest of the patch is cleanup of related warnings. Also using dm_report_filed_string() API change to simplify casting in _string_disp and _lvname_disp.	2011-02-18 14:47:28 +00:00
Zdenek Kabelac	4ebc6404ee	Void* arithmetic replaced with char*	2011-02-18 14:34:41 +00:00
Zdenek Kabelac	ab8b85fb80	Fix !DEVMAPPER_SUPPORT build Fix build when devmapper is disabled.	2011-02-18 14:29:39 +00:00
Zdenek Kabelac	44376ffe22	Remove fs_unlock after failed suspend Explicit fs_unlock() after failed suspend is not need - as it will happen automatically with nearest lv_info() or vg_unlock().	2011-02-18 14:26:31 +00:00
Zdenek Kabelac	b1bcff7424	Critical section New strategy for memory locking to decrease the number of call to to un/lock memory when processing critical lvm functions. Introducing functions for critical section. Inside the critical section - memory is always locked. When leaving the critical section, the memory stays locked until memlock_unlock() is called - this happens with sync_local_dev_names() and sync_dev_names() function call. memlock_reset() is needed to reset locking numbers after fork (polldaemon). The patch itself is mostly rename: memlock_inc -> critical_section_inc memlock_dec -> critical_section_dec memlock -> critical_section Daemons (clmvd, dmevent) are using memlock_daemon_inc&dec (mlockall()) thus they will never release or relock memory they've already locked memory. Macros sync_local_dev_names() and sync_dev_names() are functions. It's better for debugging - and also we do not need to add memlock.h to locking.h header (for memlock_unlock() prototyp).	2011-02-18 14:16:11 +00:00
Zdenek Kabelac	794e94fe16	Replace PV_MIN_SIZE with function pv_min_size() Add configurable option to define minimal size of of block device usable as a PV. pv_min_size() is added to lvm-globals and it's being initialized through _process_config. Macro PV_MIN_SIZE is unused and removed. New define DEFAULT_PV_MIN_SIZE_KB is added to lvm-global and unlike PV_MIN_SIZE it uses KB units. Should help users with various slow devices attached to the system, which cannot be easily filtered out (like FDD on /dev/sdX): https://bugzilla.redhat.com/show_bug.cgi?id=644578	2011-02-18 14:11:22 +00:00
Zdenek Kabelac	9dd091e4f4	Support 64bit ints in config	2011-02-18 14:08:22 +00:00
Jonathan Earl Brassow	c054e7cc56	Fix for bug 677739: removing final exclusive cmirror snapshot, results in clvmd deadlock When a logical volume is activated exclusively in a cluster, the local (non-cluster-aware) target is used. However, when creating a snapshot on the exclusive LV, the resulting suspend/resume fails to load the appropriate device-mapper table - instead loading the cluster-aware target. This patch adds an 'exclusive' parameter to the pertinent resume functions to allow for the right target type to be loaded.	2011-02-18 00:36:04 +00:00
Petr Rockai	21849a8587	Fix an lv_postorder bug where it failed to clear temporary flags, making it impossible to use twice with the same LV(s). Discovered by Milan.	2011-02-14 19:27:05 +00:00
Zdenek Kabelac	2fdd451b19	Fix CRC32 calculation on big endian CPU Fix regresion from 2.02.75 speedup - so currently crc32 is a little bit more complicated on big-endian CPU as the uint32_t needs to be shifted on here.	2011-02-08 12:41:08 +00:00
Zdenek Kabelac	d0df875d48	Add configure option --with-device-nodes-on Make configurable default behaviour how to deal with device node creates. With udev system natural options should be 'resume'. For older systems where user expect there is node in /dev/mapper immediately after dmsetup create --notable - use 'create' FIXME: Code needs fixing passing this flag through udev cookie.	2011-02-04 22:17:54 +00:00
Alasdair Kergon	6c7b95f281	pre-release	2011-02-04 22:07:43 +00:00
Jonathan Earl Brassow	27ff8813da	Allow snapshots in a cluster as long as they are exclusively activated. In order to achieve this, we need to be able to query whether the origin is active exclusively (a condition of being able to add an exclusive snapshot). Once we are able to query the exclusive activation of an LV, we can safely create/activate the snapshot. A change to 'hold_lock' was also made so that a request to aquire a WRITE lock did not replace an EX lock, which is already a form of write lock.	2011-02-04 20:30:17 +00:00
Zdenek Kabelac	09d288535b	Remove extra sync calls. Remove temporaly added fs_unlock() calls to fix clmvd usablity. Now when the message passing is properly working - they are no longer needed. Simplify no_locking check for VG unlock - as message is always send for all targets - clustered & non-clustered.	2011-02-04 19:21:47 +00:00
Zdenek Kabelac	fa6a525c2d	Use cluster-wide message to request device name sync Thanks to CLVMD_CMD_SYNC_NAMES propagation fix the message passing started to work. So starts to send a message before the VG is unlocked. Removing also implicit sync in VG unlock from clmvd as now the message is delievered and processed in do_command(). Also add support for this new message into external locking and mask this event from further processing.	2011-02-04 19:18:16 +00:00
Zdenek Kabelac	f5f6dcbc62	Fix operation node stacking for consecutive dm ops With the ability to stack many operations in one udev transaction - in same cases we are adding and removing same device at the same time (i.e. deactivate followed by activate). This leads to a problem of checking stacked operations: i.e. remove /dev/node1 followed by create /dev/node1 If the node creation is handled with udev - there is a problem as stacked operation gives warning about existing node1 and will try to remove it - while next operation needs to recreate it. Current code removes all previous stacked operation if the fs op is FS_DEL - patch adds similar behavior for FS_ADD - it will try to remove any 'delete' operation if udev is in use. For FS_RENAME operation it seems to be more complex. But as we are always stacking FS_READ_AHEAD after FS_ADD operation - should be safe to remove all previous operation on the node when udev is running. Code does same checking for stacking libdm and liblvm operations. As a very simple optimization counters were added for each stacked ops type to avoid unneeded list scans if some operation does not exists in the list. Enable skipping of fs_unlock() (udev sync) if only DEL operations are staked. as we do not use lv_info for already deleted nodes.	2011-02-04 19:14:39 +00:00
Zdenek Kabelac	135af49da5	Increase hash table size to 1024 lv names and 64 pv uuids	2011-02-03 16:03:13 +00:00
Zdenek Kabelac	3a00204a23	Remove fs_unlock from lv_resume path Keep it within clvmd until message for SYNC starts to work.	2011-02-03 01:58:20 +00:00
Zdenek Kabelac	16f000bcb4	Fix wipe size when seting up mda.	2011-02-03 01:41:03 +00:00
Zdenek Kabelac	401a40d941	Do not check for open_count when not needed. Disable open_count checking in lv_info it it's not used. Fix previous commit (comment out unsable code for now).	2011-02-03 01:24:46 +00:00
Zdenek Kabelac	56cab8cc03	Synchronize with udev for lv_info In case the open_count is requested via lv_info - check if there are any udev operations in-progress - and wait for them before checking for lv_info	2011-02-03 01:16:35 +00:00
Alasdair Kergon	c221ae8cdb	a few more comments	2011-02-02 23:57:48 +00:00
Alasdair Kergon	12e36e7ea7	Allow CLVMD_CMD_SYNC_NAMES to be propagated around the cluster if requested.	2011-02-02 23:39:39 +00:00
Zdenek Kabelac	fccfa9e929	Better fix for no-locking udev sync and clvmd This is better way how to fix clustered synchronization with udev. As the code for message passing needs fixed - put currently fs_unlock() after every active/deactive command in clvmd to ensure nodes are properly created in time.	2011-02-02 20:04:39 +00:00
Zdenek Kabelac	9dc3afb1fa	Revert wrong fix for nolock locking missing fs_unlock Patch was wrond and introduced recursive lock_vol Reverting it.	2011-02-02 13:34:00 +00:00
Jonathan Earl Brassow	e7cb9788c4	fix bad 'strcmp's in 'decode_lock_type' - missing !'s There was no effect from having this wrong yet, because the tree of callers only ever cared about the answer to the first condition (!response), which determines whether a lock is held or not. Correct responses, however, are needed soon.	2011-02-01 17:31:40 +00:00
Zdenek Kabelac	116cbc267c	Fix udev synchronization for no-locking mode Instead of implicitly syncing udev operation in clustered and file locking code - call synchronization directly in lock_vol() when the operation unlocks VG The problem is missing implicit fs_unlock() in the no_locking code. This is used with --sysinit on read-only filesystem locking dir. In this case vgchange -ay could exit before all udev nodes are properly synchronised and may cause problems with accessing such node right after vgchange --sysinint command is finished. Add test case for vgchange --sysinit.	2011-01-31 19:52:40 +00:00
Zdenek Kabelac	7f8badfe5e	Use memcpy and add error message strncpy (which check each byte for \0) is not need as we always copy the length size - so using memcpy is a bit cheaper. Add missing log_error message for failed allocation.	2011-01-28 10:19:00 +00:00
Zdenek Kabelac	a5c6acf22a	Skip NULL check before dm_free dm_free checks for NULL itself.	2011-01-28 10:16:04 +00:00
Zdenek Kabelac	65fc4dae3a	Avoid rebuilding of uuid validation table Small CPU relax...	2011-01-28 10:14:08 +00:00
Mike Snitzer	3e3591904b	Improve lvcreate "insufficient extents" errors to "insufficient free space".	2011-01-28 02:58:00 +00:00
Alasdair Kergon	a1d4ec1d6e	Use O_DIRECT when reading block devices.	2011-01-27 00:21:37 +00:00
Alasdair Kergon	cef065f63f	Fix lvchange --test to exit cleanly.	2011-01-24 14:19:05 +00:00
Zdenek Kabelac	6184b70cf7	Do not scan devices unnecessarily for reseting error counter For reseting error counter use directly btree cached elements and do not create whole dev_iterator.	2011-01-17 15:16:55 +00:00
Zdenek Kabelac	96eda8b9b3	Skip unnecessary lock_vol() call after volume deactivation Improve condition within lock_vol so we are not calling extra unlock if the volume just has been deactivated. Patch uses lck_type and replaces negative 'and' condition to more readable 'or' condition. Few missing strace traces added.	2011-01-13 14:56:17 +00:00
Zdenek Kabelac	b1b38215ba	Add exec_cmd paramater sync_needed As sync_local_dev_names() cannot be called within activation context, add new parametr which allows to select if the sync call is needed before executing new command.	2011-01-13 14:51:32 +00:00
Alasdair Kergon	a8de276520	Replace fs_unlock by sync_local_dev_names to notify local clvmd. (2.02.80) Introduce sync_local_dev_names and CLVMD_CMD_SYNC_NAMES to issue fs_unlock.	2011-01-12 20:42:50 +00:00
Alasdair Kergon	b84bb4d9c2	add fio	2011-01-12 15:28:33 +00:00
Jonathan Earl Brassow	6a095ca99f	s/log_verbose/log_error/ - Increase log level on error message.	2011-01-11 17:21:01 +00:00
Jonathan Earl Brassow	025e69a15a	Add disk to mirrored log type conversion.	2011-01-11 17:05:08 +00:00
Zdenek Kabelac	22b06cdcce	Fix missing declaration for fs_unlock	2011-01-10 19:49:42 +00:00
Zdenek Kabelac	ad50450a22	Avoid cookie sharing between forked processes Before fork, ensure cookie is reset so it's not shared between processes.	2011-01-10 19:31:02 +00:00
Zdenek Kabelac	937a21f0d2	Speedup consequent activation calls Stop calling fs_unlock() from lv_de/activate(). Start using internal lvm fs cookie for dm_tree. Stop directly calling dm_udev_wait() and dm_tree_set/get_cookie() from activate code - it's now called through fs_unlock() function. Add lvm_do_fs_unlock() Call fs_unlock() when unlocking vg where implicit unlock solves the problem also for cluster - thus no extra command for clustering environment is required - only lvm_do_fs_unlock() function is added to call lvm's fs_unlock() while holding lvm_lock mutex in clvmd. Add fs_unlock() also to set_lv() so the command waits until devices are ready for regular open (i.e. wiping its begining). Move fs_unlock() prototype to activation.h to keep fs.h private in lib/activate dir and not expose other functions from this header.	2011-01-10 14:02:30 +00:00
Zdenek Kabelac	f6fdfd56e4	Add internal fs cookie Add functions for handling internal lvm cookie used for all dm_tree operations until fs_unlock is called.	2011-01-10 13:44:39 +00:00
Zdenek Kabelac	2dd15068fb	Cache config_tree Start to use config_tree for cache just like vgmetadata. When vgmetadata are erased destroy its cached config tree.	2011-01-10 13:15:57 +00:00
Zdenek Kabelac	6feecf76d4	Change import_vg_from_buffer to use config_tree Change function import_vg_from_buffer() to import_vg_from_config_tree(). Instead of creating config tree inside the function allow config tree to be passed as parameter - usable later for caching.	2011-01-10 13:13:42 +00:00
Alasdair Kergon	f11781c50e	Using Fedora 14's autoreconf.	2011-01-07 14:38:34 +00:00
Zdenek Kabelac	b2c682e462	Fix memory leak in filter creation error path If some allocation for peristent filter fails its memory reference was lost, fix it by calling filter's destructor. Fix log_error messages for failing allocation.	2011-01-06 15:29:24 +00:00
Zdenek Kabelac	ff4a77c5ca	Intentionaly ignore result from get_config_uint32	2011-01-06 15:25:07 +00:00
Zdenek Kabelac	2d6e83ea19	Check result of dm_snprintf for error	2011-01-05 15:10:30 +00:00
Zdenek Kabelac	5fc79ef6dc	Add sys_debug loging for unlink This unlink intentionally silently ignores any errors. It's still worth to trace its error status in debug mode.	2011-01-05 15:06:10 +00:00
Zdenek Kabelac	006e5fa0ea	Add missing error path tests	2011-01-05 14:03:36 +00:00
Zdenek Kabelac	1936d75b3c	Return PERCENT_INVALID for error case If the percent value could not be determined return PERCENT_INVALID. Indent function with tabs.	2011-01-05 12:33:51 +00:00
Zdenek Kabelac	a5d006d515	Remove unused variable mirr_state and its assignment	2011-01-05 12:27:56 +00:00
Zdenek Kabelac	0ddb15964a	Remove check for existance of vg pointer Checking for vg being != NULL in this place is not needed. Pointer vg is already dereferced in this function above this code line. Also this internal function _read_pv is always called with valid 'vg' pointer.	2010-12-22 15:44:09 +00:00
Zdenek Kabelac	2ae2ca89bf	Add backtraces for backup and backup_remove fail paths	2010-12-22 15:36:41 +00:00
Zdenek Kabelac	bd43da4f9d	Hide unused code into if 0 Make it obvious for lcov coverage and static analyzis we are not interested in this piece of code.	2010-12-22 15:32:15 +00:00
Zdenek Kabelac	1102378e1c	Add backtraces for archive and backup_locally If archive or back_locally fails - add stack trace.	2010-12-22 13:45:33 +00:00
Zdenek Kabelac	b7149bbe45	Add missing test for reallocation error.	2010-12-20 14:38:22 +00:00
Zdenek Kabelac	446e4a6a79	Verbose log old_umask value Use old_umask value and print its content through verbose log.	2010-12-20 14:34:49 +00:00
Zdenek Kabelac	952cd45167	Add internal error if pointer is uninitialized Add simple check for existance of 'pl' and printer internal error message if device is missing instead of plain crash.	2010-12-20 14:20:52 +00:00
Zdenek Kabelac	4675e4f17d	Remove unused variable label Variable 'label' is unused in _format1_pv_write().	2010-12-20 14:06:33 +00:00
Zdenek Kabelac	f2554b9d2a	Remove dead assignment of segh Variable 'segh' is never read again after this assignment.	2010-12-20 14:04:43 +00:00
Zdenek Kabelac	f7e7f3e3ed	Add checks for allocation errors in config node clonning. Add checks for clonning allocation a fail-out when something is not allocated correctly. Also move var declaration to the begining of the function and fix log_error messages.	2010-12-20 13:53:10 +00:00
Zdenek Kabelac	9376ec18cd	Fix error path if regex engine cannot be created in _build_matcher(). Fix only 'stack' printing with full function error exit.	2010-12-20 13:45:39 +00:00
Zdenek Kabelac	9b30dfb967	Use const char * for name and old_name in vg Switch to use const char pointers to avoid changes of these structure members and having better control over, were these members could be modified.	2010-12-20 13:40:46 +00:00
Zdenek Kabelac	d40d166f91	Switch void* to char* arithmetic	2010-12-20 13:37:26 +00:00
Zdenek Kabelac	9d9de35dca	Remove const usage from destroy callbacks As const segment_type or const format_type are never released use their non-const version and remove const downcast from dm_free calls. This change fixes many gcc warnings we were getting from them.	2010-12-20 13:32:49 +00:00
Zdenek Kabelac	303923fbf1	Use const char* const * for dm_regex_create() Change API interface to accept even completely const array patterns. This should present no change for libdm users and allows to pass pattern arrays without cast to const char **.	2010-12-20 13:23:11 +00:00
Zdenek Kabelac	ba96eb24fa	Some const cleanups Minor const warning fixes and internal API updates.	2010-12-20 13:19:13 +00:00
Zdenek Kabelac	760d1fac55	Add more strict const pointers around config tree To have better control were the config tree could be modified use more const pointers and very carefully downcast them back to non-const (for config tree merge).	2010-12-20 13:12:55 +00:00
Alasdair Kergon	22bb69eb99	Fix device.c #include to ensure 64-bit fopen64 use. (2.02.51) (robbat2)	2010-12-15 12:49:55 +00:00
Petr Rockai	8961b1d503	Add getters for copy_percent and snap_percent to the lvm2app API.	2010-12-14 23:20:58 +00:00
Petr Rockai	ebfe96cad5	Add further consistency checking to vg_validate, ensuring that all segment areas point to LVs or PVs that are listed in the respective VG.	2010-12-14 17:51:09 +00:00
Petr Rockai	75b2f3507a	Add a validation step for pvmoveN internal LVs to vg_validate.	2010-12-14 17:07:35 +00:00
Milan Broz	dd1f2c0959	Update configure.	2010-12-13 11:03:10 +00:00
Peter Rajnoha	7dfce0e467	Add new dm_prepare_selinux_context fn to libdevmapper and use it throughout. Detect existence of new SELinux selabel interface during configure. Use new dm_prepare_selinux_context instead of dm_set_selinux_context. We should set the SELinux context before the actual file system object creation. The new dm_prepare_selinux_context function sets this using the selabel_lookup fn in conjuction with the setfscreatecon fn. If selinux/label.h interface (that should be a part of the selinux library) is not found during configure, we fallback to the original matchpathcon function instead.	2010-12-13 10:43:56 +00:00
Alasdair Kergon	acb037657c	Fix scanning of VGs without in-PV mdas. Set cmd->independent_metadata_areas if metadata/dirs or disk_areas in use. - Identify and record this state. Don't skip full scan when independent mdas are present even if memlock is set. - Clusters and OOM aren't supported, so no problem doing the proper scans. Avoid revalidating the label cache immediately after scanning. - A simple optimisation. Support scanning for a single VG in independent mdas. - Not used by the fix but I left it in anyway as later patches might use it.	2010-12-10 22:39:52 +00:00
Alasdair Kergon	2b82bd79f5	Rename vg_release to free_vg.	2010-12-08 20:50:48 +00:00
Alasdair Kergon	e8bed35ddf	Cope better with an undefined target_percent operation in _percent_run.	2010-12-08 19:26:35 +00:00
Zdenek Kabelac	54fca7b1ca	Remove reset of vg->vgmem pointer as it is access of already release memory This reset of vgmem pointer causes access of already released memory. (_vg_make_handle allocates vg from vgmem pool itself - which is a bit tricky) Interestingly this memory fault was missed by our test suite.	2010-12-08 10:45:37 +00:00
Zdenek Kabelac	98e6fdec8b	Check str_list_add() success Report error if str_list_add fails.	2010-12-01 13:05:06 +00:00
Zdenek Kabelac	414813e349	Check lv_info() success Add log_error message for lv_info failure and exit from futher processing. Replace 'leg' occurence in debug message with 'image' which is used in other messages.	2010-12-01 13:01:36 +00:00
Zdenek Kabelac	166597d998	Add backtraces for errors Add stack; backtraces when error is reported from dev_set() or dev_close_immediate().	2010-12-01 12:56:39 +00:00
Zdenek Kabelac	e3552d738c	Check result of vginfo_from_vgname Check for some potential internal error.	2010-12-01 10:39:28 +00:00
Zdenek Kabelac	2937b51eaa	Fallback to full rescan for missing device Fix bug when NULL could have been passsed as 'data' to _add_pv_to_list() if 'dev' is NULL. Now it fallbacks to complete scan.	2010-12-01 10:33:55 +00:00
Zdenek Kabelac	bf8ea32876	Remove unneeded test for NULL Remove check for system_id (it is defined as int8_t[], so cannot be NULL).	2010-11-30 22:57:35 +00:00
Zdenek Kabelac	81e606ab2c	Remove check for lv is NULL 'lv' is deferenced in the begining of the function so any check later is not helpful. Parameters for dev_manager_transien() are marked as nonnull.	2010-11-30 22:28:06 +00:00
Zdenek Kabelac	14caa4a2d0	Add missing test for failed pool allocation Add test for NULL from dm_poll_create. Reorder dm_pool_destroy() before file close and add label out:. Avoid leaking file descriptor if the allocation fails.	2010-11-30 22:23:35 +00:00
Petr Rockai	8191fe4f4a	Refactor the percent (mirror sync, snapshot usage) handling code to use fixed-point values instead of a combination of a float value and an enum.	2010-11-30 11:53:31 +00:00
Petr Rockai	97e8048e05	Avoid the automatic MISSING_PV recovery path in commands with special MISSING_PV handling (cmd->handles_missing_pvs is set).	2010-11-30 11:15:54 +00:00
Alasdair Kergon	1415afcdba	Fix memory leak when VG allocation policy in metadata is invalid. Ignore unrecognised allocation policy found in metadata instead of aborting. Fix another missing vg_release() in _vg_read_by_vgid.	2010-11-29 18:35:37 +00:00
Zdenek Kabelac	21ba805499	Fix memory leak in error path Nicely hidden memory leak in outf macro error path. This macro is using out_text() and does automagical return_0. That would leak tag_buffer allocated memory. As there was same code for tags output - create _out_tags() function.	2010-11-29 12:19:58 +00:00
Zdenek Kabelac	dce59eb407	Remove unused 'i' in _pv_analyze_mda_raw 'i' is unused in the function - remove it.	2010-11-29 11:16:58 +00:00
Zdenek Kabelac	73e3226012	Remove dead assignment in _lock_for_cluster 'saved_errno' is not read from this initialization and before its usage is assigned again before _cluster_free_request() call.	2010-11-29 11:13:12 +00:00
Zdenek Kabelac	201222ebad	Reset vg pointer after release Set vg to NULL after releasing it as the following memlock() test may lead to goto for the second call of vg_release() with the already released vg pointer.	2010-11-29 11:08:14 +00:00
Zdenek Kabelac	99aacef51c	Fix check for empty system_dir Fixing check for zero length system_dir string.	2010-11-29 10:58:32 +00:00
Petr Rockai	0bc382eae4	All 'size' values of lvm2app properties should be in bytes. Fix 'seg_size' to return bytes. Signed-off-by: Dave Wysochanski <wysochanski@pobox.com> Reviewed-by: Petr Rockai <prockai@redhat.com>	2010-11-25 14:39:02 +00:00
Zdenek Kabelac	e8ec0ba2e3	Fix resource leak of dlopened pointer Add missing dlclose in _init_formats() error path. Use return_0 to print stack trace from the call.	2010-11-24 09:34:34 +00:00
Zdenek Kabelac	0a178d853a	Add missing closedir() - fixes resource leak	2010-11-23 15:28:54 +00:00
Zdenek Kabelac	7d11d708d8	Move va_end() so it is also used before error path return	2010-11-23 15:08:57 +00:00
Alasdair Kergon	728074ac83	Suppress 'No PV label' message when removing several PVs without mdas.	2010-11-23 01:55:53 +00:00
Petr Rockai	4543dac58e	Add the macro and specific 'get' functions for pvsegs. Signed-off-by: Dave Wysochanski <wysochanski@pobox.com> Reviewed-by: Petr Rockai <prockai@redhat.com>	2010-11-17 20:11:27 +00:00
Petr Rockai	c1abd569f2	Add the macro and specific 'get' functions for lvsegs. Signed-off-by: Dave Wysochanski <wysochanski@pobox.com> Reviewed-by: Petr Rockai <prockai@redhat.com>	2010-11-17 20:08:14 +00:00
Petr Rockai	cbff112651	Make value.string const char *, in properties.h, to fix a warning introduced by the previous patch set.	2010-11-17 19:50:15 +00:00
Petr Rockai	fd82d8c129	Add generic infrastructure to internal library to 'set' a property. Similar to 'get' property internal functions. Add specific 'set' function for vg_mda_copies. Signed-off-by: Dave Wysochanski <wysochanski@pobox.com> Reviewed-by: Petr Rockai <prockai@redhat.com>	2010-11-17 19:15:10 +00:00
Alasdair Kergon	10955b8289	Remove tag length restriction and allow / = ! : # & characters.	2010-11-17 10:19:29 +00:00
Alasdair Kergon	f8452d8cfd	Support repetition of --addtag and --deltag arguments. Add infrastructure for specific cmdline arguments to be repeated in groups. Split the_args cmdline arguments and values into arg_props and arg_values.	2010-11-11 17:29:05 +00:00
Zdenek Kabelac	64dff85ce4	Preserve const for char pointer Keep char pointers 'const' (introduced with cling commit).	2010-11-11 12:32:33 +00:00
Alasdair Kergon	eb82bd0525	Extend cling allocation policy to recognise PV tags (cling_by_tags). Add allocation/cling_tag_list to lvm.conf.	2010-11-09 12:34:40 +00:00
Peter Rajnoha	f7e3a19f75	Clarify error messages when activation fails due to activation filter use.	2010-11-05 18:18:11 +00:00
Zdenek Kabelac	2955b913ea	Use new status code from fsadm check Patch updates exec_cmd() and adds 3rd parameter with pointer for status value, so caller might examine returned status code. If the passed pointer is NULL, behavior is unmodified. Patch allows to confinue with lvresize if the failure from fsadm check is caused by mounted filesystem as many of filesystem resize tools do support online filesystem resize. (originally user had to use flag '-n' to bypass this filesystem check)	2010-11-01 14:17:35 +00:00
Zdenek Kabelac	d0604a856d	Macro uninitialized_var gives warnings in static analysis Deactivate uninitialized_var() macro for clang static analysis.	2010-10-26 10:04:34 +00:00
Zdenek Kabelac	419d5219cb	Fix NULL pointer dereference for too large MDA error path Replace dereference of NULL vg with passed vgname to the function _vg_read_raw_area() in the error path for too large MDA.	2010-10-26 09:13:13 +00:00
Zdenek Kabelac	962ebfe4b4	Remove bufused for calculation As bufused is assigned 0 in preceding source line clang Idempotent operation	2010-10-26 08:53:25 +00:00
Dave Wysochanski	a90221d824	Add 'is_integer' flag into internal lvm_property_type. Add 'is_integer' flag similar to 'is_string'. Suggested in review by Petr Rockai.	2010-10-25 14:08:32 +00:00
Alasdair Kergon	2aa06d73ca	pre-release	2010-10-25 13:54:29 +00:00
Zdenek Kabelac	3f329bd02c	Use const config node	2010-10-25 13:38:11 +00:00
Zdenek Kabelac	d35c864521	Fix constness warning Keep using const pointers.	2010-10-25 13:36:09 +00:00
Zdenek Kabelac	91e56ffb29	Fix constness warning for _vg_read_by_vgid() uuid usage	2010-10-25 13:35:13 +00:00
Zdenek Kabelac	faa1268182	Fix constness warnings After update of device_from_pvid() API fix constness warnings Also fix info_from_pvid() constness warning for char* usage.	2010-10-25 13:33:42 +00:00
Zdenek Kabelac	710784945c	Use 'const' struct id *pvid for device_from_pvid() Update interface for device_from_pvid and use const pointer.	2010-10-25 13:02:26 +00:00
Zdenek Kabelac	aa4f87d6e3	Switch to char* arithmetic from void*	2010-10-25 13:00:35 +00:00
Alasdair Kergon	eacd3a0916	fix header #defines	2010-10-25 12:01:59 +00:00
Alasdair Kergon	b83af51668	Add global/metadata_read_only to use unrepaired metadata in read-only cmds.	2010-10-25 11:20:54 +00:00
Alasdair Kergon	727d065f6e	restrict last checkin to devs consisting entirely of error target	2010-10-25 10:37:34 +00:00
Mike Snitzer	06808d3357	Never scan a device which is using the error target A merged snapshot's DM device is made to use the "error" target as part of lvm's transaction to merge a snapshot. This snapshot merge use-case aside, any device using the error target shouldn't be scanned.	2010-10-24 17:36:58 +00:00
Dave Wysochanski	12873010e5	Add lv_get_property() internal lvm function.	2010-10-21 18:51:16 +00:00
Dave Wysochanski	60fc088d70	Rename fields in lvm_property_type. Based on review comments, rename a few fields in lvm_property_type. In particular, change 'is_writeable' to 'is_settable', which is more intuitive to the intent of the bitfield (a 'set' function exists for this field/property). Also, remove the char array for 'id' - unnecessary as we can just use the string passed in to do the strcmp. Finally rename the union members from n_val to 'integer' and 's_val' to 'string'.	2010-10-21 14:49:43 +00:00
Dave Wysochanski	d53d92f2e1	Add lv_read_ahead and lv_kernel_read_ahead 'get' functions.	2010-10-21 14:49:31 +00:00
Dave Wysochanski	f1fc310730	Refactor and add code for (lv) 'lv_origin' get function.	2010-10-21 14:49:20 +00:00
Dave Wysochanski	6103254393	Refactor and add code for (lv) 'lv_name' get function.	2010-10-21 14:49:10 +00:00
Zdenek Kabelac	f7311db64f	Fix strict-aliasing compile warning in partition table scanning	2010-10-20 15:07:30 +00:00
Petr Rockai	a341cab721	Implement automatic snapshot extension with dmeventd, and add two new options to lvm.conf in the activation section: 'snapshot_autoextend_threshold' and 'snapshot_autoextend_percent', that define how to handle automatic snapshot extension. The former defines when the snapshot should be extended: when its space usage exceeds this many percent. The latter defines how much extra space should be allocated for the snapshot, in percent of its current size.	2010-10-15 16:28:14 +00:00
Zdenek Kabelac	d1ad03efce	Speedup memory un/locking Move the call of find_config_tree_node() from inner loop to outer section of maps scanning.	2010-10-15 09:48:23 +00:00
Jonathan Earl Brassow	2c33c8b80c	Fix for bug 637936: killing both redundant logs causes deadlock Problem: When both legs of a mirrored log fail, neither the log nor the parent mirror can proceed. The repair code must be careful to replace the log with an error target before operating on the parent - otherwise, the parent can get stuck trying to suspend because it can't push through any writes. The steps to replace the log device with an error target were incomplete and resulted in the replacement not happening at all! The code originally had all the necessary logic to complete the replacement task, but was pulled out in a effort to clean-up that section of code, while fixing another bug: <offending commit msg> In addition, I added following three changes. - Removed tmp_orphan_lvs handling procedure It seems that _delete_lv() can handle detached_log_lv properly without adding mirror legs in mirrored log to tmp_orphan_lvs. Therefore, I removed the procedure. - Removed vg_write()/vg_commit() Metadata is saved by vg_write()/vg_commit() just after detached_log_lv is handled. Therefore, I removed vg_write()/vg_commit(). </offending commit msg> http://sources.redhat.com/cgi-bin/cvsweb.cgi/LVM2/lib/metadata/mirror.c?cvsroot=lvm2&f=h#rev1.130 I've reverted the "clean-up" changes associated with that fix, but not what that commit was actually fixing. Signed-off-by: Jonathan Brassow <jbrassow@redhat.com> Reviewed-by: Petr Rockai <prockai@redhat.com>	2010-10-14 20:03:12 +00:00
Mike Snitzer	9443b5d4cd	Convey need for snapshot-merge target in lvconvert error message and man page. Add ->target_name to segtype_handler to allow a more specific target name to be returned based on the state of the segment. Result of trying to merge a snapshot using a kernel that doesn't have the snapshot-merge target: Before: # lvconvert --merge vg/snap Can't expand LV lv: snapshot target support missing from kernel? Failed to suspend origin lv After: # lvconvert --merge vg/snap Can't process LV lv: snapshot-merge target support missing from kernel? Failed to suspend origin lv Unable to merge LV "snap" into it's origin.	2010-10-13 21:26:37 +00:00
Petr Rockai	976b95d929	Limit repeated accesses to broken devices. Signed-off-by: Takahiro Yasui <takahiro.yasui@hds.com> Reviewed-by: Petr Rockai <prockai@redhat.com>	2010-10-13 15:40:38 +00:00
Petr Rockai	042312952c	Give correct error message when creating a too-small snapshot (BZ 587063)	2010-10-13 13:52:53 +00:00
Zdenek Kabelac	7c9fd3ea84	Don't use floor() in _bitset_with_random_bits Use _even_rand() function instead of floor() in _bitset_with_random_bits(). floor() function is missing in dietlibc (on architectures other than x86). Moreover using floor() to clip rand results does not assure even result distribution. _even_rand() uses integer arithmetic only and is designed to return evenly distributed results. > Looks OK to me. It took a while to decipher what is the exact meaning of > the loop in _even_rand (to a non-pseudorandomness-expert) but I am > fairly comfortable with it now. If I understand this correctly, it > rejects numbers that come from an "incomplete" slice of the RAND_MAX > space (considering the number space [0, RAND_MAX] is divided into some > "max"-sized slices and at most a single smaller slice, between [n*max, > RAND_MAX] for suitable n -- numbers from this last slice are discarded > because they could distort the distribution in favour of smaller > numbers). Signed-off-by: Przemyslaw Iskra <sparky <at> pld-linux.org> Reviewed-by: Petr Rockai <prockai <at> redhat.com>	2010-10-13 12:18:53 +00:00
Dave Wysochanski	f70468ce0b	Fix lv_modules_dup segfault.	2010-10-12 17:09:23 +00:00
Petr Rockai	98351ffbd5	Make lvconvert respect --yes/--force in the inactive log conversion prompt. Fixes BZs 642055, 621281. Patch by Taka. Signed-off-by: Takahiro Yasui <tyasui@redhat.com> Reviewed-by: Petr Rockai <prockai@redhat.com>	2010-10-12 16:41:17 +00:00
Dave Wysochanski	2eba846043	Refactor and add code for (lv) 'modules' get function.	2010-10-12 16:13:06 +00:00
Dave Wysochanski	d88090b0ae	Refactor and add code for (lv) 'mirror_log' get function. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-By: Petr Rockai <prockai@redhat.com>	2010-10-12 16:12:50 +00:00
Dave Wysochanski	40c6c80723	Refactor and add code for (lv) 'lv_kernel_{major\|minor}' get functions. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-By: Petr Rockai <prockai@redhat.com>	2010-10-12 16:12:33 +00:00
Dave Wysochanski	e27833fb9c	Refactor and add code for (lv) 'convert_lv' get function. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-By: Petr Rockai <prockai@redhat.com>	2010-10-12 16:12:18 +00:00
Dave Wysochanski	af579eccc3	Refactor and add code for (lv) 'move_pv' get function. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-By: Petr Rockai <prockai@redhat.com>	2010-10-12 16:12:02 +00:00
Dave Wysochanski	29636f38e3	Refactor and add code for (lv) 'origin_size' get function. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-By: Petr Rockai <prockai@redhat.com>	2010-10-12 16:11:48 +00:00
Dave Wysochanski	802e252b29	Refactor and add code for (lv) 'lv_path' get function.	2010-10-12 16:11:34 +00:00
Dave Wysochanski	a88a278698	Add some lv 'get' functions that require no refactoring. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-By: Petr Rockai <prockai@redhat.com>	2010-10-12 16:11:20 +00:00
Dave Wysochanski	637ac19e60	Rename 'flags' to 'status' for struct metadata_area. In other LVM memory structures such as volume_group, the field used to store flags is called "status", and on-disk fields are called 'flags', so rename the one inside metadata_area to be consistent. Not only is it more consistent with existing code but is cleaner to say "the status of this mda is ignored". Background for this patch - prajnoha pinged me on IRC this morning about a fix he was working on related to metadataignore when metadata/dirs was set. I was reviewing my patches from this year and realized the 'flags' field was probably not the best choice when I originally did the metadataignore patches.	2010-10-05 17:34:05 +00:00
Milan Broz	b0485a996f	Restrict lvm1 partial mode. Current lvm1 allocation code seems to not properly map segments on missing PVs. For now disable this functionality. (It never worked and previous commit just introduced segfault here.) So the partial mode in lvm1 can only process missing PVs with no LV segments only. Also do not use random PV UUID for missing part but use fixed string derived from VG UUID (to not confuse clvmd tests).	2010-10-04 18:59:01 +00:00
Alasdair Kergon	ac0252ca07	Add dm_zalloc and use it and dm_pool_zalloc throughout.	2010-09-30 21:06:50 +00:00
Dave Wysochanski	0ca1492ca5	Fix copyright dates on new files lib/metadata/{lv\|vg\|pv}.[ch].	2010-09-30 20:47:18 +00:00
Peter Rajnoha	5936dd0381	Fix memory leak of vg_read while using live copies of metadata in directories.	2010-09-30 14:12:14 +00:00
Dave Wysochanski	4df3d5ad25	Add pv_get_property and create generic internal _get_property function. We need to use a similar function for pv and lv properties, so just make a generic _get_property() function that contains most of the required functionality. Also, add a check to ensure the field name matches the object passed in by re-using report_type_t enum. For pv properties, the report_type might be either PVS or LABEL. In addition, add 'const' to 'get' functions object parameter, but not 'set' functions. Add _not_implemented_set() and _not_implemented_get() functions.	2010-09-30 14:09:45 +00:00
Dave Wysochanski	f9bbf60213	Add pv 'get' functions for all pv properties. Add 'get' functions for all pv properties. Multiply by SECTOR_SIZE for pv properties pv_mda_free, pv_mda_size, pe_start, pv_size, pv_free, pv_used.	2010-09-30 14:09:33 +00:00
Dave Wysochanski	b184f791d4	Add pv_name_dup() and pv_fmt_dup() helper functions.	2010-09-30 14:09:22 +00:00
Dave Wysochanski	1cd292af8f	Add pv_mda_size, pv_mda_free, and pv_used functions, call from 'disp' functions.	2010-09-30 14:09:10 +00:00
Dave Wysochanski	1f4f0d1926	Add 'get' functions for vg fields. Add 'get' functions based on generic macros for VG, PV, and LV. Add 'get' functions for vg string fields, vg_name, vg_fmt, vg_sysid, vg_uuid, vg_attr, and vg_tags, and all numeric fields.	2010-09-30 14:08:58 +00:00
Dave Wysochanski	5d74ec6400	Make generic GET_*_PROPERTY_FN macros and define secondary macro for vg, pv, lv. Will need similar macros for VG, PV and LV, so define a generic one, and just pass in the struct name and variable name for the specific macro.	2010-09-30 14:08:46 +00:00
Dave Wysochanski	b1ef78d000	Add supporting functions vg_name_dup, vg_fmt_dup, vg_system_id_dup. Add supporting functions for vg_name, vg_fmt, vg_system_id. Append "_dup" to end of supporting functions to make clear the strings are dup'd and to avoid namespace conflict with vg_name.	2010-09-30 14:08:33 +00:00
Dave Wysochanski	c508945ca9	Add pv_tags_dup, vg_tags_dup, lv_tags_dup functions that call tags_format_and_copy.	2010-09-30 14:08:19 +00:00
Dave Wysochanski	f15033c0e1	Add tags_format_and_copy() common function and call from _tags_disp. Add a common function to allocate memory and format a string of tags. Call tags_format_and_copy() from _tags_disp().	2010-09-30 14:08:07 +00:00
Dave Wysochanski	254d672dcc	Add pv_uuid_dup, vg_uuid_dup, and lv_uuid_dup, and call id_format_and_copy. Add supporting functions for pv_uuid, vg_uuid, and lv_uuid. Call new function id_format_and_copy. Use 'const' where appropriate. Add "_dup" suffix to indicate memory is being allocated. Call {pv\|vg\|lv}_uuid_dup from lvm2app uuid functions.	2010-09-30 14:07:47 +00:00
Dave Wysochanski	f4fd41552d	Add id_format_and_copy() common function and call from _uuid_disp. Add supporting uuid function to allocate memory and call id_write_format. Call id_format_and_copy from _uuid_disp.	2010-09-30 14:07:33 +00:00
Dave Wysochanski	4bbadbe1cf	Simplify logic to create 'attr' strings. This patch addresses code review request to simplify creation of 'attr' strings. The simplification is done in this separate patch to more easily review and ensure the simplification is done without error.	2010-09-30 14:07:19 +00:00
Dave Wysochanski	14663348d0	Add {pv\|vg\|lv}_attr_dup() functions and refactor 'disp' functions. Move the creating of the 'attr' strings into a common function so they can be called from the 'disp' functions as well as the new 'get' property functions. Add "_dup" suffix to indicate memory is allocated. Refactor pvstatus_disp to take pv argument and call pv_attr_dup().	2010-09-30 13:52:55 +00:00
Dave Wysochanski	e32e2eb011	Add lib/metadata/vg.[ch] and lib/metadata/lv.[ch]. These got missed when git cvsexportcommit was used.	2010-09-30 13:16:55 +00:00
Dave Wysochanski	b88b638d6e	Add lib/metadata/pv.[ch] new files. Apparently git cvsexportcommit does not properly add new files from a git commit.	2010-09-30 13:15:42 +00:00
Dave Wysochanski	b171907fc5	Refactor metadata.[ch] into lv.[ch] for lv functions. This patch is similar to the other patches for pv and vg functionality, and separates lv functionality into separate files, concentrating on reporting fields and simple functions.	2010-09-30 13:05:45 +00:00
Dave Wysochanski	f42b708eae	Refactor metadata.[ch] into pv.[ch] for pv functions. The metadata.[ch] files are very large. This patch makes a first attempt at separating out pv functions and data, particularly related to the reporting fields calculations. More code could be moved here but for now I'm stopping at reporting functions 'get' / 'set' functions.	2010-09-30 13:05:20 +00:00
Dave Wysochanski	81f0124a58	Refactor metadata.[ch] into vg.[ch] for vg functions. The metadata.[ch] files are very large. This patch makes a first attempt at separating out vg functions and data, particularly related to the reporting fields calculations.	2010-09-30 13:04:55 +00:00
Zdenek Kabelac	c631be7684	Maps fix Read complete content of /proc/self/maps into one buffer without realocation in the middle of reading and before doing any m/unlock operation with these lines - as some of them gets change. With previous implementation we've read some mappings twice ([stack])	2010-09-30 11:32:40 +00:00
Alasdair Kergon	f6b1c45bf1	Speed up unquoting of quoted double quotes and backslashes.	2010-09-28 01:29:06 +00:00
Alasdair Kergon	8a075c6123	drop an unnecessary 'stack'	2010-09-27 19:15:13 +00:00
Alasdair Kergon	46d4a6acf8	was renamed	2010-09-27 19:10:46 +00:00
Alasdair Kergon	44a31a9c2f	Speed up CRC32 calculations by using a larger lookup table. Use -DDEBUG_CRC32 to revert to old function and check new one gives same result.	2010-09-27 19:09:34 +00:00
Peter Rajnoha	bad35c6554	Add escape sequence for ':' and '@' found in device names used as PVs.	2010-09-23 12:02:33 +00:00
Alasdair Kergon	0cb07b65f3	Replace alloca with dm_malloc in _aligned_io. (This section of code dates from 2.4 and could be written more efficiently nowadays.)	2010-09-22 22:31:45 +00:00
Milan Broz	980d2d8683	Fix handling of partial VG for lvm1 format metadata If some lvm1 device is missing, lvm fails on all operations # vgcfgbackup -f bck -P vg_test Partial mode. Incomplete volume groups will be activated read-only. 3 PV(s) found for VG vg_test: expected 4 PV segment VG free_count mismatch: 152599 != 228909 PV segment VG extent_count mismatch: 152600 != 228910 Internal error: PV segments corrupted in vg_test. Volume group "vg_test" not found Allow loading of lvm1 partial VG by allocating "new" missing PV, which covers lost space. Also this fake mising PV inform code that it is partial VG. https://bugzilla.redhat.com/show_bug.cgi?id=501390	2010-09-22 13:45:21 +00:00
Alasdair Kergon	ec8a4dac46	Fix name in msg in last checkin. (The problem the last checkin addressed was a segfault in 'pvs -a' if .cache didn't contain every PV in a VG.)	2010-09-22 01:50:38 +00:00
Alasdair Kergon	a171bb6e85	Track recursive filter iteration to avoid refreshing while in use. (2.02.56)	2010-09-22 01:36:13 +00:00
Peter Rajnoha	064ed484b4	"goto_bad" should be used in alloc_printed_tags function, not "goto bad".	2010-09-21 10:42:02 +00:00
Peter Rajnoha	70431c8146	Revert to old glibc behaviour for vsnprintf used in emit_to_buffer function. Revert to old glibc behaviour for vsnprintf used in emit_to_buffer fn. Otherwise, the check that follows would be wrong for new glibc versions. This caused the rh bug #633033 to be undetected and pass throught the check, corrupting the metadata!	2010-09-20 14:25:27 +00:00
Peter Rajnoha	48ae64529a	Use dynamic allocation for metadata's tag buffer (removes 4096 char. limit).	2010-09-20 14:23:20 +00:00
Dave Wysochanski	97709450ca	Update vg_mda_free 'get' function to multiply by SECTOR_SIZE.	2010-09-09 19:38:03 +00:00
Peter Rajnoha	d20ce59b80	Add random suffix to archive file names to prevent races when being created. In certain configurations, we're not under a VG rw lock while trying to write a new archive file with VG metadata. A common example is using "vgs" while having the content of backup and archive directories empty. The code scans the content of these directories and tries to determine the final index that should be used in archive name. Since we're not under a lock, we can get into a race while choosing the index which could end up showing errors about not being able to rename to final archive name. Let's add random number suffix to these archive file names so we can avoid the race.	2010-09-09 13:13:12 +00:00
Peter Rajnoha	dc8478458e	Reinitialize archive and backup handling on toolcontext refresh. For example, when using '--config "backup { ... }"' line, the values from lvm.conf (or default values) should be overridden. This patch adds reinitialisation of archive and backup handling on toolcontext refresh which makes these settings to be applied.	2010-09-09 13:07:13 +00:00
Jonathan Earl Brassow	a71d6051ed	This patch fixes a potential for I/O to hang and LVM commands to block when a mirror under a snapshot suffers a failure. The problem has to do with label scanning. When a mirror suffers a failure, the kernel blocks I/O to prevent corruption. When LVM attempts to repair the mirror, it scans the devices on the system for LVM labels. While mirrors are skipped during this scanning process, snapshot-origins are not. When the origin is scanned, it kicks up I/O to the mirror (which is blocked) underneath - causing the label scan (an thus the repair operation) to hang. This patch simply bypasses snapshot-origin devices when doing labels scans (while ignore_suspended_devices() is set). This fixes the issue.	2010-08-26 14:21:50 +00:00
Milan Broz	fc86426b56	Fix previous const removal.	2010-08-26 12:22:05 +00:00
Milan Broz	c7af31dbd7	Fix return type qualifier to avoid compiler warning. introduced in commit `b16b4d92a7` "Improve various log messages." fixes a lot of ../include/metadata.h:148: warning: type qualifiers ignored on function return type	2010-08-26 12:08:19 +00:00
Alasdair Kergon	4e19541b8d	autoreconf also updates configure.h.in	2010-08-21 00:16:37 +00:00
Mike Snitzer	7063efe1bd	Switch to using configure --with-default-data-alignment=<NUM> to establish DEFAULT_DATA_ALIGNMENT. Again, 0=64KiB, 1=1MiB, 2=2MiB Default is 1.	2010-08-20 22:24:58 +00:00
Mike Snitzer	4efb1d9cbb	Update heuristic used for default and detected data alignment. Add "devices/default_data_alignment" to lvm.conf to control the internal default that LVM2 uses: 0==64k, 1==1MB, 2==2MB, etc. If --dataalignment (or lvm.conf's "devices/data_alignment") is specified then it is always used to align the start of the data area. This means the md_chunk_alignment and data_alignment_detection are disabled if set. (Same now applies to pvcreate --dataalignmentoffset, the specified value will be used instead of the result from data_alignment_offset_detection) set_pe_align() still looks to use the determined default alignment (based on lvm.conf's default_data_alignment) if the default is a multiple of the MD or topology detected values.	2010-08-20 20:59:05 +00:00
Dave Wysochanski	614469b544	Define GET_NUM_PROPERTY_FN macro to simplify numeric property 'get' functions.	2010-08-20 13:02:39 +00:00
Dave Wysochanski	cc171eb8ee	Add implmentation for simple numeric 'get' property functions. Add 'get' functions based on the simple macro function definition for a numeric property. Add 'get' functions for the following: _vg_extent_count_get, _vg_free_count_get, _max_lv_get, _max_pv_get, _pv_count_get, _lv_count_get, _snap_count_get, _vg_seqno_get, _vg_size_get, _vg_free_get, vg_mda_*. For size functions, multiply by SECTOR_SIZE to return the value in bytes.	2010-08-20 12:45:09 +00:00
Dave Wysochanski	1af822bff0	Define GET_NUM_PROPERTY_FN macro to simplify numeric property 'get' functions.	2010-08-20 12:44:58 +00:00
Dave Wysochanski	fc65b9038e	Add properties.[ch] to lib/report, defined based on columns.h. Extend the existing reporting infrastructure definitions and structures to include a 'get' and 'set' function for each field. We will provide a 'get' and 'set' function for each of these fields, which will be utilized by exported lvm2app functions. Define a default _not_implemented 'get' and 'set' function that just sets an errno and returns 0. Future patches will actually implement the specific 'get' and 'set' functions for each property. For read-only properties, only the 'get' function will be implemented. Define vg_get_property() function to query a property. We will call this from a lvm2app function.	2010-08-20 12:44:47 +00:00
Dave Wysochanski	7bdc15c8bb	Remove explicit double quotes from columns.h 'id' entries. The 'id' entries in columns.h are the report field names. Since these are unique, we'd like to use them in generation of 'get' / 'set' functions. As a step towards using them for this purpose, remove the explicit double quotes and use the macro '#' character to add the double quotes back when placing them into the '_fields' array 'id' member.	2010-08-20 12:44:17 +00:00
Dave Wysochanski	d5722ebb21	Add 'flags' field to columns.h and define FIELD_MODIFIABLE. Add a 'flags' field to columns.h, and set it to 0 by default. Define FIELD_MODIFIABLE flag to indicate whether a 'set' function exists to change the field's value.	2010-08-20 12:44:03 +00:00
Dave Wysochanski	69d67dc2ca	Add vg_mda_size and vg_mda_free functions. Add supporting functions to get vg_mda_size and vg_mda_free fields. Should be no functional change.	2010-08-20 12:43:49 +00:00
Milan Broz	586b56b18c	Fix wrong use of LCK_WRITE In all top vg read functions only LCK_VG_READ/WRITE can be used. All other vg lock definitions are low-level backend machinery. Moreover, LCK_WRITE cannot be tested through bitmask. This patch fixes these mistakes. For _recover_vg() we do not need lock_flags, it can be only two of above and we always upgrading to LCK_VG_WRITE lock there. (N.B. that code is racy) There is no functional change in code (despite wrong masking it produces correct bits:-)	2010-08-19 23:26:31 +00:00
Milan Broz	727f7bfa49	Detect LUKS signature in pvcreate One shiny day we should use libblkid here. But now using LUKS is very common together with LVM and pvcreate destroys LUKS completely. So for user's convenience, try to detect LUKS signature and allow abort.	2010-08-19 23:08:18 +00:00
Milan Broz	c37a14506a	Fix file descriptor leak in swap signature detection	2010-08-19 23:05:45 +00:00
Milan Broz	2d5e2b52ca	Change the pvcreate swap/md logic pvcreate detects MD and swap signature. The logic hidden there is not only documented but it is also user unfriendly. Who invented this logic should run pvcreate on its own critical MD device to see why;-) This patch - creates one function instead of duplication code - asks if user want to overwrite signature - allows aborting (!) (Please note that writing LVM signatute without wiping old is wrong, it confuses blkid, MD will not work anyway and swap and LUKS is broken too.)	2010-08-19 23:03:34 +00:00
Alasdair Kergon	22149572e8	Use 'SINGLENODE' instead of 'dead' in clvmd singlenode messages. Ignore snapshots when performing mirror recovery beneath an origin. Pass LCK_ORIGIN_ONLY flag around cluster. Add suspend_lv_origin and resume_lv_origin using LCK_ORIGIN_ONLY.	2010-08-17 19:25:05 +00:00
Alasdair Kergon	2d6fcbf67d	Allow internal suspend and resume of origin without its snapshots.	2010-08-17 16:25:32 +00:00
Alasdair Kergon	85ed403002	Fix dev_manager_transient to access -real device not snapshot-origin. (brassow) Another reminder why cloning functions impedes maintenance.	2010-08-17 01:51:12 +00:00
Alasdair Kergon	f92b4f9482	Monitor origin -real device below snapshot instead of overlay device. (brassow)	2010-08-17 01:16:41 +00:00
Alasdair Kergon	85a80e0505	Don't really change monitoring status when in test mode.	2010-08-16 23:29:09 +00:00
Alasdair Kergon	d1e8046f56	Various small cleanups and fixes related to monitoring.	2010-08-16 22:54:35 +00:00
Jonathan Earl Brassow	d0191bf9f4	Fix for bug 612291: dm devices of split off mirror images are not removed DM devices were not handled properly on nodes in a cluster that were not where the splitmirrors command was issued. This was happening because suspend_lv/resume_lv were being used in a place where activate_lv should have been used. When the suspend/resume are issued on (effectively) new LVs, their 'resource' (UUID) is not located in the lv_hash. Thus, both operations turn into no-ops. You can see this from the output of clvmd from one of the remote nodes: <snip> do_suspend_lv, lock not already held <snip> do_resume_lv, lock not already held 'activate_lv' enjoins the other nodes in the cluster to process the lock and activate the new LV. clvmd output from remote node as follows: do_lock_lv: resource 'zMseY7CBuO3Ty09vXlplPAHzD0Y0CovjrTdv0R1VcwggMwPdYhutHErRcwm5Nd2S', cmd = 0x19 LCK_LV_ACTIVATE (READ\|LV\|NONBLOCK), flags = 0x84 (DMEVENTD_MONITOR ), memlock = 1 sync_lock: 'zMseY7CBuO3Ty09vXlplPAHzD0Y0CovjrTdv0R1VcwggMwPdYhutHErRcwm5Nd2S' mode:1 flags=1 sync_lock: returning lkid 27b0001 Signed-off-by: Jonathan Brassow <jbrassow@redhat.com> Reviewed-by: Petr Rockai <prockai@redhat.com>	2010-08-16 18:02:14 +00:00
Mike Snitzer	b123a82d73	Change default alignment of pe_start to 1MB. The new standard in the storage industry is to default alignment of data areas to 1MB. fdisk, parted, and mdadm have all been updated to this default. Update LVM to align the PV's data area start (pe_start) to 1MB. This provides a more useful default than the previous default of 64K (which generally ended up being a 192K pe_start once the first metadata area was created). Before this patch: # pvs -o name,vg_mda_size,pe_start PV VMdaSize 1st PE /dev/sdd 188.00k 192.00k After this patch: # pvs -o name,vg_mda_size,pe_start PV VMdaSize 1st PE /dev/sdd 1020.00k 1.00m The heuristic for setting the default alignment for LVM data areas is: - If the default value (1MB) is a multiple of the detected alignment then just use the default. - Otherwise, use the detected value. In practice this means we'll almost always use 1MB -- that is unless: - the alignment was explicitly specified with --dataalignment - or MD's full stripe width, or the {minimum,optimal}_io_size exceeds 1MB - or the specified/detected value is not a power-of-2	2010-08-12 04:11:48 +00:00
Mike Snitzer	dff224669d	Require --restorefile when using pvcreate --uuid. Introduce --norestorefile to allow user to override the new requirement. This can also be overridden with "devices/require_restorefile_with_uuid" in lvm.conf -- however the default is 1. Signed-off-by: Mike Snitzer <snitzer@redhat.com>	2010-08-12 04:08:59 +00:00
Peter Rajnoha	626242c1bd	Recognise and give preference to md device partitions (blkext major). We can already detect MD devices internally. But when using MD partitions, these have "block extended major" (blkext) assigned (259). Blkext major is also used in general, so we need to check whether the original device is an MD device actually.	2010-08-11 12:14:23 +00:00
Petr Rockai	f3ad0dcfde	Never scan internal LVM devices.	2010-08-09 14:05:16 +00:00
Jonathan Earl Brassow	8d2d4f1fa0	Fix for bug 619221 - log device splitting regression An incorrect fix on July 13, 2010 for an annoyance has caused a regression. The offending check-in was part of the 2.02.71 release of LVM. That check-in caused any PVs specified on the command line to be ignored when performing a mirror split. This patch reverses the aforementioned check-in (solving the regressions) and posits a new solution to the list reversal problem. The original problem was that we would always take the lowest mimage LVs from a mirror when performing a split, but what we really want is to take the highest mimage LVs. This patch accomplishes that by working through the list in reverse order - choosing the higher numbered mimages first. (This also reduces the amount of processing necessary.) Signed-off-by: Jonathan Brassow <jbrassow@redhat.com> Reviewed-by: Takahiro Yasui <takahiro.yasui@hds.com>	2010-08-06 15:38:32 +00:00
Petr Rockai	851aaf4ecc	Reduce severity of the "mirror transient status" log message (this was never intended to be a log_error).	2010-08-04 15:55:03 +00:00
Mike Snitzer	14a9722185	Avoid changing aligned pe_start as a side-effect of very verbose logging.	2010-08-03 18:19:42 +00:00
Peter Rajnoha	97df4e4675	Use built-in rules for device aliases: block/ < dm- < disk/ < mapper/ < other.	2010-08-03 13:39:27 +00:00
Zdenek Kabelac	3eadbbeb12	Fix const warning in dev_manager_info() and _dev_manager_lv_rmnodes().	2010-08-03 13:13:01 +00:00
Zdenek Kabelac	c10f7fd039	Fix constness warning in archive_file structure from archive.c.	2010-08-03 13:09:21 +00:00
Zdenek Kabelac	9f926fd060	Use void parameter for function definition.	2010-08-03 13:06:35 +00:00
Jonathan Earl Brassow	cbd41292a4	Taka's fix for handling failure of all mirrored log devices and all but one mirror leg. <patch header> To handle a double failure of a mirrored log, Jon's two patches are commited, however, lvconvert command can't still handle an error when mirror leg and mirrored log got failure at the same time. [Patch]: Handle both devices of a mirrored log failing (bug 607347) posted: https://www.redhat.com/archives/lvm-devel/2010-July/msg00009.html commit: https://www.redhat.com/archives/lvm-devel/2010-July/msg00027.html [Patch]: Handle both devices of a mirrored log failing (bug 607347) - additional fix posted: https://www.redhat.com/archives/lvm-devel/2010-July/msg00093.html commit: https://www.redhat.com/archives/lvm-devel/2010-July/msg00101.html In the second patch, the target type of mirrored log is replaced with error target when remove_log is set to 1, but this procedure should be also used in other cases such as the number of mirror leg is 1. This patch relocates the procedure to the main path. In addition, I added following three changes. - Removed tmp_orphan_lvs handling procedure It seems that _delete_lv() can handle detached_log_lv properly without adding mirror legs in mirrored log to tmp_orphan_lvs. Therefore, I removed the procedure. - Removed vg_write()/vg_commit() Metadata is saved by vg_write()/vg_commit() just after detached_log_lv is handled. Therefore, I removed vg_write()/vg_commit(). - With Jon's second patch, we think that we don't have to call remove_mirror_log() in _lv_update_mirrored_log() because will be handled remove_mirror_images() in _lvconvert_mirrors_repaire(). </patch header> Signed-off-by: Takahiro Yasui <takahiro.yasui@hds.com> Reviewed-by: Petr Rockai <prockai@redhat.com> Signed-off-by: Jonathan Brassow <jbrassow@redhat.com>	2010-08-02 21:07:40 +00:00
Jonathan Earl Brassow	efaaf3146d	Disallow mirrored logs in cluster mirrors. The cluster log daemon (cmirrord) is not multi-threaded and can handle only one request at a time. When a log is stacked on top of a mirror (which itself contains a 'core' log), it creates a situation that cannot be solved without threading. When the top level mirror issues a "resume", the log daemon attempts to read from the log device to retrieve the log state. However, the log is a mirror which, before issuing the read, attempts to determine the 'sync' status of the region of the mirror which is to be read. This sync status request cannot be completed by the daemon because it is blocked on a read I/O to the very mirror requesting the sync status.	2010-08-02 19:03:45 +00:00
Dave Wysochanski	936541ec56	Remove irrelevant comments relating to vg_mda_copies.	2010-07-30 16:47:27 +00:00
Alasdair Kergon	8bae0a1ecf	Change clvmd to communicate with lvm via a socket in /var/run/lvm. (mbroz) https://bugzilla.redhat.com/show_bug.cgi?id=614248 [CVE-2010-2526]	2010-07-28 13:55:42 +00:00
Dave Wysochanski	81bf06ea38	Clarify help text for vg_mda_count.	2010-07-21 19:44:25 +00:00
Jonathan Earl Brassow	9baacefc77	Building without the '--enable-cmirrord' option means that CMIRRORD_PIDFILE is not defined. This makes the build fail. Therefore, we need to conditionalize the check for cmirrord based on if CMIRRORD_PIDFILE is defined.	2010-07-21 15:21:24 +00:00
Jonathan Earl Brassow	405c4a45d8	It's not enough to check for the kernel module in the case of cluster mirrors, we must also check that the log daemon (cmirrord) is running. The log module can be auto-loaded, but the daemon cannot be "auto-started". Failing to check for the daemon produces cryptic messages that customers have a hard time deciphering. (The system messages do report that the log daemon is not running, but people don't seem to find this message easily.) Here are examples of what is printed when the module is available, but the log daemon has not been started. [root@bp-01 LVM2]# lvcreate -m1 -l1 -n lv vg Shared cluster mirrors are not available. [root@bp-01 LVM2]# lvcreate -m1 -l1 -n lv vg -v Setting logging type to disk Finding volume group "vg" Archiving volume group "vg" metadata (seqno 3). Creating logical volume lv Executing: /sbin/modprobe dm-log-userspace Cluster mirror log daemon is not running Shared cluster mirrors are not available. Creating volume group backup "/etc/lvm/backup/vg" (seqno 4).	2010-07-21 13:40:21 +00:00
Jonathan Earl Brassow	60f425d1b3	Fix for bug 614164: No check for existing name when splitting mirror The user could use the same name as an existing LV when specifying a name for an LV split off from a mirror. This causes all sorts of issues.	2010-07-13 22:24:39 +00:00
Jonathan Earl Brassow	c42b084793	Fix for bugs: 612248 & 612291 Split mirror issues The main problem with these bugs was that the newly split off LV was not being suspended properly. This meant that the memlock count was not being balanced, the DM devices were not being renamed, and some DM devices which should have been removed were not. I've also renamed some of the variables and added comments to make things clearer as to what is going on. (I can break this patch in two if it means easier review.)	2010-07-13 21:48:16 +00:00
Fabio M. Di Nitto	8c4e8a185a	Add dm_create_lockfile to libdm to handle pidfiles for all daemons. Switch dmeventd to use dm_create_lockfile and drop duplicate code. Allow clvmd pidfile to be configurable. Switch cmirrord and clvmd to use dm_create_lockfile.	2010-07-13 13:51:01 +00:00
Peter Rajnoha	3122f963b0	Addendum for previous patch - show VG/LV name everywhere so the messages are consistent.	2010-07-12 12:38:35 +00:00
Peter Rajnoha	fefa43235f	Add more verbose messages while checking volume_list and hosttags settings. This should bring less confusion when there are some settings left and people just forgot about it and then they run into problems. These messages should give them a hint of what's really going on.	2010-07-12 11:37:49 +00:00
Jonathan Earl Brassow	a93fb6299f	Failed to test for the case where a log was requested to be removed even though there was no log. A simple run through the in-tree test suite would have caught this. :( - if (lv_is_mirrored(detached_log_lv) && + if (detached_log_lv && lv_is_mirrored(detached_log_lv) && Also, made some cosmetic changes suggested by kabi after my last check-in (e.g. s/return 0/return_0/ and adding an error message).	2010-07-09 17:57:51 +00:00
Dave Wysochanski	f77fb62b2a	Add log_error when strdup fails in {vg\|lv}_change_tag(). Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-07-09 16:57:44 +00:00
Alasdair Kergon	08f1ddea6c	Use __attribute__ consistently throughout.	2010-07-09 15:34:40 +00:00
Alasdair Kergon	80e569104b	Remove superfluous fn prototypes.	2010-07-09 15:21:10 +00:00
Jonathan Earl Brassow	aa5734f2a3	Finish fix for bug 607347: failing both redundant mirror log legs... A previous check-in added logic to handle the case where both images of a mirrored log failed. It solved the problem by simply removing the log entirely - leaving the parent mirror with a 'core' log. This worked for most cases. However, if there was a small delay between the failures of the two mirrored log devices, the mirror would hang, LVM would hang, and no additional LVM commands could be issued. When the first leg of the log fails, it signals the need for repair. Before 'lvconvert --repair' is run by dmeventd, the second leg fails. 'lvconvert' would see both devices as failed and try to remove the log entirely. When it came time to suspend the parent mirror to update the configuration, the suspend would hang because it couldn't get any I/O through the mirrored log, which was plugged waiting for corrective action. The solution is to replace the log with an error target to clear any pending writes before removing it. This allows the parent mirror to suspend and make the proper changes.	2010-07-09 15:08:12 +00:00
Dave Wysochanski	a5fb2bbff3	Pass metadataignore to pv_create, pv_setup, _mda_setup, and add_mda. Pass metadataignore through PV creation / setup paths. As a result of this cleanup, we can remove the unnecessary setting of mda_ignore bits inside pvcreate_single(), after call to pv_create. For now, just set metadataignore to '0' in some places. This is equivalent to the prior functionality, although the 0 is given by the caller not hardcoded in _mda_setup() call. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-07-08 18:24:29 +00:00
Dave Wysochanski	dce204cec5	Init mda->list in mda_copy. This patch should be no functional change as all callers initialize mda->list.	2010-07-08 17:41:46 +00:00
Zdenek Kabelac	764eb41086	Fix format string from patch apply mistake	2010-07-08 14:47:46 +00:00
Zdenek Kabelac	37036b0215	Small update of memlock debug messages. Gives slightly better alligned lines for reading.	2010-07-08 13:05:27 +00:00
Zdenek Kabelac	4ec2ae8632	Do not log backtrace in valid _lv_resume() code path	2010-07-08 12:24:04 +00:00
Dave Wysochanski	7041b476ac	Add warning to vgextend and pvchange if metadataignore given on cmdline. Warn the user then change the value of vg_mda_copies. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-07-07 18:59:45 +00:00
Alasdair Kergon	7f7af46862	Adjust auto-metadata repair and caching logic to try to cope with empty mdas. - If a PV contained empty mdas, the auto-recovery code was not kicking in. - The 'inconsistent' state was getting lost when metadata was cached so recovery didn't kick in. But leave the behaviour alone when using precommitted metadata because of a warning in a confusing FIXME. In my testing, pvs and vgs didn't repair inconsistent metadata like they used to do. (How many other tools fail similarly now?) And there should be no need to cache inconsistent metadata because it is supposed to get repaired under the protection of a write lock immediately it is discovered. This code is in need of a redesign based on first principles. I still see bugs in this code and this commit is risky.	2010-07-07 02:53:16 +00:00
Alasdair Kergon	6c8655ce9b	fix code in 2nd mda unignore loop to match 1st loop	2010-07-06 20:09:38 +00:00
Alasdair Kergon	68f4e0c734	s/flags/mda/	2010-07-06 17:29:50 +00:00
Alasdair Kergon	0db1bbc3c3	shorten mesg	2010-07-06 17:27:32 +00:00
Alasdair Kergon	643f234119	fix jumbled args in 'Adjusting' message	2010-07-06 17:26:08 +00:00
Alasdair Kergon	d911ec67a9	Randomly select which mdas to use or ignore. Add some missing standard configure.in checks.	2010-07-05 22:23:15 +00:00
Alasdair Kergon	db3c1ac1c8	Add printf format attributes to yes_no_prompt & dm_{sn,as}printf and fix a calle	2010-07-02 21:16:50 +00:00
Alasdair Kergon	d0709eed62	remove unneeded header	2010-07-02 10:25:16 +00:00
Alasdair Kergon	9b95a5a939	Always pass unsuspended dm devices through persistent filter to other filters. Move test for suspended dm devices ahead of other filters.	2010-07-02 02:09:57 +00:00
Alasdair Kergon	12eadbabdd	improve vgmetadatacopies unmanaged message	2010-06-30 20:03:52 +00:00
Dave Wysochanski	3b9d1b1a96	Check for missing_pv in vg_remove loop. If a pv is missing, we should just skip it rather than checking the device size and failing the vgremove.	2010-06-30 19:55:43 +00:00
Alasdair Kergon	d8886386bd	more mda ignore cleanups	2010-06-30 19:28:35 +00:00
Dave Wysochanski	40b4d1c3ae	Refactor vg_remove_check to place pv removal into separate function.	2010-06-30 18:03:52 +00:00
Alasdair Kergon	23177eda88	more metadataignore message/code cleanup	2010-06-30 17:13:05 +00:00
Alasdair Kergon	efe75fd705	revert that	2010-06-30 14:54:29 +00:00
Alasdair Kergon	a6c4427188	suppress useless compiler warning	2010-06-30 14:52:29 +00:00
Dave Wysochanski	ef7b409966	Only attempt to guarantee 1 mda ignored if there's at least one mda in the vg.	2010-06-30 14:48:07 +00:00
Alasdair Kergon	67b91d0848	Only attempt to guarantee 1 mda ignored if there's at least one mda in the vg.	2010-06-30 14:27:40 +00:00
Alasdair Kergon	647c64c796	Improve various log messages.	2010-06-30 13:51:11 +00:00
Dave Wysochanski	7985f80c63	Add pvmetadatacopies to lvm.conf and pvcreate man pages.	2010-06-30 12:49:28 +00:00
Dave Wysochanski	a5bf70018b	Add --metadataignore to pvcreate. Allow metadataignore flag to be passed in to pvcreate. Ideally, more refactoring of the mda allocation / initialization is warranted, but for now, we just add another parameter to 'add_mda' to take an existing mda ignored flag. We need to do this or pv_write loses the state of the mda 'ignored' flag before copying and writing to disk.	2010-06-30 12:17:24 +00:00
Dave Wysochanski	6af5155529	Improve logging for setting --vgmetadatacopies. Example of logging: metadata/metadata.c:1127 Setting mda_copies = 3 on vg vgtest metadata/pv_manip.c:296 /dev/loop2 0: 0 25: NULL(0:0) metadata/pv_manip.c:296 /dev/loop3 0: 0 25: NULL(0:0) metadata/pv_manip.c:296 /dev/loop4 0: 0 25: NULL(0:0) metadata/metadata.c:1072 Adjusting ignored mdas on vg vgtest, vg_mda_used_count=5, vg_mda_copies=3 metadata/metadata.c:1015 Setting ignore flag for 2 mdas on vg vgtest metadata/metadata.c:4151 Setting mda ignored flag for metadata_locn /dev/loop2. metadata/metadata.c:4151 Setting mda ignored flag for metadata_locn /dev/loop3.	2010-06-29 22:41:28 +00:00
Dave Wysochanski	d37dd5b2d3	Improve logging for metadata ignore by printing device name. Print device name when setting or clearing metadata ignore bit. Example: label/label.c:160 /dev/loop2: lvm2 label detected cache/lvmcache.c:1136 lvmcache: /dev/loop2: now in VG #orphans_lvm2 (#orphans_lvm2) metadata/metadata.c:4142 Setting mda ignored flag for metadata_locn /dev/loop2. format_text/text_label.c:318 Skipping mda with ignored flag on device /dev/loop2 at offset 4096	2010-06-29 22:37:32 +00:00
Dave Wysochanski	710c9373bf	Add some log_verbose debug statements related to metadataignore. Logging isn't ideal, especially for mda_set_ignore. Ideally we'd like to display the device name and offset in this case but this requires a bit more work and a per-format 'mda_description' function pointer definition (we don't have access to mda_context in metadata.c).	2010-06-29 22:25:58 +00:00
Dave Wysochanski	a375ced300	Move code into pv_change_metadataignore library function. In preparation to call this from both pvcreate as well as pvchange, move the guts of metadataignore into a library function. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-29 21:32:44 +00:00
Dave Wysochanski	559aee44ab	Add error message if backup_to_file fails because of empty in_use mdas list.	2010-06-29 15:03:59 +00:00
Dave Wysochanski	5778fdeeb8	Add more initializations of 'mda->flags' field. Mda allocation needs refactored into a single function but as an interim step, ensure mda->flags is initialized properly.	2010-06-29 14:52:56 +00:00
Dave Wysochanski	fa832e3a55	Attempt to fix intermittent failure with non-debug configured vgcfgbackup. There's an intermittent failure with vgcfgbackup that seems to have been introduced with the metadataignore / vgmetadatacopies patchset. Intermittent failures are often the result of uninitialized data, so this patch calls zalloc in a few places it might matter.	2010-06-29 13:29:53 +00:00
Dave Wysochanski	a9d8bf269a	Allow 'all' and 'unmanaged' values for --vgmetadatacopies. Allowing an 'all' and 'unmanaged' value is more intuitive, and provides a simple way for users to get back to original LVM behavior of metadata written to all PVs in the volume group. If the user requests "--vgmetadatacopies unmanaged", this instructs LVM not to manage the ignore bits to achieve a specific number of metadata copies in the volume group. The user is free to use "pvchange --metadataignore" to control the mdas on a per-PV basis. If the user requests "--vgmetadatacopies all", this instructs LVM to do 2 things: 1) clear all ignore bits, and 2) set the "unmanaged" policy going forward. Internally, we use the special MAX_UINT32 value to indicate 'all'. This 'just' works since it's the largest value possible for the field and so all 'ignore' bits on all mdas in the VG will get cleared inside _vg_metadata_balance(). However, after we've called the _vg_metadata_balance function, we check for the special 'all' value, and if set, we write the "unmanaged" value into the metadata. As such, the 'all' value is never written to disk. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:40:01 +00:00
Dave Wysochanski	a09a8efb66	Update check in vg_split_mdas to account for ignored mdas list. The check in vg_split_mdas will trigger an error if the 'from' vg list is empty. However, this might be ok in some instances now that we have ignored mdas. Relax this check so an error is triggered only in the case where there's truly no more mdas in the 'from' vg. One example of where this makes a difference is with vgreduce. If we try to vgreduce a PV with un-ignored mdas, this should trigger the balancing function to un-ignore mdas on another PV in the VG. However, we don't get to vg_write() before we fail because this list size check fails, and we see an error message indicating: "Cannot remove final metadata area ..." Another example is with vgsplit into a new VG, where the PVs being moved contain all ignored mdas. We must move the mdas on fid->metadata_areas_ignored from 'vg_from' to 'vg_to'. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:38:56 +00:00
Dave Wysochanski	f61cd7b249	Ensure fid mda lists are populated correctly during vgextend. The vgextend path calls add_pv_to_vg(). Inside add_pv_to_vg(), we must ensure we pass the correct mdas list into pv_setup(), as copies of mdas are placed on the vg->fid list. If we don't place the mdas on the correct vg->fid list, the various counts may be incorrect and the metadata balance algorithm will not work when called from vg_write() path. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:38:39 +00:00
Dave Wysochanski	1b54343328	Implement _vg_adjust_ignored_mdas and call from vg_write() path. Compare the value of the newly added vg_mda_copies field (--vgmetadatacopies parameter) with the current count of in-use mdas and ignoring or unignoring mdas as necessary to get to the target count. Also, as a safety check before returning, ensure we have at least one mda enabled. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:37:54 +00:00
Dave Wysochanski	3534fb40df	Add vg_mda_copies display field to 'vgs' command. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:37:23 +00:00
Dave Wysochanski	7042e06a2a	Make vg->mda_copies persistent in on disk vg metadata. This patch adds the ability to read/write the vg->mda_copies values from/to the vg metadata. If we read the VG metadata and this field does not exist, we set mda_copies to the default value of 0. Later in the code, we use this special '0' value to indicate a disable of metadata balancing. This should preserve existing LVM behavior and ensure metadata balancing can be turned off should the need arise. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:37:10 +00:00
Dave Wysochanski	821f0cc5ea	Add vg get/set methods for VG metadata copies. This patch adds the get and partially implemented set function. The 'set' function should probably ignore or un-ignore metadata areas based on new values. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:36:56 +00:00
Dave Wysochanski	88d7dc1af8	Add mda_copies to VG structures and initialization. Add a field to struct volume_group to later implement metadata balancing: - mda_copies: target # of non-ignored mdas in the VG; default 0 (do not control pv 'ignore mdas' bit. This patch just adds the parameter to the structures with the default values but does not modify any commands. Should be no functional change. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:36:37 +00:00
Dave Wysochanski	0f2f8a5c3a	Before committing each mda, arrange mdas so ignored mdas get committed first. Arrange mdas so mdas that are to be ignored come first. This is an optimization that ensures consistency on disk for the longest period of time. This was noted by agk in review of the v4 patchset of pvchange-based mda balance. Note the following example for an explanation of the background: Assume the initial state on disk is as follows: PV0 (v1, non-ignored) PV1 (v1, non-ignored) PV2 (v1, non-ignored) PV3 (v1, non-ignored) If we did not sort the list, we would have a commit sequence something like this: PV0 (v2, non-ignored) PV1 (v2, ignored) PV2 (v2, ignored) PV3 (v2, non-ignored) After the commit of PV0's mdas, we'd have an on-disk state like this: PV0 (v2, non-ignored) PV1 (v1, non-ignored) PV2 (v1, non-ignored) PV3 (v1, non-ignored) This is an inconsistent state of the disk. If the machine fails, the next time it was brought back up, the auto-correct mechanism in vg_read would update the metadata on PV1-PV3. However, if possible we try to avoid inconsistent on-disk states. Clearly, because we did not sort, we have a greater chance of on-disk inconsistency - from the time the commit of PV0 is complete until the time PV3 is complete. We could improve the amount of time the on-disk state is consistent by simply sorting the commit order as follows: PV1 (v2, ignored) PV2 (v2, ignored) PV0 (v2, non-ignored) PV3 (v2, non-ignored) Thus, after the first PV is committed (in this case PV1), on-disk we would have: PV0 (v1, non-ignored) PV1 (v2, ignored) PV2 (v1, non-ignored) PV3 (v1, non-ignored) This is clearly a consistent state. PV1 will be read but the mda will be ignored. All other PVs contain v1 metadata, and no auto-correct will be required. In fact, if we commit all PVs with ignored mdas first, we'll only have an inconsistent state when we start writing non-ignored PVs, and thus the chances we'll get an inconsistent state on disk is much less with the sorted method. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:35:49 +00:00
Dave Wysochanski	77e0ed4be7	Refactor vg_commit() to add _vg_commit_mdas(). Factor out calling mda->ops->vg_commit() for each mda. No functional change. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:35:33 +00:00
Dave Wysochanski	69d1732334	Update _vg_read and _text_create_text_instance to use fid_add_mda[s]. When we are constructing the vg, we may need to adjust the list of metadata_areas if there are ignored mdas. At label read time, we do not read the metadata of ignored mdas, and as a result, they do not get placed on vg->fid->metadata_areas inside _text_create_text_instance since lvmcache does not have these areas attached to vginfo->infos. However, when we're checking the pvids inside _vg_read, after having read another metadata area from another PV, we do have the opportunity to update the metadata_area and metadata_areas_ignored lists based on the read metadata_area. We need accurate mda lists for the reporting functions that count the ignored mdas, as well as general correctness of mda balancing. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:35:17 +00:00
Dave Wysochanski	bb723d7897	Use mdas_empty_or_ignored() in place of checks for empty mda list. With the addition of ignored mdas, we replace all checks for an empty mda list with a new function to look for either an empty mda list or ignored mdas. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:34:58 +00:00
Dave Wysochanski	f9c307cd07	Add mdas_empty_or_ignored() helper function. Add a helper function to consolidate checking for an empty mdas list or ignored mdas. Ignored mdas should behave almost identically to an empty mda list - the metadata areas should not be read or written to. This function will make it easier to implement metadata balancing and easier to track pvs with an empty mda list or ignored mdas. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:34:40 +00:00
Dave Wysochanski	e6bd367b57	Implement ignore of mda if bit set by skipping r/w of metadata. We implement ignore of an mda at label_read time by checking for the ignore bit, and then skipping the reading of the vgname and other information in the metadata. This will have an effect similar to a PV found with no mdas. Thus, it will look like an orphan in the cache until we scan the rest of the system and find a PV with metadata, and the mda will not be on the vg->fid->metadata_areas list so no read/writes will be done to the metadata area. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:34:24 +00:00
Dave Wysochanski	cdbe475fe3	Define new functions and vgs/pvs fields related to mda ignore. Define a new pvs field, pv_mda_used_count, and a new vgs field, vg_mda_used_count to match the existing pv_mda_count and vg_mda_count. These new fields count the number of mdas that have the 'ignored' bit clear (they are in use on the PV / VG). Also define various supporting functions to implement the counting as well as setting the ignored flag and determining if an mda is ignored. These high level functions call into the lower level location independent mda ignore functions defined by earlier patches. Note that counting ignored mdas in a vg requires traversing both lists and checking for the ignored bit on the mda. The count of 'ignored' mdas then is defined by having the bit set, not by which list the mda is on. The list does determine whether LVM actually does read/write to the mda, though we must count the bits in order to return accurate numbers for the various counts. Also, pv_mda_set_ignored must search both vg lists for ignored mda. If the state changes and needs to be committed to disk, the ignored mda will be on the non-ignored list. Note also in pv_mda_set_ignored(), we must properly manage the mda lists. If we change the ignored state of an mda, we must change any mdas on vg->fid->metadata_areas that correspond to this pv. Also, we may need to allocate a copy of the mda, as is done when fid->metadata_areas is populated from _vg_read(), if we are un-ignoring an ignored mda. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:33:44 +00:00
Dave Wysochanski	9ccac021a7	Add metadata_areas_ignored list and functions to manage ignored mdas. Add a second mda list, metadata_areas_ignored to fid, and a couple functions, fid_add_mda() and fid_add_mdas() to help manage the lists. These functions are needed to properly count the ignored mdas and manage the lists attached to the 'fid' and ultimately the 'vg'. Ensure metadata_areas_ignored is initialized in other formats, even if the list is never used. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:33:22 +00:00
Dave Wysochanski	f55a20eb36	Rename fid->metadata_areas to fid->metadata_areas_in_use. Rename the metadata_areas list to an 'in_use' list to prepare for future 'ignored' list.	2010-06-28 20:32:44 +00:00
Dave Wysochanski	6b596f685f	Use vg_mda_count() in vgdisplay. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:32:21 +00:00
Dave Wysochanski	ef4fa155a5	Add mda location specific mda_copy constructor. Because of the way mdas are handled internally, where a PV in a VG has mdas on both info->mdas and vg->fid->metadata_areas list, we need a location independent copy constructor for struct metadata_area. Break up the existing format-text specific copy constructor into a format independent piece and a format dependent piece. This function is necessary to properly implement pv_set_mda_ignored(). Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-by: Alasdair G Kergon <agk@redhat.com>	2010-06-28 20:31:59 +00:00
Dave Wysochanski	29f24d4634	Add mda_locns_match() internal library function for mapping pv/device to VG mda. A metadata_area is defined independent of the location. One downside is that there is no obvious mapping from a pv to an mda. For a PV in a VG, we need a way to start with a PV and end up with an MDA, if we are to manage mdas starting with a device/pv. This function provides us a way to go down the list of PVs on a VG, and identify which ones match a particular PV. I'm not entirely happy with this approach, but it does fit into the existing structures in a reasonable way. An alternative solution might be to refactor the VG - PV interface such that mdas are a list tied to a PV. However, this seemed a bit tricky since a PV does not come into existence until after the list of mdas is constructed (see _vg_read() - we create a 'fid' and attach mdas to it, then we go through them and attach pvs). Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-by: Alasdair G Kergon <agk@redhat.com>	2010-06-28 20:31:38 +00:00
Dave Wysochanski	a6b36a5901	Ensure in-memory state matches on-disk state of mda ignore bit. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:31:18 +00:00
Dave Wysochanski	09e0f43ba0	Allow raw_read_mda_header to be called from text_label.c. We'd like to pass in mda_header to vgname_from_mda(). In order to do this, we need to call raw_read_mda_header() from text_label.c, _text_read(), which gets called from the label_read() path, and peers into the metadata and update vginfo cache. We should check the disable bit here, and if set, not peer into the vg metadata, thus reducing the I/O to disk. In the process, move vgname_from_mda() to layout.h, since the fn only gets called from format_text code, and we need the mda_header definition from the private layout.h. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:31:01 +00:00
Dave Wysochanski	da0b4d8770	Move dev_open/dev_close outside vgname_from_mda(). Refactor vgname_from_mda() so caller must open/close the device. Should be no functional change. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:30:46 +00:00
Dave Wysochanski	96597c2eab	Move dev_open / dev_close outside _vg_read_raw_area(). This refactoring moves the device open/close up one level to the caller of _vg_read_raw_area(). Should be no functional change and facilitate future changes related to metadata balancing. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:30:30 +00:00
Dave Wysochanski	322c5868b3	Add location independent flag and functions to ignore mdas. First we add a 'flags' field to the location independent metadata_area structure, and a MDA_IGNORE flag. The mda_is_ignored and mda_set_ignored functions are added to manage the flag. Adding the flag and functions gives a library interface to ignore metadata areas independent of the underlying location (disk, file, etc). The location specific read/write functions must then handle the specifics of what this flag means to the location. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-by: Alasdair G Kergon <agk@redhat.com>	2010-06-28 20:30:14 +00:00
Dave Wysochanski	d144d5eeb7	Add text format specific 'rlocn' ignore flag and access functions. Adding a flag to the 'rlocn' structure in the mda header of the text format allows us to flip a bit to ignore an area on disk that stores the metadata via the text format specific mda_header. This patch defines the flag and access functions to manage the flag. Other patches will manage the ignore on a format-independent basis, by using a flag in the metadata_area structure. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:29:57 +00:00
Dave Wysochanski	7c604e7649	Change 'filler' to 'flags' in on-disk 'raw_locn' structure. Future patches will make use of a specific flag in the on-disk 'raw_locn' structure to enable/disable metadata areas, and facilitate metadata balancing. Note that 'filler' is always set to '0' (see add_mda() - memset), so use of this area as a non-zero flags field is a safe way to provide future code features. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-06-28 20:29:42 +00:00
Jonathan Earl Brassow	68c31a2a36	Fix for bz608048 from Taka... The same region size is used for both mirror volume and mirrored log volume, but when the physical extent size is bigger than region size, the size of mirror leg for mirrored log is smaller than the region size and lvcreate command fails. This patch adjusts a region size of mirrored log to a smaller value of region size or physical extent size. [This patch ensures that the region_size of the mirrored log does not exceed the size of the mirrored log itself, which would violate the kernel constraint: (region_size <= ti->len).] Signed-off-by: Takahiro Yasui <takahiro.yasui@hds.com> Signed-off-by: Jonathan Brassow <jbrassow@redhat.com>	2010-06-28 14:19:41 +00:00
Zdenek Kabelac	d301e5917f	Preload libc locale messages. Preload libc.mo file for localized lvm before taking memory lock - this way we prevent disk access for some error paths in libdm, that prints localized errno messages while they are still in memory locked state.	2010-06-24 08:29:30 +00:00
Jonathan Earl Brassow	42f7fd0590	The function that runs to compress a stacked mirror after converting from 2-way to 3-way mirror (collapse_mirrored_lv) was calling '_remove_mirror_images' with the 'remove_log' parameter set. When the code was put in to fix 599898 to honor log parameters during conversion, this argument was suddenly being honored. Thus, when someone would convert from a 2-way to 3-way mirror, the log would get removed. 'collapse_mirrored_lv' should not be calling '_remove_mirror_images' with 'remove_log' set.	2010-06-23 13:57:26 +00:00
Alasdair Kergon	07ae1d4943	Add lv_path to reports to offer full /dev pathname.	2010-06-23 12:32:08 +00:00
Milan Broz	f9e177d281	Fix "allocated" warning typo.	2010-06-22 21:10:53 +00:00
Dave Wysochanski	58f55600d0	Add device name to output of error messages in raw_read_mda_header(). It would be helpful if we had the device name when something like a mda_header checksum error occurs. Before: ./tools/lvm pvs -opv_name,vg_name,uuid,mda_count,pv_mda_count_ignored,vg_mda_count,vg_mda_count_ignored,vg_mda_copies Incorrect metadata area header checksum PV VG PV UUID #PMda #PMdaIgn #VMda #VMdaIgn #VMdaCps /dev/loop0 vgtest2 sVv26t-gjpb-Rcau-uBDO-Cx04-GbRR-6Ssq7e 2 0 4 0 4 /dev/loop1 vgtest2 zXWStT-qE8F-mbkc-RfgH-aytv-mptF-Y5Ce09 2 0 4 0 4 /dev/loop2 riCpK9-9G8r-LlIp-i2oh-mb3N-CUzk-u5YpuR 1 0 0 0 0 /dev/loop3 vgtest tQCUjm-rmyd-i92d-4eeE-UYBW-v1vQ-kRaA17 2 0 4 2 0 /dev/loop4 vgtest ZRvpeI-p8F1-ccVW-BBac-xhl1-aGXU-CbP0oo 2 2 4 2 0 After: ./tools/lvm pvs -opv_name,vg_name,uuid,mda_count,pv_mda_count_ignored,vg_mda_count,vg_mda_count_ignored,vg_mda_copies Incorrect metadata area header checksum on /dev/loop2 at offset 4096 PV VG PV UUID #PMda #PMdaIgn #VMda #VMdaIgn #VMdaCps /dev/loop0 vgtest2 sVv26t-gjpb-Rcau-uBDO-Cx04-GbRR-6Ssq7e 2 0 4 0 4 /dev/loop1 vgtest2 zXWStT-qE8F-mbkc-RfgH-aytv-mptF-Y5Ce09 2 0 4 0 4 /dev/loop2 riCpK9-9G8r-LlIp-i2oh-mb3N-CUzk-u5YpuR 1 0 0 0 0 /dev/loop3 vgtest tQCUjm-rmyd-i92d-4eeE-UYBW-v1vQ-kRaA17 2 0 4 2 0 /dev/loop4 vgtest ZRvpeI-p8F1-ccVW-BBac-xhl1-aGXU-CbP0oo 2 2 4 2 0	2010-06-22 19:18:27 +00:00
Jonathan Earl Brassow	a7d355a28c	Mirrors can be layered - as in the case of an converting 2-way to 3-way mirror. When conversion operations are performed on these types of mirrors, log options can be confused/ignored. In the case of a converting 3-way mirror, we have a top-level 2-way corelog mirror whose legs are 1) a 2-way disk-log mirror and 2) a linear device. If we wish to convert this 3-way mirror to a 2-way mirror, the linear device is removed and the extra top layer is eliminated. If we also wished to convert the disk log to a core log in the same step, ambiguity creeps in. It is somewhat obvious what the user wants - a 2-way mirror with a corelog. However, looking at the top level mirror before compression, it seems that the mirror already has a core log. This is why the operation seemed to fail. This patch simply re-evaluates what mirrored_seg points to after a compression and then considers the log argument. This is a fix for bug 599898.	2010-06-21 16:12:33 +00:00
Alasdair Kergon	b4ee00356b	Various cleanups following recent commits.	2010-06-21 15:56:57 +00:00
Milan Broz	d2031f6a16	Clean up cluster lock mode and flags definition. Code is mixing up internal DLM and LVM definitions of lock modes and flags. OpenAIS and singlenode locking do not depend on DLM but code currently cannot be compiled without libdlm.h! LCK_* flags is LVM abstraction, used through all the code. Only low-level backend (clvmd-cman etc) should use DLM definitions, also this code should do all needed conversions. Because there are two DLM flags used in generic code (NOQUEUE, CONVERT) we define it similar way like lock modes. (So all needed binary-compatible flags are on one place in locking.h) (Further code cleaning still needed, though:-)	2010-06-17 12:48:54 +00:00

... 7 8 9 10 11 ...

2783 Commits