shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-22 17:35:59 +03:00

Author	SHA1	Message	Date
Jonathan Earl Brassow	532e6c8ae3	Thanks to Zdenek Kabelac (kabi) for pointing out that I was using dm_pool_free incorrectly. This check-in fixes that incorrect usage. I've also added a WHATS_NEW line to reflect the changes I made to allow lv_extend to operate on 0 length intrinsically layered LVs (i.e mirrors and RAID). I forgot that in the last commit.	2011-04-07 21:49:29 +00:00
Jonathan Earl Brassow	fe93c99ad9	This patch adds the ability to extend 0 length layered LVs. This allows us to allocate all images of a mirror (or RAID array) at one time during create. The current mirror implementation still requires a separate allocation for the log, however.	2011-04-06 21:32:20 +00:00
Peter Rajnoha	29684f590c	Cleanup fid finalization code in free_vg and allow exactly the same fid to be set again for a PV/VG. Actually, we can call vg_set_fid(vg, NULL) instead of calling destroy_instance for all PV structs and a VG struct - it's the same code we already have in the vg_set_fid. Also, allow exactly the same fid to be set again for the same PV/VG Before, this could end up with the fid destroyed because we destroyed existing fid first and then we used the new one and we didn't care whether existing one == new one by chance.	2011-04-01 14:54:20 +00:00
Zdenek Kabelac	3d04380691	Use created hash tables for quick check of LV, PV. Instead of searching linear list of all LVs, PVs - use created hash tables also for quick mapping between LV. (Note - for small number of PVs or LVs the overhead of the hash is bigger). TODO: Use hash tables in volume_group structure directly.	2011-03-30 13:35:51 +00:00
Zdenek Kabelac	1bedd3a97b	Use id_equal instead of strncmp() More consistent and easier to read.	2011-03-29 21:57:56 +00:00
Zdenek Kabelac	f77736cab5	Remove double braces Clang gives notice about possible confusion as commonly double bracces are used when some assignment is done inside them.	2011-03-29 20:19:03 +00:00
Jonathan Earl Brassow	60c10a45ce	s/MIRROR_NOTSYNCED/LV_NOTSYNCED/ - Flag will may refer to more than just mirrors	2011-03-29 12:51:57 +00:00
Jonathan Earl Brassow	be226be635	Fix unhandled condition in _move_lv_segments If _move_lv_segments is passed a 'lv_from' that does not yet have any segments, it will screw things up because the code that does the segment copy assumes there is at least one segment. See copy code here: lv_to->segments = lv_from->segments; lv_to->segments.n->p = &lv_to->segments; lv_to->segments.p->n = &lv_to->segments; If 'segments' is an empty list, the first statement copies over the values, but the next two reset those values to point to the other LV's list structure. 'lv_to' now appears to have one segment, but it is really an ill-set pointer.	2011-03-25 22:02:27 +00:00
Petr Rockai	5ef2808bc7	In some cases, we could end up with a mirrored LV without a MIRRORED flag. In other cases, the code could wind up removing wrong number of mirrors. In yet other cases, we could remove the right number of mirrors, but fail to respect the removal preferences (i.e. keep an image that was requested to be removed while removing an image that was requested to be kept). Under some circumstances, remove_mirror_images could also get stuck in an infinite loop. This patch should fix all of the above undesirable behaviours. Signed-off-by: Petr Rockai <prockai@redhat.com> Reviewed-by: Jonathan Brassow <jbrassow@redhat.com>	2011-03-24 12:28:02 +00:00
Zdenek Kabelac	b8ccce3500	Add missing \0 for grown debug object Attach \0 for proper char* display - otherwise somewhat random message could be displayed in debug more and read of unpredictable read of uninitilized memory values could happen.	2011-03-14 17:00:57 +00:00
Zdenek Kabelac	844b75f4d6	Fix allocation of system_id As code uses strncpy(system_id, NAME_LEN) and doesn't set '\0' Fix it by always allocating NAME_LEN + 1 buffer size and with zalloc we always get '\0' as the last byte. This bug may trigger some unexpected behavior of the string operation code - depends on the pool allocator. FIXME: refactor this code to alloc_vg.	2011-03-13 23:05:48 +00:00
Peter Rajnoha	ff4479414c	Use format instance mempool where possible and adequate.	2011-03-11 15:10:16 +00:00
Peter Rajnoha	e8d4946ec7	Various cleanups for fid mem and ref_count changes. Missing free_vg on error_path in lvmcache_get_vg fn. Call destroy_instance only if the fid is not part of the vg in backup_read_vg fn (otherwise it's part of the VG we're returning and we definitely don't want to destroy it!).	2011-03-11 15:08:31 +00:00
Peter Rajnoha	2feb2a66fd	Call destroy_instance for any PVs found in VG structure during vg_free call. This is necessary for proper format instance ref_count support. We iterate over vg->pvs and vg->removed_pvs list and the ref_count is decremented and then it is destroyed if not referenced anymore.	2011-03-11 15:06:13 +00:00
Peter Rajnoha	84f48499a3	Add new free_pv_fid fn and use it throughout to free all attached fids. Since format instances will use own memory pool, it's necessary to properly deallocate it. For now, only fid is deallocated. The PV structure itself still uses cmd mempool mostly, but anytime we'd like to add a mempool in the struct physical_volume, we can just rename this fn to free_pv and add the code (like we have free_vg fn for VGs).	2011-03-11 14:56:56 +00:00
Peter Rajnoha	1307ddf4cf	Use only vg_set_fid and new pv_set_fid fn to assign the format instance. This is essential for proper format instance ref_count support. We must use these functions to set the fid everywhere from now on, even the NULL value!	2011-03-11 14:50:13 +00:00
Peter Rajnoha	a1bec4e685	Add mem and ref_count fields to struct format_instance for own mempool use. Format instances can be created anytime on demand and it contains metadata area information mostly (at least for now, but in the future, we may store more things here to update/edit in a PV/VG). In case we have lots of metadata areas, memory consumption will rise. Using cmd context mempool is not quite optimal here because it is destroyed too late. So let's use a separate mempool for format instances. Reference counting is used because fids could be shared, e.g. each PV has either a PV-based fid or VG-based fid. If it's VG-based, each PV has a shared fid with the VG - a reference to VG's fid.	2011-03-11 14:38:38 +00:00
Peter Rajnoha	56f5b12eed	Use new alloc_fid fn for common format instance initialisation.	2011-03-11 14:30:27 +00:00
Zdenek Kabelac	a6f38f9d6a	Missed merge fix in vg_validate patch	2011-03-10 22:39:36 +00:00
Zdenek Kabelac	442dbf9ad8	Refactor code for _lv_postoder Add _lv_postorder_vg() - for calling _lv_postorder() for every LV from VG. We use this in 2 places - vg_mark_partial_lvs() and vg_validate() so make it as a one function. Benefit here is - to use only one cleanup code and avoid potentially duplicate scans of same LVs.	2011-03-10 14:40:32 +00:00
Zdenek Kabelac	4ee2b4965f	Use hash tables for validating names Accelerate validation loop by using lvname, lvid, pvid hash tables. Also merge pvl loop into one cycle now - no need to scan the list twice. List scan is stopped when dm_hash_insert fails. The error message with loop_counter1 is no longer provided - however the message has been misleading anyway.	2011-03-10 13:11:59 +00:00
Zdenek Kabelac	3019419e95	Refactor vg allocation code Create new function alloc_vg() to allocate VG structure. It takes pool_name (for easier debugging). and also take vg_name to futher simplify code. Move remainder of _build_vg_from_pds to _pool_vg_read and use vg memory pool for import functions. (it's been using smem -> fid mempool -> cmd mempool) (FIXME: remove mempool parameter for import functions and use vg). Move remainder of the _build_vg to _format1_vg_read	2011-03-10 12:43:29 +00:00
Alasdair Kergon	2f25c320fb	Use empty string instead of /dev// for LV path when there's no VG. Don't allocate unused VG mempool in _pvsegs_sub_single.	2011-03-09 12:44:42 +00:00
Zdenek Kabelac	55f6627427	Fix reading of released memory lvseg_segtype_dup used memory pool vg memory pool for strind duplication. However this one gets released before reporting happens so the command like: pvs -o segtype prints data from already released memory pool. Thanks to the fact there is not much allocation happing after the VG is released, the memory stays unmodified and correct result is printed. Fix adds support for mempool passed parameter (like other similar query commands) and uses dm_report memory pool for string duplication.	2011-03-05 12:14:00 +00:00
Milan Broz	be3510b204	PE size overflows, on most architectures it is catch by "PE cannot be 0" but s390x unfortunately return something usable. Always use unit64 in inital parameter check.	2011-03-02 20:00:09 +00:00
Zdenek Kabelac	36653e8903	Add fall through comments Add comments to switch case construct.	2011-02-28 19:53:03 +00:00
Peter Rajnoha	3b97e8d643	Allow non-orphan PVs with two metadata areas to be resized. We allow writing non-orphan PVs only for resize now. The "orphan PV" assert in pv_write fn uses the "allow_non_orphan" parameter to control this assert. However, we should find a more elaborate solution so we can remove this restriction altogether (pv_write together with vg_write is not atomic, we need to find a safe mechanism so there's an easy revert possible in case of an error).	2011-02-28 13:19:02 +00:00
Alasdair Kergon	1a52fa6858	Fix check for log-only allocation in new alloc normal loop.	2011-02-27 01:16:52 +00:00
Alasdair Kergon	92ffcda183	Various changes to the allocation algorithms: Expect some fallout. There is a lot to test. Two new config settings added that are intended to make the code behave closely to the way it did before - worth a try if you find problems.	2011-02-27 00:38:31 +00:00
Peter Rajnoha	4a304dc1d8	Allow only orphan PVs to be resized even with two metadata areas.	2011-02-25 14:08:54 +00:00
Peter Rajnoha	f74bd57ec9	Revert the patch for vgconvert to work with recent changes in metadata area handling. This should work now with the help of the patch from previous commit.	2011-02-25 14:02:53 +00:00
Peter Rajnoha	38b0564cab	Read PV metadata information from cache if pv_setup called with pv->fid == vg->fid. If the PV is already part of the VG (so the pv->fid == vg->fid), it makes no sense to attach the mdas information from PV to a VG. Instead, we read new PV metadata information from cache and attach it to the VG fid.	2011-02-25 13:59:47 +00:00
Peter Rajnoha	c901a92aa5	%ld -> PRIu64	2011-02-21 13:09:27 +00:00
Peter Rajnoha	9c0035c129	Fix metadata balance code to work with recent changes in metadata handling interface (with the changes in format_instance).	2011-02-21 12:33:16 +00:00
Peter Rajnoha	51aed1992f	Add old_uuid field to struct physical_volume so we can still reference a PV with its old UUID when we're changig it (the cache as well as metadata area index has the old uuid that we need to use to access the information!)	2011-02-21 12:31:28 +00:00
Peter Rajnoha	6bdc80743e	Fix vgconvert code to work with changes in metadata area handling and changes in format_instance. Add new 'vg_convert' function.	2011-02-21 12:29:21 +00:00
Peter Rajnoha	cb2396730a	Change pvresize code to work with new metadata handling interface and allow resizing a PV with two metadata areas.	2011-02-21 12:27:26 +00:00
Peter Rajnoha	17ad2b1115	Change pv_write code to work with the changes in metadata handling interface and changes in format_instance.	2011-02-21 12:26:27 +00:00
Peter Rajnoha	94d91fdda1	Change the code throughout to use new pv_initialise and modified pv_setup fn. Change pv_create code to work with these changes together with using new pv_add_metadata_area fn to add metadata areas for a PV being created.	2011-02-21 12:24:15 +00:00
Peter Rajnoha	617b900d85	Separate new pv_initialise function out of the original pv_setup code. pv_initiliase initialises a new PV pv_setup sets up an existing PV with a VG	2011-02-21 12:20:18 +00:00
Peter Rajnoha	981895a860	Add new pv_remove_metadata_area interface function.	2011-02-21 12:17:54 +00:00
Peter Rajnoha	8d5d20a526	Add new pv_add_metadata_area interface function.	2011-02-21 12:17:26 +00:00
Peter Rajnoha	305816232d	Remove useless mdas parameter for pv_read (from now on, we store mdas in a format instance)	2011-02-21 12:15:59 +00:00
Peter Rajnoha	6e0b348d34	Add format instance support for pv_read code.	2011-02-21 12:13:40 +00:00
Peter Rajnoha	56280d0d3a	Initialise a new PV-based format instance for a PV that is being created.	2011-02-21 12:12:32 +00:00
Peter Rajnoha	f8b78ec613	Add vg_set_fid function to change VG format instance. This function also sets a reference to a new VG format instance for all PVs that are part of the VG so the PV-VG interconnection is consistent after the change.	2011-02-21 12:10:58 +00:00
Peter Rajnoha	c0c21864c6	Change the code throughout for recent changes in format_instance handling.	2011-02-21 12:07:03 +00:00
Peter Rajnoha	88129db5e1	Change create_instance to create PV-based as well as VG-based format instances. Add supporting functions to work with the format instance and metadata area structures stored within the format instance. Add support for simple indexing of metadata areas using PV id and mda order (for on-disk PV only for now, we can extend the indexing even for other mdas if needed - we only need to define a proper key for the index).	2011-02-21 12:05:49 +00:00
Peter Rajnoha	716c4ebe52	Change and generalise struct format_instance for PV and VG use.	2011-02-21 12:01:22 +00:00
Zdenek Kabelac	aec2115410	Const fixing Fixing some const warnings - with API change in: int vg_extend(struct volume_group vg, int pv_count, const char const pv_names, Change is needed - as lvm2api expects const behaviour here. So vg_extend() is doing local strdup for unescaping. skip_dev_dir return const char from const char* vg_name. Rest of the patch is cleanup of related warnings. Also using dm_report_filed_string() API change to simplify casting in _string_disp and _lvname_disp.	2011-02-18 14:47:28 +00:00
Zdenek Kabelac	b1bcff7424	Critical section New strategy for memory locking to decrease the number of call to to un/lock memory when processing critical lvm functions. Introducing functions for critical section. Inside the critical section - memory is always locked. When leaving the critical section, the memory stays locked until memlock_unlock() is called - this happens with sync_local_dev_names() and sync_dev_names() function call. memlock_reset() is needed to reset locking numbers after fork (polldaemon). The patch itself is mostly rename: memlock_inc -> critical_section_inc memlock_dec -> critical_section_dec memlock -> critical_section Daemons (clmvd, dmevent) are using memlock_daemon_inc&dec (mlockall()) thus they will never release or relock memory they've already locked memory. Macros sync_local_dev_names() and sync_dev_names() are functions. It's better for debugging - and also we do not need to add memlock.h to locking.h header (for memlock_unlock() prototyp).	2011-02-18 14:16:11 +00:00
Zdenek Kabelac	794e94fe16	Replace PV_MIN_SIZE with function pv_min_size() Add configurable option to define minimal size of of block device usable as a PV. pv_min_size() is added to lvm-globals and it's being initialized through _process_config. Macro PV_MIN_SIZE is unused and removed. New define DEFAULT_PV_MIN_SIZE_KB is added to lvm-global and unlike PV_MIN_SIZE it uses KB units. Should help users with various slow devices attached to the system, which cannot be easily filtered out (like FDD on /dev/sdX): https://bugzilla.redhat.com/show_bug.cgi?id=644578	2011-02-18 14:11:22 +00:00
Petr Rockai	21849a8587	Fix an lv_postorder bug where it failed to clear temporary flags, making it impossible to use twice with the same LV(s). Discovered by Milan.	2011-02-14 19:27:05 +00:00
Jonathan Earl Brassow	27ff8813da	Allow snapshots in a cluster as long as they are exclusively activated. In order to achieve this, we need to be able to query whether the origin is active exclusively (a condition of being able to add an exclusive snapshot). Once we are able to query the exclusive activation of an LV, we can safely create/activate the snapshot. A change to 'hold_lock' was also made so that a request to aquire a WRITE lock did not replace an EX lock, which is already a form of write lock.	2011-02-04 20:30:17 +00:00
Mike Snitzer	3e3591904b	Improve lvcreate "insufficient extents" errors to "insufficient free space".	2011-01-28 02:58:00 +00:00
Alasdair Kergon	cef065f63f	Fix lvchange --test to exit cleanly.	2011-01-24 14:19:05 +00:00
Alasdair Kergon	a8de276520	Replace fs_unlock by sync_local_dev_names to notify local clvmd. (2.02.80) Introduce sync_local_dev_names and CLVMD_CMD_SYNC_NAMES to issue fs_unlock.	2011-01-12 20:42:50 +00:00
Jonathan Earl Brassow	6a095ca99f	s/log_verbose/log_error/ - Increase log level on error message.	2011-01-11 17:21:01 +00:00
Jonathan Earl Brassow	025e69a15a	Add disk to mirrored log type conversion.	2011-01-11 17:05:08 +00:00
Zdenek Kabelac	937a21f0d2	Speedup consequent activation calls Stop calling fs_unlock() from lv_de/activate(). Start using internal lvm fs cookie for dm_tree. Stop directly calling dm_udev_wait() and dm_tree_set/get_cookie() from activate code - it's now called through fs_unlock() function. Add lvm_do_fs_unlock() Call fs_unlock() when unlocking vg where implicit unlock solves the problem also for cluster - thus no extra command for clustering environment is required - only lvm_do_fs_unlock() function is added to call lvm's fs_unlock() while holding lvm_lock mutex in clvmd. Add fs_unlock() also to set_lv() so the command waits until devices are ready for regular open (i.e. wiping its begining). Move fs_unlock() prototype to activation.h to keep fs.h private in lib/activate dir and not expose other functions from this header.	2011-01-10 14:02:30 +00:00
Zdenek Kabelac	6feecf76d4	Change import_vg_from_buffer to use config_tree Change function import_vg_from_buffer() to import_vg_from_config_tree(). Instead of creating config tree inside the function allow config tree to be passed as parameter - usable later for caching.	2011-01-10 13:13:42 +00:00
Zdenek Kabelac	2ae2ca89bf	Add backtraces for backup and backup_remove fail paths	2010-12-22 15:36:41 +00:00
Zdenek Kabelac	b7149bbe45	Add missing test for reallocation error.	2010-12-20 14:38:22 +00:00
Zdenek Kabelac	9b30dfb967	Use const char * for name and old_name in vg Switch to use const char pointers to avoid changes of these structure members and having better control over, were these members could be modified.	2010-12-20 13:40:46 +00:00
Zdenek Kabelac	9d9de35dca	Remove const usage from destroy callbacks As const segment_type or const format_type are never released use their non-const version and remove const downcast from dm_free calls. This change fixes many gcc warnings we were getting from them.	2010-12-20 13:32:49 +00:00
Zdenek Kabelac	ba96eb24fa	Some const cleanups Minor const warning fixes and internal API updates.	2010-12-20 13:19:13 +00:00
Zdenek Kabelac	760d1fac55	Add more strict const pointers around config tree To have better control were the config tree could be modified use more const pointers and very carefully downcast them back to non-const (for config tree merge).	2010-12-20 13:12:55 +00:00
Petr Rockai	ebfe96cad5	Add further consistency checking to vg_validate, ensuring that all segment areas point to LVs or PVs that are listed in the respective VG.	2010-12-14 17:51:09 +00:00
Petr Rockai	75b2f3507a	Add a validation step for pvmoveN internal LVs to vg_validate.	2010-12-14 17:07:35 +00:00
Alasdair Kergon	acb037657c	Fix scanning of VGs without in-PV mdas. Set cmd->independent_metadata_areas if metadata/dirs or disk_areas in use. - Identify and record this state. Don't skip full scan when independent mdas are present even if memlock is set. - Clusters and OOM aren't supported, so no problem doing the proper scans. Avoid revalidating the label cache immediately after scanning. - A simple optimisation. Support scanning for a single VG in independent mdas. - Not used by the fix but I left it in anyway as later patches might use it.	2010-12-10 22:39:52 +00:00
Alasdair Kergon	2b82bd79f5	Rename vg_release to free_vg.	2010-12-08 20:50:48 +00:00
Zdenek Kabelac	54fca7b1ca	Remove reset of vg->vgmem pointer as it is access of already release memory This reset of vgmem pointer causes access of already released memory. (_vg_make_handle allocates vg from vgmem pool itself - which is a bit tricky) Interestingly this memory fault was missed by our test suite.	2010-12-08 10:45:37 +00:00
Zdenek Kabelac	166597d998	Add backtraces for errors Add stack; backtraces when error is reported from dev_set() or dev_close_immediate().	2010-12-01 12:56:39 +00:00
Petr Rockai	8191fe4f4a	Refactor the percent (mirror sync, snapshot usage) handling code to use fixed-point values instead of a combination of a float value and an enum.	2010-11-30 11:53:31 +00:00
Petr Rockai	97e8048e05	Avoid the automatic MISSING_PV recovery path in commands with special MISSING_PV handling (cmd->handles_missing_pvs is set).	2010-11-30 11:15:54 +00:00
Alasdair Kergon	1415afcdba	Fix memory leak when VG allocation policy in metadata is invalid. Ignore unrecognised allocation policy found in metadata instead of aborting. Fix another missing vg_release() in _vg_read_by_vgid.	2010-11-29 18:35:37 +00:00
Zdenek Kabelac	201222ebad	Reset vg pointer after release Set vg to NULL after releasing it as the following memlock() test may lead to goto for the second call of vg_release() with the already released vg pointer.	2010-11-29 11:08:14 +00:00
Alasdair Kergon	728074ac83	Suppress 'No PV label' message when removing several PVs without mdas.	2010-11-23 01:55:53 +00:00
Petr Rockai	c1abd569f2	Add the macro and specific 'get' functions for lvsegs. Signed-off-by: Dave Wysochanski <wysochanski@pobox.com> Reviewed-by: Petr Rockai <prockai@redhat.com>	2010-11-17 20:08:14 +00:00
Alasdair Kergon	f8452d8cfd	Support repetition of --addtag and --deltag arguments. Add infrastructure for specific cmdline arguments to be repeated in groups. Split the_args cmdline arguments and values into arg_props and arg_values.	2010-11-11 17:29:05 +00:00
Zdenek Kabelac	64dff85ce4	Preserve const for char pointer Keep char pointers 'const' (introduced with cling commit).	2010-11-11 12:32:33 +00:00
Alasdair Kergon	eb82bd0525	Extend cling allocation policy to recognise PV tags (cling_by_tags). Add allocation/cling_tag_list to lvm.conf.	2010-11-09 12:34:40 +00:00
Peter Rajnoha	f7e3a19f75	Clarify error messages when activation fails due to activation filter use.	2010-11-05 18:18:11 +00:00
Alasdair Kergon	2aa06d73ca	pre-release	2010-10-25 13:54:29 +00:00
Zdenek Kabelac	91e56ffb29	Fix constness warning for _vg_read_by_vgid() uuid usage	2010-10-25 13:35:13 +00:00
Alasdair Kergon	eacd3a0916	fix header #defines	2010-10-25 12:01:59 +00:00
Alasdair Kergon	b83af51668	Add global/metadata_read_only to use unrepaired metadata in read-only cmds.	2010-10-25 11:20:54 +00:00
Dave Wysochanski	d53d92f2e1	Add lv_read_ahead and lv_kernel_read_ahead 'get' functions.	2010-10-21 14:49:31 +00:00
Dave Wysochanski	f1fc310730	Refactor and add code for (lv) 'lv_origin' get function.	2010-10-21 14:49:20 +00:00
Dave Wysochanski	6103254393	Refactor and add code for (lv) 'lv_name' get function.	2010-10-21 14:49:10 +00:00
Jonathan Earl Brassow	2c33c8b80c	Fix for bug 637936: killing both redundant logs causes deadlock Problem: When both legs of a mirrored log fail, neither the log nor the parent mirror can proceed. The repair code must be careful to replace the log with an error target before operating on the parent - otherwise, the parent can get stuck trying to suspend because it can't push through any writes. The steps to replace the log device with an error target were incomplete and resulted in the replacement not happening at all! The code originally had all the necessary logic to complete the replacement task, but was pulled out in a effort to clean-up that section of code, while fixing another bug: <offending commit msg> In addition, I added following three changes. - Removed tmp_orphan_lvs handling procedure It seems that _delete_lv() can handle detached_log_lv properly without adding mirror legs in mirrored log to tmp_orphan_lvs. Therefore, I removed the procedure. - Removed vg_write()/vg_commit() Metadata is saved by vg_write()/vg_commit() just after detached_log_lv is handled. Therefore, I removed vg_write()/vg_commit(). </offending commit msg> http://sources.redhat.com/cgi-bin/cvsweb.cgi/LVM2/lib/metadata/mirror.c?cvsroot=lvm2&f=h#rev1.130 I've reverted the "clean-up" changes associated with that fix, but not what that commit was actually fixing. Signed-off-by: Jonathan Brassow <jbrassow@redhat.com> Reviewed-by: Petr Rockai <prockai@redhat.com>	2010-10-14 20:03:12 +00:00
Mike Snitzer	9443b5d4cd	Convey need for snapshot-merge target in lvconvert error message and man page. Add ->target_name to segtype_handler to allow a more specific target name to be returned based on the state of the segment. Result of trying to merge a snapshot using a kernel that doesn't have the snapshot-merge target: Before: # lvconvert --merge vg/snap Can't expand LV lv: snapshot target support missing from kernel? Failed to suspend origin lv After: # lvconvert --merge vg/snap Can't process LV lv: snapshot-merge target support missing from kernel? Failed to suspend origin lv Unable to merge LV "snap" into it's origin.	2010-10-13 21:26:37 +00:00
Petr Rockai	042312952c	Give correct error message when creating a too-small snapshot (BZ 587063)	2010-10-13 13:52:53 +00:00
Zdenek Kabelac	7c9fd3ea84	Don't use floor() in _bitset_with_random_bits Use _even_rand() function instead of floor() in _bitset_with_random_bits(). floor() function is missing in dietlibc (on architectures other than x86). Moreover using floor() to clip rand results does not assure even result distribution. _even_rand() uses integer arithmetic only and is designed to return evenly distributed results. > Looks OK to me. It took a while to decipher what is the exact meaning of > the loop in _even_rand (to a non-pseudorandomness-expert) but I am > fairly comfortable with it now. If I understand this correctly, it > rejects numbers that come from an "incomplete" slice of the RAND_MAX > space (considering the number space [0, RAND_MAX] is divided into some > "max"-sized slices and at most a single smaller slice, between [n*max, > RAND_MAX] for suitable n -- numbers from this last slice are discarded > because they could distort the distribution in favour of smaller > numbers). Signed-off-by: Przemyslaw Iskra <sparky <at> pld-linux.org> Reviewed-by: Petr Rockai <prockai <at> redhat.com>	2010-10-13 12:18:53 +00:00
Dave Wysochanski	f70468ce0b	Fix lv_modules_dup segfault.	2010-10-12 17:09:23 +00:00
Petr Rockai	98351ffbd5	Make lvconvert respect --yes/--force in the inactive log conversion prompt. Fixes BZs 642055, 621281. Patch by Taka. Signed-off-by: Takahiro Yasui <tyasui@redhat.com> Reviewed-by: Petr Rockai <prockai@redhat.com>	2010-10-12 16:41:17 +00:00
Dave Wysochanski	2eba846043	Refactor and add code for (lv) 'modules' get function.	2010-10-12 16:13:06 +00:00
Dave Wysochanski	d88090b0ae	Refactor and add code for (lv) 'mirror_log' get function. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-By: Petr Rockai <prockai@redhat.com>	2010-10-12 16:12:50 +00:00
Dave Wysochanski	40c6c80723	Refactor and add code for (lv) 'lv_kernel_{major\|minor}' get functions. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-By: Petr Rockai <prockai@redhat.com>	2010-10-12 16:12:33 +00:00
Dave Wysochanski	e27833fb9c	Refactor and add code for (lv) 'convert_lv' get function. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-By: Petr Rockai <prockai@redhat.com>	2010-10-12 16:12:18 +00:00
Dave Wysochanski	af579eccc3	Refactor and add code for (lv) 'move_pv' get function. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-By: Petr Rockai <prockai@redhat.com>	2010-10-12 16:12:02 +00:00
Dave Wysochanski	29636f38e3	Refactor and add code for (lv) 'origin_size' get function. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com> Reviewed-By: Petr Rockai <prockai@redhat.com>	2010-10-12 16:11:48 +00:00
Dave Wysochanski	802e252b29	Refactor and add code for (lv) 'lv_path' get function.	2010-10-12 16:11:34 +00:00
Dave Wysochanski	637ac19e60	Rename 'flags' to 'status' for struct metadata_area. In other LVM memory structures such as volume_group, the field used to store flags is called "status", and on-disk fields are called 'flags', so rename the one inside metadata_area to be consistent. Not only is it more consistent with existing code but is cleaner to say "the status of this mda is ignored". Background for this patch - prajnoha pinged me on IRC this morning about a fix he was working on related to metadataignore when metadata/dirs was set. I was reviewing my patches from this year and realized the 'flags' field was probably not the best choice when I originally did the metadataignore patches.	2010-10-05 17:34:05 +00:00
Dave Wysochanski	0ca1492ca5	Fix copyright dates on new files lib/metadata/{lv\|vg\|pv}.[ch].	2010-09-30 20:47:18 +00:00
Dave Wysochanski	b184f791d4	Add pv_name_dup() and pv_fmt_dup() helper functions.	2010-09-30 14:09:22 +00:00
Dave Wysochanski	1cd292af8f	Add pv_mda_size, pv_mda_free, and pv_used functions, call from 'disp' functions.	2010-09-30 14:09:10 +00:00
Dave Wysochanski	b1ef78d000	Add supporting functions vg_name_dup, vg_fmt_dup, vg_system_id_dup. Add supporting functions for vg_name, vg_fmt, vg_system_id. Append "_dup" to end of supporting functions to make clear the strings are dup'd and to avoid namespace conflict with vg_name.	2010-09-30 14:08:33 +00:00
Dave Wysochanski	c508945ca9	Add pv_tags_dup, vg_tags_dup, lv_tags_dup functions that call tags_format_and_copy.	2010-09-30 14:08:19 +00:00
Dave Wysochanski	f15033c0e1	Add tags_format_and_copy() common function and call from _tags_disp. Add a common function to allocate memory and format a string of tags. Call tags_format_and_copy() from _tags_disp().	2010-09-30 14:08:07 +00:00
Dave Wysochanski	254d672dcc	Add pv_uuid_dup, vg_uuid_dup, and lv_uuid_dup, and call id_format_and_copy. Add supporting functions for pv_uuid, vg_uuid, and lv_uuid. Call new function id_format_and_copy. Use 'const' where appropriate. Add "_dup" suffix to indicate memory is being allocated. Call {pv\|vg\|lv}_uuid_dup from lvm2app uuid functions.	2010-09-30 14:07:47 +00:00
Dave Wysochanski	4bbadbe1cf	Simplify logic to create 'attr' strings. This patch addresses code review request to simplify creation of 'attr' strings. The simplification is done in this separate patch to more easily review and ensure the simplification is done without error.	2010-09-30 14:07:19 +00:00
Dave Wysochanski	14663348d0	Add {pv\|vg\|lv}_attr_dup() functions and refactor 'disp' functions. Move the creating of the 'attr' strings into a common function so they can be called from the 'disp' functions as well as the new 'get' property functions. Add "_dup" suffix to indicate memory is allocated. Refactor pvstatus_disp to take pv argument and call pv_attr_dup().	2010-09-30 13:52:55 +00:00
Dave Wysochanski	e32e2eb011	Add lib/metadata/vg.[ch] and lib/metadata/lv.[ch]. These got missed when git cvsexportcommit was used.	2010-09-30 13:16:55 +00:00
Dave Wysochanski	b88b638d6e	Add lib/metadata/pv.[ch] new files. Apparently git cvsexportcommit does not properly add new files from a git commit.	2010-09-30 13:15:42 +00:00
Dave Wysochanski	b171907fc5	Refactor metadata.[ch] into lv.[ch] for lv functions. This patch is similar to the other patches for pv and vg functionality, and separates lv functionality into separate files, concentrating on reporting fields and simple functions.	2010-09-30 13:05:45 +00:00
Dave Wysochanski	f42b708eae	Refactor metadata.[ch] into pv.[ch] for pv functions. The metadata.[ch] files are very large. This patch makes a first attempt at separating out pv functions and data, particularly related to the reporting fields calculations. More code could be moved here but for now I'm stopping at reporting functions 'get' / 'set' functions.	2010-09-30 13:05:20 +00:00
Dave Wysochanski	81f0124a58	Refactor metadata.[ch] into vg.[ch] for vg functions. The metadata.[ch] files are very large. This patch makes a first attempt at separating out vg functions and data, particularly related to the reporting fields calculations.	2010-09-30 13:04:55 +00:00
Peter Rajnoha	bad35c6554	Add escape sequence for ':' and '@' found in device names used as PVs.	2010-09-23 12:02:33 +00:00
Milan Broz	c7af31dbd7	Fix return type qualifier to avoid compiler warning. introduced in commit `b16b4d92a7` "Improve various log messages." fixes a lot of ../include/metadata.h:148: warning: type qualifiers ignored on function return type	2010-08-26 12:08:19 +00:00
Mike Snitzer	4efb1d9cbb	Update heuristic used for default and detected data alignment. Add "devices/default_data_alignment" to lvm.conf to control the internal default that LVM2 uses: 0==64k, 1==1MB, 2==2MB, etc. If --dataalignment (or lvm.conf's "devices/data_alignment") is specified then it is always used to align the start of the data area. This means the md_chunk_alignment and data_alignment_detection are disabled if set. (Same now applies to pvcreate --dataalignmentoffset, the specified value will be used instead of the result from data_alignment_offset_detection) set_pe_align() still looks to use the determined default alignment (based on lvm.conf's default_data_alignment) if the default is a multiple of the MD or topology detected values.	2010-08-20 20:59:05 +00:00
Dave Wysochanski	69d67dc2ca	Add vg_mda_size and vg_mda_free functions. Add supporting functions to get vg_mda_size and vg_mda_free fields. Should be no functional change.	2010-08-20 12:43:49 +00:00
Milan Broz	586b56b18c	Fix wrong use of LCK_WRITE In all top vg read functions only LCK_VG_READ/WRITE can be used. All other vg lock definitions are low-level backend machinery. Moreover, LCK_WRITE cannot be tested through bitmask. This patch fixes these mistakes. For _recover_vg() we do not need lock_flags, it can be only two of above and we always upgrading to LCK_VG_WRITE lock there. (N.B. that code is racy) There is no functional change in code (despite wrong masking it produces correct bits:-)	2010-08-19 23:26:31 +00:00
Milan Broz	727f7bfa49	Detect LUKS signature in pvcreate One shiny day we should use libblkid here. But now using LUKS is very common together with LVM and pvcreate destroys LUKS completely. So for user's convenience, try to detect LUKS signature and allow abort.	2010-08-19 23:08:18 +00:00
Milan Broz	2d5e2b52ca	Change the pvcreate swap/md logic pvcreate detects MD and swap signature. The logic hidden there is not only documented but it is also user unfriendly. Who invented this logic should run pvcreate on its own critical MD device to see why;-) This patch - creates one function instead of duplication code - asks if user want to overwrite signature - allows aborting (!) (Please note that writing LVM signatute without wiping old is wrong, it confuses blkid, MD will not work anyway and swap and LUKS is broken too.)	2010-08-19 23:03:34 +00:00
Alasdair Kergon	22149572e8	Use 'SINGLENODE' instead of 'dead' in clvmd singlenode messages. Ignore snapshots when performing mirror recovery beneath an origin. Pass LCK_ORIGIN_ONLY flag around cluster. Add suspend_lv_origin and resume_lv_origin using LCK_ORIGIN_ONLY.	2010-08-17 19:25:05 +00:00
Alasdair Kergon	2d6fcbf67d	Allow internal suspend and resume of origin without its snapshots.	2010-08-17 16:25:32 +00:00
Jonathan Earl Brassow	d0191bf9f4	Fix for bug 612291: dm devices of split off mirror images are not removed DM devices were not handled properly on nodes in a cluster that were not where the splitmirrors command was issued. This was happening because suspend_lv/resume_lv were being used in a place where activate_lv should have been used. When the suspend/resume are issued on (effectively) new LVs, their 'resource' (UUID) is not located in the lv_hash. Thus, both operations turn into no-ops. You can see this from the output of clvmd from one of the remote nodes: <snip> do_suspend_lv, lock not already held <snip> do_resume_lv, lock not already held 'activate_lv' enjoins the other nodes in the cluster to process the lock and activate the new LV. clvmd output from remote node as follows: do_lock_lv: resource 'zMseY7CBuO3Ty09vXlplPAHzD0Y0CovjrTdv0R1VcwggMwPdYhutHErRcwm5Nd2S', cmd = 0x19 LCK_LV_ACTIVATE (READ\|LV\|NONBLOCK), flags = 0x84 (DMEVENTD_MONITOR ), memlock = 1 sync_lock: 'zMseY7CBuO3Ty09vXlplPAHzD0Y0CovjrTdv0R1VcwggMwPdYhutHErRcwm5Nd2S' mode:1 flags=1 sync_lock: returning lkid 27b0001 Signed-off-by: Jonathan Brassow <jbrassow@redhat.com> Reviewed-by: Petr Rockai <prockai@redhat.com>	2010-08-16 18:02:14 +00:00
Mike Snitzer	b123a82d73	Change default alignment of pe_start to 1MB. The new standard in the storage industry is to default alignment of data areas to 1MB. fdisk, parted, and mdadm have all been updated to this default. Update LVM to align the PV's data area start (pe_start) to 1MB. This provides a more useful default than the previous default of 64K (which generally ended up being a 192K pe_start once the first metadata area was created). Before this patch: # pvs -o name,vg_mda_size,pe_start PV VMdaSize 1st PE /dev/sdd 188.00k 192.00k After this patch: # pvs -o name,vg_mda_size,pe_start PV VMdaSize 1st PE /dev/sdd 1020.00k 1.00m The heuristic for setting the default alignment for LVM data areas is: - If the default value (1MB) is a multiple of the detected alignment then just use the default. - Otherwise, use the detected value. In practice this means we'll almost always use 1MB -- that is unless: - the alignment was explicitly specified with --dataalignment - or MD's full stripe width, or the {minimum,optimal}_io_size exceeds 1MB - or the specified/detected value is not a power-of-2	2010-08-12 04:11:48 +00:00
Jonathan Earl Brassow	8d2d4f1fa0	Fix for bug 619221 - log device splitting regression An incorrect fix on July 13, 2010 for an annoyance has caused a regression. The offending check-in was part of the 2.02.71 release of LVM. That check-in caused any PVs specified on the command line to be ignored when performing a mirror split. This patch reverses the aforementioned check-in (solving the regressions) and posits a new solution to the list reversal problem. The original problem was that we would always take the lowest mimage LVs from a mirror when performing a split, but what we really want is to take the highest mimage LVs. This patch accomplishes that by working through the list in reverse order - choosing the higher numbered mimages first. (This also reduces the amount of processing necessary.) Signed-off-by: Jonathan Brassow <jbrassow@redhat.com> Reviewed-by: Takahiro Yasui <takahiro.yasui@hds.com>	2010-08-06 15:38:32 +00:00
Jonathan Earl Brassow	cbd41292a4	Taka's fix for handling failure of all mirrored log devices and all but one mirror leg. <patch header> To handle a double failure of a mirrored log, Jon's two patches are commited, however, lvconvert command can't still handle an error when mirror leg and mirrored log got failure at the same time. [Patch]: Handle both devices of a mirrored log failing (bug 607347) posted: https://www.redhat.com/archives/lvm-devel/2010-July/msg00009.html commit: https://www.redhat.com/archives/lvm-devel/2010-July/msg00027.html [Patch]: Handle both devices of a mirrored log failing (bug 607347) - additional fix posted: https://www.redhat.com/archives/lvm-devel/2010-July/msg00093.html commit: https://www.redhat.com/archives/lvm-devel/2010-July/msg00101.html In the second patch, the target type of mirrored log is replaced with error target when remove_log is set to 1, but this procedure should be also used in other cases such as the number of mirror leg is 1. This patch relocates the procedure to the main path. In addition, I added following three changes. - Removed tmp_orphan_lvs handling procedure It seems that _delete_lv() can handle detached_log_lv properly without adding mirror legs in mirrored log to tmp_orphan_lvs. Therefore, I removed the procedure. - Removed vg_write()/vg_commit() Metadata is saved by vg_write()/vg_commit() just after detached_log_lv is handled. Therefore, I removed vg_write()/vg_commit(). - With Jon's second patch, we think that we don't have to call remove_mirror_log() in _lv_update_mirrored_log() because will be handled remove_mirror_images() in _lvconvert_mirrors_repaire(). </patch header> Signed-off-by: Takahiro Yasui <takahiro.yasui@hds.com> Reviewed-by: Petr Rockai <prockai@redhat.com> Signed-off-by: Jonathan Brassow <jbrassow@redhat.com>	2010-08-02 21:07:40 +00:00
Jonathan Earl Brassow	efaaf3146d	Disallow mirrored logs in cluster mirrors. The cluster log daemon (cmirrord) is not multi-threaded and can handle only one request at a time. When a log is stacked on top of a mirror (which itself contains a 'core' log), it creates a situation that cannot be solved without threading. When the top level mirror issues a "resume", the log daemon attempts to read from the log device to retrieve the log state. However, the log is a mirror which, before issuing the read, attempts to determine the 'sync' status of the region of the mirror which is to be read. This sync status request cannot be completed by the daemon because it is blocked on a read I/O to the very mirror requesting the sync status.	2010-08-02 19:03:45 +00:00
Dave Wysochanski	936541ec56	Remove irrelevant comments relating to vg_mda_copies.	2010-07-30 16:47:27 +00:00
Jonathan Earl Brassow	405c4a45d8	It's not enough to check for the kernel module in the case of cluster mirrors, we must also check that the log daemon (cmirrord) is running. The log module can be auto-loaded, but the daemon cannot be "auto-started". Failing to check for the daemon produces cryptic messages that customers have a hard time deciphering. (The system messages do report that the log daemon is not running, but people don't seem to find this message easily.) Here are examples of what is printed when the module is available, but the log daemon has not been started. [root@bp-01 LVM2]# lvcreate -m1 -l1 -n lv vg Shared cluster mirrors are not available. [root@bp-01 LVM2]# lvcreate -m1 -l1 -n lv vg -v Setting logging type to disk Finding volume group "vg" Archiving volume group "vg" metadata (seqno 3). Creating logical volume lv Executing: /sbin/modprobe dm-log-userspace Cluster mirror log daemon is not running Shared cluster mirrors are not available. Creating volume group backup "/etc/lvm/backup/vg" (seqno 4).	2010-07-21 13:40:21 +00:00
Jonathan Earl Brassow	60f425d1b3	Fix for bug 614164: No check for existing name when splitting mirror The user could use the same name as an existing LV when specifying a name for an LV split off from a mirror. This causes all sorts of issues.	2010-07-13 22:24:39 +00:00
Jonathan Earl Brassow	c42b084793	Fix for bugs: 612248 & 612291 Split mirror issues The main problem with these bugs was that the newly split off LV was not being suspended properly. This meant that the memlock count was not being balanced, the DM devices were not being renamed, and some DM devices which should have been removed were not. I've also renamed some of the variables and added comments to make things clearer as to what is going on. (I can break this patch in two if it means easier review.)	2010-07-13 21:48:16 +00:00
Jonathan Earl Brassow	a93fb6299f	Failed to test for the case where a log was requested to be removed even though there was no log. A simple run through the in-tree test suite would have caught this. :( - if (lv_is_mirrored(detached_log_lv) && + if (detached_log_lv && lv_is_mirrored(detached_log_lv) && Also, made some cosmetic changes suggested by kabi after my last check-in (e.g. s/return 0/return_0/ and adding an error message).	2010-07-09 17:57:51 +00:00
Dave Wysochanski	f77fb62b2a	Add log_error when strdup fails in {vg\|lv}_change_tag(). Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-07-09 16:57:44 +00:00
Alasdair Kergon	08f1ddea6c	Use __attribute__ consistently throughout.	2010-07-09 15:34:40 +00:00
Alasdair Kergon	80e569104b	Remove superfluous fn prototypes.	2010-07-09 15:21:10 +00:00
Jonathan Earl Brassow	aa5734f2a3	Finish fix for bug 607347: failing both redundant mirror log legs... A previous check-in added logic to handle the case where both images of a mirrored log failed. It solved the problem by simply removing the log entirely - leaving the parent mirror with a 'core' log. This worked for most cases. However, if there was a small delay between the failures of the two mirrored log devices, the mirror would hang, LVM would hang, and no additional LVM commands could be issued. When the first leg of the log fails, it signals the need for repair. Before 'lvconvert --repair' is run by dmeventd, the second leg fails. 'lvconvert' would see both devices as failed and try to remove the log entirely. When it came time to suspend the parent mirror to update the configuration, the suspend would hang because it couldn't get any I/O through the mirrored log, which was plugged waiting for corrective action. The solution is to replace the log with an error target to clear any pending writes before removing it. This allows the parent mirror to suspend and make the proper changes.	2010-07-09 15:08:12 +00:00
Dave Wysochanski	a5fb2bbff3	Pass metadataignore to pv_create, pv_setup, _mda_setup, and add_mda. Pass metadataignore through PV creation / setup paths. As a result of this cleanup, we can remove the unnecessary setting of mda_ignore bits inside pvcreate_single(), after call to pv_create. For now, just set metadataignore to '0' in some places. This is equivalent to the prior functionality, although the 0 is given by the caller not hardcoded in _mda_setup() call. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-07-08 18:24:29 +00:00
Dave Wysochanski	dce204cec5	Init mda->list in mda_copy. This patch should be no functional change as all callers initialize mda->list.	2010-07-08 17:41:46 +00:00
Dave Wysochanski	7041b476ac	Add warning to vgextend and pvchange if metadataignore given on cmdline. Warn the user then change the value of vg_mda_copies. Signed-off-by: Dave Wysochanski <dwysocha@redhat.com>	2010-07-07 18:59:45 +00:00
Alasdair Kergon	7f7af46862	Adjust auto-metadata repair and caching logic to try to cope with empty mdas. - If a PV contained empty mdas, the auto-recovery code was not kicking in. - The 'inconsistent' state was getting lost when metadata was cached so recovery didn't kick in. But leave the behaviour alone when using precommitted metadata because of a warning in a confusing FIXME. In my testing, pvs and vgs didn't repair inconsistent metadata like they used to do. (How many other tools fail similarly now?) And there should be no need to cache inconsistent metadata because it is supposed to get repaired under the protection of a write lock immediately it is discovered. This code is in need of a redesign based on first principles. I still see bugs in this code and this commit is risky.	2010-07-07 02:53:16 +00:00
Alasdair Kergon	6c8655ce9b	fix code in 2nd mda unignore loop to match 1st loop	2010-07-06 20:09:38 +00:00
Alasdair Kergon	68f4e0c734	s/flags/mda/	2010-07-06 17:29:50 +00:00
Alasdair Kergon	0db1bbc3c3	shorten mesg	2010-07-06 17:27:32 +00:00
Alasdair Kergon	643f234119	fix jumbled args in 'Adjusting' message	2010-07-06 17:26:08 +00:00
Alasdair Kergon	d911ec67a9	Randomly select which mdas to use or ignore. Add some missing standard configure.in checks.	2010-07-05 22:23:15 +00:00

1 2 3 4 5 ...

998 Commits