shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-21 13:34:40 +03:00

Author	SHA1	Message	Date
Zdenek Kabelac	5034bb8d18	cleanup: use local var to read struct	2016-03-31 12:21:40 +02:00
Zdenek Kabelac	b10253ab4d	cleanup: use TARGET define	2016-03-31 12:21:40 +02:00
Zdenek Kabelac	86db143307	cleanup: debug message fix Reported-by: Ming-Hung Tsai <mingnus gmail com>	2016-03-31 12:21:25 +02:00
Zdenek Kabelac	fc7dacaa4c	thin: display highest mapped sector Use meta% to expose highest mapped sector in thinLV. so showing there 100.00% means thinLV maps latest sector. Currently using a 'trick' with total_numerator to pass-in device size when 'seg==NULL' TODO: Improve device status API per target - current 'percentage' is not really extensible.	2016-03-31 12:20:43 +02:00
Zdenek Kabelac	8bbec41bd4	thin: no thin-pool flush when reading metadata status Previous fix missed the fact the we do query for 'percent' with seg value either set or unset (API overload...) When 'seg' was unset, we still issue flush with status. Fix it by cheking segtype by target_type. As we check for segtype - we could also skip whole percentage if the 'segtype' is unknown by code directly. Reported-by: Ming-Hung Tsai <mingnus gmail com	2016-03-31 12:15:47 +02:00
Peter Rajnoha	109b7e2095	revert: `6129d2e64d` Unfortunately, commit `6129d2e64d` may cause performance issue. There's going to be a better fix...	2016-03-24 14:06:12 +01:00
Peter Rajnoha	6129d2e64d	monitoring: sync /dev content before contacting dmeventd for monitor/unmonitor dmeventd daemon may call further code itself that looks at /dev, e.g. via dmeventd_lvm2_command call. We need to have a consistent view of the /dev content at that time. Therefore, sync /dev content before calling monitoring hook which contacts dmeventd. This problem was quite hidden before, but now it has manifested itself because of recent additions to dev-cache code where we started looking at device holders as seen in sysfs. What happened here was that the device was already in sysfs, but not yet under /dev and this triggered the new error message sometimes: log_error("%s: failed to find associated device structure for holder %s.", devname, devpath); This problem has manifested recently in our api/pytest.sh test from testsuite where we create thin pool LVs and thin LVs and hence it also causes dmeventd to be used as well and these error messages were visible there.	2016-03-24 12:40:19 +01:00
Alasdair G Kergon	1216efdf15	activate: Use macros for target and module names.	2016-03-22 17:46:15 +00:00
Zdenek Kabelac	5c415afd85	cache: check for cache fail during flush Just WARN if the cache can't be flushed because it's failed.	2016-03-10 18:38:53 +01:00
Zdenek Kabelac	e04a0184cb	cleanup: use lv_is_partial Check for PARTIAL_LV flag in standard way.	2016-03-03 10:17:03 +01:00
Zdenek Kabelac	ddfec5b51a	coverity: use same arithmetic for both major and minor Run all arithmetic in the same 'dev_t' type.	2016-02-23 21:40:17 +01:00
Zdenek Kabelac	68955a8102	coverity: ensure non-null pointers are used Here is too complex for Coverity to guess those pointers cannot be NULL, but it's very easy to add little checks here.	2016-02-23 21:40:16 +01:00
Zdenek Kabelac	dbc71dc05e	gcc: cleanup some sign warnings When comparing unsigned with int, the comparision is made as 'unsigned' type, so make it rather explicit which type is being compared.	2016-02-23 12:25:25 +01:00
Zdenek Kabelac	293aabe4cd	cache: enforce header check Currently it's been checked for 'zero' header for thin-pool, but lets use it always for cache as well - since it's relatively 'cheap' detection of read 'error' problems as thin/cache tools currently do not work fast enough in this case.	2016-02-23 12:25:25 +01:00
Zdenek Kabelac	f501f083bf	thin: fix read size compare Fix the compare with 'unsigned' sizeof() and error read -1 result. So the read error is correctly recognized.	2016-02-23 12:22:18 +01:00
Zdenek Kabelac	f31d596c0d	thin: report needs_check and fail state Fix reporting of Fail thin-pool target status as attr[8] letter 'F'. Report 'needs_check' status from thin-pool target via attr field [4] (letter 'c'/'C'), and also via CheckNeeded field. TODO: think about better name here? TODO: lots of prop_not_implemented_set	2016-02-18 16:49:34 +01:00
M.H. Tsai	53058e5234	debug: cut_and_paste type in message Typo in debug message.	2016-02-11 18:38:40 +01:00
Zdenek Kabelac	22810155a6	cleanup: add missing prototype Commit `2304286f68` missed to add function prototype.	2016-01-21 13:29:08 +01:00
Zdenek Kabelac	fcbef05aae	doc: change fsf address Hmm rpmlint suggest fsf is using a different address these days, so lets keep it up-to-date	2016-01-21 12:11:37 +01:00
Alasdair G Kergon	2da7525c83	activation: remote node check doesn't work yet	2016-01-20 02:54:11 +00:00
Alasdair G Kergon	2304286f68	activation: Add lv_is_active_remotely.	2016-01-19 22:01:59 +00:00
Alasdair G Kergon	c812c2dbc7	locking: Add node parameter to query_resource.	2016-01-19 21:42:22 +00:00
Zdenek Kabelac	8b16efd17c	debug: correct stack tracing Here the 'goto' is correct path, as !device_is_usable is traceable with <backtrace>. Keep the 'stack' for unusable device.	2015-12-04 22:10:30 +01:00
Zdenek Kabelac	86e7894ecc	cleanup: use dm_get_status_mirror Use libdm function to parse mirror status report.	2015-12-01 13:03:16 +01:00
Zdenek Kabelac	6336ef98d4	lib: pass mem pool to check_transient_status check_transient_status() may need to allocate some memory, so pass in already existing mem pool.	2015-12-01 13:01:28 +01:00
Zdenek Kabelac	922fccc656	cleanup: using display_lvname Use for showing vgname/lvname in messages. No functional change.	2015-11-26 09:27:37 +01:00
Zdenek Kabelac	b7b59ad932	cleanup: remove unused code Remove long outstand unused code lines, which were already been obsoleted by other code. Statuses and snapshot tree creation is already handled differently. Also drop some 'extra' log_error() and use only stack; since error has already been reported.	2015-11-26 09:27:37 +01:00
Zdenek Kabelac	528695ec20	cleanup: avoid allocation for vg_name Since we do not use dev_manager in a way we would have destroyed VG content while in-use - we could safely keep just pointer. So dropping strdup. Also it seems we actually no longer use vg_name for anything so it may possibly go away completely unless it would be useful for debugging...	2015-11-26 09:27:37 +01:00
Zdenek Kabelac	4312b09635	cleanup: change ondisk committed Patch has no functional change.	2015-11-25 11:39:26 +01:00
Zdenek Kabelac	0285066e10	thin: fix previous update of partial tree building We do want to preserve 'active' thin-pool, so add this 'fake' layer only when activating. TODO: think how to use thin-pool without fake LV layer.	2015-11-24 23:24:11 +01:00
Zdenek Kabelac	5e50e5f0b4	thin: skip detach preload from pools lv preload for detached LVs started to be used also for various other types which just happens to pass through weak if() condition. TODO: find here better solution to rather explicitly check for types we really need to preload.	2015-11-23 23:42:59 +01:00
Zdenek Kabelac	6d6c233768	cleanup: move towards using direct LV pointers We do not won't to 'expose' internals of VG struct. ATM we use lists to keep all LVs - we may want to switch to better struct for quicker 'search'. Since we do not need 'lists' but always actual LV, switch find_lv_in_vg_by_lvid() to return LV, and replaces some use case of find_lv_in_vg() with 'better' working find_lv() which already returns LV.	2015-11-23 23:42:59 +01:00
Zdenek Kabelac	94c9453659	thin: work with active thin-pool When 'lvextend -L+XX vg/thinpool' do not leave inactive table loaded for 'wrapping' LV on top of resized thin-pool (ATM we use linear LV for this with same size as thin-pool).	2015-11-23 23:41:36 +01:00
Zdenek Kabelac	a45cc0fe14	raid: fix the string compare Coverity noticed this condition is always false and the error path could never be visited. So check for all mismatches of supported messages and actually mark log_error as internal error.	2015-11-10 21:40:28 +01:00
Zdenek Kabelac	164d7e72bf	devmanager: validate target params Coverity: ensure we do not read through NULL pointers for target_type and params.	2015-11-09 10:19:20 +01:00
Zdenek Kabelac	80c3fb786c	thin: fix error path mem leak Coverity: when parsing of thin-pool status would have failed, it could have leaked memory pool and dmt struct.	2015-11-09 10:19:19 +01:00
Zdenek Kabelac	ba41ee1dc9	thin: limit no-flush using only for thin-pool For this release keep usage of 'noflush' only for thin-volume/pool. For rest of keep - keep usage of 'noflush' flag purely for non-resized mirrors.	2015-10-26 23:57:31 +01:00
Zdenek Kabelac	f898cf7539	dev_manager: no flush for extension Recognize the target only 'extends' and do not enforce 'flush' in this case. Only the size reduction still requires flush (so disables usage of no_flush flag). If some other targets do require flush before suspend, they have to explicitly ask for it.	2015-10-25 21:09:31 +01:00
Zdenek Kabelac	844b009584	dev_manager: enabled no_flush for suspend While the activation code tries to evaluate which target really needs flush with suspend and which may go without flush, it has stayed effectively disabled by original commit: `33f732c5e9` since here it only allows to pass non-pvmoving 'mirrors'. So remove check for mirror LV type and only disable no_flush for 'pvmove'.. TODO: Looking into history - it also seemed like raid target would have always required flushing but it's been later removed without clean explanation. If some more targets really do need 'no_flush' it should been handle at their 'level' - since we now stack multiple targets over itself.	2015-10-25 21:07:37 +01:00
Alasdair G Kergon	39a97d86f0	segtypes: Add and use new segtype macros. Includes fixing an inverted raid10 segtype check in _raid_add_target_line.	2015-09-24 14:59:07 +01:00
Alasdair G Kergon	214e2cddf6	segtypes: Use SEG_TYPE_NAME_ string constants.	2015-09-22 19:04:12 +01:00
Zdenek Kabelac	ee8200f1c6	cleanup: use just 2 decimal digits	2015-09-03 23:34:37 +02:00
Zdenek Kabelac	872ea3b987	thin: do not flush when quering for thin percent Since we may easily get blocked when checking for percentage of thin-pool - do not flush and just show current values. This avoids holding VG locked when pool is overfilled.	2015-09-03 23:34:36 +02:00
Zdenek Kabelac	a01eb9c451	thin: detect unusable thins Try to detect thin-pool which my block lvm2 command from furher processing (i.e. lvextend). Check if pool is read-only or out-of-space and in this case thins will skipped from being scanned (so user may miss some PVs located on thin volumes).	2015-09-03 23:34:36 +02:00
Peter Rajnoha	ac3143c093	config: {thin,cache}_{check,repair}_options are never undefined Require global/{thin,cache}_{check,repair}_options to be always defined. If not defined directly by user in the configuration and if there's no concrete default option to use, make "" (empty string) the default one - it's then clearly visible in the "lvmconfig --type default" (and generated lvm.conf) and also it makes its handling in the code more straightforward so we don't need to handle undefined values. This means, if there are no default values for these settings defined, we end up with this generated now: {thin,cache}_{check,repair}_options = [ "" ] So the value is never undefined and if it is, it's an error. (The cache_repair_options is actually not used in the code at the moment, but once the code using this setting is in, it will follow the same logic as used for thin_repair_options.)	2015-07-14 10:13:41 +02:00
Peter Rajnoha	3b6840e099	config: replace find_config_tree_node with find_config_tree_array where appropriate	2015-07-08 13:03:08 +02:00
Zdenek Kabelac	0ac20a8fdb	cache: support clear-needs-check Support newer cache tool which support new option --clear-needs-check-flag. Code does same as for thin_check.	2015-07-07 09:57:27 +02:00
Zdenek Kabelac	a900d150e4	thin: move pool messaging from resume to suspend Existing messaging intarface for thin-pool has a few 'weak' points: * Message were posted with each 'resume' operation, thus not allowing activation of thin-pool with the existing state. * Acceleration skipped suspend step has not worked in cluster, since clvmd resumes only nodes which are suspended (have proper lock state). * Resume may fail and code is not really designed to 'fail' in this phase (generic rule here is resume DOES NOT fail unless something serious is wrong and lvm2 tool usually doesn't handle recovery path in this case.) * Full thin-pool suspend happened, when taken a thin-volume snapshot. With this patch the new method relocates message passing into suspend state. This has a few drawbacks with current API, but overal it performs better and gives are more posibilities to deal with errors. Patch introduces a new logic for 'origin-only' suspend of thin-pool and this also relates to thin-volume when taking snapshot. When suspend_origin_only operation is invoked on a pool with queued messages then only those messages are posted to thin-pool and actual suspend of thin pool and data and metadata volume is skipped. This makes taking a snapshot of thin-volume lighter operation and avoids blocking of other unrelated active thin volumes. Also fail now happens in 'suspend' state where the 'Fail' is more expected and it is better handled through error paths. Activation of thin-pool is now not sending any message and leaves upto a tool to decided later how to finish unfinished double-commit transaction. Problem which needs some API improvements relates to the lvm2 tree construction. For the suspend tree we do not add target table line into the tree, but only a device is inserted into a tree. Current mechanism to attach messages for thin-pool requires the libdm to know about thin-pool target, so lvm2 currently takes assumption, node is really a thin-pool and fills in the table line for this node (which should be ensured by the PRELOAD phase, but it's a misuse of internal API) we would possibly need to be able to attach message to 'any' node. Other thing to notice - current messaging interface in thin-pool target requires to suspend thin volume origin first and then send a create message, but this could not have any 'nice' solution on lvm2 side and IMHO we should introduce something like 'create_after_resume' message. Patch also changes the moment, where lvm2 transaction id is increased. Now it happens only after successful finish of kernel transaction id change. This change was needed to handle properly activation of pool, which is in the middle of unfinished transaction, and also this corrects usage of thin-pool by external apps like Docker.	2015-07-03 16:13:14 +02:00
Alasdair G Kergon	4c629a5257	locking: Add missing error handling. Add missing error logging and detection to unlock_vg and callers of sync_local_dev_names etc.	2015-06-30 18:54:38 +01:00
Peter Rajnoha	0a203070f5	cleanup: missing target_type check in device_is_usable filter	2015-06-17 14:27:48 +02:00
Peter Rajnoha	5577f2f4f0	cleanup: \|\| instead of \| More efficient with same result here.	2015-06-17 14:12:58 +02:00
Peter Rajnoha	1e6a926e85	filter: filter-usable: consider snapshot and origin LV as unusable if its component is suspended Note that this is just a quick fix and it needs more robust fix to encompass any combination, not just the (old) snapshot one! This started with this report: https://bugzilla.redhat.com/show_bug.cgi?id=1219222 If we have devices/ignore_suspended_devices=1 set based on which we filter out suspended devices as unusable (or if we ignore suspended devices by force, e.g. during lvconvert called from dmeventd) and when we have snapshot and snapshot origin devices in the play, we need to look at their components unerneath (-real and -cow) to check if they're not suspended. If they are, the snapshot/snapshot origin is not usable as well and hence it needs to be filtered out by filter-usable.c code which does suspended device filtering. Not going into much details here, more details are in the bugzilla mentioned above. However, this is a quick fix since snapshot and this exact situation is not the only one. So this is something that needs to be revisited and fixed properly with full dm tree and checking the whole stack to state whether the device at the very top is usable or not.	2015-06-17 13:37:53 +02:00
Zdenek Kabelac	e7eb5b0696	debug: better tracing messages Enhance traced output.	2015-06-15 14:48:06 +02:00
David Teigland	95da21cc18	config: fix check_options array The code used it as both a single string, and as an array of strings in different places. Fix it so that it's an array of strings everywhere.	2015-04-23 10:35:34 -05:00
Zdenek Kabelac	0b99d648ef	cleanup: typo in comment	2015-04-13 16:38:30 +02:00
Zdenek Kabelac	40102ae014	thin: fix upgrade regression Older lvm2 tools where always providing linear mapping for thin pool. Recent lvm2 version however support external usage of thin pool and empty/unused pools are loaded without such external linear mapping. So this patch covers 'upgrade' problem, where older tool has activated thin-pool with 'linear' layer mapping, and newer tools didn't expected such mapping to exist and were not able to deactivate such table. So before checking for new layout in dm-table, check if there is not an old one already there.	2015-01-30 16:22:11 +01:00
Zdenek Kabelac	578b236a19	revert "cache: add pool deps for preload" This reverts commit `c3bb6d77dd`. Since we now have for_each_sub_lv() scanning all sub LVs, this commit could be safely reverted.	2015-01-30 12:33:52 +01:00
Zdenek Kabelac	bfeabea631	raid: preload splitted LV only when active Check splitted leg is active before preload. (Since splitmirrors currently only does work active raid volumes it's not a change for current code flow). Minor optimization included - when already positively checked for raid image don't check again for raid metadata.	2015-01-28 18:30:08 +01:00
Zdenek Kabelac	c3bb6d77dd	cache: add pool deps for preload for_each_sub_lv() normally does not put pool_lv into deps. So for now go around it in 'lv_preload()' and add explicit call with pool. TODO: think about a better way, we want pool_lv deps only in certain moments, so maybe for_each_sub_lv() needs new arg for this.	2015-01-28 16:29:35 +01:00
Zdenek Kabelac	d2d3f0d747	cleanup: use macro lv_is_visible()	2015-01-28 13:45:27 +01:00
Zdenek Kabelac	b254d330e4	raid: fix tree preload for splitting raid images When raid is being splitted, extracted leg & metadata is still floating in the table - and thus we need to detect this case and properly preload their matching table so consequent activation of extracted LVs properly renames (and FREES) existing raid images, so ongoing image name shifting will work.	2015-01-28 13:44:06 +01:00
Zdenek Kabelac	3b78d5237d	cleanup: indent	2015-01-20 15:02:19 +01:00
Zdenek Kabelac	ae8b9baa04	report: update report_object API Internal API change - pass single struct for both info & seg_status.	2015-01-20 14:58:43 +01:00
Zdenek Kabelac	b3a348c03c	report: use same info also for lv_attr Recently the single 'status' code has been used for number of cache features. Extend the API a little bit to allow usage also for lv_attr_dup. As the function itself is used in lvm2api - add a new function: lv_attr_dup_with_info_and_seg_status() that is able to use grabbed info & status information. report_init() is now using directly passed lvdm struct pointer which holds the infomation whether lv_info() was correctly obtained or there was some error when trying to read it. Move 'healt' attribute to status. TODO convert raid function to use the already known status.	2015-01-20 14:58:41 +01:00
Zdenek Kabelac	e34b004422	report: reporting unknown status Add SEG_STATUS_UNKNOWN when status cannot be parsed. Also add 'info_ok' variable when info was correctly obtained.	2015-01-20 14:53:07 +01:00
Zdenek Kabelac	1e050a77ff	cleanup: missed for build without devmapper configure --disable-devmapper build fixes.	2015-01-14 14:50:08 +01:00
Zdenek Kabelac	0869631d7d	lv_status: enable lv_status for thinpool Support also status for thin pools.	2015-01-14 14:50:08 +01:00
Zdenek Kabelac	0b7ccf835b	lv_status: track layered device For info of i.e. thin-pool we need layered device. Needs some more thinking about proper interface here. For now it's usable for cache and thin-pool.	2015-01-14 14:50:08 +01:00
Zdenek Kabelac	d0f26440ee	cleanup: properly align code lines Misaligned indetion in branches.	2015-01-14 14:50:08 +01:00
Zdenek Kabelac	d202f43fff	cleanup: update API for segment reporting API for seg reporting is breaking internal lvm coding - it cannot use vgmem mem pool for allocation of reported value. So use separate pool instead of 'vgmem' for non vg related allocations Add consts for many function params - but still many other are left for now as non-const - needs deeper level of change even on libdm side.	2015-01-14 14:50:08 +01:00
Peter Rajnoha	c0e17bca90	dev_manager: do not mark snapshot origins as unusable devices just because of possible blocked mirror underneath At first, all snapshot-origins where marked as unusable unconditionally here, but we can't cut off whole snapshot-origin use in a stack just because of this possible mirror state. This whole "device_is_usable" check was even incorrectly part of persistent filter before commit a843d0d97c66aae1872c05b0f6cf4bda176aae2 (where filter cleanup was done). The persistent filter is used only if obtain_device_list_from_udev=0, which means that the former check for snapshot-origin here had not even been hit with default configuration for a few years before commit a843d0d97c66aae1872c05b0f6cf4bda176aae2 (the check for snapshot-origin and skipping of this LV was introduced with commit `a71d6051ed` back in 2010). The obtain_device_list_from_udev=1 (and hence not using persistent filter and hence not hitting this check for snapshot-origins and skipping) has been in action since commit `edcda01a1e` (that is 2011). So for 3 years this condition was not even checked with default configuration, making it superfluous. This all changed in 2014 with commit `8a843d0d97` where "filter-usable" is introduced and since then all snapshot-origins have been marked as unusable more often than before and making snapshot-origins practically unusable in a stack. This patch removes this incorrect check from commit `a71d6051ed` which caused snapshot-origins to be unusable more often recently. If we want to fix this eventually in a correct way, we need to look down the stack and if snapshot-origin is hit and there's a blocked mirror underneath, only then mark the device as unusable. But mirrors in stack are not supported anymore so it's questionable whether it's worth spending more time on this at all...	2015-01-09 11:24:16 +01:00
Peter Rajnoha	cba6186325	cmirror: check for cmirror availability during cluster mirror creation and activation When creating/activating clustered mirrors, we should have cmirrord available and running. If it's not, we ended up with rather cryptic errors like: $ lvcreate -l1 -m1 --type mirror vg Error locking on node 1: device-mapper: reload ioctl on failed: Invalid argument Failed to activate new LV. $ vgchange -ay vg Error locking on node node 1: device-mapper: reload ioctl on failed: Invalid argument This patch adds check for cmirror availability and it errors out properly, also giving a more precise error messge so users are able to identify the source of the problem easily: $ lvcreate -l1 -m1 --type mirror vg Shared cluster mirrors are not available. $ vgchange -ay vg Error locking on node 1: Shared cluster mirrors are not available. Exclusively activated cluster mirror LVs are OK even without cmirrord: $ vgchange -aey vg 1 logical volume(s) in volume group "vg" now active	2015-01-05 16:54:07 +01:00
Peter Rajnoha	c8890e3ac1	coverity: remove dead code in lv_info_with_seg_status (continued)	2014-11-26 11:58:25 +01:00
Peter Rajnoha	86ae68a5f7	coverity: remove dead code in lv_info_with_seg_status Just call return 0 directly on error path, without using "goto" - the code is short, no need to use it this way (the dead code appeared as part of further changes in this function).	2014-11-26 11:30:01 +01:00
Zdenek Kabelac	4dc602f79b	dev_manager: fix mknodes Fix regression introduced with `a2c1024f6a` _setup_task(mknodes ? name : NULL... has been replaced with: _setup_task(type != MKNODES ? name : NULL.... Use '=='	2014-11-22 09:57:31 +01:00
Zdenek Kabelac	428b9fcd87	cleanup: validate pointers Mostly on almost impossible to happen paths - but stay safe.	2014-11-13 17:49:42 +01:00
Peter Rajnoha	83308fdff9	cleanup: cleanup internal interface to acquire segment status - Add separate lv_status fn (if we're interested only in seg status, but not lv info at the same time as it is with existing lv_info_with_seg_status fn). So we 3 fns: - lv_info (existing one, runs only info ioctl, fills in struct lvinfo only) - lv_status (new one, runs status ioctl, fills in struct lv_seg_status only) - lv_info_with_seg_status (existing one, runs status ioctl, fills in struct lvinfo as well as lv_seg_status) - Add more comments in the code explaining the difference between lv_info, lv_status and lv_info_with_seg_status and their return values. - Move decision whether lv_info_with_seg_status needs to call only status ioctl (in case the segment for which we require status is from the LV for which we require info) or separate status and info ioctl (in case the segment for which we require status is from different LV that the one for which we require info) into lv_info_with_seg_status fn so caller doesn't need to bother about this at all. - Cleanup internal interface for this seg status so it's more readable.	2014-11-13 14:28:51 +01:00
Zdenek Kabelac	c3e2990359	cleanu: drop duplicate const	2014-11-13 13:15:58 +01:00
Zdenek Kabelac	fba86dd42b	cache: improve pending_delete We need to stop guessing deleted names - so rather collect deleted UUID into a string list - and then remove them properly in _clean_tree. Restore origin _clean_tree behaviour them for currently unconverted removal of snapshots. Pending delete feature now properly tracks whole subtree of cache (so i.e. data or metadata as raid volumes). It properly replaces all related volumes with 'errors' in suspend preload, then resume them as error and remove collected UUIDs from root - since they are not longer part of any volume deps.	2014-11-13 11:54:41 +01:00
Peter Rajnoha	359dc6fa76	coverity: commit `ba2302346` - report log_sys_error properly log_sys_error uses errno, hence we need to report the first failure before reporting another failure that uses errno as well.	2014-11-12 15:16:54 +01:00
Peter Rajnoha	ce8730b508	coverity: fix possible integer overflow LVM2.2.02.112/lib/metadata/cache_manip.c:73: overflow_before_widen: Potentially overflowing expression "pool_metadata_extents vg->extent_size" with type "unsigned int" (32 bits, unsigned) is evaluated using 32-bit arithmetic, and then used in a context that expects an expression of type "uint64_t" (64 bits, unsigned). LVM2.2.02.112/lib/activate/dev_manager.c:217: overflow_before_widen: Potentially overflowing expression "seg_status->seg->len * extent_size" with type "unsigned int" (32 bits, unsigned) is evaluated using 32-bit arithmetic, and then used in a context that expects an expression of type "uint64_t" (64 bits, unsigned). LVM2.2.02.112/lib/activate/dev_manager.c:217: overflow_before_widen: Potentially overflowing expression "seg_status->seg->le * extent_size" with type "unsigned int" (32 bits, unsigned) is evaluated using 32-bit arithmetic, and then used in a context that expects an expression of type "uint64_t" (64 bits, unsigned).	2014-11-12 10:03:27 +01:00
Peter Rajnoha	60cc666c94	coverity: fix compiler warning LVM2.2.02.112/lib/activate/dev_manager.c:196:5: warning: 'dmtask' may be used uninitialized in this function [-Wmaybe-uninitialized] In _info_run fn: switch (type) { case INFO: ... case STATUS: ... case MKNODES: ... } The "type" is enum and currently only those three types are supported, but if we added a new type in the future, this would end up with a bug (if we forgot to add the new "case" in that "switch"). So let's make sure proper internal error is printed: default: log_error(INTERNAL_ERROR "_info_run: unhandled info type"); return 0;	2014-11-12 09:55:12 +01:00
Zdenek Kabelac	57c618b0ed	cache: fix clean_tree Fix `8121074fda` - the patch incorrectly removed also other top-level nodes. It needs to deactivate purely subnodes of _corig.	2014-11-12 09:40:27 +01:00
Peter Rajnoha	ba23023464	coverity: fix resource leaks LVM2.2.02.112/tools/toollib.c:1991: leaked_storage: Variable "iter" going out of scope leaks the storage it points to. LVM2.2.02.112/lib/filters/filter-usable.c:89: leaked_storage: Variable "f" going out of scope leaks the storage it points to. LVM2.2.02.112/lib/activate/dev_manager.c:1874: leaked_handle: Handle variable "fd" going out of scope leaks the handle.	2014-11-12 09:19:14 +01:00
Peter Rajnoha	9704515c1e	dev_manager: only support status for cache segment at the moment When getting status for LV segment types, we need to be sure that proper segment is selected for the status ioctl. When reporting fields that require status ioctl, the "_choose_lv_segment_for_status_report" fn in tools/reporter.c must be completed properly to choose the proper segment for all the LV types (at the moment, it just takes the first LV segment by default). This works fine with cache LVs surely. The other segment types need more auditing. We use this status ioctl only for cache status fields at the moment only, so restrict it to the cache only. Once the _choose_lv_segment_for_status_report is completed properly, release the restriction in _get_segment_status_from_target_params.	2014-11-11 15:02:21 +01:00
Zdenek Kabelac	8121074fda	cache: pending_delete fixes	2014-11-11 13:32:41 +01:00
Zdenek Kabelac	9a6e3683a2	cache: never create new table entry for deleted cache	2014-11-11 13:32:41 +01:00
Zdenek Kabelac	42a3305ec7	cache: no status for pending deleted cache	2014-11-11 13:32:41 +01:00
Peter Rajnoha	a2c1024f6a	dev_manager: enhance dev_manager_info to acquire LV segment status if requested, add lv_info_with_seg_status fn	2014-11-11 13:04:02 +01:00
Peter Rajnoha	d7e5f03888	refactor: rename struct lv_with_info used in reporting code to lv_with_info_and_seg_status The former struct lv_with_info is renamed to lv_with_info_and_seg_status as it can hold more than just "info", there's lv's segment status now in addition: struct lv_with_info_and_seg_status { struct logical_volume lv; struct lvinfo info; struct lv_seg_status seg_status; } Where struct lv_seg_status is: struct lv_seg_status { struct dm_pool mem; struct lv_segment lv_seg; lv_seg_status_type_t type; void status; / struct dm_status_* / } Where lv_seg points to lv's segment that is being reported or processed in general. New struct lv_seg_status keeps the information about segment status - the status retrieved via DM_DEVICE_STATUS ioctl. This information will be used for reporting dm device target status for the LV segment specified. So this patch introduces third level of LV information that is kept for reuse while reporting fields within one reporting line, causing only one DM_DEVICE_STATUS ioctl call per LV segment line reported (otherwise we'd need to call the DM_DEVICE_STATUS for each segment status field in one LV segment/reporting line which is not efficient). This is following exactly the same principle as already introduced by commit `ecb2be5d16`. So currently we have three levels of information that can be used to report an LV/LV segment: - LV metadata itself (struct logical_volume lv) - LV's DM_DEVICE_INFO ioctl result (struct lvinfo info) - LV's segment DM_DEVICE_STATUS ioctl result (this status must be bound to a segment, not the whole LV as the whole LV may be composed of several segments of course) (this is the new struct lv_seg_status seg_status)	2014-11-11 08:53:28 +01:00
Zdenek Kabelac	ca509c9746	dev_manager: workaround to allow top-level _tmeta, _tdata	2014-11-11 00:53:37 +01:00
Zdenek Kabelac	02f49caa35	debug: log tree type is created Print tree type and use internal_error for unknown type.	2014-11-10 22:05:49 +01:00
Zdenek Kabelac	e5d3f81285	cleanup: indents comments backtraces	2014-11-10 22:05:49 +01:00
Zdenek Kabelac	f5e265a07f	cache: use LV_PENDING_DELETE	2014-11-10 22:05:49 +01:00
Zdenek Kabelac	0dc73f7dbd	dmeventd: time scaling for status retry In normal case it's too slow to wait 1 second for default. So rather start with short time and increase sleep between status retesting.	2014-11-10 22:05:48 +01:00
Zdenek Kabelac	6e5790f2d2	activate: check all snap segs are inactive When deactivating origin, we may have possibly left table in broken state, where origin is not active, but snapshot volume is still present. Let's ensure deactivation of origin detects also all associated snapshots are inactive - otherwise do not skip deactivation. (so i.e. 'vgchange -an' would detect errors)	2014-11-05 15:30:58 +01:00
Zdenek Kabelac	00a45ca491	thin: new pool is activated without overlay Activate of new/unused/empty thin pool volume skips the 'overlay' part and directly provides 'visible' thin-pool LV to the user. Such thin pool still gets 'private' -tpool UUID suffix for easier udev detection of protected lvm2 devices, and also gets udev flags to avoid any scan. Such pool device is 'public' LV with regular /dev/vgname/poolname link, but it's still 'udev' hidden device for any other use. To display proper active state we need to do few explicit tests for this condition. Before it's used for any lvm2 thin volume, deactivation is now needed to avoid any 'race' with external usage.	2014-11-04 15:29:22 +01:00
Zdenek Kabelac	ee627884de	thin: no validation skip of new thin pools Allowing 'external' use of thin-pools requires to validate even so far 'unused' new thin pools. Later we may have 'smarter' way to resolve which thin-pools are owned by lvm2 and which are external.	2014-11-04 15:28:00 +01:00
Zdenek Kabelac	1b439a0b8e	cleanup: rename function Make more clear dm_info type.	2014-11-03 14:19:34 +01:00
Zdenek Kabelac	d6c5445bea	cleanup: correcting tracing Use log_error for real error.	2014-11-03 14:19:34 +01:00
Zdenek Kabelac	29bd3cccc8	cache: support activation of empty cache-pool When the cache pool is unused, lvm2 code will internally allow to activate such cache-pool. Cache-pool is activate as metadata LV, so lvm2 could easily wipe such volume before cache-pool is reused.	2014-11-03 14:19:33 +01:00
Zdenek Kabelac	ab49120465	cache: lv_cache_status Replace lv_cache_block_info() and lv_cache_policy_info() with lv_cache_status() which directly returns dm_status_cache structure together with some calculated values. After use mem pool stored inside lv_status_cache structure needs to be destroyed.	2014-11-03 14:19:33 +01:00
Zdenek Kabelac	13e6369d7f	cleanup: add arg to _setup_task Add init of no_open_count into _setup_task(). Report problem as warning (cannot happen anyway). Also drop some duplicated debug messages - we have already printed the info about operation so make log a bit shorter.	2014-11-03 14:19:33 +01:00
Zdenek Kabelac	7f35d42a99	thin: reporting of thin volumes simplified Simplify reporting of percentage. Allows easier support for more types. Move testing of device availability into activate.c	2014-11-03 14:19:32 +01:00
Peter Rajnoha	00d8ab8492	refactor: make it possible to select what to check exactly when calling device_is_usable fn Currently, there are 5 things that device_is_usable function checks (for DM devices only, of course): - is device empty? - is device blocked? (mirror) - is device suspended? - is device composed of an error target? - is device name/uuid reserved? If answer to any of these questions is "yes", then the device is not usable. This patch just adds possibility to choose what to check for exactly - the device_is_usable function now accepts struct dev_usable_check_params make this selection possible. This is going to be used by subsequent patches.	2014-09-30 13:11:58 +02:00
Zdenek Kabelac	bc5031c283	debug: add debug message Since we leave error printing on the called of deactivation, at least put in debug log for this case.	2014-09-24 10:54:48 +02:00
Zdenek Kabelac	7531a9169e	debug: monitor_dev_for_events stack trace	2014-09-24 10:54:48 +02:00
Zdenek Kabelac	03aeb86762	cleanup: reindent Save some code lines.	2014-09-24 10:54:48 +02:00
Zdenek Kabelac	ec4ffeb51c	cleanup: use supplied cmd pointer	2014-09-24 10:54:48 +02:00
Zdenek Kabelac	914be0696d	cleanup: replace error with print message These are not error messages. They are informing a user about missconfigured options which do not change resulting error status.	2014-09-24 10:54:47 +02:00
Zdenek Kabelac	bc0a3e2355	cleanup: simplier _lv_passes_volumes_filter Don't recreate string and just check components directly.	2014-09-24 10:54:47 +02:00
Zdenek Kabelac	a8aee7dba2	activate: update lv_check_not_in_use: API Use of lv_info() internally in lv_check_not_in_use(), so it always could use with_open_count properly. Skip sysfs() testing in open_count == 0 case. Accept just 'lv' pointer like other functions. The function has 'built-in' lv_is_active_locally check, which however is not what we need to check in many place. For now at least remotely active snapshot merge is detected and for this case merge on next activation is scheduled.	2014-09-24 10:54:47 +02:00
Zdenek Kabelac	84cdf85bd2	cleanup: constify activation usage of lv pointer Let's enforce cheking of write access to LV by compiler. Activation part does never need to write anything to LV so keep LV pointer const.	2014-09-24 10:54:47 +02:00
Zdenek Kabelac	0f2adcc9ef	activate: lv_check_not_in_use no check of closed Don't perform expensive sysfs tests when the device is closed. (having open_count == 0).	2014-09-24 10:47:00 +02:00
Alasdair G Kergon	979be63f25	mirrors: Fix checks for mirror/raid/pvmove LVs. Try to enforce consistent macro usage along these lines: lv_is_mirror - mirror that uses the original dm-raid1 implementation (segment type "mirror") lv_is_mirror_type - also includes internal mirror image and log LVs lv_is_raid - raid volume that uses the new dm-raid implementation (segment type "raid") lv_is_raid_type - also includes internal raid image / log / metadata LVs lv_is_mirrored - LV is mirrored using either kernel implementation (excludes non-mirror modes like raid5 etc.) lv_is_pvmove - internal pvmove volume	2014-09-16 00:13:46 +01:00
Alasdair G Kergon	2360ce3551	cleanup: Use lv_is_ macros. Use lv_is_* macros throughout the code base, introducing lv_is_pvmove, lv_is_locked, lv_is_converting and lv_is_merging. lv_is_mirror_type no longer includes pvmove.	2014-09-15 21:33:53 +01:00
Zdenek Kabelac	25fe716b12	cleanup: indent and stacktrack Add missing stacktrace on error path and newline indent.	2014-08-26 14:13:07 +02:00
Zdenek Kabelac	24df01f735	cleanup: avoid double assign Skip setting a value to a variable which is never used and overwritten/set afterwards.	2014-08-19 14:33:06 +02:00
Alasdair G Kergon	7cff640d9a	activation: Fix upgrades using uuid suffixes. 2.02.106 added suffixes to some LV uuids in the kernel. If any of these LVs is activated with 2.02.105 or earlier, and then a later version is used, the LVs appear invisible and activation commands fail. The code now has to check the kernel for both old and new uuids.	2014-07-30 21:55:11 +01:00
Alasdair G Kergon	52217f6ebd	raid: Fix partial activation logic for non-raid.	2014-07-23 16:13:12 +01:00
Alasdair G Kergon	99e3c13012	raid: Moved degraded activation code to raid_manip. Adjust some messages & fn names.	2014-07-22 20:50:29 +01:00
Alasdair G Kergon	513fd029a6	config: Adjust description of activation_mode.	2014-07-21 15:50:47 +01:00
Zdenek Kabelac	c0c1ada88e	pool: callback handle cache Extend the callback functionality to handle also cache pools. cache_check is now executed on cachepool metadata when it's activated and deactivated.	2014-07-11 12:57:45 +02:00
Jonathan Brassow	be75076dfc	activation: Add "degraded" activation mode Currently, we have two modes of activation, an unnamed nominal mode (which I will refer to as "complete") and "partial" mode. The "complete" mode requires that a volume group be 'complete' - that is, no missing PVs. If there are any missing PVs, no affected LVs are allowed to activate - even RAID LVs which might be able to tolerate a failure. The "partial" mode allows anything to be activated (or at least attempted). If a non-redundant LV is missing a portion of its addressable space due to a device failure, it will be replaced with an error target. RAID LVs will either activate or fail to activate depending on how badly their redundancy is compromised. This patch adds a third option, "degraded" mode. This mode can be selected via the '--activationmode {complete\|degraded\|partial}' option to lvchange/vgchange. It can also be set in lvm.conf. The "degraded" activation mode allows RAID LVs with a sufficient level of redundancy to activate (e.g. a RAID5 LV with one device failure, a RAID6 with two device failures, or RAID1 with n-1 failures). RAID LVs with too many device failures are not allowed to activate - nor are any non-redundant LVs that may have been affected. This patch also makes the "degraded" mode the default activation mode. The degraded activation mode does not yet work in a cluster. A new cluster lock flag (LCK_DEGRADED_MODE) will need to be created to make that work. Currently, there is limited space for this extra flag and I am looking for possible solutions. One possible solution is to usurp LCK_CONVERT, as it is not used. When the locking_type is 3, the degraded mode flag simply gets dropped and the old ("complete") behavior is exhibited.	2014-07-09 22:56:11 -05:00
Peter Rajnoha	cfed0d09e8	report: select: refactor: move percent handling code to libdm for reuse	2014-06-17 16:27:21 +02:00
Zdenek Kabelac	2f260c9909	activation: retry cleanup deactivation Enable 'retry' deactivation also in 'cleanup' phase. It shouldn't be mostly needed - however udev now produces more and more completelny non-synchronizable device opens, so even for orphan devices we can't easily predict where udevd opens devices. So it's more preferable here to log error about device being open and retry clean, but let the command proceed.	2014-06-10 10:51:24 +02:00
Zdenek Kabelac	2adaef8272	revert: restore original timeout Accidently it's been commited - but it has also shown, that on heavy loaded systems (like our test machine could be) slightly bigger timeouts which waits longer for udev rules processing does help and avoids occasional refuse of deactivation because device is still being open. (i.e. lvcreate...; lvchange -an...) Unsure how we could now synchronize for this. On very slow(/loaded) system 5 second timeout is simply not enough. TODO: introduce at least lvm.conf configurable setting to allow longer 'retry' loops.	2014-05-28 15:33:41 +02:00
Zdenek Kabelac	ae43d1afa2	activate: cleanup lv_check_not_in_use Reindent lv_check_not_in_use to simplify internal loop code. Also return always '0/1' (drop -1) - since we only check for failure (0) - and we don't really know why lv_info() has failed.	2014-05-27 17:08:49 +02:00
Zdenek Kabelac	cb7bba9ffe	dev_manager: disable extra udev loop Disable code which has postprocessed whole tree and reset udev flags. We need to find out which case was troublesome - since this loop was just hidding bug in other code parts (most probably preload tree)	2014-05-23 21:36:55 +02:00
Zdenek Kabelac	675fcfe9b7	devmapper: fix compilation without devmapper Fix compilation when configured with --disable-devmapper option.	2014-04-30 10:26:29 +02:00
Alasdair G Kergon	b5f8f452ac	tools: Add --readonly support. Offer lock-free access to display virtual machine or clustered VG metadata while it might be in use.	2014-04-18 02:46:34 +01:00
Zdenek Kabelac	9eab84aa2b	debug: catch invalid request for tree In general for non-toplevel LVs we shouldn't allow any _tree_action. For now error on request for cache_pool activation which doesn't even exist in dm-table.	2014-04-08 11:00:15 +02:00
Zdenek Kabelac	96cf9dc017	raid: use internal variables for array alloc Don't use passed pointer when allocating policies' array. (In case policy_argc would be NULL, this would have caused NULL dereference).	2014-04-08 11:00:13 +02:00
Zdenek Kabelac	e2ea3cd7ba	cleanup: cache use const char policy Policy should be const char pointer.	2014-04-01 20:54:09 +02:00
Zdenek Kabelac	a018c57f0b	cache: never activate cache pool Since cache-pool is purely lvm abstraction layer LV, it never need any device node, so do not add even 'error' device for it.	2014-04-01 20:17:10 +02:00
Zdenek Kabelac	356fdda46d	lv_manip: drop cmd pointer from for_each_sub_lv Drop unused passed cmd pointer from function. TODO: We have two similar functions (though not identical) lv_manip.c: for_each_sub_lv() metadata.c: _lv_each_dependency() They seem to not always match - we should probably convert to use only a single function.	2014-03-27 13:10:13 +01:00
Zdenek Kabelac	4a6f05e420	cleanup: use trigraph	2014-03-25 11:22:58 +01:00
Zdenek Kabelac	0ca16c6946	activate: report release with critical section This function is typically called for cmd context refresh or destroy. On the non-clustered case we already unlocked all messages, however when i.e. 'clvmd' gets break signal it may have still couple messages queued. For now just report an error.	2014-03-21 22:29:22 +01:00
Zdenek Kabelac	c0f1eb5f0f	dev_manager: check prohibited devices earlier Reorder detection for internal device - since this test is much simpler then target analysis, check it sooner. Replace test for '68' with sizeof & ID_LEN Add FIXME about device alias problem with is_reserved_lvname, since this test fails on devices like /dev/dm-X so we need to convert tests to UUID.	2014-03-12 19:38:34 +01:00
Zdenek Kabelac	4cc5c689b8	thin: add pool uuid suffix for pool volume Even though we make pool volume as a public visible LV, we still do not want tools to look at this volume. While we do not create /dev/vg/lv link, device is still accessible via /dev/mapper/vg-lv and there is no easy way to recognize it's private without lvm2 metadata. Enhance UUID with -pool suffix and directly skip any LV with a suffix in device_is_usable() call. TODO: enhance other targets with this logic. blkid may probably use same simple logic.	2014-03-12 00:16:27 +01:00
Zdenek Kabelac	6a0d97a65c	lvm: change build_dm_uuid API Pass directly 'lv' into this build routine, so we can eventually add more private UUID suffixes.	2014-03-12 00:16:20 +01:00
Zdenek Kabelac	4d64e91efd	thin: do not check of empty pool with messages The empty pool is also the pool which has yet queued list of messages and transaction_id == 1. Problem is exposed when pool is created inactive. lvcreate -L10 -T vg/pool -an lvcreate -V10 -T vg/pool	2014-03-12 00:15:22 +01:00
Zdenek Kabelac	07ba047116	cleanup: relocate segment flags Move flags for segments to segtype header where it seems more closely related as the features are related to segtype and not activation. Use unsigned #define - since it's more common in lvm2 source code for bit flags.	2014-02-27 14:46:11 +01:00
Zdenek Kabelac	40e6176d25	snapshots: fix incorrect calculation of cow size Code uses target driver version for better estimation of max size of COW device for snapshot. The bug can be tested with this script: VG=vg1 lvremove -f $VG/origin set -e lvcreate -L 2143289344b -n origin $VG lvcreate -n snap -c 8k -L 2304M -s $VG/origin dd if=/dev/zero of=/dev/$VG/snap bs=1M count=2044 oflag=direct The bug happens when these two conditions are met * origin size is divisible by (chunk_size/16) - so that the last metadata area is filled completely * the miscalculated snapshot metadata size is divisible by extent size - so that there is no padding to extent boundary which would otherwise save us Signed-off-by:Mikulas Patocka <mpatocka@redhat.com>	2014-02-26 14:25:09 +01:00
Zdenek Kabelac	95fe823eba	raid: use feature attributes for raid10 Test raid10 availability as a target feature (instead of doing it in all the places where raid10 should be checked). TODO: activation needs runtime validation - so metadata with raid10 are skipped from activation in user-friendly way in lvm2.	2014-02-24 21:10:13 +01:00
Zdenek Kabelac	9974136b90	cleanup: indent	2014-02-17 22:25:53 +01:00
Zdenek Kabelac	f0f4248333	activation: drop test r/w vg state for activing LV VG status read/write is meant to influence only VG metadata. It's not related to the read/write status of the LV itself.	2014-02-15 11:34:54 +01:00
Jonathan Brassow	96626f64fa	cache: Code to allow the create/remove of cache LVs This patch allows users to create cache LVs with 'lvcreate'. An origin or a cache pool LV must be created first. Then, while supplying the origin or cache pool to the lvcreate command, the cache can be created. Ex1: Here the cache pool is created first, followed by the origin which will be cached. ~> lvcreate --type cache_pool -L 500M -n cachepool vg /dev/small_n_fast ~> lvcreate --type cache -L 1G -n lv vg/cachepool /dev/large_n_slow Ex2: Here the origin is created first, followed by the cache pool - allowing a cache LV to be created covering the origin. ~> lvcreate -L 1G -n lv vg /dev/large_n_slow ~> lvcreate --type cache -L 500M -n cachepool vg/lv /dev/small_n_fast The code determines which type of LV was supplied (cache pool or origin) by checking its type. It ensures the right argument was given by ensuring that the origin is larger than the cache pool. If the user wants to remove just the cache for an LV. They specify the LV's associated cache pool when removing: ~> lvremove vg/cachepool If the user wishes to remove the origin, but leave the cachepool to be used for another LV, they specify the cache LV. ~> lvremove vg/lv In order to remove it all, specify both LVs. This patch also includes tests to create and remove cache pools and cache LVs.	2014-02-04 16:50:16 -06:00
Jonathan Brassow	75b8ea195c	cache: New functions for gathering info on cache devices Building on the new DM function that parses DM cache status, we introduce the following LVM level functions to aquire information about cache devices: - lv_cache_block_info: retrieves information on the cache's block/chunk usage - lv_cache_policy_info: retrieves information on the cache's policy	2014-01-28 12:24:51 -06:00
Zdenek Kabelac	902b343e0e	thin: validate resize of thin LV with ext. origin When thin volume is using external origin, current thin target is not able to supply 'extended' size with empty pages. lvm2 detects version and disables extension of LV past the external origin size in this case. Thin LV could be however still reduced and extended freely bellow this size.	2014-01-23 14:20:34 +01:00
Zdenek Kabelac	c3d82d717c	Revert "tree_action: destroy devices from failing activation" This reverts commit `24639be558`. Ok - seems we could be here a bit too active - and we may remove devices which are unsuable for reasons we are not aware of - thus taking down whole device could be way to big hammer. So we still need some solution to recover from failing preload and activation - but it needs more tunning.	2013-12-17 15:21:28 +01:00
Zdenek Kabelac	24639be558	tree_action: destroy devices from failing activation When activation fails - we may leak large tree of partially loaded devices in the dm table (i.e. failure in snapshot activation) The best we can do here is try to deactivate whole device and remove as much inactive table entries as we can.	2013-12-17 14:08:54 +01:00
Zdenek Kabelac	1200b7e7c2	thin: deactivation of merging thin snapshot Before trying to deactivate merging thin snapshot (which is invisible) check if it's not in-use.	2013-12-04 14:30:26 +01:00
Zdenek Kabelac	664a695561	thin: merge support for device tree When thin snapshot merge is requested, tree must detect if user tries to active such LV while origin or snapshost is still active.	2013-12-04 14:30:25 +01:00
Zdenek Kabelac	572983d793	thin: read table line with thin device id Add functions to parse thin table line to obtain thin device id.	2013-12-04 14:30:25 +01:00
Zdenek Kabelac	84b3852ee5	snapshots: use lv_check_not_in_use Switch from a simple 'open_count' test on opened snaphost to a more 'skilled' lv_check_not_in_use().	2013-12-04 14:30:24 +01:00
Zdenek Kabelac	6bf6430ae9	cleanup: convert log_error with log_warn Collapse 2 ifs and replace log_error() with log_warn(), since\ the reported message is not causing tools error. (and cannot be probably triggered anyway).	2013-11-28 12:48:01 +01:00
Zdenek Kabelac	79991aa769	snapshot: drop find_merging_snapshot Drop find_merging_snapshot() function. Use find_snapshot() called after check for lv_is_merging_origin() which is the commonly used code path - so we avoid duplicated tests and potential risk of derefering NULL point in unhandled error path.	2013-11-28 12:42:43 +01:00
Zdenek Kabelac	069fa6c49d	activate: modify read_only when dev_manager exists Change opts only when dm has been successfully created. So on the error path we leave structure unmodified.	2013-11-22 20:58:13 +01:00
Alasdair G Kergon	527db4645f	gcc: replace #ifdef linux with __linux__	2013-11-13 13:56:29 +00:00
Zdenek Kabelac	c3e674ad30	activation: _lv_activate is ok when filtered. If the volume_list filters out volume from activation, it is still success result for this function. Change the error message back to verbose level. Detect if the volume is active localy before zeroing, so we report error a bit later for cases, where volume could not be activated because it doesn't pass through volume list (but user still could create volume when he disables zeroing)	2013-11-01 13:02:36 +01:00
Jonathan Brassow	d5896f0afd	Mirror: Fix hangs and lock-ups caused by attempting label reads of mirrors There is a problem with the way mirrors have been designed to handle failures that is resulting in stuck LVM processes and hung I/O. When mirrors encounter a write failure, they block I/O and notify userspace to reconfigure the mirror to remove failed devices. This process is open to a couple races: 1) Any LVM process other than the one that is meant to deal with the mirror failure can attempt to read the mirror, fail, and block other LVM commands (including the repair command) from proceeding due to holding a lock on the volume group. 2) If there are multiple mirrors that suffer a failure in the same volume group, a repair can block while attempting to read the LVM label from one mirror while trying to repair the other. Mitigation of these races has been attempted by disallowing label reading of mirrors that are either suspended or are indicated as blocking by the kernel. While this has closed the window of opportunity for hitting the above problems considerably, it hasn't closed it completely. This is because it is still possible to start an LVM command, read the status of the mirror as healthy, and then perform the read for the label at the moment after a the failure is discovered by the kernel. I can see two solutions to this problem: 1) Allow users to configure whether mirrors can be candidates for LVM labels (i.e. whether PVs can be created on mirror LVs). If the user chooses to allow label scanning of mirror LVs, it will be at the expense of a possible hang in I/O or LVM processes. 2) Instrument a way to allow asynchronous label reading - allowing blocked label reads to be ignored while continuing to process the LVM command. This would action would allow LVM commands to continue even though they would have otherwise blocked trying to read a mirror. They can then release their lock and allow a repair command to commence. In the event of #2 above, the repair command already in progress can continue and repair the failed mirror. This patch brings solution #1. If solution #2 is developed later on, the configuration option created in #1 can be negated - allowing mirrors to be scanned for labels by default once again.	2013-10-22 19:14:33 -05:00
Peter Rajnoha	039bdad732	activation: flag temporary LVs internally Add LV_TEMPORARY flag for LVs with limited existence during command execution. Such LVs are temporary in way that they need to be activated, some action done and then removed immediately. Such LVs are just like any normal LV - the only difference is that they are removed during LVM command execution. This is also the case for LVs representing future pool metadata spare LVs which we need to initialize by using the usual LV before they are declared as pool metadata spare. We can optimize some other parts like udev to do a better job if it knows that the LV is temporary and any processing on it is just useless. This flag is orthogonal to LV_NOSCAN flag introduced recently as LV_NOSCAN flag is primarily used to mark an LV for the scanning to be avoided before the zeroing of the device happens. The LV_TEMPORARY flag makes a difference between a full-fledged LV visible in the system and the LV just used as a temporary overlay for some action that needs to be done on underlying PVs. For example: lvcreate --thinpool POOL --zero n -L 1G vg - first, the usual LV is created to do a clean up for pool metadata spare. The LV is activated, zeroed, deactivated. - between "activated" and "zeroed" stage, the LV_NOSCAN flag is used to avoid any scanning in udev - betwen "zeroed" and "deactivated" stage, we need to avoid the WATCH udev rule, but since the LV is just a usual LV, we can't make a difference. The LV_TEMPORARY internal LV flag helps here. If we create the LV with this flag, the DM_UDEV_DISABLE_DISK_RULES and DM_UDEV_DISABLE_OTHER_RULES flag are set (just like as it is with "invisible" and non-top-level LVs) - udev is directed to skip WATCH rule use. - if the LV_TEMPORARY flag was not used, there would normally be a WATCH event generated once the LV is closed after "zeroed" stage. This will make problems with immediated deactivation that follows.	2013-10-23 14:09:37 +02:00
Peter Rajnoha	48df36b8c5	activation: check for open count with a timeout before removal/deactivation of an LV This patch reinstates the lv_info call to check for open count of the LV we're removing/deactivating - this was changed with commit `125712b` some time ago and we relied on the ioctl retry logic deeper in the libdm while calling the exact 'remove' ioctl. However, there are still some situations in which it's still required to check for open count before we do any 'remove' actions - this mainly applies to LVs which consist of several sub LVs, like it is for virtual snapshot devices. The commit `1146691` fixed the issue with ordering of actions during virtual snapshot removal while the snapshot is still open. But the check for the open status of the snapshot is still prone to marking the snapshot as in use with an immediate exit even though this could be a temporary asynchronous open only, most notably because of udev and its WATCH udev rule with accompanying scans for the event which is asynchronous. The situation where this crops up most often is when we're closing the LV that was open for read-write and then calling lvremove immediately. This patch reinstates the original lv_info call for the open status of the LV in the lv_check_not_in_use fn that gets called before we do any LV removal/deactivation. In addition to original logic, this patch adds its own retry loop with a delay (25x0.2 seconds) besides the existing ioctl retry loop.	2013-10-15 12:44:42 +02:00
Jonathan Brassow	d97583cfd3	RAID: Better error message when attempting scrubbing op on thinpool LV Component LVs of a thinpool can be RAID LVs. Users who attempt a scrubbing operation directly on a thinpool will be prompted to specify the sub-LV they wish the operation to be performed on. If neither of the sub-LVs are RAID, then a message telling them that the operation can only be performed on a RAID LV will be given.	2013-10-14 15:14:16 -05:00
Zdenek Kabelac	1146691afc	snapshot: deactivate virtual snapshot first Since the virtual snapshot has no reason to stay alive once we detach related snapshot - deactivate whole thing in front of snapshot removal - otherwice the code would get tricky for support in cluster. The correct full solution would require to have transactions for libdm operations. Also enable to the check for snapshot being opened prior the origin deactivation, otherwise we could easily end with the origin being deactivate, but snapshot still kept active, desynchronizing locking state in cluster.	2013-10-14 00:25:15 +02:00
Peter Rajnoha	ce7489ed22	activation: add support for flagging an LV to skip udev scanning during activation A common scenario is during new LV creation when we need to wipe the newly created LV and avoid any udev scanning before this stage otherwise it could cause the device (the LV) to be claimed by some other subsystem for which there were stale metadata within LV data. This patch adds possibility to mark the LV we're just about to wipe with a flag that gets passed to udev via DM_COOKIE as a subsystem specific flag - DM_SUBSYSTEM_UDEV_FLAG0 (in this case the subsystem is "LVM") so LVM udev rules will take care of handling that.	2013-10-08 13:43:14 +02:00
Peter Rajnoha	b4637bd298	fix: make it possible to compile with --disable-devmapper again Some code has been added recently which makes it impossible to compile when "configure --disable-devmapper" is used. This patch just shuffles the code around so it's under proper #ifdef DEVMAPPER_SUPPORT.	2013-09-27 13:58:55 +02:00
Zdenek Kabelac	1fdead8d97	activation: use improved lv_info Call lv_info() with info == NULL to query for local active presence.	2013-09-23 12:13:08 +02:00
Zdenek Kabelac	3b604e5c8e	lvinfo: allow to use lv_info with NULL info When NULL info struct is passed in - function is usable as a quick query for lv_is_active_locally() - with a bonus we may query for layered device. So it could be seen as a more efficient lv_is_active_locally().	2013-09-23 12:13:06 +02:00
Zdenek Kabelac	85b9c12e92	cleanup: release all memory in error path Just ensure no memory will stay in pool even in error path.	2013-09-23 11:35:15 +02:00
Zdenek Kabelac	861a3b2f19	cleanup: monitoring more readable Put continue path into one code segment.	2013-09-23 11:35:15 +02:00
Jonathan Brassow	2c41c8b886	RAID: Don't allow syncaction changes on non-RAID LVs Don't allow syncaction or other RAID-type messages on non-RAID logical volumes.	2013-09-19 22:33:01 -05:00
Zdenek Kabelac	f5832d8c49	deactivate: drop readahead calc in deactivation Skip readahead when device will be deactivated.	2013-09-07 09:13:20 +02:00
Zdenek Kabelac	655296609e	thin: fix monitoring of thin pool volume Properly skip unmonitoring of thin pool volume in deactivation code path. Code makes sure if there is just any thin pool user it stays monitored with all its resources.	2013-09-07 03:31:04 +02:00
Alasdair G Kergon	c0f987949b	activation: Fix segfault with inactive pvmove LV. Set flag to avoid recursion back through an inactive pvmove LV when populating deptree.	2013-08-28 22:56:23 +01:00
Jonathan Brassow	c95f17ea64	Mirror: Fix issue preventing PV creation on mirror LVs Commit `b248ba0a39` attempted to prevent mirror devices which had a failed device in their mirrored log from being usable/readable by LVM. This was to protect against circular dependancies where one LVM command could be blocked trying to read one of these affected mirrors while the LVM command to fix/unblock that mirror was stuck behind the currently running command. The above commit went wrong when it used 'device_is_usable()' to recurse on the mirrored log device to check if it was suspended or blocked. The 'device_is_usable' function also contains a check for reserved names - like *_mlog, etc. This last check always triggered when checking a mirror's log simply because of the name, not because it was suspended or blocked - a false positive. The solution is to create a new function like 'device_is_usable', but without the check for reserved names. Using this new function (device_is_suspended_or_blocked), we can check the status of a mirror's log device properly.	2013-08-07 17:42:26 -05:00
Zdenek Kabelac	aed4e9c703	coverity: pointer validation Check for metadata_lv and make sure we have got proper thin pool segment. Check we are working with merging snapshot when adding merging target.	2013-07-22 12:41:21 +02:00
Jonathan Brassow	4eea660191	RAID: Fix segfault when reporting raid_syncaction field on older kernel The status printed for dm-raid targets on older kernels does not include the syncaction field. This is handled by dev_manager_raid_status() just fine by populating the raid status structure with NULL for that field. However, lv_raid_sync_action() does not properly handle that field being NULL. So, check for it and return 0 if it is NULL.	2013-07-19 10:01:48 -05:00
Zdenek Kabelac	fd31cc9dfc	cleanup: stack and remove braces Add stack trace for error path. Remove unneeded braces.	2013-07-18 18:16:17 +02:00
Zdenek Kabelac	9a06094824	thin: improve external origin tree creation When tree for thin LVs was using external_lv, there has been far less optimal solution, that has tried to add certain existing dependencie only when new node was added. However this has lead to way to complex tree construction since many repeated checks have been made during such tree build. This patch move this detection to the proper _partial_tree generation code and uses for it new 'activation' flag, which is set when tree for ACTIVATION or PRELOAD is generated. It increases performance when thins with external origins are used. (in release update)	2013-07-15 16:00:06 +02:00
Zdenek Kabelac	57be501aa3	dev_manager: lower memory usage Created dlid for test is not needed afterward, so lower a memory usage of this call is repeatedly used for building some large tree. TODO: create function to use given buffer on stack as much cheaper.	2013-07-15 15:59:20 +02:00
Zdenek Kabelac	0443c42e3b	thin: add sub volumes as whole volumes Do not use origin_only when add log_lv and metadata as a subvolume. The stacked volume needs to access whole volume in this case.	2013-07-15 15:58:07 +02:00
Zdenek Kabelac	97d36d5750	thin: check and use layered origin lv Code needs to check if the layer origin device is suspended, It's valid to create thinvolume snapshot of thinvolume which is also used as an old-style snapshot. In this case we need to check -real is suspended. When adding origin_only - add only layer thin volume. (in case it's also old-snapshot add only -real device)	2013-07-15 15:51:39 +02:00
Zdenek Kabelac	55d90b6420	cleanup: update commented-out code part Just make it in-sync with latest proposal.	2013-07-15 15:40:46 +02:00
Mike Snitzer	f9e0adcce5	snapshot: Rename snapshot segment returning methods from find__cow to find__snapshot find_cow -> find_snapshot, find_merging_cow -> find_merging_snapshot. Will thin snapshot code to reuse these methods without confusion.	2013-07-02 16:26:03 -04:00
Peter Rajnoha	d6a91da4be	config: add profile arg to find_config_tree_bool	2013-07-02 15:19:09 +02:00
Peter Rajnoha	8ac4fcf8ff	config: add profile arg to find_config_tree_str_allow_empty	2013-07-02 15:19:09 +02:00
Peter Rajnoha	06dd66af54	config: add profile arg to find_config_tree_str	2013-07-02 15:19:09 +02:00
Peter Rajnoha	eeb7b0f7fa	config: add profile arg to find_config_tree_node	2013-07-02 15:19:09 +02:00
Jonathan Brassow	1acad23d68	RAID: Remove optimization using static vars in lv_raid_dev_health Revert commit `37ffe6a`. If static variables are to be used then we will put them elsewhere and limit the optimization to reporting code, rather that have it be used in the general case.	2013-06-17 13:03:15 -05:00
Zdenek Kabelac	17a3ddf89e	cleanup: drop unused headers Drop heades which do not provide any used symbols.	2013-06-16 00:07:32 +02:00
Alasdair G Kergon	c2dc21d89f	text: miscellaneous comments & message tweaks	2013-06-15 01:28:54 +01:00
Zdenek Kabelac	55a3859632	thin: detect online metadata resize support	2013-06-11 14:03:28 +02:00
Petr Rockai	7d644443e0	activation: Pass both ondisk and incore LV to suspend.	2013-06-10 17:26:38 +02:00
Petr Rockai	f65dd341a5	locking: Make it possible to pass down an LV to activation code. Previously, we have relied on UUIDs alone, and on lvmcache to make getting a "new copy" of VG metadata fast. If the code which triggers the activation has the correct VG metadata at hand (the version which is currently on disk), it can now hand it to the activation code directly.	2013-06-10 17:26:38 +02:00
Zdenek Kabelac	9966842810	snapshot: skip monitor for large cows If snapshot cow device is already big enough to cover whole origin, do not monitor it.	2013-05-27 10:35:43 +02:00
Zdenek Kabelac	3ba3bc0d66	cleanup: drop backtrace After log_error/log_warn there is no point to show <backtrace> in debug log trace from the next code line.	2013-05-27 10:28:32 +02:00
Jonathan Brassow	06ac797f42	Clean-up: Replace 'lv_is_active' with more correct/specific variants There are places where 'lv_is_active' was being used where it was more correct to use 'lv_is_active_locally'. For example, when checking for the existance of a kernel instance before asking for its status. Most of the time these would work correctly. (RAID is only allowed on non-clustered VGs at the moment, which means that 'lv_is_active' and 'lv_is_active_locally' would give the same result.) However, it is more correct to use the proper variant and it helps with future scenarios where targets might be allowed exclusively (or clustered) in a cluster VG.	2013-05-16 10:36:56 -05:00
Alasdair G Kergon	f12d88f840	activation: fix lv_is_active regressions Try to fix commit `bf2741376d`. lv_is_active is not the same as lv_info(cmd, org, 0, &info, 0, 0). Introduce and use lv_is_active_locally.	2013-05-15 02:13:31 +01:00
Alasdair G Kergon	2fbe1e6e00	rephrasing: miscellaneous changes Miscellaneous changes to messages, man pages, comments and WHATS_NEW.	2013-05-15 01:50:42 +01:00
Peter Rajnoha	4407133113	toolcontext: check dm version lazily for udev_fallback setting Setting the cmd->default_settings.udev_fallback also requires DM driver version check. However, this caused useless mapper/control access with ioctl if not needed actually. For example if we're not using activation code, we don't need to know the udev_fallback as there's no node and symlink processing. For example, this premature mapper/control access caused problems when using lvm2app even when no activation happens - there are situations in which we don't need to use mapper/control, but still need some of the lvm2app functionality. This is also the case for lvm2-activation systemd generator which just needs to look at the lvm2 configuration, but it shouldn't touch mapper/control.	2013-05-13 11:53:53 +02:00
Zdenek Kabelac	f39f5b86c3	cleanup: use dm_list_iterate_items	2013-04-25 17:33:24 +02:00
Zdenek Kabelac	4e1ac7faf1	cleanup: add some FIXMEs	2013-04-21 23:14:05 +02:00
Zdenek Kabelac	dd4fdce16c	cleanup: drop unused assignment Assigned values are unused.	2013-04-21 23:14:04 +02:00
Jonathan Brassow	c363c74a25	CLEAN-UP: Better string checking to avoid substring matches Commit `9fd7ac7d03` introduced a way a method of avoiding reading from mirrors with a device failure. If a device was found to be dead, the mapping table was checked for 'handle_errors' or 'block_on_error'. These strings were checked for in the table string via 'strstr', which could also match on strings like, 'no_handle_errors' or 'no_block_on_error'. No such strings exist, but we don't want to have problems in the future if they do. So, we check for ' <string>{'\0'\|' '}'.	2013-04-12 11:30:04 -05:00
Jonathan Brassow	ff64e3500f	RAID: Add scrubbing support for RAID LVs New options to 'lvchange' allow users to scrub their RAID LVs. Synopsis: lvchange --syncaction {check\|repair} vg/raid_lv RAID scrubbing is the process of reading all the data and parity blocks in an array and checking to see whether they are coherent. 'lvchange' can now initaite the two scrubbing operations: "check" and "repair". "check" will go over the array and recored the number of discrepancies but not repair them. "repair" will correct the discrepancies as it finds them. 'lvchange --syncaction repair vg/raid_lv' is not to be confused with 'lvconvert --repair vg/raid_lv'. The former initiates a background synchronization operation on the array, while the latter is designed to repair/replace failed devices in a mirror or RAID logical volume. Additional reporting has been added for 'lvs' to support the new operations. Two new printable fields (which are not printed by default) have been added: "syncaction" and "mismatches". These can be accessed using the '-o' option to 'lvs', like: lvs -o +syncaction,mismatches vg/lv "syncaction" will print the current synchronization operation that the RAID volume is performing. It can be one of the following: - idle: All sync operations complete (doing nothing) - resync: Initializing an array or recovering after a machine failure - recover: Replacing a device in the array - check: Looking for array inconsistencies - repair: Looking for and repairing inconsistencies The "mismatches" field with print the number of descrepancies found during a check or repair operation. The 'Cpy%Sync' field already available to 'lvs' will print the progress of any of the above syncactions, including check and repair. Finally, the lv_attr field has changed to accomadate the scrubbing operations as well. The role of the 'p'artial character in the lv_attr report field as expanded. "Partial" is really an indicator for the health of a logical volume and it makes sense to extend this include other health indicators as well, specifically: 'm'ismatches: Indicates that there are discrepancies in a RAID LV. This character is shown after a scrubbing operation has detected that portions of the RAID are not coherent. 'r'efresh : Indicates that a device in a RAID array has suffered a failure and the kernel regards it as failed - even though LVM can read the device label and considers the device to be ok. The LV should be 'r'efreshed to notify the kernel that the device is now available, or the device should be 'r'eplaced if it is suspected of failing.	2013-04-11 15:33:59 -05:00
Jonathan Brassow	38f8f4a958	RAID: Capture new RAID kernel sync_action status fields I've updated the dm_status_raid structure and dm_get_status_raid() function to make it handle the new kernel status fields that will be coming in dm-raid v1.5.0. It is backwards compatible with the old status line - initializing the new fields to '0'. The new structure is also more amenable to future changes. It includes a 'reserved' field that is currently initialized to zero but could be used to hold flags describing new features. It also now uses pointers for the character strings instead of attempting to allocate their space along with the structure (causing the size of the structure to be variable). This allows future fields to be appended. The new fields that are available are: - sync_action : shows what the sync thread in the kernel is doing (idle, frozen, resync, recover, check, repair, or reshape) - mismatch_count: shows the number of discrepancies which were found or repaired by a "check" or "repair" process, respectively.	2013-04-08 15:04:08 -05:00
Zdenek Kabelac	b9fe52e811	cleanup: move comment	2013-03-13 15:13:50 +01:00
Zdenek Kabelac	293a06c39a	cleanup: indent	2013-03-13 15:13:42 +01:00
Jonathan Brassow	ed6f3945fd	clean-up: Typo 's/should had/should have/'	2013-03-06 08:42:03 -06:00
Peter Rajnoha	386886f71c	config: refer to config nodes using assigned IDs For example, the old call and reference: find_config_tree_str(cmd, "devices/dir", DEFAULT_DEV_DIR) ...now becomes: find_config_tree_str(cmd, devices_dir_CFG) So we're referring to the named configuration ID instead of passing the configuration path and the default value is taken from central config definition in config_settings.h automatically.	2013-03-06 10:14:33 +01:00
Zdenek Kabelac	71f4934500	activation: fix pvmove partial tree creation Do not try to add LV again into the partial tree, if it's been already added. Otherwise we may end in endless loop.	2013-02-23 12:09:12 +01:00
Zdenek Kabelac	b73de73151	thin: lvconvert support for external origin Add basic support for converting LV into an external origin volume. Syntax: lvconvert --thinpool vg/pool --originname renamed_origin -T origin It will convert volume 'origin' into a thin volume, which will use 'renamed_origin' as an external read-only origin. All read/write into origin will go via 'pool'. renamed_origin volume is read-only volume, that could be activated only in read-only mode, and cannot be modified.	2013-02-23 10:38:20 +01:00
Zdenek Kabelac	87331dc419	thin: add support for external origin Add internal support for thin volume's external origin.	2013-02-23 10:36:58 +01:00
Zdenek Kabelac	3679bb1cd9	activation: simplify activation code Reorder activation code to look similar for preload tree and activation tree. Its also give much better suppport for device stacking, since now we also support activation of snapshot which might be then used for other devices.	2013-02-23 10:30:03 +01:00
Zdenek Kabelac	0631d233d8	activation: add _add_layer_target_to_dtree Add function for creation of simple linear mapping over layer device.	2013-02-23 10:29:08 +01:00
Zdenek Kabelac	520cc9a7f8	thin: replace _thin_layer with lv_layer() Use consitently lv_layer function internally for thin pool layer name.	2013-02-23 10:28:04 +01:00
Zdenek Kabelac	78b23f3595	activation: extend _cached_info Add layer string to support check of layered devices.	2013-02-23 10:28:01 +01:00
Jonathan Brassow	f5cd9c3563	clean-up: Another functiont that can use 'lv_layer' lib/activate/dev_manager.c:dev_manager_raid_status() can also use the new 'lv_layer' function.	2013-02-04 17:10:16 -06:00
Zdenek Kabelac	a4870c79ca	thin: use noflush for obtaining transaction_id Do not flush thin pool data, when reading transation_id status.	2013-02-04 19:05:56 +01:00
Zdenek Kabelac	8ed0b6f312	thin: replace is_active with send_messages Since is_active is only used for thinp replace struct member with more meaningful send_messages flag	2013-02-04 19:01:10 +01:00
Zdenek Kabelac	4af4241ba4	use lv_layer	2013-02-04 19:01:10 +01:00
Zdenek Kabelac	ca7abbce8a	activate: add lv_layer function Add function to return layer name for LV.	2013-02-04 19:01:10 +01:00
Zdenek Kabelac	9f433e6ee3	cleanup: postpone lv_is_thin_volume check Code move to make it easier to follow and call _add_dev_to_dtree() in the separate if() branch for thin volumes.	2013-02-04 19:00:19 +01:00
Jonathan Brassow	37ffe6a13a	RAID: Cache previous results of lv_raid_dev_health for future use We can avoid many dev_manager (ioctl) calls by caching the results of previous calls to lv_raid_dev_health. Just considering the case where 'lvs -a' is called to get the attributes of a RAID LV and its sub-lvs, this function would be called many times. (It would be called at least 7 times for a 3-way RAID1 - once for the health of each sub-LV and once for the health of the top-level LV.) This is a good idea because the sub-LVs are processed in groups along with their parent RAID LV and in each case, it is the parent LV whose status will be queried. Therefore, there only needs to be one trip through dev_manager for each time the group is processed.	2013-02-01 11:32:18 -06:00
Jonathan Brassow	c8242e5cf4	RAID: Add RAID status accessibility functions Similar to the way thin* accesses its kernel status, we add a method for RAID to grab the various values in its status output without the higher levels (LVM) having to understand how to parse the output. Added functions include: - lib/activate/dev_manager.c:dev_manager_raid_status() Pulls the status line from the kernel - libdm/libdm-deptree.c:dm_get_status_raid() Parses status line and puts components into dm_status_raid struct - lib/activate/activate.c:lv_raid_dev_health() Accesses dm_status_raid to deliver raid dev_health string The new structure and functions can provide a more unified way to access status information. ('lv_raid_percent' could switch to using these functions, for example.)	2013-02-01 11:31:47 -06:00
Alasdair G Kergon	06abb2dd4c	logging: classify log_debug messages Place most log_debug() messages into a class.	2013-01-07 22:30:29 +00:00
Zdenek Kabelac	ec49f07b0d	mirrors: fix leak in device_is_usable mirror check Function _ignore_blocked_mirror_devices was not release allocated strings images_health and log_health. In error paths it was also not releasing dm_task structure. Swaped return code of _ignore_blocked_mirror_devices and use 1 as success. In _parse_mirror_status use log_error if memory allocation fails and few more errors so they are no going unnoticed as debug messages. On error path always clear return values and free strings. For dev_create_file use cache mem pool to avoid memleak.	2012-12-11 11:15:22 +01:00
Peter Rajnoha	35a4d70aad	activation: don't miss the log on empty {auto_activation\|read_only\|}_volume_list Addendum to previous commit...	2012-12-04 14:12:36 +01:00
Peter Rajnoha	e2be2652ad	Allow empty activation/{auto_activation\|read_only\|}_volume_list config option. In case we don't want to activate, autoactivate or have the VG/LV read-only. Primarily targeted for the auto_activation_volume_list, but it makes no harm for other settings (the part of the code that reads these three settings is shared, but there's no reason to separate it only for this change).	2012-12-04 10:33:54 +01:00
Zdenek Kabelac	683b1f0625	thin: detect discards for non-power-2 Check if target supports discards for chunk sizes, that are not power of 2 (just multiple of 64K), and enable it in case it's supported by thin kernel target.	2012-11-26 12:14:47 +01:00
Jonathan Brassow	b248ba0a39	mirror: Avoid reading mirrors with failed devices in mirrored log Commit `9fd7ac7d03` did not handle mirrors that contained mirrored logs. This is because the status line of the mirror does not give an indication of the health of the mirrored log, as you can see here: [root@bp-01 lvm2]# dmsetup status vg-lv vg-lv_mlog vg-lv: 0 409600 mirror 2 253:6 253:7 400/400 1 AA 3 disk 253:5 A vg-lv_mlog: 0 8192 mirror 2 253:3 253:4 7/8 1 AD 1 core Thus, the possibility for LVM commands to hang still persists when mirror have mirrored logs. I discovered this while performing some testing that does polling with 'pvs' while doing I/O and killing devices. The 'pvs' managed to get between the mirrored log device failure and the attempt by dmeventd to repair it. The result was a very nasty block in LVM commands that is very difficult to remove - even for someone who knows what is going on. Thus, it is absolutely essential that the log of a mirror be recursively checked for mirror devices which may be failed as well. Despite what the code comment says in the aforementioned commit... + * _mirrored_transient_status(). FIXME: It is unable to handle mirrors + * with mirrored logs because it does not have a way to get the status of + * the mirror that forms the log, which could be blocked. ... it is possible to get the status of the log because the log device major/minor is given to us by the status output of the top-level mirror. We can use that to query the log device for any DM status and see if it is a mirror that needs to be bypassed. This patch does just that and is now able to avoid reading from mirrors that have failed devices in a mirrored log.	2012-10-25 00:42:45 -05:00
Jonathan Brassow	9fd7ac7d03	mirror: Avoid reading from mirrors that have failed devices Addresses: rhbz855398 (Allow VGs to be built on cluster mirrors), and other issues. The LVM code attempts to avoid reading labels from devices that are suspended to try to avoid situations that may cause the commands to block indefinitely. When scanning devices, 'ignore_suspended_devices' can be set so the code (lib/activate/dev_manager.c:device_is_usable()) checks any DM devices it finds and avoids them if they are suspended. The mirror target has an additional mechanism that can cause I/O to be blocked. If a device in a mirror fails, all I/O will be blocked by the kernel until a new table (a linear target or a mirror with replacement devices) is loaded. The mirror indicates that this condition has happened by marking a 'D' for the faulty device in its status output. This condition must also be checked by 'device_is_usable()' to avoid the possibility of blocking LVM commands indefinitely due to an attempt to read the blocked mirror for labels. Until now, mirrors were avoided if the 'ignore_suspended_devices' condition was set. This check seemed to suggest, "if we are concerned about suspended devices, then let's ignore mirrors altogether just in case". This is insufficient and doesn't solve any problems. All devices that are suspended are already avoided if 'ignore_suspended_devices' is set; and if a mirror is blocking because of an error condition, it will block the LVM command regardless of the setting of that variable. Rather than avoiding mirrors whenever 'ignore_suspended_devices' is set, this patch causes mirrors to be avoided whenever they are blocking due to an error. (As mentioned above, the case where a DM device is suspended is already covered.) This solves a number of issues that weren't handled before. For example, pvcreate (or any command that does a pv_read or vg_read, which eventually call device_is_usable()) will be protected from blocked mirrors regardless of how 'ignore_suspended_devices' is set. Additionally, a mirror that is neither suspended nor blocking is /allowed/ to be read regardless of how 'ignore_suspended_devices' is set. (The latter point being the source of the fix for rhbz855398.)	2012-10-23 23:10:33 -05:00
Zdenek Kabelac	cf8e1a0093	thin: origin only suspend Skip tree creating when used with origin_only flag.	2012-10-03 15:05:55 +02:00
Zdenek Kabelac	eb08f86521	cleanup: initilize percent to INVALID Always initialize percent to INVALID value, in case target would have forget to setup this value somehow.	2012-08-23 14:38:48 +02:00
Zdenek Kabelac	5d0e7fb4ed	activation: report error message If the monitoring activation failed and we have not yet reported error - give the user error message for failure reason.	2012-08-23 14:38:48 +02:00
Zdenek Kabelac	fd417db274	check: add internal errors for unexpected paths Adding couple INTERNAL_ERROR reports for unwanted parameters: Ensure the 'top' metadata node cannot be NULL for lvmetad. Make obvious vginfo2 cannot be NULL. Report internal error if handler and vg is undefined. Check for handle in poll_vg(). Ensure seg is not NULL in dev_manager_transient(). Report missing read_ahead for _lv_read_ahead_single(). Check for report handler in dm_report_object(). Check missing VG in _vgreduce_single().	2012-08-23 14:37:52 +02:00
Zdenek Kabelac	286cd2006b	cleanup: drop unneeded included header files This headers were not resolving anything used for compiled .c files. Remove unused util.c file.	2012-08-23 14:37:20 +02:00
Alasdair G Kergon	701b4a8363	thin: use discards as plural rather than singular Global change from --discard to --discards, as that feels more natural.	2012-08-07 21:24:41 +01:00
Alasdair G Kergon	0650a16a22	activation: log target version present Log (very verbose) the target version present in target_version.	2012-08-07 18:47:33 +01:00
Zdenek Kabelac	260e8f2476	thin: detect supported features from thinp target Add shell variable to override reported min version for testing: LVM_THIN_VERSION_MIN	2012-07-18 14:35:17 +02:00
Peter Rajnoha	ec8f377748	cleanup: static volume filter fn, lvm.conf comment Change 'lv_passes_volumes_filter' fn back to static as it's not actually needed in the other code (a remnant from devel version). Fix lvm.conf comment referencing '--autoactivate' which was finally decided to be '--activate ay'.	2012-06-29 10:28:53 +02:00
Peter Rajnoha	95ced7a7be	activate: add autoactivation hooks Define an 'activation_handler' that gets called automatically on PV appearance/disappearance while processing the lvmetad_pv_found and lvmetad_pv_gone functions that are supposed to update the lvmetad state based on PV availability state. For now, the actual support is for PV appearance only, leaving room for PV disappearance support as well (which is a more complex problem to solve as this needs to count with possible device stack). Add a new activation change mode - CHANGE_AAY exposed as '--activate ay/-aay' argument ('activate automatically'). Factor out the vgchange activation functionality for use in other tools (like pvscan...).	2012-06-28 09:42:47 -04:00
Zdenek Kabelac	2f99e5e35a	Sync filesystem for thin snapshots Add missing lockfs option when suspend origin, before thin volume snapshot is created	2012-06-15 14:43:07 +02:00
Alasdair Kergon	56d49cbf13	Re-enable partial activation of non-thin LVs until it can be fixed. (2.02.90) - The test should be checking the LV as a whole, not just individual segments.	2012-05-16 12:50:14 +00:00
Alasdair Kergon	067184f32d	Handle replacement of an active device that goes missing with an error device. (E.g. lvchange --refresh --partial on striped LV if a PV disappeared.)	2012-04-24 00:51:26 +00:00
Jonathan Earl Brassow	c62f9f0b2f	Unlike 'mirror' segtype, 'raid1' should perform flush on suspend. The 'mirror' segtype and 'raid1' segtype both set the 'MIRRORED' flag. However, due to differences in the way these device-mapper targets behave 'mirror' must be suspended with the 'noflush' option and 'raid1' does not have to be. This patch ensures that when the 'MIRRORED' flag is checked to see if 'noflush' is needed that it does not also set it for 'raid1' by mistake.	2012-04-20 14:17:44 +00:00
Zdenek Kabelac	2caa558e7c	Update and fix monitoring of thin pool devices Code adds better support for monitoring of thin pool devices. update_pool_lv uses DMEVENTD_MONITOR_IGNORE to not manipulate with monitoring. vgchange & lvchange are checking real thin pool device for existance as we are using _tpool real device and visible LV pool device might not be even active (_tpool is activated implicitely for any thin volume). monitor_dev_for_events is another _lv_postorder like code it might be worth to think about reusing it here - for now update the code to properly monitory thin volume deps. For unmonitoring add extra code to check the usage of thin pool - in case it's in use unmonitoring of thin volume is skipped.	2012-03-23 09:58:04 +00:00
Zdenek Kabelac	e866931169	Improve thin_check option passing Update a way we handle option passing - so we now support path and options with space inside. Fix dm name usage for thin pools with '-' in name. Use new lvm.conf option thin_check_options to pass in options as string array.	2012-03-14 17:12:05 +00:00
Zdenek Kabelac	aeaec150c0	Some more missing supposedly 64bit operations. Avoid use 32bit math for extent_size.	2012-03-05 15:05:24 +00:00
Zdenek Kabelac	975b5b42d2	Improve warning Use thin_dump --repair suggestion in log error message and use just warning on deactivation path without repair info (since node has been deactivated). Also check whether there is not 16 args for thin_check configured.	2012-03-05 14:15:50 +00:00
Zdenek Kabelac	6c7a6c07ee	Add support for thin check Use libdm callback to execute thin_check before activation thin pool and after deactivation as well. Supporting thin_check_executable which may pass in extra options for the tool.	2012-03-02 21:49:43 +00:00
Zdenek Kabelac	fbf6b89a84	Using enum types for enums alloc_policy_t, dm_string_mangling_t, percent_range_t, sign_t	2012-02-28 14:24:57 +00:00
Zdenek Kabelac	499a161640	Use const for lv lv_is_active doesn't needs modifiable LV struct so keep it const. Remove lv_send_message() left bits from code - they were never released in 2.02.89.	2012-02-23 22:41:57 +00:00
Petr Rockai	dae0822698	The lvmetad client-side integration. Only active when use_lvmetad = 1 is set in lvm.conf and lvmetad is running.	2012-02-23 13:11:07 +00:00
Jonathan Earl Brassow	a30832cedd	Fix bug that caused RAID devices to be unable to activate if sub-LV was missing. Commit `02f6f4902f` introduced a bug that caused RAID devices to fail to activate if the device for a single sub-LV failed. The special case of LVM mirror was handled, but not LVM RAID. EXAMPLE: [root@bp-01 ~]# devices vg LV Copy% Devices lv 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] /dev/sde1(1) [lv_rimage_1] /dev/sdh1(1) [lv_rmeta_0] /dev/sde1(0) [lv_rmeta_1] /dev/sdh1(0) [root@bp-01 ~]# vgchange -an vg 0 logical volume(s) in volume group "vg" now active [root@bp-01 ~]# off.sh sdh Turning off sdh [root@bp-01 ~]# vgchange -ay vg --partial Partial mode. Incomplete logical volumes will be processed. Couldn't find device with uuid fbI0YO-GX7x-firU-Vy5o-vzwx-vAKZ-feRxfF. Cannot activate vg/lv_rimage_1: all segments missing. 0 logical volume(s) in volume group "vg" now active AFTER this patch: [root@bp-01 ~]# vgchange -ay vg --partial Partial mode. Incomplete logical volumes will be processed. Couldn't find device with uuid fbI0YO-GX7x-firU-Vy5o-vzwx-vAKZ-feRxfF. 1 logical volume(s) in volume group "vg" now active [root@bp-01 ~]# devices vg Couldn't find device with uuid fbI0YO-GX7x-firU-Vy5o-vzwx-vAKZ-feRxfF. LV Copy% Devices lv 100.00 lv_rimage_0(0),lv_rimage_1(0) [lv_rimage_0] /dev/sde1(1) [lv_rimage_1] unknown device(1) [lv_rmeta_0] /dev/sde1(0) [lv_rmeta_1] unknown device(0) [root@bp-01 ~]# dmsetup table vg-lv; dmsetup status vg-lv 0 1024000 raid raid1 3 0 region_size 1024 2 253:2 253:3 - - 0 1024000 raid raid1 2 AD 1024000/1024000 No WHATSNEW update necessary because this is an intrarelease fix. brassow	2012-02-13 17:59:21 +00:00
Alasdair Kergon	72b50d7fd2	give standard error message if lstat fails unexpectedly	2012-02-12 20:17:12 +00:00
Zdenek Kabelac	7b408a08ef	Check result of lstat If lstat returns errno different from ENOENT, do not use the content of struct stat 'buf'.	2012-02-08 10:43:42 +00:00
Zdenek Kabelac	ab852ffe66	Disable partial activation for thin LVs and LVs with all missing segments Count number of error and existing areas and if there is no existing area for the LV avoid its activation. Always disable partial activatio for thin volumes. For mirrors currently put in hack to let it pass with a special name since current mirror code needs to activate such LV during some operations.	2012-02-01 13:47:27 +00:00
Zdenek Kabelac	15fd61e492	Fix data% reporting For reading % of mapped size of thin volume use as origin for old style snapshot '-real' device needs to be queried. Fix log_error report given for lvs -a in this case.	2012-01-28 20:12:26 +00:00
Zdenek Kabelac	209da6efee	Fix missing dmt destructor Also always initialize maj,min,patchlevel when success is returned.	2012-01-25 22:16:04 +00:00
Zdenek Kabelac	b185993628	Fix compilation with disabled devmapper During release preparation things has changed, so making sure we are compilable with --disable-devmapper.	2012-01-25 13:12:59 +00:00
Zdenek Kabelac	e8905d9816	Rename origin_only to more generic use_layer flag Since now we have more layered devices i.e. thin volumes - support selection of layer via flag.	2012-01-25 13:10:26 +00:00
Alasdair Kergon	c3f0ed04a6	Make commented out code more obvious	2012-01-25 11:10:06 +00:00
Zdenek Kabelac	2258242f6c	Thin use origin_only for thin pools as well Extend the usage of origin_only flag to allow resume of thin pool LV (when it's active) to pass only the messages. origin_only flag will skip detection of already resumed tree for thin_pool, so we do not need to suspend the tree and we just send messages.	2012-01-25 09:13:10 +00:00
Zdenek Kabelac	efc8ca105d	Thin add support for origin_only suspend of thin volumes Pass in the origin_only flag also for thin volumes - but curently the flag is not used to its best. FIXME: achieve the state where only thin volume snapshot origin is suspended without its childrens - let's explore whether this may happen automatically inside libdm (might be generic for other targets). So the code would not need to annotate the node for this.	2012-01-25 09:10:13 +00:00
Zdenek Kabelac	78c3b21bfa	Thin add messages only for activation tree Extend lv_activate_opts with bool flag to know for which purpose dtree is created - and add message only for activation tree (since that's the only place that may send them). Extend validation check for thin snapshot creation and test whether active snapshot origin is suspended before its snapshot is created (useful in recover scenarios) - in this case also detect, whether transaction has been already completed and avoid such suspend check failure in that case.	2012-01-25 09:06:43 +00:00
Zdenek Kabelac	3c4be983d5	lv_info using -real layer only for origin_only LV If the origin_only flag is passed for non lv_is_origin LVs, the extension is not added. Thin volumes may also use origin_only flag.	2012-01-25 09:00:18 +00:00
Zdenek Kabelac	5c8b148605	Comment cleanups Move comment where it applies and remove unused attribe when the var is actually used.	2012-01-25 08:51:29 +00:00
Zdenek Kabelac	bdba904d7c	Thin add lv_thin_pool_transaction_id Easy function to get transaction_id status value.	2012-01-25 08:48:42 +00:00
Jonathan Earl Brassow	d5617bccab	Fix the way RAID meta LVs are added to the dependency tree. Similar to the "mirror" segment type's log device, _add_dev_to_dtree should be called and not _add_lv_to_dtree when adding metadata sub-LVs to the deptree. Since _add_lv_to_dtree was being called, 'origin_only' could be set if a snapshot sits on top of the RAID device. This would cause the actual device that needed to be added to be skipped in favor of the non-existant device, "<foo>-real".	2012-01-23 20:56:42 +00:00
Alasdair Kergon	f5bfc8b10d	Attempt to improve clustered 'lvchange -aey' behaviour to try local node before remote nodes and address some existing anomalies.	2012-01-21 05:29:51 +00:00
Mike Snitzer	23e34c729b	Differentiate between snapshot status of "Invalid" and "Merge failed".	2012-01-20 22:02:04 +00:00
Mike Snitzer	861c624acb	Lookup snapshot usage percent of origin when a snapshot is merging.	2012-01-20 21:56:01 +00:00
Alasdair Kergon	fd7d09e39a	improve comment	2012-01-20 03:46:52 +00:00
Jonathan Earl Brassow	25d1410592	Preserve exclusive activation of cluster mirror when converting. This patch to the suspend code - like the similar change for resume - queries the lock mode of a cluster volume and records whether it is active exclusively. This is necessary for suspend due to the possibility of preloading targets. Failure to check to exclusivity causes the cluster target of an exclusively activated mirror to be used when converting - rather than the single machine target.	2012-01-20 00:27:18 +00:00
Zdenek Kabelac	76ee08995e	Thin add function to read thin volume percent This value returns percentage of 'mapped' size compared with total LV size. (Without passed seg pointer it return highest mapped size - but it's not used yet.)	2012-01-19 15:27:54 +00:00
Zdenek Kabelac	6336898318	Thin updated support for thin pool percent Support to check also for metadata percent (By checking whether seg pointer is set)	2012-01-19 15:25:37 +00:00
Zdenek Kabelac	d8106dfee2	Thin rename seg var pool_metadata_lv to metadata_lv Better fits the code.	2012-01-19 15:23:50 +00:00
Zdenek Kabelac	64e353daec	Thin rename local static Use '_' for local const char.	2012-01-19 15:19:18 +00:00
Peter Rajnoha	5d5c80ace7	Missing const. "warning: assignment discards 'const' qualifier..."	2012-01-12 09:08:55 +00:00
Alasdair Kergon	a18dcfb533	Add activation/read_only_volume_list to override LV permission in metadata.	2012-01-12 01:51:56 +00:00
Zdenek Kabelac	34507894e9	Thin add lv_thin_pool_percent	2011-12-21 13:10:05 +00:00
Zdenek Kabelac	c0fcaacb8d	Thin add dev_manager_thin_pool_percent dev manager function to read percent info from thin pool.	2011-12-21 13:09:33 +00:00
Zdenek Kabelac	2bc1d7598e	Thin add dmeventd support This is basic version with still few unresolved issue mainly in case, when the pool resize is failing.	2011-12-21 13:08:11 +00:00
Zdenek Kabelac	d3b4a0f322	Check lv pointer for NULL before derefence.	2011-12-21 12:59:22 +00:00
Zdenek Kabelac	0d59090eaf	Thin move layer suffix into local static const	2011-12-21 12:55:22 +00:00
Alasdair Kergon	8dd6036da4	Add activation/use_linear_target enabled by default. (prajnoha) LVM metadata knows only of striped segments - not linear ones. The activation code detects segments with a single stripe and switches them to use the linear target. If the new lvm.conf setting is set to 0 (e.g. in a test script), this 'optimisation' is turned off.	2011-11-28 20:37:51 +00:00
Zdenek Kabelac	647c8edf82	Drop pool memory allocated in lv_has_target_type Remove FIXMES - there should not be any pool free call since the memory pool is from device manager, and pool is detroyed after the operation, so doing extra free here would not help here. However lv_has_target_type() is using cmd mempool so here the extra call for dm_pool_free makes sence.	2011-11-18 19:42:03 +00:00
Zdenek Kabelac	900f5f8187	Replace dynamic buffer allocations for PATH_MAX Use static buffer instead of stack allocated buffer. This reduces stack size usage of lvm tool and the change is very simple. Since the whole library is not thread safe - it should not add any new problems - and if there will be some conversion it's easy to convert this to use some preallocated buffer.	2011-11-18 19:31:09 +00:00
Zdenek Kabelac	3de08fc9de	Thin clean Reuse seg pointer already set in _add_lv_to_dtree to have the value of first_seg(lv) (and is used in other parts of this function).	2011-11-15 17:25:05 +00:00
Zdenek Kabelac	ed2368538a	Simplify iteration Since nothing is removed in dm_list snapshot_segs during the loop, there is no reason to use _safe iteration, so switch to simplier dm_list_iterate().	2011-11-15 17:21:02 +00:00
Zdenek Kabelac	8ec016236a	Thin fix tpool layer Since we support snapshots of thin volumes, we could have more layers, so we have to check whether tpool layer is going to be inserted. As the _add_segment_to_dtree() is the only place that adds tpool segment, we may just check pointer (no strcmp for layer). Switch to use seg_is_ function instead of lv_is_.	2011-11-15 17:15:03 +00:00
Milan Broz	a3390bb507	Remove unneeded parameter.	2011-11-11 16:41:37 +00:00
Milan Broz	d1b36fbe7f	Fix function name in previous patch.	2011-11-11 15:14:05 +00:00
Milan Broz	07113beea3	Do not scan device if it is part of active multipath. Add filter which tries to check if scanned device is part of active multipath. Firstly, only SCSI major number devices are handled in filter. Then it checks if device has exactly one holder (in sysfs) and if it is device-mapper device and DM-UUID is prefixed by "MPATH-". If so, this device is filtered out. The whole filter can be switched off by setting mpath_component_detection in lvm.conf. https://bugzilla.redhat.com/show_bug.cgi?id=597010 Signed-off-by: Milan Broz <mbroz@redhat.com>	2011-11-11 15:11:08 +00:00
Zdenek Kabelac	87371d48cc	Thin revert code for exclusive pool activation There are no limits on thin-pool activation now. Revert code that is no longer needed.	2011-11-07 10:58:13 +00:00
Zdenek Kabelac	a0c4e85c48	Add -tpool layer in activation tree Let's put the overlay device over real thin pool device. So we can get the proper locking on cluster. Overwise the pool LV would be activate once implicitely and in other case explicitely, confusing locking mechanism. This patch make the activation of pool LV independent on activation of thin LV since they will both implicitely use real -thin pool device.	2011-11-03 14:52:09 +00:00
Zdenek Kabelac	5cc2f9a257	Avoid creation of /dev/vg/thinpool	2011-10-28 20:34:45 +00:00
Zdenek Kabelac	a1d5aaf725	Thin pool activation change To ensure we properly handle LV cluster locking - explicitely do not allow to change the availability of the thin pool that is in use for some thin LV. As soon as the thin volume is created the only way to activate pool is via implicit dependency. Ignore thinpool open count for lv/vgchange operations.	2011-10-28 20:28:00 +00:00
Zdenek Kabelac	92cdc25882	Drop messages from lvm app context (revert) Thinp target uses activation context.	2011-10-17 14:18:07 +00:00
Zdenek Kabelac	7f815706ca	Fix lv_info open_count test When verify_udev_operations was disable, code for stacking fs operation for lvm links was completely disable - but this code was also used for collecting information, that a new node is being created. Add a new flag which is set when a creation of lv symlinks is requested which should restore old behaviour of lv_info function, that has called fs_sync() before quere for open count on device.	2011-10-14 13:23:47 +00:00
Zdenek Kabelac	7a6600b148	Use constant for the repeated dlid size specification	2011-10-11 10:02:28 +00:00
Zdenek Kabelac	df251f14dc	Use shorter way for if()	2011-10-11 09:03:33 +00:00
Zdenek Kabelac	3df790d9fd	Skip backtrace after log_error	2011-10-11 09:02:20 +00:00
Zdenek Kabelac	2abe28a8c6	Replace with debug Since the dm_tree_create already reports reason of error, use log_debug for this message.	2011-10-11 09:01:38 +00:00
Zdenek Kabelac	de75bc6688	Improve backtrace reporting Add <backtrace> so the function appears logged for the fail path.	2011-10-11 08:59:42 +00:00
Zdenek Kabelac	4007ac814f	Change message severity Using log_warn to report missing symlinks as warning, since the command itself returns as successful, we should not produce log_error(). log_warn is better fit here.	2011-10-11 08:57:13 +00:00
Zdenek Kabelac	409bf6e6d8	Skip r assignment Cosmetic, since r is already 0 for the error path, no need to assign it there, and r is assigned to 1 after switch command. Also makes the code more readable.	2011-10-11 08:54:01 +00:00
Jonathan Earl Brassow	b19f01212e	Fix splitmirror in cluster having different DM/LVM views of storage. This patch also does some clean-up of the splitmirrors code. I've attempted to clean-up the splitmirrors code to make it easier to understand with fewer operations. I've tried to reduce the number of metadata operations without compromising the intermediate stages which are necessary for easy clean-up in the even of failure. These changes now correctly handle cluster situations - including exclusive cluster mirrors. Whereas before, a splitmirror operation would result in remote nodes having LVM commands report the newly split LV with a proper name while DM commands would report the old (pre-split) names of the device. IOW, there was a kernel/userspace mismatch.	2011-10-06 14:55:39 +00:00
Jonathan Earl Brassow	83c606ae30	This patch fixes issues with improper udev flags on sub-LVs. The current code does not always assign proper udev flags to sub-LVs (e.g. mirror images and log LVs). This shows up especially during a splitmirror operation in which an image is split off from a mirror to form a new LV. A mirror with a disk log is actually composed of 4 different LVs: the 2 mirror images, the log, and the top-level LV that "glues" them all together. When a 2-way mirror is split into two linear LVs, two of those LVs must be removed. The segments of the image which is not split off to form the new LV are transferred to the top-level LV. This is done so that the original LV can maintain its major/minor, UUID, and name. The sub-lv from which the segments were transferred gets an error segment as a transitory process before it is eventually removed. (Note that if the error target was not put in place, a resume_lv would result in two LVs pointing to the same segment! If the machine crashes before the eventual removal of the sub-LV, the result would be a residual LV with the same mapping as the original (now linear) LV.) So, the two LVs that need to be removed are now the log device and the sub-LV with the error segment. If udev_flags are not properly set, a resume will cause the error LV to come up and be scanned by udev. This causes I/O errors. Additionally, when udev scans sub-LVs (or former sub-LVs), it can cause races when we are trying to remove those LVs. This is especially bad during failure conditions. When the mirror is suspended, the top-level along with its sub-LVs are suspended. The changes (now 2 linear devices and the yet-to-be-removed log and error LV) are committed. When the resume takes place on the original LV, there are no longer links to the other sub-lvs through the LVM metadata. The links are implicitly handled by querying the kernel for a list of dependencies. This is done in the '_add_dev' function (which is recursively called for each dependency found) - called through the following chain: _add_dev dm_tree_add_dev_with_udev_flags <* DM / LVM divide *> _add_dev_to_dtree _add_lv_to_dtree _create_partial_dtree _tree_action dev_manager_activate _lv_activate_lv _lv_resume lv_resume_if_active When udev flags are calculated by '_get_udev_flags', it is done by referencing the 'logical_volume' structure. Those flags are then passed down into 'dm_tree_add_dev_with_udev_flags', which in turn passes them to '_add_dev'. Unfortunately, when '_add_dev' is finding the dependencies, it has no way to calculate their proper udev_flags. This is because it is below the DM/LVM divide - it doesn't have access to the logical_volume structure. In fact, '_add_dev' simply reuses the udev_flags given for the initial device! This virtually guarentees the udev_flags are wrong for all the dependencies unless they are reset by some other mechanism. The current code provides no such mechanism. Even if '_add_new_lv_to_dtree' were called on the sub-devices - which it isn't - entries already in the tree are simply passed over, failing to reset any udev_flags. The solution must retain its implicit nature of discovering dependencies and be able to go back over the dependencies found to properly set the udev_flags. My solution simply calls a new function before leaving '_add_new_lv_to_dtree' that iterates over the dtree nodes to properly reset the udev_flags of any children. It is important that this function occur after the '_add_dev' has done its job of querying the kernel for a list of dependencies. It is this list of children that we use to look up their respective LVs and properly calculate the udev_flags. This solution has worked for single machine, cluster, and cluster w/ exclusive activation.	2011-10-06 14:45:40 +00:00
Zdenek Kabelac	a00cb3a6b0	Add lvm functions for sending messages. Functions are currently only needed for thin provissioning.	2011-10-03 18:37:47 +00:00
Zdenek Kabelac	87663d5f88	Add preload support for thin and thin_pool	2011-10-03 18:24:47 +00:00
Zdenek Kabelac	aebf2d5cdc	Add experimental code for activation of thinp targets No dm messages yes - just a base functionality in the steps of other targets. For now usable only for debugging and tracing.	2011-09-29 08:56:38 +00:00
Alasdair Kergon	10d0d9c7c4	Introduce revert_lv for better pvmove cleanup. (One further fix needed to remove the stray pvmove LVs left behind.)	2011-09-27 22:43:40 +00:00
Peter Rajnoha	c3e5b4976d	Add log_error even for general device in use when we can't do the sysfs checks.	2011-09-26 10:17:51 +00:00
Peter Rajnoha	9fa1d30a1c	Add activation/retry_deactivation to lvm.conf to retry deactivation of an LV.	2011-09-22 17:39:56 +00:00
Peter Rajnoha	125712bea0	Replace open_count check with holders/mounted_fs check on lvremove path. Before, we used to display "Can't remove open logical volume" which was generic. There 3 possibilities of how a device could be opened: - used by another device - having a filesystem on that device which is mounted - opened directly by an application With the help of sysfs info, we can distinguish the first two situations. The third one will be subject to "remove retry" logic - if it's opened quickly (e.g. a parallel scan from within a udev rule run), this will finish quickly and we can remove it once it has finished. If it's a legitimate application that keeps the device opened, we'll do our best to remove the device, but we will fail finally after a few retries.	2011-09-22 17:33:50 +00:00
Petr Rockai	e59e2f7c3c	Move the core of the lib/config/config.c functionality into libdevmapper, leaving behind the LVM-specific parts of the code (convenience wrappers that handle `struct device` and `struct cmd_context`, basically). A number of functions have been renamed (in addition to getting a dm_ prefix) -- namely, all of the config interface now has a dm_config_ prefix.	2011-08-30 14:55:15 +00:00
Jonathan Earl Brassow	6d04311efa	Add the ability to split an image from the mirror and track changes. ~> lvconvert --splitmirrors 1 --trackchanges vg/lv The '--trackchanges' option allows a user the ability to use an image of a RAID1 array for the purposes of temporary read-only access. The image can be merged back into the array at a later time and only the blocks that have changed in the array since the split will be resync'ed. This operation can be thought of as a partial split. The image is never completely extracted from the array, in that the array reserves the position the device occupied and tracks the differences between the array and the split image via a bitmap. The image itself is rendered read-only and the name (<LV>_rimage_*) cannot be changed. The user can complete the split (permanently splitting the image from the array) by re-issuing the 'lvconvert' command without the '--trackchanges' argument and specifying the '--name' argument. ~> lvconvert --splitmirrors 1 --name my_split vg/lv Merging the tracked image back into the array is done with the '--merge' option (included in a follow-on patch). ~> lvconvert --merge vg/lv_rimage_<n> The internal mechanics of this are relatively simple. The 'raid' device- mapper target allows for the specification of an empty slot in an array via '- -'. This is what will be used if a partial activation of an array is ever required. (It would also be possible to use 'error' targets in place of the '- -'.) If a RAID image is found to be both read-only and visible, then it is considered separate from the array and '- -' is used to hold it's position in the array. So, all that needs to be done to temporarily split an image from the array /and/ cause the kernel target's bitmap to track (aka "mark") changes made is to make the specified image visible and read-only. To merge the device back into the array, the image needs to be returned to the read/write state of the top-level LV and made invisible.	2011-08-18 19:38:26 +00:00
Jonathan Earl Brassow	2100c90dd7	Add missing checks for function return codes. Some functions were being called without having their return values checked.	2011-08-11 19:38:00 +00:00
Jonathan Earl Brassow	4aebd52c4c	Add ability to down-convert RAID1 arrays. Also, add some simple RAID tests to testsuite.	2011-08-11 18:24:40 +00:00
Jonathan Earl Brassow	ff58e019d8	Add RAID metadata devices to considered devices in _add_lv_to_dtree. _add_lv_to_dtree must also add RAID metadata devices.	2011-08-11 04:18:17 +00:00
Zdenek Kabelac	077a6755ff	Replace free_vg with release_vg Move the free_vg() to vg.c and replace free_vg with release_vg and make the _free_vg internal. Patch is needed for sharing VG in vginfo cache so the release_vg function name is a better fit here.	2011-08-10 20:25:29 +00:00
Jonathan Earl Brassow	cac52ca4ce	Add basic RAID segment type(s) support. Implementation described in doc/lvm2-raid.txt. Basic support includes: - ability to create RAID 1/4/5/6 arrays - ability to delete RAID arrays - ability to display RAID arrays Notable missing features (not included in this patch): - ability to clean-up/repair failures - ability to convert RAID segment types - ability to monitor RAID segment types	2011-08-02 22:07:20 +00:00
Alasdair Kergon	a73e9a6cfa	Need to snapshot lookup by uuid instead of name in case it's renamed.	2011-07-08 15:35:50 +00:00
Alasdair Kergon	ee840ff14c	Move snapshot deactivation logic into lib/activate, fixing the teardown sequence. (Previously the snapshot was deactivated while its origin was active and before its removal was committed to disk, so restarting after a crash at the point would leave corruption.)	2011-07-08 12:48:41 +00:00
Alasdair Kergon	f5f3defc02	Cope with a PV only discovered missing when creating deptree.	2011-07-06 00:29:44 +00:00
Alasdair Kergon	86b15c7c90	Abort operation if dm_tree_node_add_target_area fails.	2011-07-05 23:10:14 +00:00
Alasdair Kergon	3a8eb3870e	Always perform preload logic before suspending - not only in the case when we have precommitted metadata. (Necessary to avoid loading tables while suspend in lvchange --refresh.)	2011-07-05 18:36:37 +00:00
Alasdair Kergon	2aef1b08f0	Snapshots LVs are never loaded in their own right, only along with their origin.	2011-07-05 01:08:42 +00:00
Alasdair Kergon	b5750a61f1	Fix conditions using no_merging: only those using lv_is_merging_cow() should have been converted, not pure lv_is_cow ones. (Merging has no impact on how the pre-merged cow segment itself is loaded.)	2011-07-05 01:01:19 +00:00
Alasdair Kergon	fbbd54d123	reinstate accidentally-removed lines to fix pvmove again	2011-07-04 14:56:58 +00:00
Alasdair Kergon	2243718fae	Add framework for validation of ioctls. Doesn't do any checks yet. dmsetup --checks libdevmapper: dm_task_enable_checks() lvm.conf: activation/checks=1	2011-07-01 14:09:19 +00:00
Alasdair Kergon	0f2a4ca2b5	When suspending, automatically preload newly-visible existing LVs Let's find out if this makes things better or worse overall...	2011-06-30 18:25:18 +00:00
Jonathan Earl Brassow	9e277b9e2c	Fix issue preventing cluster mirror creation. Mirrors used to be created by first creating a linear device and then adding the other images plus the log. Now mirrors are created by creating all the images in one go and then adding the log separately. The new way ran into the condition that cluster mirrors cannot change the log type (in the case of creation, from core -> disk) while the mirror is not active. (It isn't active because it is in the process of being created.) The reason this condition is in place is because a remote node may have the mirror active, and we don't want to alter the log underneath it. What we really needed was a way of checking if the mirror was active remotely but not locally, and in that case do not allow a change of the log. I've added this check, and cluster mirrors can now be created again.	2011-06-22 21:31:21 +00:00
Peter Rajnoha	418663b61c	Disable udev fallback by default and add activation/udev_fallback to lvm.conf. We've used udev fallback code till now to check whether udev created/removed the entries in /dev correctly and if not, a repair was done (giving a warning messagea about that). This patch adds a possibility to enable this additional check and subsequent fallback only when required (debugging purposes mostly) and trust udev completely. So let's disable the fallback code by default and add a new configuration option "activation/udev_fallback". (The original code for creating the nodes will still be used in case the device directory that is set in lvm.conf differs from the one that udev uses and also when activation/udev_rules is set to 0 - otherwise we would end up with no nodes/symlinks at all)	2011-06-17 14:50:53 +00:00
Zdenek Kabelac	93a98c2672	Remove unused internal flag ACTIVATE_EXCL from the code	2011-06-17 14:30:58 +00:00
Zdenek Kabelac	f3d8974dc9	Add couple FIXMEs around suspicious code	2011-06-17 14:24:18 +00:00
Zdenek Kabelac	c6168a14c9	Use lv_activate_opts struct instead of ACTIVATE_EXCL status flag Let's hope all conditions has been properly converted.	2011-06-17 14:22:48 +00:00
Zdenek Kabelac	81beded3af	Add lv_activate_opts structure To avoid modification of 'read-only' volume group structure add a new structure to pass local data around the code for LV activation. As origin_only is one such flag - replace this parameter with new struct lv_activate_opts. More parameters might eventually become part of lv_activate_opts.	2011-06-17 14:14:19 +00:00
Alasdair Kergon	7df72b3c88	Fix last snapshot removal to avoid table reload while a device is suspended.	2011-06-13 22:28:04 +00:00
Alasdair Kergon	df390f1799	Major pvmove fix to issue ioctls in the correct order when multiple LVs are affected by the move. (Currently it's possible for I/O to become trapped between suspended devices amongst other problems. The current fix was selected so as to minimise the testing surface. I hope eventually to replace it with a cleaner one that extends the deptree code. Some lvconvert scenarios still suffer from related problems.	2011-06-11 00:03:06 +00:00
Zdenek Kabelac	a1eba521e3	Fix some unmatching sign comparation gcc warnings Simple replacement for unsigned type - usually in for() loops.	2011-04-08 14:40:18 +00:00
Zdenek Kabelac	aaf92617b0	Fix -Wold-style-definition gcc warnings	2011-03-29 20:30:05 +00:00
Milan Broz	52a5cd31c4	Mitigate some warnings if running as non-root user. LVM doesn't behave correctly if running as non-root user, there is warning when it detects it. Despite this, it produces many error messages, saying nothing. See https://bugzilla.redhat.com/show_bug.cgi?id=620571 This patch fixes two things: 1) Removes eror message from device_is_usable() which has no information value anyway (real warning is printed inside it). 2) it fixes device-mapper initialization, if we support core dm module autoload and device node is present, it should fail early and not try recreate existing and correct node. (non-root == permission denied here) N.B. In future code should support user roles, some more drastic checks in code are probably contraproductive now.	2011-03-18 12:17:57 +00:00
Zdenek Kabelac	36653e8903	Add fall through comments Add comments to switch case construct.	2011-02-28 19:53:03 +00:00
Zdenek Kabelac	aec2115410	Const fixing Fixing some const warnings - with API change in: int vg_extend(struct volume_group vg, int pv_count, const char const pv_names, Change is needed - as lvm2api expects const behaviour here. So vg_extend() is doing local strdup for unescaping. skip_dev_dir return const char from const char* vg_name. Rest of the patch is cleanup of related warnings. Also using dm_report_filed_string() API change to simplify casting in _string_disp and _lvname_disp.	2011-02-18 14:47:28 +00:00
Zdenek Kabelac	ab8b85fb80	Fix !DEVMAPPER_SUPPORT build Fix build when devmapper is disabled.	2011-02-18 14:29:39 +00:00

... 5 6 7 8 9 ...

1043 Commits