shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-22 17:35:59 +03:00

Author	SHA1	Message	Date
David Teigland	4670e9f698	skip some clvmd-specific code in common cases This, or something like it, can probably be done in many other places.	2018-04-25 16:40:08 -05:00
David Teigland	47bfac21ca	clvmd: skip dev rescan after full scan When clvmd does a full label scan just prior to calling _vg_read(), pass a new flag into _vg_read to indicate that the normal rescan of VG devs is not needed.	2018-04-25 16:39:43 -05:00
David Teigland	1fec86571f	clvmd: reuse a vg struct for sequential LV operations After reading a VG, stash it in lvmcache as "saved_vg". Before reading the VG again, try to use the saved_vg. The saved_vg is dropped on VG lock operations.	2018-04-25 16:39:43 -05:00
David Teigland	f8616ac2d8	lvmcache: rename suspended_vg to saved_vg The copy of the VG which clvmd stashes in lvmcache should not only be used between suspend and resume, but between sequential LV operations in clvmd, so that clvmd does not need to reread the VG for each one. Prepare for that by renaming the stashed VG as "saved_vg".	2018-04-25 16:39:43 -05:00
Zdenek Kabelac	c492fbb51c	debug: more explanatory error message	2018-04-23 22:42:18 +02:00
Zdenek Kabelac	fcdac700f9	gcc: remove duplicate typedef	2018-04-23 22:42:18 +02:00
David Teigland	1409c4a1c2	clvm: rescan when VG or PV not found Rescan devices to update lvmcache content when clvmd vg_read doesn't find a VG or PV.	2018-04-20 16:09:49 -05:00
David Teigland	aee27dc7ba	scan: skip device rescan in vg_read For reporting commands (pvs,vgs,lvs,pvdisplay,vgdisplay,lvdisplay) we do not need to repeat the label scan of devices in vg_read if they all had matching metadata in the initial label scan. The data read by label scan can just be reused for the vg_read. This cuts the amount of device i/o in half, from two reads of each device to one. We have to be careful to avoid repairing the VG if we've skipped rescanning. (The VG repair code is very poor, and will be redone soon.)	2018-04-20 11:23:14 -05:00
David Teigland	aa833bdd8a	bcache: intercept test mode before write Don't allow writes in test mode. test mode should be more sophisticated than just faking writes, and this should be a last defense for cases where test mode is not being checked correctly.	2018-04-20 11:22:48 -05:00
David Teigland	9b6a62f944	lvmcache: simplify Recent changes allow some major simplification of the way lvmcache works and is used. lvmcache_label_scan is now called in a controlled fashion at the start of commands, and not via various unpredictable side effects. Remove various calls to it from other places. lvmcache_label_scan should not be called from anywhere during a command, because it produces an incorrect representation of PVs with no MDAs, and misclassifies them as orphans. This has been a long standing problem. The invalid flag and rescanning based on that is no longer used and removed. The 'force' variation is no longer needed and removed.	2018-04-20 11:22:48 -05:00
David Teigland	c0973e70a5	dev_cache: clean up scan Pull out all of the twisted logic and simply call dev_cache_scan at the start of the command prior to label scan.	2018-04-20 11:22:48 -05:00
David Teigland	45e5e702c1	scan: improve io error checking and reporting	2018-04-20 11:22:48 -05:00
David Teigland	6d05859862	bcache: let caller see an error	2018-04-20 11:22:48 -05:00
David Teigland	ae21305ee7	scan: drop bcache between lvm shell commands A running lvm shell keeps all lvm devices open unless the bcache is dropped.	2018-04-20 11:22:48 -05:00
David Teigland	a9b0aa5c17	lvmetad: more fixes related to bcache Need to open devs prior to bcache io.	2018-04-20 11:22:48 -05:00
David Teigland	e351f8bc66	lvmetad: need to set up bcache in another place We need to find one common place to set up bcache for the lvmetad case, instead of adding calls in various places.	2018-04-20 11:22:48 -05:00
David Teigland	ddb5de7a98	clvm: fix bcache scan handling We can't let clvmd keep all scanned devs open, which prevents them from being removed. So drop the bcache data (and close fds) affter doing a label scan. Also set up bcache before the clvm-specific vg_read (which needs to rescan the vg's devs using bcache) and destroy the bcache after.	2018-04-20 11:22:48 -05:00
David Teigland	196579af1f	scan: check for errors in text layer The scanning code in the format_text layer has previously ignored errors. Start checking for and returning them.	2018-04-20 11:22:47 -05:00
David Teigland	44726ed9cb	scan: remove lvmcache info for failed devs When scanning a device fails, drop an lvmcache info struct for it.	2018-04-20 11:22:47 -05:00
David Teigland	1717d4cb17	lvmcache: add shorter way to delete dev info Don't make the caller look up the info first.	2018-04-20 11:22:47 -05:00
David Teigland	570c6239ee	bcache: fix error handling The error handling code wasn't working, but it appears that just removing it is what we need. The doesn't really need any different behavior related to bcache blocks on an io error, it just wants to know if there was an error.	2018-04-20 11:22:47 -05:00
David Teigland	217f3f8741	scan: add function to drop bcache blocks which can be a little more efficient that destroy.	2018-04-20 11:22:47 -05:00
David Teigland	da2b155a9d	scan: invalidate bcache for dev after errors If there are errors reading or writing dev, invalidate bcache for it.	2018-04-20 11:22:47 -05:00
David Teigland	4331182964	bcache: add some error messages for debugging	2018-04-20 11:22:47 -05:00
David Teigland	21057676a1	scan: create bcache with minimum number of blocks In some odd cases (e.g. tests) there are very few devices which results in creating too few blocks in bcache, so create bcache with a minimum number of blocks.	2018-04-20 11:22:47 -05:00
David Teigland	e49b114f7e	bcache: use wrappers for bcache read write in lvm Using a wrapper makes it easier to disable bcache if needed.	2018-04-20 11:22:47 -05:00
David Teigland	8065492046	bcache: do all writes through bcache	2018-04-20 11:22:47 -05:00
David Teigland	8b26a007b1	misc bcache fixes from ejt	2018-04-20 11:22:47 -05:00
David Teigland	0da296003d	vgchange: invalidate bcache for stacked LVs when deactivating An LV with a stacked PV will be open in bcache and needs to be invalidated to close the fd before attempting to deactivate.	2018-04-20 11:22:47 -05:00
David Teigland	c2b10daf69	scan: put dev back on caller's list Commit 6e442875613915e506440e59a290b56756df2521 missed adding devs back to caller's list.	2018-04-20 11:22:47 -05:00
David Teigland	e7670d3338	pvck: use bcache	2018-04-20 11:22:47 -05:00
David Teigland	b504bb809e	scan: use 128K bcache block size	2018-04-20 11:22:46 -05:00
David Teigland	28255e3eee	scan: always setup bcache for commands using lvmetad Do this at the start of the command so that it doesn't need to be checked and set up in every function that could need it.	2018-04-20 11:22:46 -05:00
David Teigland	f328532f05	scan: leave the caller's dev list unchanged When scanning the list of devs from the caller they are moved to another temporary list, but were never returned to the original list.	2018-04-20 11:22:46 -05:00
David Teigland	7bce66c5e8	scan: setup bcache for commands using lvmetad Commands using lvmetad will not begin with a proper label_scan which initializes bcache, but may later decide they need to scan a set of devs, in which case they'll need bcache set up at that point.	2018-04-20 11:22:46 -05:00
David Teigland	6e580465b5	vgremove: fix force remove on devs with damaged metadata The improved detection of bad metadata when scanning (where errors were ignored before) means we now have to override some errors when forcibly erasing damaged metadata.	2018-04-20 11:22:46 -05:00
David Teigland	37471bb477	scan: skip extra scan in vg_read Drop an extra label scan in the recovery part of vg_read. This is a temporary improvement until the pending replacement for the broken recovery code burried in vg_read.	2018-04-20 11:22:46 -05:00
David Teigland	e4f478d86d	scan: handle request to scan missing dev	2018-04-20 11:22:46 -05:00
David Teigland	89f54a5094	remove debugging print	2018-04-20 11:22:46 -05:00
David Teigland	a1e3398ffc	scan: handle no devices Still create bcache.	2018-04-20 11:22:46 -05:00
David Teigland	9d2add1361	scan: add a dev to bcache before each read to handle write path This is a temporary hacky workaround to the problem of reads going through bcache and writes not using bcache. The write path wants to read parts of data that it is incrementally writing to disk, but the reads (using bcache) don't work because the writes are not in the bcache. For now, add a dev to bcache before each attempt to read it in case it's being used on the write path.	2018-04-20 11:22:46 -05:00
David Teigland	6c67c7557c	scan: use separate fd for bcache Create a new dev->bcache_fd that the scanning code owns and is in charge of opening/closing. This prevents other parts of lvm code (which do various open/close) from interfering with the bcache fd. A number of dev_open and dev_close are removed from the reading path since the read path now uses the bcache. With that in place, open(O_EXCL) for pvcreate/pvremove can then be fixed. That wouldn't work previously because of other open fds.	2018-04-20 11:22:46 -05:00
David Teigland	f17c2cf7c6	pvremove: device check doesn't require label_read It just needs to check if the device was found during the scan, which means checking if it exists in lvmcache.	2018-04-20 11:22:45 -05:00
David Teigland	29c6c17121	format-text.c log message fixes	2018-04-20 11:22:45 -05:00
David Teigland	d9a77e8bb4	lvmcache: simplify metadata cache The copy of VG metadata stored in lvmcache was not being used in general. It pretended to be a generic VG metadata cache, but was not being used except for clvmd activation. There it was used to avoid reading from disk while devices were suspended, i.e. in resume. This removes the code that attempted to make this look like a generic metadata cache, and replaces with with something narrowly targetted to what it's actually used for. This is a way of passing the VG from suspend to resume in clvmd. Since in the case of clvmd one caller can't simply pass the same VG to both suspend and resume, suspend needs to stash the VG somewhere that resume can grab it from. (resume doesn't want to read it from disk since devices are suspended.) The lvmcache vginfo struct is used as a convenient place to stash the VG to pass it from suspend to resume, even though it isn't related to the lvmcache or vginfo. These suspended_vg* vginfo fields should not be used or touched anywhere else, they are only to be used for passing the VG data from suspend to resume in clvmd. The VG data being passed between suspend and resume is never modified, and will only exist in the brief period between suspend and resume in clvmd. suspend has both old (current) and new (precommitted) copies of the VG metadata. It stashes both of these in the vginfo prior to suspending devices. When vg_commit is successful, it sets a flag in vginfo as before, signaling the transition from old to new metadata. resume grabs the VG stashed by suspend. If the vg_commit happened, it grabs the new VG, and if the vg_commit didn't happen it grabs the old VG. The VG is then used to resume LVs. This isolates clvmd-specific code and usage from the normal lvm vg_read code, making the code simpler and the behavior easier to verify. Sequence of operations: - lv_suspend() has both vg_old and vg_new and stashes a copy of each onto the vginfo: lvmcache_save_suspended_vg(vg_old); lvmcache_save_suspended_vg(vg_new); - vg_commit() happens, which causes all clvmd instances to call lvmcache_commit_metadata(vg). A flag is set in the vginfo indicating the transition from the old to new VG: vginfo->suspended_vg_committed = 1; - lv_resume() needs either vg_old or vg_new to use in resuming LVs. It doesn't want to read the VG from disk since devices are suspended, so it gets the VG stashed by lv_suspend: vg = lvmcache_get_suspended_vg(vgid); If the vg_commit did not happen, suspended_vg_committed will not be set, and in this case, lvmcache_get_suspended_vg() will return the old VG instead of the new VG, and it will resume LVs based on the old metadata.	2018-04-20 11:22:45 -05:00
David Teigland	79c4971210	label_scan: remove extra label scan and read for orphan PVs When process_each_pv() calls vg_read() on the orphan VG, the internal implementation was doing an unnecessary lvmcache_label_scan() and two unnecessary label_read() calls on each orphan. Some of those unnecessary label scans/reads would sometimes be skipped due to caching, but the code was always doing at least one unnecessary read on each orphan. The common format_text case was also unecessarily calling into the format-specific pv_read() function which actually did nothing. By analyzing each case in which vg_read() was being called on the orphan VG, we can say that all of the label scans/reads in vg_read_orphans are unnecessary: 1. reporting commands: the information saved in lvmcache by the original label scan can be reported. There is no advantage to repeating the label scan on the orphans a second time before reporting it. 2. pvcreate/vgcreate/vgextend: these all share a common implementation in pvcreate_each_device(). That function already rescans labels after acquiring the orphan VG lock, which ensures that the command is using valid lvmcache information.	2018-04-20 11:22:45 -05:00
David Teigland	e3e5beec74	lvmetad: use new label_scan for update from pvscan Take advantage of the common implementation with aio and reduced disk reads.	2018-04-20 11:22:43 -05:00
David Teigland	9c71fa0214	lvmetad: use new label_scan for update from lvmlockd When lvmlockd indicates that the lvmetad cache is out of date because of changes by another node, lvmetad_pvscan_vg() rescans the devices in the VG to update lvmetad. Use the new label_scan in this function to use the common code and take advantage of the new aio and reduced reads.	2018-04-20 11:21:41 -05:00
David Teigland	098c843c50	independent metadata areas: fix bogus code Fix mixing bitwise & and logical && which was always 1 in any case.	2018-04-20 11:21:41 -05:00
David Teigland	d9ef9eb330	label_scan: fix independent metadata areas This fixes the use of lvmcache_label_rescan_vg() in the previous commit for the special case of independent metadata areas. label scan is about discovering VG name to device associations using information from disks, but devices in VGs with independent metadata areas have no information on disk, so the label scan does nothing for these VGs/devices. With independent metadata areas, only the VG metadata found in files is used. This metadata is found and read in vg_read in the processing phase. lvmcache_label_rescan_vg() drops lvmcache info for the VG devices before repeating the label scan on them. In the case of independent metadata areas, there is no metadata on devices, so the label scan of the devices will find nothing, so will not recreate the necessary vginfo/info data in lvmcache for the VG. Fix this by setting a flag in the lvmcache vginfo struct indicating that the VG uses independent metadata areas, and label rescanning should be skipped. In the case of independent metadata areas, it is the metadata processing in the vg_read phase that sets up the lvmcache vginfo/info information, and label scan has no role.	2018-04-20 11:21:41 -05:00
David Teigland	748f29b42a	scan: do scanning at the start of a command Move the location of scans to make it clearer and avoid unnecessary repeated scanning. There should be one scan at the start of a command which is then used through the rest of command processing. Previously, the initial label scan was called as a side effect from various utility functions. This would lead to it being called unnecessarily. It is an expensive operation, and should only be called when necessary. Also, this is a primary step in the function of the command, and as such it should be called prominently at the top level of command processing, not as a hidden side effect of a utility function. lvm knows exactly where and when the label scan needs to be done. Because of this, move the label scan calls from the internal functions to the top level of processing. Other specific instances of lvmcache_label_scan() are still called unnecessarily or unclearly by specific commands that do not use the common process_each functions. These will be improved in future commits. During the processing phase, rescanning labels for devices in a VG needs to be done after the VG lock is acquired in case things have changed since the initial label scan. This was being done by way of rescanning devices that had the INVALID flag set in lvmcache. This usually approximated the right set of devices, but it was not exact, and obfuscated the real requirement. Correct this by using a new function that rescans the devices in the VG: lvmcache_label_rescan_vg(). Apart from being inexact, the rescanning was extremely well hidden. _vg_read() would call ->create_instance(), _text_create_text_instance(), _create_vg_text_instance() which would call lvmcache_label_scan() which would call _scan_invalid() which repeats the label scan on devices flagged INVALID. lvmcache_label_rescan_vg() is now called prominently by _vg_read() directly.	2018-04-20 11:21:38 -05:00
David Teigland	4507ba3596	scan: use new label_scan for lvmcache_label_scan To do label scanning, lvm code calls lvmcache_label_scan(). Change lvmcache_label_scan() to use the new label_scan() based on bcache. Also add lvmcache_label_rescan_vg() which calls the new label_scan_devs() which does label scanning on only the specified devices. This is for a subsequent commit and is not yet used.	2018-04-20 11:19:32 -05:00
David Teigland	a7cb76ae94	scan: use bcache for label scan and vg read New label_scan function populates bcache for each device on the system. The two read paths are updated to get data from bcache. The bcache is not yet used for writing. bcache blocks for a device are invalidated when the device is written.	2018-04-20 11:19:24 -05:00
David Teigland	93fc937429	[device/bcache] bcache_read_bytes should put blocks	2018-04-20 11:12:50 -05:00
David Teigland	7be54bd687	[device/bcache] fix min() function	2018-04-20 11:12:50 -05:00
David Teigland	d9e6298edb	[device/bcache] fix missing max_io fn in bcache async engine	2018-04-20 11:12:50 -05:00
Joe Thornber	dc8034f5eb	[device/bcache] more work on bcache	2018-04-20 11:12:50 -05:00
Joe Thornber	6a57ed17a2	[device/bcache] add bcache_prefetch_bytes() and bcache_read_bytes() Not tested yet.	2018-04-20 11:12:50 -05:00
Joe Thornber	467adfa082	[device/bcache] More tests and some bug fixes	2018-04-20 11:12:50 -05:00
Joe Thornber	19647d1cd4	[device/bcache] fix bug in _alloc_block	2018-04-20 11:12:50 -05:00
Joe Thornber	1563b93691	[device/bcache] Add bcache_max_prefetches() Ignore prefetches if max io is in flight.	2018-04-20 11:12:50 -05:00
Joe Thornber	c4c4acfd42	[device/bcache] Add a couple of invalidate methods	2018-04-20 11:12:50 -05:00
Joe Thornber	0f0eb04edb	[device/bcache] some more work on bcache	2018-04-20 11:12:50 -05:00
Joe Thornber	46867a45d2	[device/bcache] stub a unit test	2018-04-20 11:12:50 -05:00
Joe Thornber	da7e13ef88	[lib/device/bcache] Tweaks after Kabi's review	2018-04-20 11:10:45 -05:00
Joe Thornber	acb42ec465	[device/bcache] Initial code drop. Compiles. Not written tests yet.	2018-04-20 11:10:45 -05:00
Joe Thornber	00f1b208a1	[io paths] Unpick agk's aio stuff	2018-04-20 11:03:58 -05:00
Zdenek Kabelac	73cda0437f	cleanup: correcting macro wrapping Use proper do {} while(0) so ';' after macros are correctly interpretted..	2018-04-20 12:17:01 +02:00
Zdenek Kabelac	9731d48691	cleanup: enhance debug message	2018-04-20 12:17:01 +02:00
Zdenek Kabelac	d437bd86ff	cleanup: display_lvname update message Add more display_lvname usage. Update some error messages. Indent.	2018-04-20 12:17:01 +02:00
Zdenek Kabelac	7323557379	cleanup: add _mb_ to regiosize option Just like with others mentions default unit in function name.	2018-04-20 12:17:01 +02:00
Zdenek Kabelac	e878c3fc32	cleanup: correct casting	2018-04-20 12:17:01 +02:00
Zdenek Kabelac	27a1a0e5c0	cleanup: reorder condition There is no point to wait for sync for non-locally active LV.	2018-04-20 12:17:01 +02:00
Zdenek Kabelac	d81e3f9b06	mirror: use vg mempool Use vg mempool with mirror log metadata update.	2018-04-20 12:16:14 +02:00
Zdenek Kabelac	05f954ee9b	mirror: checking for mirror segtype Checking more correctly for mirror segtype here instead of mirrored one which can be also 'raid'.	2018-04-20 12:16:14 +02:00
Zdenek Kabelac	79d214032b	mirror: validate region_size for mirrors Check for region size properties of mirror segments.	2018-04-20 12:16:13 +02:00
Zdenek Kabelac	1693fef529	mirror: properly reload table for log init Since mirror can be stacked, we need to properly reload whole table stack, otherwice we may mishandle devices in dm table.	2018-04-20 12:15:36 +02:00
Zdenek Kabelac	55d83f9f6e	mirror: block_on_error only with monitoring When user configured lvm2 to NOT user monitoring, activated mirror actually hang upon error and it's quite unusable moment. So instead Warn those 'brave' non-monitoring users about possible problem and activation mirror without blocking error handling. This also makes it a bit simpler for test suite to handle trouble cases when test is running without dmeventd.	2018-04-20 12:13:51 +02:00
Zdenek Kabelac	66400d003d	mirror: fix region_size for clustered VG When adjusting region size for clustered VG it always needs to fit 2 full bitset into 1MB due to old limits of CPG. This is relatively big amount of bits, but we have still limitation for region size to fit into 32bits (0x8000000). So for too big mirrors this operation needs to fail - so whenever function returns now 0, it means we can't find matching region_size. Since return 0 is now 'error' we need to also pass proper region_size when creating pvmove mirror.	2018-04-20 12:13:48 +02:00
Zdenek Kabelac	a19456b868	mirror: fix calcs for maximal region_size Since extent_size is no longer power_of_2 this max region size evalution was rather producing random bitsize as a combination of lowest bit from number of extents and extent size itself. Correct calculation to use whole LV size and pick biggest possible power of 2 value smaller then UINT32_MAX.	2018-04-20 12:13:08 +02:00
Zdenek Kabelac	91965af9b1	mirror: improve mirror log size estimation Drop mirrored mirror log limitation that applies only in very limited use-case and actually mirrored mirror log is deprecated anyway. So 'disk' mirror log is selecting the correct minimal size, and bigger size is only enforced with real mirrored mirror log. Also for mirrored mirror log we let use 'smalled' region size if needed so if user uses 1G region size, we still keep small mirror log with much smaller region size in this case when needed. Also mirror log extent calculation is now properly detecting error with too big mirrors where previosly trimmed uint32_t was applies unintentionally.	2018-04-20 12:11:42 +02:00
Zdenek Kabelac	73189170f5	mirror: fix 32bit size calculation On 32bit arch size_t remains 4-byte wide - so size can't get correct result for multiplication of 32bit numbers.	2018-04-20 12:08:57 +02:00
Zdenek Kabelac	ff3ffe30e4	activation: add generic rule for visibility change Whenever we make visible LV out of previously invisible one, reload it's table - the is mandator for proper udev rule processing as well as ensure content of dm table is correct. TODO: this new generic rule probably make extra raid rules unnecessary.	2018-04-20 12:07:36 +02:00
Zdenek Kabelac	4e0c0417ce	cleanup: typo fix	2018-03-19 12:05:57 +01:00
Zdenek Kabelac	8d7ece126b	cache: disallow to combine format 2 with mq Only policy 'smq' is meant to be used with format version 2. Code used to let pass 'mq' policy also with format 2. But 'mq' is obsoloted wth smq and kernel currently matches it. But this is incompatible with older original mq logic - so disallow creation of this rather useless combination.	2018-03-19 12:02:08 +01:00
Zdenek Kabelac	f4383a70ba	coverity: drop unused local static var	2018-03-17 23:33:58 +01:00
Zdenek Kabelac	aa75e181be	coverity: drop unneeded header files	2018-03-17 23:33:58 +01:00
Zdenek Kabelac	f2d0eefa77	coverity: make use of defined variable Since we declare 'r', let's use the value for something.	2018-03-17 23:33:58 +01:00
Zdenek Kabelac	67fbe980a7	raid: fix version check of target Comparision missed to check patch level for matching minor version. Howerver since all checked patchlevels were 0 - the fix doesn't change result.	2018-03-17 23:30:14 +01:00
Zdenek Kabelac	689af32313	pools: skip checks when tools are missing If the tools for checking thin_pool or cache metadata are missing, issue rather just a WARNING, but let the operation of activation continue. This has the advantage, the if user is missing those tools, but he already started to use thinpool or cacheing, he can access these volumes with a WARNING. Also if the user is using too old tools i.e. for CacheV2 format dmpd tool 0.7 is required - provide informative WARNING and skip failure from older tool version which can't understand new format V2.	2018-03-17 23:29:11 +01:00
Heinz Mauelshagen	d68d71013f	lvcreate: remove RaidLV on creation failure In case a newly created RaidLV is blacklisted using config \"activation { volume list = [ ... ] }\" (i.e. its SubLVs stay inactive), the metadata SubLVs can't get wiped thus failing the creation. As a result, the RaidLV together with its SubLVs is left behind in an inconsistent state. Fix by removing the RaidLV and provide a hint about volume_list reasoning. Resolves: rhbz1161347	2018-03-16 15:57:53 +01:00
Zdenek Kabelac	9553dc7761	activation: separate prioritized counter While prioritized_section() based on raised priority works nicely for standard lvm comman - separate counter is actually needed when it's used in daemons like clvmd/dmeventd where priority stays raised all the time.	2018-03-15 12:30:45 +01:00
Zdenek Kabelac	285413b502	cleanup: missing dots and indent	2018-03-15 11:01:04 +01:00
Zdenek Kabelac	d794444715	activation: check for prioritized_section Detect we are in prioritezed section instead of critical one, since these operation were supposed to NOT be happining during whole set of operation. This patch fixes verification of udev operations.	2018-03-15 11:01:04 +01:00
Zdenek Kabelac	6365f011b0	locking: introduce prioritized_section Introduce prioritized_section() as a closer match to previous logic of critical_section() that has been held over longer sequence of ioctl commands - essentially it's matching operation on a single cookie. While 'critical_section()' now corresponds to locked memory - we hold this memory only between suspend/resume thus notion of 'cookie' was lost. This patch restores some logic unintentionaly lost with dropping memory locking for just activation/deactivation calls.	2018-03-15 10:59:42 +01:00
Zdenek Kabelac	70ad633638	devcache: add reason and always log_error With these read errors it's useful to know the reason. Also avoid to log error just once so we know exactly how many times we did failing read. On the other hand reduce repeated log_error() on code 'backtrace' path and change severity of message to just log_debug() so the actual read error is printed once for one read.	2018-03-15 10:50:28 +01:00
Zdenek Kabelac	e9cadbe105	cleanup: matching signess	2018-03-13 12:58:57 +01:00
Zdenek Kabelac	29b2cfba06	mirror: correct locking for mirror log initialization The code was not acking proper lock holding LVs when trying to initialize mirror log to predefined values.	2018-03-13 12:58:27 +01:00
Zdenek Kabelac	1bd57b4c1d	scanning: skip more private devices Just like lvm2 has internal devices like _tdata which is using UUID with suffix, there is similar private type of device for crypto device where they are using CRYPT-TEMP uuid prefix. Also ignore stratis.	2018-03-13 12:57:33 +01:00
Zdenek Kabelac	e095586d9e	cleanup: use path on stack	2018-03-13 12:57:08 +01:00
Zdenek Kabelac	0edd89fadc	raid: skip frozen raid devices Some kernel version suffer from bad state transition where a device steps into 'frozen' mode. Any application that tries to read such raid gets unfortunatelly bloked. As some sort of protection try to skip such raid device from being scanned to minimize chances to block lvm2 command on such scan. When such device is found, warning gets printed.	2018-03-13 12:57:01 +01:00
Zdenek Kabelac	a8a579b154	cleanup: all tests needs target_type Simplify code.	2018-03-13 12:53:59 +01:00
Heinz Mauelshagen	0646fd465e	dev_manager: always activate RAID SubLVs readwrite RaidLVs on read_only_volume_list have their SubLVs activated readonly thus disabling metadata updates or image resynchronization/recovery. Bug also causes automatic repairs to fail. Fix by always activating the RAID SubLVs readwrite. Resolves: rhbz1208269	2018-03-12 22:29:54 +01:00
Heinz Mauelshagen	dd88a0f05c	raid: support raid5_n convenience type on conversion to raid10 Fix requesting a conversion on raid5_{ls,rs,la,ra} -> raid10 not offering offering interim convenience type raid5_n. Resolves: rhbz1468600	2018-03-09 21:23:16 +01:00
Zdenek Kabelac	6cb2c35d16	cleanup: use log_warn There message are not causing command failure thus turn them into warnings.	2018-03-08 10:40:27 +01:00
Zdenek Kabelac	ee37838b11	cache: fix lock usage for cache conversion Just like with lvcreate, this lvconvert case also need to properly check which LV actually holds lock for cached origin - as it might be i.e. thin-pool tdata subLV.	2018-03-08 10:39:47 +01:00
Zdenek Kabelac	7421252edc	snapshot: skip invalid snapshost When scanning DM device, skip automatically invalid snapshot devices. They behave just like 'error' device.	2018-03-08 10:39:44 +01:00
Zdenek Kabelac	a6fdb9d9d7	snapshot: keep COW writable for read-only volumes When snapshot is created in read-only mode with 'lvcreate -s -pr...', lvm2 still needs to be able to write to layered -cow volume to store metadata and exceptions blocks. TODO: in some case we might be able to do full tree with read-only volume but this probably needs futher validation: 1. checking snapshot header already exist 2. origin & snapshot are both in read-only mode.	2018-03-08 10:39:03 +01:00
Zdenek Kabelac	eb3597acb3	activation: support proper /dev names for component LVs When LV is activated AS componet LV - ensure there will be /dev/vgname/lvname link present for such LV.	2018-03-06 15:42:49 +01:00
Zdenek Kabelac	112846ce0b	activation: support activation of component LVs Occasionaly users may need to peek into 'component devices. Normally lvm2 does not let users activation component. This patch adds special mode where user can activate component LV in a 'read-only' mode i.e.: lvchange -ay vg/pool_tdata All devices can be deactivated with: lvchange -an vg \| vgchange -an....	2018-03-06 15:42:46 +01:00
Zdenek Kabelac	6134a71a90	lvconvert: support for convertsion with active component devices If componet devices could be activated alone, ensure they are not breaking common commands. TODO: mostly likely this is not a definite list of all needed checks and more will come later.	2018-03-06 15:42:07 +01:00
Zdenek Kabelac	f92b6f9930	lvremove: ensure no subLV is active Since component activation is going to be enabled, enusure, no subLV is active when we deactivate LV.	2018-03-06 15:42:07 +01:00
Zdenek Kabelac	73e93ef5e5	lvremove: validate removed component LV is not active This is the 'last' place where a LV is present in metadata. Any removed device should not be left active in dm table. So this check is an extra validation protection to capture any forgotten deactivation (adding 1 extra ioctl into lvremove path)	2018-03-06 15:42:07 +01:00
Zdenek Kabelac	ca9cbd92c4	activation: add base lv component function Introduce: lv_is_component() check is LV is actually a component device. lv_component_is_active() checking if any component device is active. lv_holder_is_active() is any component holding device is active.	2018-03-06 15:42:05 +01:00
Zdenek Kabelac	6481471c9d	debug: update comment	2018-03-06 15:40:34 +01:00
Zdenek Kabelac	b6e7a0b490	cleanup: more usage of dm_strncpy Use existing wrapper function arournd strncpy + buf[] = 0;	2018-03-06 15:40:34 +01:00
Zdenek Kabelac	f04abd1f8a	lvremove: drop duplicate check for active LV Since this code branch already tested LV is active, avoid repeating same query.	2018-03-06 15:40:31 +01:00
Zdenek Kabelac	b2f1254c14	raid: move VG update after archiving happened Update of LV le_count needs to happen after archive().	2018-03-06 15:38:15 +01:00
Zdenek Kabelac	ce199db848	raid: fix error path for lv_raid_data_offset Avoid using allocated status on error path.	2018-03-06 15:36:11 +01:00
Zdenek Kabelac	406d6de651	cleanup: indent	2018-02-28 21:15:55 +01:00
Zdenek Kabelac	16c209c613	cleanup: use lv_is_used_cache_pool Use lv_is_used_cache_pool() to simplify the code. Function was introduced later and this code missed to use it.	2018-02-28 21:15:55 +01:00
Zdenek Kabelac	e643de6e61	cleanup: explicitely ignore result code ATM too long prefix is silently ignored.	2018-02-28 21:15:55 +01:00
Zdenek Kabelac	805bf6ec74	cleanup: unused header file	2018-02-28 21:15:55 +01:00
Zdenek Kabelac	6ba94fdd81	debug: change message severity Although it's internal issue - in this case command continue without any reported error - thus hide this internal error into debug.	2018-02-28 21:15:55 +01:00
Zdenek Kabelac	052f28746d	lvresize: check external origin with new size Instead of checking with existing size of external origin LV, use correctly the new 'wanted' size of this LV whether it fits the limitiation requirements for older thin-pool target. Otherwise code started to the the resize, updates metadata and just fails during 'resize' in case the LV was active. For inactive LV operation could have actually passed.	2018-02-28 21:15:55 +01:00
Zdenek Kabelac	b09ea3b6f7	lvremove: drop unneded check Checking here for cache_pool is not necessary and in effect the check is not even right - since there are internal states that do allow to active such LV.	2018-02-28 21:08:40 +01:00
Zdenek Kabelac	bc1adc32cb	lv_manip: enhance for_each_sub_lv Fix missing 'externalLV' traversing for thins with external origins. Replace extra for_each_sub_lv_except_pools() with better internal logic allowing selectively to cut of processed subLV tree. Extend error code for function 'fn()' when it returns -1 it will stop futher tree scan for given LV. Also a bit simplify code to have only one place that is calling 'fn()' and use level counter to know depth of traversing. Update renaming travering to skip trees for pools and external origins.	2018-02-28 21:08:38 +01:00
Zdenek Kabelac	6b48868cf0	io: keep 64b arithmetic Widen to 64b arithmetic from start.	2018-02-28 21:05:18 +01:00
Zdenek Kabelac	261e6c3df6	raid: add free for error path Recent patch forget to release now allocated 'dso' on error path.	2018-02-28 21:05:18 +01:00
Zdenek Kabelac	e7f1329cae	debug: capture internal error for too long resource name Should never happen, so just put in internal error instead of silently passing some shortened resource name.	2018-02-19 16:45:10 +01:00
Zdenek Kabelac	c3bb2b29d4	locking: move cache dropping to primary locking code While 'file-locking' code always dropped cached VG before lock was taken - other locking types actually missed this. So while the cache dropping has been implement for i.e. clvmd, actually running command in cluster keept using cache even when the lock has been i.e. dropped and taken again. This rather 'hard-to-hit' error was noticable in some tests running in cluster where content of PV has been changed (metadata-balance.sh) Fix the code by moving cache dropping directly lock_vol() function. TODO: it's kind of strange we should ever need drop_cached_metadata() used in several places - this all should happen automatically this some futher thinking here is likely needed.	2018-02-19 16:45:05 +01:00
Zdenek Kabelac	a2d2fe3a8c	locking: exclusive can be either remote or local When LOCK is exclusive and LV is already locally active, it cannot be active remotely.	2018-02-15 13:54:55 +01:00
Zdenek Kabelac	a1195aaa66	cleanup: add missing WARNING ATM log_warn() is supposed to be used with WARNING: prefix.	2018-02-15 13:52:02 +01:00
Marian Csontos	d67f160200	mirror: Add deprecation warning for mirrored log	2018-02-14 13:32:04 +01:00
Zdenek Kabelac	c3642957c5	gcc: remove warns about free of const	2018-02-13 19:56:02 +01:00
Zdenek Kabelac	0eb9daf602	segtype: no libmem pool usage for name allocation Allocate name with plain malloc & free.	2018-02-13 19:11:28 +01:00
Zdenek Kabelac	32febed8d5	segtype: replace mempool allocation So this is a bit more complex and possibly worth futher checking. ATM clvmd drops cmd->mem mempool AFTER refresh of cmd. So anything allocating from cmd->mem during toolcontext init will likely die at some point in time. As a quick fix - just use regular malloc/free for 'dso' alloction. It's worth to note - cmd->libmem seems to be often misused causing hidden memleaking for clvmd.	2018-02-13 19:11:28 +01:00
Zdenek Kabelac	e40768ac32	debug: add stack tracking	2018-02-12 22:15:03 +01:00
Zdenek Kabelac	27399755fd	segtype: better get_monitor_dso_path api Instead of allocating always 4K for dso path, use only real needed size. Also simplify API call and move common functionality into function itself.	2018-02-12 22:15:03 +01:00
Zdenek Kabelac	e113df129e	cleanup: decode dso path just once Build dso plugin name during segtype initialisation and just use the string during command life-time. Also slightlt update message verbosity and make it very_verbose when operation is going to be made and 'verbose' when it's done.	2018-02-12 22:15:03 +01:00
Zdenek Kabelac	6dff5dc653	activation: cleanup error to warning Since for the code it's not fatal to fail on monitoring, issue correct warning message instead of error.	2018-02-12 22:15:03 +01:00
Zdenek Kabelac	d90a647802	activation: separate reporting of error and monitoring status Avoid using same return code for reporting 2 different things and stricly report error code by return value and add new parameter for reporting monitoring status. This makes easier to recognize which error we got from dm_event and continue only with ENOENT.	2018-02-12 22:14:59 +01:00
Zdenek Kabelac	12fba201be	cleanup: detect dmeventd_executable just once Avoid repeating debug messages about dmeventd executable and just remember it once for whole cmd lifetime.	2018-02-12 22:14:25 +01:00
Zdenek Kabelac	7239a45b79	clean: drop unneeded -1 for snprintf man gives: snprintf() and vsnprintf() write at most size bytes (including the terminating null byte ('\0')) to str.	2018-02-12 22:14:25 +01:00
Zdenek Kabelac	afdbb28f72	toolcontext: light context missed to set-up mem mempool If cmd->mem was null, then systemd generator was failing on: (gdb) bt dm_pool_alloc_aligned (p=0x0, s=96, alignment=8) at mm/pool-fast.c:95 dm_pool_alloc (p=0x0, s=96) at mm/pool-fast.c:90 dm_pool_zalloc (p=0x0, s=96) at mm/pool.c:74 config_file_read_fd (mem=0x0, cft=0x55f4339dbad0, dev=0x55f4339dfac0, reason=DEV_IO_MDA_CONTENT, offset=0, size=82293, offset2=0, size2=0, checksum_fn=0x0, checksum=0, checksum_only=0, no_dup_node_check=0, ioflags=0, config_file_read_fd_callback=0x0, config_file_read_fd_context=0x0) at config/config.c:567 config_file_read (mem=0x0, cft=0x55f4339dbad0) at config/config.c:658 config_file_open_and_read (config_file=0x7f49aef14540 <config_file> "/var/tmp/lvm/etc/lvm/lvm.conf", source=CONFIG_FILE, cmd=0x55f4339d6260) at config/config.c:282 _load_config_file (cmd=0x55f4339d6260, tag=0x7f49aeca15da "", local=0) at commands/toolcontext.c:824 _init_lvm_conf (cmd=0x55f4339d6260) at commands/toolcontext.c:853 create_config_context () at commands/toolcontext.c:1814 lvm_config_find_bool (libh=0x0, config_path=0x55f431a884ad "global/use_lvmetad", fail=0) at lvm_base.c:144 main ()	2018-02-12 22:13:53 +01:00
Zdenek Kabelac	83258e3385	toolcontext: do not change stream for pthreaded programs With pthreaded daemons like 'dmeventd' using liblvm via plugin, lvm2 actually should not 'play' with streams at all - as there could be parallel outputs running. As a current quick workaround just disable change for pthreaded program (gettid() != getpid()). TODO: it's possible the change of buffering actually doesn't serve us any measurable benefit and could be dropped as whole later... Meanwhile this patch is fixing this occasional valgrind race report: Invalid read of size 4 at 0x571892C: vfprintf (in /usr/lib64/libc-2.26.9000.so) by 0x57216B3: fprintf (in /usr/lib64/libc-2.26.9000.so) by 0x5042886: dm_event_log (libdevmapper-event.c:925) by 0x10B015: _dmeventd_log (dmeventd.c:125) by 0x10D289: _unregister_for_event (dmeventd.c:1146) by 0x10E52E: _handle_request (dmeventd.c:1583) by 0x10E6D7: _do_process_request (dmeventd.c:1631) by 0x10E7C6: _process_request (dmeventd.c:1660) by 0x1101A4: main (dmeventd.c:2285) Address 0x6264d30 is 192 bytes inside a block of size 552 free'd at 0x4C2ED68: free (vg_replace_malloc.c:530) by 0x573907D: fclose@@GLIBC_2.2.5 (in /usr/lib64/libc-2.26.9000.so) by 0x6AC5C00: reopen_standard_stream (log.c:189) by 0x6A8E62C: destroy_toolcontext (toolcontext.c:2271) by 0x6BA5C22: lvm_fin (lvmcmdline.c:3339) by 0x6BD5EF3: lvm2_exit (lvmcmdlib.c:123) by 0x6856013: dmeventd_lvm2_exit (dmeventd_lvm.c:103) by 0x66535B8: unregister_device (dmeventd_thin.c:432) by 0x10CBBC: _do_unregister_device (dmeventd.c:926) by 0x10CD74: _monitor_unregister (dmeventd.c:979) by 0x10D094: _monitor_thread (dmeventd.c:1066) by 0x54B35E0: start_thread (in /usr/lib64/libpthread-2.26.9000.so) by 0x57C30EE: clone (in /usr/lib64/libc-2.26.9000.so) Block was alloc'd at at 0x4C2DBBB: malloc (vg_replace_malloc.c:299) by 0x573932B: fdopen@@GLIBC_2.2.5 (in /usr/lib64/libc-2.26.9000.so) by 0x6AC5DC2: reopen_standard_stream (log.c:200) by 0x6A8D11D: create_toolcontext (toolcontext.c:1898) by 0x6BA5B6B: init_lvm (lvmcmdline.c:3319) by 0x6BD5BC8: cmdlib_lvm2_init (lvmcmdlib.c:34) by 0x6BD5F04: lvm2_init (lvm2cmd.c:20) by 0x6855EA7: dmeventd_lvm2_init (dmeventd_lvm.c:67) by 0x665305F: register_device (dmeventd_thin.c:352) by 0x10CB7A: _do_register_device (dmeventd.c:916) by 0x10CEE4: _monitor_thread (dmeventd.c:1006) by 0x54B35E0: start_thread (in /usr/lib64/libpthread-2.26.9000.so) by 0x57C30EE: clone (in /usr/lib64/libc-2.26.9000.so) .... Process terminating with default action of signal 6 (SIGABRT): dumping core at 0x570016B: raise (in /usr/lib64/libc-2.26.9000.so) by 0x5701520: abort (in /usr/lib64/libc-2.26.9000.so) by 0x57437D8: __libc_message (in /usr/lib64/libc-2.26.9000.so) by 0x5743831: __libc_fatal (in /usr/lib64/libc-2.26.9000.so) by 0x5744056: _IO_vtable_check (in /usr/lib64/libc-2.26.9000.so) by 0x574751C: __overflow (in /usr/lib64/libc-2.26.9000.so) by 0x574191A: fputc (in /usr/lib64/libc-2.26.9000.so) by 0x50428E3: dm_event_log (libdevmapper-event.c:934) by 0x10B015: _dmeventd_log (dmeventd.c:125) by 0x10D289: _unregister_for_event (dmeventd.c:1146) by 0x10E52E: _handle_request (dmeventd.c:1583) by 0x10E6D7: _do_process_request (dmeventd.c:1631) by 0x10E7C6: _process_request (dmeventd.c:1660) by 0x1101A4: main (dmeventd.c:2285)	2018-02-09 10:56:40 +01:00
Alasdair G Kergon	1b6d0346a3	format_text: Use versionsort to sort archive files Ensure that vg_100000-* follows vg_99999-* so that the expiry logic doesn't stop too early. https://bugzilla.redhat.com/1481085	2018-02-09 01:08:55 +00:00
Alasdair G Kergon	d6cabbbc53	device: Fix basic async I/O error handling	2018-02-08 20:19:21 +00:00
Alasdair G Kergon	3e29c80122	device: Queue any aio beyond defined limits.	2018-02-08 20:15:37 +00:00
Alasdair G Kergon	db41fe6c5d	lvmcache: Use asynchronous I/O when scanning devices.	2018-02-08 20:15:29 +00:00
Alasdair G Kergon	8c7bbcfb0f	device: Basic config and setup to support async I/O.	2018-02-08 20:15:14 +00:00
Alasdair G Kergon	7a9af3cd0e	device: Add flag to indicate that a code path can support AIO Until the whole source supports AIO, library code can check for AIO_SUPPORTED_CODE_PATH to determine whether or not it is OK to use AIO.	2018-02-06 01:11:00 +00:00
Alasdair G Kergon	e869a52cc4	callbacks: Miscellaneous fixes for recent changes	2018-02-06 01:09:39 +00:00
Zdenek Kabelac	083c221cbe	pvmove: reinstantiate clustered pvmove In fact pvmove does support 'clustered-core' target for clustered pvmove of LVs activated on multiple nodes. This patch restores support for activation of pvmove on all nodes for LVs that are also activate on all nodes.	2018-02-01 21:55:20 +01:00
Zdenek Kabelac	a1cfef9f26	dev_io: fix writes for unaligned buffers Actually the removed code is necessary - since not all writes are getting alligned buffer - older compilers seems to be not able to create 4K aligned buffers on stack - this the aligning code still need to be present for write path.	2018-01-23 13:36:12 +01:00
Zdenek Kabelac	6e9148e7ab	debug: drop DEBUG_MEM path Memory is not allocated so no DEBUG_MEM part is needed.	2018-01-23 11:45:18 +01:00
Alasdair G Kergon	9194610f42	device: Add ioflags parameter to transfer additional state. Flags are set on the initial I/O and passed to any callbacks that may in turn issue further I/O using the inherited flags.	2018-01-21 21:10:23 +00:00
Zdenek Kabelac	f3c75bb201	activation: move check later Check for lv when it's known in all cases.	2018-01-17 15:15:43 +01:00
Zdenek Kabelac	38b81e6537	cleanup: enhance messages Add extra info about failing local exlusive activation (as in cluster the LV can be active on some other nodes).	2018-01-17 14:45:48 +01:00
Zdenek Kabelac	a8bcdef4fd	activation: guard exclusive activation Add protectional internall error whenever we spot activation of 'exclusive' only segments in 'non-exclusive' mode. TODO: possibly the activation locking could be enhanced to handle this fully behind the scene - as for now this works purely for lvchange/vgchange activation.	2018-01-17 14:45:47 +01:00
Zdenek Kabelac	e86910b052	lvconvert: use excl activation for conversion Use properly exclusive activation when reactivating origin after snapshot merge (since origin must have been previously also exlusively activated). Same applies when converting volumes to thin-pool or cache. Previously used 'only' local activation incorrectly allowed local activation of some targets (i.e. raid) - thus 'leaking' chance to activate same device on another node - which can be a problem for device types like raid.	2018-01-17 14:43:34 +01:00
Alasdair G Kergon	c26458339e	device: Move buffer allocation nearer to the I/O. Don't allocate memory until it's needed - later we'll add some of the I/O to an internal queue instead of issuing it immediately.	2018-01-16 01:12:08 +00:00
Alasdair G Kergon	081902b4c1	device: Merge _dev_read and dev_read_callback.	2018-01-16 00:41:42 +00:00
Alasdair G Kergon	0a3c6bf8c6	format_text: Refactor mda counting in label processing.	2018-01-15 23:47:44 +00:00
Alasdair G Kergon	73b5ee64e7	format_text: Change update_mda_baton to use label not labeller	2018-01-15 20:13:53 +00:00
Alasdair G Kergon	b825987b2f	device: Rearrange _aligned_io().	2018-01-15 20:10:54 +00:00
Alasdair G Kergon	c90582344d	device: Add reason to devbuf.	2018-01-15 19:38:18 +00:00
Alasdair G Kergon	1f01eaa612	device: Store offset to data instead of pointer. We want to save the relative offset before we've allocated the buffer's memory.	2018-01-15 19:32:59 +00:00
Alasdair G Kergon	61d3296f2a	device: Reorder device.h before change.	2018-01-15 19:24:01 +00:00
Alasdair G Kergon	35cdd9cf48	label: Clean up storing of device and label sector. No longer use the external 'result' pointer internally to set up the cached label. The callback _set_label_read_result() is now given the internal label pointer directly Callers that don't need the result are no longer required to pass a label pointer into label_read().	2018-01-11 02:54:00 +00:00
Alasdair G Kergon	f771d3f870	config: Move use_mmap to local variable.	2018-01-10 20:35:02 +00:00
Alasdair G Kergon	6210c1ec28	device: Mark read-only device buffers const.	2018-01-10 19:57:10 +00:00
Alasdair G Kergon	c350f96c09	device: Eliminate unnecessary buffer from dev_read.	2018-01-10 18:48:01 +00:00
Alasdair G Kergon	366493a1d1	device: Suppress repeated reads of the same data. If the data being requested is present in last_[extra_]devbuf, return that directly instead of reading it from disk again. Typical LVM2 access patterns request data within two adjacent 4k blocks so we eliminate some read() system calls by always reading at least 8k.	2018-01-10 15:52:03 +00:00
Alasdair G Kergon	dcb2a5a611	device: Remove some data copying between buffers. Callers that read larger amounts of data now get a pointer to read-only data directly without copying it through an intermediate buffer. This data is owned by the device layer so the callers no longer free it.	2018-01-10 15:48:03 +00:00
Alasdair G Kergon	4d568b709c	device: Free cached device bufs when metadata invalid or dev closed.	2018-01-10 15:48:03 +00:00
Alasdair G Kergon	bd0967a4b1	device: Keep the last data buffer read off each device. If there's a second metadata area on device, we record that separately. Note that the memory requirements aren't restricted yet.	2018-01-10 15:48:03 +00:00
Alasdair G Kergon	bacc942333	allocation: Avoid exceeding array bounds in allocation tag code If _limit_to_one_area_per_tag() changes nothing it writes beyond the array.	2018-01-10 15:48:03 +00:00
Alasdair G Kergon	e2438b5b9f	format_text: Use malloc aligned for export buffer	2018-01-10 15:48:03 +00:00
Alasdair G Kergon	b65246499b	label: Rename a variable	2018-01-10 15:48:03 +00:00
Alasdair G Kergon	ea96381534	libdm: Introduce dm_malloc_aligned	2018-01-10 15:48:03 +00:00
Alasdair G Kergon	f4675af4cf	format_text: Use vgsummary callbacks	2018-01-09 03:14:30 +00:00
Alasdair G Kergon	4b02d4e22e	label: Add label_read callback.	2018-01-08 23:30:50 +00:00
Alasdair G Kergon	6d322e68f3	label: Add callback fns (partially)	2018-01-08 17:04:56 +00:00
Alasdair G Kergon	5e7d3ad749	device: Introduce dev_read_callback If it obtains the data, it passes it into the supplied callback function and returns 1. Otherwise the callback receives failed = 1. Updated config_file_read_fd to use this and similarly return the data via a callback fn of its own.	2018-01-06 02:40:12 +00:00
Alasdair G Kergon	946f07af3e	metadata: Use a consistent format for callback fn parameters	2018-01-05 14:24:56 +00:00
Alasdair G Kergon	a0ddfad94b	metadata: Change the new data processing fns to void. Move the existing fn return codes into the new structs.	2018-01-05 03:12:22 +00:00
Alasdair G Kergon	c70c9f6565	format_text: Split vgname_from_mda into three pieces.	2018-01-04 21:13:44 +00:00
Alasdair G Kergon	d61b1369d0	format_text: Split out raw_read_mda_header processing	2018-01-04 15:52:59 +00:00
Alasdair G Kergon	139209ef42	format_text: Split up _update_mda. Dedicated functions are now used to process each piece of data obtained, so the refactoring in this file gives us one for the vgsummary and one for the metadata header. This new type of function takes two parameters (for now), the obtained data plus a single struct (that must not reference any data on the stack) that wraps up the entire context needed to process it.	2018-01-04 12:25:24 +00:00
Alasdair G Kergon	111a9fcff5	format_text: Allocate update_mda baton from mempool. Also store return code. Note that fatal and non-fatal errors while handling the mda aren't currently distinguished.	2018-01-03 23:53:00 +00:00
Alasdair G Kergon	5a846e0929	format_text: Split the text import fns into two pieces.	2018-01-03 20:48:02 +00:00
Alasdair G Kergon	4b9806ab6f	toolcontext: Add paired label_init to refresh_toolcontext. label_init() and label_exit() should be paired.	2018-01-02 22:00:31 +00:00
Alasdair G Kergon	22b6c482ec	config: Split config buffer processing into new fn. Wrap its parameters into struct process_config_file_params allocated from a mempool now passed into the config_file_read* fns.	2018-01-02 21:10:46 +00:00
Alasdair G Kergon	3db51e3f0e	label: Wrap _find_labeller params into a struct. Move the actual buffer reading up to _label_read() so _find_labeller() just examines the buffer supplied.	2018-01-02 17:15:32 +00:00
Alasdair G Kergon	9b830791ea	label: Move _set_label_read_result call into _find_labeller. Move responsibility for setting the label_read() result parameter down into _find_labeller().	2018-01-02 15:30:58 +00:00
Alasdair G Kergon	4f4ddb806d	label: Move setting result of label_read into separate fn.	2018-01-02 14:19:20 +00:00
Alasdair G Kergon	e6b4b41881	label: Add mempool.	2018-01-02 13:37:12 +00:00
Zdenek Kabelac	3a841515af	lvm-string: add function to detect component LV suffix Add is_component_lvname() function to recognize component LV name.	2017-12-19 15:28:07 +01:00
Alasdair G Kergon	17649d4ac8	device: Move dev_read memory allocation into device layer. Rename dev_read() to dev_read_buf() - the function that reads data into a supplied buffer. Introduce a new dev_read() that allocates the buffer it returns and switch the important users over to this. No caller may change the returned data. (For now, callers are responsible for freeing it after use, but later the device layer will take full ownership.) dev_read_buf() should only be used for tiny buffers or unimportant code (such as the old disk formats).	2017-12-19 01:31:50 +00:00
David Teigland	3f9ae846b8	lvmlockd: clear coverity complaint from previous coverity fix, it's never happy.	2017-12-18 15:19:17 -06:00
Alasdair G Kergon	5f45cb90a7	format_text: Transfer circular buf alloc to device layer. Instead of the caller passing dev_read_circular() a buffer to fill with data, the device layer itself now allocates it.	2017-12-15 22:34:26 +00:00
Alasdair G Kergon	beee9940a5	format_text: Separate out code paths for buffer wraparound The creation of wrapped around metadata - where the start of metadata is written up to the end of the buffer and the remainder follows back at the start of the buffer - is now restricted to cases where writing the metadata in one piece wouldn't fit. This shouldn't happen in 'normal' usage so let's begin treating the code for this as a special case that can be ignored when optimising 'normal' cases.	2017-12-15 21:12:19 +00:00
Alasdair G Kergon	145ded10c2	format_text: Supply mempool directly to raw_read_mda_header.	2017-12-15 14:57:05 +00:00
Alasdair G Kergon	3edc25dbdf	format_text: Round size written up to multiple of 4096. Zero-fill metadata up to the next 4096 boundary then write out a multiple of 4096 bytes to avoid triggering a read-modify-write.	2017-12-12 22:52:22 +00:00
Alasdair G Kergon	78ffa44fc5	format_text: Change metadata alignment from 512 to 4096. If there is sufficient space in the metadata area, align the next metadata to a disk offset that is a multiple of 4096 bytes and don't write it circularly. If it doesn't all fit at the end of the metadata area, go back to the start and write it all there contiguously. If there is insufficient space to use the new stricter rules, revert to the original behaviour, aligning on 512-byte boundaries wrapping around the circular buffer as required.	2017-12-12 20:57:36 +00:00
Alasdair G Kergon	643df602c7	format_text: More refactoring of metadata offset calcs	2017-12-12 18:51:32 +00:00
Alasdair G Kergon	4002f5e206	format_text: Refactor and document metadata offset calculation.	2017-12-12 18:36:54 +00:00
Alasdair G Kergon	e932c5da50	device: Fix an unpaired device close. dev_open_flags contains an unpaired dev_close_immediate so increment open_count before calling it.	2017-12-12 17:56:58 +00:00
Alasdair G Kergon	b96862ee11	metadata: Consistently skip metadata areas that failed. Even after writing some metadata encountered problems, some commands continue (rightly or wrongly) and attempt to make further changes. Once an mda is marked MDA_FAILED, don't try to use it again. This also applies when reverting, where one loop already skips failed mdas but the other doesn't. This fixes some device open_count warnings on relevant failure paths.	2017-12-12 17:52:45 +00:00
Alasdair G Kergon	c5ef76bf27	device: Internal error if writing 0 bytes to dev.	2017-12-12 12:57:25 +00:00
Alasdair G Kergon	b76c6951aa	format_text: Adjust metadata alignment calculation. Use new ALIGN_ABSOLUTE macro when calculating the start location of new metadata and adjust the end of buffer detection so that there is no longer an imposed gap between old and new metadata.	2017-12-11 20:25:03 +00:00
Alasdair G Kergon	053d35de47	format_text: Use absolute alignment to calculate metadata usage Currently both start and offset should always be divisible by alignment, so this should have no effect, but a later patch will increase alignment so these variables can no longer be optimised out.	2017-12-11 17:14:38 +00:00
Alasdair G Kergon	2db67a8ea0	format_text: Move metadata size checking into separate fn. Move checks into _metadata_fits_into_buffer() and add macro for alignment.	2017-12-11 17:08:29 +00:00
Alasdair G Kergon	46393bfca0	format_text: Log additional circular buffer information.	2017-12-11 16:07:34 +00:00
Alasdair G Kergon	49d486319f	format_text: Replace PRI with FMT.	2017-12-11 15:39:25 +00:00
Zdenek Kabelac	71485ebfc7	thin: regression fix for metadata checking Fix regression from commit `f173274fe4` and restore support for 'disabled' checking via lvm.conf.	2017-12-08 13:21:15 +01:00
Zdenek Kabelac	455b26b8db	activation: keep priority till memlock_unlock Although it doesn't look like it can be a measurable problem and costs some time to flip priorities outside of activation window. So just like with memory locking preserve priority until call memlock_unlock() appears. (addition to commit `c086dfadc3`).	2017-12-08 13:21:15 +01:00
Alasdair G Kergon	14b1e5270d	format_text: Use explicit alignment in wrapping calc. Expand out the metadata wrapping calculations to prepare to support a larger alignment. The current alignment is 512 bytes so (mdac_area_start + rlocn->offset) % alignment is zero.	2017-12-08 01:18:46 +00:00
Zdenek Kabelac	f173274fe4	cleanup: reorder calling of pool checking tools Test for zero header before even starting to create argument list for execution of thin/cache_check tool.	2017-12-07 21:00:39 +01:00
Alasdair G Kergon	2166d7be72	lvmetad: drop stray underscore	2017-12-07 16:24:14 +00:00
Alasdair G Kergon	d591d04103	device: Tag I/O for each mda on a device separately in log messages. Mark the first metadata area on each text format PV as MDA_PRIMARY. Pass this information down to the device layer so that when there are two metadata areas on a block device, we can easily distinguish two independent streams of I/O.	2017-12-07 03:48:11 +00:00
David Teigland	54154dc6f1	lvmlockd: clear coverity complaint	2017-12-06 10:49:31 -06:00
David Teigland	b910c34f09	lvmlockd: use pool lock for tmeta access When a command is run on a named tmeta LV, use the lock on the pool.	2017-12-05 14:31:03 -06:00
David Teigland	b9e4198500	lvmlockd: fix log print from previous commit	2017-12-05 13:48:30 -06:00
David Teigland	5d5807b238	lvmlockd: improve error message for VG lock conflict When there is significant VG lock contention which retries have not been able to mask, print a better error message.	2017-12-05 11:53:03 -06:00
Heinz Mauelshagen	94632eb155	deactivate_lvs: deactivate any missing RaidLV legs In case of failed legs, raid replaces those with e.g. "vg-lv_rimage_0-missing_0_0" mapped to an error target. Those errouneously remain on deactivation. Fix by removing them on deactivation/removal of the RaidLV.	2017-12-05 18:48:06 +01:00
Alasdair G Kergon	7195df5aca	device: Skip read-modify-write if replacing whole block.	2017-12-05 01:00:38 +00:00
Alasdair G Kergon	e4805e4883	device: categorise block i/o Introduce enum dev_io_reason to categorise block device I/O in debug messages so it's obvious what it is for. DEV_IO_SIGNATURES /* Scanning device signatures / DEV_IO_LABEL / LVM PV disk label / DEV_IO_MDA_HEADER / Text format metadata area header / DEV_IO_MDA_CONTENT / Text format metadata area content / DEV_IO_FMT1 / Original LVM1 metadata format / DEV_IO_POOL / Pool metadata format / DEV_IO_LV / Content written to an LV / DEV_IO_LOG / Logging messages */	2017-12-04 23:45:26 +00:00
Zdenek Kabelac	698483b5a1	activation: also lock memory for clustered locking Commit `c086dfadc3` missed to lock memory for clustering suspend part since it's using differnt locking reason.	2017-12-04 23:33:02 +01:00
Zdenek Kabelac	110dac870c	cleanup: use existing define with prefix	2017-12-04 15:38:50 +01:00
Zdenek Kabelac	2a22576b2d	cleanup: drop unused header DM_UUID_LEN is no longer needed.	2017-12-04 15:38:50 +01:00
Heinz Mauelshagen	4daad1cf11	lv_manip: allow extension on --nosync raid lv If the recovery of the repleced leg(s) of a RaidLV created without initial resynchronization (i.e. "lvcreate --nosync ...") got interrupted, it can't be extended because of the < 100% sync rate.	2017-12-01 18:38:18 +01:00
Heinz Mauelshagen	d3d18e637c	raid: ignore --stripesize on raid4/5 conversion to 1 stripe In case caller passes in changed stripe size when reshaping raid4/5 to 1 stripe aiming to convert to raid1 and optionally to linear, ignore it to prevent data corruption.	2017-12-01 15:00:09 +01:00
Zdenek Kabelac	a42c3a0e90	cleanup: remove debug code	2017-12-01 12:19:09 +01:00
Zdenek Kabelac	4dc8184803	suspend: optimize generated list Avoid adding same LV multiple times into the list. Just saves couple extra calls and ioctls and makes log shorter.	2017-12-01 12:19:09 +01:00
Zdenek Kabelac	7e794b7748	activation: avoid rechecking pvmove node Use new 3rd. state of trace_pvmove_deps == 2. In this state we know, we have already seen the node and can skip futher testing. Remainging value 1 signals we want to track, and value 0 is for ignoring tracking, but node is still checking in this case. Reduces large amount of duplicate ioctl queries.	2017-12-01 12:19:09 +01:00
Zdenek Kabelac	e4db42e476	activation: extend resume validation Check also all snapshosts when resume is requested, the origin volume is already resume, but possibly some subLV or snapshot LV could be suspended if we are still in critical_section.	2017-12-01 12:19:09 +01:00
Zdenek Kabelac	c086dfadc3	activation: split priority from memory locking When entering any critical section, lvm2 used to lock process memory and raised task priority to avoid problem with page swapping and minimize time of having non-resumed devices in table. With this patch, memory locking which which is expensive is only used when entering 'suspending' section as only in this section there is risk lvm could be suspending a device which later can be needed for paging. Raised priority is still kept for all section entrances as this is low-cost operation and may accelerate table resumes - although the real impact can be still considered later.	2017-12-01 12:19:09 +01:00
Zdenek Kabelac	c489dd2e17	pvmove: add missing segment merging When pvmove is finished and metadata are updated, the code missed to merge possible mergable segments - so add explicit merging call after pvmoved volumes are unlocked. This avoids weird results where i.e. lvs could have been reporting non-matching segments as lvs upon metadata read is doing silent segment merging while dm table left after pvmove was still preserving non-merged segments.	2017-12-01 12:19:09 +01:00
Zdenek Kabelac	fbd8b456db	pvmove: move code from tools to lib Move code manipulating with locking flags into /lib part of lvm.	2017-12-01 12:18:32 +01:00
Alasdair G Kergon	a9812ec9d3	label: Remove unused verify functions. label_verify has never been used so remove it.	2017-11-28 01:36:55 +00:00
Zdenek Kabelac	02e934c444	cleanup: reuse existing macro Use existing macro to detect striped raid segment.	2017-11-27 10:34:30 +01:00
Zdenek Kabelac	f70404addb	pvmove: enhance delayed_resume logic ATM we want to support delayed resume purely in pvmove case. So have libdm logic internal to recognize difference beween pvmove and other targets that do use delayed resume. This fixes problem introduced with commit `aa68b898ff` for mirror-on-mirror or snapshot-on-mirror problem. TODO: likely added new API call and let libdm user select delayed nodes explicitely.	2017-11-26 00:36:48 +01:00
Zdenek Kabelac	8c6fd0933f	activation: enhance holders detection Use code which detectes handlers in a way, which is more backward-compatible friendly. Replace read of 'sysfs' uuid entry with dm ioctl call. Use /sys/block/dm-X/holders path instead of new path /sys/dev/block/major:minor/holders. TODO: There are few more occurencies of this logic around the code so some abstract interface should be considered.	2017-11-26 00:31:26 +01:00
Zdenek Kabelac	54741aeac5	cleanup: reduce couple debug lines	2017-11-24 16:09:59 +01:00
Zdenek Kabelac	5e88d3a89b	cache: use conditional in warning message In some cases the message could be slightly misleading so use here rather conditional. TODO: In future we may possibly further tune the message in case we are certain the level of redundancy protection has not been reduced.	2017-11-24 16:09:59 +01:00
Zdenek Kabelac	2a6981a697	activation: automaticaly discover pvmove holders When pvmove is finished and does 'suspend/resume' on PVMOVE LV, on resume path committed metadata are already showing 'standalone' pvmove LV prepared just for removal. However code should be able to 'resume' preloaded LV there were participating in pvmove operation. Previously this was all done in the 'tools' part of lvm2 code. So the lvconvert upon pvmove finish had to explicitely call 'resume' on every such LV. Now 'smarted' activation code is able to deduce and combine all information from the active dm table and committed metadata so single call resolves it all in one go. Internally holders are detected by reading sysfs directory to capture all needed UUID which are then looked in lvm2 metadata and all such LVs are automatically collected into dmtree.	2017-11-24 16:09:59 +01:00
Zdenek Kabelac	ddbe763eb8	mirror: use lv_update_and_reload_origin Replace complex code with standard lv_update_and_reload_origin(). Extra suspend should not be necessary. (If they would be - dependency tree would have bug for fixing).	2017-11-24 16:05:21 +01:00
Zdenek Kabelac	b5be7420d9	locking: pvmove is locking holding LV As we do get lock for pvmove LV - it's lockholder ATM.	2017-11-24 16:05:21 +01:00

... 3 4 5 6 7 ...

6218 Commits