shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-22 17:35:59 +03:00

Author	SHA1	Message	Date
David Teigland	f17c2cf7c6	pvremove: device check doesn't require label_read It just needs to check if the device was found during the scan, which means checking if it exists in lvmcache.	2018-04-20 11:22:45 -05:00
David Teigland	29c6c17121	format-text.c log message fixes	2018-04-20 11:22:45 -05:00
David Teigland	d9a77e8bb4	lvmcache: simplify metadata cache The copy of VG metadata stored in lvmcache was not being used in general. It pretended to be a generic VG metadata cache, but was not being used except for clvmd activation. There it was used to avoid reading from disk while devices were suspended, i.e. in resume. This removes the code that attempted to make this look like a generic metadata cache, and replaces with with something narrowly targetted to what it's actually used for. This is a way of passing the VG from suspend to resume in clvmd. Since in the case of clvmd one caller can't simply pass the same VG to both suspend and resume, suspend needs to stash the VG somewhere that resume can grab it from. (resume doesn't want to read it from disk since devices are suspended.) The lvmcache vginfo struct is used as a convenient place to stash the VG to pass it from suspend to resume, even though it isn't related to the lvmcache or vginfo. These suspended_vg* vginfo fields should not be used or touched anywhere else, they are only to be used for passing the VG data from suspend to resume in clvmd. The VG data being passed between suspend and resume is never modified, and will only exist in the brief period between suspend and resume in clvmd. suspend has both old (current) and new (precommitted) copies of the VG metadata. It stashes both of these in the vginfo prior to suspending devices. When vg_commit is successful, it sets a flag in vginfo as before, signaling the transition from old to new metadata. resume grabs the VG stashed by suspend. If the vg_commit happened, it grabs the new VG, and if the vg_commit didn't happen it grabs the old VG. The VG is then used to resume LVs. This isolates clvmd-specific code and usage from the normal lvm vg_read code, making the code simpler and the behavior easier to verify. Sequence of operations: - lv_suspend() has both vg_old and vg_new and stashes a copy of each onto the vginfo: lvmcache_save_suspended_vg(vg_old); lvmcache_save_suspended_vg(vg_new); - vg_commit() happens, which causes all clvmd instances to call lvmcache_commit_metadata(vg). A flag is set in the vginfo indicating the transition from the old to new VG: vginfo->suspended_vg_committed = 1; - lv_resume() needs either vg_old or vg_new to use in resuming LVs. It doesn't want to read the VG from disk since devices are suspended, so it gets the VG stashed by lv_suspend: vg = lvmcache_get_suspended_vg(vgid); If the vg_commit did not happen, suspended_vg_committed will not be set, and in this case, lvmcache_get_suspended_vg() will return the old VG instead of the new VG, and it will resume LVs based on the old metadata.	2018-04-20 11:22:45 -05:00
David Teigland	79c4971210	label_scan: remove extra label scan and read for orphan PVs When process_each_pv() calls vg_read() on the orphan VG, the internal implementation was doing an unnecessary lvmcache_label_scan() and two unnecessary label_read() calls on each orphan. Some of those unnecessary label scans/reads would sometimes be skipped due to caching, but the code was always doing at least one unnecessary read on each orphan. The common format_text case was also unecessarily calling into the format-specific pv_read() function which actually did nothing. By analyzing each case in which vg_read() was being called on the orphan VG, we can say that all of the label scans/reads in vg_read_orphans are unnecessary: 1. reporting commands: the information saved in lvmcache by the original label scan can be reported. There is no advantage to repeating the label scan on the orphans a second time before reporting it. 2. pvcreate/vgcreate/vgextend: these all share a common implementation in pvcreate_each_device(). That function already rescans labels after acquiring the orphan VG lock, which ensures that the command is using valid lvmcache information.	2018-04-20 11:22:45 -05:00
David Teigland	e3e5beec74	lvmetad: use new label_scan for update from pvscan Take advantage of the common implementation with aio and reduced disk reads.	2018-04-20 11:22:43 -05:00
David Teigland	9c71fa0214	lvmetad: use new label_scan for update from lvmlockd When lvmlockd indicates that the lvmetad cache is out of date because of changes by another node, lvmetad_pvscan_vg() rescans the devices in the VG to update lvmetad. Use the new label_scan in this function to use the common code and take advantage of the new aio and reduced reads.	2018-04-20 11:21:41 -05:00
David Teigland	098c843c50	independent metadata areas: fix bogus code Fix mixing bitwise & and logical && which was always 1 in any case.	2018-04-20 11:21:41 -05:00
David Teigland	d9ef9eb330	label_scan: fix independent metadata areas This fixes the use of lvmcache_label_rescan_vg() in the previous commit for the special case of independent metadata areas. label scan is about discovering VG name to device associations using information from disks, but devices in VGs with independent metadata areas have no information on disk, so the label scan does nothing for these VGs/devices. With independent metadata areas, only the VG metadata found in files is used. This metadata is found and read in vg_read in the processing phase. lvmcache_label_rescan_vg() drops lvmcache info for the VG devices before repeating the label scan on them. In the case of independent metadata areas, there is no metadata on devices, so the label scan of the devices will find nothing, so will not recreate the necessary vginfo/info data in lvmcache for the VG. Fix this by setting a flag in the lvmcache vginfo struct indicating that the VG uses independent metadata areas, and label rescanning should be skipped. In the case of independent metadata areas, it is the metadata processing in the vg_read phase that sets up the lvmcache vginfo/info information, and label scan has no role.	2018-04-20 11:21:41 -05:00
David Teigland	748f29b42a	scan: do scanning at the start of a command Move the location of scans to make it clearer and avoid unnecessary repeated scanning. There should be one scan at the start of a command which is then used through the rest of command processing. Previously, the initial label scan was called as a side effect from various utility functions. This would lead to it being called unnecessarily. It is an expensive operation, and should only be called when necessary. Also, this is a primary step in the function of the command, and as such it should be called prominently at the top level of command processing, not as a hidden side effect of a utility function. lvm knows exactly where and when the label scan needs to be done. Because of this, move the label scan calls from the internal functions to the top level of processing. Other specific instances of lvmcache_label_scan() are still called unnecessarily or unclearly by specific commands that do not use the common process_each functions. These will be improved in future commits. During the processing phase, rescanning labels for devices in a VG needs to be done after the VG lock is acquired in case things have changed since the initial label scan. This was being done by way of rescanning devices that had the INVALID flag set in lvmcache. This usually approximated the right set of devices, but it was not exact, and obfuscated the real requirement. Correct this by using a new function that rescans the devices in the VG: lvmcache_label_rescan_vg(). Apart from being inexact, the rescanning was extremely well hidden. _vg_read() would call ->create_instance(), _text_create_text_instance(), _create_vg_text_instance() which would call lvmcache_label_scan() which would call _scan_invalid() which repeats the label scan on devices flagged INVALID. lvmcache_label_rescan_vg() is now called prominently by _vg_read() directly.	2018-04-20 11:21:38 -05:00
David Teigland	4507ba3596	scan: use new label_scan for lvmcache_label_scan To do label scanning, lvm code calls lvmcache_label_scan(). Change lvmcache_label_scan() to use the new label_scan() based on bcache. Also add lvmcache_label_rescan_vg() which calls the new label_scan_devs() which does label scanning on only the specified devices. This is for a subsequent commit and is not yet used.	2018-04-20 11:19:32 -05:00
David Teigland	a7cb76ae94	scan: use bcache for label scan and vg read New label_scan function populates bcache for each device on the system. The two read paths are updated to get data from bcache. The bcache is not yet used for writing. bcache blocks for a device are invalidated when the device is written.	2018-04-20 11:19:24 -05:00
David Teigland	93fc937429	[device/bcache] bcache_read_bytes should put blocks	2018-04-20 11:12:50 -05:00
David Teigland	7be54bd687	[device/bcache] fix min() function	2018-04-20 11:12:50 -05:00
David Teigland	d9e6298edb	[device/bcache] fix missing max_io fn in bcache async engine	2018-04-20 11:12:50 -05:00
Joe Thornber	dc8034f5eb	[device/bcache] more work on bcache	2018-04-20 11:12:50 -05:00
Joe Thornber	6a57ed17a2	[device/bcache] add bcache_prefetch_bytes() and bcache_read_bytes() Not tested yet.	2018-04-20 11:12:50 -05:00
Joe Thornber	467adfa082	[device/bcache] More tests and some bug fixes	2018-04-20 11:12:50 -05:00
Joe Thornber	19647d1cd4	[device/bcache] fix bug in _alloc_block	2018-04-20 11:12:50 -05:00
Joe Thornber	1563b93691	[device/bcache] Add bcache_max_prefetches() Ignore prefetches if max io is in flight.	2018-04-20 11:12:50 -05:00
Joe Thornber	c4c4acfd42	[device/bcache] Add a couple of invalidate methods	2018-04-20 11:12:50 -05:00
Joe Thornber	0f0eb04edb	[device/bcache] some more work on bcache	2018-04-20 11:12:50 -05:00
Joe Thornber	46867a45d2	[device/bcache] stub a unit test	2018-04-20 11:12:50 -05:00
Joe Thornber	da7e13ef88	[lib/device/bcache] Tweaks after Kabi's review	2018-04-20 11:10:45 -05:00
Joe Thornber	acb42ec465	[device/bcache] Initial code drop. Compiles. Not written tests yet.	2018-04-20 11:10:45 -05:00
Joe Thornber	00f1b208a1	[io paths] Unpick agk's aio stuff	2018-04-20 11:03:58 -05:00
Zdenek Kabelac	73cda0437f	cleanup: correcting macro wrapping Use proper do {} while(0) so ';' after macros are correctly interpretted..	2018-04-20 12:17:01 +02:00
Zdenek Kabelac	9731d48691	cleanup: enhance debug message	2018-04-20 12:17:01 +02:00
Zdenek Kabelac	d437bd86ff	cleanup: display_lvname update message Add more display_lvname usage. Update some error messages. Indent.	2018-04-20 12:17:01 +02:00
Zdenek Kabelac	7323557379	cleanup: add _mb_ to regiosize option Just like with others mentions default unit in function name.	2018-04-20 12:17:01 +02:00
Zdenek Kabelac	e878c3fc32	cleanup: correct casting	2018-04-20 12:17:01 +02:00
Zdenek Kabelac	27a1a0e5c0	cleanup: reorder condition There is no point to wait for sync for non-locally active LV.	2018-04-20 12:17:01 +02:00
Zdenek Kabelac	d81e3f9b06	mirror: use vg mempool Use vg mempool with mirror log metadata update.	2018-04-20 12:16:14 +02:00
Zdenek Kabelac	05f954ee9b	mirror: checking for mirror segtype Checking more correctly for mirror segtype here instead of mirrored one which can be also 'raid'.	2018-04-20 12:16:14 +02:00
Zdenek Kabelac	79d214032b	mirror: validate region_size for mirrors Check for region size properties of mirror segments.	2018-04-20 12:16:13 +02:00
Zdenek Kabelac	1693fef529	mirror: properly reload table for log init Since mirror can be stacked, we need to properly reload whole table stack, otherwice we may mishandle devices in dm table.	2018-04-20 12:15:36 +02:00
Zdenek Kabelac	55d83f9f6e	mirror: block_on_error only with monitoring When user configured lvm2 to NOT user monitoring, activated mirror actually hang upon error and it's quite unusable moment. So instead Warn those 'brave' non-monitoring users about possible problem and activation mirror without blocking error handling. This also makes it a bit simpler for test suite to handle trouble cases when test is running without dmeventd.	2018-04-20 12:13:51 +02:00
Zdenek Kabelac	66400d003d	mirror: fix region_size for clustered VG When adjusting region size for clustered VG it always needs to fit 2 full bitset into 1MB due to old limits of CPG. This is relatively big amount of bits, but we have still limitation for region size to fit into 32bits (0x8000000). So for too big mirrors this operation needs to fail - so whenever function returns now 0, it means we can't find matching region_size. Since return 0 is now 'error' we need to also pass proper region_size when creating pvmove mirror.	2018-04-20 12:13:48 +02:00
Zdenek Kabelac	a19456b868	mirror: fix calcs for maximal region_size Since extent_size is no longer power_of_2 this max region size evalution was rather producing random bitsize as a combination of lowest bit from number of extents and extent size itself. Correct calculation to use whole LV size and pick biggest possible power of 2 value smaller then UINT32_MAX.	2018-04-20 12:13:08 +02:00
Zdenek Kabelac	91965af9b1	mirror: improve mirror log size estimation Drop mirrored mirror log limitation that applies only in very limited use-case and actually mirrored mirror log is deprecated anyway. So 'disk' mirror log is selecting the correct minimal size, and bigger size is only enforced with real mirrored mirror log. Also for mirrored mirror log we let use 'smalled' region size if needed so if user uses 1G region size, we still keep small mirror log with much smaller region size in this case when needed. Also mirror log extent calculation is now properly detecting error with too big mirrors where previosly trimmed uint32_t was applies unintentionally.	2018-04-20 12:11:42 +02:00
Zdenek Kabelac	73189170f5	mirror: fix 32bit size calculation On 32bit arch size_t remains 4-byte wide - so size can't get correct result for multiplication of 32bit numbers.	2018-04-20 12:08:57 +02:00
Zdenek Kabelac	ff3ffe30e4	activation: add generic rule for visibility change Whenever we make visible LV out of previously invisible one, reload it's table - the is mandator for proper udev rule processing as well as ensure content of dm table is correct. TODO: this new generic rule probably make extra raid rules unnecessary.	2018-04-20 12:07:36 +02:00
Zdenek Kabelac	4e0c0417ce	cleanup: typo fix	2018-03-19 12:05:57 +01:00
Zdenek Kabelac	8d7ece126b	cache: disallow to combine format 2 with mq Only policy 'smq' is meant to be used with format version 2. Code used to let pass 'mq' policy also with format 2. But 'mq' is obsoloted wth smq and kernel currently matches it. But this is incompatible with older original mq logic - so disallow creation of this rather useless combination.	2018-03-19 12:02:08 +01:00
Zdenek Kabelac	f4383a70ba	coverity: drop unused local static var	2018-03-17 23:33:58 +01:00
Zdenek Kabelac	aa75e181be	coverity: drop unneeded header files	2018-03-17 23:33:58 +01:00
Zdenek Kabelac	f2d0eefa77	coverity: make use of defined variable Since we declare 'r', let's use the value for something.	2018-03-17 23:33:58 +01:00
Zdenek Kabelac	67fbe980a7	raid: fix version check of target Comparision missed to check patch level for matching minor version. Howerver since all checked patchlevels were 0 - the fix doesn't change result.	2018-03-17 23:30:14 +01:00
Zdenek Kabelac	689af32313	pools: skip checks when tools are missing If the tools for checking thin_pool or cache metadata are missing, issue rather just a WARNING, but let the operation of activation continue. This has the advantage, the if user is missing those tools, but he already started to use thinpool or cacheing, he can access these volumes with a WARNING. Also if the user is using too old tools i.e. for CacheV2 format dmpd tool 0.7 is required - provide informative WARNING and skip failure from older tool version which can't understand new format V2.	2018-03-17 23:29:11 +01:00
Heinz Mauelshagen	d68d71013f	lvcreate: remove RaidLV on creation failure In case a newly created RaidLV is blacklisted using config \"activation { volume list = [ ... ] }\" (i.e. its SubLVs stay inactive), the metadata SubLVs can't get wiped thus failing the creation. As a result, the RaidLV together with its SubLVs is left behind in an inconsistent state. Fix by removing the RaidLV and provide a hint about volume_list reasoning. Resolves: rhbz1161347	2018-03-16 15:57:53 +01:00
Zdenek Kabelac	9553dc7761	activation: separate prioritized counter While prioritized_section() based on raised priority works nicely for standard lvm comman - separate counter is actually needed when it's used in daemons like clvmd/dmeventd where priority stays raised all the time.	2018-03-15 12:30:45 +01:00

1 2 3 4 5 ...

5976 Commits