shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-21 13:34:40 +03:00

Author	SHA1	Message	Date
David Teigland	bee9f4efdd	filter-mpath: work with nvme devices Recognize when a device is nvme, and apply filter-mpath to nvme devices in addition to scsi devices.	2021-02-02 13:01:20 -06:00
David Teigland	450f272b31	devices: support printing the filter that rejects a device Use of this new message function needs to be added to various commands to improve the output.	2020-10-01 12:00:09 -05:00
David Teigland	1570e76233	bcache: use indirection table for fd Add a "device index" (di) for each device, and use this in the bcache api to the rest of lvm. This replaces the file descriptor (fd) in the api. The rest of lvm uses new functions bcache_set_fd(), bcache_clear_fd(), and bcache_change_fd() to control which fd bcache uses for io to a particular device. . lvm opens a dev and gets and fd. fd = open(dev); . lvm passes fd to the bcache layer and gets a di to use in the bcache api for the dev. di = bcache_set_fd(fd); . lvm uses bcache functions, passing di for the dev. bcache_write_bytes(di, ...), etc. . bcache translates di to fd to do io. . lvm closes the device and clears the di/fd bcache state. close(fd); bcache_clear_fd(di); In the bcache layer, a di-to-fd translation table (int *_fd_table) is added. When bcache needs to perform io on a di, it uses _fd_table[di]. In the following commit, lvm will make use of the new bcache_change_fd() function to change the fd that bcache uses for the dev, without dropping cached blocks.	2020-09-18 15:10:11 -05:00
David Teigland	09bc2d0fd1	devices: clean up block size functions Replace calls to the old dev_get_block_size function with calls to the new dev_get_direct_block_size function, and remove the old function.	2019-08-07 11:48:10 -05:00
David Teigland	7f347698e3	Fix rounding writes up to sector size Do this at two levels, although one would be enough to fix the problem seen recently: - Ignore any reported sector size other than 512 of 4096. If either sector size (physical or logical) is reported as 512, then use 512. If neither are reported as 512, and one or the other is reported as 4096, then use 4096. If neither is reported as either 512 or 4096, then use 512. - When rounding up a limited write in bcache to be a multiple of the sector size, check that the resulting write size is not larger than the bcache block itself. (This shouldn't happen if the sector size is 512 or 4096.)	2019-07-26 14:21:08 -05:00
David Teigland	db98a6e362	Additional MD component checking If udev info is missing for a device, (which would indicate if it's an MD component), then do an end-of-device read to check if a PV is an MD component. (This is skipped when using hints since we already know devs in hints are good.) A new config setting md_component_checks can be used to disable the additional end-of-device MD checks, or to always enable end-of-device MD checks. When both hints and udev info are disabled/unavailable, the end of PVs will now be scanned by default. If md devices with end-of-device superblocks are not being used, the extra I/O overhead can be avoided by setting md_component_checks="start".	2019-06-07 13:27:16 -05:00
David Teigland	60bf9c9f33	hints: exclude md components In some cases md components could be included in the hints, so add a check to hint creation to make sure they are excluded.	2019-05-21 11:58:01 -05:00
David Teigland	3ed9256985	remove unused io functions	2019-02-28 10:58:00 -06:00
David Teigland	6620dc9475	add device hints to reduce scanning Save the list of PVs in /run/lvm/hints. These hints are used to reduce scanning in a number of commands to only the PVs on the system, or only the PVs in a requested VG (rather than all devices on the system.)	2019-01-15 10:23:47 -06:00
David Teigland	42f7caf1c2	scan: work around udev problems by avoiding open RDWR udev creates a train wreck of events if we open devices with RDWR. Until we can fix/disable/scrap udev, work around this by opening RDONLY and then closing/reopening RDWR when a write is needed. This invalidates the bcache blocks for the device before writing so it can trigger unnecessary rereading.	2018-06-20 14:08:12 -05:00
David Teigland	328303d4d4	Remove unused device error counting	2018-06-15 14:04:39 -05:00
Joe Thornber	d5da55ed85	device_mapper: remove dbg_malloc. I wrote dbg_malloc before we had valgrind. These days there's just no need.	2018-06-08 13:40:53 +01:00
Joe Thornber	dbba1e9b93	Merge branch 'master' into 2018-05-11-fork-libdm	2018-06-01 13:04:12 +01:00
David Teigland	3c9ed33f83	scan: move warnings about duplicate devices We have been warning about duplicate devices (and disabling lvmetad) immediately when the dup was detected (during label_scan). Move the warnings (and the disabling) to happen later, after label_scan is finished. This lets us avoid an unwanted warning message about duplicates in the special case were md components are eliminated during the duplicate device resolution.	2018-05-21 16:48:02 -05:00
Joe Thornber	89fdc0b588	Merge branch 'master' into 2018-05-11-fork-libdm	2018-05-16 13:43:02 +01:00
Joe Thornber	7f97c7ea9a	build: Don't generate symlinks in include/ dir As we start refactoring the code to break dependencies (see doc/refactoring.txt), I want us to use full paths in the includes (eg, #include "base/data-struct/list.h"). This makes it more obvious when we're breaking abstraction boundaries, eg, including a file in metadata/ from base/	2018-05-14 10:30:20 +01:00
David Teigland	bbb8040456	dev_cache: drop open_list devices are now held open only in bcache, so drop the dev_cache list of open devices which is unused.	2018-05-11 12:47:56 -05:00
David Teigland	57bb46c5e7	filter: use bcache for filter reads Filters are still applied before any device reading or the label scan, but any filter checks that want to read the device are skipped and the device is flagged. After bcache is populated, but before lvm looks for devices (i.e. before label scan), the filters are reapplied to the devices that were flagged above. The filters will then find the data they need in bcache.	2018-05-10 16:03:19 -05:00
David Teigland	6c67c7557c	scan: use separate fd for bcache Create a new dev->bcache_fd that the scanning code owns and is in charge of opening/closing. This prevents other parts of lvm code (which do various open/close) from interfering with the bcache fd. A number of dev_open and dev_close are removed from the reading path since the read path now uses the bcache. With that in place, open(O_EXCL) for pvcreate/pvremove can then be fixed. That wouldn't work previously because of other open fds.	2018-04-20 11:22:46 -05:00
David Teigland	a7cb76ae94	scan: use bcache for label scan and vg read New label_scan function populates bcache for each device on the system. The two read paths are updated to get data from bcache. The bcache is not yet used for writing. bcache blocks for a device are invalidated when the device is written.	2018-04-20 11:19:24 -05:00
Joe Thornber	00f1b208a1	[io paths] Unpick agk's aio stuff	2018-04-20 11:03:58 -05:00
Alasdair G Kergon	d6cabbbc53	device: Fix basic async I/O error handling	2018-02-08 20:19:21 +00:00
Alasdair G Kergon	3e29c80122	device: Queue any aio beyond defined limits.	2018-02-08 20:15:37 +00:00
Alasdair G Kergon	db41fe6c5d	lvmcache: Use asynchronous I/O when scanning devices.	2018-02-08 20:15:29 +00:00
Alasdair G Kergon	8c7bbcfb0f	device: Basic config and setup to support async I/O.	2018-02-08 20:15:14 +00:00
Alasdair G Kergon	7a9af3cd0e	device: Add flag to indicate that a code path can support AIO Until the whole source supports AIO, library code can check for AIO_SUPPORTED_CODE_PATH to determine whether or not it is OK to use AIO.	2018-02-06 01:11:00 +00:00
Alasdair G Kergon	9194610f42	device: Add ioflags parameter to transfer additional state. Flags are set on the initial I/O and passed to any callbacks that may in turn issue further I/O using the inherited flags.	2018-01-21 21:10:23 +00:00
Alasdair G Kergon	c90582344d	device: Add reason to devbuf.	2018-01-15 19:38:18 +00:00
Alasdair G Kergon	1f01eaa612	device: Store offset to data instead of pointer. We want to save the relative offset before we've allocated the buffer's memory.	2018-01-15 19:32:59 +00:00
Alasdair G Kergon	61d3296f2a	device: Reorder device.h before change.	2018-01-15 19:24:01 +00:00
Alasdair G Kergon	6210c1ec28	device: Mark read-only device buffers const.	2018-01-10 19:57:10 +00:00
Alasdair G Kergon	dcb2a5a611	device: Remove some data copying between buffers. Callers that read larger amounts of data now get a pointer to read-only data directly without copying it through an intermediate buffer. This data is owned by the device layer so the callers no longer free it.	2018-01-10 15:48:03 +00:00
Alasdair G Kergon	bd0967a4b1	device: Keep the last data buffer read off each device. If there's a second metadata area on device, we record that separately. Note that the memory requirements aren't restricted yet.	2018-01-10 15:48:03 +00:00
Alasdair G Kergon	5e7d3ad749	device: Introduce dev_read_callback If it obtains the data, it passes it into the supplied callback function and returns 1. Otherwise the callback receives failed = 1. Updated config_file_read_fd to use this and similarly return the data via a callback fn of its own.	2018-01-06 02:40:12 +00:00
Alasdair G Kergon	946f07af3e	metadata: Use a consistent format for callback fn parameters	2018-01-05 14:24:56 +00:00
Alasdair G Kergon	17649d4ac8	device: Move dev_read memory allocation into device layer. Rename dev_read() to dev_read_buf() - the function that reads data into a supplied buffer. Introduce a new dev_read() that allocates the buffer it returns and switch the important users over to this. No caller may change the returned data. (For now, callers are responsible for freeing it after use, but later the device layer will take full ownership.) dev_read_buf() should only be used for tiny buffers or unimportant code (such as the old disk formats).	2017-12-19 01:31:50 +00:00
Alasdair G Kergon	5f45cb90a7	format_text: Transfer circular buf alloc to device layer. Instead of the caller passing dev_read_circular() a buffer to fill with data, the device layer itself now allocates it.	2017-12-15 22:34:26 +00:00
Alasdair G Kergon	d591d04103	device: Tag I/O for each mda on a device separately in log messages. Mark the first metadata area on each text format PV as MDA_PRIMARY. Pass this information down to the device layer so that when there are two metadata areas on a block device, we can easily distinguish two independent streams of I/O.	2017-12-07 03:48:11 +00:00
Alasdair G Kergon	e4805e4883	device: categorise block i/o Introduce enum dev_io_reason to categorise block device I/O in debug messages so it's obvious what it is for. DEV_IO_SIGNATURES /* Scanning device signatures / DEV_IO_LABEL / LVM PV disk label / DEV_IO_MDA_HEADER / Text format metadata area header / DEV_IO_MDA_CONTENT / Text format metadata area content / DEV_IO_FMT1 / Original LVM1 metadata format / DEV_IO_POOL / Pool metadata format / DEV_IO_LV / Content written to an LV / DEV_IO_LOG / Logging messages */	2017-12-04 23:45:26 +00:00
Alasdair G Kergon	b5314c2a6a	device: Retry open without O_NOATIME if it fails.	2016-05-12 01:05:52 +01:00
Peter Rajnoha	03b0a78640	dev: detect mismatch between devices used and devices assumed for an LV It's possible for an LVM LV to use a device during activation which then differs from device which LVM assumes based on metadata later on. For example, such device mismatch can occur if LVM doesn't have complete view of devices during activation or if filters are misbehaving or they're incorrectly set during activation. This patch adds code that can detect this mismatch by creating VG UUID and LV UUID index while scanning devices for device cache. The VG UUID index maps VG UUID to a device list. Each device in the list has a device layered above as a holder which is an LVM LV device and for which we know the VG UUID (and similarly for LV UUID index). We can acquire VG and LV UUID by reading /sys/block/<dm_dev_name>/dm/uuid. So these indices represent the actual state of PV device use in the system by LVs and then we compare that to what LVM assumes based on metadata. For example: [0] fedora/~ # lsblk /dev/sdq /dev/sdr /dev/sds /dev/sdt NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT sdq 65:0 0 104M 0 disk \|-vg-lvol0 253:2 0 200M 0 lvm `-mpath_dev1 253:3 0 104M 0 mpath sdr 65:16 0 104M 0 disk `-mpath_dev1 253:3 0 104M 0 mpath sds 65:32 0 104M 0 disk \|-vg-lvol0 253:2 0 200M 0 lvm `-mpath_dev2 253:4 0 104M 0 mpath sdt 65:48 0 104M 0 disk `-mpath_dev2 253:4 0 104M 0 mpath In this case the vg-lvol0 is mapped onto sdq and sds becauset this is what was available and seen during activation. Then later on, sdr and sdt appeared and mpath devices were created out of sdq+sdr (mpath_dev1) and sds+sdt (mpath_dev2). Now, LVM assumes (correctly) that mpath_dev1 and mpath_dev2 are the PVs that should be used, not the mpath components (sdq/sdr, sds/sdt). [0] fedora/~ # pvs Found duplicate PV xSUix1GJ2SK82ACFuKzFLAQi8xMfFxnO: using /dev/mapper/mpath_dev1 not /dev/sdq Using duplicate PV /dev/mapper/mpath_dev1 from subsystem DM, replacing /dev/sdq Found duplicate PV MvHyMVabtSqr33AbkUrobq1LjP8oiTRm: using /dev/mapper/mpath_dev2 not /dev/sds Using duplicate PV /dev/mapper/mpath_dev2 from subsystem DM, ignoring /dev/sds WARNING: Device mismatch detected for vg/lvol0 which is accessing /dev/sdq, /dev/sds instead of /dev/mapper/mpath_dev1, /dev/mapper/mpath_dev2. PV VG Fmt Attr PSize PFree /dev/mapper/mpath_dev1 vg lvm2 a-- 100.00m 0 /dev/mapper/mpath_dev2 vg lvm2 a-- 100.00m 0	2016-03-21 11:40:40 +01:00
Peter Rajnoha	65d9f742f8	device: add DEV_OPEN_FAILURE flag DEV_OPEN_FAILURE flag is set if the most recent "open" for a device failed and it's unset if any subsequent "open" succeeds.	2016-03-21 11:06:05 +01:00
Peter Rajnoha	d090d6574e	device: also cache device size Add "size" and "size_seqno" to struct device to cache device's size and also to control its lifetime - the cached value is valid as long as the global _dev_size_seqno is equal to the device's size_seqno, otherwise we need to get the size again and cache the new value. This patch also adds new dev_size_seqno_inc() fn for the appropriate parts of the code to increment current global value of _dev_size_seqno and hence to cause all currently cached values for device sizes to be invalidated. The device size is now cached because we're planning to reuse this information for further checks and we want to avoid checking it more than necessary to save resources.	2016-01-22 14:13:34 +01:00
Zdenek Kabelac	fcbef05aae	doc: change fsf address Hmm rpmlint suggest fsf is using a different address these days, so lets keep it up-to-date	2016-01-21 12:11:37 +01:00
Zdenek Kabelac	e262d5e596	cleanup: keep using enum typedef Using enum instead of unsigned.	2015-11-09 10:22:52 +01:00
Alasdair G Kergon	623b46a17d	device: Don't try to close config file on failure. $file: open failed: Permission denied Failed to load config file $file Attempt to close device '$file' which is not open.	2015-08-17 12:57:01 +01:00
Zdenek Kabelac	6f68f4364b	devices: avoid extra open() syscall If the device is already opened by lvm's device cache, avoid extra syscall opening devices for obtaining its size.	2015-03-06 13:17:39 +01:00
Peter Rajnoha	fbfde21e7c	device: add infrastructure to support external device info	2015-01-30 13:01:12 +01:00
Peter Rajnoha	5abdb52fdc	report: select: refactor: move str_list to libdm The list of strings is used quite frequently and we'd like to reuse this simple structure for report selection support too. Make it part of libdevmapper for general reuse throughout the code. This also simplifies the LVM code a bit since we don't need to include and manage lvm-types.h anymore (the string list was the only structure defined there).	2014-06-17 16:27:20 +02:00
Peter Rajnoha	32080c4ff7	device: add physical block size info and make sure VG extent size >= PV's phys. block size	2013-12-12 15:02:36 +01:00

1 2 3

102 Commits