shaba/lvm2 - lvm2 - Gitea: Git with a cup of tea

shaba/lvm2

mirror of git://sourceware.org/git/lvm2.git synced 2024-12-22 17:35:59 +03:00

Author	SHA1	Message	Date
Zdenek Kabelac	2c597c73a8	dev-cache: better code reuse for _add_alias Move path copying into _add_alish together with hashing. Remove duplicated code.	2021-02-08 23:43:38 +01:00
Zdenek Kabelac	be9b731f44	dev-cache: check for nvme name while adding alias Instead of repeated list retest, compare name once during add of alias.	2021-02-08 23:43:38 +01:00
David Teigland	bee9f4efdd	filter-mpath: work with nvme devices Recognize when a device is nvme, and apply filter-mpath to nvme devices in addition to scsi devices.	2021-02-02 13:01:20 -06:00
David Teigland	37227b8ad6	devs: remove invalid path name aliases Make dev_cache_get() verify aliases and drop any that are invalid before returning a dev for a given name.	2021-01-15 16:31:50 -06:00
David Teigland	c601ec0d6e	filters: allow filter wipe for one device as passes_filter already does	2020-10-21 16:24:16 -05:00
Zdenek Kabelac	dd8212365d	debug: update messages	2020-10-02 21:04:16 +02:00
Zdenek Kabelac	b44db5d1a7	bcache: use flexible arrays Cleanup, allocate whole struct with a single malloc call.	2020-10-02 21:00:26 +02:00
Zdenek Kabelac	b3c7a2b3f0	bcache: support interrupts when waiting on IO Since lvm2 normally block signals during protected phase where it does not want to be interrupted. Support interruptible processing when allowed in section between sigint_allow() ... sigint_restore()) and let the 'io_getenvents()' finish with EINTR.	2020-10-02 20:57:50 +02:00
Zdenek Kabelac	0fe58fc54f	bcache: fix busy loop with too many errors When bcache tries to write data to a faulty device, it may get out of caching blocks and then just busy-loops on a CPU - so this check protects this by checking if there is already max_io (~64) errored blocks.	2020-10-02 20:56:55 +02:00
Zdenek Kabelac	41f9e372c0	bcache: fix waiting problem for completed IO Call _wait_all() which does check whether there is still some pending IO before sleep. Otherwise it may happen our submitted IO operations have been already dispatched and this call then endlessly waits for IO which are all done. This can be reproduced when device returns quickly errors on write requests.	2020-10-02 20:53:41 +02:00
David Teigland	450f272b31	devices: support printing the filter that rejects a device Use of this new message function needs to be added to various commands to improve the output.	2020-10-01 12:00:09 -05:00
Zdenek Kabelac	6728788bf5	debug: remove stacktrace on regular path Here _insert is expected to also fail, so just regular 'return 0'.	2020-09-29 10:43:56 +02:00
Zdenek Kabelac	6c769eb460	bache: fix error return value Return 0 as failure (as checked for). Also add INTERNAL_ERROR if 'DI' would be -1.	2020-09-19 23:00:50 +02:00
David Teigland	1570e76233	bcache: use indirection table for fd Add a "device index" (di) for each device, and use this in the bcache api to the rest of lvm. This replaces the file descriptor (fd) in the api. The rest of lvm uses new functions bcache_set_fd(), bcache_clear_fd(), and bcache_change_fd() to control which fd bcache uses for io to a particular device. . lvm opens a dev and gets and fd. fd = open(dev); . lvm passes fd to the bcache layer and gets a di to use in the bcache api for the dev. di = bcache_set_fd(fd); . lvm uses bcache functions, passing di for the dev. bcache_write_bytes(di, ...), etc. . bcache translates di to fd to do io. . lvm closes the device and clears the di/fd bcache state. close(fd); bcache_clear_fd(di); In the bcache layer, a di-to-fd translation table (int *_fd_table) is added. When bcache needs to perform io on a di, it uses _fd_table[di]. In the following commit, lvm will make use of the new bcache_change_fd() function to change the fd that bcache uses for the dev, without dropping cached blocks.	2020-09-18 15:10:11 -05:00
Zdenek Kabelac	a481f42630	cov: always initialized values Make sure values are initialized for all possible paths.	2020-09-01 17:57:50 +02:00
Zdenek Kabelac	fd96f1014b	gcc: zero-sized array to fexlible array C99 Switch remaining zero sized struct to flexible arrays to be C99 complient. These simple rules should apply: - The incomplete array type must be the last element within the structure. - There cannot be an array of structures that contain a flexible array member. - Structures that contain a flexible array member cannot be used as a member of another structure. - The structure must contain at least one named member in addition to the flexible array member. Although some of the code pieces should be still improved.	2020-09-01 17:57:50 +02:00
Zdenek Kabelac	7880896f0d	gcc: calc size in compile time	2020-08-28 21:43:02 +02:00
Zdenek Kabelac	ce202c3b1c	gcc: keep unsigned arithmetic Avoid conversion to int.	2020-08-28 21:43:02 +02:00
David Teigland	00c9a788cc	devices: simplify md superblock checking code	2020-07-09 10:48:34 -05:00
David Teigland	23774f997e	devices: detect md ddf and imsm superblocks	2020-07-09 10:48:21 -05:00
Zdenek Kabelac	6eb9eba59b	bcache: support longer writes When initiated larger write request, it may have happened, bcache got out of free chunks - fix the loop, that is supposed to wait until next free chunk becomes avain available.	2020-06-24 15:01:03 +02:00
David Teigland	8e2938c963	improve get_fs_block_size string to number	2020-06-11 15:05:47 -05:00
David Teigland	9fbad5bb0f	fix libblkid BLOCK_SIZE check	2020-06-11 12:43:07 -05:00
Zhao Heming	b59127a838	Change dev->bcache_fd default value from 0 to -1 This fix can avoid bcache_fd will mistakenly open/close in later. Signed-off-by: Zhao Heming <heming.zhao@suse.com>	2020-06-01 12:22:15 -05:00
David Teigland	2f29765e7f	devs: add some checks for a dev with no path name It's possible for a dev-cache entry to remain after all paths for it have been removed, and other parts of the code expect that a dev always has a name. A better fix may be to remove a device from dev-cache after all paths to it have been removed.	2020-05-13 16:26:26 -05:00
David Teigland	d9e8895a96	Allow dm-integrity to be used for raid images dm-integrity stores checksums of the data written to an LV, and returns an error if data read from the LV does not match the previously saved checksum. When used on raid images, dm-raid will correct the error by reading the block from another image, and the device user sees no error. The integrity metadata (checksums) are stored on an internal LV allocated by lvm for each linear image. The internal LV is allocated on the same PV as the image. Create a raid LV with an integrity layer over each raid image (for raid levels 1,4,5,6,10): lvcreate --type raidN --raidintegrity y [options] Add an integrity layer to images of an existing raid LV: lvconvert --raidintegrity y LV Remove the integrity layer from images of a raid LV: lvconvert --raidintegrity n LV Settings Use --raidintegritymode journal\|bitmap (journal is default) to configure the method used by dm-integrity to ensure crash consistency. Initialization When integrity is added to an LV, the kernel needs to initialize the integrity metadata/checksums for all blocks in the LV. The data corruption checking performed by dm-integrity will only operate on areas of the LV that are already initialized. The progress of integrity initialization is reported by the "syncpercent" LV reporting field (and under the Cpy%Sync lvs column.) Example: create a raid1 LV with integrity: $ lvcreate --type raid1 -m1 --raidintegrity y -n rr -L1G foo Creating integrity metadata LV rr_rimage_0_imeta with size 12.00 MiB. Logical volume "rr_rimage_0_imeta" created. Creating integrity metadata LV rr_rimage_1_imeta with size 12.00 MiB. Logical volume "rr_rimage_1_imeta" created. Logical volume "rr" created. $ lvs -a foo LV VG Attr LSize Origin Cpy%Sync rr foo rwi-a-r--- 1.00g 4.93 [rr_rimage_0] foo gwi-aor--- 1.00g [rr_rimage_0_iorig] 41.02 [rr_rimage_0_imeta] foo ewi-ao---- 12.00m [rr_rimage_0_iorig] foo -wi-ao---- 1.00g [rr_rimage_1] foo gwi-aor--- 1.00g [rr_rimage_1_iorig] 39.45 [rr_rimage_1_imeta] foo ewi-ao---- 12.00m [rr_rimage_1_iorig] foo -wi-ao---- 1.00g [rr_rmeta_0] foo ewi-aor--- 4.00m [rr_rmeta_1] foo ewi-aor--- 4.00m	2020-04-15 12:10:32 -05:00
David Teigland	957904933b	reduce device path error messsages When /dev entries or sysfs entries are changing due to concurrent lvm commands, it can cause warning/error messages about missing paths.	2020-03-12 10:18:51 -05:00
Zdenek Kabelac	f439716b75	container_of: use offsetof from stddef Use standardized offsetof() macro from stddef. Helps to build valid code with latest gcc10 with -O2.	2020-03-05 17:38:55 +01:00
Zdenek Kabelac	c5e5ae4c95	bcache: fix memleak on error path clang: free io on error path.	2020-02-04 17:22:06 +01:00
Zdenek Kabelac	cff16b062b	debug: avoid to slashes in debug message	2019-12-10 15:44:16 +01:00
David Teigland	56a295f78c	bcache: add invalidate_bytes function	2019-11-26 16:52:28 -06:00
Zdenek Kabelac	43f149526d	devtype: simplify code Update code with simpler form and check for fclose().	2019-11-14 18:06:14 +01:00
Joe Thornber	25e7bf021a	[bcache] bcache_invalidate_fd, only remove prefixes on success.	2019-10-29 15:21:11 +00:00
Joe Thornber	7e8296f478	[bcache] reverse earlier patch. It broke some unit tests, for v. little benefit	2019-10-29 15:14:07 +00:00
Joe Thornber	2b3c39e402	[bcache] pass up the error from io_submit rather than using generic -EIO Author: Heming Zhao	2019-10-29 10:39:20 +00:00
Joe Thornber	2938b4dcca	[bcache] add bcache_abort() This gives us a way to cope with write failures.	2019-10-28 15:00:53 +00:00
Zdenek Kabelac	a7563dc6a1	gcc: older version can't see udev is always set	2019-10-22 13:39:22 +02:00
David Teigland	fcbffbdbc0	bcache: change log level for prefetch message The "new new blocks" message was printed as an error but it's not an error condition.	2019-09-03 12:02:09 -05:00
David Teigland	0534cd9cd4	pvscan: disable sleeping and retrying for udev When systemd is running pvscans, udev may not be entirely initialized, so the pvscan should not sleep and retry waiting for udev info.	2019-08-16 14:41:26 -05:00
David Teigland	eb6aa5fefe	devices: put ifdef around BLKPBSZGET BLKPBSZGET is not defined before kernel version 2.6.32 (e.g. rhel5)	2019-08-08 15:45:03 -05:00
David Teigland	09bc2d0fd1	devices: clean up block size functions Replace calls to the old dev_get_block_size function with calls to the new dev_get_direct_block_size function, and remove the old function.	2019-08-07 11:48:10 -05:00
David Teigland	7f347698e3	Fix rounding writes up to sector size Do this at two levels, although one would be enough to fix the problem seen recently: - Ignore any reported sector size other than 512 of 4096. If either sector size (physical or logical) is reported as 512, then use 512. If neither are reported as 512, and one or the other is reported as 4096, then use 4096. If neither is reported as either 512 or 4096, then use 512. - When rounding up a limited write in bcache to be a multiple of the sector size, check that the resulting write size is not larger than the bcache block itself. (This shouldn't happen if the sector size is 512 or 4096.)	2019-07-26 14:21:08 -05:00
David Teigland	4567c6a2b2	enable full md component detection at the right time An active md device with an end superblock causes lvm to enable full md component detection. This was being done within the filter loop instead of before, so the full filtering of some devs could be missed. Also incorporate the recently added config setting that controls the md component detection.	2019-07-10 13:30:50 -05:00
David Teigland	db98a6e362	Additional MD component checking If udev info is missing for a device, (which would indicate if it's an MD component), then do an end-of-device read to check if a PV is an MD component. (This is skipped when using hints since we already know devs in hints are good.) A new config setting md_component_checks can be used to disable the additional end-of-device MD checks, or to always enable end-of-device MD checks. When both hints and udev info are disabled/unavailable, the end of PVs will now be scanned by default. If md devices with end-of-device superblocks are not being used, the extra I/O overhead can be avoided by setting md_component_checks="start".	2019-06-07 13:27:16 -05:00
David Teigland	60bf9c9f33	hints: exclude md components In some cases md components could be included in the hints, so add a check to hint creation to make sure they are excluded.	2019-05-21 11:58:01 -05:00
David Teigland	19ef399ea7	devs: rename dev_is_md dev_is_md_component The naming was confusing and misleading since it it's testing if a device is an md component, not an md device.	2019-05-21 11:44:39 -05:00
David Teigland	6f18186bfd	pvscan: print more reasons for ignoring devices	2019-04-05 15:48:12 -05:00
David Teigland	3ed9256985	remove unused io functions	2019-02-28 10:58:00 -06:00
David Teigland	3ebce8dbd2	apply obtain_device_list_from_udev to all libudev usage udev_dev_is_md_component and udev_dev_is_mpath_component are not used for obtaining the device list, but they still use libudev for device info. When there are problems with udev, these functions can get stuck. So, use the existing obtain_device_list_from_udev config setting to also control whether these "is component" functions are used, which gives us a way to avoid using libudev entirely when it's causing problems.	2019-02-05 10:15:40 -06:00
David Teigland	6620dc9475	add device hints to reduce scanning Save the list of PVs in /run/lvm/hints. These hints are used to reduce scanning in a number of commands to only the PVs on the system, or only the PVs in a requested VG (rather than all devices on the system.)	2019-01-15 10:23:47 -06:00
Zdenek Kabelac	2724a09e58	debug: tracing close errors	2018-12-21 21:45:08 +01:00
Zdenek Kabelac	82f66834ef	bcache: fix memory leak on error path Coverity noticed missing free of io struct on error path.	2018-12-21 21:45:03 +01:00
Zdenek Kabelac	cc5cfb88d7	cleanup: some local headers first	2018-12-14 15:14:48 +01:00
Zdenek Kabelac	0b19387dae	headers: use configure.h as 1st. header Ensure configure.h is always 1st. included header. Maybe we could eventually introduce gcc -include option, but for now this better uses dependency tracking. Also move _REENTRANT and _GNU_SOURCE into configure.h so it doesn't need to be present in various source files. This ensures consistent compilation of headers like stdio.h since it may produce different declaration.	2018-12-14 15:09:13 +01:00
David Teigland	a063d2d123	devs: use udev info to improve md component detection Use udev info to supplement native md component detection.	2018-12-03 12:58:28 -06:00
Peter Rajnoha	cb04b84c79	scan: md metadata version 0.90 is at the end of disk commit `de28637` scan: use full md filter when md 1.0 devices are present missed the fact that md superblock version 0.90 also puts metadata at the end of the device, so the full md filter needs to be used when either 0.90 or 1.0 is present.	2018-11-29 12:35:54 -06:00
David Teigland	7e721ca048	bcache: sync io fixes fix lseek error check fix read/write error checks handle zero return from read and write don't return an error for short io fix partial read/write loop	2018-11-20 09:19:18 -06:00
David Teigland	ca66d52032	io: use sync io if aio fails io_setup() for aio may fail if a system has reached the aio request limit. In this case, fall back to using sync io. Also, lvm use of aio can be disabled entirely with config setting global/use_aio=0. The system limit for aio requests can be seen from /proc/sys/fs/aio-max-nr The current usage of aio requests can be seen from /proc/sys/fs/aio-nr The system limit for aio requests can be increased by setting fs.aio-max-nr using sysctl. Also add last-byte limit to the sync io code.	2018-11-20 09:13:20 -06:00
David Teigland	1dc5603f73	devices: reuse bcache fd when getting block size This avoids an unnecessary open() on the device.	2018-11-06 16:36:18 -06:00
David Teigland	3ae5569570	Add dm-writecache support dm-writecache is used like dm-cache with a standard LV as the cache. $ lvcreate -n main -L 128M -an foo /dev/loop0 $ lvcreate -n fast -L 32M -an foo /dev/pmem0 $ lvconvert --type writecache --cachepool fast foo/main $ lvs -a foo -o+devices LV VG Attr LSize Origin Devices [fast] foo -wi------- 32.00m /dev/pmem0(0) main foo Cwi------- 128.00m [main_wcorig] main_wcorig(0) [main_wcorig] foo -wi------- 128.00m /dev/loop0(0) $ lvchange -ay foo/main $ dmsetup table foo-main_wcorig: 0 262144 linear 7:0 2048 foo-main: 0 262144 writecache p 253:4 253:3 4096 0 foo-fast: 0 65536 linear 259:0 2048 $ lvchange -an foo/main $ lvconvert --splitcache foo/main $ lvs -a foo -o+devices LV VG Attr LSize Devices fast foo -wi------- 32.00m /dev/pmem0(0) main foo -wi------- 128.00m /dev/loop0(0)	2018-11-06 14:18:41 -06:00
Zdenek Kabelac	9a6f0e64f9	debug: missing backtrace	2018-11-05 17:25:11 +01:00
Zdenek Kabelac	aa8b2d6a0f	cleanup: move cast to det_t into MKDEV macro	2018-11-05 17:25:11 +01:00
Zdenek Kabelac	70e3d0a613	cov: remove unused assigns	2018-11-05 17:25:11 +01:00
David Teigland	aecf542126	metadata: prevent writing beyond metadata area lvm uses a bcache block size of 128K. A bcache block at the end of the metadata area will overlap the PEs from which LVs are allocated. How much depends on alignments. When lvm reads and writes one of these bcache blocks to update VG metadata, it can also be reading and writing PEs that belong to an LV. If these overlapping PEs are being written to by the LV user (e.g. filesystem) at the same time that lvm is modifying VG metadata in the overlapping bcache block, then the user's updates to the PEs can be lost. This patch is a quick hack to prevent lvm from writing past the end of the metadata area.	2018-10-29 16:53:17 -05:00
Zdenek Kabelac	fdd76da33d	cov: drop uneeded header files	2018-10-15 17:49:44 +02:00
Zdenek Kabelac	b1ff52ca14	cov: check dev_close_immediate Function can report log_error() on fail path.	2018-10-15 17:49:44 +02:00
Zdenek Kabelac	eb566e034f	cov: add check for positive value As pgsize parameter for _init_free_list() can't be negative, report problem in case for any reason we would get negative number.	2018-10-15 17:49:44 +02:00
Zdenek Kabelac	9b85ecb85b	cov: fix memleak on bcache io error path Drop allocated IO. merge free bache	2018-10-15 17:49:44 +02:00
Joe Thornber	3255e384db	[bcache] Remove unused 'hash' field from blocks. We use a radix tree these days rather than a hash table.	2018-09-11 13:17:29 +01:00
David Teigland	fade9ca3b6	bcache: reduce MAX_IO to 256 This is the number of concurrent async io requests that the scan layer will submit to the bcache layer. There will be an open fd for each of these, so it is best to keep this well below the default limit for max open files (1024), otherwise lvm may get EMFILE from open(2) when there are around 1024 devices to scan on the system.	2018-08-24 14:55:12 -05:00
Zdenek Kabelac	c8b4f9414c	dev_io: no discard in testmode When lvm2 command is executed in test mode, discard ioctl is skipped. This may cause even data-loose in case, issuing discard for released areas was enabled and user 'tested' lvreduce.	2018-07-09 00:19:30 +02:00
Marian Csontos	a14f21bf1d	bcache: Fix null pointer dereferencing	2018-06-26 17:04:18 +02:00
Zdenek Kabelac	c728d88e11	build: include configure.h It's important to consistenly include configure.h as the 1st. header. It containts #defines influencing behavior of other included header files.	2018-06-22 23:11:44 +02:00
David Teigland	15826214f9	Remove code for using files as devices It appears this has not been used in a long time, and it seems to have no point since loop devices exist.	2018-06-21 09:33:21 -05:00
David Teigland	42f7caf1c2	scan: work around udev problems by avoiding open RDWR udev creates a train wreck of events if we open devices with RDWR. Until we can fix/disable/scrap udev, work around this by opening RDONLY and then closing/reopening RDWR when a write is needed. This invalidates the bcache blocks for the device before writing so it can trigger unnecessary rereading.	2018-06-20 14:08:12 -05:00
David Teigland	f85a010a6b	bcache: remove extraneous error message an error from io_submit is already recognized by the caller like errors during completion.	2018-06-18 12:02:22 -05:00
David Teigland	328303d4d4	Remove unused device error counting	2018-06-15 14:04:39 -05:00
David Teigland	3fd75d1bcd	scan: use full md filter when md 1.0 devices are present The md filter can operate in two native modes: - normal: reads only the start of each device - full: reads both the start and end of each device md 1.0 devices place the superblock at the end of the device, so components of this version will only be identified and excluded when lvm uses the full md filter. Previously, the full md filter was only used in commands that could write to the device. Now, the full md filter is also applied when there is an md 1.0 device present on the system. This means the 'pvs' command can avoid displaying md 1.0 components (at the cost of doubling the i/o to every device on the system.) (The md filter can operate in a third mode, using udev, but this is disabled by default because there have been problems with reliability of the info returned from udev.)	2018-06-15 12:21:25 -05:00
David Teigland	8eab37593e	Add cmd arg to more functions so that it can be used in the filter code	2018-06-15 11:03:55 -05:00
Joe Thornber	d5da55ed85	device_mapper: remove dbg_malloc. I wrote dbg_malloc before we had valgrind. These days there's just no need.	2018-06-08 13:40:53 +01:00
Joe Thornber	286c1ba336	device_mapper: rename libdevmapper.h -> all.h I'm paranoid a file will include the global one in /usr/include by accident.	2018-06-08 12:31:45 +01:00
David Teigland	1539e51721	devices: clean up io error messages Remove the io error message from bcache.c since it is not very useful without the device path. Make the io error messages from dev_read_bytes/dev_write_bytes more user friendly.	2018-06-07 16:17:04 +01:00
Joe Thornber	dbba1e9b93	Merge branch 'master' into 2018-05-11-fork-libdm	2018-06-01 13:04:12 +01:00
Joe Thornber	d4d39d0f90	Merge branch 'master' into 2018-05-30-bcache-radix-tree	2018-05-31 16:36:04 +01:00
David Teigland	6d14d5d16b	scan: removed failed paths for devices Drop a device path when the scan fails to open it.	2018-05-30 09:05:18 -05:00
Joe Thornber	7635df8cce	bcache: switch to storing blocks in a radix tree. Rather than a hash table. This will make invalidate_fd() more efficient since we can iterate just those blocks that are on a particular dev.	2018-05-30 14:17:26 +01:00
David Teigland	28c8e95d19	scan: refresh paths and retry open If scanning fails to open any devices, refresh the device paths in dev cache, and retry the opens.	2018-05-25 13:09:07 -05:00
David Teigland	3c9ed33f83	scan: move warnings about duplicate devices We have been warning about duplicate devices (and disabling lvmetad) immediately when the dup was detected (during label_scan). Move the warnings (and the disabling) to happen later, after label_scan is finished. This lets us avoid an unwanted warning message about duplicates in the special case were md components are eliminated during the duplicate device resolution.	2018-05-21 16:48:02 -05:00
Joe Thornber	5052970da3	bcache: Don't call sysconf for every io	2018-05-17 10:05:10 +01:00
Alex Bennée	c6ca81a38d	bcache: don't use PAGE_SIZE compile const PAGE_SIZE is not a compile time constant. Use sysconf instead like elsewhere in the code. Signed-off-by: Alex Bennée <alex.bennee@linaro.org>	2018-05-17 10:38:16 +02:00
Joe Thornber	89fdc0b588	Merge branch 'master' into 2018-05-11-fork-libdm	2018-05-16 13:43:02 +01:00
Joe Thornber	ccc35e2647	device-mapper: Fork libdm internally. The device-mapper directory now holds a copy of libdm source. At the moment this code is identical to libdm. Over time code will migrate out to appropriate places (see doc/refactoring.txt). The libdm directory still exists, and contains the source for the libdevmapper shared library, which we will continue to ship (though not neccessarily update). All code using libdm should now use the version in device-mapper.	2018-05-16 13:00:50 +01:00
Joe Thornber	e296f784c9	Merge branch 'master' of git://sourceware.org/git/lvm2	2018-05-16 10:11:58 +01:00
Joe Thornber	df2acbbb97	bcache: nr_ios_pending wasn't being incremented ... but it was being decremented on completion. Which meant it wrapped, and no prefetches were ever issued after the first completion.	2018-05-16 10:09:17 +01:00
Joe Thornber	7f97c7ea9a	build: Don't generate symlinks in include/ dir As we start refactoring the code to break dependencies (see doc/refactoring.txt), I want us to use full paths in the includes (eg, #include "base/data-struct/list.h"). This makes it more obvious when we're breaking abstraction boundaries, eg, including a file in metadata/ from base/	2018-05-14 10:30:20 +01:00
Zdenek Kabelac	ac768a9d2b	bcache: do not use libdm header files Logging for libdm differs from lvm logging - keep using consisten logging function calls.	2018-05-12 18:18:23 +02:00
David Teigland	09fcc8eaa8	scan: ignore duplicates that are md component devs md devices using an older superblock version have superblocks at the end of the md device. For commands that skip reading the end of devices during filtering, the md component devs will be scanned, and will appear as duplicate PVs to the original md device. Remove these md components from the list of unused duplicate devices, so they are treated as if they had been ignored during filtering. This avoids the restrictions that are placed on using PVs with duplicates.	2018-05-11 15:52:22 -05:00
David Teigland	73578e36fa	dev_cache: remove the lvmcache check when closing fd This is no longer used since devices are not held open in dev_cache.	2018-05-11 14:30:10 -05:00
David Teigland	3e3cb22f2a	dev_cache: fix close in utility functions All these functions are now used as utilities, e.g. for ioctl (not for io), and need to open/close the device each time they are called. (Many of the opens can probably be eliminated by just using the bcache fd for the ioctl.)	2018-05-11 14:25:08 -05:00
David Teigland	b5d9914628	devs: recognize md devices in subsystem check If md components appear as duplicate PVs, let the existing subsystem check recognize the md device.	2018-05-11 14:00:19 -05:00
David Teigland	ccab54677c	dev_cache: fix close in dev_get_block_size	2018-05-11 13:53:19 -05:00
David Teigland	bbb8040456	dev_cache: drop open_list devices are now held open only in bcache, so drop the dev_cache list of open devices which is unused.	2018-05-11 12:47:56 -05:00
Joe Thornber	3b02b35c3e	Merge branch 'master' of git+ssh://sourceware.org/git/lvm2	2018-05-11 05:39:27 +01:00
Joe Thornber	5f780813f2	bcache/sync io engine: handle short ios	2018-05-11 05:37:47 +01:00
David Teigland	57bb46c5e7	filter: use bcache for filter reads Filters are still applied before any device reading or the label scan, but any filter checks that want to read the device are skipped and the device is flagged. After bcache is populated, but before lvm looks for devices (i.e. before label scan), the filters are reapplied to the devices that were flagged above. The filters will then find the data they need in bcache.	2018-05-10 16:03:19 -05:00
Joe Thornber	ae50374811	bcache: Add sync io engine Something to fall back to when testing.	2018-05-10 14:29:26 +01:00
Joe Thornber	67b80e2d9d	bcache: knock out err param. Dave used this for debugging. Not needed in general.	2018-05-10 13:26:08 +01:00
Joe Thornber	1c5c99afce	bcache-utils: bcache_set_bytes()	2018-05-09 11:05:29 +01:00
Joe Thornber	dfc320f5b8	bcache-utils: rewrite They take care to avoid redundant reads now.	2018-05-03 11:36:29 +01:00
Joe Thornber	2688aafefb	bcache: rename bcache_write_zeroes() -> bcache_zero_bytes() Now matches the other util functions: bcache_{prefetch,read,write,zero}_bytes()	2018-05-03 10:21:14 +01:00
Joe Thornber	8b755f1e04	bcache: rewrite bcache_write_zeros() It now uses GF_ZERO to avoid reading blocks that are going to be completely zeroed.	2018-05-03 10:14:56 +01:00
Joe Thornber	dc30d4b2f2	bcache: switch off_t -> uint64_t We always want it to be 64bit	2018-05-03 09:37:43 +01:00
Joe Thornber	efad84ebc2	bcache: Move the utils to a separate file. This makes it clearer that they don't access the cache internals.	2018-05-03 09:34:41 +01:00
Joe Thornber	b3c41bce3d	bcache: add bcache_block_sectors() query fn	2018-05-03 09:33:55 +01:00
Joe Thornber	65912ce44d	bcache: add a comment	2018-05-03 09:21:10 +01:00
Joe Thornber	90d0ff6636	bcache: reorder includes in .c file too	2018-05-02 19:45:06 +01:00
Joe Thornber	8fd300f7df	device/bcache: reorder includes	2018-05-02 18:59:43 +01:00
David Teigland	24e7745d7a	devices: ignore lvm1 and pool devices	2018-05-01 15:18:47 -05:00
Joe Thornber	bfc61a9543	bcache: squash some warnings on rhel6	2018-05-01 13:21:53 +01:00
Joe Thornber	f564e78d98	bcache: rewrite bcache_{write,zero}_bytes These are utility functions so should only use the public interface. Also write_bytes was flushing, which will kill performance.	2018-05-01 12:07:33 +01:00
Joe Thornber	e890c37704	[bcache] Some work on bcache_invalidate() bcache_invalidate() now returns a bool to indicate success. If fails if the block is currently held, or the block is dirty and writeback fails. Added a bunch of unit tests for the invalidate functions. Fixed some bugs to do with invalidating errored blocks.	2018-04-27 10:56:13 +01:00
Joe Thornber	1c97fda425	[bcache] get all unit tests passing again	2018-04-26 13:13:27 +01:00
Zdenek Kabelac	fcdac700f9	gcc: remove duplicate typedef	2018-04-23 22:42:18 +02:00
David Teigland	c0973e70a5	dev_cache: clean up scan Pull out all of the twisted logic and simply call dev_cache_scan at the start of the command prior to label scan.	2018-04-20 11:22:48 -05:00
David Teigland	6d05859862	bcache: let caller see an error	2018-04-20 11:22:48 -05:00
David Teigland	570c6239ee	bcache: fix error handling The error handling code wasn't working, but it appears that just removing it is what we need. The doesn't really need any different behavior related to bcache blocks on an io error, it just wants to know if there was an error.	2018-04-20 11:22:47 -05:00
David Teigland	4331182964	bcache: add some error messages for debugging	2018-04-20 11:22:47 -05:00
David Teigland	e49b114f7e	bcache: use wrappers for bcache read write in lvm Using a wrapper makes it easier to disable bcache if needed.	2018-04-20 11:22:47 -05:00
David Teigland	8065492046	bcache: do all writes through bcache	2018-04-20 11:22:47 -05:00
David Teigland	8b26a007b1	misc bcache fixes from ejt	2018-04-20 11:22:47 -05:00
David Teigland	6c67c7557c	scan: use separate fd for bcache Create a new dev->bcache_fd that the scanning code owns and is in charge of opening/closing. This prevents other parts of lvm code (which do various open/close) from interfering with the bcache fd. A number of dev_open and dev_close are removed from the reading path since the read path now uses the bcache. With that in place, open(O_EXCL) for pvcreate/pvremove can then be fixed. That wouldn't work previously because of other open fds.	2018-04-20 11:22:46 -05:00
David Teigland	a7cb76ae94	scan: use bcache for label scan and vg read New label_scan function populates bcache for each device on the system. The two read paths are updated to get data from bcache. The bcache is not yet used for writing. bcache blocks for a device are invalidated when the device is written.	2018-04-20 11:19:24 -05:00
David Teigland	93fc937429	[device/bcache] bcache_read_bytes should put blocks	2018-04-20 11:12:50 -05:00
David Teigland	7be54bd687	[device/bcache] fix min() function	2018-04-20 11:12:50 -05:00
David Teigland	d9e6298edb	[device/bcache] fix missing max_io fn in bcache async engine	2018-04-20 11:12:50 -05:00
Joe Thornber	dc8034f5eb	[device/bcache] more work on bcache	2018-04-20 11:12:50 -05:00
Joe Thornber	6a57ed17a2	[device/bcache] add bcache_prefetch_bytes() and bcache_read_bytes() Not tested yet.	2018-04-20 11:12:50 -05:00
Joe Thornber	467adfa082	[device/bcache] More tests and some bug fixes	2018-04-20 11:12:50 -05:00
Joe Thornber	19647d1cd4	[device/bcache] fix bug in _alloc_block	2018-04-20 11:12:50 -05:00
Joe Thornber	1563b93691	[device/bcache] Add bcache_max_prefetches() Ignore prefetches if max io is in flight.	2018-04-20 11:12:50 -05:00
Joe Thornber	c4c4acfd42	[device/bcache] Add a couple of invalidate methods	2018-04-20 11:12:50 -05:00
Joe Thornber	0f0eb04edb	[device/bcache] some more work on bcache	2018-04-20 11:12:50 -05:00
Joe Thornber	46867a45d2	[device/bcache] stub a unit test	2018-04-20 11:12:50 -05:00
Joe Thornber	da7e13ef88	[lib/device/bcache] Tweaks after Kabi's review	2018-04-20 11:10:45 -05:00
Joe Thornber	acb42ec465	[device/bcache] Initial code drop. Compiles. Not written tests yet.	2018-04-20 11:10:45 -05:00
Joe Thornber	00f1b208a1	[io paths] Unpick agk's aio stuff	2018-04-20 11:03:58 -05:00
Zdenek Kabelac	e878c3fc32	cleanup: correct casting	2018-04-20 12:17:01 +02:00
Zdenek Kabelac	f2d0eefa77	coverity: make use of defined variable Since we declare 'r', let's use the value for something.	2018-03-17 23:33:58 +01:00
Zdenek Kabelac	285413b502	cleanup: missing dots and indent	2018-03-15 11:01:04 +01:00
Zdenek Kabelac	70ad633638	devcache: add reason and always log_error With these read errors it's useful to know the reason. Also avoid to log error just once so we know exactly how many times we did failing read. On the other hand reduce repeated log_error() on code 'backtrace' path and change severity of message to just log_debug() so the actual read error is printed once for one read.	2018-03-15 10:50:28 +01:00
Zdenek Kabelac	b6e7a0b490	cleanup: more usage of dm_strncpy Use existing wrapper function arournd strncpy + buf[] = 0;	2018-03-06 15:40:34 +01:00
Zdenek Kabelac	6b48868cf0	io: keep 64b arithmetic Widen to 64b arithmetic from start.	2018-02-28 21:05:18 +01:00
Alasdair G Kergon	d6cabbbc53	device: Fix basic async I/O error handling	2018-02-08 20:19:21 +00:00
Alasdair G Kergon	3e29c80122	device: Queue any aio beyond defined limits.	2018-02-08 20:15:37 +00:00
Alasdair G Kergon	db41fe6c5d	lvmcache: Use asynchronous I/O when scanning devices.	2018-02-08 20:15:29 +00:00
Alasdair G Kergon	8c7bbcfb0f	device: Basic config and setup to support async I/O.	2018-02-08 20:15:14 +00:00
Alasdair G Kergon	7a9af3cd0e	device: Add flag to indicate that a code path can support AIO Until the whole source supports AIO, library code can check for AIO_SUPPORTED_CODE_PATH to determine whether or not it is OK to use AIO.	2018-02-06 01:11:00 +00:00
Alasdair G Kergon	e869a52cc4	callbacks: Miscellaneous fixes for recent changes	2018-02-06 01:09:39 +00:00
Zdenek Kabelac	a1cfef9f26	dev_io: fix writes for unaligned buffers Actually the removed code is necessary - since not all writes are getting alligned buffer - older compilers seems to be not able to create 4K aligned buffers on stack - this the aligning code still need to be present for write path.	2018-01-23 13:36:12 +01:00
Zdenek Kabelac	6e9148e7ab	debug: drop DEBUG_MEM path Memory is not allocated so no DEBUG_MEM part is needed.	2018-01-23 11:45:18 +01:00
Alasdair G Kergon	9194610f42	device: Add ioflags parameter to transfer additional state. Flags are set on the initial I/O and passed to any callbacks that may in turn issue further I/O using the inherited flags.	2018-01-21 21:10:23 +00:00
Alasdair G Kergon	c26458339e	device: Move buffer allocation nearer to the I/O. Don't allocate memory until it's needed - later we'll add some of the I/O to an internal queue instead of issuing it immediately.	2018-01-16 01:12:08 +00:00
Alasdair G Kergon	081902b4c1	device: Merge _dev_read and dev_read_callback.	2018-01-16 00:41:42 +00:00
Alasdair G Kergon	b825987b2f	device: Rearrange _aligned_io().	2018-01-15 20:10:54 +00:00
Alasdair G Kergon	c90582344d	device: Add reason to devbuf.	2018-01-15 19:38:18 +00:00
Alasdair G Kergon	1f01eaa612	device: Store offset to data instead of pointer. We want to save the relative offset before we've allocated the buffer's memory.	2018-01-15 19:32:59 +00:00
Alasdair G Kergon	61d3296f2a	device: Reorder device.h before change.	2018-01-15 19:24:01 +00:00
Alasdair G Kergon	6210c1ec28	device: Mark read-only device buffers const.	2018-01-10 19:57:10 +00:00
Alasdair G Kergon	c350f96c09	device: Eliminate unnecessary buffer from dev_read.	2018-01-10 18:48:01 +00:00
Alasdair G Kergon	366493a1d1	device: Suppress repeated reads of the same data. If the data being requested is present in last_[extra_]devbuf, return that directly instead of reading it from disk again. Typical LVM2 access patterns request data within two adjacent 4k blocks so we eliminate some read() system calls by always reading at least 8k.	2018-01-10 15:52:03 +00:00
Alasdair G Kergon	dcb2a5a611	device: Remove some data copying between buffers. Callers that read larger amounts of data now get a pointer to read-only data directly without copying it through an intermediate buffer. This data is owned by the device layer so the callers no longer free it.	2018-01-10 15:48:03 +00:00
Alasdair G Kergon	4d568b709c	device: Free cached device bufs when metadata invalid or dev closed.	2018-01-10 15:48:03 +00:00
Alasdair G Kergon	bd0967a4b1	device: Keep the last data buffer read off each device. If there's a second metadata area on device, we record that separately. Note that the memory requirements aren't restricted yet.	2018-01-10 15:48:03 +00:00
Alasdair G Kergon	ea96381534	libdm: Introduce dm_malloc_aligned	2018-01-10 15:48:03 +00:00
Alasdair G Kergon	f4675af4cf	format_text: Use vgsummary callbacks	2018-01-09 03:14:30 +00:00
Alasdair G Kergon	5e7d3ad749	device: Introduce dev_read_callback If it obtains the data, it passes it into the supplied callback function and returns 1. Otherwise the callback receives failed = 1. Updated config_file_read_fd to use this and similarly return the data via a callback fn of its own.	2018-01-06 02:40:12 +00:00
Alasdair G Kergon	946f07af3e	metadata: Use a consistent format for callback fn parameters	2018-01-05 14:24:56 +00:00
Alasdair G Kergon	22b6c482ec	config: Split config buffer processing into new fn. Wrap its parameters into struct process_config_file_params allocated from a mempool now passed into the config_file_read* fns.	2018-01-02 21:10:46 +00:00
Alasdair G Kergon	17649d4ac8	device: Move dev_read memory allocation into device layer. Rename dev_read() to dev_read_buf() - the function that reads data into a supplied buffer. Introduce a new dev_read() that allocates the buffer it returns and switch the important users over to this. No caller may change the returned data. (For now, callers are responsible for freeing it after use, but later the device layer will take full ownership.) dev_read_buf() should only be used for tiny buffers or unimportant code (such as the old disk formats).	2017-12-19 01:31:50 +00:00
Alasdair G Kergon	5f45cb90a7	format_text: Transfer circular buf alloc to device layer. Instead of the caller passing dev_read_circular() a buffer to fill with data, the device layer itself now allocates it.	2017-12-15 22:34:26 +00:00
Alasdair G Kergon	beee9940a5	format_text: Separate out code paths for buffer wraparound The creation of wrapped around metadata - where the start of metadata is written up to the end of the buffer and the remainder follows back at the start of the buffer - is now restricted to cases where writing the metadata in one piece wouldn't fit. This shouldn't happen in 'normal' usage so let's begin treating the code for this as a special case that can be ignored when optimising 'normal' cases.	2017-12-15 21:12:19 +00:00
Alasdair G Kergon	e932c5da50	device: Fix an unpaired device close. dev_open_flags contains an unpaired dev_close_immediate so increment open_count before calling it.	2017-12-12 17:56:58 +00:00
Alasdair G Kergon	c5ef76bf27	device: Internal error if writing 0 bytes to dev.	2017-12-12 12:57:25 +00:00
Alasdair G Kergon	d591d04103	device: Tag I/O for each mda on a device separately in log messages. Mark the first metadata area on each text format PV as MDA_PRIMARY. Pass this information down to the device layer so that when there are two metadata areas on a block device, we can easily distinguish two independent streams of I/O.	2017-12-07 03:48:11 +00:00
Alasdair G Kergon	7195df5aca	device: Skip read-modify-write if replacing whole block.	2017-12-05 01:00:38 +00:00
Alasdair G Kergon	e4805e4883	device: categorise block i/o Introduce enum dev_io_reason to categorise block device I/O in debug messages so it's obvious what it is for. DEV_IO_SIGNATURES /* Scanning device signatures / DEV_IO_LABEL / LVM PV disk label / DEV_IO_MDA_HEADER / Text format metadata area header / DEV_IO_MDA_CONTENT / Text format metadata area content / DEV_IO_FMT1 / Original LVM1 metadata format / DEV_IO_POOL / Pool metadata format / DEV_IO_LV / Content written to an LV / DEV_IO_LOG / Logging messages */	2017-12-04 23:45:26 +00:00
Alasdair G Kergon	115e66e9be	device: log debug when I/O bounce buffer used	2017-11-16 19:16:10 +00:00
Alasdair G Kergon	02e9876665	log: Add io debug class	2017-11-15 01:02:15 +00:00
Alasdair G Kergon	6bf0f04ae2	log: Improve various device-related messages - Use 'lvmcache' consistently instead of 'metadata cache' - Always use 5 characters for source line number - Remember to convert uuids into printable form - Use <no name> rather than (null) when VG has no name.	2017-11-13 19:45:33 +00:00
Zdenek Kabelac	e9206fb93d	devcache: track more udev errors Add a bit more details for failing udev function.	2017-10-30 13:16:50 +01:00
Alasdair G Kergon	f1cc5b12fd	tidy: Add missing underscores to statics.	2017-10-18 15:58:13 +01:00
Alasdair G Kergon	146745ad88	device: Separate errors for dev not found and filtered. Replaced the confusing device error message "not found (or ignored by filtering)" by either "not found" or "excluded by a filter". (Later we should be able to say which filter.) Left the the liblvm code paths alone.	2017-10-17 02:12:41 +01:00
Zdenek Kabelac	48ce8c7a49	tidy: drop unneeded cast Avoid casting to the same type.	2017-07-20 11:20:44 +02:00
Zdenek Kabelac	0bf836aa14	tidy: prefer not using else after return clang-tidy: avoid using 'else' after return - give more readable code, and also saves indention level.	2017-07-20 11:18:29 +02:00
Zdenek Kabelac	0d0a3397c2	cleanup: add braces in macro	2017-07-20 11:18:29 +02:00
Zdenek Kabelac	767a5e1281	dev-cache: avoid hashing same data again Before hashing device again with path, check if it's not already hashed. TODO: maybe bigger chunk of executed code might be actually skipped.	2017-07-17 12:33:17 +02:00
Alasdair G Kergon	6a20b22151	devices: Recognise Veritas Dynamic Multipathing VxDMP doesn't interact very well with udev so always set devices/obtain_device_list_from_udev = 0 in lvm.conf on these systems.	2017-01-10 22:23:23 +00:00
Zdenek Kabelac	1d58074d9f	debug: more stacktrace corrections Continue previous patch dropping some unneeded stack traces after printed log_error/warn messages.	2016-11-25 14:58:28 +01:00
Peter Rajnoha	c8a14a29cd	dev-type: check for DEVLINKS udev db variable existence if udev_device_get_is_initialized fn is not present Older udev versions (udev < v165), don't have the official udev_device_get_is_initialized function available to query for device initialization state in udev database. Also, devices don't have USEC_INITIALIZED udev db variable set - this is bound to the udev_device_get_is_initialized fn functionality. In this case, check for "DEVLINKS" variable instead - all block devices have at least one symlink set for the node (the "/dev/block/<major:minor>". This symlink is set by default basic udev rules provided by udev directly. We'll use this as an alternative for the check that initial udev processing for a device has already finished.	2016-09-06 13:21:29 +02:00
Peter Rajnoha	d7b282c601	dev-type: use more appropriate messages in udev_dev_is_mpath_component and use 10s timeout	2016-09-05 14:37:13 +02:00
Peter Rajnoha	5d323c37f3	refactor: move and rename _dev_is_mpath_component in lvmetad.c to udev_dev_is_mpath_component in dev-type.c	2016-09-05 12:55:25 +02:00
Zdenek Kabelac	d37a26b680	devices: handle partscan loop devices Treat loop device created with 'losetup -P' as regular partitioned device - so if it has partition table, prevent its usage in commands like 'pvcreate'. Before 'pvcreate /dev/loop0' could have erased and formated as PV, after this patch, device is filtered out and cannot be used.	2016-06-01 17:37:47 +02:00
Alasdair G Kergon	b5314c2a6a	device: Retry open without O_NOATIME if it fails.	2016-05-12 01:05:52 +01:00
Zdenek Kabelac	447daa9179	coverity: use wider type for whole expression Coverity likes when the types are same through the whole expression. And since dev_t is 64b - widen int type early.	2016-04-22 01:12:34 +02:00
Zdenek Kabelac	cbf99be43a	coverity: fix memory access Commit `52e0d0db44` introduced regression as code may access buf[0 - 1]. Reorder code to first remove '\n' and then check buffer size for empty.	2016-04-22 01:11:57 +02:00
Zdenek Kabelac	556eba1835	cleanup: use kdev_t header in lvm tree Reuse libdm header in lvm so we have single definition of MAJOR/MINOR/MKDEV macros in use.	2016-04-22 00:23:28 +02:00
Zdenek Kabelac	0f7975cb35	cleanup: avoid declaring var in the middle of code Easier to read code.	2016-04-12 11:47:51 +02:00
Zdenek Kabelac	fe65a86cbc	devcache: do not insert devices without device node When not obtaining device from udev, we are doing deep devdir scan, and at the same time we try to insert everything what /sys/dev/block knows about. However in case lvm2 is configured to use nonstardard devdir this way it will see (and scan) devices from a real system. lvm2 test suite is using its own test devdir with its own device nodes. To avoid touching real /dev devices, validate the device node exist in give dir and do not insert such device into a cache. With obtain list from udev this patch has no effect (the normal user path).	2016-04-11 10:33:14 +02:00
Zdenek Kabelac	07a60b59f7	devcache: index devices also without udev We have _insert_dirs() for udev and non-udev compilation. Compiling without udev missed to call dev_cache_index_devs(). Move the call after _insert_dirs() call so both compilation gets it.	2016-04-11 10:32:19 +02:00
Peter Rajnoha	42f04a0f77	dev-cache: skip VGID/LVID indexing if /sys/dev/block is not present /sys/dev/block is available since kernel version 2.2.26 (~ 2008): https://www.kernel.org/doc/Documentation/ABI/testing/sysfs-dev The VGID/LVID indexing code relies on this feature so skip indexing if it's not available to avoid error messages about inability to open /sys/dev/block directory. We're not going to provide fallback code to read the /sys/block/ instead in this case as that's not that efficient - it needs extra reads for getting major:minor and reading partitions would also pose further reads and that's not worth it.	2016-04-01 17:09:15 +02:00
Peter Rajnoha	15d1824fac	dev-cache: iterate devices in sysfs for VGID/LVID index if obtain_device_list_from_udev=0 If obtain_device_list_from_udev=0, LVM can make use of persistent .cache file. This cache file contains only devices which underwent filters in previous LVM command run. But we need to iterate over all block devices to create the VGID/LVID index completely for the device mismatch check to be complete as well. This patch iterates over block devices found in sysfs to generate the VGID/LVID index in dev cache if obtain_device_list_from_udev=0 (if obtain_device_list_from_udev=1, we always read complete list of block devices from udev and we ignore .cache file so we don't need to look in sysfs for the complete list).	2016-04-01 14:49:39 +02:00
Peter Rajnoha	7ed5a65ee5	dev-cache: also add dev name for device found in sysfs only For the case when we print device name associated with struct device that was not found in /dev, but in sysfs, for example when printing devices where LV device mismatch is found.	2016-04-01 14:48:56 +02:00
Peter Rajnoha	91d32f9d1b	refactor: dev-cache: move code adding sysfs-only device into separate fn	2016-04-01 11:47:06 +02:00
Peter Rajnoha	0e774d5ae7	refactor: dev-cache: use btree instead of hash table for sysfs-only devices major:minor btree is more convenient and more suitable than dev name hash table here.	2016-04-01 11:42:25 +02:00
Peter Rajnoha	9a086a6607	dev-cache: fix check for already indexed dev in _index_dev_by_vgid_and_lvid	2016-03-30 15:57:57 +02:00
Peter Rajnoha	8b258a005b	dev-cache: dev_cache_index_devs fn is available unconditionally The new dev_cache_index_devs fn was under ifdef UDEV_SYNC_SUPPORT by mistake, move it out of this ifdef.	2016-03-30 13:06:20 +02:00
Peter Rajnoha	52e0d0db44	dev-cache: remove spurious error msg if no value found in /sys/dev/block/<major>:<minor>/dm/uuid during dev scan It's correct to have a DM device that has no DM UUID assigned so no need to issue error message in this case. Also, if the device doesn't have DM UUID, it's also clear it's not an LVM LV (...when looking for VGID/LVID while creating VGID/LVID indices in dev cache). For example: $ dmsetup create test --table "0 1 linear /dev/sda 0" And there's no PV in the system. Before this patch (spurious error message issued): $ pvs _get_sysfs_value: /sys/dev/block/253:2/dm/uuid: no value With this patch applied (no spurious error message): $ pvs	2016-03-30 11:30:09 +02:00
Peter Rajnoha	8c27c52749	dev-cache: also index VGIDs and LVIDs if using persistent .cache file If we're using persistent .cache file, we're reading this file instead of traversing the /dev content. Fix missing indexing by VGID and LVID here - hook this into persistent_filter_load where we populate device cache from persistent .cache file instead of scanning /dev. For example, inducing situation in which we warn about different device actually used than what LVM thinks should be used based on metadata: $ lsblk -s /dev/vg/lvol0 NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT vg-lvol0 253:4 0 124M 0 lvm `-loop1 7:1 0 128M 0 loop $ lvmconfig --type diff global { use_lvmetad=0 } devices { obtain_device_list_from_udev=0 } (obtain_device_list_from_udev=0 also means the persistent .cache file is used) Before this patch - pvs is fine as it does the dev scan, but lvs relies on persistent .cache file and it misses the VGID/LVID indices to check and warn about incorrect devices used: $ pvs Found duplicate PV B9gXTHkIdEIiMVwcOoT2LX3Ywh4YIHgR: using /dev/loop0 not /dev/loop1 Using duplicate PV /dev/loop0 without holders, ignoring /dev/loop1 WARNING: Device mismatch detected for vg/lvol0 which is accessing /dev/loop1 instead of /dev/loop0. PV VG Fmt Attr PSize PFree /dev/loop0 vg lvm2 a-- 124.00m 0 $ lvs Found duplicate PV B9gXTHkIdEIiMVwcOoT2LX3Ywh4YIHgR: using /dev/loop0 not /dev/loop1 Using duplicate PV /dev/loop0 without holders, ignoring /dev/loop1 LV VG Attr LSize lvol0 vg -wi-a----- 124.00m With this patch applied - both pvs and lvs is fine - the indices are always created correctly (lvs just an example here, other LVM commands that rely on persistent .cache file are fixed with this patch too): $ pvs Found duplicate PV B9gXTHkIdEIiMVwcOoT2LX3Ywh4YIHgR: using /dev/loop0 not /dev/loop1 Using duplicate PV /dev/loop0 without holders, ignoring /dev/loop1 WARNING: Device mismatch detected for vg/lvol0 which is accessing /dev/loop1 instead of /dev/loop0. PV VG Fmt Attr PSize PFree /dev/loop0 vg lvm2 a-- 124.00m 0 $ lvs Found duplicate PV B9gXTHkIdEIiMVwcOoT2LX3Ywh4YIHgR: using /dev/loop0 not /dev/loop1 Using duplicate PV /dev/loop0 without holders, ignoring /dev/loop1 WARNING: Device mismatch detected for vg/lvol0 which is accessing /dev/loop1 instead of /dev/loop0. LV VG Attr LSize lvol0 vg -wi-a----- 124.00m	2016-03-30 11:00:01 +02:00
Peter Rajnoha	91bb202ded	dev-cache: handle situation where device is referenced in sysfs, but the node is not yet in dev dir It's possible that while a device is already referenced in sysfs, the node is not yet in /dev directory. This may happen in some rare cases right after LVs get created - we sync with udev (or alternatively we create /dev content ourselves) while VG lock is held. However, dev scan is done without VG lock so devices may already be in sysfs, but /dev may not be updated yet if we call LVM command right after LV creation (so the fact that fs_unlock is done within VG lock is not usable here much). This is not a problem with devtmpfs as there's at least kernel name for device in /dev as soon as the sysfs item exists, but we still support environments without devtmpfs or where different directory for dev nodes is used (e.g. our test suite). This patch covers these situations by tracking such devices in _cache.sysfs_only_names helper hash for the vgid/lvid check to work still. This also resolves commit `6129d2e64d` which was then reverted by commit `109b7e2095` due to performance issues it may have brought (...and it didn't resolve the problem fully anyway).	2016-03-30 10:56:46 +02:00
Peter Rajnoha	94f78e0183	coverity: fix some issues reported by coverity for recent code	2016-03-22 16:03:55 +01:00
Peter Rajnoha	ed002ed22a	dev: also count with suffixes in UUID for LVs when constructing VGID and LVID index UUID for LV is either "LVM-<vg_uuid><lv_uuid>" or "LVM-<vg_uuid><lv_uuid>-<suffix>". The code before just checked the length of the UUID based on the first template, not the variant with suffix - so LVs with this suffix were not processed properly. For example a thin pool LV (as an example of an LV that contains sub LVs where UUIDs have suffixes): [0] fedora/~ # lsblk -s /dev/vg/lvol1 NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT vg-lvol1 253:8 0 4M 0 lvm `-vg-pool-tpool 253:6 0 116M 0 lvm \|-vg-pool_tmeta 253:2 0 4M 0 lvm \| `-sda 8:0 0 128M 0 disk `-vg-pool_tdata 253:3 0 116M 0 lvm `-sda 8:0 0 128M 0 disk Before this patch (spurious warning message about device mismatch): [0] fedora/~ # pvs WARNING: Device mismatch detected for vg/lvol1 which is accessing /dev/mapper/vg-pool-tpool instead of (null). PV VG Fmt Attr PSize PFree /dev/sda vg lvm2 a-- 124.00m 0 With this patch applied (no spurious warning message about device mismatch): [0] fedora/~ # pvs PV VG Fmt Attr PSize PFree /dev/sda vg lvm2 a-- 124.00m 0	2016-03-22 10:52:24 +01:00
Peter Rajnoha	2a47f0957f	dev: also check for blank sysfs value containing only '\n'	2016-03-22 09:29:24 +01:00
Peter Rajnoha	8fad9b9e5d	dev: be safer when reading sysfs properties Check if the value we read from sysfs is not blank and replace the '\n' at the end only when needed ('\n' should usually be there for sysfs values, but better check this).	2016-03-21 15:50:32 +01:00
Peter Rajnoha	03b0a78640	dev: detect mismatch between devices used and devices assumed for an LV It's possible for an LVM LV to use a device during activation which then differs from device which LVM assumes based on metadata later on. For example, such device mismatch can occur if LVM doesn't have complete view of devices during activation or if filters are misbehaving or they're incorrectly set during activation. This patch adds code that can detect this mismatch by creating VG UUID and LV UUID index while scanning devices for device cache. The VG UUID index maps VG UUID to a device list. Each device in the list has a device layered above as a holder which is an LVM LV device and for which we know the VG UUID (and similarly for LV UUID index). We can acquire VG and LV UUID by reading /sys/block/<dm_dev_name>/dm/uuid. So these indices represent the actual state of PV device use in the system by LVs and then we compare that to what LVM assumes based on metadata. For example: [0] fedora/~ # lsblk /dev/sdq /dev/sdr /dev/sds /dev/sdt NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT sdq 65:0 0 104M 0 disk \|-vg-lvol0 253:2 0 200M 0 lvm `-mpath_dev1 253:3 0 104M 0 mpath sdr 65:16 0 104M 0 disk `-mpath_dev1 253:3 0 104M 0 mpath sds 65:32 0 104M 0 disk \|-vg-lvol0 253:2 0 200M 0 lvm `-mpath_dev2 253:4 0 104M 0 mpath sdt 65:48 0 104M 0 disk `-mpath_dev2 253:4 0 104M 0 mpath In this case the vg-lvol0 is mapped onto sdq and sds becauset this is what was available and seen during activation. Then later on, sdr and sdt appeared and mpath devices were created out of sdq+sdr (mpath_dev1) and sds+sdt (mpath_dev2). Now, LVM assumes (correctly) that mpath_dev1 and mpath_dev2 are the PVs that should be used, not the mpath components (sdq/sdr, sds/sdt). [0] fedora/~ # pvs Found duplicate PV xSUix1GJ2SK82ACFuKzFLAQi8xMfFxnO: using /dev/mapper/mpath_dev1 not /dev/sdq Using duplicate PV /dev/mapper/mpath_dev1 from subsystem DM, replacing /dev/sdq Found duplicate PV MvHyMVabtSqr33AbkUrobq1LjP8oiTRm: using /dev/mapper/mpath_dev2 not /dev/sds Using duplicate PV /dev/mapper/mpath_dev2 from subsystem DM, ignoring /dev/sds WARNING: Device mismatch detected for vg/lvol0 which is accessing /dev/sdq, /dev/sds instead of /dev/mapper/mpath_dev1, /dev/mapper/mpath_dev2. PV VG Fmt Attr PSize PFree /dev/mapper/mpath_dev1 vg lvm2 a-- 100.00m 0 /dev/mapper/mpath_dev2 vg lvm2 a-- 100.00m 0	2016-03-21 11:40:40 +01:00
Peter Rajnoha	65d9f742f8	device: add DEV_OPEN_FAILURE flag DEV_OPEN_FAILURE flag is set if the most recent "open" for a device failed and it's unset if any subsequent "open" succeeds.	2016-03-21 11:06:05 +01:00
Alasdair G Kergon	2159a1429d	pre-release	2016-03-11 00:19:16 +00:00
Zdenek Kabelac	5aade9c402	topology: handle reported sizes smaller then sector Recent kernel (4.4) start to report values smaller then sector size (but in reporting size for SSD which support data zeroing on discard). For now log warning and assume it really means 1 sector. Addressing RHBZ: https://bugzilla.redhat.com/show_bug.cgi?id=1313377	2016-03-10 18:38:54 +01:00
David Teigland	57cd94b9e3	pvs: replace 'unknown device' with [unknown] A config setting can restore the old string.	2016-03-01 11:12:03 -06:00
Zdenek Kabelac	9267e0c5e7	cleanup: use braces around macro params	2016-02-23 21:40:17 +01:00
Zdenek Kabelac	c0b836e316	gcc: logical-op warning go away Don't be too much inventive and shutdown gcc6 warning: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=69602	2016-02-23 14:41:24 +01:00
Peter Rajnoha	ec43f55445	filters: partitioned: fix partition table filter with external_device_info_source="udev" and blkid<2.20 Non-dm devices have ID_PART_TABLE_TYPE variable exported in udev db from blkid scan for both whole devices and partitions. We used ID_PART_ENTRY_DISK in addition to decide whether this is the whole device or partition and then we filtered out only whole devices where the partition table really is. However, ID_PART_ENTRY_DISK was added in blkid 2.20 so we need to use a different set of variables to decide on whole devices and partitions on systems where older blkid is still used. Now, we use ID_PART_TABLE_TYPE to detect that there's something related to partitioning with this device and we use DEVTYPE variable instead to decide between whole device (DEVTYPE="disk") and partition (DEVTYPE="partition"). For dm devices it's simpler, we have ID_PART_TABLE_TYPE variable\ set in udev db for whole devices. It's not set for partitions, hence we don't need more variable in addition to make the decision on whole device vs. partition (dm devices do not have regular partitions, hence DEVTYPE can't be used anyway, it's always set to "disk" for whole disks and partitions).	2016-02-02 13:28:11 +01:00
Peter Rajnoha	d090d6574e	device: also cache device size Add "size" and "size_seqno" to struct device to cache device's size and also to control its lifetime - the cached value is valid as long as the global _dev_size_seqno is equal to the device's size_seqno, otherwise we need to get the size again and cache the new value. This patch also adds new dev_size_seqno_inc() fn for the appropriate parts of the code to increment current global value of _dev_size_seqno and hence to cause all currently cached values for device sizes to be invalidated. The device size is now cached because we're planning to reuse this information for further checks and we want to avoid checking it more than necessary to save resources.	2016-01-22 14:13:34 +01:00
Zdenek Kabelac	fcbef05aae	doc: change fsf address Hmm rpmlint suggest fsf is using a different address these days, so lets keep it up-to-date	2016-01-21 12:11:37 +01:00
David Teigland	92e1422707	process_each_pv: do full scan earlier to find new devices Before commit `c1f246fedf`, _get_all_devices() did a full device scan before get_vgnameids() was called. The full scan in _get_all_devices() is from calling dev_iter_create(f, 1). The '1' arg forces a full scan. By doing a full scan in _get_all_devices(), new devices were added to dev-cache before get_vgnameids() began scanning labels. So, labels would be read from new devices. (e.g. by the first 'pvs' command after the new device appeared.) After that commit, _get_all_devices() was called after get_vgnameids() was finished scanning labels. So, new devices would be missed while scanning labels. When _get_all_devices() saw the new devices (after labels were scanned), those devices were added to the .cache file. This meant that the second 'pvs' command would see the devices because they would be in .cache. Now, the full device scan is factored out of _get_all_devices() and called by itself at the start of the command so that new devices will be known before get_vgnameids() scans labels.	2015-12-14 10:02:29 -06:00
Zdenek Kabelac	dccbc3b621	cleanup: simplify dev_cache_exit Just set whole _cache struct into unitialized state just like with lib init start usage. Lists are initialized with dev_cache_init().	2015-11-16 01:16:11 +01:00
Zdenek Kabelac	5a4676fea9	cleanup: add _free on error path Just like with failing allocation above also _free(dev). TODO: rework this to always use mempool and drop unneeded comlexity we have in this function.	2015-11-16 01:16:11 +01:00
Peter Rajnoha	b8779e706e	configure: check for udev_device_get_is_initialized is available The udev_device_get_is_initialized is available since libudev version 165. Older versions are still used somewhere (e.g. RHEL6). So better check for this fn and use it only if it's available.	2015-11-11 15:15:50 +01:00
Peter Rajnoha	f82e0210b7	dev-ext: issue error if external_device_info_source=udev and udev db record incomplete Udev db records are marked as not initialized (incomplete) on timeout. Issue an error message whenever LVM finds such records so users are aware that something's going wrong with udev db. This is important in case we use devices/external_device_info_source="udev" where udev database records are used to do various filtering decisions. For example: udev log of timed out worker: Nov 11 13:02:25 raw.virt systemd-udevd[607]: seq 1997 '/devices/virtual/block/dm-2' is taking a long time Nov 11 13:04:25 raw.virt systemd-udevd[607]: seq 1997 '/devices/virtual/block/dm-2' killed Nov 11 13:04:25 raw.virt systemd-udevd[607]: worker [11221] terminated by signal 9 (Killed) Nov 11 13:04:25 raw.virt systemd-udevd[607]: worker [11221] failed while handling '/devices/virtual/block/dm-2' ... LVM also issues error message visibly if incomplete udev db record is found, devices/external_device_info_source="udev" is set: $ pvs Udev database has incomplete information about device /dev/dm-2. Failed to get external handle for device /dev/dm-2 [udev]. ...	2015-11-11 13:14:07 +01:00
Zdenek Kabelac	e262d5e596	cleanup: keep using enum typedef Using enum instead of unsigned.	2015-11-09 10:22:52 +01:00
Zdenek Kabelac	fa1d730847	dev-type: fix TOCTOU order Doing 'stat' checking first and later opening is racy. And since we do not really care about any 'status' info here and we read 'sysfs' here - just drop whole 'stat()' call and directly handle error from failing 'fopen()'.	2015-11-09 10:19:20 +01:00
Alasdair G Kergon	65ec00ce20	device: Tidy DASD CDL format detection code.	2015-10-27 15:27:52 +00:00
Lidong Zhong	729f489009	pvcreate: don't support unpartitioned DASD devices with CDL formatted The former patch(`dab3ebce4c`) is a little bit strict. For example, it is OK to create PV on unpartitioned DASD devices with LDL formatted. So after lvm version containing the patch, LVs created on those devices could not be found. Signed-off-by: Lidong Zhong <lzhong@suse.com>	2015-10-27 11:42:47 +01:00
Peter Rajnoha	5ac81657e5	wiping: make libblkid detect all copies of the same signature if use_blkid_wiping=1 Some signatures are spread around the disk in several copies, mainly for backup. Make libblkid to detect these extra copies - there was missing "blkid_probe_step_back" fn call after successful wipe of previous signature copy. An example with FAT table which has copies: $ mkfs.vfat /dev/sda1 Before this patch: $ pvcreate /dev/sda1 WARNING: vfat signature detected on /dev/sda1 at offset 54. Wipe it? [y/n]: y Wiping vfat signature on /dev/sda1. Physical volume "/dev/sda1" successfully created With this patch applied: $ pvcreate /dev/sda1 WARNING: vfat signature detected on /dev/sda1 at offset 54. Wipe it? [y/n]: y Wiping vfat signature on /dev/sda1. WARNING: vfat signature detected on /dev/sda1 at offset 0. Wipe it? [y/n]: y Wiping vfat signature on /dev/sda1. WARNING: vfat signature detected on /dev/sda1 at offset 510. Wipe it? [y/n]: y Wiping vfat signature on /dev/sda1. Physical volume "/dev/sda1" successfully created	2015-10-13 12:22:09 +02:00
Peter Rajnoha	2081071bee	wiping: warn if use_blkid_wiping=1 is set and LVM not compiled with blkid_wiping support	2015-09-22 11:11:26 +02:00
Peter Rajnoha	55c13f3de4	dev-cache: fix use of uninitialized device status if reading outdated .cache record As part of fix that came with `cf700151eb`, I forgot to add the check whether the result of stat was successful or not. This bug caused uninitialized buffer to be used for entries from .cache file which are no longer valid. This bug may have caused these uninitialized values to be used further, for example (see the unreal (2567,590944) representing major:minor pair): $ pvs /dev/abc: stat failed: No such file or directory Path /dev/abc no longer valid for device(2567,590944) PV VG Fmt Attr PSize PFree /dev/mapper/test lvm2 --- 104.00m 104.00m /dev/vda2 rhel lvm2 a-- 9.51g 0	2015-09-04 18:00:29 +02:00
Peter Rajnoha	d1d00fdeec	dev-cache: append (major:minor) to debug messages about adding device or its alias to cache device/dev-cache.c:350 /dev/sda: Added to device cache (8:0) device/dev-cache.c:346 /dev/disk/by-id/lvm-pv-uuid-5nPovF-EWp4-vBwd-ylCJ-9Y0B-yzHQ-ek1li2: Aliased to /dev/sda in device cache (8:0) ...	2015-09-03 14:36:15 +02:00
Alasdair G Kergon	623b46a17d	device: Don't try to close config file on failure. $file: open failed: Permission denied Failed to load config file $file Attempt to close device '$file' which is not open.	2015-08-17 12:57:01 +01:00
Peter Rajnoha	cf700151eb	cache: fix regression causing some PVs to bypass filters This is a regression introduced by commit `6c0e44d5a2` which changed the way dev_cache_get fn works - before this patch, when a device was not found, it fired a full rescan to correct the cache. However, the change coming with that commit missed this full_rescan call, causing the lvmcache to still contain info about PVs which should be filtered now. Such situation may have happened by coincidence of using old persistent cache (/etc/lvm/cache/.cache) which does not reflect the actual state anymore, a device name/symlink which now points to a device which should be filtered and a fact we keep info about usable DM devices in .cache no matter what the filter setting is. This bug could be hidden though by changes introduced in commit `f1a000a477` as it calls full_rescan earlier before this problem is hit. But we need to fix this anyway for the dev_cache_get to be correct if we happen to use the same code path again somewhere sometime. For example, simple reproducer was (before commit 1a000a477558e157532d5f2cd2f9c9139d4f87c): - /dev/sda contains a PV header with UUID y5PzRD-RBAv-7sBx-V3SP-vDmy-DeSq-GUh65M - lvm.conf: filter = [ "r\|.\|" ] - rm -f .cache (to start with clean state) - dmsetup create test --table "0 8388608 linear /dev/sda 0" (8388608 is just the size of the /dev/sda device I use in the reproducer) - pvs (this will create .cache file which contains "/dev/disk/by-id/lvm-pv-uuid-y5PzRD-RBAv-7sBx-V3SP-vDmy-DeSq-GUh65M" as well as "/dev/mapper/test" and the target node "/dev/dm-1" - all the usable DM mappings (and their symlinks) get into the .cache file even though the filter "is set to "ignore all" - we do this - so far it's OK) - dmsetup remove test (so we end up with /dev/disk/by-id/lvm-pv-uuid-... pointing to the /dev/sda now since it's the underlying device containing the actual PV header) - now calling "pvs" with such .cache file and we get: $ pvs PV VG Fmt Attr PSize PFree /dev/disk/by-id/lvm-pv-uuid-y5PzRD-RBAv-7sBx-V3SP-vDmy-DeSq-GUh65M vg lvm2 a-- 4.00g 0 Even though we have set filter = [ "r\|.\|" ] in the lvm.conf file!	2015-07-29 10:19:12 +02:00
Peter Rajnoha	c3fddb0fbb	wiping: add "Wiping skipped." for the message context to be complete	2015-07-21 11:00:43 +02:00
Peter Rajnoha	697fb353dc	wiping: log_warn instead of log_error if blkid wipe ignored for a signature Comply with the rules we have for log_error and log_warn... $ pvcreate /dev/sda1 Failed to get offset of the xfs_external_log signature on /dev/sda1. 1 existing signature left on the device. Aborting pvcreate on /dev/sda1. $ pvcreate /dev/sda1 --force WARNING: Failed to get offset of the xfs_external_log signature on /dev/sda1. Physical volume "/dev/sda1" successfully created	2015-07-21 10:34:04 +02:00

... 3 4 5 6 7 ...

724 Commits