samba-mirror

mirror of https://github.com/samba-team/samba.git synced 2024-12-23 17:34:34 +03:00

Author	SHA1	Message	Date
Martin Schwenke	6fe6a54e7f	ctdb-client: Add client code for disable/enable controls BUG: https://bugzilla.samba.org/show_bug.cgi?id=14784 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2021-09-09 01:46:49 +00:00
Martin Schwenke	15a6489c28	ctdb_daemon: Implement controls DISABLE_NODE/ENABLE_NODE BUG: https://bugzilla.samba.org/show_bug.cgi?id=14784 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2021-09-09 01:46:49 +00:00
Martin Schwenke	60c1ef1465	ctdb-daemon: Start as disabled means PERMANENTLY_DISABLED DISABLED is UNHEALTHY \| PERMANENTLY_DISABLED, which is not what is intended here. Luckily, it doesn't do any harm because nodes are marked unhealthy at startup anyway. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14784 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2021-09-09 01:46:49 +00:00
Martin Schwenke	1ac7bc7532	ctdb-daemon: Factor out a function to get node structure from PNN BUG: https://bugzilla.samba.org/show_bug.cgi?id=14784 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2021-09-09 01:46:49 +00:00
Martin Schwenke	e0a7b5a9e8	ctdb-daemon: Add a helper variable Simplifies a subsequent change. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14784 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2021-09-09 01:46:49 +00:00
Martin Schwenke	6845dca87e	ctdb-protocol: Add marshalling for controls DISABLE_NODE/ENABLE_NODE BUG: https://bugzilla.samba.org/show_bug.cgi?id=14784 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2021-09-09 01:46:49 +00:00
Martin Schwenke	49dc5d8cd2	ctdb-protocol: Add new controls to disable and enable nodes These are CTDB_CONTROL_DISABLE_NODE and CTDB_CONTROL_ENABLE_NODE. For consistency these match CTDB_CONTROL_STOP_NODE and CTDB_CONTROL_CONTINUE_NODE. It would be possible to add a single control but it would need to take data. The aim is to finally fix races in flag handling. Previous fixes have improved the situation but they have only narrowed the race window. The problem is that the recovery daemon on the master node pushes flags to nodes the same way that disable and enable are implemented. So the following sequence is still racy: 1. Node A is disabled 2. Recovery master pulls flags from all nodes including A 3. Node A is enabled 4. Recovery master notices A is disabled and pushes a flag update to all nodes including node A 5. Node A is erroneously marked disabled Node A can not tell if the MODIFY_FLAGS control is from a "ctdb disable" command or a flag update from the recovery master. The solution is to use a different mechanism for disable/enable and for a node to ignore MODIFY_FLAGS controls for their own flags. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14784 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2021-09-09 01:46:49 +00:00
Martin Schwenke	8305f6a7f1	ctdb-recoverd: Push flags for a node if any remote node disagrees This will usually happen if flags on the node in question change, so keeping the code simple and pushing to all nodes won't hurt. When all nodes come up there might be differences in connected nodes, causing such "fix ups". Receiving nodes will ignore no-op pushes. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14784 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2021-09-09 01:46:49 +00:00
Martin Schwenke	620d078714	ctdb-recoverd: Update the local node map before pushing out flags The resulting code structure looks a little weird. However, there is another condition that requires the flags to be pushed that will be inserted before the continue statement in a subsequent commit.. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14784 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2021-09-09 01:46:49 +00:00
Martin Schwenke	82a075d4d7	ctdb-recoverd: Add a helper variable Improves readability and simplifies subsequent changes. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14784 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2021-09-09 01:46:49 +00:00
Martin Schwenke	b724c1e6a6	utils: Avoid pylint warning pylint warns: Use lazy % formatting in logging functions Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: David Disseldorp <ddiss@samba.org> Reviewed-by: Jose A. Rivera <jarrpa@samba.org> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Tue Jul 20 05:29:18 UTC 2021 on sn-devel-184	2021-07-20 05:29:18 +00:00
Martin Schwenke	319e27343d	utils: Reformat lines that are longer than 80 columns Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: David Disseldorp <ddiss@samba.org> Reviewed-by: Jose A. Rivera <jarrpa@samba.org>	2021-07-20 04:43:37 +00:00
Martin Schwenke	98c7a38b71	utils: Tweak exception handling to stop flake8 complaining Don't bother with "as e" to avoid warning about unused variable. Don't use bare "except:" (though pylint still complains about this version). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: David Disseldorp <ddiss@samba.org> Reviewed-by: Jose A. Rivera <jarrpa@samba.org>	2021-07-20 04:43:37 +00:00
Martin Schwenke	12d3e215a6	utils: Simplify log level logic, drop global variable Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: David Disseldorp <ddiss@samba.org> Reviewed-by: Jose A. Rivera <jarrpa@samba.org>	2021-07-20 04:43:37 +00:00
Martin Schwenke	e323d16a9d	utils: Inline defaults and help strings Removes an unnecessary level of indirection: defaults and help strings are now where they are expected. Also removes some global variables. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: David Disseldorp <ddiss@samba.org> Reviewed-by: Jose A. Rivera <jarrpa@samba.org>	2021-07-20 04:43:37 +00:00
Martin Schwenke	af5aecced1	utils: Move argument processing into function and call from main() Removes the need for the global variables currently associated with this processing. Also removes unnecessarily double-handling the defaults, which are assigned to the global variables and set via add_argument(). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: David Disseldorp <ddiss@samba.org> Reviewed-by: Jose A. Rivera <jarrpa@samba.org>	2021-07-20 04:43:37 +00:00
Martin Schwenke	e66637a079	utils: Reorder imports so that standard imports are first Avoids numerous pylint warnings. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: David Disseldorp <ddiss@samba.org> Reviewed-by: Jose A. Rivera <jarrpa@samba.org>	2021-07-20 04:43:37 +00:00
Martin Schwenke	bd0b2bb6ee	utils: Clean up ctdb_etcd_lock using autopep8 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: David Disseldorp <ddiss@samba.org> Reviewed-by: Jose A. Rivera <jarrpa@samba.org>	2021-07-20 04:43:37 +00:00
Martin Schwenke	939aed0498	utils: Use Python 3 Due to the number of flake8 and pylint warnings it is unclear if the source has Python 3 incompatibilities. These will be cleaned up in subsequent commits. Signed-off-by: "L.P.H. van Belle" <belle@bazuin.nl> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: David Disseldorp <ddiss@samba.org> Reviewed-by: Jose A. Rivera <jarrpa@samba.org>	2021-07-20 04:43:37 +00:00
Martin Schwenke	466aa8b6f5	ctdb-scripts: Ignore ShellCheck SC3013 for test -nt In ShellCheck 0.7.2, POSIX compatibility warnings got their own SC3xxx error codes, so now both the old and new codes need to be ignored. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Fri Jun 25 10:06:48 UTC 2021 on sn-devel-184	2021-06-25 10:06:48 +00:00
Martin Schwenke	fc0da6b0f8	ctdb-tests: Force stub version of service in eventscript tests Fedora 34 now has a shell function for the which command, which causes these uses of which to return the enclosing function definition rather than the executable file as expected. The event script unit tests always expect the stub service command to be used, so the conditional in these functions is unnecessary. $CTDB_HELPER_BINDIR already conveniently points to the stub directory, so use it here. Signed-off-by: Martin Schwenke <martin@meltin.net> Signed-off-by: Amitay Isaacs <amitay@gmail.com>	2021-06-25 09:16:31 +00:00
Martin Schwenke	23b2fab2c8	ctdb-common: Drop unused include of mkdir_p.h Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2021-06-25 09:16:31 +00:00
Martin Schwenke	e40d452722	ctdb-daemon: Close server socket when switching to client The socket is set close-on-exec but that doesn't help for processes that do not exec(). This should be done for all child processes. This has been seen in testing where "ctdb shutdown" waits for the socket to close before succeeding. It appears that lingering vacuuming processes have not closed the socket when becoming clients so they cause "ctdb shutdown" to hang even though the main daemon process has exited. The cause of the lingering vacuuming processes has been previously examined but still isn't understood. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2021-06-25 09:16:31 +00:00
Martin Schwenke	f7cf8132b0	ctdb-tests: Add debug_locks.sh tests for mutexes Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Fri May 28 07:34:23 UTC 2021 on sn-devel-184	2021-05-28 07:34:23 +00:00
Amitay Isaacs	99c3b49260	ctdb-scripts: Add lock debugging for tdb mutex locks Signed-off-by: Amitay Isaacs <amitay@gmail.com> Signed-off-by: Martin Schwenke <martin@meltin.net>	2021-05-28 06:46:29 +00:00
Amitay Isaacs	cb55b68b3e	ctdb-utils: Add tdb_mutex_check utility Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2021-05-28 06:46:29 +00:00
Martin Schwenke	dd5972b699	ctdb-scripts: Simplify logic in debug_via_proc_locks() The path of the TDB is known, so calculate the file ID (device number + inode number) from it and use this to directly filter /proc/locks to find processes holding locks. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2021-05-28 06:46:29 +00:00
Martin Schwenke	e62ae53ef6	ctdb-scripts: Update debug_locks.sh to handle arguments Don't use the arguments yet. They will be used in a simplified version of the code. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2021-05-28 06:46:29 +00:00
Martin Schwenke	1dfff9751b	ctdb-scripts: Move current lock debugging to a function Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2021-05-28 06:46:29 +00:00
Amitay Isaacs	d07875330a	ctdb-locking: Pass additional arguments to debug locks script 1. PID of lock helper waiting for lock 2. Scope of lock: "record" or "db" 3. Path to database that lock helper is trying to lock 4. Whether the database uses mutexes: "mutex" or "fcntl" Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2021-05-28 06:46:29 +00:00
Martin Schwenke	2c7dbb043f	ctdb-tests: Add debug_locks.sh testing Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2021-05-28 06:46:29 +00:00
Martin Schwenke	a3e7fd9c61	ctdb-tests: Fix nonsense arguments to ps stub These were fine (though still lazy) when these tests were the only user of this stub. However, the ps stub is about to be enhanced, so fix these uses of it to represent the intended usage. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2021-05-28 06:46:29 +00:00
Martin Schwenke	ffb56c9143	ctdb-scripts: Avoid direct /proc access The main reason for this is to facilitate testing. Avoid some /proc accesses entirely by using ps(1) (which can be replaced by a stub when testing) because this script might as well be more portable in case anyone wants to add lock debugging for a non-Linux platform. While the "state" format specification isn't POSIX-compliant, it works on both Linux and FreeBSD so it is a reasonable improvement. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2021-05-28 06:46:29 +00:00
Martin Schwenke	55d4b3438f	ctdb-scripts: Factor out function dump_stacks() Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2021-05-28 06:46:29 +00:00
Volker Lendecke	adef87a621	ctdb: Fix a crash in run_proc_signal_handler() If a script times out the caller can talloc_free() the script_list output of run_event_recv, which talloc_free's proc->output from run_proc.c as well. If the script generates further output after the timeout and then exits after a while, the SIGCHLD handler in the eventd tries to read into proc->output, which was already free'ed. Fix this by not doing just a talloc_steal but a talloc_move. This way proc_read_handler() called from run_proc_signal_handler() does not try to realloc the stale reference to proc->output but gets a NULL reference. I don't really know how to do a knownfail in ctdb, so this commit actually activates catching the signal by waiting long enough for 22.bar to exit and generate the SIGCHLD. Bug: https://bugzilla.samba.org/show_bug.cgi?id=14475 Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Ralph Boehme <slow@samba.org>	2021-05-18 10:42:32 +00:00
Volker Lendecke	f320d1a7ab	ctdb: Introduce output before and after the 10-second timeout This will lead to a crash in run_event_test.c soon Bug: https://bugzilla.samba.org/show_bug.cgi?id=14475 Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Ralph Boehme <slow@samba.org>	2021-05-18 10:42:32 +00:00
Volker Lendecke	19290f10c7	ctdb: Wait for SIGCHLD if script timed out Bug: https://bugzilla.samba.org/show_bug.cgi?id=14475 Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Ralph Boehme <slow@samba.org>	2021-05-18 10:42:32 +00:00
Volker Lendecke	07ab9b7a71	ctdb: Introduce a helper variable in run_event_test.c Bug: https://bugzilla.samba.org/show_bug.cgi?id=14475 Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Ralph Boehme <slow@samba.org>	2021-05-18 10:42:32 +00:00
Volker Lendecke	9398d4b912	ctdb: Call run_event_recv() in a callback function Triggers a different code path in run_event_* and aligns it more what the ctdb eventd really does. Bug: https://bugzilla.samba.org/show_bug.cgi?id=14475 Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Ralph Boehme <slow@samba.org>	2021-05-18 10:42:32 +00:00
Volker Lendecke	f188c9d732	ctdb: fix typos Bug: https://bugzilla.samba.org/show_bug.cgi?id=14475 Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Ralph Boehme <slow@samba.org>	2021-05-18 10:42:32 +00:00
Volker Lendecke	cf43f331be	lib: Make pidfile_path_create() return the existing PID on conflict Use F_GETLK to get the lock holder PID, this is more accurate than reading the file contents: A conflicting process might not have written its PID yet. Also, F_GETLK easily allows to do a retry if the lock holder just died. Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Jeremy Allison <jra@samba.org>	2021-03-16 17:09:32 +00:00
Volker Lendecke	06b740e2fb	ctdb: Fix a typo Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Jeremy Allison <jra@samba.org>	2021-03-09 22:36:28 +00:00
Martin Schwenke	6a81f43177	ctdb-tests: Actually wait for record to migrate to lmaster node This test has been failing with: Wait until record is migrated to lmaster node 0 <30\|BAD: node 0 is not dmaster dmaster: 1 rsn: 8 flags: 0x00010000 MIGRATED_WITH_DATA data(6) = "value1" *** TEST COMPLETED (RC=1) AT 2021-02-02 06:18:48, CLEANING UP... This should never happen. If this really fails then the wait should time out. The problem is that wait_until() does: "$@" \|\| _rc=$? and vacuum_test_key_dmaster() currently calls ctdb_test_fail() on failure, which causes the shell to exit. Instead, pass a variant to wait_until() that simply returns the correct status instead of exiting. An alternative would be to change the statement in wait_until() to do: ("$@") \|\| _rc=$? so it captures the exit. However, this is a global change and requires more thought. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org>	2021-02-08 22:33:14 +00:00
Volker Lendecke	e593f96960	lib: Make accept_recv() return the listening socket This is helpful if you are in a listening loop with the same receiver for many sockets doing the same thing. Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Jeremy Allison <jra@samba.org>	2021-01-22 19:54:38 +00:00
Volker Lendecke	40e4958953	lib: Make accept_recv() return struct samba_sockaddr Avoid casting problems by using the samba_sockaddr union Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Jeremy Allison <jra@samba.org>	2021-01-22 19:54:38 +00:00
Volker Lendecke	6aa672a41c	ctdb: Use hex_byte() in hex_to_data() Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Ralph Boehme <slow@samba.org> Reviewed-by: Jeremy Allison <jra@samba.org>	2021-01-08 20:31:33 +00:00
Martin Schwenke	65ab8cb014	ctdb-daemon: Do not attempt to chown Unix domain socket in test mode If run with UID wrapper and UID_WRAPPER_ROOT=1 then securing the socket will fail. Test mode means that local daemons are in use, so securing the socket is not important. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Volker Lendecke <vl@samba.org>	2020-11-02 08:58:31 +00:00
Martin Schwenke	78c3b5b6a8	ctdb-daemon: Clean up call to bind socket Variable res is only used once and ret is re-used many times. Drop res, use ret, which doesn't need to be initialised. Modernise debug macro. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Volker Lendecke <vl@samba.org>	2020-11-02 08:58:31 +00:00
Martin Schwenke	9404f8631e	ctdb-daemon: Clean up socket bind/secure/listen Obey the coding style, modernise debug macros, clean up whitespace. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Volker Lendecke <vl@samba.org>	2020-11-02 08:58:31 +00:00
Amitay Isaacs	6aa396b0cd	ctdb-common: Avoid aliasing errors during code optimization When compiling with GCC 10.x and -O3 optimization, the IP checksum calculation code generates wrong checksum. The function uint16_checksum gets inlined during optimization and ip4pkt->tcp data gets wrongly aliased. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14537 Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Wed Oct 21 05:52:28 UTC 2020 on sn-devel-184	2020-10-21 05:52:28 +00:00
Martin Schwenke	b68105b8f7	ctdb-tests: Strengthen node state checking in ctdb disable/enable test Check that the desired state is set on all nodes instead of just the test node. This ensures that node flags have correctly propagated across the cluster. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14513 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Tue Oct 6 04:32:06 UTC 2020 on sn-devel-184	2020-10-06 04:32:06 +00:00
Martin Schwenke	4b01f54041	ctdb-recoverd: Drop unnecessary and broken code update_flags() has already updated the recovery master's canonical node map, based on the flags from each remote node, and pushed out these flags to all nodes. If i == j then the node map has already been updated from this remote node's flags, so simply drop this case. Although update_flags() has updated flags for all nodes, it did not update each node map in remote_nodemaps[] to reflect this. This means that remote_nodemaps[] may contain inconsistent flags for some nodes so it should not be used to check consistency when i != j. Further, a meaningful difference in flags can only really occur if update_flags() failed. In that case this code is never reached. These observations combine to imply that this whole loop should be dropped. This leaves potential sub-second inconsistencies due to out-of-band healthy/unhealthy flag changes pushed via CTDB_SRVID_PUSH_NODE_FLAGS. These updates could be dropped (takeover run asks each node for available IPs rather than making centralised decisions based on node flags) but for now they will be fixed in the next iteration of main_loop(). BUG: https://bugzilla.samba.org/show_bug.cgi?id=14513 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-10-06 03:12:35 +00:00
Martin Schwenke	3ab52b5286	ctdb-recoverd: Drop unnecessary code This has already been done in update_flags(). BUG: https://bugzilla.samba.org/show_bug.cgi?id=14513 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-10-06 03:12:35 +00:00
David Disseldorp	68b981ee8a	ctdb/test_ceph_rados_reclock: check for service registration Signed-off-by: David Disseldorp <ddiss@samba.org> Reviewed-by: Samuel Cabrero <scabrero@samba.org> Autobuild-User(master): David Disseldorp <ddiss@samba.org> Autobuild-Date(master): Thu Sep 24 00:52:42 UTC 2020 on sn-devel-184	2020-09-24 00:52:42 +00:00
David Disseldorp	55dbd1080d	ctdb/doc: mention ctdb_mutex_ceph_rados_helper mgr registration Signed-off-by: David Disseldorp <ddiss@samba.org> Reviewed-by: Samuel Cabrero <scabrero@samba.org>	2020-09-23 23:29:41 +00:00
David Disseldorp	ff36cb7402	ctdb/ceph: register recovery lock holder with ceph-mgr The Ceph Manager's service map is useful for tracking the status of Ceph related services. By registering the CTDB recovery lock holder, Ceph storage administrators can more easily identify where and when a CTDB cluster is up and running. Signed-off-by: David Disseldorp <ddiss@samba.org> Reviewed-by: Samuel Cabrero <scabrero@samba.org>	2020-09-23 23:29:41 +00:00
Martin Schwenke	d98f68f918	ctdb-daemon: Drop implementation of old-style database pull/push controls Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Fri Sep 11 06:29:32 UTC 2020 on sn-devel-184	2020-09-11 06:29:32 +00:00
Martin Schwenke	7d826731d4	ctdb-protocol: Drop marshalling functions for old-style database pull/push Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-09-11 05:06:42 +00:00
Martin Schwenke	3bbb4a8535	ctdb-protocol: Drop client functions for old-style database pull/push Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-09-11 05:06:42 +00:00
Martin Schwenke	2898695473	ctdb-client: Drop unused synchronous functions for database pull/push Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-09-11 05:06:42 +00:00
Martin Schwenke	2efce7d477	ctdb-recovery: Simplify database push function names Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-09-11 05:06:42 +00:00
Martin Schwenke	f4e2206e88	ctdb-recovery: Drop unnecessary database push wrapper Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-09-11 05:06:42 +00:00
Martin Schwenke	225a699633	ctdb-recovery: Drop passing of capabilities into database pull This is no longer necessary because the capability new style database pull is assumed to always be available. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-09-11 05:06:42 +00:00
Martin Schwenke	595c1a7c0f	ctdb-recovery: Simplify database pull function names Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-09-11 05:06:42 +00:00
Martin Schwenke	f968576642	ctdb-recovery: Remove use of old pull and push controls Removes use of the old controls without cleaning up the code. Clean up can be done later. After this change the CTDB_CAP_FRAGMENTED_CONTROLS capability is no longer checked. This capability can be removed along with the controls. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-09-11 05:06:42 +00:00
Martin Schwenke	d9d8bf8c54	ctdb-tests: Simplify comment in large database recovery test The older style controls mentioned are being removed. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-09-11 05:06:42 +00:00
David Mulder	6f5b0fef59	ctdb: Prevent man page duplication The new waf detects a duplicate instance of ctdb_mutex_ceph_rados_helper.7.xml, which is due to manpages_extra being a pointer to manpages_misc, therefore each call to build() added duplicate entries to the manpages_misc global entry. Signed-off-by: David Mulder <dmulder@suse.com> Reviewed-by: Andrew Bartlett <abartlet@samba.org>	2020-09-11 03:43:40 +00:00
Martin Schwenke	8bb6a6607d	ctdb-recoverd: Broadcast takeover run message when verifying IPs This makes it consistent with the monitoring code. If the master has changed then this means the master will always get the message. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Tue Aug 18 06:24:11 UTC 2020 on sn-devel-184	2020-08-18 06:24:11 +00:00
Martin Schwenke	4aa8e72d60	ctdb-recoverd: Rename update_local_flags() -> update_flags() This also updates remote flags so the name is misleading. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14466 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-08-18 05:02:25 +00:00
Martin Schwenke	702c7c4934	ctdb-recoverd: Change update_local_flags() to use already retrieved nodemaps BUG: https://bugzilla.samba.org/show_bug.cgi?id=14466 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-08-18 05:02:25 +00:00
Martin Schwenke	910a0b3b74	ctdb-recoverd: Get remote nodemaps earlier update_local_flags() will be changed to use these nodemaps. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14466 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-08-18 05:02:25 +00:00
Martin Schwenke	d50919b0cb	ctdb-recoverd: Do not fetch the nodemap from the recovery master The nodemap has already been fetched from the local node and is actually passed to this function. Care must be taken to avoid referencing the "remote" nodemap for the recovery master. It also isn't useful to do so, since it would be the same nodemap. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14466 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-08-18 05:02:25 +00:00
Martin Schwenke	762d1d8a96	ctdb-recoverd: Change get_remote_nodemaps() to use connected nodes The plan here is to use the nodemaps retrieved by get_remote_nodes() in update_local_flags(). This will improve efficiency, since get_remote_nodes() fetches flags from nodes in parallel. It also means that get_remote_nodes() can be used exactly once early on in main_loop() to retrieve remote nodemaps. Retrieving nodemaps multiple times is unnecessary and racy - a single monitoring iteration should not fetch flags multiple times and compare them. This introduces a temporary behaviour change but it will be of no consequence when the above changes are made. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14466 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-08-18 05:02:25 +00:00
Martin Schwenke	368c83bfe3	ctdb-recoverd: Fix node_pnn check and assignment of nodemap into array This array is indexed by the same index as nodemap, not the PNN. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14466 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-08-18 05:02:25 +00:00
Martin Schwenke	10ce0dbf1c	ctdb-recoverd: Add fail callback to assign banning credits Also drop error handling in main_loop() that is replaced by this change. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14466 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-08-18 05:02:25 +00:00
Martin Schwenke	a079ee3169	ctdb-recoverd: Add an intermediate state struct for nodemap fetching This will allow an error callback to be added. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14466 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-08-18 05:02:25 +00:00
Martin Schwenke	2eaa0af616	ctdb-recoverd: Move memory allocation into get_remote_nodemaps() BUG: https://bugzilla.samba.org/show_bug.cgi?id=14466 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-08-18 05:02:25 +00:00
Martin Schwenke	3324dd272c	ctdb-recoverd: Change signature of get_remote_nodemaps() Change 1st argument to a rec context, since this will be needed later. Drop the nodemap argument and access it via rec->nodemap instead. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14466 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-08-18 05:02:25 +00:00
Martin Schwenke	d2d90f2502	ctdb-recoverd: Fix a local memory leak The memory is allocated off the memory context used by the current iteration of main loop. It is freed when main loop completes the fix doesn't require backporting to stable branches. However, it is sloppy so it is worth fixing. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14466 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-08-18 05:02:25 +00:00
Martin Schwenke	52f520d39c	ctdb-recoverd: Basic cleanups for get_remote_nodemaps() Don't log an error on failure - let the caller can do this. Apart from this: fix up coding style and modernise the remaining error message. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14466 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-08-18 05:02:25 +00:00
Martin Schwenke	0cb61c6fb6	ctdb-doc: Link to CTDB page in wiki Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Mon Aug 17 06:13:11 UTC 2020 on sn-devel-184	2020-08-17 06:13:11 +00:00
Martin Schwenke	971c20e9dc	ctdb-tools: Drop "ctdb isnotrecmaster" command This isn't used anywhere and can easily be checked via "ctdb pnn" and "ctdb recmaster" commands. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-08-17 04:51:32 +00:00
Ralph Boehme	2327471756	lib: relicense smb_strtoul(l) under LGPLv3 Signed-off-by: Ralph Boehme <slow@samba.org> Reviewed-by: Swen Schillig <swen@linux.ibm.com> Reviewed-by: Volker Lendecke <vl@samba.org> Autobuild-User(master): Jeremy Allison <jra@samba.org> Autobuild-Date(master): Mon Aug 3 22:21:04 UTC 2020 on sn-devel-184	2020-08-03 22:21:02 +00:00
Martin Schwenke	642dc6ded6	ctdb-scripts: Use nfsconf as a last resort get nfsd thread count If nfsconf exists then use it as last resort to attempt to extract [nfsd]:threads from /etc/nfs.conf. Invocation of nfsconf requires "\|\| true" because this script uses "set -e". Add a stub that always fails to at least test this much. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14444 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Mon Jul 27 07:06:58 UTC 2020 on sn-devel-184	2020-07-27 07:06:57 +00:00
Martin Schwenke	334dd8cedd	ctdb-scripts: Use nfsconf as a last resort to set NFS_HOSTNAME If nfsconf exists then use it as last resort to attempt to extract [statd]:name from /etc/nfs.conf. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14444 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-27 05:42:31 +00:00
Martin Schwenke	f37b3cf2a6	ctdb: Change LVS to use leader/follower Instead of master/slave. Nearly all of these are simple textual substitutions, which preserve the case of the original. A couple of minor cleanups were made in the documentation (such as "LVSMASTER" -> "LVS leader"). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-24 08:37:31 +00:00
Martin Schwenke	16b848553d	ctdb: Change NAT gateway to use leader/follower Instead of master/slave. Nearly all of these are simple textual substitutions, which preserve the case of the original. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-24 08:37:31 +00:00
Martin Schwenke	5ce6133a75	ctdb-recoverd: Simplify calculation of new flags Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Fri Jul 24 06:03:23 UTC 2020 on sn-devel-184	2020-07-24 06:03:23 +00:00
Martin Schwenke	3654e41677	ctdb-recoverd: Correctly find nodemap entry for pnn Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-24 04:41:25 +00:00
Martin Schwenke	9475ab0441	ctdb-recoverd: Do not retrieve nodemap from recovery master It is already in rec->nodemap. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-24 04:41:25 +00:00
Martin Schwenke	0c6a7db3ba	ctdb-recoverd: Flatten update_flags_on_all_nodes() The logic currently in ctdb_ctrl_modflags() will be optimised so that it no longer matches the pattern for a control function. So, remove this function and squash its functionality into the only caller. Although there are some superficial changes, the behaviour is unchanged. Flattening the 2 functions produces some seriously weird logic for setting the new flags, to the point where using ctdb_ctrl_modflags() for this purpose now looks very strange. The weirdness will be cleaned up in a subsequent commit. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-24 04:41:25 +00:00
Martin Schwenke	a88c10c5a9	ctdb-recoverd: Move ctdb_ctrl_modflags() to ctdb_recoverd.c This file is the only user of this function. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-24 04:41:25 +00:00
Martin Schwenke	b1e631ff92	ctdb-recoverd: Improve a call to update_flags_on_all_nodes() This should take a PNN, not an array index. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-24 04:41:25 +00:00
Martin Schwenke	915d24ac12	ctdb-recoverd: Use update_flags_on_all_nodes() This is clearer than using the MODFLAGS control directly. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-24 04:41:25 +00:00
Martin Schwenke	f681c0e947	ctdb-recoverd: Introduce some local variables to improve readability Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-24 04:41:25 +00:00
Martin Schwenke	cb3a3147b7	ctdb-recoverd: Change update_flags_on_all_nodes() to take rec argument This makes fields such as recmaster and nodemap easily available if required. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-24 04:41:25 +00:00
Martin Schwenke	6982fcb3e6	ctdb-recoverd: Drop unused nodemap argument from update_flags_on_all_nodes() An unused argument needlessly extends the length of function calls. A subsequent change will allow rec->nodemap to be used if necessary. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-24 04:41:25 +00:00
Martin Schwenke	484a764e83	ctdb-tests: Improve test portability/quality Avoid use of non-portable md5sum by constructing database names using index. Improve indentation, use more modern commands, code improvements (shellcheck). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Wed Jul 22 09:14:35 UTC 2020 on sn-devel-184	2020-07-22 09:14:35 +00:00
Martin Schwenke	f4c2c77ff7	ctdb-tests: Improve test quality Simplify code, use more modern commands, code improvements (shellcheck). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 07:53:36 +00:00
Martin Schwenke	c6c81ea287	ctdb-tests: Improve test portability "wc -l" on some platforms (e.g. FreeBSD) contains leading spaces, so strip them. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 07:53:36 +00:00
Martin Schwenke	244eaad76a	ctdb-tests: Improve test quality Select test node with IPs instead of using a fixed node. Remove unnecessary code, use more modern commands, code improvements (shellcheck). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 07:53:36 +00:00
Martin Schwenke	760c3039b0	ctdb-tests: Improve test portability "wc -l" on some platforms (e.g. FreeBSD) contains leading spaces and stops "$num from being a number. Create a more portable solution and put it in a function instead of repeating the logic. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 07:53:36 +00:00
Martin Schwenke	41ff58338a	ctdb-tests: Drop uses of "onnode any ..." in testcases It would be nice to get rid of "onnode any". There's no use making tests nondeterministic. If covering different cases matters then they should be explicitly handled. In most places "any" is replaced by "$test_node". In some cases, where $test_node is not set, a fixed node that is already used elsewhere can be reused. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 07:53:36 +00:00
Martin Schwenke	ce3de39894	ctdb-tests: Don't bother shutting down daemons in ctdb_init() They'll never be up here... Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 07:53:36 +00:00
Martin Schwenke	37c26a9590	ctdb-tests: Separate custom cluster startup from test initialisation Separate cluster startup from test initialisation for tests that start the cluster with customised configuration. In these cases the result of the cluster startup is actually the point of the test. Additionally, pubips.013.failover_noop.sh claims to have completed test initialisation twice, which just seems wrong. The result is: * ctdb_test_init() takes one option (-n) to indicate when it should not configure/start the cluster * New function ctdb_nodes_start_custom() accepts options for special cluster configuration, only operates on local daemons and triggers a test failure rather than a test error on failure. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 07:53:36 +00:00
Martin Schwenke	a766136df4	ctdb-tests: Do not trigger ctdb_test_error() from ctdb_init() The only caller calls ctdb_test_error() on failure and nesting this calls can be confusing. A future change will make this even more confusing. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 07:53:36 +00:00
Martin Schwenke	a369bedf8c	ctdb-tests: Make unit.sh pass shellcheck Mostly avoidance of quoting warnings. Silencing warnings about unquoted $CTDB_TEST_CAT_RESULTS_OPTS is handled by passing '-' to cat when that variable's value is empty. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 07:53:36 +00:00
Martin Schwenke	be3065ea95	ctdb-tests: Make integration.bash pass shellcheck Apart from the non-constant sourcing of include files. Mostly avoidance of quoting warnings. One subtle change is to simply pass "120" to wait_until_ready() to stop warnings that it expects arguments but none are passed (both SC2119 and SC2120). There seems no way to indicate to structure function argument handling so that shellcheck realises arguments are optional. In later shellcheck versions, disabling SC2120 for a function also silences complaints about its callers... but not all of our testing uses "later" shellcheck versions. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 07:53:36 +00:00
Martin Schwenke	d667352805	ctdb-tests: Use "#!/usr/bin/env bash" for improved portability Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 07:53:35 +00:00
Martin Schwenke	8b24cae630	ctdb-tests: Update preamble for INTEGRATION tests * Use "#!/usr/bin/env bash" for improved portability * Drop test_info() definition and replace it with a comment The use of test_info() is pointless. * Drop call to cluster_is_healthy() This is a holdover from when the previous test would restart daemons to get things ready for a test. There was also a bug where going into recovery during the restart would sometimes cause the cluster to become unhealthy. If we really need something like this then we can add it to ctdb_test_init(). * Make order of preamble consistent Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 07:53:35 +00:00
Martin Schwenke	0f201dd67a	ctdb-tests: Drop unreachable line ctdb_test_skip() will exit. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 07:53:35 +00:00
Martin Schwenke	29a3fce28f	ctdb-tests: Redirect stderr too when checking for shellcheck Avoid: .../UNIT/shellcheck/scripts/local.sh: line 14: type: shellcheck: not found The "type" command in dash prints the "not found" message to stdout but the bash version prints to stderr, so redirect stderr too. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 07:53:35 +00:00
Martin Schwenke	1565446508	ctdb-tests: Show hung script debugging output The output in a test failure appears to contain no pstree output because "00\.test\.script,.*" does not match. However, this is just a guess because the output is not shown. Showing the output makes it easier to understand test failures. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 07:53:35 +00:00
Martin Schwenke	70c38d404b	ctdb-tests: Enable SOCKET_WRAPPER_DIR_ALLOW_ORIG This will allow local daemons to be used in more contexts, especially in tests run by Jenkins where the directory names for some targets can be very long. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 07:53:35 +00:00
Martin Schwenke	066c205e5f	ctdb-build: Don't build/install tests in top-level build by default The standalone build still includes tests, as does the top-level build when --enable-selftest is used. The latter is consistent with the use of --enable-selftest in the rest of the tree. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 07:53:35 +00:00
Martin Schwenke	3ff8765d04	ctdb-tests: Stop cat command failure from causing test failure In certain circumstance, which aren't obvious, cat(1) can fail when attempting to write a lot of data. This is due to something (probably write(2)) returning EAGAIN. Given that the -v option should only really be used for test debugging, ignore the failure instead of spending time debugging it. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14446 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 07:53:35 +00:00
Martin Schwenke	6436c74ebf	Revert "ctdb-build: Don't build/install tests in top-level build by default" Fix missing Reviewed-by: tag. This reverts commit `91c36c16c8`. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Wed Jul 22 06:29:43 UTC 2020 on sn-devel-184	2020-07-22 06:29:43 +00:00
Martin Schwenke	bdd89d5276	Revert "ctdb-tests: Enable SOCKET_WRAPPER_DIR_ALLOW_ORIG" Fix missing Reviewed-by: tag. This reverts commit `9694ba6fe4`. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 05:07:47 +00:00
Martin Schwenke	6a3372e895	Revert "ctdb-tests: Show hung script debugging output" Fix missing Reviewed-by: tag. This reverts commit `c78de201f8`. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 05:07:46 +00:00
Martin Schwenke	e4b1cdc709	Revert "ctdb-tests: Redirect stderr too when checking for shellcheck" Fix missing Reviewed-by: tag. This reverts commit `847aa0e367`. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 05:07:46 +00:00
Martin Schwenke	a694c07126	Revert "ctdb-tests: Drop unreachable line" Fix missing Reviewed-by: tag. This reverts commit `a55dd6f17b`. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 05:07:46 +00:00
Martin Schwenke	4438e44f88	Revert "ctdb-tests: Update preamble for INTEGRATION tests" Fix missing Reviewed-by: tag. This reverts commit `65f56505e2`. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 05:07:46 +00:00
Martin Schwenke	271ad95e23	Revert "ctdb-tests: Use "#!/usr/bin/env bash" for improved portability" Fix missing Reviewed-by: tag. This reverts commit `9a7cabd342`. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 05:07:46 +00:00
Martin Schwenke	60d999ad94	Revert "ctdb-tests: Make integration.bash pass shellcheck" Fix missing Reviewed-by: tag. This reverts commit `0f04b8a70b`. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 05:07:46 +00:00
Martin Schwenke	548f2021df	Revert "ctdb-tests: Make unit.sh pass shellcheck" Fix missing Reviewed-by: tag. This reverts commit `30293baae5`. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 05:07:46 +00:00
Martin Schwenke	da654f9795	Revert "ctdb-tests: Do not trigger ctdb_test_error() from ctdb_init()" Fix missing Reviewed-by: tag. This reverts commit `44e05ac851`. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 05:07:46 +00:00
Martin Schwenke	e11526ad54	Revert "ctdb-tests: Separate custom cluster startup from test initialisation" Fix missing Reviewed-by: tag. This reverts commit `e9df17b500`. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 05:07:45 +00:00
Martin Schwenke	941a2d0a3b	Revert "ctdb-tests: Don't bother shutting down daemons in ctdb_init()" Fix missing Reviewed-by: tag. This reverts commit `58f9f699f1`. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 05:07:45 +00:00
Martin Schwenke	c9dfdeaddc	Revert "ctdb-tests: Drop uses of "onnode any ..." in testcases" Fix missing Reviewed-by: tag. This reverts commit `aa5b214eaa`. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 05:07:45 +00:00
Martin Schwenke	635d5cfa31	Revert "ctdb-tests: Improve test portability" Fix missing Reviewed-by: tag. This reverts commit `1079d6e3ae`. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 05:07:45 +00:00
Martin Schwenke	c83ece42e5	Revert "ctdb-tests: Improve test quality" Fix missing Reviewed-by: tag. This reverts commit `ea1cbff624`. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 05:07:45 +00:00
Martin Schwenke	cf3b1fb390	Revert "ctdb-tests: Improve test portability" Fix missing Reviewed-by: tag. This reverts commit `1f6556916e`. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 05:07:45 +00:00
Martin Schwenke	979a6c8c5f	Revert "ctdb-tests: Improve test quality" Fix missing Reviewed-by: tag. This reverts commit `a308f2534d`. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 05:07:45 +00:00
Martin Schwenke	d035b69b53	Revert "ctdb-tests: Improve test portability/quality" Fix missing Reviewed-by: tag. This reverts commit `d2f8cd835d`. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 05:07:45 +00:00
Martin Schwenke	5948a57920	Revert "ctdb-tests: Stop cat command failure from causing test failure" Fix missing Reviewed-by: tag. This reverts commit `5707781ccf`. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-22 05:07:45 +00:00
Martin Schwenke	5707781ccf	ctdb-tests: Stop cat command failure from causing test failure In certain circumstance, which aren't obvious, cat(1) can fail when attempting to write a lot of data. This is due to something (probably write(2)) returning EAGAIN. Given that the -v option should only really be used for test debugging, ignore the failure instead of spending time debugging it. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14446 Signed-off-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Wed Jul 22 04:10:47 UTC 2020 on sn-devel-184	2020-07-22 04:10:47 +00:00
Martin Schwenke	d2f8cd835d	ctdb-tests: Improve test portability/quality Avoid use of non-portable md5sum by constructing database names using index. Improve indentation, use more modern commands, code improvements (shellcheck). Signed-off-by: Martin Schwenke <martin@meltin.net>	2020-07-22 02:42:38 +00:00
Martin Schwenke	a308f2534d	ctdb-tests: Improve test quality Simplify code, use more modern commands, code improvements (shellcheck). Signed-off-by: Martin Schwenke <martin@meltin.net>	2020-07-22 02:42:38 +00:00
Martin Schwenke	1f6556916e	ctdb-tests: Improve test portability "wc -l" on some platforms (e.g. FreeBSD) contains leading spaces, so strip them. Signed-off-by: Martin Schwenke <martin@meltin.net>	2020-07-22 02:42:38 +00:00
Martin Schwenke	ea1cbff624	ctdb-tests: Improve test quality Select test node with IPs instead of using a fixed node. Remove unnecessary code, use more modern commands, code improvements (shellcheck). Signed-off-by: Martin Schwenke <martin@meltin.net>	2020-07-22 02:42:38 +00:00
Martin Schwenke	1079d6e3ae	ctdb-tests: Improve test portability "wc -l" on some platforms (e.g. FreeBSD) contains leading spaces and stops "$num from being a number. Create a more portable solution and put it in a function instead of repeating the logic. Signed-off-by: Martin Schwenke <martin@meltin.net>	2020-07-22 02:42:38 +00:00
Martin Schwenke	aa5b214eaa	ctdb-tests: Drop uses of "onnode any ..." in testcases It would be nice to get rid of "onnode any". There's no use making tests nondeterministic. If covering different cases matters then they should be explicitly handled. In most places "any" is replaced by "$test_node". In some cases, where $test_node is not set, a fixed node that is already used elsewhere can be reused. Signed-off-by: Martin Schwenke <martin@meltin.net>	2020-07-22 02:42:38 +00:00
Martin Schwenke	58f9f699f1	ctdb-tests: Don't bother shutting down daemons in ctdb_init() They'll never be up here... Signed-off-by: Martin Schwenke <martin@meltin.net>	2020-07-22 02:42:38 +00:00
Martin Schwenke	e9df17b500	ctdb-tests: Separate custom cluster startup from test initialisation Separate cluster startup from test initialisation for tests that start the cluster with customised configuration. In these cases the result of the cluster startup is actually the point of the test. Additionally, pubips.013.failover_noop.sh claims to have completed test initialisation twice, which just seems wrong. The result is: * ctdb_test_init() takes one option (-n) to indicate when it should not configure/start the cluster * New function ctdb_nodes_start_custom() accepts options for special cluster configuration, only operates on local daemons and triggers a test failure rather than a test error on failure. Signed-off-by: Martin Schwenke <martin@meltin.net>	2020-07-22 02:42:38 +00:00
Martin Schwenke	44e05ac851	ctdb-tests: Do not trigger ctdb_test_error() from ctdb_init() The only caller calls ctdb_test_error() on failure and nesting this calls can be confusing. A future change will make this even more confusing. Signed-off-by: Martin Schwenke <martin@meltin.net>	2020-07-22 02:42:38 +00:00
Martin Schwenke	30293baae5	ctdb-tests: Make unit.sh pass shellcheck Mostly avoidance of quoting warnings. Silencing warnings about unquoted $CTDB_TEST_CAT_RESULTS_OPTS is handled by passing '-' to cat when that variable's value is empty. Signed-off-by: Martin Schwenke <martin@meltin.net>	2020-07-22 02:42:37 +00:00
Martin Schwenke	0f04b8a70b	ctdb-tests: Make integration.bash pass shellcheck Apart from the non-constant sourcing of include files. Mostly avoidance of quoting warnings. One subtle change is to simply pass "120" to wait_until_ready() to stop warnings that it expects arguments but none are passed (both SC2119 and SC2120). There seems no way to indicate to structure function argument handling so that shellcheck realises arguments are optional. In later shellcheck versions, disabling SC2120 for a function also silences complaints about its callers... but not all of our testing uses "later" shellcheck versions. Signed-off-by: Martin Schwenke <martin@meltin.net>	2020-07-22 02:42:37 +00:00
Martin Schwenke	9a7cabd342	ctdb-tests: Use "#!/usr/bin/env bash" for improved portability Signed-off-by: Martin Schwenke <martin@meltin.net>	2020-07-22 02:42:37 +00:00
Martin Schwenke	65f56505e2	ctdb-tests: Update preamble for INTEGRATION tests * Use "#!/usr/bin/env bash" for improved portability * Drop test_info() definition and replace it with a comment The use of test_info() is pointless. * Drop call to cluster_is_healthy() This is a holdover from when the previous test would restart daemons to get things ready for a test. There was also a bug where going into recovery during the restart would sometimes cause the cluster to become unhealthy. If we really need something like this then we can add it to ctdb_test_init(). * Make order of preamble consistent Signed-off-by: Martin Schwenke <martin@meltin.net>	2020-07-22 02:42:37 +00:00
Martin Schwenke	a55dd6f17b	ctdb-tests: Drop unreachable line ctdb_test_skip() will exit. Signed-off-by: Martin Schwenke <martin@meltin.net>	2020-07-22 02:42:37 +00:00
Martin Schwenke	847aa0e367	ctdb-tests: Redirect stderr too when checking for shellcheck Avoid: .../UNIT/shellcheck/scripts/local.sh: line 14: type: shellcheck: not found The "type" command in dash prints the "not found" message to stdout but the bash version prints to stderr, so redirect stderr too. Signed-off-by: Martin Schwenke <martin@meltin.net>	2020-07-22 02:42:37 +00:00
Martin Schwenke	c78de201f8	ctdb-tests: Show hung script debugging output The output in a test failure appears to contain no pstree output because "00\.test\.script,.*" does not match. However, this is just a guess because the output is not shown. Showing the output makes it easier to understand test failures. Signed-off-by: Martin Schwenke <martin@meltin.net>	2020-07-22 02:42:37 +00:00
Martin Schwenke	9694ba6fe4	ctdb-tests: Enable SOCKET_WRAPPER_DIR_ALLOW_ORIG This will allow local daemons to be used in more contexts, especially in tests run by Jenkins where the directory names for some targets can be very long. Signed-off-by: Martin Schwenke <martin@meltin.net>	2020-07-22 02:42:37 +00:00
Martin Schwenke	91c36c16c8	ctdb-build: Don't build/install tests in top-level build by default The standalone build still includes tests, as does the top-level build when --enable-selftest is used. The latter is consistent with the use of --enable-selftest in the rest of the tree. Signed-off-by: Martin Schwenke <martin@meltin.net>	2020-07-22 02:42:37 +00:00
Martin Schwenke	0e287127cb	ctdb-tools: Improve onnode's ShellCheck credibility Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Thu Jul 16 06:51:47 UTC 2020 on sn-devel-184	2020-07-16 06:51:47 +00:00
Martin Schwenke	5f217d6037	ctdb-tools: Allow onnode -P to respect ONNODE_SSH Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-16 05:28:42 +00:00
Martin Schwenke	00eb88b241	ctdb-tools: Whitespace fixups Drop some unnecessary whitespace and re-indent push(). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-16 05:28:42 +00:00
Martin Schwenke	bc174243d7	ctdb-tools: Drop undocumented ONNODE_SSH_OPTS variable Options can be set in ONNODE_SSH, so this variable is unnecessary. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-07-16 05:28:42 +00:00
Martin Schwenke	1e55591bc5	ctdb-tests: Add a new fetch ring test that also checks hot keys Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Fri May 22 08:05:54 UTC 2020 on sn-devel-184	2020-05-22 08:05:54 +00:00
Martin Schwenke	fb38252677	ctdb-tests: Update fetch_ring to take database and key on command line Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-05-22 06:41:45 +00:00
Martin Schwenke	53b73b9b0f	ctdb-daemon: Fix sorting of hot keys The current code only ever swaps with slot 0. This will only ever happen with slots 0 and 1, so probably never sorts. Replace with qsort(). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-05-22 06:41:45 +00:00
Martin Schwenke	5c8dfbbf9b	ctdb-daemon: Add extra logging of hot keys ctdbd currently only logs when a new hot key is added. If a key gets hotter then nothing new is logged. Log hot key updates when the number of migrations has doubled since the last time that key was logged. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-05-22 06:41:45 +00:00
Martin Schwenke	baf058dcf7	ctdb-daemon: Update hot key logging This message indicates that a hot key was added, so say that. After all the hot key slots have been filled the id will always be 0, so don't bother logging it. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-05-22 06:41:44 +00:00
Martin Schwenke	1ab39b3270	ctdb-daemon: Fix bug in slot 0 comparison optimisation This is only valid if all slots are in use. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-05-22 06:41:44 +00:00
Martin Schwenke	f9f60c2a60	ctdb-daemon: Switch some variables to unsigned These should be unsigned but luck is currently on our side. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-05-22 06:41:44 +00:00
Martin Schwenke	21b9844bcb	ctdb-daemon: Add separate hot keys array for database statistics There are 2 reasons for this. Sorting of hot keys is broken and will be changed to an implementation that needs a named (i.e. not anonymous) structure. Also, at least one non-protocol field will be added to facilitate more useful logging. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-05-22 06:41:44 +00:00
Martin Schwenke	c28914bfa7	ctdb-build: Fix a typo Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-05-22 06:41:44 +00:00
Ralph Boehme	6e419dda71	ctdb: increase TasksMax limit, the systemd default is just 512 In 2015 systemd introduced a TasksMax which limits the number of processes in a unit: https://lists.freedesktop.org/archives/systemd-devel/2015-November/035006.html The default of 512 may be too low in certain situations leading to vfork() failing with errno=EAGAIN when trying to spawn lock-helper processes. With the default for LockProcessesPerDB being 200 the increased TasksMax limit should cover the problematic scenario. Additional background: the failing vfork()s have been seen on production clusters and were tracked down to being logged in the context of ctdb calling tdb_repack(). Links: `9ded9cd14c` https://www.suse.com/support/kb/doc/?id=000015901 https://success.docker.com/article/how-to-reserve-resource-temporarily-unavailable-errors-due-to-tasksmax-setting https://www.percona.com/blog/2019/01/02/tasksmax-another-setting-that-can-cause-mysql-error-messages/ Signed-off-by: Ralph Boehme <slow@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Wed May 13 13:30:12 UTC 2020 on sn-devel-184	2020-05-13 13:30:12 +00:00
Amitay Isaacs	23c2195e2c	ctdb-build: Add messages_dgm build to ctdb Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Volker Lendecke <vl@samba.org> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Wed May 6 01:47:16 UTC 2020 on sn-devel-184	2020-05-06 01:47:16 +00:00
Amitay Isaacs	a59fd8164c	lib/util: Build genrand for util core messages_dgm depends on genrand. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Volker Lendecke <vl@samba.org>	2020-05-06 00:06:40 +00:00
Volker Lendecke	d9ccd853c3	ctdb: Implement CTDB_CONTROL_ECHO_DATA Testing control: 4 bytes msec delay plus a blob, return the request after the delay. This is an enhanced "ping" which can be used to test asynchronous clients. Doesn't have the full protocol implementation yet Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2020-04-28 09:08:39 +00:00
Volker Lendecke	bdabf78122	ctdb-protocol: Add marshalling for control ECHO_DATA Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2020-04-28 09:08:39 +00:00
Volker Lendecke	6f56f45639	ctdb-protocol: Add marshalling for struct ctdb_echo_data Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2020-04-28 09:08:39 +00:00
Volker Lendecke	4f3db63d5e	ctdb-protocol: Add new control CTDB_CONTROL_ECHO_DATA Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2020-04-28 09:08:39 +00:00
Volker Lendecke	861dd8c48a	ctdb: Fix duplicate ;; Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2020-04-28 09:08:39 +00:00
Renaud Fortier	fdfc480a56	ctdb-scripts: Update nfs-ganesha-callout On debian buster, this variable doesn't exist anymore. Look at this PR as a reference: https://github.com/gluster/storhaug/pull/30 Signed-off-by: Renaud Fortier <renaud.fortier@fsaa.ulaval.ca> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Andrew Bartlett <abartlet@samba.org> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Thu Apr 23 08:07:51 UTC 2020 on sn-devel-184	2020-04-23 08:07:51 +00:00
Volker Lendecke	ad4b53f2d9	ctdb: Fix a memleak Bug: https://bugzilla.samba.org/show_bug.cgi?id=14348 Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Fri Apr 17 08:32:35 UTC 2020 on sn-devel-184	2020-04-17 08:32:35 +00:00
Martin Schwenke	f8f3d7954d	ctdb-vacuum: Reschedule vacuum event if VacuumInterval has increased The vacuuming integration tests set VacuumInterval to a very high number to avoid vacuuming collisions. This is done after the cluster is healthy, so Samba will have already been started and vacuuming will already be scheduled at the default interval for databases attached by Samba. This means that vacuuming controls used by vacuuming tests can still collide with the scheduled vacuuming events. Add some logic to reschedule a vacuuming event that has fired but where VacuumInterval has increased since it was originally scheduled. The increase in VacuumInterval is used as the time offset for rescheduling the event. Although this changes production behaviour for the convenience of testing, the new behaviour is completely reasonable and obeys the principle of least surprise. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Tue Apr 7 03:04:57 UTC 2020 on sn-devel-184	2020-04-07 03:04:57 +00:00
Martin Schwenke	5d03a3c86e	ctdb-vacuum: Store value of VacuumInterval in ctdb_vacuum_handle No behaviour change. This is final staging to make the next change completely obvious. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-04-07 01:26:41 +00:00
Martin Schwenke	7ad7c0b932	ctdb-vacuum: Use vacuum_handle local variables No behaviour change. This just makes future changes clearer by avoiding reformatting (or introducing local variables). Clean up error handling while touching a relevant line. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-04-07 01:26:41 +00:00
Martin Schwenke	716f52f68b	ctdb-recoverd: Avoid dereferencing NULL rec->nodemap Inside the nested event loop in ctdb_ctrl_getnodemap(), various asynchronous handlers may dereference rec->nodemap, which will be NULL. One example is lost_reclock_handler(), which causes rec->nodemap to be unconditionally dereferenced in list_of_nodes() via this call chain: list_of_nodes() list_of_active_nodes() set_recovery_mode() force_election() lost_reclock_handler() Instead of attempting to trace all of the cases, just avoid leaving rec->nodemap set to NULL. Attempting to use an old value is generally harmless, especially since it will be the same as the new value in most cases. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14324 Reported-by: Volker Lendecke <vl@samba.org> Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Tue Mar 24 01:22:45 UTC 2020 on sn-devel-184	2020-03-24 01:22:45 +00:00
Martin Schwenke	147afe77de	ctdb-daemon: Don't allow attach from recovery if recovery is not active Neither the recovery daemon nor the recovery helper should attach databases outside of the recovery process. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14294 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-03-23 23:45:38 +00:00
Martin Schwenke	052f1bdb9c	ctdb-daemon: Remove more unused old client database functions BUG: https://bugzilla.samba.org/show_bug.cgi?id=14294 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-03-23 23:45:38 +00:00
Martin Schwenke	3a66d181b6	ctdb-recovery: Remove old code for creating missing databases BUG: https://bugzilla.samba.org/show_bug.cgi?id=14294 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-03-23 23:45:38 +00:00
Martin Schwenke	76a8174279	ctdb-recovery: Create database on nodes where it is missing BUG: https://bugzilla.samba.org/show_bug.cgi?id=14294 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-03-23 23:45:38 +00:00
Martin Schwenke	e6e63f8fb8	ctdb-recovery: Fetch database name from all nodes where it is attached BUG: https://bugzilla.samba.org/show_bug.cgi?id=14294 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-03-23 23:45:38 +00:00
Martin Schwenke	1bdfeb3fdc	ctdb-recovery: Pass db structure for each database recovery Instead of db_id and db_flags. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14294 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-03-23 23:45:38 +00:00
Martin Schwenke	c6f74e590f	ctdb-recovery: GET_DBMAP from all nodes This builds a complete list of databases across the cluster so it can be used to create databases on the nodes where they are missing. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14294 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-03-23 23:45:38 +00:00
Martin Schwenke	4c0b9c3605	ctdb-recovery: Replace use of ctdb_dbid_map with local db_list This will be used to build a merged list of databases from all nodes, allowing the recovery helper to create missing databases. It would be possible to also include the db_name field in this structure but that would cause a lot of churn. This field is used locally in the recovery of each database so can continue to live in the relevant state structure(s). BUG: https://bugzilla.samba.org/show_bug.cgi?id=14294 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-03-23 23:45:38 +00:00
Martin Schwenke	7e5a8a4884	ctdb-daemon: Respect CTDB_CTRL_FLAG_ATTACH_RECOVERY when attaching databases This is currently only set by the recovery daemon when it attaches missing databases, so there is no obvious behaviour change. However, attaching missing databases can now be moved to the recovery helper as long as it sets this flag. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14294 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-03-23 23:45:38 +00:00
Martin Schwenke	98e3d0db2b	ctdb-recovery: Use CTDB_CTRL_FLAG_ATTACH_RECOVERY to attach during recovery ctdb_ctrl_createdb() is only called by the recovery daemon, so this is a safe, temporary change. This is temporary because ctdb_ctrl_createdb(), create_missing_remote_databases() and create_missing_local_databases() will all go away soon. Note that this doesn't cause a change in behaviour. The main daemon will still only defer attaches from non-recoverd processes during recovery. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14294 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-03-23 23:45:38 +00:00
Martin Schwenke	17ed042590	ctdb-protocol: Add control flag CTDB_CTRL_FLAG_ATTACH_RECOVERY BUG: https://bugzilla.samba.org/show_bug.cgi?id=14294 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-03-23 23:45:37 +00:00
Martin Schwenke	fc23cd1b9c	ctdb-daemon: Remove unused old client database functions BUG: https://bugzilla.samba.org/show_bug.cgi?id=14294 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-03-23 23:45:37 +00:00
Martin Schwenke	c6c89495fb	ctdb-daemon: Fix database attach deferral logic Commit `3cc230b5ee` says: Dont allow clients to connect to databases untile we are well past and through the initial recovery phase It is unclear what this commit was attempting to do. The commit message implies that more attaches should be deferred but the code change adds a conjunction that causes less attaches to be deferred. In particular, no attaches will be deferred after startup is complete. This seems wrong. To implement what seems to be stated in the commit message an "or" needs to be used so that non-recovery daemon attaches are deferred either when in recovery or before startup is complete. Making this change highlights that attaches need to be allowed during the "startup" event because this is when smbd is started. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14294 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-03-23 23:45:37 +00:00
Amitay Isaacs	1c56d6413f	ctdb-recovery: Refactor banning a node into separate computation If a node is marked for banning, confirm that it's not become inactive during the recovery. If yes, then don't ban the node. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14294 Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2020-03-23 23:45:37 +00:00
Amitay Isaacs	c6a0ff1bed	ctdb-recovery: Don't trust nodemap obtained from local node It's possible to have a node stopped, but recovery master not yet updated flags on the local ctdb daemon when recovery is started. So do not trust the list of active nodes obtained from the local node. Query the connected nodes to calculate the list of active nodes. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14294 Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2020-03-23 23:45:37 +00:00
Amitay Isaacs	6e2f8756f1	ctdb-recovery: Consolidate node state This avoids passing multiple arguments to async computation. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14294 Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2020-03-23 23:45:37 +00:00
Amitay Isaacs	072ff4d12b	ctdb-recovery: Fetched vnnmap is never used, so don't fetch it New vnnmap is constructed using the information from all the connected nodes. So there is no need to fetch the vnnmap from recovery master. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14294 Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2020-03-23 23:45:37 +00:00
Martin Schwenke	319c93f0c6	ctdb-tcp: Do not stop outbound connection in ctdb_tcp_node_connect() The only place the outgoing connection needs to be stopped is when there is a timeout when waiting for the connection to become writable. Add a new function ctdb_tcp_node_connect_timeout() to handle this case. All of the other cases are attempts to establish a new outgoing connection (initial attempt, retry after an error or disconnect, ...) so drop stopping the connection in those cases. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14295 Signed-off-by: Amitay Isaacs <amitay@gmail.com> Signed-off-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Thu Mar 12 05:29:20 UTC 2020 on sn-devel-184	2020-03-12 05:29:20 +00:00
Martin Schwenke	3c8747fe29	ctdb-tcp: Factor out function ctdb_tcp_start_outgoing() BUG: https://bugzilla.samba.org/show_bug.cgi?id=14295 Signed-off-by: Amitay Isaacs <amitay@gmail.com> Signed-off-by: Martin Schwenke <martin@meltin.net>	2020-03-12 03:47:30 +00:00
Ralph Boehme	2c73dbafba	ctdb-tcp: add ctdb_tcp_stop_incoming() No change in behaviour. This makes the code self-documenting. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14295 Signed-off-by: Ralph Boehme <slow@samba.org> Signed-off-by: Martin Schwenke <martin@meltin.net>	2020-03-12 03:47:30 +00:00
Ralph Boehme	1e2a967ff4	ctdb-tcp: rename ctdb_tcp_stop_connection() to ctdb_tcp_stop_outgoing() No change in behaviour. This makes the code self-documenting. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14295 Signed-off-by: Ralph Boehme <slow@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2020-03-12 03:47:30 +00:00
Ralph Boehme	ea37ecdcd5	ctdb-tcp: Remove redundant restart in ctdb_tcp_tnode_cb() The node dead upcall has already restarted the outgoing connection. There's no need to repeat it. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14295 Signed-off-by: Ralph Boehme <slow@samba.org> Signed-off-by: Martin Schwenke <martin@meltin.net>	2020-03-12 03:47:30 +00:00
Ralph Boehme	b83ef98c74	ctdb-tcp: always call node_dead() upcall in ctdb_tcp_tnode_cb() ctdb_tcp_tnode_cb() is called when we receive data on the outgoing connection. This can happen when we get an EOF on the connection because the other side as closed. In this case data will be NULL. It would also be called if we received data from the peer. In this case data will not be NULL. The latter case is a fatal error though and we already call ctdb_tcp_stop_connection() for this case as well, which means even though the node is not fully connected anymore, by not calling the node_dead() upcall NODE_FLAGS_DISCONNECTED will not be set. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14295 Signed-off-by: Ralph Boehme <slow@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2020-03-12 03:47:30 +00:00
Noel Power	0ff1b78fc2	ctdb-tcp: move free of inbound queue to TCP restart Since commit `77deaadca8`, a nodeA which had previously accepted a connection from nodeB (where nodeB dies e.g. as as result of fencing) when nodeB attempts to connect again after restarting is always rejected with ctdb_listen_event: Incoming queue active, rejecting connection from w.x.y.z messages. Consolidate dead node handling in the TCP restart handling. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14295 Signed-off-by: Noel Power <noel.power@suse.com> Reviewed-by: Ralph Boehme <slow@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2020-03-12 03:47:30 +00:00
Martin Schwenke	15762a3455	ctdb-daemon: more logical whitespace, debug modernisation BUG: https://bugzilla.samba.org/show_bug.cgi?id=14295 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Ralph Boehme <slow@samba.org>	2020-03-12 03:47:30 +00:00
Ralph Boehme	6a4fa0785f	ctdb-daemon: ensure restart() callback is called in half-connected state If NODE_FLAGS_DISCONNECTED is set the node can be in half-connected state. With this change we ensure to restart the transport for this case. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14295 Signed-off-by: Ralph Boehme <slow@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2020-03-12 03:47:30 +00:00
Martin Schwenke	9f9dcfb6c3	ctdb-tests: Use built-in hexdump() in system socket tests Better compatibility, since od output isn't consistent on FreeBSD. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Tue Mar 10 09:17:12 UTC 2020 on sn-devel-184	2020-03-10 09:17:12 +00:00
Martin Schwenke	602694522f	ctdb-tests: Split system socket test One test for each of types, TCP, ARP. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-03-10 07:37:34 +00:00
Martin Schwenke	b10e79f208	ctdb-tests: Skip "ctdb process-exists" tests when not on Linux Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-03-10 07:37:34 +00:00
Martin Schwenke	c5dd476715	ctdb-tests: Add function ctdb_test_check_supported_OS Skips test if not on one of the supported OSes. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-03-10 07:37:34 +00:00
Martin Schwenke	8402dabf88	ctdb-tests: Use ctdb_test_skip() when initscript can not be found Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-03-10 07:37:34 +00:00
Martin Schwenke	30180ef6c2	ctdb-tests: Use ctdb_test_skip() when shellcheck is not installed When the tests are run interactively this will make it more noticeable that shellcheck is not installed because the test summary will indicate missing tests. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-03-10 07:37:34 +00:00
Martin Schwenke	77f6977102	ctdb-tests: Skipped tests should not cause failure Skipped tests return a status that indicates failure. In combination with the -e option this results in an exit with failure on the first skipped test. Convert skipped test status to success. The skip has already been counted. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-03-10 07:37:34 +00:00
Martin Schwenke	be90ab01bb	ctdb-docs: Improve recovery lock documentation Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Christof Schmitt <cs@samba.org> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Mon Mar 9 02:27:18 UTC 2020 on sn-devel-184	2020-03-09 02:27:18 +00:00
Martin Schwenke	7cff3ed12c	ctdb-tests: Use a local "ctdb shutdown" command to avoid a race When "ctdb shutdown" is run with -n <N> it does not wait for the node <N>'s ctdbd to go down but exits immediately. This means that the local_daemons.sh shutdown command can find the PID file still present and then attempt the shutdown, but the daemon can have exited between the check and the shutdown. Although the test waits until the node is disconnected, the transport is taken down just before the exit, so this does not guarantee the daemon has exited. A local shutdown command (no -n <N>) waits until the socket disconnects and this happens after the PID file is gone, so this is safe to use with the local_daemons.sh shutdown command. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Mon Mar 2 10:39:28 UTC 2020 on sn-devel-184	2020-03-02 10:39:28 +00:00
Martin Schwenke	1e72fbdde0	ctdb-tests: Silence a ShellCheck warning Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Ralph Boehme <slow@samba.org> Autobuild-User(master): Ralph Böhme <slow@samba.org> Autobuild-Date(master): Sat Feb 29 11:53:42 UTC 2020 on sn-devel-184	2020-02-29 11:53:42 +00:00
Ralph Boehme	c2dba1f53b	ctdb: add tail logs option to local_daemons.sh Signed-off-by: Ralph Boehme <slow@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Sat Feb 29 08:02:50 UTC 2020 on sn-devel-184	2020-02-29 08:02:50 +00:00
Anoop C S	959235fffb	ctdb-docs: Move CTDB_SERVICE_NMB to new 48.netbios section Signed-off-by: Anoop C S <anoopcs@redhat.com> Reviewed-by: Guenther Deschner <gd@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Thu Feb 27 07:34:53 UTC 2020 on sn-devel-184	2020-02-27 07:34:53 +00:00
Anoop C S	512fa29cce	ctdb-scripts: Change CTDB_SERVICE_NMB default value to 'nmb' Till now 50.samba script was based on RHEL versions <=6 where we didn't have separate start up script for nmb and smbd used to start nmbd when required. Now that nmbd has its own start up script named "nmb" it is reasonable to have "nmb" as default value for CTDB_SERVICE_NMB inside new 48.netbios ctdb script. Signed-off-by: Anoop C S <anoopcs@redhat.com> Reviewed-by: Guenther Deschner <gd@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2020-02-27 06:07:41 +00:00
Günther Deschner	26e1556819	ctdb-scripts: add new 48.netbios script for starting nmbd This change basically moves out nmbd references from 50.samba script to a new 48.netbios script. Accordingly ctdb test scripts are tweaked to cope with newly added script. Signed-off-by: Guenther Deschner <gd@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2020-02-27 06:07:41 +00:00
Martin Schwenke	4de1e3207b	ctdb-docs: Provide example commands for "ctdb event ..." The example output doesn't tell a user what command generated it. Adding the command makes the examples much more useful. Reported-by: Stefan Kania <stefan@kania-online.de> Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Tue Feb 18 04:22:56 UTC 2020 on sn-devel-184	2020-02-18 04:22:56 +00:00
Martin Schwenke	3d5de9b26d	ctdb-tests: Flag setup, startup, shutdown failures as test errors Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-02-18 02:56:38 +00:00
Martin Schwenke	455d931a16	ctdb-tests: Dump logs on shutdown failure Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-02-18 02:56:38 +00:00
Martin Schwenke	03403aacfe	ctdb-tests: Avoid shutdown error when daemon already cleanly shut down This depends on a small amount of internal knowledge but is the cleanest way of avoiding errors for nodes that have already been shut down. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-02-18 02:56:38 +00:00
Martin Schwenke	dc076b835f	ctdb-tests: Rationalise node stop/start/restart Separate functions are not needed for stopping/starting/restarting individual nodes. The stop and start functions essentially just use onnode, though for local daemons this is embedded in local_daemons.sh. So, just provide one stop and one start function that takes an optional nodespec, defaulting to all nodes. Restarting becomes common. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-02-18 02:56:38 +00:00
Martin Schwenke	a20403adf8	ctdb-daemon: Fix signed/unsigned comparison csbuild says: ctdb/server/ctdb_lock.c: scope_hint: In function ‘ctdb_find_lock_context’ ctdb/server/ctdb_lock.c:671:33: warning: comparison of integer expressions of different signedness: ‘int’ and ‘uint32_t’ {aka ‘unsigned int’} [-Wsign-compare] Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-02-18 02:56:38 +00:00
Martin Schwenke	c9405aec70	ctdb-daemon: Check for lock count underflow This is a programming error. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-02-18 02:56:38 +00:00
Amitay Isaacs	c16da0e8f0	ctdb-common: Remove signed/unsigned comparisons Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2020-02-18 02:56:38 +00:00
Martin Schwenke	bd279d3f98	ctdb-tests: Fix getdbmap test so that it actually works sanely * Typo in variable name db_map_pattern * Variable num_db_init used before set * dbmap_pattern does not cover database flags Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Wed Feb 12 04:38:47 UTC 2020 on sn-devel-184	2020-02-12 04:38:47 +00:00
Martin Schwenke	224e897872	ctdb-tests: Fix handling of --no-event-scripts option Shellcheck noticed that pnn was never referenced. Not sure this ever worked or whether it got broken somewhere along the way. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-02-12 03:11:39 +00:00
Martin Schwenke	a6d464aa2e	ctdb-tests: Use a here document to improve readability Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-02-12 03:11:39 +00:00
Martin Schwenke	b9f23f5b49	ctdb-tests: Use select_test_node() select_test_node_and_ips() is not required in these cases. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-02-12 03:11:39 +00:00
Martin Schwenke	0162fd87ed	ctdb-tests: Increase to dumping up to 500 lines of logs on error 100 lines are not enough to debug a current issue. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-02-12 03:11:39 +00:00
Martin Schwenke	5a702b01f6	ctdb-tests: Fix return value of DB test tool delete command Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-02-12 03:11:39 +00:00
Martin Schwenke	a40fc709cc	ctdb-tcp: Make error handling for outbound connection consistent If we can't bind the local end of an outgoing connection then something has gone wrong. Retrying is better than failing into a zombie state. The interface might come back up and/or the address my be reconfigured. While here, do the same thing for the other (potentially transient) failures. The unknown address family failure is special but just handle it via a retry. Technically it can't happen because the node address parsing can only return values with address family AF_INET or AF_INET6. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14274 Reported-by: 耿纪超 <gengjichao@jd.com> Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-02-12 03:11:39 +00:00
Martin Schwenke	0b3db29bd5	ctdb-tests: Add some tool unit tests to ensure that timeouts work Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Mon Feb 10 05:34:08 UTC 2020 on sn-devel-184	2020-02-10 05:34:08 +00:00
Martin Schwenke	0e59cd25e1	ctdb-tools: Allow shorter runtime limit to be specified Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-02-10 04:07:39 +00:00
Martin Schwenke	39206fd327	ctdb-tools: When in test mode set process group in top-level ctdb tool If ctdbd hangs when shutting down in post-test clean-up then killing the process group can kill the test. When in test mode, create a process group but only in the top-level ctdb tool - the natgw and lvs helpers also run the ctdb tool. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-02-10 04:07:39 +00:00
Martin Schwenke	3b0b830e40	ctdb-tests: Use $PWD/bin/ if it exists when running in-tree When running tests from a top-level build, a stale build in ctdb/bin/ will be preferred and may cause confusing results. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-02-10 04:07:39 +00:00
Martin Schwenke	b0b14e4edd	ctdb-tests: Make $ctdb_dir absolute This is used to set several variables so it might as well be cd-proof. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-02-10 04:07:39 +00:00
Martin Schwenke	1a0e1f8924	ctdb-daemon: Fork when not interactive and test mode is enabled There is no sane way of keeping stdin open when using the shell to background ctdbd in local_daemons.sh. Instead, have ctdbd fork when not interactive and when test mode is enabled. become_daemon() can't be used for this: if it forks then it also closes stdin. For the interactive case, become_daemon() wasn't doing anything special, so do nothing instead. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-02-10 04:07:39 +00:00
Martin Schwenke	a220e9454a	ctdb-daemon: Make some conditions more explicit These don't need to depend on do_fork. Child logging should be set up whenever the daemon is not interactive. The stdin handler should be setup whenever test mode is enabled. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-02-10 04:07:39 +00:00
Martin Schwenke	cefb3327c6	ctdb-daemon: Pass more information to ctdb_start_daemon() No functional changes. This is staging for a change that makes ctdbd fork when test mode is enabled but interactive is not set. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-02-10 04:07:38 +00:00
Martin Schwenke	3509aa28d4	ctdb-tests: Don't actually close stdin in fake ssh A subsequent file descriptor allocation may return 0 and unexpected things may then happen. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-02-10 04:07:38 +00:00
Martin Schwenke	8de1bb75e5	ctdb-tests: Redirect stdin from /dev/null when running a test Otherwise, if the test is run via ssh it will "unexpectedly" find itself at the other end of a pipe. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-02-10 04:07:38 +00:00
Martin Schwenke	0737849b90	Revert "ctdb-tests: Enable job control when keeping stdin open" This doesn't work when stdin is not a tty. This reverts commit `ea754bfdec`. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-02-10 04:07:38 +00:00
Volker Lendecke	3d40efaed8	ctdb-test: Fix a typo Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Andreas Schneider <asn@samba.org> Autobuild-User(master): Andreas Schneider <asn@cryptomilk.org> Autobuild-Date(master): Thu Jan 30 13:53:22 UTC 2020 on sn-devel-184	2020-01-30 13:53:22 +00:00
Martin Schwenke	ea754bfdec	ctdb-tests: Enable job control when keeping stdin open POSIX says: If job control is disabled (see set, -m), the standard input for an asynchronous list, before any explicit redirections are performed, shall be considered to be assigned to a file that has the same properties as /dev/null. This shall not happen if job control is enabled. In all cases, explicit redirection of standard input shall override this activity. ctdbd is backgrounded at startup, so the above causes stdin to be redirected from /dev/null. Enable job control to work around this. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Tue Jan 28 11:24:35 UTC 2020 on sn-devel-184	2020-01-28 11:24:35 +00:00
Martin Schwenke	2380b13bf8	ctdb-tests: Don't close stdin when starting local daemons Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-01-28 09:57:33 +00:00
Martin Schwenke	cf460bd9c4	ctdb-daemon: Shut down if interactive and stdin is closed This allows a test environment to simply close its end of a pipe to cleanly shutdown ctdbd. Like in smbd, this is only done if stdin is a pipe or a socket. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-01-28 09:57:32 +00:00
Martin Schwenke	d79e2dcfc8	ctdb-daemon: Only stop monitoring if it has been initialised This avoids a crash if ctdb_shutdown_sequence() is called before monitoring is initialised. Switch to using TALLOC_FREE() while touching this function. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-01-28 09:57:32 +00:00
Martin Schwenke	aa2977e151	ctdb-mutex: Change default re-check time for fcntl helper to 5s Testing against a commonly used cluster filesystem has shown no performance impact, as expected. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-01-21 11:39:40 +00:00
Martin Schwenke	14b1dffc27	ctdb-tests: Add some tests to check recovery from recovery lock issues Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-01-21 11:39:40 +00:00
Martin Schwenke	64501f5193	ctdb-tests: Put recovery lock for local daemons into a subdirectory This makes it more like the way it works with a cluster filesystem. It also allows the subdirectory to be manipulated in tests. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-01-21 11:39:40 +00:00
Martin Schwenke	93fc31858f	ctdb-tests: Add local_daemons.sh option for recovery lock recheck interval Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-01-21 11:39:40 +00:00
Volker Lendecke	42a3e2e503	ctdbd: Use struct initialization 2 lines less Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Jeremy Allison <jra@samba.org>	2020-01-19 18:29:39 +00:00
Martin Schwenke	9edf15afc2	ctdb-tests: Skip some tests that don't work with IPv6 See the comments added to the tests. It may be possible to rewrite these so they do something sane for IPv6... some other time. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14227 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Fri Jan 3 00:00:55 UTC 2020 on sn-devel-184	2020-01-03 00:00:55 +00:00
Martin Schwenke	693080abe4	ctdb-scripts: Strip square brackets when gathering connection info ss added square brackets around IPv6 addresses in versions > 4.12.0 via commit aba9c23a6e1cb134840c998df14888dca469a485. CentOS 7 added this feature somewhere mid-release. So, backward compatibility is obviously needed. As per the comment protocol/protocol_util.c should probably print and parse such square brackets. However, for backward compatibility the brackets would have to be stripped in both places in update_tickles()... or added to the ss output when missing. Best to leave this until we have a connection tracking daemon. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14227 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2020-01-02 22:36:34 +00:00
Amitay Isaacs	963a639101	ctdb-tests: Add tests for cmdline_add() api Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Thu Nov 14 12:03:46 UTC 2019 on sn-devel-184	2019-11-14 12:03:46 +00:00
Amitay Isaacs	e469d6c119	ctdb-common: Add api to add new section/commands to cmdline Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-11-14 10:38:34 +00:00
Amitay Isaacs	977a6f7fad	ctdb-common: Change cmdline implementation to support multiple sections Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-11-14 10:38:34 +00:00
Amitay Isaacs	7a008c6b74	ctdb-tests: Update cmdline tests for section name Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-11-14 10:38:34 +00:00
Amitay Isaacs	b2b24c91fa	ctdb-common: Add section to group commands in cmdline Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-11-14 10:38:34 +00:00
Amitay Isaacs	29948d7b1e	ctdb-common: Generate usage message from cmdline_parse() If any of the option parsing or command parsing fails, generate usage message. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-11-14 10:38:34 +00:00
Martin Schwenke	e45feaf28d	ctdb-tcp: Simplify freeing of transport data on shutdown The type-checking is superfluous and gets in the way of readability. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Thu Nov 14 03:45:44 UTC 2019 on sn-devel-184	2019-11-14 03:45:44 +00:00
Martin Schwenke	750f3938e4	ctdb-daemon: Rename ctdb_context private_data to transport_data This gives a casual reader a useful clue. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-11-14 02:20:46 +00:00
Martin Schwenke	53f8492caa	ctdb-daemon: Rename ctdb_node private_data to transport_data This gives a casual reader a useful clue. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-11-14 02:20:46 +00:00
Volker Lendecke	a6d99d9e5c	ctdb-tcp: Close inflight connecting TCP sockets after fork Commit `c68b6f96f2` changed the talloc hierarchy such that outgoing TCP sockets while sitting in the async connect() syscall are not freed via ctdb_tcp_shutdown() anymore, they are hanging off a longer-running structure. Free this structure as well. If an outgoing TCP socket leaks into a long-running child process (possibly the recovery daemon), this connection will never be closed as seen by the destination node. Because with recent changes incoming connections will not be accepted as long as any incoming connection is alive, with that socket leak into the recovery daemon we will never again be able to successfully connect to the node that is affected by this leak. Further attempts to connect will be discarded by the destination as long as the recovery daemon keeps this socket alive. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14175 RN: Avoid communication breakdown on node reconnect Signed-off-by: Martin Schwenke <martin@meltin.net> Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-11-14 02:20:46 +00:00
Amitay Isaacs	816205027a	ctdb-ib: Fix build errors for infiniband transport Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Wed Nov 13 13:31:10 UTC 2019 on sn-devel-184	2019-11-13 13:31:10 +00:00
Andrew Bartlett	92ce387ed0	build: Remove workaround for missing os.path.relpath in Python < 2.6 Signed-off-by: Andrew Bartlett <abartlet@samba.org> Reviewed-by: David Mulder <dmulder@suse.com> Reviewed-by: Andreas Schneider <asn@samba.org>	2019-11-13 08:42:30 +00:00
Volker Lendecke	f5f89b1b99	ctdb: Use TALLOC_FREE() in a few places We have a macro for NULLing out the pointer Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Fri Nov 8 01:35:11 UTC 2019 on sn-devel-184	2019-11-08 01:35:11 +00:00
Martin Schwenke	bf99f82077	ctdb-tests: Make process exists test more resilient This can fail as follows: --==--==--==--==--==--==--==--==--==--==--==--==--==--==--==--==--==--==-- Running test ./tests/UNIT/tool/ctdb.process-exists.003.sh (02:26:30) --==--==--==--==--==--==--==--==--==--==--==--==--==--==--==--==--==--==-- ctdb.process-exists.003 - ctdbd process with multiple connections on node 0 Setting up fake ctdbd <10\|\|0\| OK <10\|PID 26107 exists \|0\| OK ================================================== Running "ctdb -d NOTICE process-exists 26107 0x1234567812345678" PASSED ================================================== Running "ctdb -d NOTICE process-exists 26107 0xaebbccdd12345678" Registered SRVID 0xaebbccdd12345678 -------------------------------------------------- Output (Exit status: 1): -------------------------------------------------- PID 26107 with SRVID 0xaebbccdd12345678 does not exist -------------------------------------------------- Required output (Exit status: 0): -------------------------------------------------- PID 26107 with SRVID 0xaebbccdd12345678 exists FAILED connection to daemon closed, exiting ========================================================================== TEST FAILED: ./tests/UNIT/tool/ctdb.process-exists.003.sh (status 1) (duration: 0s) ========================================================================== This happens when dummy_client has not registered the SRVID (for its 10th connection) before the 2nd simple_test. Change the initial wait to ensure that the SRVID is registered. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Wed Nov 6 02:46:24 UTC 2019 on sn-devel-184	2019-11-06 02:46:24 +00:00
Martin Schwenke	dd9d5ec5c8	ctdb-tests: Improve code quality in ctdb_init() Improve quoting and indentation. Print a clear error if the cluster goes back into recovery and doesn't come back out. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-11-06 01:22:30 +00:00
Martin Schwenke	3b5ed00054	ctdb-tests: No longer retry starting the cluster Retrying like this hides bugs. The cluster should come up first time, every time. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-11-06 01:22:30 +00:00
Martin Schwenke	bf47bc18bb	ctdb-tcp: Drop tracking of file descriptor for incoming connections This file descriptor is owned by the incoming queue. It will be closed when the queue is torn down. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14175 RN: Avoid communication breakdown on node reconnect Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-11-06 01:22:30 +00:00
Martin Schwenke	d0baad257e	ctdb-tcp: Avoid orphaning the TCP incoming queue CTDB's incoming queue handling does not check whether an existing queue exists, so can overwrite the pointer to the queue. This used to be harmless until commit `c68b6f96f2` changed the read callback to use a parent structure as the callback data. Instead of cleaning up an orphaned queue on disconnect, as before, this will now free the new queue. At first glance it doesn't seem possible that 2 incoming connections from the same node could be processed before the intervening disconnect. However, the incoming connections and disconnect occur on different file descriptors. The queue can become orphaned on node A when the following sequence occurs: 1. Node A comes up 2. Node A accepts an incoming connection from node B 3. Node B processes a timeout before noticing that outgoing the queue is writable 4. Node B tears down the outgoing connection to node A 5. Node B initiates a new connection to node A 6. Node A accepts an incoming connection from node B Node A processes then the disconnect of the old incoming connection from (2) but tears down the new incoming connection from (6). This then occurs until the originally affected node is restarted. However, due to the number of outgoing connection attempts and associated teardowns, this induces the same behaviour on the corresponding incoming queue on all nodes that node A attempts to connect to. Therefore, other nodes become affected and need to be restarted too. As a result, the whole cluster probably needs to be restarted to recover from this situation. The problem can occur any time CTDB is started on a node. The fix is to avoid accepting new incoming connections when a queue for incoming connections is already present. The connecting node will simply retry establishing its outgoing connection. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14175 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-11-06 01:22:30 +00:00
Martin Schwenke	e62b3a05a8	ctdb-tcp: Check incoming queue to see if incoming connection is up This makes it consistent with the reverse case. Also, in_fd will soon be removed. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14175 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-11-06 01:22:30 +00:00
Björn Jacke	a456c2bb02	ctdb/utils/smnotify/smnotify.c: typo fixes Signed-off-by: Bjoern Jacke <bjacke@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-31 00:43:39 +00:00
Björn Jacke	1e73161bdd	ctdb/utils/scsi_io/scsi_io.c: typo fixes Signed-off-by: Bjoern Jacke <bjacke@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-31 00:43:39 +00:00
Björn Jacke	f3754b6487	ctdb/server/ctdb_daemon.c: typo fixes Signed-off-by: Bjoern Jacke <bjacke@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-31 00:43:39 +00:00
Björn Jacke	5d2a257c2e	ctdb/server/ctdb_client.c: typo fixes Signed-off-by: Bjoern Jacke <bjacke@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-31 00:43:39 +00:00
Björn Jacke	7722bd80fc	ctdb/server/ctdb_call.c: typo fixes Signed-off-by: Bjoern Jacke <bjacke@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-31 00:43:38 +00:00
Björn Jacke	9fa37484c3	ctdb/include/ctdb_private.h: typo fixes Signed-off-by: Bjoern Jacke <bjacke@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-31 00:43:38 +00:00
Björn Jacke	493705dc27	ctdb/ib/ibwrapper_test.c: typo fixes Signed-off-by: Bjoern Jacke <bjacke@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-31 00:43:38 +00:00
Björn Jacke	540325d3cb	ctdb/ib/ibw_ctdb.c: typo fixes Signed-off-by: Bjoern Jacke <bjacke@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-31 00:43:38 +00:00
Björn Jacke	b9d7b85afd	ctdb/doc/readonlyrecords.txt: typo fixes Signed-off-by: Bjoern Jacke <bjacke@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-31 00:43:38 +00:00
Björn Jacke	cb867b29c9	ctdb/doc/ctdb.1.xml: typo fixes Signed-off-by: Bjoern Jacke <bjacke@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-31 00:43:38 +00:00
Björn Jacke	93859b3394	ctdb/doc/ctdb-tunables.7.xml: typo fixes Signed-off-by: Bjoern Jacke <bjacke@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-31 00:43:38 +00:00
Björn Jacke	37398acca5	ctdb/doc/ctdb-statistics.7.xml: typo fixes Signed-off-by: Bjoern Jacke <bjacke@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-31 00:43:38 +00:00
Björn Jacke	1b51b44487	ctdb/common/srvid.h: typo fixes Signed-off-by: Bjoern Jacke <bjacke@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-31 00:43:38 +00:00
Björn Jacke	c71a1df18a	ctdb/client/client.h: typo fixes Signed-off-by: Bjoern Jacke <bjacke@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-31 00:43:38 +00:00
Martin Schwenke	6de5706b4d	ctdb-tests: Add vacuuming tests Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Thu Oct 24 05:28:21 UTC 2019 on sn-devel-184	2019-10-24 05:28:21 +00:00
Martin Schwenke	49262a6bc4	ctdb-tests: Add handling of process clean-up on a cluster node Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-24 04:06:44 +00:00
Martin Schwenke	b9654085f5	ctdb-tests: Factor out function check_cattdb_num_records() This can be use in multiple vacuuming tests. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-24 04:06:44 +00:00
Martin Schwenke	5a6d319eea	ctdb-tests: Add ctdb-db-test tool Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-24 04:06:44 +00:00
Martin Schwenke	439ef65d29	ctdb-client: Factor out function client_db_tdb() Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-24 04:06:44 +00:00
Martin Schwenke	41a41d5f3e	ctdb-daemon: Implement DB_VACUUM control Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-24 04:06:43 +00:00
Martin Schwenke	d462d64cdf	ctdb-vacuum: Only schedule next vacuum event if vacuuuming is scheduled At the moment vacuuming is always scheduled. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-24 04:06:43 +00:00
Martin Schwenke	13cedaf019	ctdb-daemon: Factor out code to create vacuuming child This changes the behaviour for some failures from exiting to simply attempting to schedule the next run. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-24 04:06:43 +00:00
Martin Schwenke	5539edfdbe	ctdb-vacuum: Simplify recording of in-progress vacuuming child There can only be one, so simplify the logic. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-24 04:06:43 +00:00
Martin Schwenke	496204feb0	ctdb-protocol: Add marshalling for control DB_VACUUM Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-24 04:06:43 +00:00
Martin Schwenke	a896486b62	ctdb-protocol: Add marshalling for struct ctdb_db_vacuum Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-24 04:06:43 +00:00
Martin Schwenke	b314835341	ctdb-protocol: Add new control CTDB_CONTROL_DB_VACUUM Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-24 04:06:43 +00:00
Amitay Isaacs	d0cc9edc05	ctdb-vacuum: Avoid processing any more packets All the vacuum operations if required have an event loop to ensure completion of pending operations. Once all the steps are complete, there is no reason to process any more packets. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-24 04:06:43 +00:00
Amitay Isaacs	680df07630	ctdb-daemon: Avoid memory leak when packet is deferred Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-24 04:06:43 +00:00
Amitay Isaacs	c6427dddf5	ctdb-recoverd: No need for database detach handler The only reason for recoverd attaching to databases was to migrate records to the local node as part of vacuuming. Recovery daemon does not take part in database vacuuming any more. The actual database recovery is handled via the recovery_helper and recovery daemon should not need to attach to the databases any more. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-24 04:06:43 +00:00
Amitay Isaacs	fc81729dd2	ctdb-recoverd: Drop VACUUM_FETCH message handling This is now implemented in the ctdb daemon using VACUMM_FETCH control. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-24 04:06:43 +00:00
Amitay Isaacs	498932c0e8	ctdb-vacuum: Replace VACUUM_FETCH message with control Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-24 04:06:42 +00:00
Amitay Isaacs	86521837b6	ctdb-vacuum: Add processing of fetch queue Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-24 04:06:42 +00:00
Amitay Isaacs	da617f90d9	ctdb-daemon: Add implementation of VACUUM_FETCH control Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-24 04:06:42 +00:00
Amitay Isaacs	36f9b4953a	ctdb-tests: Add marshalling tests for new control Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-24 04:06:42 +00:00
Amitay Isaacs	b71d8cd80f	ctdb-protocol: Add marshalling for new control VACUUM_FETCH Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-24 04:06:42 +00:00
Amitay Isaacs	0872c52ef0	ctdb-protocol: Add new control VACUUM_FETCH Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-24 04:06:42 +00:00
Amitay Isaacs	913bd331f6	ctdb-tests: Drop code releated to obsolete controls Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-24 04:06:42 +00:00
Amitay Isaacs	688567f080	ctdb-protocol: Drop code related to obsolete controls Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-24 04:06:42 +00:00
Volker Lendecke	9f41a9fc1e	ctdb: Avoid malloc/memcpy/free in ctdb_ltdb_fetch() Make use of tdb_parse_record() Signed-off-by: Volker Lendecke <vl@samba.org> Signed-off-by: Amitay Isaacs <amitay@gmail.com>	2019-10-24 04:06:42 +00:00
Martin Schwenke	aa37668218	ctdb-tests: Add -l option to set number of local daemons This is the only place where setting an environment variable by hand is recommended, so remove the anomaly. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Ralph Böhme <slow@samba.org> Autobuild-Date(master): Tue Oct 22 21:02:11 UTC 2019 on sn-devel-184	2019-10-22 21:02:11 +00:00
Martin Schwenke	fe80038d07	ctdb-tests: Prefix remaining environment variables with CTDB_ Now they are clearly all part of CTDB. TEST_SOCKET_WRAPPER_SO_PATH gets too long in integration_local_daemons.bash, so change it to CTDB_TEST_SWRAP_SO_PATH instead of just prefixing. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-22 19:39:41 +00:00
Martin Schwenke	e8dc125ed2	ctdb-tests: Drop setting of test state directory for testonly target This is the default and deciding this should be left to run_tests.sh. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-22 19:39:41 +00:00
Martin Schwenke	fc16b8dbc6	ctdb-tests: Enable printing of logs on failure in autobuild Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-22 19:39:41 +00:00
Martin Schwenke	787662604d	ctdb-tests: Add run_tests.sh option to print logs on test failure Implement this for local daemons integration tests, dumping last 100 lines of logs. This makes it possible to debug some failures in automated tests where the logs are unavailable for analysis. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-22 19:39:40 +00:00
Martin Schwenke	5ae330e5a5	ctdb-tests: Avoid running valgrind under valgrind When run from integration tests $CTDB already includes $VALGRIND, if set. So only add $VALGRIND if $CTDB is not set. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-22 19:39:40 +00:00
Martin Schwenke	b8461b422d	ctdb-tests: Simplify tool unit test runner There is no good reason why the code needs to be this way. The intervening code was removed years ago leaving a more complex version of something very simple. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-22 19:39:40 +00:00
Martin Schwenke	0bddee8dac	ctdb-tests: Rename functions to test_header() and test_footer() That's all they do now. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Fri Oct 4 10:58:10 UTC 2019 on sn-devel-184	2019-10-04 10:58:10 +00:00
Martin Schwenke	435d903ad8	ctdb-tests: Move test duration calculation to ctdb_test_run() It makes sense to do this in one place in case other headers/footers are added. Reindent ctdb_test_begin() while touching this function. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-04 09:41:29 +00:00
Martin Schwenke	23982477f3	ctdb-tests: Add handling for skipped tests Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-04 09:41:28 +00:00
Martin Schwenke	473a6fed11	ctdb-tests: Add a special failure code when a test error occurs Use it when a test is not executable. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-04 09:41:28 +00:00
Martin Schwenke	55dd0f047f	ctdb-tests: Move test status interpretation to ctdb_test_run() It makes sense to do this in one place in case other headers/footers are added. Simplify ctdb_test_end() accordingly, reindenting because nearly all lines are modified. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-04 09:41:28 +00:00
Martin Schwenke	47c9b79262	ctdb-tests: Move use of show_progress() into ctdb_test_run() This allows more variables to be set in this function because they are no longer in a sub-shell. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-04 09:41:28 +00:00
Martin Schwenke	e7e6f4067e	ctdb-tests: Simplify ctdb_test_run() Only the test file name is ever passed. Reindent while touching many existing lines. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-04 09:41:28 +00:00
Martin Schwenke	dc8ddbb084	ctdb-tests: Switch TEST_CLEANUP and TEST_TIMEOUT to script variables These are not used outside this script so they do not need to be environment variables. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-04 09:41:28 +00:00
Martin Schwenke	0ec83f32fa	ctdb-tests: Add new test functions for running commands on nodes * ctdb_onnode() * testprog_onnode() * function_onnode() These encapsulate familiar patterns found when running try_command_on_node(). The new function names are more concise and encourage more readable tests. Test writers can do less thinking about the subtleties of running different types of commands on nodes. For example, these functions ensure that $CTDB and $VALGRIND are used in the correct contexts. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-04 09:41:28 +00:00
Martin Schwenke	38b838b59c	ctdb-tests: try_command_on_node() should return status of command There is no point folding this down to 1. Tests should be able to see the original value, if required. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-04 09:41:28 +00:00
Martin Schwenke	e494eb3e8c	ctdb-tests: Drop unused function ctdb_test_check_real_cluster() Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-04 09:41:28 +00:00
Martin Schwenke	38138b42f7	ctdb-tests: Update preamble for CLUSTER tests The main change is to source cluster.bash instead of integration.bash. While touching the preamble, the following additional changes are also made: * Drop test_info() definition and replace it with a comment The use of test_info() is pointless. * Drop call to ctdb_test_check_real_cluster() cluster.bash now does this. * Drop call to cluster_is_healthy() This is a holdover from when the previous test would restart daemons to get things ready for a test. There was also a bug where going into recovery during the restart would sometimes cause the cluster to become unhealthy. If we really need something like this then we can add it to ctdb_test_init(). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-04 09:41:28 +00:00
Martin Schwenke	653b35764a	ctdb-tests: Add cluster.bash include file Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-04 09:41:28 +00:00
Martin Schwenke	5ad356c282	ctdb-tests: Add function ctdb_test_skip_on_cluster() Use it in relevant tests. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-04 09:41:28 +00:00
Martin Schwenke	59055f4da1	ctdb-tests: Add function ctdb_test_on_cluster() This centralises this logic. Use it in a subset of tests - there are other cases but these will be cleaned up soon. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-04 09:41:28 +00:00
Martin Schwenke	65ca431c95	ctdb-tests: Add functions for terminating tests on failure, skip, error This allows standard exit codes for failed and skipped tests, and test errors. Skipped tests currently just succeed and a test error is the same as a failure. These can be easily changed later when run_tests.sh is ready to handle them. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-04 09:41:27 +00:00
Martin Schwenke	2c54f6df71	ctdb-common: Mark VacuumLimit tunable as obsolete Use of this tunable was dropped over 5 years ago in commit `16837bc309`. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Fri Oct 4 07:07:21 UTC 2019 on sn-devel-184	2019-10-04 07:07:21 +00:00
Martin Schwenke	815ae64400	ctdb-vacuum: Drop debug level of repacking message to NOTICE This occurs rarely but can adversely impact performance, so it is worth logging it more frequently. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-04 05:47:35 +00:00
Martin Schwenke	a8c4e7d1f6	ctdb-protocol: Initialise request->rdata.opcode where missing Otherwise it is uninitialised, so... ==22889== Conditional jump or move depends on uninitialised value(s) ==22889== at 0x12257B: ctdb_req_control_data_len (protocol_control.c:39) ==22889== by 0x1228E9: ctdb_req_control_len (protocol_control.c:1786) ==22889== by 0x12A51C: ctdb_client_control_send (client_control.c:101) ==22889== by 0x138BE1: ctdb_tunnel_setup_send (client_tunnel.c:100) ==22889== by 0x10EE4F: tunnel_test_send (tunnel_test.c:135) ==22889== by 0x10EE4F: main (tunnel_test.c:463) and similar. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-10-04 05:47:35 +00:00
Amitay Isaacs	33f1c9d965	ctdb-vacuum: Process all records not deleted on a remote node This currently skips the last record. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14147 RN: Avoid potential data loss during recovery after vacuuming error Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2019-10-04 05:47:34 +00:00
Martin Schwenke	24f04c1cc5	ctdb-tests: Update README Bring this up to date. Drop descriptions of command-line options because these tend to bit-rot - refer to "run_tests.sh -h" instead. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Thu Sep 26 06:01:33 UTC 2019 on sn-devel-184	2019-09-26 06:01:33 +00:00
Martin Schwenke	8a5c4a60e1	ctdb-tests: Move simple tests to INTEGRATION/ subdirectory Split some tests out into database/ and failover/ subdirectories. Rename the remaining tests in simple/. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-09-26 04:45:37 +00:00
Martin Schwenke	658068184f	ctdb-tests: Move complex tests to CLUSTER/ subdirectory Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-09-26 04:45:37 +00:00
Martin Schwenke	df6800e330	ctdb-tests: Convert local daemons include file into top-level include Do the same with the alternative code for real clusters. Both of these can now be used by other test suites. Fix some basic shellcheck warnings (e.g. avoid word-splitting by quoting) while moving code and add the new files to the shellcheck test. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-09-26 04:45:37 +00:00
Martin Schwenke	384381fbff	ctdb-tests: Drop use of array in run_tests() This doesn't accomplish anything. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-09-26 04:45:37 +00:00
Martin Schwenke	c438e0db45	ctdb-tests: Drop custom handling for unit tests Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-09-26 04:45:37 +00:00

... 5 6 7 8 9 ...

9087 Commits