samba-mirror

mirror of https://github.com/samba-team/samba.git synced 2024-12-22 13:34:15 +03:00

Author	SHA1	Message	Date
Andreas Schneider	7749df4992	ctdb:server: Fix code spelling Best reviewed with: `git show --word-diff` Signed-off-by: Andreas Schneider <asn@samba.org> Reviewed-by: Martin Schwenke <mschwenke@ddn.com>	2023-03-24 07:01:31 +00:00
Andreas Schneider	59af504999	ctdb:server: Remove trailing whitespaces in ctdb_recover.c Signed-off-by: Andreas Schneider <asn@samba.org> Reviewed-by: Martin Schwenke <mschwenke@ddn.com>	2023-03-24 07:01:31 +00:00
Martin Schwenke	a76374070d	ctdb-daemon: Drop implementation of {GET,SET}_RECMASTER controls Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2022-01-17 10:21:33 +00:00
Volker Lendecke	06b740e2fb	ctdb: Fix a typo Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Jeremy Allison <jra@samba.org>	2021-03-09 22:36:28 +00:00
Martin Schwenke	d98f68f918	ctdb-daemon: Drop implementation of old-style database pull/push controls Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Fri Sep 11 06:29:32 UTC 2020 on sn-devel-184	2020-09-11 06:29:32 +00:00
Martin Schwenke	e9f2e205ee	ctdb-daemon: Make node inactive in the NODE_STOP control Currently some of this is supported by a periodic check in the recovery daemon's main_loop(), which notices the flag change, sets recovery mode active and freezes databases. If STOP_NODE returns immediately then the associated recovery can complete and the node can be continued before databases are actually frozen. Instead, immediately do all of the things that make a node inactive. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14087 RN: Stop "ctdb stop" from completing before freezing databases Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Tue Aug 20 08:32:27 UTC 2019 on sn-devel-184	2019-08-20 08:32:27 +00:00
Martin Schwenke	a42bcaabb6	ctdb-daemon: Factor out new function ctdb_node_become_inactive() This is a superset of ctdb_local_node_got_banned() so will replace that function, and will also be used in the NODE_STOP control. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14087 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-08-20 07:15:41 +00:00
Martin Schwenke	fa7bd35b6a	ctdb-recovery: Fix signed/unsigned comparisons by declaring as unsigned Simple cases where variables need to be declared as an unsigned type instead of an int. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2019-06-05 10:25:50 +00:00
Martin Schwenke	944c92a15d	ctdb-daemon: Modernise debug during record deletion for vacuuming Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Tue Dec 18 10:13:50 CET 2018 on sn-devel-144	2018-12-18 10:13:50 +01:00
Martin Schwenke	cdca0d7e78	ctdb-daemon Add extra debug during record deletion for vacuuming It isn't currently possible to distinguish these 2 cases. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2018-12-18 07:12:10 +01:00
Amitay Isaacs	d18385ea2a	ctdb-daemon: Drop implementation of RECEIVE_RECORDS control BUG: https://bugzilla.samba.org/show_bug.cgi?id=13641 Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2018-10-08 02:46:21 +02:00
Amitay Isaacs	040401ca3a	ctdb-daemon: Don't pull any records if records are invalidated This avoids unnecessary work during recovery to pull records from nodes that were INACTIVE just before the recovery. BUG: https://bugzilla.samba.org/show_bug.cgi?id=13641 Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2018-10-08 02:46:20 +02:00
Amitay Isaacs	b886a95eca	ctdb-daemon: Switch to using ETIMEDOUT instead of ETIME BUG: https://bugzilla.samba.org/show_bug.cgi?id=13520 Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2018-07-28 03:50:10 +02:00
Martin Schwenke	4656b0816a	ctdb-daemon: Don't explicitly disable monitoring around recovery Monitoring can fail during recovery due to databases (e.g. registry) being unavailable. This has been avoided by explicitly disabling monitoring around recovery via the START_RECOVERY and END_RECOVERY controls. With this approach only there is still a window between enabling recovery mode and START_RECOVERY when monitoring could be attempted. However, explicitly disabling monitoring is unnecessary because monitoring is not done when a node is in recovery. So remove the explicit disable/enable of monitoring and rely on monitoring being skipped when recovery mode is active. The only possible change of behaviour with this change is that there is now a window between setting recovery mode to normal and the END_RECOVERY control where monitoring is enabled. However, at this point databases would be available and the "recovered" event will cancel any in-progress monitoring. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2017-09-14 14:49:15 +02:00
Martin Schwenke	173aa683d5	ctdb-daemon: Don't explicitly disable monitoring when stopping a node Monitoring is now avoided for inactive nodes anyway. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2017-09-14 14:49:15 +02:00
Amitay Isaacs	027689a2cf	ctdb-daemon: Increase priority of logs when recovery happens Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2017-07-04 13:11:16 +02:00
Amitay Isaacs	c6f2624287	ctdb-daemon: Increase priority of logs when node is stopped/continued Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2017-07-04 13:11:16 +02:00
Amitay Isaacs	1992404326	ctdb-daemon: Increase priority of logs for recmaster changes Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2017-07-04 13:11:16 +02:00
Amitay Isaacs	7c462b0df8	ctdb-daemon: Store db_flags instead of individual boolean flags Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2017-06-29 10:34:27 +02:00
Amitay Isaacs	4e43a344cc	ctdb-daemon: Add accessors for CTDB_DB_FLAGS_STICKY flag Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2017-06-29 10:34:27 +02:00
Amitay Isaacs	d0fa710ea1	ctdb-daemon: Add accessors for CTDB_DB_FLAGS_READONLY flag Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2017-06-29 10:34:26 +02:00
Amitay Isaacs	94af277c48	ctdb-daemon: Add accessors for CTDB_DB_FLAGS_PERSISTENT flag This allows to differentiate between the two database models. ctdb_db_persistent() - replicated and permanent ctdb_db_volatile() - distributed and temporary Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2017-06-29 10:34:26 +02:00
Amitay Isaacs	f8200153b2	ctdb-recovery: Finish processing for recovery mode ACTIVE first BUG: https://bugzilla.samba.org/show_bug.cgi?id=12857 This simplifies the code and avoids complicated conditions. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2017-06-24 10:28:21 +02:00
Amitay Isaacs	d74dadd7f2	ctdb-recovery: Simplify logging of recovery mode setting BUG: https://bugzilla.samba.org/show_bug.cgi?id=12857 Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2017-06-24 10:28:21 +02:00
Amitay Isaacs	f2771fcbf4	ctdb-recovery: Setting up of recmode should be idempotent BUG: https://bugzilla.samba.org/show_bug.cgi?id=12857 If the recovery mode is already set to the expected value, there is nothing to do. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2017-06-24 10:28:21 +02:00
Martin Schwenke	c6a7f680ce	ctdb-daemon: Fix CID 1363067 Resource leak (RESOURCE_LEAK) Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-08-03 05:29:24 +02:00
Martin Schwenke	74aca5f4c6	ctdb-daemon: Fix CID 1363233 Resource leak (RESOURCE_LEAK) Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-08-03 05:29:24 +02:00
Amitay Isaacs	79b6b4b621	ctdb-daemon: Drop priorites from freeze/thaw code Parallel database recovery freezes databases in parallel and irrespective of database priority. So drop priority from freeze/thaw code. Database priority will be dropped completely soon. Now FREEZE and THAW controls operate on all the databases. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2016-07-25 21:29:42 +02:00
Amitay Isaacs	7c8c6ce74e	ctdb-daemon: Improve log message Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2016-07-05 10:53:14 +02:00
Amitay Isaacs	e6818c8e3c	ctdb-recoverd: Improve election win messages Logging that node has lost election is less useful than knowing which node has won the election. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2016-07-05 10:53:14 +02:00
Amitay Isaacs	c620bf5deb	ctdb-daemon: Reset push_started flag once DB_PUSH_CONFIRM is done Once DB_PUSH_START is processed as part of recovery, push_started flag tracks if there are multiple attempts to send DB_PUSH_START. In DB_PUSH_CONFIRM, once the record count is confirmed, all information related to DB_PUSH should be reset. However, The push_started flag was not reset when the push_state was reset. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Wed Jun 8 14:31:52 CEST 2016 on sn-devel-144	2016-06-08 14:31:52 +02:00
Martin Schwenke	95a7920d22	ctdb-cluster-mutex: Register an extra handler for when mutex is lost Pass NULL if not needed. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-06-08 00:51:29 +02:00
Martin Schwenke	4f0ca0107c	ctdb-cluster-mutex: ctdb_cluster_mutex() registers handler and private data Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-06-08 00:51:29 +02:00
Martin Schwenke	145ddcbe37	ctdb-cluster-mutex: Drop cluster_mutex_handler() ctdb and handle arguments This makes the API more general. If they are needed in a handler then they can be in the private data. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-06-08 00:51:29 +02:00
Martin Schwenke	8cf74f335e	ctdb-recovery: Wrap private data for reclock test callback This will allow a simplification of the cluster mutex API, so the private data can be registered when calling ctdb_cluster_mutex(). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-06-08 00:51:29 +02:00
Martin Schwenke	5c4744e69d	ctdb-cluster-mutex: Pass a talloc context to allocate the handle off Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-06-08 00:51:28 +02:00
Martin Schwenke	fdd214ce6a	ctdb-daemon: Rename recovery lock file to just recovery lock It isn't necessarily a file. Don't bother changing the control, since it doesn't pervade the code. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-06-08 00:51:28 +02:00
Martin Schwenke	091d4d2dbb	ctdb-recovery: Consistency check reclock in start recovery control If the recovery lock setting is not consistent with that of the recovery master then abort. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-06-08 00:51:28 +02:00
Martin Schwenke	3e272e081f	ctdb-recover: Avoid duplicate deferred attach processing Deferred attach processing is done unconditionally at this point. It is then done again if recovery lock checking is done and completes successfuly. If the recovery lock checking fails then it should not be done at all. Move this processing so it is done with the early exit when the recovery lock is not being used. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-05-06 11:39:09 +02:00
Martin Schwenke	bcb838ba1e	ctdb-recovery: Move recovery lock functions to recovery daemon code ctdb_recovery_have_lock(), ctdb_recovery_lock(), ctdb_recovery_unlock() are only used by recovery daemon, so move them there. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-04-28 09:39:17 +02:00
Martin Schwenke	df99d9e273	ctdb-cluster-mutex: Factor out cluster mutex code Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-04-28 09:39:17 +02:00
Martin Schwenke	ecc6751c6b	ctdb-recovery: Factor out setting of cluster mutex handler This means that the cluster mutex handle can now be treated as opaque. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-04-28 09:39:17 +02:00
Martin Schwenke	94fb2cf0ec	ctdb_recovery: ctdb_cluster_mutex() now takes an argstring argument All of the ctdb_cluster_mutex_* infrastucture can now handle an arbitrary mutex. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-04-28 09:39:17 +02:00
Martin Schwenke	46684867b1	ctdb-recovery: Recovery lock setting can now include helper command The underlying change is to allow the cluster mutex argstring to optionally contain a helper command. When the argument string starts with '!' then the first word is the helper command to run. This is now the standard way of changing the helper from the default. CTDB_CLUSTER_MUTEX_HELPER show now only be used to change the location of the default helper when testing. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-04-28 09:39:17 +02:00
Martin Schwenke	918b0d9a9c	ctdb-recovery: Parse recovery lock setting This is currently just treated as the name of a lock file. However, it is really some arbitrary arguments to lock helper. Therefore, it should be parsed and passed as separate arguments to the lock helper. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-04-28 09:39:17 +02:00
Martin Schwenke	64d557200e	ctdb-recovery: Reimplement ctdb_recovery_lock() using ctdb_cluster_mutex() Replace the file descriptor for the recovery lock in the CTDB context with the cluster mutex handle, where non-NULL means locked. Attempting to take the recovery lock is now asynchronous and no longer blocks the recovery daemon. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-04-28 09:39:16 +02:00
Martin Schwenke	0b0b954ff2	ctdb-recovery: Kill cluster mutex helper with a signal that can be caught Unlike fcntl(2), some other helper might need to explicitly take action to release a mutex. This can be done by catching SIGTERM. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-04-28 09:39:16 +02:00
Martin Schwenke	e679a1731c	ctdb-recovery: Switch ctdb_cluster_mutex() to use helper Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-04-28 09:39:16 +02:00
Martin Schwenke	978404ecde	ctdb-recovery: Add optional timeout argument to ctdb_cluster_mutex() Timeout in seconds, 0 means no timeout. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-04-28 09:39:16 +02:00
Martin Schwenke	43e9f58d6a	ctdb-recovery: Factor out reclock testing into ctdb_cluster_mutex() This is currently only used to check whether the recovery lock can be taken. However, name it more generally in anticipation of using it for general cluster mutex taking and testing. No functional changes. A couple of debug message simplifications and code rearrangements. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-04-28 09:39:16 +02:00

1 2 3 4 5

228 Commits