samba-mirror

mirror of https://github.com/samba-team/samba.git synced 2024-12-23 17:34:34 +03:00

Author	SHA1	Message	Date
Amitay Isaacs	b886a95eca	ctdb-daemon: Switch to using ETIMEDOUT instead of ETIME BUG: https://bugzilla.samba.org/show_bug.cgi?id=13520 Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2018-07-28 03:50:10 +02:00
Amitay Isaacs	db548f4852	ctdb-daemon: Add client code to talk to new event daemon This fixes the build and now new eventd is integrated completely in CTDB. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2018-07-05 06:52:44 +02:00
Martin Schwenke	7052f87903	ctdb-daemon: Drop unused function ctdb_set_notification_script() Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2018-05-01 13:31:18 +02:00
Amitay Isaacs	d7a5cd589b	ctdb-daemon: Send STARTUP control after startup event BUG: https://bugzilla.samba.org/show_bug.cgi?id=13154 STARTUP control is primarily used to synchronise tcp tickles from running nodes to a node which has just started up. Earlier STARTUP control was sent (using BROADCAST_ALL) after setup event. Once the other nodes in the cluster connected to this node, the queued up messages would be sent and the tcp tickles would get synchronised. Recent fix to drop messages to disconnected or not-yet-connected nodes, the STARTUP control was never sent to the remote nodes and the tcp tickles did not get synchronised. To fix this problem send the STARTUP control (using BROADCAST_CONNECTED) after startup event. By this time all the running nodes in the cluster are connected. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Thu Nov 30 15:29:48 CET 2017 on sn-devel-144	2017-11-30 15:29:48 +01:00
Martin Schwenke	d0d805977f	Revert "ctdb-daemon: Remove unused function ctdb_stop_monitoring()" This reverts commit `b119104267`. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2017-09-19 01:28:13 +02:00
Martin Schwenke	b119104267	ctdb-daemon: Remove unused function ctdb_stop_monitoring() Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2017-09-14 14:49:16 +02:00
Martin Schwenke	64225c63dd	ctdb-daemon: Drop monitoring mode Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2017-09-14 14:49:16 +02:00
Martin Schwenke	b00e360515	ctdb-daemon: Drop implementation of monitor controls Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2017-09-14 14:49:16 +02:00
Martin Schwenke	12cf6640e4	ctdb-daemon: Skip monitoring when not in RUNNING runstate Monitoring does not need to be done in other states. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2017-09-14 14:49:15 +02:00
Martin Schwenke	873db694c9	ctdb-daemon: Skip monitoring when node is inactive This is currently handled by explicitly disabling monitoring in various places. However, those places shouldn't need to know about monitoring but it is OK for monitoring to know about global node states. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2017-09-14 14:49:15 +02:00
Martin Schwenke	bff8d410f9	ctdb-daemon: Don't release all IPs before "startup" event This doesn't belong in the monitoring/startup code and it is already done in the 10.interface "init" event. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2017-09-12 12:23:19 +02:00
Amitay Isaacs	aaeef14ae5	ctdb-daemon: Remove setting of debug_extra Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2016-12-05 08:09:22 +01:00
Martin Schwenke	df2d6518e7	ctdb-daemon: Don't call ctdb_local_node_got_banned() on flag changes This function is currently called twice each time a node is banned. ctdb_local_node_got_banned() is already called from the banning code, either due to a received banning control or a node banning itself. Given that other nodes can't set a node's BANNED flag, a node can only be banned via the above mechanisms, so drop the redundant call. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-12-02 00:24:28 +01:00
Martin Schwenke	1be2cd9dd2	ctdb-daemon: Fix CID 1125575 Operands don't affect result This is related to an error, so repeatedly log at error level instead of trying to avoid repetition. BUG: https://bugzilla.samba.org/show_bug.cgi?id=12157 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-08-17 23:00:24 +02:00
Amitay Isaacs	0a759bc3ff	ctdb-daemon: Use consistent naming for monitoring mode Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Andrew Bartlett <abartlet@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net>	2016-07-28 05:00:17 +02:00
Martin Schwenke	4bef374e31	ctdb-daemon: Don't use CTDB_SRVID_TAKEOVER_RUN_RESPONSE Nobody registers a handler for this message type. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2016-05-06 11:39:09 +02:00
Martin Schwenke	56ce230de7	ctdb-recoverd: Fix some uninitialised memory issues The first element of these structures is a 32-bit PNN. On 64-bit systems this field can be followed by 32-bits of padding. When the structures are copied this can cause uninitialised memory to be copied. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Michael Adam <obnox@samba.org>	2016-01-12 19:16:17 +01:00
Christof Schmitt	03b27bd139	ctdb: Use prctl_set_comment from lib/util Signed-off-by: Christof Schmitt <cs@samba.org> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2015-11-18 04:05:13 +01:00
Martin Schwenke	9166c30a41	ctdb-daemon: Rename EventScriptTimeoutCount to MonitorTimeoutCount This only applies to monitor events so renaming clarifies this. Note that this change is not backward compatible. Users with CTDB_SET_EventScriptTimeoutCount=<n> in their configuration will get failures when starting CTDB but the cause will be clearly logged. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2015-11-16 08:42:11 +01:00
Martin Schwenke	55ad4d80d4	ctdb-daemon: Move script timeout count into monitor state It is only used by the monitoring code. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2015-11-16 08:42:11 +01:00
Martin Schwenke	0d5db1c007	ctdb-daemon: Reset script timeout count in monitor code This is the only place it is used. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2015-11-16 08:42:11 +01:00
Amitay Isaacs	f50db5cba5	ctdb-server: Replace ctdb_logging.h with common/logging.h Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Michael Adam <obnox@samba.org>	2015-11-16 00:46:15 +01:00
Mathieu Parent	c315fce17e	Fix various spelling errors Reviewed-by: Andrew Bartlett <abartlet@samba.org> Reviewed-by: Michael Adam <obnox@samba.org> Autobuild-User(master): Andrew Bartlett <abartlet@samba.org> Autobuild-Date(master): Fri Nov 6 13:43:45 CET 2015 on sn-devel-104	2015-11-06 13:43:45 +01:00
Amitay Isaacs	cf1ac77b3a	ctdb-daemon: Rename struct srvid_request to ctdb_srvid_message Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2015-11-04 00:47:15 +01:00
Amitay Isaacs	4647787773	ctdb-daemon: Separate prototypes for common client/server functions This groups function prototypes for common client/server functions in common/common.h and removes them from ctdb_private.h. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2015-10-30 02:00:27 +01:00
Amitay Isaacs	01c6c90e98	ctdb-daemon: Remove dependency on includes.h Instead of includes.h, include the required header files explicitly. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2015-10-30 02:00:27 +01:00
Amitay Isaacs	2fdb332fad	ctdb-daemon: Stop using tevent compatibility definitions Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2015-10-30 02:00:27 +01:00
Amitay Isaacs	b900adc55c	ctdb-daemon: Separate prototypes for system specific functions This groups function prototypes for system specific functions in common/system.h and removes them from ctdb_private.h. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2015-10-30 02:00:27 +01:00
Amitay Isaacs	7afabb1285	ctdb-daemon: Avoid the use of ctdb->freeze_handle variable These variables are used for state information related to freezing databases. Instead use the API functions to check if the databases are frozen. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2015-10-07 14:53:26 +02:00
Martin Schwenke	a1d6b3fb4b	ctdb-daemon: Move release of all IPs to startup This means that DisableIPFailover will be set if it should be. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2015-05-13 06:42:13 +02:00
Martin Schwenke	e6f99fcba3	ctdb-daemon: Broadcast IP rellocation request from monitor code No need to just send it to the recovery master. This reduces the need for main daemon code to know which node is the recovery master. The end goal is for the main daemon to not need to know which node is the recovery master - this information would be stored in the recovery daemon (and subsequently a separate cluster management daemon). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2015-05-10 03:22:13 +02:00
Martin Schwenke	ae9cd037ee	ctdb-daemon: Pass on consistent flag information to recovery daemon Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2015-04-07 07:43:12 +02:00
Martin Schwenke	e6304d1e1a	ctdb/daemon: Untangle serialisation of 1st recovery -> startup -> monitor At the moment ctdb_check_healthy() is overloaded to wait until the first recovery is complete, handle the "startup" event and also actually handle monitoring. This is untidy and hard to follow. Instead, have the daemon explicitly wait for 1st recovery after the "setup" event. When first recovery is complete, schedule a function to handle the "startup" event. When the "startup" event succeeds then explicitly enable monitoring. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com>	2014-01-17 17:59:41 +11:00
Amitay Isaacs	7aa20ccb5c	ctdb-daemon: No need to call event scripts with CTDB_CALLED_BY_USER This was added to support external monitoring using CTDB event scripts. However, it was never used. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net>	2014-01-16 11:41:12 +11:00
Martin Schwenke	4c3f8dc3bb	recoverd: Make the SRVID request structure generic No need for a separate one for each SRVID. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit d9c22b04d5aa7938a3965bd3144568664eb772ce)	2013-09-19 12:54:30 +10:00
Martin Schwenke	a3bef911f3	ctdbd: Allow extra recovery to repair persistent DBs during first recovery Commit 8076773a9924dcf8aff16f7d96b2b9ac383ecc28 introduced a potential regression because a node may not have completed the "recovered" event (so might still be in CTDB_RUNSTATE_FIRST_RECOVERY) when another node becomes healthy. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 57ef5d3827ea3417a32703e259a53ce6fd10ac45)	2013-07-19 15:35:41 +10:00
Amitay Isaacs	1c21f37e57	ctdbd: Set process names for child processes This helps distinguish processes in process list in top, perf, etc. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 2493f57ce268d6fe7e4c40a87852c347fd60d29e)	2013-07-10 14:33:19 +10:00
Amitay Isaacs	c6914e3891	banning: Make ctdb_local_node_got_banned() a void function When this function is called, we are already committed to banning and there is no point in failing this function. In case, freezing of databases fails, it will be fixed from recovery daemon. (This used to be ctdb commit bb178338658b4ae32382a1f62f7c21cee1d4878f)	2013-07-02 12:59:08 +10:00
Martin Schwenke	44e885e98e	ctdbd: Fix panic on overlapping shutdowns The runstate can't be set to SHUTDOWN twice, so the current naive code causes a panic on the 2nd shutdown. This regression was introduced in commit 8076773a9924dcf8aff16f7d96b2b9ac383ecc28. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit f1b7ca8dc3f34a59c7b3e55748f974ac9ed8f458)	2013-06-22 15:51:16 +10:00
Martin Schwenke	6a52a87028	ctdbd: Refactor shutdown sequence Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit b32fd04bfbf33062d45365b37a7247e272a76ceb)	2013-06-22 15:51:02 +10:00
Martin Schwenke	6d9667f01c	ctdbd: Add new runstate CTDB_RUNSTATE_FIRST_RECOVERY This adds more serialisation to the startup, ensuring that the "startup" event runs after everything to do with the first recovery (including the "recovered" event). Given that it now takes longer to get to the "startup" state, the initscript needs to wait until ctdbd gets to "first_recovery". Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit ed6814ff0a59ddbb1c1b3128b505380f60d7aeb7)	2013-05-24 14:08:07 +10:00
Martin Schwenke	63577c96db	ctdbd: Replace ctdb->done_startup with ctdb->runstate This allows states, including startup and shutdown states, to be clearly tracked. This doesn't include regular runtime "states", which are handled by node flags. Introduce new functions ctdb_set_runstate(), runstate_to_string() and runstate_from_string(). Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 8076773a9924dcf8aff16f7d96b2b9ac383ecc28)	2013-05-24 14:08:06 +10:00
Amitay Isaacs	4a6fa39ff9	daemon: Protect against double free of callback state while shutting down When CTDB is shut down and monitoring has been stopped, monitor_context gets freed and all the callback states hanging off it. This includes callback state for current_monitor, if the current monitor event has not yet finished. As a result, when the shutdown event is called, current_monitor->callback state is not NULL, but it's actually freed and it's a dangling reference. So before executing callback function and freeing callback state check if ctdb->monitor->monitor_context is not NULL. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 7d8546ee4353851f0543d0ca2c4c67cb0cc75aea)	2013-01-09 14:39:23 +11:00
Amitay Isaacs	4392591555	Remove explicit include of lib/tevent/tevent.h. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 0681014ca5ed2a9b56f63fdace7f894beccf8a9a)	2012-04-13 17:28:14 +10:00
Ronnie Sahlberg	0581fd85e6	Eventscripts: Add special -ECANCELED status for monitor events that are cancelled When a monitor event is canceled by a higher priority script, make sure we return status -ECANCELED to the callback in ctdB_monitor.c Also treat -ECANCELED as a simple "try monitor event again" and skip modifying any HEALTHY/UNHEALTHY flags when this happens (This used to be ctdb commit a15ec57c26d1bc82af85f74eebae0bd8abde3233)	2011-11-18 12:22:22 +11:00
Ronnie Sahlberg	a9eba762d7	remove a non-error logmessage about persistent databases being healthy, as expected S1026492 (This used to be ctdb commit da9e02085523e27fa29e35c60034f6a8aaaa81e8)	2011-08-04 13:49:48 +10:00
Ronnie Sahlberg	629f4da55a	remove a log message we dont need about "allow clients to attach to databases" S1026492 (This used to be ctdb commit 42c3e4c5216000c370814441e38c7a8180047aaf)	2011-08-04 13:49:38 +10:00
Ronnie Sahlberg	ae35e9e5b2	Cleanup of logging messages/spamming Reduce an infomational message about not performing ip reallocation from NOTICE(the default) to INFO. These messages are normal during startup or when stopped/banned when we will be in recovery mode for a while. Remove a messager in the loop waiting for initial startup to complete about the generation being invalid. It is always invalid at this stage before we have finished initial recovery. Rate-limit the informational messages for CTDB_WAIT_UNTIL_RECOVERED so that we only print them once per second for the first 60 seconds and after that only once per 10 minutes. These messages are normal during startup, but we should not be logging them every second for cases where we will remain in recovery mode during startup for an extended period of time. Such as if suspended or permabanned. CQ S1023302 (This used to be ctdb commit 3a0af8780dc595acbed880f288fcbc4f62c862fb)	2011-05-04 10:42:32 +10:00
Ronnie Sahlberg	3cc230b5ee	Dont allow clients to connect to databases untile we are well past and through the initial recovery phase CQ S1022412 Signed-off-by: Michael Adam <obnox@samba.org> (This used to be ctdb commit e02bbd915b7151c615ff64f09ad9abc9720bef7d)	2011-03-14 13:35:53 +01:00
Ronnie Sahlberg	40bd94bd5e	If the node is stopped, put a log entry in /var/log/* to indicate this is why we never become ready (This used to be ctdb commit ef1de8211f83259ea37dcd57562139a3b63d9631)	2011-02-02 14:09:56 +11:00

1 2 3

112 Commits