samba-mirror

mirror of https://github.com/samba-team/samba.git synced 2024-12-23 17:34:34 +03:00

533 lines

15 KiB

C

Raw Normal View History

add a missing file :-) (This used to be ctdb commit 29cf1b927f2cebfdc43e22d32a270e956716e2c5) 2007-05-18 14:06:29 +04:00			`/*`
			`monitoring links to all other nodes to detect dead nodes`


			`Copyright (C) Ronnie Sahlberg 2007`

ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`This program is free software; you can redistribute it and/or modify`
			`it under the terms of the GNU General Public License as published by`
update lib/replace from samba4 (This used to be ctdb commit f0555484105668c01c21f56322992e752e831109) 2007-07-10 09:29:31 +04:00			`the Free Software Foundation; either version 3 of the License, or`
ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`(at your option) any later version.`

			`This program is distributed in the hope that it will be useful,`
add a missing file :-) (This used to be ctdb commit 29cf1b927f2cebfdc43e22d32a270e956716e2c5) 2007-05-18 14:06:29 +04:00			`but WITHOUT ANY WARRANTY; without even the implied warranty of`
ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the`
			`GNU General Public License for more details.`

			`You should have received a copy of the GNU General Public License`
update lib/replace from samba4 (This used to be ctdb commit f0555484105668c01c21f56322992e752e831109) 2007-07-10 09:29:31 +04:00			`along with this program; if not, see <http://www.gnu.org/licenses/>.`
add a missing file :-) (This used to be ctdb commit 29cf1b927f2cebfdc43e22d32a270e956716e2c5) 2007-05-18 14:06:29 +04:00			`*/`

			`#include "includes.h"`
			`#include "system/filesys.h"`
			`#include "system/wait.h"`
			`#include "../include/ctdb_private.h"`

exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`struct ctdb_monitor_state {`
			`uint32_t monitoring_mode;`
			`TALLOC_CTX *monitor_context;`
			`uint32_t next_interval;`
			`};`

added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00			`static void ctdb_check_health(struct event_context ev, struct timed_event te,`
			`struct timeval t, void *private_data);`

add a mechanism where the ctdb daemon will run a usercontrolled script when the node status changes to/from UNHEALTHY state. This would allow a sysadmin to set up ctdb to send an email/snmptrap/... when the status of the node changes. (This used to be ctdb commit ce534a83a05dbd40238e4eee0669d60ff396f935) 2009-03-31 07:23:31 +04:00			`/*`
			`setup the notification script`
			`*/`
			`int ctdb_set_notification_script(struct ctdb_context ctdb, const char script)`
			`{`
			`ctdb->notification_script = talloc_strdup(ctdb, script);`
			`CTDB_NO_MEMORY(ctdb, ctdb->notification_script);`
			`return 0;`
			`}`

			`static int ctdb_run_notification_script_child(struct ctdb_context ctdb, const char event)`
			`{`
			`struct stat st;`
			`int ret;`
			`char *cmd;`

			`if (stat(ctdb->notification_script, &st) != 0) {`
			`DEBUG(DEBUG_ERR,("Could not stat notification script %s. Can not send notifications.\n", ctdb->notification_script));`
			`return -1;`
			`}`
			`if (!(st.st_mode & S_IXUSR)) {`
			`DEBUG(DEBUG_ERR,("Notification script %s is not executable.\n", ctdb->notification_script));`
			`return -1;`
			`}`

			`cmd = talloc_asprintf(ctdb, "%s %s\n", ctdb->notification_script, event);`
			`CTDB_NO_MEMORY(ctdb, cmd);`

			`ret = system(cmd);`
			`/* if the system() call was successful, translate ret into the`
			`return code from the command`
			`*/`
			`if (ret != -1) {`
			`ret = WEXITSTATUS(ret);`
			`}`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,("Notification script \"%s\" failed with error %d\n", cmd, ret));`
			`}`

			`return ret;`
			`}`

server: add "init" event This is needed because the "startup" event runs after the initial recovery, but we need to do some actions before the initial recovery. metze (This used to be ctdb commit e953808449c102258abb6cba6f4abf486dda3b82) 2010-01-19 12:07:14 +03:00			`void ctdb_run_notification_script(struct ctdb_context ctdb, const char event)`
add a mechanism where the ctdb daemon will run a usercontrolled script when the node status changes to/from UNHEALTHY state. This would allow a sysadmin to set up ctdb to send an email/snmptrap/... when the status of the node changes. (This used to be ctdb commit ce534a83a05dbd40238e4eee0669d60ff396f935) 2009-03-31 07:23:31 +04:00			`{`
			`pid_t child;`

			`if (ctdb->notification_script == NULL) {`
			`return;`
			`}`

Add ctdb_fork(0 which will fork a child process and drop the real-time scheduler for the child. Use ctdb_fork() from callers where we dont want the child to be running at real-time privilege. (This used to be ctdb commit 58795a4c9e0624e20fa3e0023b65127053edd103) 2011-01-10 05:57:49 +03:00			`child = ctdb_fork(ctdb);`
add a mechanism where the ctdb daemon will run a usercontrolled script when the node status changes to/from UNHEALTHY state. This would allow a sysadmin to set up ctdb to send an email/snmptrap/... when the status of the node changes. (This used to be ctdb commit ce534a83a05dbd40238e4eee0669d60ff396f935) 2009-03-31 07:23:31 +04:00			`if (child == (pid_t)-1) {`
			`DEBUG(DEBUG_ERR,("Failed to fork() a notification child process\n"));`
			`return;`
			`}`
			`if (child == 0) {`
			`int ret;`

ctdbd: Set process names for child processes This helps distinguish processes in process list in top, perf, etc. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 2493f57ce268d6fe7e4c40a87852c347fd60d29e) 2013-07-09 06:32:53 +04:00			`ctdb_set_process_name("ctdb_notification");`
logging: give a unique logging name to each forked child. This means we can distinguish which child is logging, esp. via syslog where we have no pid. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 68b3761a0874429b90731741f0531f76dcfbb081) 2010-07-19 13:59:09 +04:00			`debug_extra = talloc_asprintf(NULL, "notification-%s:", event);`
add a mechanism where the ctdb daemon will run a usercontrolled script when the node status changes to/from UNHEALTHY state. This would allow a sysadmin to set up ctdb to send an email/snmptrap/... when the status of the node changes. (This used to be ctdb commit ce534a83a05dbd40238e4eee0669d60ff396f935) 2009-03-31 07:23:31 +04:00			`ret = ctdb_run_notification_script_child(ctdb, event);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " Notification script failed\n"));`
			`}`
			`_exit(0);`
			`}`

			`return;`
			`}`

added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00			`/*`
			`called when a health monitoring event script finishes`
			`*/`
			`static void ctdb_health_callback(struct ctdb_context ctdb, int status, void p)`
			`{`
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`struct ctdb_node *node = ctdb->nodes[ctdb->pnn];`
added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00			`TDB_DATA data;`
			`struct ctdb_node_flag_change c;`
run monitoring more quickly when unhealthy and at startup (This used to be ctdb commit ff1c205928e3ef5bcc6bf4e4b2122a19fa38d8f4) 2007-09-24 04:12:18 +04:00			`uint32_t next_interval;`
when we change state between healthy/unhealthy, make sure we ask the recovery master to perform an explicit ip reallocation. This is more reliable and faster than having the recovery dameon track these changes, and since we now have an explicit method to ask the recovery daemon to perform an explicit ip reallocation, we should use this. (This used to be ctdb commit 3807681e74f4bfe92befdae6ed616ff5f1a99880) 2009-10-14 04:59:16 +04:00			`int ret;`
			`TDB_DATA rddata;`
recoverd: Make the SRVID request structure generic No need for a separate one for each SRVID. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit d9c22b04d5aa7938a3965bd3144568664eb772ce) 2013-08-16 14:10:10 +04:00			`struct srvid_request rd;`
server/monitor: ask for a takeoverrun after propagating our new flags metze (This used to be ctdb commit 942f44123350d4d0c4ad7f3fcd5ff2d0d175739b) 2010-08-24 11:22:49 +04:00			`const char *state_str = NULL;`
added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00
change ctdb_node_flags_change.vnn to ctdb_node_flags_changed.pnn change ctdb_ban_info.vnn to ctdb_ban_info.pnn (This used to be ctdb commit fcedd40e0493948829e1c921d4fe30e9196e398a) 2007-09-04 04:33:10 +04:00			`c.pnn = ctdb->pnn;`
change the structure used for node flag change messages so that we can see both the old flags as well as the new flags (so we can tell which flags changed) send the CTDB_SRVID_RECONFIGURE messages to connected nodes only, not to every node, connected or not, in the cluster. in the handler inside the recovery daemon which is invoked for node flag change messages, only do a takeover_run() and redistribute the ip addresses IF it was the disabled or the unhealthy flags that changed. Also send out the cluster reconfigured message in this case. If any of the other flags changed we dont need to do the takeover_run(0 here since that will be done during recovery. (This used to be ctdb commit 5549b2058e2c148a8ca9d419123acf3247bb8829) 2007-08-21 11:25:15 +04:00			`c.old_flags = node->flags;`

when we change state between healthy/unhealthy, make sure we ask the recovery master to perform an explicit ip reallocation. This is more reliable and faster than having the recovery dameon track these changes, and since we now have an explicit method to ask the recovery daemon to perform an explicit ip reallocation, we should use this. (This used to be ctdb commit 3807681e74f4bfe92befdae6ed616ff5f1a99880) 2009-10-14 04:59:16 +04:00			`rd.pnn = ctdb->pnn;`
			`rd.srvid = CTDB_SRVID_TAKEOVER_RUN_RESPONSE;`

			`rddata.dptr = (uint8_t *)&rd;`
			`rddata.dsize = sizeof(rd);`

Eventscripts: Add special -ECANCELED status for monitor events that are cancelled When a monitor event is canceled by a higher priority script, make sure we return status -ECANCELED to the callback in ctdB_monitor.c Also treat -ECANCELED as a simple "try monitor event again" and skip modifying any HEALTHY/UNHEALTHY flags when this happens (This used to be ctdb commit a15ec57c26d1bc82af85f74eebae0bd8abde3233) 2011-11-17 06:34:29 +04:00			`if (status == -ECANCELED) {`
			`DEBUG(DEBUG_ERR,("Monitoring event was cancelled\n"));`
			`goto after_change_status;`
			`}`

eventscript: handle banning within the callbacks Currently the timeout handler in eventscript.c does the banning if a timeout happens. However, because monitor events are different, it has to special case them. As we call the callback anyway in this case, we should make that handle -ETIME as it sees fit: for everyone but the monitor event, we simply ban ourselves. The more complicated monitor event banning logic is now in ctdb_monitor.c where it belongs. Note: I wrapped the other bans in "if (status == -ETIME)", though they should probably ban themselves on any error. This change should be a noop. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 9ecee127e19a9e7cae114a66f3514ee7a75276c5) 2009-12-07 16:18:57 +03:00			`if (status == -ETIME) {`
			`ctdb->event_script_timeouts++;`

Rename the tunable EventScriptBanCount to EventScriptTimeoutCount since we no longer ban nodes when dodgy scripts continue to hang. We now only mark nodes as unhealthy if monitor events fail or timeout. Never ban. (This used to be ctdb commit 5c8e56fc7a518e115bceac257867739283cf6a1e) 2009-12-14 07:53:23 +03:00			`if (ctdb->event_script_timeouts >= ctdb->tunable.script_timeout_count) {`
			`DEBUG(DEBUG_ERR, ("Maximum timeout count %u reached for eventscript. Making node unhealthy\n", ctdb->tunable.script_timeout_count));`
eventscript: handle banning within the callbacks Currently the timeout handler in eventscript.c does the banning if a timeout happens. However, because monitor events are different, it has to special case them. As we call the callback anyway in this case, we should make that handle -ETIME as it sees fit: for everyone but the monitor event, we simply ban ourselves. The more complicated monitor event banning logic is now in ctdb_monitor.c where it belongs. Note: I wrapped the other bans in "if (status == -ETIME)", though they should probably ban themselves on any error. This change should be a noop. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 9ecee127e19a9e7cae114a66f3514ee7a75276c5) 2009-12-07 16:18:57 +03:00			`} else {`
			`/* We pretend this is OK. */`
eventscript: don't make ourselves healthy if we're under ban_count If we've timed out, but we've not timed out more than ctdb->tunable.script_ban_count, we pretend we haven't. There's a logic bug in the way this is done: if we were unhealthy before, this would set us to "healthy" again (status == 0). I don't think this would happen in real life, but it's a little surprising. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit e6488c0e05bab5c4c2c0a6370930b0b27e5ed56e) 2009-12-07 16:22:01 +03:00			`goto after_change_status;`
eventscript: handle banning within the callbacks Currently the timeout handler in eventscript.c does the banning if a timeout happens. However, because monitor events are different, it has to special case them. As we call the callback anyway in this case, we should make that handle -ETIME as it sees fit: for everyone but the monitor event, we simply ban ourselves. The more complicated monitor event banning logic is now in ctdb_monitor.c where it belongs. Note: I wrapped the other bans in "if (status == -ETIME)", though they should probably ban themselves on any error. This change should be a noop. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 9ecee127e19a9e7cae114a66f3514ee7a75276c5) 2009-12-07 16:18:57 +03:00			`}`
			`}`

merged admin enable/disable change from ronnie (This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a) 2007-06-07 05:15:22 +04:00			`if (status != 0 && !(node->flags & NODE_FLAGS_UNHEALTHY)) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_NOTICE,("monitor event failed - disabling node\n"));`
merged admin enable/disable change from ronnie (This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a) 2007-06-07 05:15:22 +04:00			`node->flags \|= NODE_FLAGS_UNHEALTHY;`
dont run the monitor event so frequently after a event has failed. use _exit() instead of exit() when terminating an eventscript. (This used to be ctdb commit cc30ee2f4f33cb75b2be980c2d4dff6c7c23852f) 2009-10-27 05:51:45 +03:00			`ctdb->monitor->next_interval = 5;`
add a new tunable DisableWhenUnhealthy which when set will cause a node to automatically become DISABLED anytime monitoring fails and the node becomes UNHEALTHY. Use with caution. (This used to be ctdb commit c20293360db67f9876b0c84e5e9e12a5868964cb) 2008-02-22 02:33:09 +03:00
add a mechanism where the ctdb daemon will run a usercontrolled script when the node status changes to/from UNHEALTHY state. This would allow a sysadmin to set up ctdb to send an email/snmptrap/... when the status of the node changes. (This used to be ctdb commit ce534a83a05dbd40238e4eee0669d60ff396f935) 2009-03-31 07:23:31 +04:00			`ctdb_run_notification_script(ctdb, "unhealthy");`
merged admin enable/disable change from ronnie (This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a) 2007-06-07 05:15:22 +04:00			`} else if (status == 0 && (node->flags & NODE_FLAGS_UNHEALTHY)) {`
added debug constants to allow for better mapping to syslog levels (This used to be ctdb commit 7ba8f1dde318eab03f4257e5a89fd23e7281e502) 2008-02-04 09:44:24 +03:00			`DEBUG(DEBUG_NOTICE,("monitor event OK - node re-enabled\n"));`
run monitoring more quickly when unhealthy and at startup (This used to be ctdb commit ff1c205928e3ef5bcc6bf4e4b2122a19fa38d8f4) 2007-09-24 04:12:18 +04:00			`node->flags &= ~NODE_FLAGS_UNHEALTHY;`
dont run the monitor event so frequently after a event has failed. use _exit() instead of exit() when terminating an eventscript. (This used to be ctdb commit cc30ee2f4f33cb75b2be980c2d4dff6c7c23852f) 2009-10-27 05:51:45 +03:00			`ctdb->monitor->next_interval = 5;`
add a mechanism where the ctdb daemon will run a usercontrolled script when the node status changes to/from UNHEALTHY state. This would allow a sysadmin to set up ctdb to send an email/snmptrap/... when the status of the node changes. (This used to be ctdb commit ce534a83a05dbd40238e4eee0669d60ff396f935) 2009-03-31 07:23:31 +04:00
			`ctdb_run_notification_script(ctdb, "healthy");`
run monitoring more quickly when unhealthy and at startup (This used to be ctdb commit ff1c205928e3ef5bcc6bf4e4b2122a19fa38d8f4) 2007-09-24 04:12:18 +04:00			`}`

eventscript: don't make ourselves healthy if we're under ban_count If we've timed out, but we've not timed out more than ctdb->tunable.script_ban_count, we pretend we haven't. There's a logic bug in the way this is done: if we were unhealthy before, this would set us to "healthy" again (status == 0). I don't think this would happen in real life, but it's a little surprising. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit e6488c0e05bab5c4c2c0a6370930b0b27e5ed56e) 2009-12-07 16:22:01 +03:00			`after_change_status:`
exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`next_interval = ctdb->monitor->next_interval;`

			`ctdb->monitor->next_interval *= 2;`
			`if (ctdb->monitor->next_interval > ctdb->tunable.monitor_interval) {`
			`ctdb->monitor->next_interval = ctdb->tunable.monitor_interval;`
run monitoring more quickly when unhealthy and at startup (This used to be ctdb commit ff1c205928e3ef5bcc6bf4e4b2122a19fa38d8f4) 2007-09-24 04:12:18 +04:00			`}`

exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
->monitor_context is NULL when monitoring is disabled. Check whether monitoring is enabled or not before creating new events and log why the event is not set up othervise (This used to be ctdb commit 2f352b2606c04a65ce461fc2e99e6d6251ac4f20) 2007-11-30 01:02:37 +03:00			`timeval_current_ofs(next_interval, 0),`
			`ctdb_check_health, ctdb);`
run monitoring more quickly when unhealthy and at startup (This used to be ctdb commit ff1c205928e3ef5bcc6bf4e4b2122a19fa38d8f4) 2007-09-24 04:12:18 +04:00
			`if (c.old_flags == node->flags) {`
added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00			`return;`
			`}`

change the structure used for node flag change messages so that we can see both the old flags as well as the new flags (so we can tell which flags changed) send the CTDB_SRVID_RECONFIGURE messages to connected nodes only, not to every node, connected or not, in the cluster. in the handler inside the recovery daemon which is invoked for node flag change messages, only do a takeover_run() and redistribute the ip addresses IF it was the disabled or the unhealthy flags that changed. Also send out the cluster reconfigured message in this case. If any of the other flags changed we dont need to do the takeover_run(0 here since that will be done during recovery. (This used to be ctdb commit 5549b2058e2c148a8ca9d419123acf3247bb8829) 2007-08-21 11:25:15 +04:00			`c.new_flags = node->flags;`
added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00
			`data.dptr = (uint8_t *)&c;`
			`data.dsize = sizeof(c);`

reqrite the handling of flag updates across the cluster to eliminate a race between the ctdb tool and the recovery daemon both at once trying to push flag changes across the cluster. (This used to be ctdb commit a9a1156ea4e10483a4bf4265b8e9203f0af033aa) 2008-11-19 06:43:46 +03:00			`/* ask the recovery daemon to push these changes out to all nodes */`
			`ctdb_daemon_send_message(ctdb, ctdb->pnn,`
			`CTDB_SRVID_PUSH_NODE_FLAGS, data);`
merged admin enable/disable change from ronnie (This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a) 2007-06-07 05:15:22 +04:00
server/monitor: ask for a takeoverrun after propagating our new flags metze (This used to be ctdb commit 942f44123350d4d0c4ad7f3fcd5ff2d0d175739b) 2010-08-24 11:22:49 +04:00			`if (c.new_flags & NODE_FLAGS_UNHEALTHY) {`
			`state_str = "UNHEALTHY";`
			`} else {`
			`state_str = "HEALTHY";`
			`}`

			`/* ask the recmaster to reallocate all addresses */`
ctdb-daemon: Broadcast IP rellocation request from monitor code No need to just send it to the recovery master. This reduces the need for main daemon code to know which node is the recovery master. The end goal is for the main daemon to not need to know which node is the recovery master - this information would be stored in the recovery daemon (and subsequently a separate cluster management daemon). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2015-03-30 12:51:51 +03:00			`DEBUG(DEBUG_ERR,`
			`("Node became %s. Ask recovery master to reallocate IPs\n",`
			`state_str));`
			`ret = ctdb_daemon_send_message(ctdb, CTDB_BROADCAST_CONNECTED, CTDB_SRVID_TAKEOVER_RUN, rddata);`
server/monitor: ask for a takeoverrun after propagating our new flags metze (This used to be ctdb commit 942f44123350d4d0c4ad7f3fcd5ff2d0d175739b) 2010-08-24 11:22:49 +04:00			`if (ret != 0) {`
ctdb-daemon: Broadcast IP rellocation request from monitor code No need to just send it to the recovery master. This reduces the need for main daemon code to know which node is the recovery master. The end goal is for the main daemon to not need to know which node is the recovery master - this information would be stored in the recovery daemon (and subsequently a separate cluster management daemon). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2015-03-30 12:51:51 +03:00			`DEBUG(DEBUG_ERR,`
			`(__location__`
			`" Failed to send IP takeover run request\n"));`
server/monitor: ask for a takeoverrun after propagating our new flags metze (This used to be ctdb commit 942f44123350d4d0c4ad7f3fcd5ff2d0d175739b) 2010-08-24 11:22:49 +04:00			`}`
added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00			`}`


ctdb/daemon: Untangle serialisation of 1st recovery -> startup -> monitor At the moment ctdb_check_healthy() is overloaded to wait until the first recovery is complete, handle the "startup" event and also actually handle monitoring. This is untidy and hard to follow. Instead, have the daemon explicitly wait for 1st recovery after the "setup" event. When first recovery is complete, schedule a function to handle the "startup" event. When the "startup" event succeeds then explicitly enable monitoring. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2013-12-18 08:37:11 +04:00			`static void ctdb_run_startup(struct event_context ev, struct timed_event te,`
			`struct timeval t, void *private_data);`
prevent a deadly embrace between smbd and ctdbd by moving the calling of the startup event scripts after the point where recovery has started and the node is in normal operation This makes the 'startup' script just a special type of the 'monitor' script which is called first (This used to be ctdb commit 7424c30a5fd04aea0137c466b4318c3f185280d8) 2007-11-12 02:53:11 +03:00			`/*`
			`called when the startup event script finishes`
			`*/`
			`static void ctdb_startup_callback(struct ctdb_context ctdb, int status, void p)`
			`{`
			`if (status != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,("startup event failed\n"));`
ctdb/daemon: Untangle serialisation of 1st recovery -> startup -> monitor At the moment ctdb_check_healthy() is overloaded to wait until the first recovery is complete, handle the "startup" event and also actually handle monitoring. This is untidy and hard to follow. Instead, have the daemon explicitly wait for 1st recovery after the "setup" event. When first recovery is complete, schedule a function to handle the "startup" event. When the "startup" event succeeds then explicitly enable monitoring. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2013-12-18 08:37:11 +04:00			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
			`timeval_current_ofs(5, 0),`
			`ctdb_run_startup, ctdb);`
			`return;`
prevent a deadly embrace between smbd and ctdbd by moving the calling of the startup event scripts after the point where recovery has started and the node is in normal operation This makes the 'startup' script just a special type of the 'monitor' script which is called first (This used to be ctdb commit 7424c30a5fd04aea0137c466b4318c3f185280d8) 2007-11-12 02:53:11 +03:00			`}`

ctdb/daemon: Untangle serialisation of 1st recovery -> startup -> monitor At the moment ctdb_check_healthy() is overloaded to wait until the first recovery is complete, handle the "startup" event and also actually handle monitoring. This is untidy and hard to follow. Instead, have the daemon explicitly wait for 1st recovery after the "setup" event. When first recovery is complete, schedule a function to handle the "startup" event. When the "startup" event succeeds then explicitly enable monitoring. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2013-12-18 08:37:11 +04:00			`DEBUG(DEBUG_NOTICE,("startup event OK - enabling monitoring\n"));`
			`ctdb_set_runstate(ctdb, CTDB_RUNSTATE_RUNNING);`
			`ctdb->monitor->next_interval = 2;`
			`ctdb_run_notification_script(ctdb, "startup");`

			`ctdb->monitor->monitoring_mode = CTDB_MONITORING_ACTIVE;`

			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`timeval_current_ofs(ctdb->monitor->next_interval, 0),`
			`ctdb_check_health, ctdb);`
prevent a deadly embrace between smbd and ctdbd by moving the calling of the startup event scripts after the point where recovery has started and the node is in normal operation This makes the 'startup' script just a special type of the 'monitor' script which is called first (This used to be ctdb commit 7424c30a5fd04aea0137c466b4318c3f185280d8) 2007-11-12 02:53:11 +03:00			`}`

ctdb/daemon: Untangle serialisation of 1st recovery -> startup -> monitor At the moment ctdb_check_healthy() is overloaded to wait until the first recovery is complete, handle the "startup" event and also actually handle monitoring. This is untidy and hard to follow. Instead, have the daemon explicitly wait for 1st recovery after the "setup" event. When first recovery is complete, schedule a function to handle the "startup" event. When the "startup" event succeeds then explicitly enable monitoring. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2013-12-18 08:37:11 +04:00			`static void ctdb_run_startup(struct event_context ev, struct timed_event te,`
			`struct timeval t, void *private_data)`
			`{`
			`struct ctdb_context *ctdb = talloc_get_type(private_data,`
			`struct ctdb_context);`
			`int ret;`

			`/* This is necessary to avoid the "startup" event colliding`
			`* with the "ipreallocated" event from the takeover run`
			`* following the first recovery. We might as well serialise`
			`* these things if we can.`
			`*/`
			`if (ctdb->runstate < CTDB_RUNSTATE_STARTUP) {`
			`DEBUG(DEBUG_NOTICE,`
			`("Not yet in startup runstate. Wait one more second\n"));`
			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
			`timeval_current_ofs(1, 0),`
			`ctdb_run_startup, ctdb);`
			`return;`
			`}`

			`DEBUG(DEBUG_NOTICE,("Running the \"startup\" event.\n"));`
			`ret = ctdb_event_script_callback(ctdb,`
			`ctdb->monitor->monitor_context,`
			`ctdb_startup_callback,`
			`ctdb, CTDB_EVENT_STARTUP, "%s", "");`

			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,("Unable to launch startup event script\n"));`
			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
			`timeval_current_ofs(5, 0),`
			`ctdb_run_startup, ctdb);`
			`}`
			`}`
prevent a deadly embrace between smbd and ctdbd by moving the calling of the startup event scripts after the point where recovery has started and the node is in normal operation This makes the 'startup' script just a special type of the 'monitor' script which is called first (This used to be ctdb commit 7424c30a5fd04aea0137c466b4318c3f185280d8) 2007-11-12 02:53:11 +03:00
When starting up ctdbd, wait until all initial recoveries have finished and until we have gone through a full re-recovery timeout without triggering any pending recoveries before we start up the services and start monitoring the node. (This used to be ctdb commit 821333afb458358f90446062b0242790695e5060) 2009-12-01 05:19:58 +03:00			`/*`
			`wait until we have finished initial recoveries before we start the`
			`monitoring events`
			`*/`
			`static void ctdb_wait_until_recovered(struct event_context ev, struct timed_event te,`
			`struct timeval t, void *private_data)`
			`{`
			`struct ctdb_context *ctdb = talloc_get_type(private_data, struct ctdb_context);`
server: Use tdb_check to verify persistent tdbs on startup Depending on --max-persistent-check-errors we allow ctdb to start with unhealthy persistent databases. The default is 0 which means to reject a startup with unhealthy dbs. The health of the persistent databases is checked after each recovery. Node monitoring and the "startup" is deferred until all persistent databases are healthy. Databases can become healthy automaticly by a completely HEALTHY node joining the cluster. Or by an administrator with "ctdb backupdb/restoredb" or "ctdb wipedb". metze (This used to be ctdb commit 15f133d5150ed1badb4fef7d644f10cd08a25cb5) 2009-12-07 15:28:11 +03:00			`int ret;`
Cleanup of logging messages/spamming Reduce an infomational message about not performing ip reallocation from NOTICE(the default) to INFO. These messages are normal during startup or when stopped/banned when we will be in recovery mode for a while. Remove a messager in the loop waiting for initial startup to complete about the generation being invalid. It is always invalid at this stage before we have finished initial recovery. Rate-limit the informational messages for CTDB_WAIT_UNTIL_RECOVERED so that we only print them once per second for the first 60 seconds and after that only once per 10 minutes. These messages are normal during startup, but we should not be logging them every second for cases where we will remain in recovery mode during startup for an extended period of time. Such as if suspended or permabanned. CQ S1023302 (This used to be ctdb commit 3a0af8780dc595acbed880f288fcbc4f62c862fb) 2011-05-04 02:54:02 +04:00			`static int count = 0;`
When starting up ctdbd, wait until all initial recoveries have finished and until we have gone through a full re-recovery timeout without triggering any pending recoveries before we start up the services and start monitoring the node. (This used to be ctdb commit 821333afb458358f90446062b0242790695e5060) 2009-12-01 05:19:58 +03:00
Cleanup of logging messages/spamming Reduce an infomational message about not performing ip reallocation from NOTICE(the default) to INFO. These messages are normal during startup or when stopped/banned when we will be in recovery mode for a while. Remove a messager in the loop waiting for initial startup to complete about the generation being invalid. It is always invalid at this stage before we have finished initial recovery. Rate-limit the informational messages for CTDB_WAIT_UNTIL_RECOVERED so that we only print them once per second for the first 60 seconds and after that only once per 10 minutes. These messages are normal during startup, but we should not be logging them every second for cases where we will remain in recovery mode during startup for an extended period of time. Such as if suspended or permabanned. CQ S1023302 (This used to be ctdb commit 3a0af8780dc595acbed880f288fcbc4f62c862fb) 2011-05-04 02:54:02 +04:00			`count++;`

			`if (count < 60 \|\| count%600 == 0) {`
			`DEBUG(DEBUG_NOTICE,("CTDB_WAIT_UNTIL_RECOVERED\n"));`
			`if (ctdb->nodes[ctdb->pnn]->flags & NODE_FLAGS_STOPPED) {`
			`DEBUG(DEBUG_NOTICE,("Node is STOPPED. Node will NOT recover.\n"));`
			`}`
If the node is stopped, put a log entry in /var/log/* to indicate this is why we never become ready (This used to be ctdb commit ef1de8211f83259ea37dcd57562139a3b63d9631) 2011-01-31 09:40:26 +03:00			`}`
When starting up ctdbd, wait until all initial recoveries have finished and until we have gone through a full re-recovery timeout without triggering any pending recoveries before we start up the services and start monitoring the node. (This used to be ctdb commit 821333afb458358f90446062b0242790695e5060) 2009-12-01 05:19:58 +03:00
			`if (ctdb->vnn_map->generation == INVALID_GENERATION) {`
server: Use tdb_check to verify persistent tdbs on startup Depending on --max-persistent-check-errors we allow ctdb to start with unhealthy persistent databases. The default is 0 which means to reject a startup with unhealthy dbs. The health of the persistent databases is checked after each recovery. Node monitoring and the "startup" is deferred until all persistent databases are healthy. Databases can become healthy automaticly by a completely HEALTHY node joining the cluster. Or by an administrator with "ctdb backupdb/restoredb" or "ctdb wipedb". metze (This used to be ctdb commit 15f133d5150ed1badb4fef7d644f10cd08a25cb5) 2009-12-07 15:28:11 +03:00			`ctdb->db_persistent_startup_generation = INVALID_GENERATION;`

When starting up ctdbd, wait until all initial recoveries have finished and until we have gone through a full re-recovery timeout without triggering any pending recoveries before we start up the services and start monitoring the node. (This used to be ctdb commit 821333afb458358f90446062b0242790695e5060) 2009-12-01 05:19:58 +03:00			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
			`timeval_current_ofs(1, 0),`
			`ctdb_wait_until_recovered, ctdb);`
			`return;`
			`}`

			`if (ctdb->recovery_mode != CTDB_RECOVERY_NORMAL) {`
server: Use tdb_check to verify persistent tdbs on startup Depending on --max-persistent-check-errors we allow ctdb to start with unhealthy persistent databases. The default is 0 which means to reject a startup with unhealthy dbs. The health of the persistent databases is checked after each recovery. Node monitoring and the "startup" is deferred until all persistent databases are healthy. Databases can become healthy automaticly by a completely HEALTHY node joining the cluster. Or by an administrator with "ctdb backupdb/restoredb" or "ctdb wipedb". metze (This used to be ctdb commit 15f133d5150ed1badb4fef7d644f10cd08a25cb5) 2009-12-07 15:28:11 +03:00			`ctdb->db_persistent_startup_generation = INVALID_GENERATION;`

When starting up ctdbd, wait until all initial recoveries have finished and until we have gone through a full re-recovery timeout without triggering any pending recoveries before we start up the services and start monitoring the node. (This used to be ctdb commit 821333afb458358f90446062b0242790695e5060) 2009-12-01 05:19:58 +03:00			`DEBUG(DEBUG_NOTICE,(__location__ " in recovery. Wait one more second\n"));`
			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
			`timeval_current_ofs(1, 0),`
			`ctdb_wait_until_recovered, ctdb);`
			`return;`
			`}`


speed startup: add --sloppy-start. The extra recovery interval wait was introduced in 821333afb458 but no explanation was provided in that message. Nonetheless, if starting the entire cluster for the first time, it should be safe to skip this. We use the commandline arg --sloppy-start which should discourage people from using it outside testing. Seconds between ctdbd first log message and node healthy: BEFORE: 16.10 AFTER: 4.03 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 509e2e89ae233a0e91998d95267bf62f296a73cd) 2010-06-22 17:22:34 +04:00			`if (!fast_start && timeval_elapsed(&ctdb->last_recovery_finished) < (ctdb->tunable.rerecovery_timeout + 3)) {`
server: Use tdb_check to verify persistent tdbs on startup Depending on --max-persistent-check-errors we allow ctdb to start with unhealthy persistent databases. The default is 0 which means to reject a startup with unhealthy dbs. The health of the persistent databases is checked after each recovery. Node monitoring and the "startup" is deferred until all persistent databases are healthy. Databases can become healthy automaticly by a completely HEALTHY node joining the cluster. Or by an administrator with "ctdb backupdb/restoredb" or "ctdb wipedb". metze (This used to be ctdb commit 15f133d5150ed1badb4fef7d644f10cd08a25cb5) 2009-12-07 15:28:11 +03:00			`ctdb->db_persistent_startup_generation = INVALID_GENERATION;`

When starting up ctdbd, wait until all initial recoveries have finished and until we have gone through a full re-recovery timeout without triggering any pending recoveries before we start up the services and start monitoring the node. (This used to be ctdb commit 821333afb458358f90446062b0242790695e5060) 2009-12-01 05:19:58 +03:00			`DEBUG(DEBUG_NOTICE,(__location__ " wait for pending recoveries to end. Wait one more second.\n"));`

			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
			`timeval_current_ofs(1, 0),`
			`ctdb_wait_until_recovered, ctdb);`
			`return;`
			`}`

server: Use tdb_check to verify persistent tdbs on startup Depending on --max-persistent-check-errors we allow ctdb to start with unhealthy persistent databases. The default is 0 which means to reject a startup with unhealthy dbs. The health of the persistent databases is checked after each recovery. Node monitoring and the "startup" is deferred until all persistent databases are healthy. Databases can become healthy automaticly by a completely HEALTHY node joining the cluster. Or by an administrator with "ctdb backupdb/restoredb" or "ctdb wipedb". metze (This used to be ctdb commit 15f133d5150ed1badb4fef7d644f10cd08a25cb5) 2009-12-07 15:28:11 +03:00			`if (ctdb->vnn_map->generation == ctdb->db_persistent_startup_generation) {`
			`DEBUG(DEBUG_INFO,(__location__ " skip ctdb_recheck_persistent_health() "`
			`"until the next recovery\n"));`
			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
			`timeval_current_ofs(1, 0),`
			`ctdb_wait_until_recovered, ctdb);`
			`return;`
			`}`

			`ctdb->db_persistent_startup_generation = ctdb->vnn_map->generation;`
			`ret = ctdb_recheck_persistent_health(ctdb);`
			`if (ret != 0) {`
			`ctdb->db_persistent_check_errors++;`
			`if (ctdb->db_persistent_check_errors < ctdb->max_persistent_check_errors) {`
			`DEBUG(ctdb->db_persistent_check_errors==1?DEBUG_ERR:DEBUG_WARNING,`
			`(__location__ "ctdb_recheck_persistent_health() "`
			`"failed (%llu of %llu times) - retry later\n",`
			`(unsigned long long)ctdb->db_persistent_check_errors,`
			`(unsigned long long)ctdb->max_persistent_check_errors));`
			`event_add_timed(ctdb->ev,`
			`ctdb->monitor->monitor_context,`
			`timeval_current_ofs(1, 0),`
			`ctdb_wait_until_recovered, ctdb);`
			`return;`
			`}`
			`DEBUG(DEBUG_ALERT,(__location__`
			`"ctdb_recheck_persistent_health() failed (%llu times) - prepare shutdown\n",`
			`(unsigned long long)ctdb->db_persistent_check_errors));`
ctdbd: Refactor shutdown sequence Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit b32fd04bfbf33062d45365b37a7247e272a76ceb) 2013-06-19 04:58:14 +04:00			`ctdb_shutdown_sequence(ctdb, 11);`
ctdbd: Fix panic on overlapping shutdowns The runstate can't be set to SHUTDOWN twice, so the current naive code causes a panic on the 2nd shutdown. This regression was introduced in commit 8076773a9924dcf8aff16f7d96b2b9ac383ecc28. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit f1b7ca8dc3f34a59c7b3e55748f974ac9ed8f458) 2013-06-22 09:44:28 +04:00			`/* In case above returns due to duplicate shutdown */`
			`return;`
server: Use tdb_check to verify persistent tdbs on startup Depending on --max-persistent-check-errors we allow ctdb to start with unhealthy persistent databases. The default is 0 which means to reject a startup with unhealthy dbs. The health of the persistent databases is checked after each recovery. Node monitoring and the "startup" is deferred until all persistent databases are healthy. Databases can become healthy automaticly by a completely HEALTHY node joining the cluster. Or by an administrator with "ctdb backupdb/restoredb" or "ctdb wipedb". metze (This used to be ctdb commit 15f133d5150ed1badb4fef7d644f10cd08a25cb5) 2009-12-07 15:28:11 +03:00			`}`
			`ctdb->db_persistent_check_errors = 0;`
When starting up ctdbd, wait until all initial recoveries have finished and until we have gone through a full re-recovery timeout without triggering any pending recoveries before we start up the services and start monitoring the node. (This used to be ctdb commit 821333afb458358f90446062b0242790695e5060) 2009-12-01 05:19:58 +03:00
			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
ctdb/daemon: Untangle serialisation of 1st recovery -> startup -> monitor At the moment ctdb_check_healthy() is overloaded to wait until the first recovery is complete, handle the "startup" event and also actually handle monitoring. This is untidy and hard to follow. Instead, have the daemon explicitly wait for 1st recovery after the "setup" event. When first recovery is complete, schedule a function to handle the "startup" event. When the "startup" event succeeds then explicitly enable monitoring. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2013-12-18 08:37:11 +04:00			`timeval_current(), ctdb_run_startup, ctdb);`
When starting up ctdbd, wait until all initial recoveries have finished and until we have gone through a full re-recovery timeout without triggering any pending recoveries before we start up the services and start monitoring the node. (This used to be ctdb commit 821333afb458358f90446062b0242790695e5060) 2009-12-01 05:19:58 +03:00			`}`


added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00			`/*`
			`see if the event scripts think we are healthy`
			`*/`
			`static void ctdb_check_health(struct event_context ev, struct timed_event te,`
			`struct timeval t, void *private_data)`
			`{`
			`struct ctdb_context *ctdb = talloc_get_type(private_data, struct ctdb_context);`
ctdb/daemon: Untangle serialisation of 1st recovery -> startup -> monitor At the moment ctdb_check_healthy() is overloaded to wait until the first recovery is complete, handle the "startup" event and also actually handle monitoring. This is untidy and hard to follow. Instead, have the daemon explicitly wait for 1st recovery after the "setup" event. When first recovery is complete, schedule a function to handle the "startup" event. When the "startup" event succeeds then explicitly enable monitoring. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2013-12-18 08:37:11 +04:00			`bool skip_monitoring = false;`
change the eventscript handling to allow EventScriptTimeout for each individual script isntead of for the entire set of scripts restructure the talloc hierarchy to allow this (This used to be ctdb commit 64da4402c6ad485f1d0a604878a7b0c01a0ea5f0) 2009-10-28 08:11:54 +03:00			`int ret = 0;`
added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00
don't do the first startup event until we are out of recovery (This used to be ctdb commit 689940eb6e23f16ee063331caf3986613a8963ea) 2007-11-12 05:10:15 +03:00			`if (ctdb->recovery_mode != CTDB_RECOVERY_NORMAL \|\|`
ctdb/daemon: Untangle serialisation of 1st recovery -> startup -> monitor At the moment ctdb_check_healthy() is overloaded to wait until the first recovery is complete, handle the "startup" event and also actually handle monitoring. This is untidy and hard to follow. Instead, have the daemon explicitly wait for 1st recovery after the "setup" event. When first recovery is complete, schedule a function to handle the "startup" event. When the "startup" event succeeds then explicitly enable monitoring. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2013-12-18 08:37:11 +04:00			`ctdb->monitor->monitoring_mode == CTDB_MONITORING_DISABLED) {`
			`skip_monitoring = true;`
prevent a deadly embrace between smbd and ctdbd by moving the calling of the startup event scripts after the point where recovery has started and the node is in normal operation This makes the 'startup' script just a special type of the 'monitor' script which is called first (This used to be ctdb commit 7424c30a5fd04aea0137c466b4318c3f185280d8) 2007-11-12 02:53:11 +03:00			`} else {`
Dont run eventscript monitor when the databases are frozen. The databases can become frozen a while before we do the actual recovery since we have the re-recovery timeout. There is no point in doing much monitoring if we are waiting for a recovery, or if we are banned. This will eliminate some annoying log entries where certain tests will fail if the databases are locked. (This used to be ctdb commit ff824676fab94168707aada7423ae766bc0f711c) 2009-10-15 09:03:43 +04:00			`int i;`
			`for (i=1; i<=NUM_DB_PRIORITIES; i++) {`
if a lock wait child died/finished, we could have released the lockwait handle and set it to NULL before we call the destructors for releaseing the waiters. The waiters reference the locakwait handle in order to remove itself from the li nked list which caused a SEGV. We dont actually need to remove ourselves from this list here since if the parent freeze_handle holding the list is freed, then all waiters are rele ased as well, and the only place we actually need to relink the waiter is in ctd b_freeze_lock_handler, where we want to respond back to the clients and release the waiters but we still want to keep the freeze_handle hanging around. (This used to be ctdb commit e01ab46bafad09a5e320d420734db129d35863bc) 2009-10-22 06:41:28 +04:00			`if (ctdb->freeze_handles[i] != NULL) {`
ctdb/daemon: Untangle serialisation of 1st recovery -> startup -> monitor At the moment ctdb_check_healthy() is overloaded to wait until the first recovery is complete, handle the "startup" event and also actually handle monitoring. This is untidy and hard to follow. Instead, have the daemon explicitly wait for 1st recovery after the "setup" event. When first recovery is complete, schedule a function to handle the "startup" event. When the "startup" event succeeds then explicitly enable monitoring. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2013-12-18 08:37:11 +04:00			`DEBUG(DEBUG_ERR,`
			`("Skip monitoring since databases are frozen\n"));`
			`skip_monitoring = true;`
Dont run eventscript monitor when the databases are frozen. The databases can become frozen a while before we do the actual recovery since we have the re-recovery timeout. There is no point in doing much monitoring if we are waiting for a recovery, or if we are banned. This will eliminate some annoying log entries where certain tests will fail if the databases are locked. (This used to be ctdb commit ff824676fab94168707aada7423ae766bc0f711c) 2009-10-15 09:03:43 +04:00			`break;`
			`}`
			`}`
prevent a deadly embrace between smbd and ctdbd by moving the calling of the startup event scripts after the point where recovery has started and the node is in normal operation This makes the 'startup' script just a special type of the 'monitor' script which is called first (This used to be ctdb commit 7424c30a5fd04aea0137c466b4318c3f185280d8) 2007-11-12 02:53:11 +03:00			`}`

ctdb/daemon: Untangle serialisation of 1st recovery -> startup -> monitor At the moment ctdb_check_healthy() is overloaded to wait until the first recovery is complete, handle the "startup" event and also actually handle monitoring. This is untidy and hard to follow. Instead, have the daemon explicitly wait for 1st recovery after the "setup" event. When first recovery is complete, schedule a function to handle the "startup" event. When the "startup" event succeeds then explicitly enable monitoring. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2013-12-18 08:37:11 +04:00			`if (skip_monitoring) {`
			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
			`timeval_current_ofs(ctdb->monitor->next_interval, 0),`
			`ctdb_check_health, ctdb);`
			`return;`
			`}`

			`ret = ctdb_event_script_callback(ctdb,`
			`ctdb->monitor->monitor_context,`
			`ctdb_health_callback,`
			`ctdb, CTDB_EVENT_MONITOR, "%s", "");`
added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00			`if (ret != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,("Unable to launch monitor event script\n"));`
change the eventscript handling to allow EventScriptTimeout for each individual script isntead of for the entire set of scripts restructure the talloc hierarchy to allow this (This used to be ctdb commit 64da4402c6ad485f1d0a604878a7b0c01a0ea5f0) 2009-10-28 08:11:54 +03:00			`ctdb->monitor->next_interval = 5;`
ctdb/daemon: Untangle serialisation of 1st recovery -> startup -> monitor At the moment ctdb_check_healthy() is overloaded to wait until the first recovery is complete, handle the "startup" event and also actually handle monitoring. This is untidy and hard to follow. Instead, have the daemon explicitly wait for 1st recovery after the "setup" event. When first recovery is complete, schedule a function to handle the "startup" event. When the "startup" event succeeds then explicitly enable monitoring. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2013-12-18 08:37:11 +04:00			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
			`timeval_current_ofs(5, 0),`
			`ctdb_check_health, ctdb);`
change the eventscript handling to allow EventScriptTimeout for each individual script isntead of for the entire set of scripts restructure the talloc hierarchy to allow this (This used to be ctdb commit 64da4402c6ad485f1d0a604878a7b0c01a0ea5f0) 2009-10-28 08:11:54 +03:00			`}`
added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00			`}`

add ctdb_disable/enable_monitoring() that only modifies the monitoring flag. change calling of the recovered/takeip/releaseip event scripts to use these enable/disable functions instead of stopping/starting monitoring. when we disable monitoring we want all events to still be running in particular the events to monitor for dead nodes and we only want to supress running the monitor event scripts (This used to be ctdb commit a006dcc4f75aba950dd701ad7d1a84e89df285e8) 2007-11-30 02:09:54 +03:00			`/*`
			`(Temporaily) Disabling monitoring will stop the monitor event scripts`
			`from running but node health checks will still occur`
			`*/`
			`void ctdb_disable_monitoring(struct ctdb_context *ctdb)`
			`{`
exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`ctdb->monitor->monitoring_mode = CTDB_MONITORING_DISABLED;`
added debug constants to allow for better mapping to syslog levels (This used to be ctdb commit 7ba8f1dde318eab03f4257e5a89fd23e7281e502) 2008-02-04 09:44:24 +03:00			`DEBUG(DEBUG_INFO,("Monitoring has been disabled\n"));`
add ctdb_disable/enable_monitoring() that only modifies the monitoring flag. change calling of the recovered/takeip/releaseip event scripts to use these enable/disable functions instead of stopping/starting monitoring. when we disable monitoring we want all events to still be running in particular the events to monitor for dead nodes and we only want to supress running the monitor event scripts (This used to be ctdb commit a006dcc4f75aba950dd701ad7d1a84e89df285e8) 2007-11-30 02:09:54 +03:00			`}`

			`/*`
			`Re-enable running monitor events after they have been disabled`
			`*/`
			`void ctdb_enable_monitoring(struct ctdb_context *ctdb)`
			`{`
exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`ctdb->monitor->monitoring_mode = CTDB_MONITORING_ACTIVE;`
change the eventscript handling to allow EventScriptTimeout for each individual script isntead of for the entire set of scripts restructure the talloc hierarchy to allow this (This used to be ctdb commit 64da4402c6ad485f1d0a604878a7b0c01a0ea5f0) 2009-10-28 08:11:54 +03:00			`ctdb->monitor->next_interval = 5;`
added debug constants to allow for better mapping to syslog levels (This used to be ctdb commit 7ba8f1dde318eab03f4257e5a89fd23e7281e502) 2008-02-04 09:44:24 +03:00			`DEBUG(DEBUG_INFO,("Monitoring has been enabled\n"));`
add ctdb_disable/enable_monitoring() that only modifies the monitoring flag. change calling of the recovered/takeip/releaseip event scripts to use these enable/disable functions instead of stopping/starting monitoring. when we disable monitoring we want all events to still be running in particular the events to monitor for dead nodes and we only want to supress running the monitor event scripts (This used to be ctdb commit a006dcc4f75aba950dd701ad7d1a84e89df285e8) 2007-11-30 02:09:54 +03:00			`}`

			`/* stop any monitoring`
			`this should only be done when shutting down the daemon`
			`*/`
added timeouts in all event scripts (This used to be ctdb commit d986c91a607ed7c7d4869ea786b5cdf80e7862f1) 2007-06-06 07:45:12 +04:00			`void ctdb_stop_monitoring(struct ctdb_context *ctdb)`
			`{`
exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`talloc_free(ctdb->monitor->monitor_context);`
			`ctdb->monitor->monitor_context = NULL;`
dont manipulate ctdb->monitoring_mode directly from the SET_MON_MODE control, instead call ctdb_start/stop_monitoring() ctdb_stop_monitoring() dont allocate a new monitoring context, leave it NULL. Also set the monitoring_mode in this function so that ctdb_stop/start_monitoring() and ->monitoring_mode are kept in sync. Add a debug message to log that we have stopped monitoring. ctdb_start_monitoring() check whether monitoring is already active and make the function idempotent. Create the monitoring context when monitoring is started. Update ->monitoring_mode once the monitoring has been started. Add a debug message to log that we have started monitoring. When we temporarily stop monitoring while running an event script, restart monitoring after the event script wrapper returns instead of in the event script callback. Let monitoring_mode start out as DISABLED and let it be enabled once we call ctdb_start_monitoring. dont check for MONITORING_DISABLED in check_fore_dead_nodes(). If monitoring is disabled, this event handler will not be called. (This used to be ctdb commit 3a93ae8bdcffb1adbd6243844f3058fc742f76aa) 2007-11-30 00:44:34 +03:00
exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`ctdb->monitor->monitoring_mode = CTDB_MONITORING_DISABLED;`
change the eventscript handling to allow EventScriptTimeout for each individual script isntead of for the entire set of scripts restructure the talloc hierarchy to allow this (This used to be ctdb commit 64da4402c6ad485f1d0a604878a7b0c01a0ea5f0) 2009-10-28 08:11:54 +03:00			`ctdb->monitor->next_interval = 5;`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_NOTICE,("Monitoring has been stopped\n"));`
added timeouts in all event scripts (This used to be ctdb commit d986c91a607ed7c7d4869ea786b5cdf80e7862f1) 2007-06-06 07:45:12 +04:00			`}`
added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00
- up rx_cnt on all packet types - notice when a node becomes available again (This used to be ctdb commit e05110dd6112e81f224937dfd7370d963ce9531a) 2007-05-18 17:23:36 +04:00			`/*`
			`start watching for nodes that might be dead`
			`*/`
ctdb/daemon: Untangle serialisation of 1st recovery -> startup -> monitor At the moment ctdb_check_healthy() is overloaded to wait until the first recovery is complete, handle the "startup" event and also actually handle monitoring. This is untidy and hard to follow. Instead, have the daemon explicitly wait for 1st recovery after the "setup" event. When first recovery is complete, schedule a function to handle the "startup" event. When the "startup" event succeeds then explicitly enable monitoring. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2013-12-18 08:37:11 +04:00			`void ctdb_wait_for_first_recovery(struct ctdb_context *ctdb)`
add a missing file :-) (This used to be ctdb commit 29cf1b927f2cebfdc43e22d32a270e956716e2c5) 2007-05-18 14:06:29 +04:00			`{`
ctdb/daemon: Untangle serialisation of 1st recovery -> startup -> monitor At the moment ctdb_check_healthy() is overloaded to wait until the first recovery is complete, handle the "startup" event and also actually handle monitoring. This is untidy and hard to follow. Instead, have the daemon explicitly wait for 1st recovery after the "setup" event. When first recovery is complete, schedule a function to handle the "startup" event. When the "startup" event succeeds then explicitly enable monitoring. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2013-12-18 08:37:11 +04:00			`ctdb_set_runstate(ctdb, CTDB_RUNSTATE_FIRST_RECOVERY);`
dont manipulate ctdb->monitoring_mode directly from the SET_MON_MODE control, instead call ctdb_start/stop_monitoring() ctdb_stop_monitoring() dont allocate a new monitoring context, leave it NULL. Also set the monitoring_mode in this function so that ctdb_stop/start_monitoring() and ->monitoring_mode are kept in sync. Add a debug message to log that we have stopped monitoring. ctdb_start_monitoring() check whether monitoring is already active and make the function idempotent. Create the monitoring context when monitoring is started. Update ->monitoring_mode once the monitoring has been started. Add a debug message to log that we have started monitoring. When we temporarily stop monitoring while running an event script, restart monitoring after the event script wrapper returns instead of in the event script callback. Let monitoring_mode start out as DISABLED and let it be enabled once we call ctdb_start_monitoring. dont check for MONITORING_DISABLED in check_fore_dead_nodes(). If monitoring is disabled, this event handler will not be called. (This used to be ctdb commit 3a93ae8bdcffb1adbd6243844f3058fc742f76aa) 2007-11-30 00:44:34 +03:00
exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`ctdb->monitor = talloc(ctdb, struct ctdb_monitor_state);`
			`CTDB_NO_MEMORY_FATAL(ctdb, ctdb->monitor);`

			`ctdb->monitor->monitor_context = talloc_new(ctdb->monitor);`
			`CTDB_NO_MEMORY_FATAL(ctdb, ctdb->monitor->monitor_context);`
dont manipulate ctdb->monitoring_mode directly from the SET_MON_MODE control, instead call ctdb_start/stop_monitoring() ctdb_stop_monitoring() dont allocate a new monitoring context, leave it NULL. Also set the monitoring_mode in this function so that ctdb_stop/start_monitoring() and ->monitoring_mode are kept in sync. Add a debug message to log that we have stopped monitoring. ctdb_start_monitoring() check whether monitoring is already active and make the function idempotent. Create the monitoring context when monitoring is started. Update ->monitoring_mode once the monitoring has been started. Add a debug message to log that we have started monitoring. When we temporarily stop monitoring while running an event script, restart monitoring after the event script wrapper returns instead of in the event script callback. Let monitoring_mode start out as DISABLED and let it be enabled once we call ctdb_start_monitoring. dont check for MONITORING_DISABLED in check_fore_dead_nodes(). If monitoring is disabled, this event handler will not be called. (This used to be ctdb commit 3a93ae8bdcffb1adbd6243844f3058fc742f76aa) 2007-11-30 00:44:34 +03:00
When starting up ctdbd, wait until all initial recoveries have finished and until we have gone through a full re-recovery timeout without triggering any pending recoveries before we start up the services and start monitoring the node. (This used to be ctdb commit 821333afb458358f90446062b0242790695e5060) 2009-12-01 05:19:58 +03:00			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
ctdb/daemon: Untangle serialisation of 1st recovery -> startup -> monitor At the moment ctdb_check_healthy() is overloaded to wait until the first recovery is complete, handle the "startup" event and also actually handle monitoring. This is untidy and hard to follow. Instead, have the daemon explicitly wait for 1st recovery after the "setup" event. When first recovery is complete, schedule a function to handle the "startup" event. When the "startup" event succeeds then explicitly enable monitoring. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2013-12-18 08:37:11 +04:00			`timeval_current_ofs(1, 0),`
			`ctdb_wait_until_recovered, ctdb);`
add a missing file :-) (This used to be ctdb commit 29cf1b927f2cebfdc43e22d32a270e956716e2c5) 2007-05-18 14:06:29 +04:00			`}`
merged admin enable/disable change from ronnie (This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a) 2007-06-07 05:15:22 +04:00

			`/*`
implement a scheme where nodes are banned if they continuously caused the cluster to start a recovery session. The node is banned from the cluster for the RecoveryBanPeriod (default of 5 minutes) (This used to be ctdb commit 4ad43dd07f526b6002477177fbf55483246c2c0c) 2007-06-07 09:18:55 +04:00			`modify flags on a node`
merged admin enable/disable change from ronnie (This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a) 2007-06-07 05:15:22 +04:00			`*/`
implement a scheme where nodes are banned if they continuously caused the cluster to start a recovery session. The node is banned from the cluster for the RecoveryBanPeriod (default of 5 minutes) (This used to be ctdb commit 4ad43dd07f526b6002477177fbf55483246c2c0c) 2007-06-07 09:18:55 +04:00			`int32_t ctdb_control_modflags(struct ctdb_context *ctdb, TDB_DATA indata)`
merged admin enable/disable change from ronnie (This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a) 2007-06-07 05:15:22 +04:00			`{`
reqrite the handling of flag updates across the cluster to eliminate a race between the ctdb tool and the recovery daemon both at once trying to push flag changes across the cluster. (This used to be ctdb commit a9a1156ea4e10483a4bf4265b8e9203f0af033aa) 2008-11-19 06:43:46 +03:00			`struct ctdb_node_flag_change c = (struct ctdb_node_flag_change )indata.dptr;`
			`struct ctdb_node *node;`
dont let other nodes modify the STOPPED flag for the local process when pushing out flags changes (This used to be ctdb commit 501a2747d839ca291b70c761098549cf6d47a158) 2009-07-09 07:20:14 +04:00			`uint32_t old_flags;`
initial attempt at freezing databases in priority order (This used to be ctdb commit e8d692590da1070c87a4144031e3306d190ebed2) 2009-10-12 05:08:39 +04:00
reqrite the handling of flag updates across the cluster to eliminate a race between the ctdb tool and the recovery daemon both at once trying to push flag changes across the cluster. (This used to be ctdb commit a9a1156ea4e10483a4bf4265b8e9203f0af033aa) 2008-11-19 06:43:46 +03:00			`if (c->pnn >= ctdb->num_nodes) {`
			`DEBUG(DEBUG_ERR,(__location__ " Node %d is invalid, num_nodes :%d\n", c->pnn, ctdb->num_nodes));`
			`return -1;`
			`}`
merged admin enable/disable change from ronnie (This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a) 2007-06-07 05:15:22 +04:00
reqrite the handling of flag updates across the cluster to eliminate a race between the ctdb tool and the recovery daemon both at once trying to push flag changes across the cluster. (This used to be ctdb commit a9a1156ea4e10483a4bf4265b8e9203f0af033aa) 2008-11-19 06:43:46 +03:00			`node = ctdb->nodes[c->pnn];`
dont let other nodes modify the STOPPED flag for the local process when pushing out flags changes (This used to be ctdb commit 501a2747d839ca291b70c761098549cf6d47a158) 2009-07-09 07:20:14 +04:00			`old_flags = node->flags;`
server: if takeover runs when the recovery master becomes unhealthy The problem was this: When the monitor event fails, the node->flags get updated, and an update (containing the old and new flags) is sent to the recovery master. If the recovery master sends the update to itself (the same process), it was compairing the node->flags variable with the received new flags. This check always found both flag values to be equal and never sets the rec->need_takeover_run variable to true. There were two problem, first the push_flags_handler() function didn't pass the received old flags. And the ctdb_control_modflags() function ignored the received old flags. metze (This used to be ctdb commit 8ec633b64a05a2d903c2b9639909f15f6375548f) 2009-10-09 17:47:49 +04:00			`if (c->pnn != ctdb->pnn) {`
			`c->old_flags = node->flags;`
			`}`
reqrite the handling of flag updates across the cluster to eliminate a race between the ctdb tool and the recovery daemon both at once trying to push flag changes across the cluster. (This used to be ctdb commit a9a1156ea4e10483a4bf4265b8e9203f0af033aa) 2008-11-19 06:43:46 +03:00			`node->flags = c->new_flags & ~NODE_FLAGS_DISCONNECTED;`
			`node->flags \|= (c->old_flags & NODE_FLAGS_DISCONNECTED);`
implement a scheme where nodes are banned if they continuously caused the cluster to start a recovery session. The node is banned from the cluster for the RecoveryBanPeriod (default of 5 minutes) (This used to be ctdb commit 4ad43dd07f526b6002477177fbf55483246c2c0c) 2007-06-07 09:18:55 +04:00
dont let other nodes modify the STOPPED flag for the local process when pushing out flags changes (This used to be ctdb commit 501a2747d839ca291b70c761098549cf6d47a158) 2009-07-09 07:20:14 +04:00			`/* we dont let other nodes modify our STOPPED status */`
			`if (c->pnn == ctdb->pnn) {`
			`node->flags &= ~NODE_FLAGS_STOPPED;`
			`if (old_flags & NODE_FLAGS_STOPPED) {`
			`node->flags \|= NODE_FLAGS_STOPPED;`
			`}`
			`}`

new prototype banning code (This used to be ctdb commit 0c4c2240267af183d54ffd4c0aacda208f6eff6a) 2009-09-03 20:20:39 +04:00			`/* we dont let other nodes modify our BANNED status */`
			`if (c->pnn == ctdb->pnn) {`
			`node->flags &= ~NODE_FLAGS_BANNED;`
			`if (old_flags & NODE_FLAGS_BANNED) {`
			`node->flags \|= NODE_FLAGS_BANNED;`
			`}`
			`}`

reqrite the handling of flag updates across the cluster to eliminate a race between the ctdb tool and the recovery daemon both at once trying to push flag changes across the cluster. (This used to be ctdb commit a9a1156ea4e10483a4bf4265b8e9203f0af033aa) 2008-11-19 06:43:46 +03:00			`if (node->flags == c->old_flags) {`
			`DEBUG(DEBUG_INFO, ("Control modflags on node %u - Unchanged - flags 0x%x\n", c->pnn, node->flags));`
implement a scheme where nodes are banned if they continuously caused the cluster to start a recovery session. The node is banned from the cluster for the RecoveryBanPeriod (default of 5 minutes) (This used to be ctdb commit 4ad43dd07f526b6002477177fbf55483246c2c0c) 2007-06-07 09:18:55 +04:00			`return 0;`
merged admin enable/disable change from ronnie (This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a) 2007-06-07 05:15:22 +04:00			`}`

reqrite the handling of flag updates across the cluster to eliminate a race between the ctdb tool and the recovery daemon both at once trying to push flag changes across the cluster. (This used to be ctdb commit a9a1156ea4e10483a4bf4265b8e9203f0af033aa) 2008-11-19 06:43:46 +03:00			`DEBUG(DEBUG_INFO, ("Control modflags on node %u - flags now 0x%x\n", c->pnn, node->flags));`
added admin commands to ban/unban nodes (This used to be ctdb commit 4dad04172e7e4955b5bf6444a85b19901c9683ad) 2007-06-07 10:34:33 +04:00
ctdbd: Allow extra recovery to repair persistent DBs during first recovery Commit 8076773a9924dcf8aff16f7d96b2b9ac383ecc28 introduced a potential regression because a node may not have completed the "recovered" event (so might still be in CTDB_RUNSTATE_FIRST_RECOVERY) when another node becomes healthy. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 57ef5d3827ea3417a32703e259a53ce6fd10ac45) 2013-06-24 09:49:48 +04:00			`if (node->flags == 0 && ctdb->runstate <= CTDB_RUNSTATE_STARTUP) {`
server: Use tdb_check to verify persistent tdbs on startup Depending on --max-persistent-check-errors we allow ctdb to start with unhealthy persistent databases. The default is 0 which means to reject a startup with unhealthy dbs. The health of the persistent databases is checked after each recovery. Node monitoring and the "startup" is deferred until all persistent databases are healthy. Databases can become healthy automaticly by a completely HEALTHY node joining the cluster. Or by an administrator with "ctdb backupdb/restoredb" or "ctdb wipedb". metze (This used to be ctdb commit 15f133d5150ed1badb4fef7d644f10cd08a25cb5) 2009-12-07 15:28:11 +03:00			`DEBUG(DEBUG_ERR, (__location__ " Node %u became healthy - force recovery for startup\n",`
			`c->pnn));`
			`ctdb->recovery_mode = CTDB_RECOVERY_ACTIVE;`
			`}`
merged admin enable/disable change from ronnie (This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a) 2007-06-07 05:15:22 +04:00
reqrite the handling of flag updates across the cluster to eliminate a race between the ctdb tool and the recovery daemon both at once trying to push flag changes across the cluster. (This used to be ctdb commit a9a1156ea4e10483a4bf4265b8e9203f0af033aa) 2008-11-19 06:43:46 +03:00			`/* tell the recovery daemon something has changed */`
ctdb-daemon: Pass on consistent flag information to recovery daemon Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2015-02-04 09:18:12 +03:00			`c->new_flags = node->flags;`
reqrite the handling of flag updates across the cluster to eliminate a race between the ctdb tool and the recovery daemon both at once trying to push flag changes across the cluster. (This used to be ctdb commit a9a1156ea4e10483a4bf4265b8e9203f0af033aa) 2008-11-19 06:43:46 +03:00			`ctdb_daemon_send_message(ctdb, ctdb->pnn,`
			`CTDB_SRVID_SET_NODE_FLAGS, indata);`
added admin commands to ban/unban nodes (This used to be ctdb commit 4dad04172e7e4955b5bf6444a85b19901c9683ad) 2007-06-07 10:34:33 +04:00
reqrite the handling of flag updates across the cluster to eliminate a race between the ctdb tool and the recovery daemon both at once trying to push flag changes across the cluster. (This used to be ctdb commit a9a1156ea4e10483a4bf4265b8e9203f0af033aa) 2008-11-19 06:43:46 +03:00			`/* if we have become banned, we should go into recovery mode */`
When we ban a node, only drop the IPs on the node being banned, not on every node (This used to be ctdb commit 46e8c3737e6ff54fc80de8e962e922924c27bc35) 2009-06-10 04:28:47 +04:00			`if ((node->flags & NODE_FLAGS_BANNED) && !(c->old_flags & NODE_FLAGS_BANNED) && (node->pnn == ctdb->pnn)) {`
banning: Make ctdb_local_node_got_banned() a void function When this function is called, we are already committed to banning and there is no point in failing this function. In case, freezing of databases fails, it will be fixed from recovery daemon. (This used to be ctdb commit bb178338658b4ae32382a1f62f7c21cee1d4878f) 2013-06-28 08:04:18 +04:00			`ctdb_local_node_got_banned(ctdb);`
added admin commands to ban/unban nodes (This used to be ctdb commit 4dad04172e7e4955b5bf6444a85b19901c9683ad) 2007-06-07 10:34:33 +04:00			`}`
merged admin enable/disable change from ronnie (This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a) 2007-06-07 05:15:22 +04:00
			`return 0;`
			`}`
exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00
			`/*`
			`return the monitoring mode`
			`*/`
			`int32_t ctdb_monitoring_mode(struct ctdb_context *ctdb)`
			`{`
			`if (ctdb->monitor == NULL) {`
			`return CTDB_MONITORING_DISABLED;`
			`}`
			`return ctdb->monitor->monitoring_mode;`
			`}`
add a mechanism to force a node to run the eventscripts with arbitrary arguments ctdb eventscript "command argument argument ..." (This used to be ctdb commit 118a16e763d8332c6ce4d8b8e194775fb874c8c8) 2008-04-02 04:13:30 +04:00
daemon: Protect against double free of callback state while shutting down When CTDB is shut down and monitoring has been stopped, monitor_context gets freed and all the callback states hanging off it. This includes callback state for current_monitor, if the current monitor event has not yet finished. As a result, when the shutdown event is called, current_monitor->callback state is not NULL, but it's actually freed and it's a dangling reference. So before executing callback function and freeing callback state check if ctdb->monitor->monitor_context is not NULL. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 7d8546ee4353851f0543d0ca2c4c67cb0cc75aea) 2012-10-29 07:56:10 +04:00			`/*`
			`* Check if monitoring has been stopped`
			`*/`
			`bool ctdb_stopped_monitoring(struct ctdb_context *ctdb)`
			`{`
			`return (ctdb->monitor->monitor_context == NULL ? true : false);`
			`}`

533 lines 15 KiB C Raw Normal View History Unescape Escape

533 lines

15 KiB

C

Raw Normal View History