samba-mirror

mirror of https://github.com/samba-team/samba.git synced 2024-12-23 17:34:34 +03:00

516 lines

15 KiB

C

Raw Normal View History

add a missing file :-) (This used to be ctdb commit 29cf1b927f2cebfdc43e22d32a270e956716e2c5) 2007-05-18 14:06:29 +04:00			`/*`
			`monitoring links to all other nodes to detect dead nodes`


			`Copyright (C) Ronnie Sahlberg 2007`

ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`This program is free software; you can redistribute it and/or modify`
			`it under the terms of the GNU General Public License as published by`
update lib/replace from samba4 (This used to be ctdb commit f0555484105668c01c21f56322992e752e831109) 2007-07-10 09:29:31 +04:00			`the Free Software Foundation; either version 3 of the License, or`
ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`(at your option) any later version.`

			`This program is distributed in the hope that it will be useful,`
add a missing file :-) (This used to be ctdb commit 29cf1b927f2cebfdc43e22d32a270e956716e2c5) 2007-05-18 14:06:29 +04:00			`but WITHOUT ANY WARRANTY; without even the implied warranty of`
ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the`
			`GNU General Public License for more details.`

			`You should have received a copy of the GNU General Public License`
update lib/replace from samba4 (This used to be ctdb commit f0555484105668c01c21f56322992e752e831109) 2007-07-10 09:29:31 +04:00			`along with this program; if not, see <http://www.gnu.org/licenses/>.`
add a missing file :-) (This used to be ctdb commit 29cf1b927f2cebfdc43e22d32a270e956716e2c5) 2007-05-18 14:06:29 +04:00			`*/`

			`#include "includes.h"`
event: Update events to latest Samba version 0.9.8 In Samba this is now called "tevent", and while we use the backwards compatibility wrappers they don't offer EVENT_FD_AUTOCLOSE: that is now a separate tevent_fd_set_auto_close() function. This is based on Samba version 7f29f817fa939ef1bbb740584f09e76e2ecd5b06. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 85e5e760cc91eb3157d3a88996ce474491646726) 2010-08-18 03:46:31 +04:00			`#include "lib/tevent/tevent.h"`
add a missing file :-) (This used to be ctdb commit 29cf1b927f2cebfdc43e22d32a270e956716e2c5) 2007-05-18 14:06:29 +04:00			`#include "system/filesys.h"`
			`#include "system/wait.h"`
			`#include "../include/ctdb_private.h"`

exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`struct ctdb_monitor_state {`
			`uint32_t monitoring_mode;`
			`TALLOC_CTX *monitor_context;`
			`uint32_t next_interval;`
			`};`

added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00			`static void ctdb_check_health(struct event_context ev, struct timed_event te,`
			`struct timeval t, void *private_data);`

add a mechanism where the ctdb daemon will run a usercontrolled script when the node status changes to/from UNHEALTHY state. This would allow a sysadmin to set up ctdb to send an email/snmptrap/... when the status of the node changes. (This used to be ctdb commit ce534a83a05dbd40238e4eee0669d60ff396f935) 2009-03-31 07:23:31 +04:00			`/*`
			`setup the notification script`
			`*/`
			`int ctdb_set_notification_script(struct ctdb_context ctdb, const char script)`
			`{`
			`ctdb->notification_script = talloc_strdup(ctdb, script);`
			`CTDB_NO_MEMORY(ctdb, ctdb->notification_script);`
			`return 0;`
			`}`

			`static int ctdb_run_notification_script_child(struct ctdb_context ctdb, const char event)`
			`{`
			`struct stat st;`
			`int ret;`
			`char *cmd;`

			`if (stat(ctdb->notification_script, &st) != 0) {`
			`DEBUG(DEBUG_ERR,("Could not stat notification script %s. Can not send notifications.\n", ctdb->notification_script));`
			`return -1;`
			`}`
			`if (!(st.st_mode & S_IXUSR)) {`
			`DEBUG(DEBUG_ERR,("Notification script %s is not executable.\n", ctdb->notification_script));`
			`return -1;`
			`}`

			`cmd = talloc_asprintf(ctdb, "%s %s\n", ctdb->notification_script, event);`
			`CTDB_NO_MEMORY(ctdb, cmd);`

			`ret = system(cmd);`
			`/* if the system() call was successful, translate ret into the`
			`return code from the command`
			`*/`
			`if (ret != -1) {`
			`ret = WEXITSTATUS(ret);`
			`}`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,("Notification script \"%s\" failed with error %d\n", cmd, ret));`
			`}`

			`return ret;`
			`}`

server: add "init" event This is needed because the "startup" event runs after the initial recovery, but we need to do some actions before the initial recovery. metze (This used to be ctdb commit e953808449c102258abb6cba6f4abf486dda3b82) 2010-01-19 12:07:14 +03:00			`void ctdb_run_notification_script(struct ctdb_context ctdb, const char event)`
add a mechanism where the ctdb daemon will run a usercontrolled script when the node status changes to/from UNHEALTHY state. This would allow a sysadmin to set up ctdb to send an email/snmptrap/... when the status of the node changes. (This used to be ctdb commit ce534a83a05dbd40238e4eee0669d60ff396f935) 2009-03-31 07:23:31 +04:00			`{`
			`pid_t child;`

			`if (ctdb->notification_script == NULL) {`
			`return;`
			`}`

			`child = fork();`
			`if (child == (pid_t)-1) {`
			`DEBUG(DEBUG_ERR,("Failed to fork() a notification child process\n"));`
			`return;`
			`}`
			`if (child == 0) {`
			`int ret;`

logging: give a unique logging name to each forked child. This means we can distinguish which child is logging, esp. via syslog where we have no pid. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 68b3761a0874429b90731741f0531f76dcfbb081) 2010-07-19 13:59:09 +04:00			`debug_extra = talloc_asprintf(NULL, "notification-%s:", event);`
add a mechanism where the ctdb daemon will run a usercontrolled script when the node status changes to/from UNHEALTHY state. This would allow a sysadmin to set up ctdb to send an email/snmptrap/... when the status of the node changes. (This used to be ctdb commit ce534a83a05dbd40238e4eee0669d60ff396f935) 2009-03-31 07:23:31 +04:00			`ret = ctdb_run_notification_script_child(ctdb, event);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " Notification script failed\n"));`
			`}`
			`_exit(0);`
			`}`

			`return;`
			`}`

added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00			`/*`
			`called when a health monitoring event script finishes`
			`*/`
			`static void ctdb_health_callback(struct ctdb_context ctdb, int status, void p)`
			`{`
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`struct ctdb_node *node = ctdb->nodes[ctdb->pnn];`
added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00			`TDB_DATA data;`
			`struct ctdb_node_flag_change c;`
run monitoring more quickly when unhealthy and at startup (This used to be ctdb commit ff1c205928e3ef5bcc6bf4e4b2122a19fa38d8f4) 2007-09-24 04:12:18 +04:00			`uint32_t next_interval;`
when we change state between healthy/unhealthy, make sure we ask the recovery master to perform an explicit ip reallocation. This is more reliable and faster than having the recovery dameon track these changes, and since we now have an explicit method to ask the recovery daemon to perform an explicit ip reallocation, we should use this. (This used to be ctdb commit 3807681e74f4bfe92befdae6ed616ff5f1a99880) 2009-10-14 04:59:16 +04:00			`int ret;`
			`TDB_DATA rddata;`
			`struct takeover_run_reply rd;`
server/monitor: ask for a takeoverrun after propagating our new flags metze (This used to be ctdb commit 942f44123350d4d0c4ad7f3fcd5ff2d0d175739b) 2010-08-24 11:22:49 +04:00			`const char *state_str = NULL;`
added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00
change ctdb_node_flags_change.vnn to ctdb_node_flags_changed.pnn change ctdb_ban_info.vnn to ctdb_ban_info.pnn (This used to be ctdb commit fcedd40e0493948829e1c921d4fe30e9196e398a) 2007-09-04 04:33:10 +04:00			`c.pnn = ctdb->pnn;`
change the structure used for node flag change messages so that we can see both the old flags as well as the new flags (so we can tell which flags changed) send the CTDB_SRVID_RECONFIGURE messages to connected nodes only, not to every node, connected or not, in the cluster. in the handler inside the recovery daemon which is invoked for node flag change messages, only do a takeover_run() and redistribute the ip addresses IF it was the disabled or the unhealthy flags that changed. Also send out the cluster reconfigured message in this case. If any of the other flags changed we dont need to do the takeover_run(0 here since that will be done during recovery. (This used to be ctdb commit 5549b2058e2c148a8ca9d419123acf3247bb8829) 2007-08-21 11:25:15 +04:00			`c.old_flags = node->flags;`

when we change state between healthy/unhealthy, make sure we ask the recovery master to perform an explicit ip reallocation. This is more reliable and faster than having the recovery dameon track these changes, and since we now have an explicit method to ask the recovery daemon to perform an explicit ip reallocation, we should use this. (This used to be ctdb commit 3807681e74f4bfe92befdae6ed616ff5f1a99880) 2009-10-14 04:59:16 +04:00			`rd.pnn = ctdb->pnn;`
			`rd.srvid = CTDB_SRVID_TAKEOVER_RUN_RESPONSE;`

			`rddata.dptr = (uint8_t *)&rd;`
			`rddata.dsize = sizeof(rd);`

eventscript: handle banning within the callbacks Currently the timeout handler in eventscript.c does the banning if a timeout happens. However, because monitor events are different, it has to special case them. As we call the callback anyway in this case, we should make that handle -ETIME as it sees fit: for everyone but the monitor event, we simply ban ourselves. The more complicated monitor event banning logic is now in ctdb_monitor.c where it belongs. Note: I wrapped the other bans in "if (status == -ETIME)", though they should probably ban themselves on any error. This change should be a noop. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 9ecee127e19a9e7cae114a66f3514ee7a75276c5) 2009-12-07 16:18:57 +03:00			`if (status == -ETIME) {`
			`ctdb->event_script_timeouts++;`

Rename the tunable EventScriptBanCount to EventScriptTimeoutCount since we no longer ban nodes when dodgy scripts continue to hang. We now only mark nodes as unhealthy if monitor events fail or timeout. Never ban. (This used to be ctdb commit 5c8e56fc7a518e115bceac257867739283cf6a1e) 2009-12-14 07:53:23 +03:00			`if (ctdb->event_script_timeouts >= ctdb->tunable.script_timeout_count) {`
			`DEBUG(DEBUG_ERR, ("Maximum timeout count %u reached for eventscript. Making node unhealthy\n", ctdb->tunable.script_timeout_count));`
eventscript: handle banning within the callbacks Currently the timeout handler in eventscript.c does the banning if a timeout happens. However, because monitor events are different, it has to special case them. As we call the callback anyway in this case, we should make that handle -ETIME as it sees fit: for everyone but the monitor event, we simply ban ourselves. The more complicated monitor event banning logic is now in ctdb_monitor.c where it belongs. Note: I wrapped the other bans in "if (status == -ETIME)", though they should probably ban themselves on any error. This change should be a noop. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 9ecee127e19a9e7cae114a66f3514ee7a75276c5) 2009-12-07 16:18:57 +03:00			`} else {`
			`/* We pretend this is OK. */`
eventscript: don't make ourselves healthy if we're under ban_count If we've timed out, but we've not timed out more than ctdb->tunable.script_ban_count, we pretend we haven't. There's a logic bug in the way this is done: if we were unhealthy before, this would set us to "healthy" again (status == 0). I don't think this would happen in real life, but it's a little surprising. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit e6488c0e05bab5c4c2c0a6370930b0b27e5ed56e) 2009-12-07 16:22:01 +03:00			`goto after_change_status;`
eventscript: handle banning within the callbacks Currently the timeout handler in eventscript.c does the banning if a timeout happens. However, because monitor events are different, it has to special case them. As we call the callback anyway in this case, we should make that handle -ETIME as it sees fit: for everyone but the monitor event, we simply ban ourselves. The more complicated monitor event banning logic is now in ctdb_monitor.c where it belongs. Note: I wrapped the other bans in "if (status == -ETIME)", though they should probably ban themselves on any error. This change should be a noop. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 9ecee127e19a9e7cae114a66f3514ee7a75276c5) 2009-12-07 16:18:57 +03:00			`}`
			`}`

merged admin enable/disable change from ronnie (This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a) 2007-06-07 05:15:22 +04:00			`if (status != 0 && !(node->flags & NODE_FLAGS_UNHEALTHY)) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_NOTICE,("monitor event failed - disabling node\n"));`
merged admin enable/disable change from ronnie (This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a) 2007-06-07 05:15:22 +04:00			`node->flags \|= NODE_FLAGS_UNHEALTHY;`
dont run the monitor event so frequently after a event has failed. use _exit() instead of exit() when terminating an eventscript. (This used to be ctdb commit cc30ee2f4f33cb75b2be980c2d4dff6c7c23852f) 2009-10-27 05:51:45 +03:00			`ctdb->monitor->next_interval = 5;`
add a new tunable DisableWhenUnhealthy which when set will cause a node to automatically become DISABLED anytime monitoring fails and the node becomes UNHEALTHY. Use with caution. (This used to be ctdb commit c20293360db67f9876b0c84e5e9e12a5868964cb) 2008-02-22 02:33:09 +03:00
add a mechanism where the ctdb daemon will run a usercontrolled script when the node status changes to/from UNHEALTHY state. This would allow a sysadmin to set up ctdb to send an email/snmptrap/... when the status of the node changes. (This used to be ctdb commit ce534a83a05dbd40238e4eee0669d60ff396f935) 2009-03-31 07:23:31 +04:00			`ctdb_run_notification_script(ctdb, "unhealthy");`
merged admin enable/disable change from ronnie (This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a) 2007-06-07 05:15:22 +04:00			`} else if (status == 0 && (node->flags & NODE_FLAGS_UNHEALTHY)) {`
added debug constants to allow for better mapping to syslog levels (This used to be ctdb commit 7ba8f1dde318eab03f4257e5a89fd23e7281e502) 2008-02-04 09:44:24 +03:00			`DEBUG(DEBUG_NOTICE,("monitor event OK - node re-enabled\n"));`
run monitoring more quickly when unhealthy and at startup (This used to be ctdb commit ff1c205928e3ef5bcc6bf4e4b2122a19fa38d8f4) 2007-09-24 04:12:18 +04:00			`node->flags &= ~NODE_FLAGS_UNHEALTHY;`
dont run the monitor event so frequently after a event has failed. use _exit() instead of exit() when terminating an eventscript. (This used to be ctdb commit cc30ee2f4f33cb75b2be980c2d4dff6c7c23852f) 2009-10-27 05:51:45 +03:00			`ctdb->monitor->next_interval = 5;`
add a mechanism where the ctdb daemon will run a usercontrolled script when the node status changes to/from UNHEALTHY state. This would allow a sysadmin to set up ctdb to send an email/snmptrap/... when the status of the node changes. (This used to be ctdb commit ce534a83a05dbd40238e4eee0669d60ff396f935) 2009-03-31 07:23:31 +04:00
			`ctdb_run_notification_script(ctdb, "healthy");`
run monitoring more quickly when unhealthy and at startup (This used to be ctdb commit ff1c205928e3ef5bcc6bf4e4b2122a19fa38d8f4) 2007-09-24 04:12:18 +04:00			`}`

eventscript: don't make ourselves healthy if we're under ban_count If we've timed out, but we've not timed out more than ctdb->tunable.script_ban_count, we pretend we haven't. There's a logic bug in the way this is done: if we were unhealthy before, this would set us to "healthy" again (status == 0). I don't think this would happen in real life, but it's a little surprising. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit e6488c0e05bab5c4c2c0a6370930b0b27e5ed56e) 2009-12-07 16:22:01 +03:00			`after_change_status:`
exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`next_interval = ctdb->monitor->next_interval;`

			`ctdb->monitor->next_interval *= 2;`
			`if (ctdb->monitor->next_interval > ctdb->tunable.monitor_interval) {`
			`ctdb->monitor->next_interval = ctdb->tunable.monitor_interval;`
run monitoring more quickly when unhealthy and at startup (This used to be ctdb commit ff1c205928e3ef5bcc6bf4e4b2122a19fa38d8f4) 2007-09-24 04:12:18 +04:00			`}`

exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
->monitor_context is NULL when monitoring is disabled. Check whether monitoring is enabled or not before creating new events and log why the event is not set up othervise (This used to be ctdb commit 2f352b2606c04a65ce461fc2e99e6d6251ac4f20) 2007-11-30 01:02:37 +03:00			`timeval_current_ofs(next_interval, 0),`
			`ctdb_check_health, ctdb);`
run monitoring more quickly when unhealthy and at startup (This used to be ctdb commit ff1c205928e3ef5bcc6bf4e4b2122a19fa38d8f4) 2007-09-24 04:12:18 +04:00
			`if (c.old_flags == node->flags) {`
added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00			`return;`
			`}`

change the structure used for node flag change messages so that we can see both the old flags as well as the new flags (so we can tell which flags changed) send the CTDB_SRVID_RECONFIGURE messages to connected nodes only, not to every node, connected or not, in the cluster. in the handler inside the recovery daemon which is invoked for node flag change messages, only do a takeover_run() and redistribute the ip addresses IF it was the disabled or the unhealthy flags that changed. Also send out the cluster reconfigured message in this case. If any of the other flags changed we dont need to do the takeover_run(0 here since that will be done during recovery. (This used to be ctdb commit 5549b2058e2c148a8ca9d419123acf3247bb8829) 2007-08-21 11:25:15 +04:00			`c.new_flags = node->flags;`
added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00
			`data.dptr = (uint8_t *)&c;`
			`data.dsize = sizeof(c);`

reqrite the handling of flag updates across the cluster to eliminate a race between the ctdb tool and the recovery daemon both at once trying to push flag changes across the cluster. (This used to be ctdb commit a9a1156ea4e10483a4bf4265b8e9203f0af033aa) 2008-11-19 06:43:46 +03:00			`/* ask the recovery daemon to push these changes out to all nodes */`
			`ctdb_daemon_send_message(ctdb, ctdb->pnn,`
			`CTDB_SRVID_PUSH_NODE_FLAGS, data);`
merged admin enable/disable change from ronnie (This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a) 2007-06-07 05:15:22 +04:00
server/monitor: ask for a takeoverrun after propagating our new flags metze (This used to be ctdb commit 942f44123350d4d0c4ad7f3fcd5ff2d0d175739b) 2010-08-24 11:22:49 +04:00			`if (c.new_flags & NODE_FLAGS_UNHEALTHY) {`
			`state_str = "UNHEALTHY";`
			`} else {`
			`state_str = "HEALTHY";`
			`}`

			`/* ask the recmaster to reallocate all addresses */`
			`DEBUG(DEBUG_ERR,("Node became %s. Ask recovery master %u to perform ip reallocation\n",`
			`state_str, ctdb->recovery_master));`
			`ret = ctdb_daemon_send_message(ctdb, ctdb->recovery_master, CTDB_SRVID_TAKEOVER_RUN, rddata);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " Failed to send ip takeover run request message to %u\n", ctdb->recovery_master));`
			`}`
added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00			`}`


prevent a deadly embrace between smbd and ctdbd by moving the calling of the startup event scripts after the point where recovery has started and the node is in normal operation This makes the 'startup' script just a special type of the 'monitor' script which is called first (This used to be ctdb commit 7424c30a5fd04aea0137c466b4318c3f185280d8) 2007-11-12 02:53:11 +03:00			`/*`
			`called when the startup event script finishes`
			`*/`
			`static void ctdb_startup_callback(struct ctdb_context ctdb, int status, void p)`
			`{`
			`if (status != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,("startup event failed\n"));`
prevent a deadly embrace between smbd and ctdbd by moving the calling of the startup event scripts after the point where recovery has started and the node is in normal operation This makes the 'startup' script just a special type of the 'monitor' script which is called first (This used to be ctdb commit 7424c30a5fd04aea0137c466b4318c3f185280d8) 2007-11-12 02:53:11 +03:00			`} else if (status == 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_NOTICE,("startup event OK - enabling monitoring\n"));`
prevent a deadly embrace between smbd and ctdbd by moving the calling of the startup event scripts after the point where recovery has started and the node is in normal operation This makes the 'startup' script just a special type of the 'monitor' script which is called first (This used to be ctdb commit 7424c30a5fd04aea0137c466b4318c3f185280d8) 2007-11-12 02:53:11 +03:00			`ctdb->done_startup = true;`
Dont set next_interval to 0. This can cause ctdbd to spin at 100% in the eventsystem, creating a timed event that will immediately trigger again and again. On uniprocessors this cause the eventscript we are actually waiting for to basically become cpu starved and never complete. (This used to be ctdb commit 92c8408fba957a8ded13f7e285da290502735234) 2010-08-20 08:54:03 +04:00			`ctdb->monitor->next_interval = 2;`
add a new notification to trigger on when ctdb has started (This used to be ctdb commit b1fe04f2e9447f762a0b805763deb29296585ff8) 2009-10-01 08:05:30 +04:00			`ctdb_run_notification_script(ctdb, "startup");`
prevent a deadly embrace between smbd and ctdbd by moving the calling of the startup event scripts after the point where recovery has started and the node is in normal operation This makes the 'startup' script just a special type of the 'monitor' script which is called first (This used to be ctdb commit 7424c30a5fd04aea0137c466b4318c3f185280d8) 2007-11-12 02:53:11 +03:00			`}`

exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
			`timeval_current_ofs(ctdb->monitor->next_interval, 0),`
			`ctdb_check_health, ctdb);`
prevent a deadly embrace between smbd and ctdbd by moving the calling of the startup event scripts after the point where recovery has started and the node is in normal operation This makes the 'startup' script just a special type of the 'monitor' script which is called first (This used to be ctdb commit 7424c30a5fd04aea0137c466b4318c3f185280d8) 2007-11-12 02:53:11 +03:00			`}`


When starting up ctdbd, wait until all initial recoveries have finished and until we have gone through a full re-recovery timeout without triggering any pending recoveries before we start up the services and start monitoring the node. (This used to be ctdb commit 821333afb458358f90446062b0242790695e5060) 2009-12-01 05:19:58 +03:00			`/*`
			`wait until we have finished initial recoveries before we start the`
			`monitoring events`
			`*/`
			`static void ctdb_wait_until_recovered(struct event_context ev, struct timed_event te,`
			`struct timeval t, void *private_data)`
			`{`
			`struct ctdb_context *ctdb = talloc_get_type(private_data, struct ctdb_context);`
server: Use tdb_check to verify persistent tdbs on startup Depending on --max-persistent-check-errors we allow ctdb to start with unhealthy persistent databases. The default is 0 which means to reject a startup with unhealthy dbs. The health of the persistent databases is checked after each recovery. Node monitoring and the "startup" is deferred until all persistent databases are healthy. Databases can become healthy automaticly by a completely HEALTHY node joining the cluster. Or by an administrator with "ctdb backupdb/restoredb" or "ctdb wipedb". metze (This used to be ctdb commit 15f133d5150ed1badb4fef7d644f10cd08a25cb5) 2009-12-07 15:28:11 +03:00			`int ret;`
When starting up ctdbd, wait until all initial recoveries have finished and until we have gone through a full re-recovery timeout without triggering any pending recoveries before we start up the services and start monitoring the node. (This used to be ctdb commit 821333afb458358f90446062b0242790695e5060) 2009-12-01 05:19:58 +03:00
			`DEBUG(DEBUG_NOTICE,("CTDB_WAIT_UNTIL_RECOVERED\n"));`

			`if (ctdb->vnn_map->generation == INVALID_GENERATION) {`
server: Use tdb_check to verify persistent tdbs on startup Depending on --max-persistent-check-errors we allow ctdb to start with unhealthy persistent databases. The default is 0 which means to reject a startup with unhealthy dbs. The health of the persistent databases is checked after each recovery. Node monitoring and the "startup" is deferred until all persistent databases are healthy. Databases can become healthy automaticly by a completely HEALTHY node joining the cluster. Or by an administrator with "ctdb backupdb/restoredb" or "ctdb wipedb". metze (This used to be ctdb commit 15f133d5150ed1badb4fef7d644f10cd08a25cb5) 2009-12-07 15:28:11 +03:00			`ctdb->db_persistent_startup_generation = INVALID_GENERATION;`

When starting up ctdbd, wait until all initial recoveries have finished and until we have gone through a full re-recovery timeout without triggering any pending recoveries before we start up the services and start monitoring the node. (This used to be ctdb commit 821333afb458358f90446062b0242790695e5060) 2009-12-01 05:19:58 +03:00			`DEBUG(DEBUG_NOTICE,(__location__ " generation is INVALID. Wait one more second\n"));`
			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
			`timeval_current_ofs(1, 0),`
			`ctdb_wait_until_recovered, ctdb);`
			`return;`
			`}`

			`if (ctdb->recovery_mode != CTDB_RECOVERY_NORMAL) {`
server: Use tdb_check to verify persistent tdbs on startup Depending on --max-persistent-check-errors we allow ctdb to start with unhealthy persistent databases. The default is 0 which means to reject a startup with unhealthy dbs. The health of the persistent databases is checked after each recovery. Node monitoring and the "startup" is deferred until all persistent databases are healthy. Databases can become healthy automaticly by a completely HEALTHY node joining the cluster. Or by an administrator with "ctdb backupdb/restoredb" or "ctdb wipedb". metze (This used to be ctdb commit 15f133d5150ed1badb4fef7d644f10cd08a25cb5) 2009-12-07 15:28:11 +03:00			`ctdb->db_persistent_startup_generation = INVALID_GENERATION;`

When starting up ctdbd, wait until all initial recoveries have finished and until we have gone through a full re-recovery timeout without triggering any pending recoveries before we start up the services and start monitoring the node. (This used to be ctdb commit 821333afb458358f90446062b0242790695e5060) 2009-12-01 05:19:58 +03:00			`DEBUG(DEBUG_NOTICE,(__location__ " in recovery. Wait one more second\n"));`
			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
			`timeval_current_ofs(1, 0),`
			`ctdb_wait_until_recovered, ctdb);`
			`return;`
			`}`


speed startup: add --sloppy-start. The extra recovery interval wait was introduced in 821333afb458 but no explanation was provided in that message. Nonetheless, if starting the entire cluster for the first time, it should be safe to skip this. We use the commandline arg --sloppy-start which should discourage people from using it outside testing. Seconds between ctdbd first log message and node healthy: BEFORE: 16.10 AFTER: 4.03 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 509e2e89ae233a0e91998d95267bf62f296a73cd) 2010-06-22 17:22:34 +04:00			`if (!fast_start && timeval_elapsed(&ctdb->last_recovery_finished) < (ctdb->tunable.rerecovery_timeout + 3)) {`
server: Use tdb_check to verify persistent tdbs on startup Depending on --max-persistent-check-errors we allow ctdb to start with unhealthy persistent databases. The default is 0 which means to reject a startup with unhealthy dbs. The health of the persistent databases is checked after each recovery. Node monitoring and the "startup" is deferred until all persistent databases are healthy. Databases can become healthy automaticly by a completely HEALTHY node joining the cluster. Or by an administrator with "ctdb backupdb/restoredb" or "ctdb wipedb". metze (This used to be ctdb commit 15f133d5150ed1badb4fef7d644f10cd08a25cb5) 2009-12-07 15:28:11 +03:00			`ctdb->db_persistent_startup_generation = INVALID_GENERATION;`

When starting up ctdbd, wait until all initial recoveries have finished and until we have gone through a full re-recovery timeout without triggering any pending recoveries before we start up the services and start monitoring the node. (This used to be ctdb commit 821333afb458358f90446062b0242790695e5060) 2009-12-01 05:19:58 +03:00			`DEBUG(DEBUG_NOTICE,(__location__ " wait for pending recoveries to end. Wait one more second.\n"));`

			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
			`timeval_current_ofs(1, 0),`
			`ctdb_wait_until_recovered, ctdb);`
			`return;`
			`}`

server: Use tdb_check to verify persistent tdbs on startup Depending on --max-persistent-check-errors we allow ctdb to start with unhealthy persistent databases. The default is 0 which means to reject a startup with unhealthy dbs. The health of the persistent databases is checked after each recovery. Node monitoring and the "startup" is deferred until all persistent databases are healthy. Databases can become healthy automaticly by a completely HEALTHY node joining the cluster. Or by an administrator with "ctdb backupdb/restoredb" or "ctdb wipedb". metze (This used to be ctdb commit 15f133d5150ed1badb4fef7d644f10cd08a25cb5) 2009-12-07 15:28:11 +03:00			`if (ctdb->vnn_map->generation == ctdb->db_persistent_startup_generation) {`
			`DEBUG(DEBUG_INFO,(__location__ " skip ctdb_recheck_persistent_health() "`
			`"until the next recovery\n"));`
			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
			`timeval_current_ofs(1, 0),`
			`ctdb_wait_until_recovered, ctdb);`
			`return;`
			`}`

			`ctdb->db_persistent_startup_generation = ctdb->vnn_map->generation;`
			`ret = ctdb_recheck_persistent_health(ctdb);`
			`if (ret != 0) {`
			`ctdb->db_persistent_check_errors++;`
			`if (ctdb->db_persistent_check_errors < ctdb->max_persistent_check_errors) {`
			`DEBUG(ctdb->db_persistent_check_errors==1?DEBUG_ERR:DEBUG_WARNING,`
			`(__location__ "ctdb_recheck_persistent_health() "`
			`"failed (%llu of %llu times) - retry later\n",`
			`(unsigned long long)ctdb->db_persistent_check_errors,`
			`(unsigned long long)ctdb->max_persistent_check_errors));`
			`event_add_timed(ctdb->ev,`
			`ctdb->monitor->monitor_context,`
			`timeval_current_ofs(1, 0),`
			`ctdb_wait_until_recovered, ctdb);`
			`return;`
			`}`
			`DEBUG(DEBUG_ALERT,(__location__`
			`"ctdb_recheck_persistent_health() failed (%llu times) - prepare shutdown\n",`
			`(unsigned long long)ctdb->db_persistent_check_errors));`
			`ctdb_stop_recoverd(ctdb);`
			`ctdb_stop_keepalive(ctdb);`
			`ctdb_stop_monitoring(ctdb);`
			`ctdb_release_all_ips(ctdb);`
			`if (ctdb->methods != NULL) {`
			`ctdb->methods->shutdown(ctdb);`
			`}`
			`ctdb_event_script(ctdb, CTDB_EVENT_SHUTDOWN);`
			`DEBUG(DEBUG_ALERT,("ctdb_recheck_persistent_health() failed - Stopping CTDB daemon\n"));`
			`exit(11);`
			`}`
			`ctdb->db_persistent_check_errors = 0;`
			`DEBUG(DEBUG_NOTICE,(__location__`
			`"ctdb_start_monitoring: ctdb_recheck_persistent_health() OK\n"));`
When starting up ctdbd, wait until all initial recoveries have finished and until we have gone through a full re-recovery timeout without triggering any pending recoveries before we start up the services and start monitoring the node. (This used to be ctdb commit 821333afb458358f90446062b0242790695e5060) 2009-12-01 05:19:58 +03:00
			`DEBUG(DEBUG_NOTICE,(__location__ " Recoveries finished. Running the \"startup\" event.\n"));`
			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
speed startup: run startup immediately after recovery finished. Seconds between ctdbd first log message and node healthy: BEFORE: 17.08 AFTER: 16.10 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 372201d418f041d69646793105f6898ab12a7d91) 2010-06-22 17:20:45 +04:00			`timeval_current(),`
When starting up ctdbd, wait until all initial recoveries have finished and until we have gone through a full re-recovery timeout without triggering any pending recoveries before we start up the services and start monitoring the node. (This used to be ctdb commit 821333afb458358f90446062b0242790695e5060) 2009-12-01 05:19:58 +03:00			`ctdb_check_health, ctdb);`
			`}`


added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00			`/*`
			`see if the event scripts think we are healthy`
			`*/`
			`static void ctdb_check_health(struct event_context ev, struct timed_event te,`
			`struct timeval t, void *private_data)`
			`{`
			`struct ctdb_context *ctdb = talloc_get_type(private_data, struct ctdb_context);`
change the eventscript handling to allow EventScriptTimeout for each individual script isntead of for the entire set of scripts restructure the talloc hierarchy to allow this (This used to be ctdb commit 64da4402c6ad485f1d0a604878a7b0c01a0ea5f0) 2009-10-28 08:11:54 +03:00			`int ret = 0;`
added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00
don't do the first startup event until we are out of recovery (This used to be ctdb commit 689940eb6e23f16ee063331caf3986613a8963ea) 2007-11-12 05:10:15 +03:00			`if (ctdb->recovery_mode != CTDB_RECOVERY_NORMAL \|\|`
exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`(ctdb->monitor->monitoring_mode == CTDB_MONITORING_DISABLED && ctdb->done_startup)) {`
			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
			`timeval_current_ofs(ctdb->monitor->next_interval, 0),`
added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00			`ctdb_check_health, ctdb);`
			`return;`
			`}`

prevent a deadly embrace between smbd and ctdbd by moving the calling of the startup event scripts after the point where recovery has started and the node is in normal operation This makes the 'startup' script just a special type of the 'monitor' script which is called first (This used to be ctdb commit 7424c30a5fd04aea0137c466b4318c3f185280d8) 2007-11-12 02:53:11 +03:00			`if (!ctdb->done_startup) {`
			`ret = ctdb_event_script_callback(ctdb,`
exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`ctdb->monitor->monitor_context, ctdb_startup_callback,`
Add flag to ctdb_event_script_callback indicating when called by client. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit a1d654a982ca56fade82552f4e6b5586236d3233) 2009-11-26 07:49:49 +03:00			`ctdb, false,`
			`CTDB_EVENT_STARTUP, "%s", "");`
prevent a deadly embrace between smbd and ctdbd by moving the calling of the startup event scripts after the point where recovery has started and the node is in normal operation This makes the 'startup' script just a special type of the 'monitor' script which is called first (This used to be ctdb commit 7424c30a5fd04aea0137c466b4318c3f185280d8) 2007-11-12 02:53:11 +03:00			`} else {`
Dont run eventscript monitor when the databases are frozen. The databases can become frozen a while before we do the actual recovery since we have the re-recovery timeout. There is no point in doing much monitoring if we are waiting for a recovery, or if we are banned. This will eliminate some annoying log entries where certain tests will fail if the databases are locked. (This used to be ctdb commit ff824676fab94168707aada7423ae766bc0f711c) 2009-10-15 09:03:43 +04:00			`int i;`
			`int skip_monitoring = 0;`

			`if (ctdb->recovery_mode != CTDB_RECOVERY_NORMAL) {`
			`skip_monitoring = 1;`
			`DEBUG(DEBUG_ERR,("Skip monitoring during recovery\n"));`
			`}`
			`for (i=1; i<=NUM_DB_PRIORITIES; i++) {`
if a lock wait child died/finished, we could have released the lockwait handle and set it to NULL before we call the destructors for releaseing the waiters. The waiters reference the locakwait handle in order to remove itself from the li nked list which caused a SEGV. We dont actually need to remove ourselves from this list here since if the parent freeze_handle holding the list is freed, then all waiters are rele ased as well, and the only place we actually need to relink the waiter is in ctd b_freeze_lock_handler, where we want to respond back to the clients and release the waiters but we still want to keep the freeze_handle hanging around. (This used to be ctdb commit e01ab46bafad09a5e320d420734db129d35863bc) 2009-10-22 06:41:28 +04:00			`if (ctdb->freeze_handles[i] != NULL) {`
Dont run eventscript monitor when the databases are frozen. The databases can become frozen a while before we do the actual recovery since we have the re-recovery timeout. There is no point in doing much monitoring if we are waiting for a recovery, or if we are banned. This will eliminate some annoying log entries where certain tests will fail if the databases are locked. (This used to be ctdb commit ff824676fab94168707aada7423ae766bc0f711c) 2009-10-15 09:03:43 +04:00			`DEBUG(DEBUG_ERR,("Skip monitoring since databases are frozen\n"));`
			`skip_monitoring = 1;`
			`break;`
			`}`
			`}`
change the eventscript handling to allow EventScriptTimeout for each individual script isntead of for the entire set of scripts restructure the talloc hierarchy to allow this (This used to be ctdb commit 64da4402c6ad485f1d0a604878a7b0c01a0ea5f0) 2009-10-28 08:11:54 +03:00			`if (skip_monitoring != 0) {`
Dont run eventscript monitor when the databases are frozen. The databases can become frozen a while before we do the actual recovery since we have the re-recovery timeout. There is no point in doing much monitoring if we are waiting for a recovery, or if we are banned. This will eliminate some annoying log entries where certain tests will fail if the databases are locked. (This used to be ctdb commit ff824676fab94168707aada7423ae766bc0f711c) 2009-10-15 09:03:43 +04:00			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
			`timeval_current_ofs(ctdb->monitor->next_interval, 0),`
			`ctdb_check_health, ctdb);`
			`return;`
			`} else {`
			`ret = ctdb_event_script_callback(ctdb,`
change the eventscript handling to allow EventScriptTimeout for each individual script isntead of for the entire set of scripts restructure the talloc hierarchy to allow this (This used to be ctdb commit 64da4402c6ad485f1d0a604878a7b0c01a0ea5f0) 2009-10-28 08:11:54 +03:00			`ctdb->monitor->monitor_context, ctdb_health_callback,`
Add flag to ctdb_event_script_callback indicating when called by client. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit a1d654a982ca56fade82552f4e6b5586236d3233) 2009-11-26 07:49:49 +03:00			`ctdb, false,`
			`CTDB_EVENT_MONITOR, "%s", "");`
Dont run eventscript monitor when the databases are frozen. The databases can become frozen a while before we do the actual recovery since we have the re-recovery timeout. There is no point in doing much monitoring if we are waiting for a recovery, or if we are banned. This will eliminate some annoying log entries where certain tests will fail if the databases are locked. (This used to be ctdb commit ff824676fab94168707aada7423ae766bc0f711c) 2009-10-15 09:03:43 +04:00			`}`
prevent a deadly embrace between smbd and ctdbd by moving the calling of the startup event scripts after the point where recovery has started and the node is in normal operation This makes the 'startup' script just a special type of the 'monitor' script which is called first (This used to be ctdb commit 7424c30a5fd04aea0137c466b4318c3f185280d8) 2007-11-12 02:53:11 +03:00			`}`

added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00			`if (ret != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,("Unable to launch monitor event script\n"));`
change the eventscript handling to allow EventScriptTimeout for each individual script isntead of for the entire set of scripts restructure the talloc hierarchy to allow this (This used to be ctdb commit 64da4402c6ad485f1d0a604878a7b0c01a0ea5f0) 2009-10-28 08:11:54 +03:00			`ctdb->monitor->next_interval = 5;`
exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
change the eventscript handling to allow EventScriptTimeout for each individual script isntead of for the entire set of scripts restructure the talloc hierarchy to allow this (This used to be ctdb commit 64da4402c6ad485f1d0a604878a7b0c01a0ea5f0) 2009-10-28 08:11:54 +03:00			`timeval_current_ofs(5, 0),`
			`ctdb_check_health, ctdb);`
			`}`
added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00			`}`

add ctdb_disable/enable_monitoring() that only modifies the monitoring flag. change calling of the recovered/takeip/releaseip event scripts to use these enable/disable functions instead of stopping/starting monitoring. when we disable monitoring we want all events to still be running in particular the events to monitor for dead nodes and we only want to supress running the monitor event scripts (This used to be ctdb commit a006dcc4f75aba950dd701ad7d1a84e89df285e8) 2007-11-30 02:09:54 +03:00			`/*`
			`(Temporaily) Disabling monitoring will stop the monitor event scripts`
			`from running but node health checks will still occur`
			`*/`
			`void ctdb_disable_monitoring(struct ctdb_context *ctdb)`
			`{`
exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`ctdb->monitor->monitoring_mode = CTDB_MONITORING_DISABLED;`
added debug constants to allow for better mapping to syslog levels (This used to be ctdb commit 7ba8f1dde318eab03f4257e5a89fd23e7281e502) 2008-02-04 09:44:24 +03:00			`DEBUG(DEBUG_INFO,("Monitoring has been disabled\n"));`
add ctdb_disable/enable_monitoring() that only modifies the monitoring flag. change calling of the recovered/takeip/releaseip event scripts to use these enable/disable functions instead of stopping/starting monitoring. when we disable monitoring we want all events to still be running in particular the events to monitor for dead nodes and we only want to supress running the monitor event scripts (This used to be ctdb commit a006dcc4f75aba950dd701ad7d1a84e89df285e8) 2007-11-30 02:09:54 +03:00			`}`

			`/*`
			`Re-enable running monitor events after they have been disabled`
			`*/`
			`void ctdb_enable_monitoring(struct ctdb_context *ctdb)`
			`{`
exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`ctdb->monitor->monitoring_mode = CTDB_MONITORING_ACTIVE;`
change the eventscript handling to allow EventScriptTimeout for each individual script isntead of for the entire set of scripts restructure the talloc hierarchy to allow this (This used to be ctdb commit 64da4402c6ad485f1d0a604878a7b0c01a0ea5f0) 2009-10-28 08:11:54 +03:00			`ctdb->monitor->next_interval = 5;`
added debug constants to allow for better mapping to syslog levels (This used to be ctdb commit 7ba8f1dde318eab03f4257e5a89fd23e7281e502) 2008-02-04 09:44:24 +03:00			`DEBUG(DEBUG_INFO,("Monitoring has been enabled\n"));`
add ctdb_disable/enable_monitoring() that only modifies the monitoring flag. change calling of the recovered/takeip/releaseip event scripts to use these enable/disable functions instead of stopping/starting monitoring. when we disable monitoring we want all events to still be running in particular the events to monitor for dead nodes and we only want to supress running the monitor event scripts (This used to be ctdb commit a006dcc4f75aba950dd701ad7d1a84e89df285e8) 2007-11-30 02:09:54 +03:00			`}`

			`/* stop any monitoring`
			`this should only be done when shutting down the daemon`
			`*/`
added timeouts in all event scripts (This used to be ctdb commit d986c91a607ed7c7d4869ea786b5cdf80e7862f1) 2007-06-06 07:45:12 +04:00			`void ctdb_stop_monitoring(struct ctdb_context *ctdb)`
			`{`
exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`talloc_free(ctdb->monitor->monitor_context);`
			`ctdb->monitor->monitor_context = NULL;`
dont manipulate ctdb->monitoring_mode directly from the SET_MON_MODE control, instead call ctdb_start/stop_monitoring() ctdb_stop_monitoring() dont allocate a new monitoring context, leave it NULL. Also set the monitoring_mode in this function so that ctdb_stop/start_monitoring() and ->monitoring_mode are kept in sync. Add a debug message to log that we have stopped monitoring. ctdb_start_monitoring() check whether monitoring is already active and make the function idempotent. Create the monitoring context when monitoring is started. Update ->monitoring_mode once the monitoring has been started. Add a debug message to log that we have started monitoring. When we temporarily stop monitoring while running an event script, restart monitoring after the event script wrapper returns instead of in the event script callback. Let monitoring_mode start out as DISABLED and let it be enabled once we call ctdb_start_monitoring. dont check for MONITORING_DISABLED in check_fore_dead_nodes(). If monitoring is disabled, this event handler will not be called. (This used to be ctdb commit 3a93ae8bdcffb1adbd6243844f3058fc742f76aa) 2007-11-30 00:44:34 +03:00
exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`ctdb->monitor->monitoring_mode = CTDB_MONITORING_DISABLED;`
change the eventscript handling to allow EventScriptTimeout for each individual script isntead of for the entire set of scripts restructure the talloc hierarchy to allow this (This used to be ctdb commit 64da4402c6ad485f1d0a604878a7b0c01a0ea5f0) 2009-10-28 08:11:54 +03:00			`ctdb->monitor->next_interval = 5;`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_NOTICE,("Monitoring has been stopped\n"));`
added timeouts in all event scripts (This used to be ctdb commit d986c91a607ed7c7d4869ea786b5cdf80e7862f1) 2007-06-06 07:45:12 +04:00			`}`
added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem (This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f) 2007-06-06 04:25:46 +04:00
- up rx_cnt on all packet types - notice when a node becomes available again (This used to be ctdb commit e05110dd6112e81f224937dfd7370d963ce9531a) 2007-05-18 17:23:36 +04:00			`/*`
			`start watching for nodes that might be dead`
			`*/`
added timeouts in all event scripts (This used to be ctdb commit d986c91a607ed7c7d4869ea786b5cdf80e7862f1) 2007-06-06 07:45:12 +04:00			`void ctdb_start_monitoring(struct ctdb_context *ctdb)`
add a missing file :-) (This used to be ctdb commit 29cf1b927f2cebfdc43e22d32a270e956716e2c5) 2007-05-18 14:06:29 +04:00			`{`
exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`if (ctdb->monitor != NULL) {`
dont manipulate ctdb->monitoring_mode directly from the SET_MON_MODE control, instead call ctdb_start/stop_monitoring() ctdb_stop_monitoring() dont allocate a new monitoring context, leave it NULL. Also set the monitoring_mode in this function so that ctdb_stop/start_monitoring() and ->monitoring_mode are kept in sync. Add a debug message to log that we have stopped monitoring. ctdb_start_monitoring() check whether monitoring is already active and make the function idempotent. Create the monitoring context when monitoring is started. Update ->monitoring_mode once the monitoring has been started. Add a debug message to log that we have started monitoring. When we temporarily stop monitoring while running an event script, restart monitoring after the event script wrapper returns instead of in the event script callback. Let monitoring_mode start out as DISABLED and let it be enabled once we call ctdb_start_monitoring. dont check for MONITORING_DISABLED in check_fore_dead_nodes(). If monitoring is disabled, this event handler will not be called. (This used to be ctdb commit 3a93ae8bdcffb1adbd6243844f3058fc742f76aa) 2007-11-30 00:44:34 +03:00			`return;`
			`}`

exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`ctdb->monitor = talloc(ctdb, struct ctdb_monitor_state);`
			`CTDB_NO_MEMORY_FATAL(ctdb, ctdb->monitor);`

change the eventscript handling to allow EventScriptTimeout for each individual script isntead of for the entire set of scripts restructure the talloc hierarchy to allow this (This used to be ctdb commit 64da4402c6ad485f1d0a604878a7b0c01a0ea5f0) 2009-10-28 08:11:54 +03:00			`ctdb->monitor->next_interval = 5;`
added timeouts in all event scripts (This used to be ctdb commit d986c91a607ed7c7d4869ea786b5cdf80e7862f1) 2007-06-06 07:45:12 +04:00
exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`ctdb->monitor->monitor_context = talloc_new(ctdb->monitor);`
			`CTDB_NO_MEMORY_FATAL(ctdb, ctdb->monitor->monitor_context);`
dont manipulate ctdb->monitoring_mode directly from the SET_MON_MODE control, instead call ctdb_start/stop_monitoring() ctdb_stop_monitoring() dont allocate a new monitoring context, leave it NULL. Also set the monitoring_mode in this function so that ctdb_stop/start_monitoring() and ->monitoring_mode are kept in sync. Add a debug message to log that we have stopped monitoring. ctdb_start_monitoring() check whether monitoring is already active and make the function idempotent. Create the monitoring context when monitoring is started. Update ->monitoring_mode once the monitoring has been started. Add a debug message to log that we have started monitoring. When we temporarily stop monitoring while running an event script, restart monitoring after the event script wrapper returns instead of in the event script callback. Let monitoring_mode start out as DISABLED and let it be enabled once we call ctdb_start_monitoring. dont check for MONITORING_DISABLED in check_fore_dead_nodes(). If monitoring is disabled, this event handler will not be called. (This used to be ctdb commit 3a93ae8bdcffb1adbd6243844f3058fc742f76aa) 2007-11-30 00:44:34 +03:00
When starting up ctdbd, wait until all initial recoveries have finished and until we have gone through a full re-recovery timeout without triggering any pending recoveries before we start up the services and start monitoring the node. (This used to be ctdb commit 821333afb458358f90446062b0242790695e5060) 2009-12-01 05:19:58 +03:00			`event_add_timed(ctdb->ev, ctdb->monitor->monitor_context,`
get rid of monitor_retry as well (This used to be ctdb commit c957cf9c1d99d5d3f4ca726f7a867c829660a2b7) 2008-01-10 06:49:43 +03:00			`timeval_current_ofs(1, 0),`
When starting up ctdbd, wait until all initial recoveries have finished and until we have gone through a full re-recovery timeout without triggering any pending recoveries before we start up the services and start monitoring the node. (This used to be ctdb commit 821333afb458358f90446062b0242790695e5060) 2009-12-01 05:19:58 +03:00			`ctdb_wait_until_recovered, ctdb);`
dont manipulate ctdb->monitoring_mode directly from the SET_MON_MODE control, instead call ctdb_start/stop_monitoring() ctdb_stop_monitoring() dont allocate a new monitoring context, leave it NULL. Also set the monitoring_mode in this function so that ctdb_stop/start_monitoring() and ->monitoring_mode are kept in sync. Add a debug message to log that we have stopped monitoring. ctdb_start_monitoring() check whether monitoring is already active and make the function idempotent. Create the monitoring context when monitoring is started. Update ->monitoring_mode once the monitoring has been started. Add a debug message to log that we have started monitoring. When we temporarily stop monitoring while running an event script, restart monitoring after the event script wrapper returns instead of in the event script callback. Let monitoring_mode start out as DISABLED and let it be enabled once we call ctdb_start_monitoring. dont check for MONITORING_DISABLED in check_fore_dead_nodes(). If monitoring is disabled, this event handler will not be called. (This used to be ctdb commit 3a93ae8bdcffb1adbd6243844f3058fc742f76aa) 2007-11-30 00:44:34 +03:00
exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00			`ctdb->monitor->monitoring_mode = CTDB_MONITORING_ACTIVE;`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_NOTICE,("Monitoring has been started\n"));`
add a missing file :-) (This used to be ctdb commit 29cf1b927f2cebfdc43e22d32a270e956716e2c5) 2007-05-18 14:06:29 +04:00			`}`
merged admin enable/disable change from ronnie (This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a) 2007-06-07 05:15:22 +04:00

			`/*`
implement a scheme where nodes are banned if they continuously caused the cluster to start a recovery session. The node is banned from the cluster for the RecoveryBanPeriod (default of 5 minutes) (This used to be ctdb commit 4ad43dd07f526b6002477177fbf55483246c2c0c) 2007-06-07 09:18:55 +04:00			`modify flags on a node`
merged admin enable/disable change from ronnie (This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a) 2007-06-07 05:15:22 +04:00			`*/`
implement a scheme where nodes are banned if they continuously caused the cluster to start a recovery session. The node is banned from the cluster for the RecoveryBanPeriod (default of 5 minutes) (This used to be ctdb commit 4ad43dd07f526b6002477177fbf55483246c2c0c) 2007-06-07 09:18:55 +04:00			`int32_t ctdb_control_modflags(struct ctdb_context *ctdb, TDB_DATA indata)`
merged admin enable/disable change from ronnie (This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a) 2007-06-07 05:15:22 +04:00			`{`
reqrite the handling of flag updates across the cluster to eliminate a race between the ctdb tool and the recovery daemon both at once trying to push flag changes across the cluster. (This used to be ctdb commit a9a1156ea4e10483a4bf4265b8e9203f0af033aa) 2008-11-19 06:43:46 +03:00			`struct ctdb_node_flag_change c = (struct ctdb_node_flag_change )indata.dptr;`
			`struct ctdb_node *node;`
dont let other nodes modify the STOPPED flag for the local process when pushing out flags changes (This used to be ctdb commit 501a2747d839ca291b70c761098549cf6d47a158) 2009-07-09 07:20:14 +04:00			`uint32_t old_flags;`
initial attempt at freezing databases in priority order (This used to be ctdb commit e8d692590da1070c87a4144031e3306d190ebed2) 2009-10-12 05:08:39 +04:00			`int i;`

reqrite the handling of flag updates across the cluster to eliminate a race between the ctdb tool and the recovery daemon both at once trying to push flag changes across the cluster. (This used to be ctdb commit a9a1156ea4e10483a4bf4265b8e9203f0af033aa) 2008-11-19 06:43:46 +03:00			`if (c->pnn >= ctdb->num_nodes) {`
			`DEBUG(DEBUG_ERR,(__location__ " Node %d is invalid, num_nodes :%d\n", c->pnn, ctdb->num_nodes));`
			`return -1;`
			`}`
merged admin enable/disable change from ronnie (This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a) 2007-06-07 05:15:22 +04:00
reqrite the handling of flag updates across the cluster to eliminate a race between the ctdb tool and the recovery daemon both at once trying to push flag changes across the cluster. (This used to be ctdb commit a9a1156ea4e10483a4bf4265b8e9203f0af033aa) 2008-11-19 06:43:46 +03:00			`node = ctdb->nodes[c->pnn];`
dont let other nodes modify the STOPPED flag for the local process when pushing out flags changes (This used to be ctdb commit 501a2747d839ca291b70c761098549cf6d47a158) 2009-07-09 07:20:14 +04:00			`old_flags = node->flags;`
server: if takeover runs when the recovery master becomes unhealthy The problem was this: When the monitor event fails, the node->flags get updated, and an update (containing the old and new flags) is sent to the recovery master. If the recovery master sends the update to itself (the same process), it was compairing the node->flags variable with the received new flags. This check always found both flag values to be equal and never sets the rec->need_takeover_run variable to true. There were two problem, first the push_flags_handler() function didn't pass the received old flags. And the ctdb_control_modflags() function ignored the received old flags. metze (This used to be ctdb commit 8ec633b64a05a2d903c2b9639909f15f6375548f) 2009-10-09 17:47:49 +04:00			`if (c->pnn != ctdb->pnn) {`
			`c->old_flags = node->flags;`
			`}`
reqrite the handling of flag updates across the cluster to eliminate a race between the ctdb tool and the recovery daemon both at once trying to push flag changes across the cluster. (This used to be ctdb commit a9a1156ea4e10483a4bf4265b8e9203f0af033aa) 2008-11-19 06:43:46 +03:00			`node->flags = c->new_flags & ~NODE_FLAGS_DISCONNECTED;`
			`node->flags \|= (c->old_flags & NODE_FLAGS_DISCONNECTED);`
implement a scheme where nodes are banned if they continuously caused the cluster to start a recovery session. The node is banned from the cluster for the RecoveryBanPeriod (default of 5 minutes) (This used to be ctdb commit 4ad43dd07f526b6002477177fbf55483246c2c0c) 2007-06-07 09:18:55 +04:00
dont let other nodes modify the STOPPED flag for the local process when pushing out flags changes (This used to be ctdb commit 501a2747d839ca291b70c761098549cf6d47a158) 2009-07-09 07:20:14 +04:00			`/* we dont let other nodes modify our STOPPED status */`
			`if (c->pnn == ctdb->pnn) {`
			`node->flags &= ~NODE_FLAGS_STOPPED;`
			`if (old_flags & NODE_FLAGS_STOPPED) {`
			`node->flags \|= NODE_FLAGS_STOPPED;`
			`}`
			`}`

new prototype banning code (This used to be ctdb commit 0c4c2240267af183d54ffd4c0aacda208f6eff6a) 2009-09-03 20:20:39 +04:00			`/* we dont let other nodes modify our BANNED status */`
			`if (c->pnn == ctdb->pnn) {`
			`node->flags &= ~NODE_FLAGS_BANNED;`
			`if (old_flags & NODE_FLAGS_BANNED) {`
			`node->flags \|= NODE_FLAGS_BANNED;`
			`}`
			`}`

reqrite the handling of flag updates across the cluster to eliminate a race between the ctdb tool and the recovery daemon both at once trying to push flag changes across the cluster. (This used to be ctdb commit a9a1156ea4e10483a4bf4265b8e9203f0af033aa) 2008-11-19 06:43:46 +03:00			`if (node->flags == c->old_flags) {`
			`DEBUG(DEBUG_INFO, ("Control modflags on node %u - Unchanged - flags 0x%x\n", c->pnn, node->flags));`
implement a scheme where nodes are banned if they continuously caused the cluster to start a recovery session. The node is banned from the cluster for the RecoveryBanPeriod (default of 5 minutes) (This used to be ctdb commit 4ad43dd07f526b6002477177fbf55483246c2c0c) 2007-06-07 09:18:55 +04:00			`return 0;`
merged admin enable/disable change from ronnie (This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a) 2007-06-07 05:15:22 +04:00			`}`

reqrite the handling of flag updates across the cluster to eliminate a race between the ctdb tool and the recovery daemon both at once trying to push flag changes across the cluster. (This used to be ctdb commit a9a1156ea4e10483a4bf4265b8e9203f0af033aa) 2008-11-19 06:43:46 +03:00			`DEBUG(DEBUG_INFO, ("Control modflags on node %u - flags now 0x%x\n", c->pnn, node->flags));`
added admin commands to ban/unban nodes (This used to be ctdb commit 4dad04172e7e4955b5bf6444a85b19901c9683ad) 2007-06-07 10:34:33 +04:00
server: Use tdb_check to verify persistent tdbs on startup Depending on --max-persistent-check-errors we allow ctdb to start with unhealthy persistent databases. The default is 0 which means to reject a startup with unhealthy dbs. The health of the persistent databases is checked after each recovery. Node monitoring and the "startup" is deferred until all persistent databases are healthy. Databases can become healthy automaticly by a completely HEALTHY node joining the cluster. Or by an administrator with "ctdb backupdb/restoredb" or "ctdb wipedb". metze (This used to be ctdb commit 15f133d5150ed1badb4fef7d644f10cd08a25cb5) 2009-12-07 15:28:11 +03:00			`if (node->flags == 0 && !ctdb->done_startup) {`
			`DEBUG(DEBUG_ERR, (__location__ " Node %u became healthy - force recovery for startup\n",`
			`c->pnn));`
			`ctdb->recovery_mode = CTDB_RECOVERY_ACTIVE;`
			`}`
merged admin enable/disable change from ronnie (This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a) 2007-06-07 05:15:22 +04:00
reqrite the handling of flag updates across the cluster to eliminate a race between the ctdb tool and the recovery daemon both at once trying to push flag changes across the cluster. (This used to be ctdb commit a9a1156ea4e10483a4bf4265b8e9203f0af033aa) 2008-11-19 06:43:46 +03:00			`/* tell the recovery daemon something has changed */`
			`ctdb_daemon_send_message(ctdb, ctdb->pnn,`
			`CTDB_SRVID_SET_NODE_FLAGS, indata);`
added admin commands to ban/unban nodes (This used to be ctdb commit 4dad04172e7e4955b5bf6444a85b19901c9683ad) 2007-06-07 10:34:33 +04:00
reqrite the handling of flag updates across the cluster to eliminate a race between the ctdb tool and the recovery daemon both at once trying to push flag changes across the cluster. (This used to be ctdb commit a9a1156ea4e10483a4bf4265b8e9203f0af033aa) 2008-11-19 06:43:46 +03:00			`/* if we have become banned, we should go into recovery mode */`
When we ban a node, only drop the IPs on the node being banned, not on every node (This used to be ctdb commit 46e8c3737e6ff54fc80de8e962e922924c27bc35) 2009-06-10 04:28:47 +04:00			`if ((node->flags & NODE_FLAGS_BANNED) && !(c->old_flags & NODE_FLAGS_BANNED) && (node->pnn == ctdb->pnn)) {`
added admin commands to ban/unban nodes (This used to be ctdb commit 4dad04172e7e4955b5bf6444a85b19901c9683ad) 2007-06-07 10:34:33 +04:00			`/* make sure we are frozen */`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_NOTICE,("This node has been banned - forcing freeze and recovery\n"));`
when a node becomes banned its databases are no longer part of ctdb and it should thus no longer serve any database access calls until it has been reintroduced into the cluster. when becoming banned, reset the local generation id to 1 to prevent any further database access calls from other nodes from being processed. (This used to be ctdb commit b531021db43ebaa5f5d0ace28c59913d359bd8a8) 2007-08-22 04:38:35 +04:00			`/* Reset the generation id to 1 to make us ignore any`
			`REQ/REPLY CALL/DMASTER someone sends to us.`
			`We are now banned so we shouldnt service database calls`
			`anymore.`
			`*/`
create a define to represent the 'invalid' generation id we used in two places. create a new helper function to generate new generation id values that know about the invalid id and avoids generating it. update the ctdb status tool to know about the invalid generation id and print the string INVALID instead (This used to be ctdb commit 4fbcd189543cb8a92227fdcd3d158472e558ccda) 2007-08-22 06:38:31 +04:00			`ctdb->vnn_map->generation = INVALID_GENERATION;`
when a node becomes banned its databases are no longer part of ctdb and it should thus no longer serve any database access calls until it has been reintroduced into the cluster. when becoming banned, reset the local generation id to 1 to prevent any further database access calls from other nodes from being processed. (This used to be ctdb commit b531021db43ebaa5f5d0ace28c59913d359bd8a8) 2007-08-22 04:38:35 +04:00
initial attempt at freezing databases in priority order (This used to be ctdb commit e8d692590da1070c87a4144031e3306d190ebed2) 2009-10-12 05:08:39 +04:00			`for (i=1; i<=NUM_DB_PRIORITIES; i++) {`
			`if (ctdb_start_freeze(ctdb, i) != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " Failed to freeze db priority %u\n", i));`
			`}`
			`}`
- send tcp info to all connected nodes, not just vnnmap nodes - use a non-blocking freeze when banned - release all IPs when banned (This used to be ctdb commit 070e85e532b33b792f85c3e72eee205d906aaf85) 2007-06-10 02:46:33 +04:00			`ctdb_release_all_ips(ctdb);`
added admin commands to ban/unban nodes (This used to be ctdb commit 4dad04172e7e4955b5bf6444a85b19901c9683ad) 2007-06-07 10:34:33 +04:00			`ctdb->recovery_mode = CTDB_RECOVERY_ACTIVE;`
			`}`
merged admin enable/disable change from ronnie (This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a) 2007-06-07 05:15:22 +04:00
			`return 0;`
			`}`
exponential backoff in health monitoring for faster startup (This used to be ctdb commit 1b04a1f675f73b48366ba98803a58c3d8df1b6e1) 2008-01-10 06:40:56 +03:00
			`/*`
			`return the monitoring mode`
			`*/`
			`int32_t ctdb_monitoring_mode(struct ctdb_context *ctdb)`
			`{`
			`if (ctdb->monitor == NULL) {`
			`return CTDB_MONITORING_DISABLED;`
			`}`
			`return ctdb->monitor->monitoring_mode;`
			`}`
add a mechanism to force a node to run the eventscripts with arbitrary arguments ctdb eventscript "command argument argument ..." (This used to be ctdb commit 118a16e763d8332c6ce4d8b8e194775fb874c8c8) 2008-04-02 04:13:30 +04:00

516 lines 15 KiB C Raw Normal View History Unescape Escape

516 lines

15 KiB

C

Raw Normal View History