samba-mirror

mirror of https://github.com/samba-team/samba.git synced 2024-12-23 17:34:34 +03:00

1321 lines

36 KiB

C

Raw Normal View History

make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`/*`
			`ctdb daemon code`

			`Copyright (C) Andrew Tridgell 2006`

ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`This program is free software; you can redistribute it and/or modify`
			`it under the terms of the GNU General Public License as published by`
update lib/replace from samba4 (This used to be ctdb commit f0555484105668c01c21f56322992e752e831109) 2007-07-10 09:29:31 +04:00			`the Free Software Foundation; either version 3 of the License, or`
ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`(at your option) any later version.`

			`This program is distributed in the hope that it will be useful,`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`but WITHOUT ANY WARRANTY; without even the implied warranty of`
ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the`
			`GNU General Public License for more details.`

			`You should have received a copy of the GNU General Public License`
update lib/replace from samba4 (This used to be ctdb commit f0555484105668c01c21f56322992e752e831109) 2007-07-10 09:29:31 +04:00			`along with this program; if not, see <http://www.gnu.org/licenses/>.`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`*/`

			`#include "includes.h"`
			`#include "db_wrap.h"`
			`#include "lib/tdb/include/tdb.h"`
event: Update events to latest Samba version 0.9.8 In Samba this is now called "tevent", and while we use the backwards compatibility wrappers they don't offer EVENT_FD_AUTOCLOSE: that is now a separate tevent_fd_set_auto_close() function. This is based on Samba version 7f29f817fa939ef1bbb740584f09e76e2ecd5b06. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 85e5e760cc91eb3157d3a88996ce474491646726) 2010-08-18 03:46:31 +04:00			`#include "lib/tevent/tevent.h"`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`#include "lib/util/dlinklist.h"`
			`#include "system/network.h"`
			`#include "system/filesys.h"`
block SIGPIPE in the daemon to prevent a SIGPIPE on write to a dead socket (This used to be ctdb commit 02c09dc07c9bed57ca3692b14e41ac8cca0a29f4) 2007-04-17 09:33:20 +04:00			`#include "system/wait.h"`
libctdb: reorganize headers: remove ctdb.h, add ctdb_client.h and ctdb_protocol.h ctdb_client.h is the existing internal client interface (which was mainly in ctdb.h), and ctdb_protocol.h is the information needed for the wire protocol only. ctdb.h will be the new, shiny, libctdb API. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 4bba6b8cd47b352f98d41f9f06258d5ac3c9adef) 2010-05-20 09:48:30 +04:00			`#include "../include/ctdb_client.h"`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`#include "../include/ctdb_private.h"`
add improvements to tracking memory usage in ctdbd adn the recovery daemon and a ctdb command to pull the talloc memory map from a recovery daemon ctdb rddumpmemory (This used to be ctdb commit d23950be7406cf288f48b660c0f57a9b8d7bdd05) 2008-04-01 08:34:54 +04:00			`#include <sys/socket.h>`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
Add a double linked list to the ctdb_context to store a mapping between client pids and client structures. Add the mapping to the list everytime we accept() a new client connection and set it up to remove in the destructor when the client structure is freed. (This used to be ctdb commit f75d379377f5d4abbff2576ddc5d58d91dc53bf4) 2009-12-02 05:41:04 +03:00			`struct ctdb_client_pid_list {`
			`struct ctdb_client_pid_list next, prev;`
			`struct ctdb_context *ctdb;`
			`pid_t pid;`
			`struct ctdb_client *client;`
			`};`

make sure we don't increment rx_cnt for redirected packets, or for packets that have been requeued after a lockwait (This used to be ctdb commit 92e5569407dba173a27e9645b4339ce3e2c00520) 2007-05-19 07:45:24 +04:00			`static void daemon_incoming_packet(void , struct ctdb_req_header );`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00
no longer wait at startup for services to become available, instead set the node initially unhealthy and let the status monitoring bring the node online. This fixes a problem with winbindd, where it refused to start because secrets.tdb was not populated but we could not populate ctdbd, because the net command would not run while ctdbd was still doing startup and thus frozen (This used to be ctdb commit 3a001b793dd76fb96addf1e2ccb74da326fbcfbc) 2007-09-24 04:00:14 +04:00			`static void print_exit_message(void)`
			`{`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_NOTICE,("CTDB daemon shutting down\n"));`
no longer wait at startup for services to become available, instead set the node initially unhealthy and let the status monitoring bring the node online. This fixes a problem with winbindd, where it refused to start because secrets.tdb was not populated but we could not populate ctdbd, because the net command would not run while ctdbd was still doing startup and thus frozen (This used to be ctdb commit 3a001b793dd76fb96addf1e2ccb74da326fbcfbc) 2007-09-24 04:00:14 +04:00			`}`

We can not always rely on the recovery daemon pinging us in a timely manner so we need a "ticker" in the main ctdbd daemon too to ensure we get at least one event to process every second. This will improve the accuracy of "Time jumped" messages and remove false positives when the recovery daemon is "slow". (This used to be ctdb commit 70154e5e19e219de086b2995d41e8f6e069ee20d) 2011-01-14 01:46:04 +03:00

			`static void ctdb_time_tick(struct event_context ev, struct timed_event te,`
			`struct timeval t, void *private_data)`
			`{`
			`struct ctdb_context *ctdb = talloc_get_type(private_data, struct ctdb_context);`

			`if (getpid() != ctdbd_pid) {`
			`return;`
			`}`

			`event_add_timed(ctdb->ev, ctdb,`
			`timeval_current_ofs(1, 0),`
			`ctdb_time_tick, ctdb);`
			`}`

			`/* Used to trigger a dummy event once per second, to make`
			`* detection of hangs more reliable.`
			`*/`
			`static void ctdb_start_time_tickd(struct ctdb_context *ctdb)`
			`{`
			`event_add_timed(ctdb->ev, ctdb,`
			`timeval_current_ofs(1, 0),`
			`ctdb_time_tick, ctdb);`
			`}`


don't start the transport connecting to the other nodes until after the startup event script has run (This used to be ctdb commit afca3cc74211aa2e18b1f74d36b2add8dffcfdc7) 2007-05-30 07:26:50 +04:00			`/* called when the "startup" event script has finished */`
prevent a deadly embrace between smbd and ctdbd by moving the calling of the startup event scripts after the point where recovery has started and the node is in normal operation This makes the 'startup' script just a special type of the 'monitor' script which is called first (This used to be ctdb commit 7424c30a5fd04aea0137c466b4318c3f185280d8) 2007-11-12 02:53:11 +03:00			`static void ctdb_start_transport(struct ctdb_context *ctdb)`
don't start the transport connecting to the other nodes until after the startup event script has run (This used to be ctdb commit afca3cc74211aa2e18b1f74d36b2add8dffcfdc7) 2007-05-30 07:26:50 +04:00			`{`
ctdb->methods becomes NULL when we shutdown the transport. If we shutdown the transport and CTDB later decides to send a command out for queueing, the call to ctdb->methods->allocate_pkt() will SEGV. This could trigger for example when we are in the process of shuttind down CTDBD and have already shutdown the transport but we are still waiting for the "shutdown" eventscripts to finish. If the event scripts now take much much longer to execute for some reason, this race condition becomes much more probable. Decorate all dereferencing of ctdb->methods-> with a check that ctdb->menthods is non-NULL (This used to be ctdb commit c4c2c53918da6fb566d6e9cbd6b02e61ae2921e7) 2008-05-11 08:28:33 +04:00			`if (ctdb->methods == NULL) {`
			`DEBUG(DEBUG_ALERT,(__location__ " startup event finished but transport is DOWN.\n"));`
			`ctdb_fatal(ctdb, "transport is not initialized but startup completed");`
			`}`

don't start the transport connecting to the other nodes until after the startup event script has run (This used to be ctdb commit afca3cc74211aa2e18b1f74d36b2add8dffcfdc7) 2007-05-30 07:26:50 +04:00			`/* start the transport running */`
			`if (ctdb->methods->start(ctdb) != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ALERT,("transport failed to start!\n"));`
don't start the transport connecting to the other nodes until after the startup event script has run (This used to be ctdb commit afca3cc74211aa2e18b1f74d36b2add8dffcfdc7) 2007-05-30 07:26:50 +04:00			`ctdb_fatal(ctdb, "transport failed to start");`
			`}`

			`/* start the recovery daemon process */`
			`if (ctdb_start_recoverd(ctdb) != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ALERT,("Failed to start recovery daemon\n"));`
don't start the transport connecting to the other nodes until after the startup event script has run (This used to be ctdb commit afca3cc74211aa2e18b1f74d36b2add8dffcfdc7) 2007-05-30 07:26:50 +04:00			`exit(11);`
			`}`
- use a CTDB_BROADCAST_ALL for the attach message so it goes to currently disconnected nodes - start node monitoring only after transport starts - check if a node is already disconnected in the node dead function (This used to be ctdb commit b81ab6d507797282237768380c6f0e5a4c6519a5) 2007-05-30 08:35:22 +04:00
no longer wait at startup for services to become available, instead set the node initially unhealthy and let the status monitoring bring the node online. This fixes a problem with winbindd, where it refused to start because secrets.tdb was not populated but we could not populate ctdbd, because the net command would not run while ctdbd was still doing startup and thus frozen (This used to be ctdb commit 3a001b793dd76fb96addf1e2ccb74da326fbcfbc) 2007-09-24 04:00:14 +04:00			`/* Make sure we log something when the daemon terminates */`
			`atexit(print_exit_message);`

split node health monitoring and checking for connected/disconnected nodes into two separate files. move the monitoring of keepalives for detecting connected/disconnected remote nodes into ctdb_keepalive.c (This used to be ctdb commit 23a57b20c314d5f11a433cf251eb9d9de743849a) 2008-01-15 00:42:12 +03:00			`/* start monitoring for connected/disconnected nodes */`
			`ctdb_start_keepalive(ctdb);`

			`/* start monitoring for node health */`
- use a CTDB_BROADCAST_ALL for the attach message so it goes to currently disconnected nodes - start node monitoring only after transport starts - check if a node is already disconnected in the node dead function (This used to be ctdb commit b81ab6d507797282237768380c6f0e5a4c6519a5) 2007-05-30 08:35:22 +04:00			`ctdb_start_monitoring(ctdb);`
updated ctdb tickle management there is an array for each node/public address that contains tcp tickles we send a TCP_ADD as a broadcast to all nodes when a client is added if tcp tickles are removed, they are only removed immediately from the local node. once every 20 seconds a node will push/broadcast out the tickle list for all public addresses it manages. this will remove any deleted tickles from the remote nodes (This used to be ctdb commit e3c432a915222e1392d91835bc7a73a96ab61ac9) 2007-07-20 09:05:55 +04:00
			`/* start periodic update of tcp tickle lists */`
			`ctdb_start_tcp_tickle_update(ctdb);`
additional monitoring between the two daemons. we currently only monitor that the dameons are running by kill(0, pid) and verifying the the domain socket between them is ok. this is not sufficient since we can have a situation where the recovery daemon is hung. this new code monitors that the recovery daemon is operating. if the recovery hangs, we log this and shut down the main daemon (This used to be ctdb commit cd69d292292eaab3aac0e9d9fc57cb621597c63c) 2008-09-09 07:44:46 +04:00
			`/* start listening for recovery daemon pings */`
			`ctdb_control_recd_ping(ctdb);`
We can not always rely on the recovery daemon pinging us in a timely manner so we need a "ticker" in the main ctdbd daemon too to ensure we get at least one event to process every second. This will improve the accuracy of "Time jumped" messages and remove false positives when the recovery daemon is "slow". (This used to be ctdb commit 70154e5e19e219de086b2995d41e8f6e069ee20d) 2011-01-14 01:46:04 +03:00
			`/* start listening to timer ticks */`
			`ctdb_start_time_tickd(ctdb);`
don't start the transport connecting to the other nodes until after the startup event script has run (This used to be ctdb commit afca3cc74211aa2e18b1f74d36b2add8dffcfdc7) 2007-05-30 07:26:50 +04:00			`}`

block SIGPIPE in the daemon to prevent a SIGPIPE on write to a dead socket (This used to be ctdb commit 02c09dc07c9bed57ca3692b14e41ac8cca0a29f4) 2007-04-17 09:33:20 +04:00			`static void block_signal(int signum)`
			`{`
			`struct sigaction act;`

			`memset(&act, 0, sizeof(act));`

			`act.sa_handler = SIG_IGN;`
			`sigemptyset(&act.sa_mask);`
			`sigaddset(&act.sa_mask, signum);`
			`sigaction(signum, &act, NULL);`
			`}`

make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
added ctdb_status tool (This used to be ctdb commit 908d6c6a936e21f70f05827ce302e966cca0132a) 2007-04-20 14:07:47 +04:00			`/*`
			`send a packet to a client`
			`*/`
			`static int daemon_queue_send(struct ctdb_client client, struct ctdb_req_header hdr)`
			`{`
Create macros to update the statistics counters and use these macros everywhere instead of manipulating the coutenrs directly. (This used to be ctdb commit 2e648df890e5713bc575965d87937827b068d0d7) 2010-09-29 04:38:41 +04:00			`CTDB_INCREMENT_STAT(client->ctdb, client_packets_sent);`
When clients have blocked, perhaps because the node is banned or stopped and the client is blocked trying to tdb_fetch() a record, make sure we dont queue up too many REQ_MESSAGES. Add a new tunable to control the maximum queue size we allow to a blocked client before we start discarding REQ_MESSAGES instead of queueing them for delivery. This avoids having queued up very very large number of MESSAGES that samba semds between eachother to nodes that are blocked/banned/stopped for extended periods . (This used to be ctdb commit f76d6fed8f9630450263b9fa4b5fdf3493fb1e11) 2009-10-21 08:20:55 +04:00			`if (hdr->operation == CTDB_REQ_MESSAGE) {`
			`if (ctdb_queue_length(client->queue) > client->ctdb->tunable.max_queue_depth_drop_msg) {`
ctdb: when we fill the client packet queue we need to drop the client We can't just drop packets to the list, as those packets could be part of the core protocol the client is using. This happens (for example) when Samba is doing a traverse. If we drop a traverse packet then Samba hangs indefinately. We are better off dropping the ctdb socket to Samba. (This used to be ctdb commit a7a86dafa4d88a6bbc6a71b77ed79a178fd802a6) 2010-02-04 06:36:14 +03:00			`DEBUG(DEBUG_ERR,("CTDB_REQ_MESSAGE queue full - killing client connection.\n"));`
			`talloc_free(client);`
			`return -1;`
When clients have blocked, perhaps because the node is banned or stopped and the client is blocked trying to tdb_fetch() a record, make sure we dont queue up too many REQ_MESSAGES. Add a new tunable to control the maximum queue size we allow to a blocked client before we start discarding REQ_MESSAGES instead of queueing them for delivery. This avoids having queued up very very large number of MESSAGES that samba semds between eachother to nodes that are blocked/banned/stopped for extended periods . (This used to be ctdb commit f76d6fed8f9630450263b9fa4b5fdf3493fb1e11) 2009-10-21 08:20:55 +04:00			`}`
			`}`
added ctdb_status tool (This used to be ctdb commit 908d6c6a936e21f70f05827ce302e966cca0132a) 2007-04-20 14:07:47 +04:00			`return ctdb_queue_send(client->queue, (uint8_t *)hdr, hdr->length);`
			`}`

partially completed work towards full messaging system which will work in both daemon and standalone mode. Does not compile\! committing so ronnie can continue while I'm out (This used to be ctdb commit 1b5e65a700e2bd0a5c913d7866024b25600a14c9) 2007-04-11 05:58:28 +04:00			`/*`
			`message handler for when we are in daemon mode. This redirects the message`
			`to the right client`
			`*/`
make srvid 64 bits instead of 32 bits (This used to be ctdb commit 723bcfbba1d5aa711496d37b9658190b78a2d66b) 2007-04-27 18:31:45 +04:00			`static void daemon_message_handler(struct ctdb_context *ctdb, uint64_t srvid,`
private -> private_data for samba3 (This used to be ctdb commit 080b6901173afb2ad618dd0621876ff478c7d6e5) 2007-04-13 14:38:24 +04:00			`TDB_DATA data, void *private_data)`
partially completed work towards full messaging system which will work in both daemon and standalone mode. Does not compile\! committing so ronnie can continue while I'm out (This used to be ctdb commit 1b5e65a700e2bd0a5c913d7866024b25600a14c9) 2007-04-11 05:58:28 +04:00			`{`
private -> private_data for samba3 (This used to be ctdb commit 080b6901173afb2ad618dd0621876ff478c7d6e5) 2007-04-13 14:38:24 +04:00			`struct ctdb_client *client = talloc_get_type(private_data, struct ctdb_client);`
partially completed work towards full messaging system which will work in both daemon and standalone mode. Does not compile\! committing so ronnie can continue while I'm out (This used to be ctdb commit 1b5e65a700e2bd0a5c913d7866024b25600a14c9) 2007-04-11 05:58:28 +04:00			`struct ctdb_req_message *r;`
			`int len;`

			`/* construct a message to send to the client containing the data */`
			`len = offsetof(struct ctdb_req_message, data) + data.dsize;`
factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`r = ctdbd_allocate_pkt(ctdb, ctdb, CTDB_REQ_MESSAGE,`
			`len, struct ctdb_req_message);`
			`CTDB_NO_MEMORY_VOID(ctdb, r);`
add a test that sends messages between clients connected to the same ctdb add code to actually pass the messages between clients and ctdb (This used to be ctdb commit 6d5b55d7b9c611fb5e98765906757a7d82e4bf6b) 2007-04-11 07:43:15 +04:00
partially completed work towards full messaging system which will work in both daemon and standalone mode. Does not compile\! committing so ronnie can continue while I'm out (This used to be ctdb commit 1b5e65a700e2bd0a5c913d7866024b25600a14c9) 2007-04-11 05:58:28 +04:00			`talloc_set_name_const(r, "req_message packet");`

			`r->srvid = srvid;`
			`r->datalen = data.dsize;`
			`memcpy(&r->data[0], data.dptr, data.dsize);`
change some error printouts to make it easier to determine whether the error occured in the client or in the daemon (This used to be ctdb commit a7e42c2c56e38b4b58ede0ad45767695d704dac4) 2007-04-17 04:15:44 +04:00
added ctdb_status tool (This used to be ctdb commit 908d6c6a936e21f70f05827ce302e966cca0132a) 2007-04-20 14:07:47 +04:00			`daemon_queue_send(client, &r->hdr);`
partially completed work towards full messaging system which will work in both daemon and standalone mode. Does not compile\! committing so ronnie can continue while I'm out (This used to be ctdb commit 1b5e65a700e2bd0a5c913d7866024b25600a14c9) 2007-04-11 05:58:28 +04:00
			`talloc_free(r);`
			`}`

			`/*`
			`this is called when the ctdb daemon received a ctdb request to`
			`set the srvid from the client`
			`*/`
- changed the REQ_REGISTER PDU to be a control - allow controls to know which client invoked them - added a client_id to clients, so they can be identified remotely - added the ability to remove registered srvids - in the list_keys code, register a temp srvid, then remove it afterwards (This used to be ctdb commit 29603c51cc6d81362532cd8e50f75c8360c5f5ef) 2007-05-04 05:41:29 +04:00			`int daemon_register_message_handler(struct ctdb_context *ctdb, uint32_t client_id, uint64_t srvid)`
partially completed work towards full messaging system which will work in both daemon and standalone mode. Does not compile\! committing so ronnie can continue while I'm out (This used to be ctdb commit 1b5e65a700e2bd0a5c913d7866024b25600a14c9) 2007-04-11 05:58:28 +04:00			`{`
- changed the REQ_REGISTER PDU to be a control - allow controls to know which client invoked them - added a client_id to clients, so they can be identified remotely - added the ability to remove registered srvids - in the list_keys code, register a temp srvid, then remove it afterwards (This used to be ctdb commit 29603c51cc6d81362532cd8e50f75c8360c5f5ef) 2007-05-04 05:41:29 +04:00			`struct ctdb_client *client = ctdb_reqid_find(ctdb, client_id, struct ctdb_client);`
partially completed work towards full messaging system which will work in both daemon and standalone mode. Does not compile\! committing so ronnie can continue while I'm out (This used to be ctdb commit 1b5e65a700e2bd0a5c913d7866024b25600a14c9) 2007-04-11 05:58:28 +04:00			`int res;`
- changed the REQ_REGISTER PDU to be a control - allow controls to know which client invoked them - added a client_id to clients, so they can be identified remotely - added the ability to remove registered srvids - in the list_keys code, register a temp srvid, then remove it afterwards (This used to be ctdb commit 29603c51cc6d81362532cd8e50f75c8360c5f5ef) 2007-05-04 05:41:29 +04:00			`if (client == NULL) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,("Bad client_id in daemon_request_register_message_handler\n"));`
- changed the REQ_REGISTER PDU to be a control - allow controls to know which client invoked them - added a client_id to clients, so they can be identified remotely - added the ability to remove registered srvids - in the list_keys code, register a temp srvid, then remove it afterwards (This used to be ctdb commit 29603c51cc6d81362532cd8e50f75c8360c5f5ef) 2007-05-04 05:41:29 +04:00			`return -1;`
			`}`
			`res = ctdb_register_message_handler(ctdb, client, srvid, daemon_message_handler, client);`
partially completed work towards full messaging system which will work in both daemon and standalone mode. Does not compile\! committing so ronnie can continue while I'm out (This used to be ctdb commit 1b5e65a700e2bd0a5c913d7866024b25600a14c9) 2007-04-11 05:58:28 +04:00			`if (res != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,(__location__ " Failed to register handler %llu in daemon\n",`
fixed more warnings on 64 bit boxes (This used to be ctdb commit 2f6eae476203f8a8b28e083553204c01f224c8a5) 2007-05-29 07:58:41 +04:00			`(unsigned long long)srvid));`
minor debug changes (This used to be ctdb commit 1950d96458238782c3bfd8e41a053c4be8330ef9) 2007-04-20 01:47:37 +04:00			`} else {`
added debug constants to allow for better mapping to syslog levels (This used to be ctdb commit 7ba8f1dde318eab03f4257e5a89fd23e7281e502) 2008-02-04 09:44:24 +03:00			`DEBUG(DEBUG_INFO,(__location__ " Registered message handler for srvid=%llu\n",`
fixed more warnings on 64 bit boxes (This used to be ctdb commit 2f6eae476203f8a8b28e083553204c01f224c8a5) 2007-05-29 07:58:41 +04:00			`(unsigned long long)srvid));`
partially completed work towards full messaging system which will work in both daemon and standalone mode. Does not compile\! committing so ronnie can continue while I'm out (This used to be ctdb commit 1b5e65a700e2bd0a5c913d7866024b25600a14c9) 2007-04-11 05:58:28 +04:00			`}`
added code to kill registered clients on a IP release (This used to be ctdb commit ca0243b544987ce0618a99ac87b4abf598991e93) 2007-06-18 21:54:06 +04:00
- changed the REQ_REGISTER PDU to be a control - allow controls to know which client invoked them - added a client_id to clients, so they can be identified remotely - added the ability to remove registered srvids - in the list_keys code, register a temp srvid, then remove it afterwards (This used to be ctdb commit 29603c51cc6d81362532cd8e50f75c8360c5f5ef) 2007-05-04 05:41:29 +04:00			`return res;`
			`}`

			`/*`
			`this is called when the ctdb daemon received a ctdb request to`
			`remove a srvid from the client`
			`*/`
			`int daemon_deregister_message_handler(struct ctdb_context *ctdb, uint32_t client_id, uint64_t srvid)`
			`{`
			`struct ctdb_client *client = ctdb_reqid_find(ctdb, client_id, struct ctdb_client);`
			`if (client == NULL) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,("Bad client_id in daemon_request_deregister_message_handler\n"));`
- changed the REQ_REGISTER PDU to be a control - allow controls to know which client invoked them - added a client_id to clients, so they can be identified remotely - added the ability to remove registered srvids - in the list_keys code, register a temp srvid, then remove it afterwards (This used to be ctdb commit 29603c51cc6d81362532cd8e50f75c8360c5f5ef) 2007-05-04 05:41:29 +04:00			`return -1;`
			`}`
			`return ctdb_deregister_message_handler(ctdb, srvid, client);`
partially completed work towards full messaging system which will work in both daemon and standalone mode. Does not compile\! committing so ronnie can continue while I'm out (This used to be ctdb commit 1b5e65a700e2bd0a5c913d7866024b25600a14c9) 2007-04-11 05:58:28 +04:00			`}`


make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`/*`
			`destroy a ctdb_client`
			`*/`
			`static int ctdb_client_destructor(struct ctdb_client *client)`
			`{`
Fix persistent transaction commit race condition. In ctdb_client.c:ctdb_transaction_commit(), after a failed TRANS2_COMMIT control call (for instance due to the 1-second being exceeded waiting for a busy node's reply), there is a 1-second gap between the transaction_cancel() and replay_transaction() calls in which there is no lock on the persistent db. And due to the lack of global state indicating that a transaction is in progress in ctdbd, other nodes may succeed to start transactions on the db in this gap and even worse work on top of the possibly already pushed changes. So the data diverges on the several nodes. This change fixes this by introducing global state for a transaction commit being active in the ctdb_db_context struct and in a db_id field in the client so that a client keeps track of _which_ tdb it as transaction commit running on. These data are set by ctdb upon entering the trans2_commit control and they are cleared in the trans2_error or trans2_finished controls. This makes it impossible to start a nother transaction or migrate a record to a different node while a transaction is active on a persistent tdb, including the retry loop. This approach is dead lock free and still allows recovery process to be started in the retry-gap between cancel and replay. Also note, that this solution does not require any change in the client side. This was debugged and developed together with Stefan Metzmacher <metze@samba.org> - thanks! Michael (This used to be ctdb commit f88103516e5ad723062fb95fcb07a128f1069d69) 2009-07-21 13:30:38 +04:00			`struct ctdb_db_context *ctdb_db;`

added code to ctdb to send a tcp 'tickle' ack when we takeover an IP. A raw tcp ack is sent for each tcp connection held by clients before the IP takeover. These acks have a deliberately incorrect sequence number, and should cause the windows client to send its own ack which will in turn cause a tcp reset and thus cause windows clients to much more quickly reconnect to the new node. (This used to be ctdb commit eef38bfe8461b47489d169c61895d6bb8a8f79a1) 2007-05-27 09:26:29 +04:00			`ctdb_takeover_client_destructor_hook(client);`
- changed the REQ_REGISTER PDU to be a control - allow controls to know which client invoked them - added a client_id to clients, so they can be identified remotely - added the ability to remove registered srvids - in the list_keys code, register a temp srvid, then remove it afterwards (This used to be ctdb commit 29603c51cc6d81362532cd8e50f75c8360c5f5ef) 2007-05-04 05:41:29 +04:00			`ctdb_reqid_remove(client->ctdb, client->client_id);`
Create macros to update the statistics counters and use these macros everywhere instead of manipulating the coutenrs directly. (This used to be ctdb commit 2e648df890e5713bc575965d87937827b068d0d7) 2010-09-29 04:38:41 +04:00			`CTDB_DECREMENT_STAT(client->ctdb, num_clients);`
Add two new controls to start and cancel a persistent update. This allows ctdb to automatically start a new full blown recovery if a client has started updating the local tdb for a persistent database but is kill -9ed before it has ensured the update is distributed clusterwide. (This used to be ctdb commit 1ffccb3e0b3b5bd376c5302304029af393709518) 2008-07-17 07:50:55 +04:00
			`if (client->num_persistent_updates != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " Client disconnecting with %u persistent updates in flight. Starting recovery\n", client->num_persistent_updates));`
			`client->ctdb->recovery_mode = CTDB_RECOVERY_ACTIVE;`
			`}`
Fix persistent transaction commit race condition. In ctdb_client.c:ctdb_transaction_commit(), after a failed TRANS2_COMMIT control call (for instance due to the 1-second being exceeded waiting for a busy node's reply), there is a 1-second gap between the transaction_cancel() and replay_transaction() calls in which there is no lock on the persistent db. And due to the lack of global state indicating that a transaction is in progress in ctdbd, other nodes may succeed to start transactions on the db in this gap and even worse work on top of the possibly already pushed changes. So the data diverges on the several nodes. This change fixes this by introducing global state for a transaction commit being active in the ctdb_db_context struct and in a db_id field in the client so that a client keeps track of _which_ tdb it as transaction commit running on. These data are set by ctdb upon entering the trans2_commit control and they are cleared in the trans2_error or trans2_finished controls. This makes it impossible to start a nother transaction or migrate a record to a different node while a transaction is active on a persistent tdb, including the retry loop. This approach is dead lock free and still allows recovery process to be started in the retry-gap between cancel and replay. Also note, that this solution does not require any change in the client side. This was debugged and developed together with Stefan Metzmacher <metze@samba.org> - thanks! Michael (This used to be ctdb commit f88103516e5ad723062fb95fcb07a128f1069d69) 2009-07-21 13:30:38 +04:00			`ctdb_db = find_ctdb_db(client->ctdb, client->db_id);`
			`if (ctdb_db) {`
			`DEBUG(DEBUG_ERR, (__location__ " client exit while transaction "`
			`"commit active. Forcing recovery.\n"));`
			`client->ctdb->recovery_mode = CTDB_RECOVERY_ACTIVE;`
daemon: correctly end a running trans3_commit if the client disconnects. (This used to be ctdb commit 9e0898db6df52d9bc799dd87bfea8c72d5f70ba0) 2011-02-23 19:37:42 +03:00
			`/* legacy trans2 transaction state: */`
Fix persistent transaction commit race condition. In ctdb_client.c:ctdb_transaction_commit(), after a failed TRANS2_COMMIT control call (for instance due to the 1-second being exceeded waiting for a busy node's reply), there is a 1-second gap between the transaction_cancel() and replay_transaction() calls in which there is no lock on the persistent db. And due to the lack of global state indicating that a transaction is in progress in ctdbd, other nodes may succeed to start transactions on the db in this gap and even worse work on top of the possibly already pushed changes. So the data diverges on the several nodes. This change fixes this by introducing global state for a transaction commit being active in the ctdb_db_context struct and in a db_id field in the client so that a client keeps track of _which_ tdb it as transaction commit running on. These data are set by ctdb upon entering the trans2_commit control and they are cleared in the trans2_error or trans2_finished controls. This makes it impossible to start a nother transaction or migrate a record to a different node while a transaction is active on a persistent tdb, including the retry loop. This approach is dead lock free and still allows recovery process to be started in the retry-gap between cancel and replay. Also note, that this solution does not require any change in the client side. This was debugged and developed together with Stefan Metzmacher <metze@samba.org> - thanks! Michael (This used to be ctdb commit f88103516e5ad723062fb95fcb07a128f1069d69) 2009-07-21 13:30:38 +04:00			`ctdb_db->transaction_active = false;`
daemon: correctly end a running trans3_commit if the client disconnects. (This used to be ctdb commit 9e0898db6df52d9bc799dd87bfea8c72d5f70ba0) 2011-02-23 19:37:42 +03:00
			`/*`
			`* trans3 transaction state:`
			`*`
			`* The destructor sets the pointer to NULL.`
			`*/`
			`talloc_free(ctdb_db->persistent_state);`
Fix persistent transaction commit race condition. In ctdb_client.c:ctdb_transaction_commit(), after a failed TRANS2_COMMIT control call (for instance due to the 1-second being exceeded waiting for a busy node's reply), there is a 1-second gap between the transaction_cancel() and replay_transaction() calls in which there is no lock on the persistent db. And due to the lack of global state indicating that a transaction is in progress in ctdbd, other nodes may succeed to start transactions on the db in this gap and even worse work on top of the possibly already pushed changes. So the data diverges on the several nodes. This change fixes this by introducing global state for a transaction commit being active in the ctdb_db_context struct and in a db_id field in the client so that a client keeps track of _which_ tdb it as transaction commit running on. These data are set by ctdb upon entering the trans2_commit control and they are cleared in the trans2_error or trans2_finished controls. This makes it impossible to start a nother transaction or migrate a record to a different node while a transaction is active on a persistent tdb, including the retry loop. This approach is dead lock free and still allows recovery process to be started in the retry-gap between cancel and replay. Also note, that this solution does not require any change in the client side. This was debugged and developed together with Stefan Metzmacher <metze@samba.org> - thanks! Michael (This used to be ctdb commit f88103516e5ad723062fb95fcb07a128f1069d69) 2009-07-21 13:30:38 +04:00			`}`
Add two new controls to start and cancel a persistent update. This allows ctdb to automatically start a new full blown recovery if a client has started updating the local tdb for a persistent database but is kill -9ed before it has ensured the update is distributed clusterwide. (This used to be ctdb commit 1ffccb3e0b3b5bd376c5302304029af393709518) 2008-07-17 07:50:55 +04:00
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`return 0;`
			`}`


add a test that sends messages between clients connected to the same ctdb add code to actually pass the messages between clients and ctdb (This used to be ctdb commit 6d5b55d7b9c611fb5e98765906757a7d82e4bf6b) 2007-04-11 07:43:15 +04:00			`/*`
			`this is called when the ctdb daemon received a ctdb request message`
			`from a local client over the unix domain socket`
			`*/`
			`static void daemon_request_message_from_client(struct ctdb_client *client,`
merge from ronnie, plus complete the client side of inter-node messaging (This used to be ctdb commit e605417436855d22343462acae4cbb79a374977e) 2007-04-11 08:05:01 +04:00			`struct ctdb_req_message *c)`
add a test that sends messages between clients connected to the same ctdb add code to actually pass the messages between clients and ctdb (This used to be ctdb commit 6d5b55d7b9c611fb5e98765906757a7d82e4bf6b) 2007-04-11 07:43:15 +04:00			`{`
merge from ronnie, plus complete the client side of inter-node messaging (This used to be ctdb commit e605417436855d22343462acae4cbb79a374977e) 2007-04-11 08:05:01 +04:00			`TDB_DATA data;`
			`int res;`

			`/* maybe the message is for another client on this node */`
change ctdb_get_vnn to ctdb_get_pnn (This used to be ctdb commit 1e19930198c2bcc7ccb755e0ee51555fb823029a) 2007-09-04 04:18:44 +04:00			`if (ctdb_get_pnn(client->ctdb)==c->hdr.destnode) {`
add a test that sends messages between clients connected to the same ctdb add code to actually pass the messages between clients and ctdb (This used to be ctdb commit 6d5b55d7b9c611fb5e98765906757a7d82e4bf6b) 2007-04-11 07:43:15 +04:00			`ctdb_request_message(client->ctdb, (struct ctdb_req_header *)c);`
merge from ronnie, plus complete the client side of inter-node messaging (This used to be ctdb commit e605417436855d22343462acae4cbb79a374977e) 2007-04-11 08:05:01 +04:00			`return;`
			`}`
add a special VNN that means "all" nodes so that a message can be broadcasted to all daemons in the cluster change the message dispatch routine for sending messages so that it allows several clients to use the same srvid messages are then passed on to all clients that have that srvid (This used to be ctdb commit 05d7ebb3556785f0f17a87d808f31ffe8dac288a) 2007-04-27 17:16:17 +04:00
merge from ronnie, plus complete the client side of inter-node messaging (This used to be ctdb commit e605417436855d22343462acae4cbb79a374977e) 2007-04-11 08:05:01 +04:00			`/* its for a remote node */`
			`data.dptr = &c->data[0];`
			`data.dsize = c->datalen;`
			`res = ctdb_daemon_send_message(client->ctdb, c->hdr.destnode,`
			`c->srvid, data);`
			`if (res != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,(__location__ " Failed to send message to remote node %u\n",`
more DEBUG() calls (This used to be ctdb commit 79f3d63eec5652d87f13875c76e90ead81a26ad9) 2007-04-17 16:27:17 +04:00			`c->hdr.destnode));`
add a test that sends messages between clients connected to the same ctdb add code to actually pass the messages between clients and ctdb (This used to be ctdb commit 6d5b55d7b9c611fb5e98765906757a7d82e4bf6b) 2007-04-11 07:43:15 +04:00			`}`
			`}`

- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00
			`struct daemon_call_state {`
			`struct ctdb_client *client;`
			`uint32_t reqid;`
			`struct ctdb_call *call;`
added ctdb_status tool (This used to be ctdb commit 908d6c6a936e21f70f05827ce302e966cca0132a) 2007-04-20 14:07:47 +04:00			`struct timeval start_time;`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`};`

			`/*`
			`complete a call from a client`
			`*/`
			`static void daemon_call_from_client_callback(struct ctdb_call_state *state)`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`{`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`struct daemon_call_state *dstate = talloc_get_type(state->async.private_data,`
			`struct daemon_call_state);`
- add --daemon flag to ctdb_fetch test code - split client specific routines out of ctdb_daemon.c - use ctdb_queue code in message send from client to daemon - use clearer names in client/daemon functions - use talloc autofree context to avoid global for unlink of socket on exit - start on API change for message handler, to allow ctdb messaging to handle daemon mode with multiple clients (This used to be ctdb commit 53555db45f3583ae4a32cc3aa9e07fb8ef2a77e3) 2007-04-11 05:01:42 +04:00			`struct ctdb_reply_call *r;`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`int res;`
- add --daemon flag to ctdb_fetch test code - split client specific routines out of ctdb_daemon.c - use ctdb_queue code in message send from client to daemon - use clearer names in client/daemon functions - use talloc autofree context to avoid global for unlink of socket on exit - start on API change for message handler, to allow ctdb messaging to handle daemon mode with multiple clients (This used to be ctdb commit 53555db45f3583ae4a32cc3aa9e07fb8ef2a77e3) 2007-04-11 05:01:42 +04:00			`uint32_t length;`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`struct ctdb_client *client = dstate->client;`
we actually need a ctdb_db variable (This used to be ctdb commit aba984f1b85f5a2d370b093061cf15843ee53758) 2008-11-03 13:54:52 +03:00			`struct ctdb_db_context *ctdb_db = state->ctdb_db;`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`talloc_steal(client, dstate);`
			`talloc_steal(dstate, dstate->call);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`res = ctdb_daemon_call_recv(state, dstate->call);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`if (res != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR, (__location__ " ctdbd_call_recv() returned error\n"));`
Create macros to update the statistics counters and use these macros everywhere instead of manipulating the coutenrs directly. (This used to be ctdb commit 2e648df890e5713bc575965d87937827b068d0d7) 2010-09-29 04:38:41 +04:00			`CTDB_DECREMENT_STAT(client->ctdb, pending_calls);`

Update latency countes to show min/max and average (This used to be ctdb commit 1919e949af4641ffe919123e44b02fb87c13ab9f) 2010-10-11 08:11:18 +04:00			`CTDB_UPDATE_LATENCY(client->ctdb, ctdb_db, "call_from_client_cb 1", call_latency, dstate->start_time);`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`return;`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`}`

- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`length = offsetof(struct ctdb_reply_call, data) + dstate->call->reply_data.dsize;`
factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`r = ctdbd_allocate_pkt(client->ctdb, dstate, CTDB_REPLY_CALL,`
			`length, struct ctdb_reply_call);`
- add --daemon flag to ctdb_fetch test code - split client specific routines out of ctdb_daemon.c - use ctdb_queue code in message send from client to daemon - use clearer names in client/daemon functions - use talloc autofree context to avoid global for unlink of socket on exit - start on API change for message handler, to allow ctdb messaging to handle daemon mode with multiple clients (This used to be ctdb commit 53555db45f3583ae4a32cc3aa9e07fb8ef2a77e3) 2007-04-11 05:01:42 +04:00			`if (r == NULL) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR, (__location__ " Failed to allocate reply_call in ctdb daemon\n"));`
Create macros to update the statistics counters and use these macros everywhere instead of manipulating the coutenrs directly. (This used to be ctdb commit 2e648df890e5713bc575965d87937827b068d0d7) 2010-09-29 04:38:41 +04:00			`CTDB_DECREMENT_STAT(client->ctdb, pending_calls);`
Update latency countes to show min/max and average (This used to be ctdb commit 1919e949af4641ffe919123e44b02fb87c13ab9f) 2010-10-11 08:11:18 +04:00			`CTDB_UPDATE_LATENCY(client->ctdb, ctdb_db, "call_from_client_cb 2", call_latency, dstate->start_time);`
- add --daemon flag to ctdb_fetch test code - split client specific routines out of ctdb_daemon.c - use ctdb_queue code in message send from client to daemon - use clearer names in client/daemon functions - use talloc autofree context to avoid global for unlink of socket on exit - start on API change for message handler, to allow ctdb messaging to handle daemon mode with multiple clients (This used to be ctdb commit 53555db45f3583ae4a32cc3aa9e07fb8ef2a77e3) 2007-04-11 05:01:42 +04:00			`return;`
			`}`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`r->hdr.reqid = dstate->reqid;`
			`r->datalen = dstate->call->reply_data.dsize;`
			`memcpy(&r->data[0], dstate->call->reply_data.dptr, r->datalen);`
- add --daemon flag to ctdb_fetch test code - split client specific routines out of ctdb_daemon.c - use ctdb_queue code in message send from client to daemon - use clearer names in client/daemon functions - use talloc autofree context to avoid global for unlink of socket on exit - start on API change for message handler, to allow ctdb messaging to handle daemon mode with multiple clients (This used to be ctdb commit 53555db45f3583ae4a32cc3aa9e07fb8ef2a77e3) 2007-04-11 05:01:42 +04:00
added ctdb_status tool (This used to be ctdb commit 908d6c6a936e21f70f05827ce302e966cca0132a) 2007-04-20 14:07:47 +04:00			`res = daemon_queue_send(client, &r->hdr);`
ctdb: when we fill the client packet queue we need to drop the client We can't just drop packets to the list, as those packets could be part of the core protocol the client is using. This happens (for example) when Samba is doing a traverse. If we drop a traverse packet then Samba hangs indefinately. We are better off dropping the ctdb socket to Samba. (This used to be ctdb commit a7a86dafa4d88a6bbc6a71b77ed79a178fd802a6) 2010-02-04 06:36:14 +03:00			`if (res == -1) {`
			`/* client is dead - return immediately */`
			`return;`
			`}`
- add --daemon flag to ctdb_fetch test code - split client specific routines out of ctdb_daemon.c - use ctdb_queue code in message send from client to daemon - use clearer names in client/daemon functions - use talloc autofree context to avoid global for unlink of socket on exit - start on API change for message handler, to allow ctdb messaging to handle daemon mode with multiple clients (This used to be ctdb commit 53555db45f3583ae4a32cc3aa9e07fb8ef2a77e3) 2007-04-11 05:01:42 +04:00			`if (res != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR, (__location__ " Failed to queue packet from daemon to client\n"));`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`}`
Update latency countes to show min/max and average (This used to be ctdb commit 1919e949af4641ffe919123e44b02fb87c13ab9f) 2010-10-11 08:11:18 +04:00			`CTDB_UPDATE_LATENCY(client->ctdb, ctdb_db, "call_from_client_cb 3", call_latency, dstate->start_time);`
Create macros to update the statistics counters and use these macros everywhere instead of manipulating the coutenrs directly. (This used to be ctdb commit 2e648df890e5713bc575965d87937827b068d0d7) 2010-09-29 04:38:41 +04:00			`CTDB_DECREMENT_STAT(client->ctdb, pending_calls);`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`talloc_free(dstate);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`}`

- catch a case where the client disconnects during a call - track all talloc memory, using NULL context (This used to be ctdb commit bf89c56002f5311520e91cb367753bc46e5dddc9) 2008-01-16 01:44:48 +03:00			`struct ctdb_daemon_packet_wrap {`
			`struct ctdb_context *ctdb;`
			`uint32_t client_id;`
			`};`

			`/*`
			`a wrapper to catch disconnected clients`
			`*/`
			`static void daemon_incoming_packet_wrap(void p, struct ctdb_req_header hdr)`
			`{`
			`struct ctdb_client *client;`
			`struct ctdb_daemon_packet_wrap *w = talloc_get_type(p,`
			`struct ctdb_daemon_packet_wrap);`
			`if (w == NULL) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_CRIT,(__location__ " Bad packet type '%s'\n", talloc_get_name(p)));`
- catch a case where the client disconnects during a call - track all talloc memory, using NULL context (This used to be ctdb commit bf89c56002f5311520e91cb367753bc46e5dddc9) 2008-01-16 01:44:48 +03:00			`return;`
			`}`

			`client = ctdb_reqid_find(w->ctdb, w->client_id, struct ctdb_client);`
			`if (client == NULL) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,(__location__ " Packet for disconnected client %u\n",`
- catch a case where the client disconnects during a call - track all talloc memory, using NULL context (This used to be ctdb commit bf89c56002f5311520e91cb367753bc46e5dddc9) 2008-01-16 01:44:48 +03:00			`w->client_id));`
			`talloc_free(w);`
			`return;`
			`}`
			`talloc_free(w);`

			`/* process it */`
			`daemon_incoming_packet(client, hdr);`
			`}`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00
merged ronnies code to delay client requests when in recovery mode (This used to be ctdb commit dfca37076d642f3407c63dfe3b685287d27c8f8d) 2007-05-10 01:43:18 +04:00
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`/*`
			`this is called when the ctdb daemon received a ctdb request call`
			`from a local client over the unix domain socket`
			`*/`
			`static void daemon_request_call_from_client(struct ctdb_client *client,`
			`struct ctdb_req_call *c)`
			`{`
			`struct ctdb_call_state *state;`
			`struct ctdb_db_context *ctdb_db;`
			`struct daemon_call_state *dstate;`
			`struct ctdb_call *call;`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`struct ctdb_ltdb_header header;`
			`TDB_DATA key, data;`
			`int ret;`
			`struct ctdb_context *ctdb = client->ctdb;`
- catch a case where the client disconnects during a call - track all talloc memory, using NULL context (This used to be ctdb commit bf89c56002f5311520e91cb367753bc46e5dddc9) 2008-01-16 01:44:48 +03:00			`struct ctdb_daemon_packet_wrap *w;`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00
Create macros to update the statistics counters and use these macros everywhere instead of manipulating the coutenrs directly. (This used to be ctdb commit 2e648df890e5713bc575965d87937827b068d0d7) 2010-09-29 04:38:41 +04:00			`CTDB_INCREMENT_STAT(ctdb, total_calls);`
			`CTDB_DECREMENT_STAT(ctdb, pending_calls);`
added ctdb_status tool (This used to be ctdb commit 908d6c6a936e21f70f05827ce302e966cca0132a) 2007-04-20 14:07:47 +04:00
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`ctdb_db = find_ctdb_db(client->ctdb, c->db_id);`
			`if (!ctdb_db) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR, (__location__ " Unknown database in request. db_id==0x%08x",`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`c->db_id));`
Create macros to update the statistics counters and use these macros everywhere instead of manipulating the coutenrs directly. (This used to be ctdb commit 2e648df890e5713bc575965d87937827b068d0d7) 2010-09-29 04:38:41 +04:00			`CTDB_DECREMENT_STAT(ctdb, pending_calls);`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`return;`
			`}`

server: Use tdb_check to verify persistent tdbs on startup Depending on --max-persistent-check-errors we allow ctdb to start with unhealthy persistent databases. The default is 0 which means to reject a startup with unhealthy dbs. The health of the persistent databases is checked after each recovery. Node monitoring and the "startup" is deferred until all persistent databases are healthy. Databases can become healthy automaticly by a completely HEALTHY node joining the cluster. Or by an administrator with "ctdb backupdb/restoredb" or "ctdb wipedb". metze (This used to be ctdb commit 15f133d5150ed1badb4fef7d644f10cd08a25cb5) 2009-12-07 15:28:11 +03:00			`if (ctdb_db->unhealthy_reason) {`
			`/*`
			`* this is just a warning, as the tdb should be empty anyway,`
			`* and only persistent databases can be unhealthy, which doesn't`
			`* use this code patch`
			`*/`
			`DEBUG(DEBUG_WARNING,("warn: db(%s) unhealty in daemon_request_call_from_client(): %s\n",`
			`ctdb_db->db_name, ctdb_db->unhealthy_reason));`
			`}`

- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`key.dptr = c->data;`
			`key.dsize = c->keylen;`

- catch a case where the client disconnects during a call - track all talloc memory, using NULL context (This used to be ctdb commit bf89c56002f5311520e91cb367753bc46e5dddc9) 2008-01-16 01:44:48 +03:00			`w = talloc(ctdb, struct ctdb_daemon_packet_wrap);`
			`CTDB_NO_MEMORY_VOID(ctdb, w);`

			`w->ctdb = ctdb;`
			`w->client_id = client->client_id;`

- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`ret = ctdb_ltdb_lock_fetch_requeue(ctdb_db, key, &header,`
			`(struct ctdb_req_header *)c, &data,`
- catch a case where the client disconnects during a call - track all talloc memory, using NULL context (This used to be ctdb commit bf89c56002f5311520e91cb367753bc46e5dddc9) 2008-01-16 01:44:48 +03:00			`daemon_incoming_packet_wrap, w, True);`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`if (ret == -2) {`
			`/* will retry later */`
Create macros to update the statistics counters and use these macros everywhere instead of manipulating the coutenrs directly. (This used to be ctdb commit 2e648df890e5713bc575965d87937827b068d0d7) 2010-09-29 04:38:41 +04:00			`CTDB_DECREMENT_STAT(ctdb, pending_calls);`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`return;`
			`}`

- catch a case where the client disconnects during a call - track all talloc memory, using NULL context (This used to be ctdb commit bf89c56002f5311520e91cb367753bc46e5dddc9) 2008-01-16 01:44:48 +03:00			`talloc_free(w);`

- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`if (ret != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,(__location__ " Unable to fetch record\n"));`
Create macros to update the statistics counters and use these macros everywhere instead of manipulating the coutenrs directly. (This used to be ctdb commit 2e648df890e5713bc575965d87937827b068d0d7) 2010-09-29 04:38:41 +04:00			`CTDB_DECREMENT_STAT(ctdb, pending_calls);`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`return;`
			`}`

- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`dstate = talloc(client, struct daemon_call_state);`
			`if (dstate == NULL) {`
add additional logging when tdb_chainunlock() fails so we can see where it was called from when it fails (This used to be ctdb commit 0c091b3db6bdefd371787d87bc749593ea8e3c76) 2010-06-09 08:17:35 +04:00			`ret = ctdb_ltdb_unlock(ctdb_db, key);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_ltdb_unlock() failed with error %d\n", ret));`
			`}`

merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,(__location__ " Unable to allocate dstate\n"));`
Create macros to update the statistics counters and use these macros everywhere instead of manipulating the coutenrs directly. (This used to be ctdb commit 2e648df890e5713bc575965d87937827b068d0d7) 2010-09-29 04:38:41 +04:00			`CTDB_DECREMENT_STAT(ctdb, pending_calls);`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`return;`
			`}`
added ctdb_status tool (This used to be ctdb commit 908d6c6a936e21f70f05827ce302e966cca0132a) 2007-04-20 14:07:47 +04:00			`dstate->start_time = timeval_current();`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`dstate->client = client;`
			`dstate->reqid = c->hdr.reqid;`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`talloc_steal(dstate, data.dptr);`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00
			`call = dstate->call = talloc_zero(dstate, struct ctdb_call);`
			`if (call == NULL) {`
add additional logging when tdb_chainunlock() fails so we can see where it was called from when it fails (This used to be ctdb commit 0c091b3db6bdefd371787d87bc749593ea8e3c76) 2010-06-09 08:17:35 +04:00			`ret = ctdb_ltdb_unlock(ctdb_db, key);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_ltdb_unlock() failed with error %d\n", ret));`
			`}`

merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,(__location__ " Unable to allocate call\n"));`
Create macros to update the statistics counters and use these macros everywhere instead of manipulating the coutenrs directly. (This used to be ctdb commit 2e648df890e5713bc575965d87937827b068d0d7) 2010-09-29 04:38:41 +04:00			`CTDB_DECREMENT_STAT(ctdb, pending_calls);`
Update latency countes to show min/max and average (This used to be ctdb commit 1919e949af4641ffe919123e44b02fb87c13ab9f) 2010-10-11 08:11:18 +04:00			`CTDB_UPDATE_LATENCY(ctdb, ctdb_db, "call_from_client 1", call_latency, dstate->start_time);`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`return;`
			`}`

			`call->call_id = c->callid;`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`call->key = key;`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`call->call_data.dptr = c->data + c->keylen;`
			`call->call_data.dsize = c->calldatalen;`
The remote node needs to get the IMMEDIATE_MIGRATION flag to actually send the record (This used to be ctdb commit 9159434b1eef39b7de58b30626039f1e45a97306) 2007-04-19 19:44:45 +04:00			`call->flags = c->flags;`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`if (header.dmaster == ctdb->pnn) {`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`state = ctdb_call_local_send(ctdb_db, call, &header, &data);`
			`} else {`
			`state = ctdb_daemon_call_send_remote(ctdb_db, call, &header);`
			`}`

add additional logging when tdb_chainunlock() fails so we can see where it was called from when it fails (This used to be ctdb commit 0c091b3db6bdefd371787d87bc749593ea8e3c76) 2010-06-09 08:17:35 +04:00			`ret = ctdb_ltdb_unlock(ctdb_db, key);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_ltdb_unlock() failed with error %d\n", ret));`
			`}`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`if (state == NULL) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,(__location__ " Unable to setup call send\n"));`
Create macros to update the statistics counters and use these macros everywhere instead of manipulating the coutenrs directly. (This used to be ctdb commit 2e648df890e5713bc575965d87937827b068d0d7) 2010-09-29 04:38:41 +04:00			`CTDB_DECREMENT_STAT(ctdb, pending_calls);`
Update latency countes to show min/max and average (This used to be ctdb commit 1919e949af4641ffe919123e44b02fb87c13ab9f) 2010-10-11 08:11:18 +04:00			`CTDB_UPDATE_LATENCY(ctdb, ctdb_db, "call_from_client 2", call_latency, dstate->start_time);`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`return;`
			`}`
			`talloc_steal(state, dstate);`
			`talloc_steal(client, state);`

			`state->async.fn = daemon_call_from_client_callback;`
			`state->async.private_data = dstate;`
			`}`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00
			`static void daemon_request_control_from_client(struct ctdb_client *client,`
			`struct ctdb_req_control *c);`

make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`/* data contains a packet from the client */`
make sure we don't increment rx_cnt for redirected packets, or for packets that have been requeued after a lockwait (This used to be ctdb commit 92e5569407dba173a27e9645b4339ce3e2c00520) 2007-05-19 07:45:24 +04:00			`static void daemon_incoming_packet(void p, struct ctdb_req_header hdr)`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`{`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`struct ctdb_client *client = talloc_get_type(p, struct ctdb_client);`
- merge volkers debug changes - fixed memory leaks in the 3 packet receive routines. The problem was that the ctdb_call logic would occasionally complete and free a incoming packet, which would then be freed again in the packet receive routine. The solution is to make the packet a child of a temporary context in the receive routine then free that temporary context. That allows other routines to keep or free the packet if they want to, while allowing us to safely free it (via a free of the temporary context) in the receive function (This used to be ctdb commit 304aaaa7235febbe97ff9ecb43875b7265ac48cd) 2007-04-18 05:20:24 +04:00			`TALLOC_CTX *tmp_ctx;`
- expanded status to include count of each call type - added lockwait latency (This used to be ctdb commit 0b5d196147e644cf8b172cb4b593fd46b1caa386) 2007-04-20 15:02:53 +04:00			`struct ctdb_context *ctdb = client->ctdb;`
- merge volkers debug changes - fixed memory leaks in the 3 packet receive routines. The problem was that the ctdb_call logic would occasionally complete and free a incoming packet, which would then be freed again in the packet receive routine. The solution is to make the packet a child of a temporary context in the receive routine then free that temporary context. That allows other routines to keep or free the packet if they want to, while allowing us to safely free it (via a free of the temporary context) in the receive function (This used to be ctdb commit 304aaaa7235febbe97ff9ecb43875b7265ac48cd) 2007-04-18 05:20:24 +04:00
			`/* place the packet as a child of a tmp_ctx. We then use`
			`talloc_free() below to free it. If any of the calls want`
			`to keep it, then they will steal it somewhere else, and the`
			`talloc_free() will be a no-op */`
			`tmp_ctx = talloc_new(client);`
			`talloc_steal(tmp_ctx, hdr);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
			`if (hdr->ctdb_magic != CTDB_MAGIC) {`
change some error printouts to make it easier to determine whether the error occured in the client or in the daemon (This used to be ctdb commit a7e42c2c56e38b4b58ede0ad45767695d704dac4) 2007-04-17 04:15:44 +04:00			`ctdb_set_error(client->ctdb, "Non CTDB packet rejected in daemon\n");`
merge from ronnie, plus complete the client side of inter-node messaging (This used to be ctdb commit e605417436855d22343462acae4cbb79a374977e) 2007-04-11 08:05:01 +04:00			`goto done;`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`}`

			`if (hdr->ctdb_version != CTDB_VERSION) {`
change some error printouts to make it easier to determine whether the error occured in the client or in the daemon (This used to be ctdb commit a7e42c2c56e38b4b58ede0ad45767695d704dac4) 2007-04-17 04:15:44 +04:00			`ctdb_set_error(client->ctdb, "Bad CTDB version 0x%x rejected in daemon\n", hdr->ctdb_version);`
merge from ronnie, plus complete the client side of inter-node messaging (This used to be ctdb commit e605417436855d22343462acae4cbb79a374977e) 2007-04-11 08:05:01 +04:00			`goto done;`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`}`

			`switch (hdr->operation) {`
			`case CTDB_REQ_CALL:`
Create macros to update the statistics counters and use these macros everywhere instead of manipulating the coutenrs directly. (This used to be ctdb commit 2e648df890e5713bc575965d87937827b068d0d7) 2010-09-29 04:38:41 +04:00			`CTDB_INCREMENT_STAT(ctdb, client.req_call);`
- add --daemon flag to ctdb_fetch test code - split client specific routines out of ctdb_daemon.c - use ctdb_queue code in message send from client to daemon - use clearer names in client/daemon functions - use talloc autofree context to avoid global for unlink of socket on exit - start on API change for message handler, to allow ctdb messaging to handle daemon mode with multiple clients (This used to be ctdb commit 53555db45f3583ae4a32cc3aa9e07fb8ef2a77e3) 2007-04-11 05:01:42 +04:00			`daemon_request_call_from_client(client, (struct ctdb_req_call *)hdr);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`break;`

add a test that sends messages between clients connected to the same ctdb add code to actually pass the messages between clients and ctdb (This used to be ctdb commit 6d5b55d7b9c611fb5e98765906757a7d82e4bf6b) 2007-04-11 07:43:15 +04:00			`case CTDB_REQ_MESSAGE:`
Create macros to update the statistics counters and use these macros everywhere instead of manipulating the coutenrs directly. (This used to be ctdb commit 2e648df890e5713bc575965d87937827b068d0d7) 2010-09-29 04:38:41 +04:00			`CTDB_INCREMENT_STAT(ctdb, client.req_message);`
add a test that sends messages between clients connected to the same ctdb add code to actually pass the messages between clients and ctdb (This used to be ctdb commit 6d5b55d7b9c611fb5e98765906757a7d82e4bf6b) 2007-04-11 07:43:15 +04:00			`daemon_request_message_from_client(client, (struct ctdb_req_message *)hdr);`
			`break;`
add proper support for ctdb_connect_wait in daemon mode (This used to be ctdb commit 8d110df5939b3e6a6341909956453887f4eb6b0d) 2007-04-11 08:54:47 +04:00
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`case CTDB_REQ_CONTROL:`
Create macros to update the statistics counters and use these macros everywhere instead of manipulating the coutenrs directly. (This used to be ctdb commit 2e648df890e5713bc575965d87937827b068d0d7) 2010-09-29 04:38:41 +04:00			`CTDB_INCREMENT_STAT(ctdb, client.req_control);`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`daemon_request_control_from_client(client, (struct ctdb_req_control *)hdr);`
			`break;`

add store_unlock pdu's for the domain socket. note that the store_unlock does not actually do anything yet apart from passing the pdu from client to daemon and daemon responds. next is to make sure the daemon actually stores the data in a database (This used to be ctdb commit 167d6993e78f6a1d0f6607ef66925a14993ae6a1) 2007-04-13 03:41:15 +04:00			`default:`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_CRIT,(__location__ " daemon: unrecognized operation %u\n",`
more DEBUG() calls (This used to be ctdb commit 79f3d63eec5652d87f13875c76e90ead81a26ad9) 2007-04-17 16:27:17 +04:00			`hdr->operation));`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`}`

merge from ronnie, plus complete the client side of inter-node messaging (This used to be ctdb commit e605417436855d22343462acae4cbb79a374977e) 2007-04-11 08:05:01 +04:00			`done:`
- merge volkers debug changes - fixed memory leaks in the 3 packet receive routines. The problem was that the ctdb_call logic would occasionally complete and free a incoming packet, which would then be freed again in the packet receive routine. The solution is to make the packet a child of a temporary context in the receive routine then free that temporary context. That allows other routines to keep or free the packet if they want to, while allowing us to safely free it (via a free of the temporary context) in the receive function (This used to be ctdb commit 304aaaa7235febbe97ff9ecb43875b7265ac48cd) 2007-04-18 05:20:24 +04:00			`talloc_free(tmp_ctx);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`}`

added ctdb_status tool (This used to be ctdb commit 908d6c6a936e21f70f05827ce302e966cca0132a) 2007-04-20 14:07:47 +04:00			`/*`
			`called when the daemon gets a incoming packet`
			`*/`
- removed the non-daemon mode from ctdb, in order to simplify the code. It may be added back later once everything is working nicely, or simulated using a in-process pipe instead of a unix domain socket - rewrote the ctdb_fetch_lock() code to follow the new design (This used to be ctdb commit 5024dd1f305fe1ecc262db2240c56f773b4f28f0) 2007-04-17 08:52:51 +04:00			`static void ctdb_daemon_read_cb(uint8_t data, size_t cnt, void args)`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`{`
change ctdb_client_read() to use the ctdb_read_pdu() helper (This used to be ctdb commit d476aa8533b394af6aced9c80fffaf0eefae1dd0) 2007-04-10 02:38:29 +04:00			`struct ctdb_client *client = talloc_get_type(args, struct ctdb_client);`
			`struct ctdb_req_header *hdr;`

Handle a client that exited correctly: We need to ignore SIGPIPE and when the read returns 0 bytes this means the client has exited. Close the connection then. (This used to be ctdb commit bd10f4e62146493848258df8a3dc3b9222337a12) 2007-04-11 15:17:36 +04:00			`if (cnt == 0) {`
			`talloc_free(client);`
			`return;`
			`}`

Create macros to update the statistics counters and use these macros everywhere instead of manipulating the coutenrs directly. (This used to be ctdb commit 2e648df890e5713bc575965d87937827b068d0d7) 2010-09-29 04:38:41 +04:00			`CTDB_INCREMENT_STAT(client->ctdb, client_packets_recv);`
added ctdb_status tool (This used to be ctdb commit 908d6c6a936e21f70f05827ce302e966cca0132a) 2007-04-20 14:07:47 +04:00
change ctdb_client_read() to use the ctdb_read_pdu() helper (This used to be ctdb commit d476aa8533b394af6aced9c80fffaf0eefae1dd0) 2007-04-10 02:38:29 +04:00			`if (cnt < sizeof(*hdr)) {`
fixed some warnings (This used to be ctdb commit b5434a40cf2db008eb1e681fcd2ceeff331324fa) 2007-04-28 13:35:49 +04:00			`ctdb_set_error(client->ctdb, "Bad packet length %u in daemon\n",`
			`(unsigned)cnt);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`return;`
			`}`
change ctdb_client_read() to use the ctdb_read_pdu() helper (This used to be ctdb commit d476aa8533b394af6aced9c80fffaf0eefae1dd0) 2007-04-10 02:38:29 +04:00			`hdr = (struct ctdb_req_header *)data;`
			`if (cnt != hdr->length) {`
removed some bogus debug lines (This used to be ctdb commit 25aa579058ecd2a33b13b4c1d6c7c75427bbdafa) 2007-04-26 20:31:13 +04:00			`ctdb_set_error(client->ctdb, "Bad header length %u expected %u\n in daemon",`
fixed some warnings (This used to be ctdb commit b5434a40cf2db008eb1e681fcd2ceeff331324fa) 2007-04-28 13:35:49 +04:00			`(unsigned)hdr->length, (unsigned)cnt);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`return;`
			`}`

change ctdb_client_read() to use the ctdb_read_pdu() helper (This used to be ctdb commit d476aa8533b394af6aced9c80fffaf0eefae1dd0) 2007-04-10 02:38:29 +04:00			`if (hdr->ctdb_magic != CTDB_MAGIC) {`
			`ctdb_set_error(client->ctdb, "Non CTDB packet rejected\n");`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`return;`
			`}`

change ctdb_client_read() to use the ctdb_read_pdu() helper (This used to be ctdb commit d476aa8533b394af6aced9c80fffaf0eefae1dd0) 2007-04-10 02:38:29 +04:00			`if (hdr->ctdb_version != CTDB_VERSION) {`
change some error printouts to make it easier to determine whether the error occured in the client or in the daemon (This used to be ctdb commit a7e42c2c56e38b4b58ede0ad45767695d704dac4) 2007-04-17 04:15:44 +04:00			`ctdb_set_error(client->ctdb, "Bad CTDB version 0x%x rejected in daemon\n", hdr->ctdb_version);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`return;`
			`}`

added debug constants to allow for better mapping to syslog levels (This used to be ctdb commit 7ba8f1dde318eab03f4257e5a89fd23e7281e502) 2008-02-04 09:44:24 +03:00			`DEBUG(DEBUG_DEBUG,(__location__ " client request %u of type %u length %u from "`
fixed %d which should be %u (This used to be ctdb commit 2792cf718ff1e66fe99f870f683a13baa160f629) 2007-05-23 14:15:09 +04:00			`"node %u to %u\n", hdr->reqid, hdr->operation, hdr->length,`
added ctdb_status tool (This used to be ctdb commit 908d6c6a936e21f70f05827ce302e966cca0132a) 2007-04-20 14:07:47 +04:00			`hdr->srcnode, hdr->destnode));`

change ctdb_client_read() to use the ctdb_read_pdu() helper (This used to be ctdb commit d476aa8533b394af6aced9c80fffaf0eefae1dd0) 2007-04-10 02:38:29 +04:00			`/* it is the responsibility of the incoming packet function to free 'data' */`
make sure we don't increment rx_cnt for redirected packets, or for packets that have been requeued after a lockwait (This used to be ctdb commit 92e5569407dba173a27e9645b4339ce3e2c00520) 2007-05-19 07:45:24 +04:00			`daemon_incoming_packet(client, hdr);`
change ctdb_client_read() to use the ctdb_read_pdu() helper (This used to be ctdb commit d476aa8533b394af6aced9c80fffaf0eefae1dd0) 2007-04-10 02:38:29 +04:00			`}`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
Add a double linked list to the ctdb_context to store a mapping between client pids and client structures. Add the mapping to the list everytime we accept() a new client connection and set it up to remove in the destructor when the client structure is freed. (This used to be ctdb commit f75d379377f5d4abbff2576ddc5d58d91dc53bf4) 2009-12-02 05:41:04 +03:00
			`static int ctdb_clientpid_destructor(struct ctdb_client_pid_list *client_pid)`
			`{`
			`if (client_pid->ctdb->client_pids != NULL) {`
			`DLIST_REMOVE(client_pid->ctdb->client_pids, client_pid);`
			`}`

			`return 0;`
			`}`


make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`static void ctdb_accept_client(struct event_context ev, struct fd_event fde,`
private -> private_data for samba3 (This used to be ctdb commit 080b6901173afb2ad618dd0621876ff478c7d6e5) 2007-04-13 14:38:24 +04:00			`uint16_t flags, void *private_data)`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`{`
initial ipv6 patch Signed-off-by: Ronnie Sahlberg <ronniesahlberg@gmail.com> (This used to be ctdb commit 1f131f21386f428bbbbb29098d56c2f64596583b) 2008-08-19 08:58:29 +04:00			`struct sockaddr_un addr;`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`socklen_t len;`
			`int fd;`
private -> private_data for samba3 (This used to be ctdb commit 080b6901173afb2ad618dd0621876ff478c7d6e5) 2007-04-13 14:38:24 +04:00			`struct ctdb_context *ctdb = talloc_get_type(private_data, struct ctdb_context);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`struct ctdb_client *client;`
Add a double linked list to the ctdb_context to store a mapping between client pids and client structures. Add the mapping to the list everytime we accept() a new client connection and set it up to remove in the destructor when the client structure is freed. (This used to be ctdb commit f75d379377f5d4abbff2576ddc5d58d91dc53bf4) 2009-12-02 05:41:04 +03:00			`struct ctdb_client_pid_list *client_pid;`
From Chris Cowan Add support in AIX to track the PID of a client that connects to the unix domain socket (This used to be ctdb commit 4c006c675d577d4a45f4db2929af6d50bc28dd9e) 2008-04-03 03:58:51 +04:00			`#ifdef _AIX`
			`struct peercred_struct cr;`
			`socklen_t crl = sizeof(struct peercred_struct);`
			`#else`
decorate the memdump output with a nice field for ctdb_client structures to show the pid of the client that attached (This used to be ctdb commit 0d9314302d0b988b6ab5d533deef40c5b343c249) 2008-04-01 10:17:21 +04:00			`struct ucred cr;`
			`socklen_t crl = sizeof(struct ucred);`
From Chris Cowan Add support in AIX to track the PID of a client that connects to the unix domain socket (This used to be ctdb commit 4c006c675d577d4a45f4db2929af6d50bc28dd9e) 2008-04-03 03:58:51 +04:00			`#endif`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
			`memset(&addr, 0, sizeof(addr));`
			`len = sizeof(addr);`
			`fd = accept(ctdb->daemon.sd, (struct sockaddr *)&addr, &len);`
			`if (fd == -1) {`
			`return;`
			`}`
close sockets when we exec scripts (This used to be ctdb commit 0fac2164db4279db2d7d376a34be05b890304087) 2007-05-30 09:43:25 +04:00
			`set_nonblocking(fd);`
			`set_close_on_exec(fd);`
add logging everytime we create a filedescriptor in the main ctdb daemon so we can spot if there are leaks. plug two leaks for filedescriptors related to when sending ARP fail and one leak when we can not parse the local address during tcp connection establish (This used to be ctdb commit ddd089810a14efe4be6e1ff3eccaa604e4913c9e) 2009-10-15 04:24:54 +04:00
lower the debug levels for the "create FD messages" so we dont fill up the logs. (This used to be ctdb commit 87146db2769c2ec494813685bf9cec0d2a6336c3) 2009-10-21 08:26:24 +04:00			`DEBUG(DEBUG_DEBUG,(__location__ " Created SOCKET FD:%d to connected child\n", fd));`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
			`client = talloc_zero(ctdb, struct ctdb_client);`
From Chris Cowan Add support in AIX to track the PID of a client that connects to the unix domain socket (This used to be ctdb commit 4c006c675d577d4a45f4db2929af6d50bc28dd9e) 2008-04-03 03:58:51 +04:00			`#ifdef _AIX`
			`if (getsockopt(fd, SOL_SOCKET, SO_PEERID, &cr, &crl) == 0) {`
			`#else`
decorate the memdump output with a nice field for ctdb_client structures to show the pid of the client that attached (This used to be ctdb commit 0d9314302d0b988b6ab5d533deef40c5b343c249) 2008-04-01 10:17:21 +04:00			`if (getsockopt(fd, SOL_SOCKET, SO_PEERCRED, &cr, &crl) == 0) {`
From Chris Cowan Add support in AIX to track the PID of a client that connects to the unix domain socket (This used to be ctdb commit 4c006c675d577d4a45f4db2929af6d50bc28dd9e) 2008-04-03 03:58:51 +04:00			`#endif`
lower the loglevel for the message that a client has attached through a domian socket (This used to be ctdb commit de9e5236b20d70eac5ed29991703d6d25a103963) 2009-12-02 06:51:57 +03:00			`DEBUG(DEBUG_INFO,("Connected client with pid:%u\n", (unsigned)cr.pid));`
add improvements to tracking memory usage in ctdbd adn the recovery daemon and a ctdb command to pull the talloc memory map from a recovery daemon ctdb rddumpmemory (This used to be ctdb commit d23950be7406cf288f48b660c0f57a9b8d7bdd05) 2008-04-01 08:34:54 +04:00			`}`

make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`client->ctdb = ctdb;`
			`client->fd = fd;`
- changed the REQ_REGISTER PDU to be a control - allow controls to know which client invoked them - added a client_id to clients, so they can be identified remotely - added the ability to remove registered srvids - in the list_keys code, register a temp srvid, then remove it afterwards (This used to be ctdb commit 29603c51cc6d81362532cd8e50f75c8360c5f5ef) 2007-05-04 05:41:29 +04:00			`client->client_id = ctdb_reqid_new(ctdb, client);`
Use the PID we pick up from the domain socket when a client connects and store this in the client structure. There is no need to rely on the hack that samba sends some special message handle registrations that encodes the pid in the srvid any more. This might not work on AIX since I recall some issues to get the pid in this way on that platform. (This used to be ctdb commit b4a7efa7e53e060a91dea0e8e57b116e2aeacebf) 2009-12-02 05:17:12 +03:00			`client->pid = cr.pid;`
Add a double linked list to the ctdb_context to store a mapping between client pids and client structures. Add the mapping to the list everytime we accept() a new client connection and set it up to remove in the destructor when the client structure is freed. (This used to be ctdb commit f75d379377f5d4abbff2576ddc5d58d91dc53bf4) 2009-12-02 05:41:04 +03:00
			`client_pid = talloc(client, struct ctdb_client_pid_list);`
			`if (client_pid == NULL) {`
			`DEBUG(DEBUG_ERR,("Failed to allocate client pid structure\n"));`
			`close(fd);`
			`talloc_free(client);`
			`return;`
			`}`
			`client_pid->ctdb = ctdb;`
			`client_pid->pid = cr.pid;`
			`client_pid->client = client;`

			`DLIST_ADD(ctdb->client_pids, client_pid);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
made all sockets handle partial IO abstract IO via ctdb_queue_*() functions (This used to be ctdb commit 636ae76f4632b29231db87be32c9114f58b37840) 2007-04-10 13:33:21 +04:00			`client->queue = ctdb_queue_setup(ctdb, client, fd, CTDB_DS_ALIGNMENT,`
Report client for queue errors. We've been seeing "Invalid packet of length 0" errors, but we don't know what is sending them. Add a name for each queue, and print nread. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit e6cf0e8f14f4263fbd8b995418909199924827e9) 2010-07-01 17:08:49 +04:00			`ctdb_daemon_read_cb, client,`
			`"client-%u", client->pid);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
			`talloc_set_destructor(client, ctdb_client_destructor);`
Add a double linked list to the ctdb_context to store a mapping between client pids and client structures. Add the mapping to the list everytime we accept() a new client connection and set it up to remove in the destructor when the client structure is freed. (This used to be ctdb commit f75d379377f5d4abbff2576ddc5d58d91dc53bf4) 2009-12-02 05:41:04 +03:00			`talloc_set_destructor(client_pid, ctdb_clientpid_destructor);`
Create macros to update the statistics counters and use these macros everywhere instead of manipulating the coutenrs directly. (This used to be ctdb commit 2e648df890e5713bc575965d87937827b068d0d7) 2010-09-29 04:38:41 +04:00			`CTDB_INCREMENT_STAT(ctdb, num_clients);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`}`



			`/*`
			`create a unix domain socket and bind it`
			`return a file descriptor open on the socket`
			`*/`
			`static int ux_socket_bind(struct ctdb_context *ctdb)`
			`{`
			`struct sockaddr_un addr;`

			`ctdb->daemon.sd = socket(AF_UNIX, SOCK_STREAM, 0);`
			`if (ctdb->daemon.sd == -1) {`
			`return -1;`
			`}`

close sockets when we exec scripts (This used to be ctdb commit 0fac2164db4279db2d7d376a34be05b890304087) 2007-05-30 09:43:25 +04:00			`set_close_on_exec(ctdb->daemon.sd);`
			`set_nonblocking(ctdb->daemon.sd);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
			`memset(&addr, 0, sizeof(addr));`
			`addr.sun_family = AF_UNIX;`
			`strncpy(addr.sun_path, ctdb->daemon.name, sizeof(addr.sun_path));`

			`if (bind(ctdb->daemon.sd, (struct sockaddr *)&addr, sizeof(addr)) == -1) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_CRIT,("Unable to bind on ctdb socket '%s'\n", ctdb->daemon.name));`
make sure the ctdb control socket is secure (This used to be ctdb commit 2954f2e501a418af578e75e8705b0b39a77c1861) 2007-05-13 03:20:16 +04:00			`goto failed;`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`}`
From Chris Cowan secure the domain socket and set permissions properly (This used to be ctdb commit ac6a362fc2fc4a56b4c310478a96eb12daace176) 2008-04-10 00:51:53 +04:00
			`if (chown(ctdb->daemon.name, geteuid(), getegid()) != 0 \|\|`
			`chmod(ctdb->daemon.name, 0700) != 0) {`
fix compiler warning during a fatal error failing to lock down the socket (This used to be ctdb commit 0ad22de1a614dc2d1926546027be5f5eea3381ed) 2008-04-10 03:56:49 +04:00			`DEBUG(DEBUG_CRIT,("Unable to secure ctdb socket '%s', ctdb->daemon.name\n", ctdb->daemon.name));`
From Chris Cowan secure the domain socket and set permissions properly (This used to be ctdb commit ac6a362fc2fc4a56b4c310478a96eb12daace176) 2008-04-10 00:51:53 +04:00			`goto failed;`
			`}`


increase the listen queue. Now that the eventscripts may become clients and connect back to the server we do get a lot more concurrent connection attempts (takepip/teleaseip are performed in parallell) (This used to be ctdb commit 018f8b0b1823ef59b46f1a671aec5309d10628f4) 2009-04-06 08:00:41 +04:00			`if (listen(ctdb->daemon.sd, 100) != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_CRIT,("Unable to listen on ctdb socket '%s'\n", ctdb->daemon.name));`
make sure the ctdb control socket is secure (This used to be ctdb commit 2954f2e501a418af578e75e8705b0b39a77c1861) 2007-05-13 03:20:16 +04:00			`goto failed;`
			`}`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
			`return 0;`
make sure the ctdb control socket is secure (This used to be ctdb commit 2954f2e501a418af578e75e8705b0b39a77c1861) 2007-05-13 03:20:16 +04:00
			`failed:`
			`close(ctdb->daemon.sd);`
			`ctdb->daemon.sd = -1;`
			`return -1;`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`}`

proper waitpid() fix. remove all waitpid() calls and use the event system to trap sigchld (This used to be ctdb commit 77458b2b6b51b2970c12b0e5b097088d3fb9d358) 2008-07-09 08:02:54 +04:00			`static void sig_child_handler(struct event_context *ev,`
			`struct signal_event *se, int signum, int count,`
			`void *dont_care,`
			`void *private_data)`
			`{`
			`// struct ctdb_context *ctdb = talloc_get_type(private_data, struct ctdb_context);`
			`int status;`
			`pid_t pid = -1;`

			`while (pid != 0) {`
			`pid = waitpid(-1, &status, WNOHANG);`
			`if (pid == -1) {`
			`DEBUG(DEBUG_ERR, (__location__ " waitpid() returned error. errno:%d\n", errno));`
			`return;`
			`}`
			`if (pid > 0) {`
			`DEBUG(DEBUG_DEBUG, ("SIGCHLD from %d\n", (int)pid));`
			`}`
			`}`
			`}`

server: add "setup" event This is needed because the "init" event can't use 'ctdb' commands. metze (This used to be ctdb commit 1493436b6b24eb05a23b7a339071ad85f70de8f4) 2010-02-12 13:24:08 +03:00			`static void ctdb_setup_event_callback(struct ctdb_context *ctdb, int status,`
			`void *private_data)`
			`{`
			`if (status != 0) {`
			`ctdb_fatal(ctdb, "Failed to run setup event\n");`
			`return;`
			`}`
			`ctdb_run_notification_script(ctdb, "setup");`

			`/* tell all other nodes we've just started up */`
			`ctdb_daemon_send_control(ctdb, CTDB_BROADCAST_ALL,`
			`0, CTDB_CONTROL_STARTUP, 0,`
			`CTDB_CTRL_FLAG_NOREPLY,`
			`tdb_null, NULL, NULL);`
			`}`

yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`/*`
			`start the protocol going as a daemon`
			`*/`
delay loading the public ip address file until after we have started the transport and discovered ouw own pnn number (This used to be ctdb commit 1b57fc866fc836b5dbd3ef7b646e5a0f4280e81e) 2010-11-10 04:59:25 +03:00			`int ctdb_start_daemon(struct ctdb_context ctdb, bool do_fork, bool use_syslog, const char public_address_list)`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`{`
added support for persistent databases in ctdbd (This used to be ctdb commit 3115090a0d882beca9d70761130b74bb0821f201) 2007-09-21 06:24:02 +04:00			`int res, ret = -1;`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`struct fd_event *fde;`
			`const char *domain_socket_name;`
proper waitpid() fix. remove all waitpid() calls and use the event system to trap sigchld (This used to be ctdb commit 77458b2b6b51b2970c12b0e5b097088d3fb9d358) 2008-07-09 08:02:54 +04:00			`struct signal_event *se;`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00
			`/* get rid of any old sockets */`
			`unlink(ctdb->daemon.name);`

			`/* create a unix domain stream socket to listen to */`
			`res = ux_socket_bind(ctdb);`
			`if (res!=0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ALERT,(__location__ " Failed to open CTDB unix domain socket\n"));`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`exit(10);`
			`}`

added a -i switch to run ctdbd without forking (This used to be ctdb commit 327df14ecd58f405fbe8b38afa2ee54a8dd0a2e4) 2007-05-15 03:44:33 +04:00			`if (do_fork && fork()) {`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`return 0;`
			`}`

			`tdb_reopen_all(False);`

added a -i switch to run ctdbd without forking (This used to be ctdb commit 327df14ecd58f405fbe8b38afa2ee54a8dd0a2e4) 2007-05-15 03:44:33 +04:00			`if (do_fork) {`
			`setsid();`
fixed the bug that make "onnode N service ctdb start" hang (This used to be ctdb commit b50dcb16f30a60abce42f491f9b0aae7948b8206) 2008-01-05 04:09:29 +03:00			`close(0);`
			`if (open("/dev/null", O_RDONLY) != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ALERT,(__location__ " Failed to setup stdin on /dev/null\n"));`
fixed the bug that make "onnode N service ctdb start" hang (This used to be ctdb commit b50dcb16f30a60abce42f491f9b0aae7948b8206) 2008-01-05 04:09:29 +03:00			`exit(11);`
			`}`
added a -i switch to run ctdbd without forking (This used to be ctdb commit 327df14ecd58f405fbe8b38afa2ee54a8dd0a2e4) 2007-05-15 03:44:33 +04:00			`}`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`block_signal(SIGPIPE);`

for debugging add a global variable holding the pid of the main daemon. change the tracking of time() in the event loop to only check/warn when called from the main daemon (This used to be ctdb commit a10fc51f4c30e85ada6d4b7347b0f9a8ebc76637) 2009-10-27 05:18:52 +03:00			`ctdbd_pid = getpid();`
daemon: fill ctdb->ctdbd_pid early (This used to be ctdb commit 3da1e2e30bf34622f08e6ecd5b8fe55684e5007a) 2010-12-28 15:14:23 +03:00			`ctdb->ctdbd_pid = ctdbd_pid;`
start the syslog child a little later, after we have forked and detached from the local shell (This used to be ctdb commit 9ffd54b73c0d64b67e8e736d7cb54490e77ffa78) 2009-10-30 11:39:11 +03:00

for debugging add a global variable holding the pid of the main daemon. change the tracking of time() in the event loop to only check/warn when called from the main daemon (This used to be ctdb commit a10fc51f4c30e85ada6d4b7347b0f9a8ebc76637) 2009-10-27 05:18:52 +03:00			`DEBUG(DEBUG_ERR, ("Starting CTDBD as pid : %u\n", ctdbd_pid));`

Revert scheduling back to use real-time processes Revert this patch: commit 482c302d46e2162d0cf552f8456bc49573ae729d We may need to use real-time processes for the main daemon and the recovery daemon to handle the cases where systems come under very high loads. (This used to be ctdb commit 08bef9dcab6e4da15fc783f8624e5ed09aa060b5) 2011-01-10 05:35:39 +03:00			`if (ctdb->do_setsched) {`
			`/* try to set us up as realtime */`
			`ctdb_set_scheduler(ctdb);`
			`}`
make ctdbd realtime if possible (This used to be ctdb commit 8852f6cca52b64a5239c83ab7c6a99ae4edb2597) 2007-05-24 08:52:10 +04:00
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`/* ensure the socket is deleted on exit of the daemon */`
			`domain_socket_name = talloc_strdup(talloc_autofree_context(), ctdb->daemon.name);`
add missing checks on so far ignored return values Most of these were found during a review by Jim Meyering <meyering@redhat.com> (This used to be ctdb commit 3aee5ee1deb4a19be3bd3a4ce3abbe09de763344) 2009-05-20 14:08:13 +04:00			`if (domain_socket_name == NULL) {`
			`DEBUG(DEBUG_ALERT,(__location__ " talloc_strdup failed.\n"));`
			`exit(12);`
			`}`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00
			`ctdb->ev = event_context_init(NULL);`
We use eventloop nesting in a couple of places, notably the sync parts of the recovery daemon. Initialize all event contexts to allow nesting (This used to be ctdb commit 5bf6bd5e7f33aabbeb7b9707716ef99cf471e590) 2010-08-18 04:11:59 +04:00			`tevent_loop_allow_nesting(ctdb->ev);`
set up a handler to catch and log debug messages from the tevent layer (This used to be ctdb commit fdb4c02f595fa207310a9a48da3fefd653fa9e4b) 2010-09-22 04:59:01 +04:00			`ret = ctdb_init_tevent_logging(ctdb);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ALERT,("Failed to initialize TEVENT logging\n"));`
			`exit(1);`
			`}`
add a command line flag to ctdbd to start a recovery daemon. update the recovery test script to start all ctdb daemons with a recovery daemon (This used to be ctdb commit 47794e16df285cacefc30208d892d931a6e46b96) 2007-05-09 03:59:23 +04:00
added syslog support, and use a pipe to catch logging from child processes to the ctdbd logging functions (This used to be ctdb commit 1306b04cd01e996fd1aa1159a9521f2ff7b06165) 2008-01-16 14:03:01 +03:00			`ctdb_set_child_logging(ctdb);`

Add rolling statistics that are collected across 10 second intervals. Add a new command "ctdb stats [num]" that prints the [num] most recent statistics intervals collected. (This used to be ctdb commit e6e16fcd5a45ebd3739a8160c8fb5f44494edb9e) 2010-09-29 06:13:05 +04:00			`/* initialize statistics collection */`
			`ctdb_statistics_init(ctdb);`

added support for persistent databases in ctdbd (This used to be ctdb commit 3115090a0d882beca9d70761130b74bb0821f201) 2007-09-21 06:24:02 +04:00			`/* force initial recovery for election */`
			`ctdb->recovery_mode = CTDB_RECOVERY_ACTIVE;`

			`if (strcmp(ctdb->transport, "tcp") == 0) {`
			`int ctdb_tcp_init(struct ctdb_context *);`
			`ret = ctdb_tcp_init(ctdb);`
			`}`
			`#ifdef USE_INFINIBAND`
			`if (strcmp(ctdb->transport, "ib") == 0) {`
			`int ctdb_ibw_init(struct ctdb_context *);`
			`ret = ctdb_ibw_init(ctdb);`
			`}`
			`#endif`
			`if (ret != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,("Failed to initialise transport '%s'\n", ctdb->transport));`
added support for persistent databases in ctdbd (This used to be ctdb commit 3115090a0d882beca9d70761130b74bb0821f201) 2007-09-21 06:24:02 +04:00			`return -1;`
			`}`

ctdb->methods becomes NULL when we shutdown the transport. If we shutdown the transport and CTDB later decides to send a command out for queueing, the call to ctdb->methods->allocate_pkt() will SEGV. This could trigger for example when we are in the process of shuttind down CTDBD and have already shutdown the transport but we are still waiting for the "shutdown" eventscripts to finish. If the event scripts now take much much longer to execute for some reason, this race condition becomes much more probable. Decorate all dereferencing of ctdb->methods-> with a check that ctdb->menthods is non-NULL (This used to be ctdb commit c4c2c53918da6fb566d6e9cbd6b02e61ae2921e7) 2008-05-11 08:28:33 +04:00			`if (ctdb->methods == NULL) {`
			`DEBUG(DEBUG_ALERT,(__location__ " Can not initialize transport. ctdb->methods is NULL\n"));`
			`ctdb_fatal(ctdb, "transport is unavailable. can not initialize.");`
			`}`

added support for persistent databases in ctdbd (This used to be ctdb commit 3115090a0d882beca9d70761130b74bb0821f201) 2007-09-21 06:24:02 +04:00			`/* initialise the transport */`
			`if (ctdb->methods->initialise(ctdb) != 0) {`
			`ctdb_fatal(ctdb, "transport failed to initialise");`
			`}`
delay loading the public ip address file until after we have started the transport and discovered ouw own pnn number (This used to be ctdb commit 1b57fc866fc836b5dbd3ef7b646e5a0f4280e81e) 2010-11-10 04:59:25 +03:00			`if (public_address_list) {`
			`ret = ctdb_set_public_addresses(ctdb, public_address_list);`
			`if (ret == -1) {`
			`DEBUG(DEBUG_ALERT,("Unable to setup public address list\n"));`
			`exit(1);`
			`}`
			`}`

added support for persistent databases in ctdbd (This used to be ctdb commit 3115090a0d882beca9d70761130b74bb0821f201) 2007-09-21 06:24:02 +04:00
server: only do the mkdir() calls for db_directory* once at the start metze (This used to be ctdb commit f30f33685db50860b6cd6fd1b6bdc3066620a78f) 2009-11-29 14:39:23 +03:00			`/* attach to existing databases */`
			`if (ctdb_attach_databases(ctdb) != 0) {`
			`ctdb_fatal(ctdb, "Failed to attach to databases\n");`
added support for persistent databases in ctdbd (This used to be ctdb commit 3115090a0d882beca9d70761130b74bb0821f201) 2007-09-21 06:24:02 +04:00			`}`

server: add "init" event This is needed because the "startup" event runs after the initial recovery, but we need to do some actions before the initial recovery. metze (This used to be ctdb commit e953808449c102258abb6cba6f4abf486dda3b82) 2010-01-19 12:07:14 +03:00			`ret = ctdb_event_script(ctdb, CTDB_EVENT_INIT);`
			`if (ret != 0) {`
			`ctdb_fatal(ctdb, "Failed to run init event\n");`
			`}`
			`ctdb_run_notification_script(ctdb, "init");`

run the "init" event before we freeze the databases so that we can read from databases during this event (This used to be ctdb commit 6c93bf5a1219617bfb39b093aee3200c74c2c61a) 2010-08-25 02:34:35 +04:00			`/* start frozen, then let the first election sort things out */`
			`if (ctdb_blocking_freeze(ctdb)) {`
			`ctdb_fatal(ctdb, "Failed to get initial freeze\n");`
			`}`

start ctdb frozen, and let the election sort things out. This prevents a race on startup (This used to be ctdb commit b788ed3fa64e31e517b4e602e8bd3ae7201ecddd) 2007-05-23 06:23:07 +04:00			`/* now start accepting clients, only can do this once frozen */`
			`fde = event_add_fd(ctdb->ev, ctdb, ctdb->daemon.sd,`
event: Update events to latest Samba version 0.9.8 In Samba this is now called "tevent", and while we use the backwards compatibility wrappers they don't offer EVENT_FD_AUTOCLOSE: that is now a separate tevent_fd_set_auto_close() function. This is based on Samba version 7f29f817fa939ef1bbb740584f09e76e2ecd5b06. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 85e5e760cc91eb3157d3a88996ce474491646726) 2010-08-18 03:46:31 +04:00			`EVENT_FD_READ,`
start ctdb frozen, and let the election sort things out. This prevents a race on startup (This used to be ctdb commit b788ed3fa64e31e517b4e602e8bd3ae7201ecddd) 2007-05-23 06:23:07 +04:00			`ctdb_accept_client, ctdb);`
event: Update events to latest Samba version 0.9.8 In Samba this is now called "tevent", and while we use the backwards compatibility wrappers they don't offer EVENT_FD_AUTOCLOSE: that is now a separate tevent_fd_set_auto_close() function. This is based on Samba version 7f29f817fa939ef1bbb740584f09e76e2ecd5b06. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 85e5e760cc91eb3157d3a88996ce474491646726) 2010-08-18 03:46:31 +04:00			`tevent_fd_set_auto_close(fde);`
start ctdb frozen, and let the election sort things out. This prevents a race on startup (This used to be ctdb commit b788ed3fa64e31e517b4e602e8bd3ae7201ecddd) 2007-05-23 06:23:07 +04:00
added support for persistent databases in ctdbd (This used to be ctdb commit 3115090a0d882beca9d70761130b74bb0821f201) 2007-09-21 06:24:02 +04:00			`/* release any IPs we hold from previous runs of the daemon */`
Add a new tunable : DisableIPFailover that when set to non 0 will stopp any ip reallocations at all from happening. (This used to be ctdb commit d8d37493478a26c5f1809a5f3df89ffd6e149281) 2010-11-09 07:19:06 +03:00			`if (ctdb->tunable.disable_ip_failover == 0) {`
			`ctdb_release_all_ips(ctdb);`
			`}`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00
prevent a deadly embrace between smbd and ctdbd by moving the calling of the startup event scripts after the point where recovery has started and the node is in normal operation This makes the 'startup' script just a special type of the 'monitor' script which is called first (This used to be ctdb commit 7424c30a5fd04aea0137c466b4318c3f185280d8) 2007-11-12 02:53:11 +03:00			`/* start the transport going */`
			`ctdb_start_transport(ctdb);`
added support for persistent databases in ctdbd (This used to be ctdb commit 3115090a0d882beca9d70761130b74bb0821f201) 2007-09-21 06:24:02 +04:00
proper waitpid() fix. remove all waitpid() calls and use the event system to trap sigchld (This used to be ctdb commit 77458b2b6b51b2970c12b0e5b097088d3fb9d358) 2008-07-09 08:02:54 +04:00			`/* set up a handler to pick up sigchld */`
			`se = event_add_signal(ctdb->ev, ctdb,`
			`SIGCHLD, 0,`
			`sig_child_handler,`
			`ctdb);`
			`if (se == NULL) {`
			`DEBUG(DEBUG_CRIT,("Failed to set up signal handler for SIGCHLD\n"));`
			`exit(1);`
			`}`
start the syslog child a little later, after we have forked and detached from the local shell (This used to be ctdb commit 9ffd54b73c0d64b67e8e736d7cb54490e77ffa78) 2009-10-30 11:39:11 +03:00
server: add "setup" event This is needed because the "init" event can't use 'ctdb' commands. metze (This used to be ctdb commit 1493436b6b24eb05a23b7a339071ad85f70de8f4) 2010-02-12 13:24:08 +03:00			`ret = ctdb_event_script_callback(ctdb,`
			`ctdb,`
			`ctdb_setup_event_callback,`
			`ctdb,`
			`false,`
			`CTDB_EVENT_SETUP,`
			`"");`
			`if (ret != 0) {`
			`DEBUG(DEBUG_CRIT,("Failed to set up 'setup' event\n"));`
			`exit(1);`
			`}`

start the syslog child a little later, after we have forked and detached from the local shell (This used to be ctdb commit 9ffd54b73c0d64b67e8e736d7cb54490e77ffa78) 2009-10-30 11:39:11 +03:00			`if (use_syslog) {`
			`if (start_syslog_daemon(ctdb)) {`
			`DEBUG(DEBUG_CRIT, ("Failed to start syslog daemon\n"));`
			`exit(10);`
			`}`
			`}`

ctdb: use mlockall, cautiously We don't want ctdb stalling due to paging; this can be far worse than scheduling delays. But if we simply do mlockall(MCL_FUTURE), it increases the risk that mmap (ie. tdb open) or malloc will fail, causing us to abort. This patch is a compromise: we mlock all current pages (including 10k of future stack for expansion) and then relock when a client asks us to open a TDB. We warn, but don't exit, if it fails. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 82f778e85440bc713d3f87c08ddc955d3cfce926) 2009-12-16 13:27:20 +03:00			`ctdb_lockdown_memory(ctdb);`
proper waitpid() fix. remove all waitpid() calls and use the event system to trap sigchld (This used to be ctdb commit 77458b2b6b51b2970c12b0e5b097088d3fb9d358) 2008-07-09 08:02:54 +04:00
added support for persistent databases in ctdbd (This used to be ctdb commit 3115090a0d882beca9d70761130b74bb0821f201) 2007-09-21 06:24:02 +04:00			`/* go into a wait loop to allow other nodes to complete */`
			`event_loop_wait(ctdb->ev);`

merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_CRIT,("event_loop_wait() returned. this should not happen\n"));`
added support for persistent databases in ctdbd (This used to be ctdb commit 3115090a0d882beca9d70761130b74bb0821f201) 2007-09-21 06:24:02 +04:00			`exit(1);`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`}`

factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`/*`
			`allocate a packet for use in daemon<->daemon communication`
			`*/`
			`struct ctdb_req_header _ctdb_transport_allocate(struct ctdb_context ctdb,`
			`TALLOC_CTX *mem_ctx,`
			`enum ctdb_operation operation,`
			`size_t length, size_t slength,`
			`const char *type)`
			`{`
			`int size;`
			`struct ctdb_req_header *hdr;`
don't zero beyond packet header unnecessarily (This used to be ctdb commit 4cf88ca2ce81db8fe10b0dfedb81d99a2bd93328) 2007-05-03 07:44:27 +04:00
			`length = MAX(length, slength);`
			`size = (length+(CTDB_DS_ALIGNMENT-1)) & ~(CTDB_DS_ALIGNMENT-1);`

ctdb->methods becomes NULL when we shutdown the transport. If we shutdown the transport and CTDB later decides to send a command out for queueing, the call to ctdb->methods->allocate_pkt() will SEGV. This could trigger for example when we are in the process of shuttind down CTDBD and have already shutdown the transport but we are still waiting for the "shutdown" eventscripts to finish. If the event scripts now take much much longer to execute for some reason, this race condition becomes much more probable. Decorate all dereferencing of ctdb->methods-> with a check that ctdb->menthods is non-NULL (This used to be ctdb commit c4c2c53918da6fb566d6e9cbd6b02e61ae2921e7) 2008-05-11 08:28:33 +04:00			`if (ctdb->methods == NULL) {`
during shutdown there is a window after we have stopped TCP and disconnected from all other nodes but before we have stopped all processing. During this window we may still hit asynchronous events that will fail because we can not send/receive packets from other nodes. These messages are logged as ... Transport is DOWN. To help indicate that they are benign messages related to the process of shutting down. These messages spam the syslog during normal shutdown, so this patch will drop the loglevel of these messages to DEBUG, so that they will not appear in or spam the syslog. (This used to be ctdb commit 8275d265d2ae19b765e30ecf18f6b6319b6e6453) 2010-10-28 06:38:34 +04:00			`DEBUG(DEBUG_INFO,(__location__ " Unable to allocate transport packet for operation %u of length %u. Transport is DOWN.\n",`
ctdb->methods becomes NULL when we shutdown the transport. If we shutdown the transport and CTDB later decides to send a command out for queueing, the call to ctdb->methods->allocate_pkt() will SEGV. This could trigger for example when we are in the process of shuttind down CTDBD and have already shutdown the transport but we are still waiting for the "shutdown" eventscripts to finish. If the event scripts now take much much longer to execute for some reason, this race condition becomes much more probable. Decorate all dereferencing of ctdb->methods-> with a check that ctdb->menthods is non-NULL (This used to be ctdb commit c4c2c53918da6fb566d6e9cbd6b02e61ae2921e7) 2008-05-11 08:28:33 +04:00			`operation, (unsigned)length));`
			`return NULL;`
			`}`

factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`hdr = (struct ctdb_req_header *)ctdb->methods->allocate_pkt(mem_ctx, size);`
			`if (hdr == NULL) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,("Unable to allocate transport packet for operation %u of length %u\n",`
fixed more warnings on 64 bit boxes (This used to be ctdb commit 2f6eae476203f8a8b28e083553204c01f224c8a5) 2007-05-29 07:58:41 +04:00			`operation, (unsigned)length));`
factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`return NULL;`
			`}`
			`talloc_set_name_const(hdr, type);`
don't zero beyond packet header unnecessarily (This used to be ctdb commit 4cf88ca2ce81db8fe10b0dfedb81d99a2bd93328) 2007-05-03 07:44:27 +04:00			`memset(hdr, 0, slength);`
			`hdr->length = length;`
factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`hdr->operation = operation;`
			`hdr->ctdb_magic = CTDB_MAGIC;`
			`hdr->ctdb_version = CTDB_VERSION;`
			`hdr->generation = ctdb->vnn_map->generation;`
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`hdr->srcnode = ctdb->pnn;`
factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00
			`return hdr;`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`}`

added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`struct daemon_control_state {`
timeout pending controls immediately when a node becomes disconnected (This used to be ctdb commit 93c4b16f4efef383ba8db83953019ef4821613e0) 2007-05-18 17:48:29 +04:00			`struct daemon_control_state next, prev;`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`struct ctdb_client *client;`
			`struct ctdb_req_control *c;`
added a ctdb_get_config call added a ctdb ping control (This used to be ctdb commit 7d17378b6e6076a922cffe98239e20dfbbae3bf7) 2007-04-26 21:27:07 +04:00			`uint32_t reqid;`
timeout pending controls immediately when a node becomes disconnected (This used to be ctdb commit 93c4b16f4efef383ba8db83953019ef4821613e0) 2007-05-18 17:48:29 +04:00			`struct ctdb_node *node;`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`};`

			`/*`
			`callback when a control reply comes in`
			`*/`
			`static void daemon_control_callback(struct ctdb_context *ctdb,`
timeout pending controls immediately when a node becomes disconnected (This used to be ctdb commit 93c4b16f4efef383ba8db83953019ef4821613e0) 2007-05-18 17:48:29 +04:00			`int32_t status, TDB_DATA data,`
added error messages in ctdb_control replies (This used to be ctdb commit bd848f5b760e6b2a73ebfc67fd8adb3c31479fb5) 2007-05-12 15:25:26 +04:00			`const char *errormsg,`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`void *private_data)`
			`{`
			`struct daemon_control_state *state = talloc_get_type(private_data,`
			`struct daemon_control_state);`
			`struct ctdb_client *client = state->client;`
			`struct ctdb_reply_control *r;`
			`size_t len;`
ctdb: when we fill the client packet queue we need to drop the client We can't just drop packets to the list, as those packets could be part of the core protocol the client is using. This happens (for example) when Samba is doing a traverse. If we drop a traverse packet then Samba hangs indefinately. We are better off dropping the ctdb socket to Samba. (This used to be ctdb commit a7a86dafa4d88a6bbc6a71b77ed79a178fd802a6) 2010-02-04 06:36:14 +03:00			`int ret;`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00
			`/* construct a message to send to the client containing the data */`
got rid of the getdbpath call (This used to be ctdb commit 736ce5c00a1d1b47abb44c4b262b14bfba5202b1) 2007-04-27 01:10:35 +04:00			`len = offsetof(struct ctdb_reply_control, data) + data.dsize;`
added error messages in ctdb_control replies (This used to be ctdb commit bd848f5b760e6b2a73ebfc67fd8adb3c31479fb5) 2007-05-12 15:25:26 +04:00			`if (errormsg) {`
			`len += strlen(errormsg);`
			`}`
fixed a memory leak in the ctdb_control code (This used to be ctdb commit 70aa77a66bb5f16c93ecb122b92a6e63f6343ab1) 2007-05-02 23:51:46 +04:00			`r = ctdbd_allocate_pkt(ctdb, state, CTDB_REPLY_CONTROL, len,`
factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`struct ctdb_reply_control);`
			`CTDB_NO_MEMORY_VOID(ctdb, r);`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00
added a ctdb_get_config call added a ctdb ping control (This used to be ctdb commit 7d17378b6e6076a922cffe98239e20dfbbae3bf7) 2007-04-26 21:27:07 +04:00			`r->hdr.reqid = state->reqid;`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`r->status = status;`
			`r->datalen = data.dsize;`
added error messages in ctdb_control replies (This used to be ctdb commit bd848f5b760e6b2a73ebfc67fd8adb3c31479fb5) 2007-05-12 15:25:26 +04:00			`r->errorlen = 0;`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`memcpy(&r->data[0], data.dptr, data.dsize);`
added error messages in ctdb_control replies (This used to be ctdb commit bd848f5b760e6b2a73ebfc67fd8adb3c31479fb5) 2007-05-12 15:25:26 +04:00			`if (errormsg) {`
			`r->errorlen = strlen(errormsg);`
			`memcpy(&r->data[r->datalen], errormsg, r->errorlen);`
			`}`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00
ctdb: when we fill the client packet queue we need to drop the client We can't just drop packets to the list, as those packets could be part of the core protocol the client is using. This happens (for example) when Samba is doing a traverse. If we drop a traverse packet then Samba hangs indefinately. We are better off dropping the ctdb socket to Samba. (This used to be ctdb commit a7a86dafa4d88a6bbc6a71b77ed79a178fd802a6) 2010-02-04 06:36:14 +03:00			`ret = daemon_queue_send(client, &r->hdr);`
			`if (ret != -1) {`
			`talloc_free(state);`
			`}`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`}`

timeout pending controls immediately when a node becomes disconnected (This used to be ctdb commit 93c4b16f4efef383ba8db83953019ef4821613e0) 2007-05-18 17:48:29 +04:00			`/*`
			`fail all pending controls to a disconnected node`
			`*/`
			`void ctdb_daemon_cancel_controls(struct ctdb_context ctdb, struct ctdb_node node)`
			`{`
			`struct daemon_control_state *state;`
			`while ((state = node->pending_controls)) {`
			`DLIST_REMOVE(node->pending_controls, state);`
			`daemon_control_callback(ctdb, (uint32_t)-1, tdb_null,`
			`"node is disconnected", state);`
			`}`
			`}`

			`/*`
			`destroy a daemon_control_state`
			`*/`
			`static int daemon_control_destructor(struct daemon_control_state *state)`
			`{`
			`if (state->node) {`
			`DLIST_REMOVE(state->node->pending_controls, state);`
			`}`
			`return 0;`
			`}`

added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`/*`
			`this is called when the ctdb daemon received a ctdb request control`
			`from a local client over the unix domain socket`
			`*/`
			`static void daemon_request_control_from_client(struct ctdb_client *client,`
			`struct ctdb_req_control *c)`
			`{`
			`TDB_DATA data;`
			`int res;`
			`struct daemon_control_state *state;`
- moved cmdline options that are only relevant to ctdbd into ctdbd.c - fixed a valgrind error on failing to send a control - don't mark node dead when already disconnected - moved node list lock code into common code (This used to be ctdb commit bcc0432d0fea7ef223f82ccee81cf35c18144b1b) 2007-06-02 04:03:28 +04:00			`TALLOC_CTX *tmp_ctx = talloc_new(client);`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00
added a ctdb_get_config call added a ctdb ping control (This used to be ctdb commit 7d17378b6e6076a922cffe98239e20dfbbae3bf7) 2007-04-26 21:27:07 +04:00			`if (c->hdr.destnode == CTDB_CURRENT_NODE) {`
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`c->hdr.destnode = client->ctdb->pnn;`
added a ctdb_get_config call added a ctdb ping control (This used to be ctdb commit 7d17378b6e6076a922cffe98239e20dfbbae3bf7) 2007-04-26 21:27:07 +04:00			`}`

added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`state = talloc(client, struct daemon_control_state);`
			`CTDB_NO_MEMORY_VOID(client->ctdb, state);`

			`state->client = client;`
			`state->c = talloc_steal(state, c);`
added a ctdb_get_config call added a ctdb ping control (This used to be ctdb commit 7d17378b6e6076a922cffe98239e20dfbbae3bf7) 2007-04-26 21:27:07 +04:00			`state->reqid = c->hdr.reqid;`
change ctdb_validate_vnn to ctdb_validate_pnn (This used to be ctdb commit a4a1f41b69475b9dc16d8fd7f8965c32e96c32f0) 2007-09-04 04:09:58 +04:00			`if (ctdb_validate_pnn(client->ctdb, c->hdr.destnode)) {`
timeout pending controls immediately when a node becomes disconnected (This used to be ctdb commit 93c4b16f4efef383ba8db83953019ef4821613e0) 2007-05-18 17:48:29 +04:00			`state->node = client->ctdb->nodes[c->hdr.destnode];`
			`DLIST_ADD(state->node->pending_controls, state);`
			`} else {`
			`state->node = NULL;`
			`}`

			`talloc_set_destructor(state, daemon_control_destructor);`
- moved cmdline options that are only relevant to ctdbd into ctdbd.c - fixed a valgrind error on failing to send a control - don't mark node dead when already disconnected - moved node list lock code into common code (This used to be ctdb commit bcc0432d0fea7ef223f82ccee81cf35c18144b1b) 2007-06-02 04:03:28 +04:00
			`if (c->flags & CTDB_CTRL_FLAG_NOREPLY) {`
			`talloc_steal(tmp_ctx, state);`
			`}`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00
			`data.dptr = &c->data[0];`
			`data.dsize = c->datalen;`
			`res = ctdb_daemon_send_control(client->ctdb, c->hdr.destnode,`
- changed the REQ_REGISTER PDU to be a control - allow controls to know which client invoked them - added a client_id to clients, so they can be identified remotely - added the ability to remove registered srvids - in the list_keys code, register a temp srvid, then remove it afterwards (This used to be ctdb commit 29603c51cc6d81362532cd8e50f75c8360c5f5ef) 2007-05-04 05:41:29 +04:00			`c->srvid, c->opcode, client->client_id,`
			`c->flags,`
changed the way set_call and attach are done so that you can safely attach to databases after the protocol has started. The daemon broadcasts information on new databases to the other daemons. This also eliminates the need for the client to know about the hash between db name and db_id. (This used to be ctdb commit 3bad91a9d987d4c09fe3322eac23c2733660ad08) 2007-04-30 17:31:40 +04:00			`data, daemon_control_callback,`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`state);`
			`if (res != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,(__location__ " Failed to send control to remote node %u\n",`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`c->hdr.destnode));`
			`}`
timeout pending controls immediately when a node becomes disconnected (This used to be ctdb commit 93c4b16f4efef383ba8db83953019ef4821613e0) 2007-05-18 17:48:29 +04:00
- moved cmdline options that are only relevant to ctdbd into ctdbd.c - fixed a valgrind error on failing to send a control - don't mark node dead when already disconnected - moved node list lock code into common code (This used to be ctdb commit bcc0432d0fea7ef223f82ccee81cf35c18144b1b) 2007-06-02 04:03:28 +04:00			`talloc_free(tmp_ctx);`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`}`
changed the way set_call and attach are done so that you can safely attach to databases after the protocol has started. The daemon broadcasts information on new databases to the other daemons. This also eliminates the need for the client to know about the hash between db name and db_id. (This used to be ctdb commit 3bad91a9d987d4c09fe3322eac23c2733660ad08) 2007-04-30 17:31:40 +04:00
			`/*`
			`register a call function`
			`*/`
			`int ctdb_daemon_set_call(struct ctdb_context *ctdb, uint32_t db_id,`
			`ctdb_fn_t fn, int id)`
			`{`
			`struct ctdb_registered_call *call;`
			`struct ctdb_db_context *ctdb_db;`

			`ctdb_db = find_ctdb_db(ctdb, db_id);`
			`if (ctdb_db == NULL) {`
			`return -1;`
			`}`

			`call = talloc(ctdb_db, struct ctdb_registered_call);`
			`call->fn = fn;`
			`call->id = id;`

			`DLIST_ADD(ctdb_db->calls, call);`
			`return 0;`
			`}`
start splitting the code into separate client and server pieces (This used to be ctdb commit 603cd77988c181525946cd5eb0f4d0d646b58059) 2007-06-07 16:06:19 +04:00


			`/*`
			`this local messaging handler is ugly, but is needed to prevent`
			`recursion in ctdb_send_message() when the destination node is the`
			`same as the source node`
			`*/`
			`struct ctdb_local_message {`
			`struct ctdb_context *ctdb;`
			`uint64_t srvid;`
			`TDB_DATA data;`
			`};`

			`static void ctdb_local_message_trigger(struct event_context ev, struct timed_event te,`
			`struct timeval t, void *private_data)`
			`{`
			`struct ctdb_local_message *m = talloc_get_type(private_data,`
			`struct ctdb_local_message);`
			`int res;`

			`res = ctdb_dispatch_message(m->ctdb, m->srvid, m->data);`
			`if (res != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR, (__location__ " Failed to dispatch message for srvid=%llu\n",`
start splitting the code into separate client and server pieces (This used to be ctdb commit 603cd77988c181525946cd5eb0f4d0d646b58059) 2007-06-07 16:06:19 +04:00			`(unsigned long long)m->srvid));`
			`}`
			`talloc_free(m);`
			`}`

			`static int ctdb_local_message(struct ctdb_context *ctdb, uint64_t srvid, TDB_DATA data)`
			`{`
			`struct ctdb_local_message *m;`
			`m = talloc(ctdb, struct ctdb_local_message);`
			`CTDB_NO_MEMORY(ctdb, m);`

			`m->ctdb = ctdb;`
			`m->srvid = srvid;`
			`m->data = data;`
			`m->data.dptr = talloc_memdup(m, m->data.dptr, m->data.dsize);`
			`if (m->data.dptr == NULL) {`
			`talloc_free(m);`
			`return -1;`
			`}`

			`/* this needs to be done as an event to prevent recursion */`
			`event_add_timed(ctdb->ev, m, timeval_zero(), ctdb_local_message_trigger, m);`
			`return 0;`
			`}`

			`/*`
			`send a ctdb message`
			`*/`
change debug output from vnn to pnn change ctdb_daemon_send_message to take pnn as parameter isntead of vnn (This used to be ctdb commit e352a2bbf9bb9a0b2c4f8329e8a529cf02414097) 2007-09-04 04:45:41 +04:00			`int ctdb_daemon_send_message(struct ctdb_context *ctdb, uint32_t pnn,`
start splitting the code into separate client and server pieces (This used to be ctdb commit 603cd77988c181525946cd5eb0f4d0d646b58059) 2007-06-07 16:06:19 +04:00			`uint64_t srvid, TDB_DATA data)`
			`{`
			`struct ctdb_req_message *r;`
			`int len;`

dont even try to send a message from the main daemon if the transport is down (This used to be ctdb commit 9a2c4c3ed09ac9ea781d06999d11e5c3b5b4a97a) 2009-06-30 06:09:28 +04:00			`if (ctdb->methods == NULL) {`
during shutdown there is a window after we have stopped TCP and disconnected from all other nodes but before we have stopped all processing. During this window we may still hit asynchronous events that will fail because we can not send/receive packets from other nodes. These messages are logged as ... Transport is DOWN. To help indicate that they are benign messages related to the process of shutting down. These messages spam the syslog during normal shutdown, so this patch will drop the loglevel of these messages to DEBUG, so that they will not appear in or spam the syslog. (This used to be ctdb commit 8275d265d2ae19b765e30ecf18f6b6319b6e6453) 2010-10-28 06:38:34 +04:00			`DEBUG(DEBUG_INFO,(__location__ " Failed to send message. Transport is DOWN\n"));`
dont even try to send a message from the main daemon if the transport is down (This used to be ctdb commit 9a2c4c3ed09ac9ea781d06999d11e5c3b5b4a97a) 2009-06-30 06:09:28 +04:00			`return -1;`
			`}`

start splitting the code into separate client and server pieces (This used to be ctdb commit 603cd77988c181525946cd5eb0f4d0d646b58059) 2007-06-07 16:06:19 +04:00			`/* see if this is a message to ourselves */`
change debug output from vnn to pnn change ctdb_daemon_send_message to take pnn as parameter isntead of vnn (This used to be ctdb commit e352a2bbf9bb9a0b2c4f8329e8a529cf02414097) 2007-09-04 04:45:41 +04:00			`if (pnn == ctdb->pnn) {`
start splitting the code into separate client and server pieces (This used to be ctdb commit 603cd77988c181525946cd5eb0f4d0d646b58059) 2007-06-07 16:06:19 +04:00			`return ctdb_local_message(ctdb, srvid, data);`
			`}`

			`len = offsetof(struct ctdb_req_message, data) + data.dsize;`
			`r = ctdb_transport_allocate(ctdb, ctdb, CTDB_REQ_MESSAGE, len,`
			`struct ctdb_req_message);`
			`CTDB_NO_MEMORY(ctdb, r);`

change debug output from vnn to pnn change ctdb_daemon_send_message to take pnn as parameter isntead of vnn (This used to be ctdb commit e352a2bbf9bb9a0b2c4f8329e8a529cf02414097) 2007-09-04 04:45:41 +04:00			`r->hdr.destnode = pnn;`
start splitting the code into separate client and server pieces (This used to be ctdb commit 603cd77988c181525946cd5eb0f4d0d646b58059) 2007-06-07 16:06:19 +04:00			`r->srvid = srvid;`
			`r->datalen = data.dsize;`
			`memcpy(&r->data[0], data.dptr, data.dsize);`

			`ctdb_queue_packet(ctdb, &r->hdr);`

			`talloc_free(r);`
			`return 0;`
			`}`

Add a mechanism where we can register notifications to be sent out to a SRVID when the client disconnects. The way to use this is from a client to : 1, first create a message handle and bind it to a SRVID A special prefix for the srvid space has been set aside for samba : Only samba is allowed to use srvid's with the top 32 bits set like this. The lower 32 bits are for samba to use internally. 2, register a "notification" using the new control : CTDB_CONTROL_REGISTER_NOTIFY = 114, This control takes as indata a structure like this : struct ctdb_client_notify_register { uint64_t srvid; uint32_t len; uint8_t notify_data[1]; }; srvid is the srvid used in the space set aside above. len and notify_data is an arbitrary blob. When notifications are later sent out to all clients, this is the payload of that notification message. If a client has registered with control 114 and then disconnects from ctdbd, ctdbd will broadcast a message to that srvid to all nodes/listeners in the cluster. A client can resister itself with as many different srvid's it want, but this is handled through a linked list from the client structure so it mainly designed for "few notifications per client". 3, a client that no longer wants to have a notification set up can deregister using control CTDB_CONTROL_DEREGISTER_NOTIFY = 115, which takes this as arguments : struct ctdb_client_notify_deregister { uint64_t srvid; }; When a client deregisters, there will no longer be sent a message to all other clients when this client disconnects from ctdbd. (This used to be ctdb commit f1b6ee4a55cdca60f93d992f0431d91bf301af2c) 2009-10-23 08:24:51 +04:00

			`struct ctdb_client_notify_list {`
			`struct ctdb_client_notify_list next, prev;`
			`struct ctdb_context *ctdb;`
			`uint64_t srvid;`
			`TDB_DATA data;`
			`};`


			`static int ctdb_client_notify_destructor(struct ctdb_client_notify_list *nl)`
			`{`
			`int ret;`

			`DEBUG(DEBUG_ERR,("Sending client notify message for srvid:%llu\n", (unsigned long long)nl->srvid));`

			`ret = ctdb_daemon_send_message(nl->ctdb, CTDB_BROADCAST_CONNECTED, (unsigned long long)nl->srvid, nl->data);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,("Failed to send client notify message\n"));`
			`}`

			`return 0;`
			`}`

			`int32_t ctdb_control_register_notify(struct ctdb_context *ctdb, uint32_t client_id, TDB_DATA indata)`
			`{`
			`struct ctdb_client_notify_register notify = (struct ctdb_client_notify_register )indata.dptr;`
			`struct ctdb_client *client = ctdb_reqid_find(ctdb, client_id, struct ctdb_client);`
			`struct ctdb_client_notify_list *nl;`

Reduce the loglevel for two log messages for Registering and Deregistering server ids. BZ61890 (This used to be ctdb commit 944434eb6420774e42e58984c6ddaa326a6853bd) 2010-03-30 04:57:25 +04:00			`DEBUG(DEBUG_INFO,("Register srvid %llu for client %d\n", (unsigned long long)notify->srvid, client_id));`
Add a mechanism where we can register notifications to be sent out to a SRVID when the client disconnects. The way to use this is from a client to : 1, first create a message handle and bind it to a SRVID A special prefix for the srvid space has been set aside for samba : Only samba is allowed to use srvid's with the top 32 bits set like this. The lower 32 bits are for samba to use internally. 2, register a "notification" using the new control : CTDB_CONTROL_REGISTER_NOTIFY = 114, This control takes as indata a structure like this : struct ctdb_client_notify_register { uint64_t srvid; uint32_t len; uint8_t notify_data[1]; }; srvid is the srvid used in the space set aside above. len and notify_data is an arbitrary blob. When notifications are later sent out to all clients, this is the payload of that notification message. If a client has registered with control 114 and then disconnects from ctdbd, ctdbd will broadcast a message to that srvid to all nodes/listeners in the cluster. A client can resister itself with as many different srvid's it want, but this is handled through a linked list from the client structure so it mainly designed for "few notifications per client". 3, a client that no longer wants to have a notification set up can deregister using control CTDB_CONTROL_DEREGISTER_NOTIFY = 115, which takes this as arguments : struct ctdb_client_notify_deregister { uint64_t srvid; }; When a client deregisters, there will no longer be sent a message to all other clients when this client disconnects from ctdbd. (This used to be ctdb commit f1b6ee4a55cdca60f93d992f0431d91bf301af2c) 2009-10-23 08:24:51 +04:00
			`if (indata.dsize < offsetof(struct ctdb_client_notify_register, notify_data)) {`
			`DEBUG(DEBUG_ERR,(__location__ " Too little data in control : %d\n", (int)indata.dsize));`
			`return -1;`
			`}`

			`if (indata.dsize != (notify->len + offsetof(struct ctdb_client_notify_register, notify_data))) {`
			`DEBUG(DEBUG_ERR,(__location__ " Wrong amount of data in control. Got %d, expected %d\n", (int)indata.dsize, (int)(notify->len + offsetof(struct ctdb_client_notify_register, notify_data))));`
			`return -1;`
			`}`


			`if (client == NULL) {`
			`DEBUG(DEBUG_ERR,(__location__ " Could not find client parent structure. You can not send this control to a remote node\n"));`
			`return -1;`
			`}`

			`for(nl=client->notify; nl; nl=nl->next) {`
			`if (nl->srvid == notify->srvid) {`
			`break;`
			`}`
			`}`
			`if (nl != NULL) {`
			`DEBUG(DEBUG_ERR,(__location__ " Notification for srvid:%llu already exists for this client\n", (unsigned long long)notify->srvid));`
			`return -1;`
			`}`

			`nl = talloc(client, struct ctdb_client_notify_list);`
			`CTDB_NO_MEMORY(ctdb, nl);`
			`nl->ctdb = ctdb;`
			`nl->srvid = notify->srvid;`
			`nl->data.dsize = notify->len;`
			`nl->data.dptr = talloc_size(nl, nl->data.dsize);`
			`CTDB_NO_MEMORY(ctdb, nl->data.dptr);`
			`memcpy(nl->data.dptr, notify->notify_data, nl->data.dsize);`

			`DLIST_ADD(client->notify, nl);`
			`talloc_set_destructor(nl, ctdb_client_notify_destructor);`

			`return 0;`
			`}`

			`int32_t ctdb_control_deregister_notify(struct ctdb_context *ctdb, uint32_t client_id, TDB_DATA indata)`
			`{`
			`struct ctdb_client_notify_deregister notify = (struct ctdb_client_notify_deregister )indata.dptr;`
			`struct ctdb_client *client = ctdb_reqid_find(ctdb, client_id, struct ctdb_client);`
			`struct ctdb_client_notify_list *nl;`

Reduce the loglevel for two log messages for Registering and Deregistering server ids. BZ61890 (This used to be ctdb commit 944434eb6420774e42e58984c6ddaa326a6853bd) 2010-03-30 04:57:25 +04:00			`DEBUG(DEBUG_INFO,("Deregister srvid %llu for client %d\n", (unsigned long long)notify->srvid, client_id));`
Add a mechanism where we can register notifications to be sent out to a SRVID when the client disconnects. The way to use this is from a client to : 1, first create a message handle and bind it to a SRVID A special prefix for the srvid space has been set aside for samba : Only samba is allowed to use srvid's with the top 32 bits set like this. The lower 32 bits are for samba to use internally. 2, register a "notification" using the new control : CTDB_CONTROL_REGISTER_NOTIFY = 114, This control takes as indata a structure like this : struct ctdb_client_notify_register { uint64_t srvid; uint32_t len; uint8_t notify_data[1]; }; srvid is the srvid used in the space set aside above. len and notify_data is an arbitrary blob. When notifications are later sent out to all clients, this is the payload of that notification message. If a client has registered with control 114 and then disconnects from ctdbd, ctdbd will broadcast a message to that srvid to all nodes/listeners in the cluster. A client can resister itself with as many different srvid's it want, but this is handled through a linked list from the client structure so it mainly designed for "few notifications per client". 3, a client that no longer wants to have a notification set up can deregister using control CTDB_CONTROL_DEREGISTER_NOTIFY = 115, which takes this as arguments : struct ctdb_client_notify_deregister { uint64_t srvid; }; When a client deregisters, there will no longer be sent a message to all other clients when this client disconnects from ctdbd. (This used to be ctdb commit f1b6ee4a55cdca60f93d992f0431d91bf301af2c) 2009-10-23 08:24:51 +04:00
			`if (client == NULL) {`
			`DEBUG(DEBUG_ERR,(__location__ " Could not find client parent structure. You can not send this control to a remote node\n"));`
			`return -1;`
			`}`

			`for(nl=client->notify; nl; nl=nl->next) {`
			`if (nl->srvid == notify->srvid) {`
			`break;`
			`}`
			`}`
			`if (nl == NULL) {`
			`DEBUG(DEBUG_ERR,(__location__ " No notification for srvid:%llu found for this client\n", (unsigned long long)notify->srvid));`
			`return -1;`
			`}`

			`DLIST_REMOVE(client->notify, nl);`
			`talloc_set_destructor(nl, NULL);`
			`talloc_free(nl);`

			`return 0;`
			`}`
Use the PID we pick up from the domain socket when a client connects and store this in the client structure. There is no need to rely on the hack that samba sends some special message handle registrations that encodes the pid in the srvid any more. This might not work on AIX since I recall some issues to get the pid in this way on that platform. (This used to be ctdb commit b4a7efa7e53e060a91dea0e8e57b116e2aeacebf) 2009-12-02 05:17:12 +03:00
Add a proper function to process a process-exist control in the daemon. This controls is only used by samba when samba wants to check if a subrecord held by a <node-id>:<smbd-pid> is still valid or if it can be reclaimed. If the node is banned or stopped, we kill the smbd process and return that the process does not exist to the caller. This allows us to recover subrecords from stopped/banned nodes where smbd is hung waiting for the databases to thaw. bz58185 (This used to be ctdb commit 157807af72ed4f7314afbc9c19756f9787b92c15) 2009-12-02 05:58:27 +03:00			`struct ctdb_client ctdb_find_client_by_pid(struct ctdb_context ctdb, pid_t pid)`
			`{`
			`struct ctdb_client_pid_list *client_pid;`

			`for (client_pid = ctdb->client_pids; client_pid; client_pid=client_pid->next) {`
			`if (client_pid->pid == pid) {`
			`return client_pid->client;`
			`}`
			`}`
			`return NULL;`
			`}`


			`/* This control is used by samba when probing if a process (of a samba daemon)`
			`exists on the node.`
			`Samba does this when it needs/wants to check if a subrecord in one of the`
			`databases is still valied, or if it is stale and can be removed.`
			`If the node is in unhealthy or stopped state we just kill of the samba`
			`process holding htis sub-record and return to the calling samba that`
			`the process does not exist.`
			`This allows us to forcefully recall subrecords registered by samba processes`
			`on banned and stopped nodes.`
			`*/`
			`int32_t ctdb_control_process_exists(struct ctdb_context *ctdb, pid_t pid)`
			`{`
			`struct ctdb_client *client;`

			`if (ctdb->nodes[ctdb->pnn]->flags & (NODE_FLAGS_BANNED\|NODE_FLAGS_STOPPED)) {`
			`client = ctdb_find_client_by_pid(ctdb, pid);`
			`if (client != NULL) {`
			`DEBUG(DEBUG_NOTICE,(__location__ " Killing client with pid:%d on banned/stopped node\n", (int)pid));`
			`talloc_free(client);`
			`}`
			`return -1;`
			`}`

			`return kill(pid, 0);`
			`}`

1321 lines 36 KiB C Raw Normal View History Unescape Escape

1321 lines

36 KiB

C

Raw Normal View History