samba-mirror

mirror of https://github.com/samba-team/samba.git synced 2025-01-11 05:18:09 +03:00

1038 lines

28 KiB

C

Raw Normal View History

make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`/*`
			`ctdb daemon code`

			`Copyright (C) Andrew Tridgell 2006`

ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`This program is free software; you can redistribute it and/or modify`
			`it under the terms of the GNU General Public License as published by`
update lib/replace from samba4 (This used to be ctdb commit f0555484105668c01c21f56322992e752e831109) 2007-07-10 09:29:31 +04:00			`the Free Software Foundation; either version 3 of the License, or`
ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`(at your option) any later version.`

			`This program is distributed in the hope that it will be useful,`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`but WITHOUT ANY WARRANTY; without even the implied warranty of`
ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the`
			`GNU General Public License for more details.`

			`You should have received a copy of the GNU General Public License`
update lib/replace from samba4 (This used to be ctdb commit f0555484105668c01c21f56322992e752e831109) 2007-07-10 09:29:31 +04:00			`along with this program; if not, see <http://www.gnu.org/licenses/>.`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`*/`

			`#include "includes.h"`
			`#include "db_wrap.h"`
			`#include "lib/tdb/include/tdb.h"`
			`#include "lib/events/events.h"`
			`#include "lib/util/dlinklist.h"`
			`#include "system/network.h"`
			`#include "system/filesys.h"`
block SIGPIPE in the daemon to prevent a SIGPIPE on write to a dead socket (This used to be ctdb commit 02c09dc07c9bed57ca3692b14e41ac8cca0a29f4) 2007-04-17 09:33:20 +04:00			`#include "system/wait.h"`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`#include "../include/ctdb.h"`
			`#include "../include/ctdb_private.h"`
add improvements to tracking memory usage in ctdbd adn the recovery daemon and a ctdb command to pull the talloc memory map from a recovery daemon ctdb rddumpmemory (This used to be ctdb commit d23950be7406cf288f48b660c0f57a9b8d7bdd05) 2008-04-01 08:34:54 +04:00			`#include <sys/socket.h>`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
make sure we don't increment rx_cnt for redirected packets, or for packets that have been requeued after a lockwait (This used to be ctdb commit 92e5569407dba173a27e9645b4339ce3e2c00520) 2007-05-19 07:45:24 +04:00			`static void daemon_incoming_packet(void , struct ctdb_req_header );`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00
update flags in parent daemon too (This used to be ctdb commit 8995246d95e670753ab8c61d724d284cac2b414d) 2007-06-06 15:34:36 +04:00
no longer wait at startup for services to become available, instead set the node initially unhealthy and let the status monitoring bring the node online. This fixes a problem with winbindd, where it refused to start because secrets.tdb was not populated but we could not populate ctdbd, because the net command would not run while ctdbd was still doing startup and thus frozen (This used to be ctdb commit 3a001b793dd76fb96addf1e2ccb74da326fbcfbc) 2007-09-24 04:00:14 +04:00			`static void print_exit_message(void)`
			`{`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_NOTICE,("CTDB daemon shutting down\n"));`
no longer wait at startup for services to become available, instead set the node initially unhealthy and let the status monitoring bring the node online. This fixes a problem with winbindd, where it refused to start because secrets.tdb was not populated but we could not populate ctdbd, because the net command would not run while ctdbd was still doing startup and thus frozen (This used to be ctdb commit 3a001b793dd76fb96addf1e2ccb74da326fbcfbc) 2007-09-24 04:00:14 +04:00			`}`


don't start the transport connecting to the other nodes until after the startup event script has run (This used to be ctdb commit afca3cc74211aa2e18b1f74d36b2add8dffcfdc7) 2007-05-30 07:26:50 +04:00			`/* called when the "startup" event script has finished */`
prevent a deadly embrace between smbd and ctdbd by moving the calling of the startup event scripts after the point where recovery has started and the node is in normal operation This makes the 'startup' script just a special type of the 'monitor' script which is called first (This used to be ctdb commit 7424c30a5fd04aea0137c466b4318c3f185280d8) 2007-11-12 02:53:11 +03:00			`static void ctdb_start_transport(struct ctdb_context *ctdb)`
don't start the transport connecting to the other nodes until after the startup event script has run (This used to be ctdb commit afca3cc74211aa2e18b1f74d36b2add8dffcfdc7) 2007-05-30 07:26:50 +04:00			`{`
ctdb->methods becomes NULL when we shutdown the transport. If we shutdown the transport and CTDB later decides to send a command out for queueing, the call to ctdb->methods->allocate_pkt() will SEGV. This could trigger for example when we are in the process of shuttind down CTDBD and have already shutdown the transport but we are still waiting for the "shutdown" eventscripts to finish. If the event scripts now take much much longer to execute for some reason, this race condition becomes much more probable. Decorate all dereferencing of ctdb->methods-> with a check that ctdb->menthods is non-NULL (This used to be ctdb commit c4c2c53918da6fb566d6e9cbd6b02e61ae2921e7) 2008-05-11 08:28:33 +04:00			`if (ctdb->methods == NULL) {`
			`DEBUG(DEBUG_ALERT,(__location__ " startup event finished but transport is DOWN.\n"));`
			`ctdb_fatal(ctdb, "transport is not initialized but startup completed");`
			`}`

don't start the transport connecting to the other nodes until after the startup event script has run (This used to be ctdb commit afca3cc74211aa2e18b1f74d36b2add8dffcfdc7) 2007-05-30 07:26:50 +04:00			`/* start the transport running */`
			`if (ctdb->methods->start(ctdb) != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ALERT,("transport failed to start!\n"));`
don't start the transport connecting to the other nodes until after the startup event script has run (This used to be ctdb commit afca3cc74211aa2e18b1f74d36b2add8dffcfdc7) 2007-05-30 07:26:50 +04:00			`ctdb_fatal(ctdb, "transport failed to start");`
			`}`

			`/* start the recovery daemon process */`
			`if (ctdb_start_recoverd(ctdb) != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ALERT,("Failed to start recovery daemon\n"));`
don't start the transport connecting to the other nodes until after the startup event script has run (This used to be ctdb commit afca3cc74211aa2e18b1f74d36b2add8dffcfdc7) 2007-05-30 07:26:50 +04:00			`exit(11);`
			`}`
- use a CTDB_BROADCAST_ALL for the attach message so it goes to currently disconnected nodes - start node monitoring only after transport starts - check if a node is already disconnected in the node dead function (This used to be ctdb commit b81ab6d507797282237768380c6f0e5a4c6519a5) 2007-05-30 08:35:22 +04:00
no longer wait at startup for services to become available, instead set the node initially unhealthy and let the status monitoring bring the node online. This fixes a problem with winbindd, where it refused to start because secrets.tdb was not populated but we could not populate ctdbd, because the net command would not run while ctdbd was still doing startup and thus frozen (This used to be ctdb commit 3a001b793dd76fb96addf1e2ccb74da326fbcfbc) 2007-09-24 04:00:14 +04:00			`/* Make sure we log something when the daemon terminates */`
			`atexit(print_exit_message);`

split node health monitoring and checking for connected/disconnected nodes into two separate files. move the monitoring of keepalives for detecting connected/disconnected remote nodes into ctdb_keepalive.c (This used to be ctdb commit 23a57b20c314d5f11a433cf251eb9d9de743849a) 2008-01-15 00:42:12 +03:00			`/* start monitoring for connected/disconnected nodes */`
			`ctdb_start_keepalive(ctdb);`

			`/* start monitoring for node health */`
- use a CTDB_BROADCAST_ALL for the attach message so it goes to currently disconnected nodes - start node monitoring only after transport starts - check if a node is already disconnected in the node dead function (This used to be ctdb commit b81ab6d507797282237768380c6f0e5a4c6519a5) 2007-05-30 08:35:22 +04:00			`ctdb_start_monitoring(ctdb);`
updated ctdb tickle management there is an array for each node/public address that contains tcp tickles we send a TCP_ADD as a broadcast to all nodes when a client is added if tcp tickles are removed, they are only removed immediately from the local node. once every 20 seconds a node will push/broadcast out the tickle list for all public addresses it manages. this will remove any deleted tickles from the remote nodes (This used to be ctdb commit e3c432a915222e1392d91835bc7a73a96ab61ac9) 2007-07-20 09:05:55 +04:00
			`/* start periodic update of tcp tickle lists */`
			`ctdb_start_tcp_tickle_update(ctdb);`
additional monitoring between the two daemons. we currently only monitor that the dameons are running by kill(0, pid) and verifying the the domain socket between them is ok. this is not sufficient since we can have a situation where the recovery daemon is hung. this new code monitors that the recovery daemon is operating. if the recovery hangs, we log this and shut down the main daemon (This used to be ctdb commit cd69d292292eaab3aac0e9d9fc57cb621597c63c) 2008-09-09 07:44:46 +04:00
			`/* start listening for recovery daemon pings */`
			`ctdb_control_recd_ping(ctdb);`
don't start the transport connecting to the other nodes until after the startup event script has run (This used to be ctdb commit afca3cc74211aa2e18b1f74d36b2add8dffcfdc7) 2007-05-30 07:26:50 +04:00			`}`

block SIGPIPE in the daemon to prevent a SIGPIPE on write to a dead socket (This used to be ctdb commit 02c09dc07c9bed57ca3692b14e41ac8cca0a29f4) 2007-04-17 09:33:20 +04:00			`static void block_signal(int signum)`
			`{`
			`struct sigaction act;`

			`memset(&act, 0, sizeof(act));`

			`act.sa_handler = SIG_IGN;`
			`sigemptyset(&act.sa_mask);`
			`sigaddset(&act.sa_mask, signum);`
			`sigaction(signum, &act, NULL);`
			`}`

make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
added ctdb_status tool (This used to be ctdb commit 908d6c6a936e21f70f05827ce302e966cca0132a) 2007-04-20 14:07:47 +04:00			`/*`
			`send a packet to a client`
			`*/`
			`static int daemon_queue_send(struct ctdb_client client, struct ctdb_req_header hdr)`
			`{`
- renamed ctdb_control utility to ctdb - use -n to specify node number in ctdb utility - change 'ctdb status' to 'ctdb statistics' - added 'ctdb status' which shows status - added netmask to public IPs, so you don't try a takeover on a foreign network - cleaned up tools/ctdb_control.c a lot - generate usage message at runtime (This used to be ctdb commit 28de71c03ace7d32a9fd9882fabbd5d668b97656) 2007-05-29 06:16:59 +04:00			`client->ctdb->statistics.client_packets_sent++;`
added ctdb_status tool (This used to be ctdb commit 908d6c6a936e21f70f05827ce302e966cca0132a) 2007-04-20 14:07:47 +04:00			`return ctdb_queue_send(client->queue, (uint8_t *)hdr, hdr->length);`
			`}`

partially completed work towards full messaging system which will work in both daemon and standalone mode. Does not compile\! committing so ronnie can continue while I'm out (This used to be ctdb commit 1b5e65a700e2bd0a5c913d7866024b25600a14c9) 2007-04-11 05:58:28 +04:00			`/*`
			`message handler for when we are in daemon mode. This redirects the message`
			`to the right client`
			`*/`
make srvid 64 bits instead of 32 bits (This used to be ctdb commit 723bcfbba1d5aa711496d37b9658190b78a2d66b) 2007-04-27 18:31:45 +04:00			`static void daemon_message_handler(struct ctdb_context *ctdb, uint64_t srvid,`
private -> private_data for samba3 (This used to be ctdb commit 080b6901173afb2ad618dd0621876ff478c7d6e5) 2007-04-13 14:38:24 +04:00			`TDB_DATA data, void *private_data)`
partially completed work towards full messaging system which will work in both daemon and standalone mode. Does not compile\! committing so ronnie can continue while I'm out (This used to be ctdb commit 1b5e65a700e2bd0a5c913d7866024b25600a14c9) 2007-04-11 05:58:28 +04:00			`{`
private -> private_data for samba3 (This used to be ctdb commit 080b6901173afb2ad618dd0621876ff478c7d6e5) 2007-04-13 14:38:24 +04:00			`struct ctdb_client *client = talloc_get_type(private_data, struct ctdb_client);`
partially completed work towards full messaging system which will work in both daemon and standalone mode. Does not compile\! committing so ronnie can continue while I'm out (This used to be ctdb commit 1b5e65a700e2bd0a5c913d7866024b25600a14c9) 2007-04-11 05:58:28 +04:00			`struct ctdb_req_message *r;`
			`int len;`

			`/* construct a message to send to the client containing the data */`
			`len = offsetof(struct ctdb_req_message, data) + data.dsize;`
factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`r = ctdbd_allocate_pkt(ctdb, ctdb, CTDB_REQ_MESSAGE,`
			`len, struct ctdb_req_message);`
			`CTDB_NO_MEMORY_VOID(ctdb, r);`
add a test that sends messages between clients connected to the same ctdb add code to actually pass the messages between clients and ctdb (This used to be ctdb commit 6d5b55d7b9c611fb5e98765906757a7d82e4bf6b) 2007-04-11 07:43:15 +04:00
partially completed work towards full messaging system which will work in both daemon and standalone mode. Does not compile\! committing so ronnie can continue while I'm out (This used to be ctdb commit 1b5e65a700e2bd0a5c913d7866024b25600a14c9) 2007-04-11 05:58:28 +04:00			`talloc_set_name_const(r, "req_message packet");`

			`r->srvid = srvid;`
			`r->datalen = data.dsize;`
			`memcpy(&r->data[0], data.dptr, data.dsize);`
change some error printouts to make it easier to determine whether the error occured in the client or in the daemon (This used to be ctdb commit a7e42c2c56e38b4b58ede0ad45767695d704dac4) 2007-04-17 04:15:44 +04:00
added ctdb_status tool (This used to be ctdb commit 908d6c6a936e21f70f05827ce302e966cca0132a) 2007-04-20 14:07:47 +04:00			`daemon_queue_send(client, &r->hdr);`
partially completed work towards full messaging system which will work in both daemon and standalone mode. Does not compile\! committing so ronnie can continue while I'm out (This used to be ctdb commit 1b5e65a700e2bd0a5c913d7866024b25600a14c9) 2007-04-11 05:58:28 +04:00
			`talloc_free(r);`
			`}`


			`/*`
			`this is called when the ctdb daemon received a ctdb request to`
			`set the srvid from the client`
			`*/`
- changed the REQ_REGISTER PDU to be a control - allow controls to know which client invoked them - added a client_id to clients, so they can be identified remotely - added the ability to remove registered srvids - in the list_keys code, register a temp srvid, then remove it afterwards (This used to be ctdb commit 29603c51cc6d81362532cd8e50f75c8360c5f5ef) 2007-05-04 05:41:29 +04:00			`int daemon_register_message_handler(struct ctdb_context *ctdb, uint32_t client_id, uint64_t srvid)`
partially completed work towards full messaging system which will work in both daemon and standalone mode. Does not compile\! committing so ronnie can continue while I'm out (This used to be ctdb commit 1b5e65a700e2bd0a5c913d7866024b25600a14c9) 2007-04-11 05:58:28 +04:00			`{`
- changed the REQ_REGISTER PDU to be a control - allow controls to know which client invoked them - added a client_id to clients, so they can be identified remotely - added the ability to remove registered srvids - in the list_keys code, register a temp srvid, then remove it afterwards (This used to be ctdb commit 29603c51cc6d81362532cd8e50f75c8360c5f5ef) 2007-05-04 05:41:29 +04:00			`struct ctdb_client *client = ctdb_reqid_find(ctdb, client_id, struct ctdb_client);`
partially completed work towards full messaging system which will work in both daemon and standalone mode. Does not compile\! committing so ronnie can continue while I'm out (This used to be ctdb commit 1b5e65a700e2bd0a5c913d7866024b25600a14c9) 2007-04-11 05:58:28 +04:00			`int res;`
- changed the REQ_REGISTER PDU to be a control - allow controls to know which client invoked them - added a client_id to clients, so they can be identified remotely - added the ability to remove registered srvids - in the list_keys code, register a temp srvid, then remove it afterwards (This used to be ctdb commit 29603c51cc6d81362532cd8e50f75c8360c5f5ef) 2007-05-04 05:41:29 +04:00			`if (client == NULL) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,("Bad client_id in daemon_request_register_message_handler\n"));`
- changed the REQ_REGISTER PDU to be a control - allow controls to know which client invoked them - added a client_id to clients, so they can be identified remotely - added the ability to remove registered srvids - in the list_keys code, register a temp srvid, then remove it afterwards (This used to be ctdb commit 29603c51cc6d81362532cd8e50f75c8360c5f5ef) 2007-05-04 05:41:29 +04:00			`return -1;`
			`}`
			`res = ctdb_register_message_handler(ctdb, client, srvid, daemon_message_handler, client);`
partially completed work towards full messaging system which will work in both daemon and standalone mode. Does not compile\! committing so ronnie can continue while I'm out (This used to be ctdb commit 1b5e65a700e2bd0a5c913d7866024b25600a14c9) 2007-04-11 05:58:28 +04:00			`if (res != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,(__location__ " Failed to register handler %llu in daemon\n",`
fixed more warnings on 64 bit boxes (This used to be ctdb commit 2f6eae476203f8a8b28e083553204c01f224c8a5) 2007-05-29 07:58:41 +04:00			`(unsigned long long)srvid));`
minor debug changes (This used to be ctdb commit 1950d96458238782c3bfd8e41a053c4be8330ef9) 2007-04-20 01:47:37 +04:00			`} else {`
added debug constants to allow for better mapping to syslog levels (This used to be ctdb commit 7ba8f1dde318eab03f4257e5a89fd23e7281e502) 2008-02-04 09:44:24 +03:00			`DEBUG(DEBUG_INFO,(__location__ " Registered message handler for srvid=%llu\n",`
fixed more warnings on 64 bit boxes (This used to be ctdb commit 2f6eae476203f8a8b28e083553204c01f224c8a5) 2007-05-29 07:58:41 +04:00			`(unsigned long long)srvid));`
partially completed work towards full messaging system which will work in both daemon and standalone mode. Does not compile\! committing so ronnie can continue while I'm out (This used to be ctdb commit 1b5e65a700e2bd0a5c913d7866024b25600a14c9) 2007-04-11 05:58:28 +04:00			`}`
added code to kill registered clients on a IP release (This used to be ctdb commit ca0243b544987ce0618a99ac87b4abf598991e93) 2007-06-18 21:54:06 +04:00
			`/* this is a hack for Samba - we now know the pid of the Samba client */`
			`if ((srvid & 0xFFFFFFFF) == srvid &&`
			`kill(srvid, 0) == 0) {`
			`client->pid = srvid;`
added debug constants to allow for better mapping to syslog levels (This used to be ctdb commit 7ba8f1dde318eab03f4257e5a89fd23e7281e502) 2008-02-04 09:44:24 +03:00			`DEBUG(DEBUG_INFO,(__location__ " Registered PID %u for client %u\n",`
added code to kill registered clients on a IP release (This used to be ctdb commit ca0243b544987ce0618a99ac87b4abf598991e93) 2007-06-18 21:54:06 +04:00			`(unsigned)client->pid, client_id));`
			`}`
- changed the REQ_REGISTER PDU to be a control - allow controls to know which client invoked them - added a client_id to clients, so they can be identified remotely - added the ability to remove registered srvids - in the list_keys code, register a temp srvid, then remove it afterwards (This used to be ctdb commit 29603c51cc6d81362532cd8e50f75c8360c5f5ef) 2007-05-04 05:41:29 +04:00			`return res;`
			`}`

			`/*`
			`this is called when the ctdb daemon received a ctdb request to`
			`remove a srvid from the client`
			`*/`
			`int daemon_deregister_message_handler(struct ctdb_context *ctdb, uint32_t client_id, uint64_t srvid)`
			`{`
			`struct ctdb_client *client = ctdb_reqid_find(ctdb, client_id, struct ctdb_client);`
			`if (client == NULL) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,("Bad client_id in daemon_request_deregister_message_handler\n"));`
- changed the REQ_REGISTER PDU to be a control - allow controls to know which client invoked them - added a client_id to clients, so they can be identified remotely - added the ability to remove registered srvids - in the list_keys code, register a temp srvid, then remove it afterwards (This used to be ctdb commit 29603c51cc6d81362532cd8e50f75c8360c5f5ef) 2007-05-04 05:41:29 +04:00			`return -1;`
			`}`
			`return ctdb_deregister_message_handler(ctdb, srvid, client);`
partially completed work towards full messaging system which will work in both daemon and standalone mode. Does not compile\! committing so ronnie can continue while I'm out (This used to be ctdb commit 1b5e65a700e2bd0a5c913d7866024b25600a14c9) 2007-04-11 05:58:28 +04:00			`}`


make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`/*`
			`destroy a ctdb_client`
			`*/`
			`static int ctdb_client_destructor(struct ctdb_client *client)`
			`{`
Fix persistent transaction commit race condition. In ctdb_client.c:ctdb_transaction_commit(), after a failed TRANS2_COMMIT control call (for instance due to the 1-second being exceeded waiting for a busy node's reply), there is a 1-second gap between the transaction_cancel() and replay_transaction() calls in which there is no lock on the persistent db. And due to the lack of global state indicating that a transaction is in progress in ctdbd, other nodes may succeed to start transactions on the db in this gap and even worse work on top of the possibly already pushed changes. So the data diverges on the several nodes. This change fixes this by introducing global state for a transaction commit being active in the ctdb_db_context struct and in a db_id field in the client so that a client keeps track of _which_ tdb it as transaction commit running on. These data are set by ctdb upon entering the trans2_commit control and they are cleared in the trans2_error or trans2_finished controls. This makes it impossible to start a nother transaction or migrate a record to a different node while a transaction is active on a persistent tdb, including the retry loop. This approach is dead lock free and still allows recovery process to be started in the retry-gap between cancel and replay. Also note, that this solution does not require any change in the client side. This was debugged and developed together with Stefan Metzmacher <metze@samba.org> - thanks! Michael (This used to be ctdb commit f88103516e5ad723062fb95fcb07a128f1069d69) 2009-07-21 13:30:38 +04:00			`struct ctdb_db_context *ctdb_db;`

added code to ctdb to send a tcp 'tickle' ack when we takeover an IP. A raw tcp ack is sent for each tcp connection held by clients before the IP takeover. These acks have a deliberately incorrect sequence number, and should cause the windows client to send its own ack which will in turn cause a tcp reset and thus cause windows clients to much more quickly reconnect to the new node. (This used to be ctdb commit eef38bfe8461b47489d169c61895d6bb8a8f79a1) 2007-05-27 09:26:29 +04:00			`ctdb_takeover_client_destructor_hook(client);`
- changed the REQ_REGISTER PDU to be a control - allow controls to know which client invoked them - added a client_id to clients, so they can be identified remotely - added the ability to remove registered srvids - in the list_keys code, register a temp srvid, then remove it afterwards (This used to be ctdb commit 29603c51cc6d81362532cd8e50f75c8360c5f5ef) 2007-05-04 05:41:29 +04:00			`ctdb_reqid_remove(client->ctdb, client->client_id);`
when tracking the ctdb statistics, only decrement num_clients and pending_calls IFF the counter is >0 Otherwise there is the chance that we will reset the statistics after the counter has been incremented (client connects) to zero and when the client disconnects we decrement it to a negative number. this is a pure cosmetic patch with no operational impact to ctdb (This used to be ctdb commit 72f1c696ee77899f7973878f2568a60d199d4fea) 2009-05-01 06:30:26 +04:00			`if (client->ctdb->statistics.num_clients) {`
			`client->ctdb->statistics.num_clients--;`
			`}`
Add two new controls to start and cancel a persistent update. This allows ctdb to automatically start a new full blown recovery if a client has started updating the local tdb for a persistent database but is kill -9ed before it has ensured the update is distributed clusterwide. (This used to be ctdb commit 1ffccb3e0b3b5bd376c5302304029af393709518) 2008-07-17 07:50:55 +04:00
			`if (client->num_persistent_updates != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " Client disconnecting with %u persistent updates in flight. Starting recovery\n", client->num_persistent_updates));`
			`client->ctdb->recovery_mode = CTDB_RECOVERY_ACTIVE;`
			`}`
Fix persistent transaction commit race condition. In ctdb_client.c:ctdb_transaction_commit(), after a failed TRANS2_COMMIT control call (for instance due to the 1-second being exceeded waiting for a busy node's reply), there is a 1-second gap between the transaction_cancel() and replay_transaction() calls in which there is no lock on the persistent db. And due to the lack of global state indicating that a transaction is in progress in ctdbd, other nodes may succeed to start transactions on the db in this gap and even worse work on top of the possibly already pushed changes. So the data diverges on the several nodes. This change fixes this by introducing global state for a transaction commit being active in the ctdb_db_context struct and in a db_id field in the client so that a client keeps track of _which_ tdb it as transaction commit running on. These data are set by ctdb upon entering the trans2_commit control and they are cleared in the trans2_error or trans2_finished controls. This makes it impossible to start a nother transaction or migrate a record to a different node while a transaction is active on a persistent tdb, including the retry loop. This approach is dead lock free and still allows recovery process to be started in the retry-gap between cancel and replay. Also note, that this solution does not require any change in the client side. This was debugged and developed together with Stefan Metzmacher <metze@samba.org> - thanks! Michael (This used to be ctdb commit f88103516e5ad723062fb95fcb07a128f1069d69) 2009-07-21 13:30:38 +04:00			`ctdb_db = find_ctdb_db(client->ctdb, client->db_id);`
			`if (ctdb_db) {`
			`DEBUG(DEBUG_ERR, (__location__ " client exit while transaction "`
			`"commit active. Forcing recovery.\n"));`
			`client->ctdb->recovery_mode = CTDB_RECOVERY_ACTIVE;`
			`ctdb_db->transaction_active = false;`
			`}`
Add two new controls to start and cancel a persistent update. This allows ctdb to automatically start a new full blown recovery if a client has started updating the local tdb for a persistent database but is kill -9ed before it has ensured the update is distributed clusterwide. (This used to be ctdb commit 1ffccb3e0b3b5bd376c5302304029af393709518) 2008-07-17 07:50:55 +04:00
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`return 0;`
			`}`


add a test that sends messages between clients connected to the same ctdb add code to actually pass the messages between clients and ctdb (This used to be ctdb commit 6d5b55d7b9c611fb5e98765906757a7d82e4bf6b) 2007-04-11 07:43:15 +04:00			`/*`
			`this is called when the ctdb daemon received a ctdb request message`
			`from a local client over the unix domain socket`
			`*/`
			`static void daemon_request_message_from_client(struct ctdb_client *client,`
merge from ronnie, plus complete the client side of inter-node messaging (This used to be ctdb commit e605417436855d22343462acae4cbb79a374977e) 2007-04-11 08:05:01 +04:00			`struct ctdb_req_message *c)`
add a test that sends messages between clients connected to the same ctdb add code to actually pass the messages between clients and ctdb (This used to be ctdb commit 6d5b55d7b9c611fb5e98765906757a7d82e4bf6b) 2007-04-11 07:43:15 +04:00			`{`
merge from ronnie, plus complete the client side of inter-node messaging (This used to be ctdb commit e605417436855d22343462acae4cbb79a374977e) 2007-04-11 08:05:01 +04:00			`TDB_DATA data;`
			`int res;`

			`/* maybe the message is for another client on this node */`
change ctdb_get_vnn to ctdb_get_pnn (This used to be ctdb commit 1e19930198c2bcc7ccb755e0ee51555fb823029a) 2007-09-04 04:18:44 +04:00			`if (ctdb_get_pnn(client->ctdb)==c->hdr.destnode) {`
add a test that sends messages between clients connected to the same ctdb add code to actually pass the messages between clients and ctdb (This used to be ctdb commit 6d5b55d7b9c611fb5e98765906757a7d82e4bf6b) 2007-04-11 07:43:15 +04:00			`ctdb_request_message(client->ctdb, (struct ctdb_req_header *)c);`
merge from ronnie, plus complete the client side of inter-node messaging (This used to be ctdb commit e605417436855d22343462acae4cbb79a374977e) 2007-04-11 08:05:01 +04:00			`return;`
			`}`
add a special VNN that means "all" nodes so that a message can be broadcasted to all daemons in the cluster change the message dispatch routine for sending messages so that it allows several clients to use the same srvid messages are then passed on to all clients that have that srvid (This used to be ctdb commit 05d7ebb3556785f0f17a87d808f31ffe8dac288a) 2007-04-27 17:16:17 +04:00
merge from ronnie, plus complete the client side of inter-node messaging (This used to be ctdb commit e605417436855d22343462acae4cbb79a374977e) 2007-04-11 08:05:01 +04:00			`/* its for a remote node */`
			`data.dptr = &c->data[0];`
			`data.dsize = c->datalen;`
			`res = ctdb_daemon_send_message(client->ctdb, c->hdr.destnode,`
			`c->srvid, data);`
			`if (res != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,(__location__ " Failed to send message to remote node %u\n",`
more DEBUG() calls (This used to be ctdb commit 79f3d63eec5652d87f13875c76e90ead81a26ad9) 2007-04-17 16:27:17 +04:00			`c->hdr.destnode));`
add a test that sends messages between clients connected to the same ctdb add code to actually pass the messages between clients and ctdb (This used to be ctdb commit 6d5b55d7b9c611fb5e98765906757a7d82e4bf6b) 2007-04-11 07:43:15 +04:00			`}`
			`}`

- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00
			`struct daemon_call_state {`
			`struct ctdb_client *client;`
			`uint32_t reqid;`
			`struct ctdb_call *call;`
added ctdb_status tool (This used to be ctdb commit 908d6c6a936e21f70f05827ce302e966cca0132a) 2007-04-20 14:07:47 +04:00			`struct timeval start_time;`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`};`

			`/*`
			`complete a call from a client`
			`*/`
			`static void daemon_call_from_client_callback(struct ctdb_call_state *state)`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`{`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`struct daemon_call_state *dstate = talloc_get_type(state->async.private_data,`
			`struct daemon_call_state);`
- add --daemon flag to ctdb_fetch test code - split client specific routines out of ctdb_daemon.c - use ctdb_queue code in message send from client to daemon - use clearer names in client/daemon functions - use talloc autofree context to avoid global for unlink of socket on exit - start on API change for message handler, to allow ctdb messaging to handle daemon mode with multiple clients (This used to be ctdb commit 53555db45f3583ae4a32cc3aa9e07fb8ef2a77e3) 2007-04-11 05:01:42 +04:00			`struct ctdb_reply_call *r;`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`int res;`
- add --daemon flag to ctdb_fetch test code - split client specific routines out of ctdb_daemon.c - use ctdb_queue code in message send from client to daemon - use clearer names in client/daemon functions - use talloc autofree context to avoid global for unlink of socket on exit - start on API change for message handler, to allow ctdb messaging to handle daemon mode with multiple clients (This used to be ctdb commit 53555db45f3583ae4a32cc3aa9e07fb8ef2a77e3) 2007-04-11 05:01:42 +04:00			`uint32_t length;`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`struct ctdb_client *client = dstate->client;`
we actually need a ctdb_db variable (This used to be ctdb commit aba984f1b85f5a2d370b093061cf15843ee53758) 2008-11-03 13:54:52 +03:00			`struct ctdb_db_context *ctdb_db = state->ctdb_db;`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`talloc_steal(client, dstate);`
			`talloc_steal(dstate, dstate->call);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`res = ctdb_daemon_call_recv(state, dstate->call);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`if (res != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR, (__location__ " ctdbd_call_recv() returned error\n"));`
when tracking the ctdb statistics, only decrement num_clients and pending_calls IFF the counter is >0 Otherwise there is the chance that we will reset the statistics after the counter has been incremented (client connects) to zero and when the client disconnects we decrement it to a negative number. this is a pure cosmetic patch with no operational impact to ctdb (This used to be ctdb commit 72f1c696ee77899f7973878f2568a60d199d4fea) 2009-05-01 06:30:26 +04:00			`if (client->ctdb->statistics.pending_calls > 0) {`
			`client->ctdb->statistics.pending_calls--;`
			`}`
latency is measured in us, not ms use an explicit ctdb_db variable instead of dereferencing state (This used to be ctdb commit 8c6a02fb423a8cbcbfc706767e3d353cd48073c3) 2008-10-30 05:34:10 +03:00			`ctdb_latency(ctdb_db, "call_from_client_cb 1", &client->ctdb->statistics.max_call_latency, dstate->start_time);`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`return;`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`}`

- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`length = offsetof(struct ctdb_reply_call, data) + dstate->call->reply_data.dsize;`
factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`r = ctdbd_allocate_pkt(client->ctdb, dstate, CTDB_REPLY_CALL,`
			`length, struct ctdb_reply_call);`
- add --daemon flag to ctdb_fetch test code - split client specific routines out of ctdb_daemon.c - use ctdb_queue code in message send from client to daemon - use clearer names in client/daemon functions - use talloc autofree context to avoid global for unlink of socket on exit - start on API change for message handler, to allow ctdb messaging to handle daemon mode with multiple clients (This used to be ctdb commit 53555db45f3583ae4a32cc3aa9e07fb8ef2a77e3) 2007-04-11 05:01:42 +04:00			`if (r == NULL) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR, (__location__ " Failed to allocate reply_call in ctdb daemon\n"));`
when tracking the ctdb statistics, only decrement num_clients and pending_calls IFF the counter is >0 Otherwise there is the chance that we will reset the statistics after the counter has been incremented (client connects) to zero and when the client disconnects we decrement it to a negative number. this is a pure cosmetic patch with no operational impact to ctdb (This used to be ctdb commit 72f1c696ee77899f7973878f2568a60d199d4fea) 2009-05-01 06:30:26 +04:00			`if (client->ctdb->statistics.pending_calls > 0) {`
			`client->ctdb->statistics.pending_calls--;`
			`}`
latency is measured in us, not ms use an explicit ctdb_db variable instead of dereferencing state (This used to be ctdb commit 8c6a02fb423a8cbcbfc706767e3d353cd48073c3) 2008-10-30 05:34:10 +03:00			`ctdb_latency(ctdb_db, "call_from_client_cb 2", &client->ctdb->statistics.max_call_latency, dstate->start_time);`
- add --daemon flag to ctdb_fetch test code - split client specific routines out of ctdb_daemon.c - use ctdb_queue code in message send from client to daemon - use clearer names in client/daemon functions - use talloc autofree context to avoid global for unlink of socket on exit - start on API change for message handler, to allow ctdb messaging to handle daemon mode with multiple clients (This used to be ctdb commit 53555db45f3583ae4a32cc3aa9e07fb8ef2a77e3) 2007-04-11 05:01:42 +04:00			`return;`
			`}`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`r->hdr.reqid = dstate->reqid;`
			`r->datalen = dstate->call->reply_data.dsize;`
			`memcpy(&r->data[0], dstate->call->reply_data.dptr, r->datalen);`
- add --daemon flag to ctdb_fetch test code - split client specific routines out of ctdb_daemon.c - use ctdb_queue code in message send from client to daemon - use clearer names in client/daemon functions - use talloc autofree context to avoid global for unlink of socket on exit - start on API change for message handler, to allow ctdb messaging to handle daemon mode with multiple clients (This used to be ctdb commit 53555db45f3583ae4a32cc3aa9e07fb8ef2a77e3) 2007-04-11 05:01:42 +04:00
added ctdb_status tool (This used to be ctdb commit 908d6c6a936e21f70f05827ce302e966cca0132a) 2007-04-20 14:07:47 +04:00			`res = daemon_queue_send(client, &r->hdr);`
- add --daemon flag to ctdb_fetch test code - split client specific routines out of ctdb_daemon.c - use ctdb_queue code in message send from client to daemon - use clearer names in client/daemon functions - use talloc autofree context to avoid global for unlink of socket on exit - start on API change for message handler, to allow ctdb messaging to handle daemon mode with multiple clients (This used to be ctdb commit 53555db45f3583ae4a32cc3aa9e07fb8ef2a77e3) 2007-04-11 05:01:42 +04:00			`if (res != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR, (__location__ " Failed to queue packet from daemon to client\n"));`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`}`
latency is measured in us, not ms use an explicit ctdb_db variable instead of dereferencing state (This used to be ctdb commit 8c6a02fb423a8cbcbfc706767e3d353cd48073c3) 2008-10-30 05:34:10 +03:00			`ctdb_latency(ctdb_db, "call_from_client_cb 3", &client->ctdb->statistics.max_call_latency, dstate->start_time);`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`talloc_free(dstate);`
when tracking the ctdb statistics, only decrement num_clients and pending_calls IFF the counter is >0 Otherwise there is the chance that we will reset the statistics after the counter has been incremented (client connects) to zero and when the client disconnects we decrement it to a negative number. this is a pure cosmetic patch with no operational impact to ctdb (This used to be ctdb commit 72f1c696ee77899f7973878f2568a60d199d4fea) 2009-05-01 06:30:26 +04:00			`if (client->ctdb->statistics.pending_calls > 0) {`
			`client->ctdb->statistics.pending_calls--;`
			`}`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`}`

- catch a case where the client disconnects during a call - track all talloc memory, using NULL context (This used to be ctdb commit bf89c56002f5311520e91cb367753bc46e5dddc9) 2008-01-16 01:44:48 +03:00			`struct ctdb_daemon_packet_wrap {`
			`struct ctdb_context *ctdb;`
			`uint32_t client_id;`
			`};`

			`/*`
			`a wrapper to catch disconnected clients`
			`*/`
			`static void daemon_incoming_packet_wrap(void p, struct ctdb_req_header hdr)`
			`{`
			`struct ctdb_client *client;`
			`struct ctdb_daemon_packet_wrap *w = talloc_get_type(p,`
			`struct ctdb_daemon_packet_wrap);`
			`if (w == NULL) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_CRIT,(__location__ " Bad packet type '%s'\n", talloc_get_name(p)));`
- catch a case where the client disconnects during a call - track all talloc memory, using NULL context (This used to be ctdb commit bf89c56002f5311520e91cb367753bc46e5dddc9) 2008-01-16 01:44:48 +03:00			`return;`
			`}`

			`client = ctdb_reqid_find(w->ctdb, w->client_id, struct ctdb_client);`
			`if (client == NULL) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,(__location__ " Packet for disconnected client %u\n",`
- catch a case where the client disconnects during a call - track all talloc memory, using NULL context (This used to be ctdb commit bf89c56002f5311520e91cb367753bc46e5dddc9) 2008-01-16 01:44:48 +03:00			`w->client_id));`
			`talloc_free(w);`
			`return;`
			`}`
			`talloc_free(w);`

			`/* process it */`
			`daemon_incoming_packet(client, hdr);`
			`}`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00
merged ronnies code to delay client requests when in recovery mode (This used to be ctdb commit dfca37076d642f3407c63dfe3b685287d27c8f8d) 2007-05-10 01:43:18 +04:00
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`/*`
			`this is called when the ctdb daemon received a ctdb request call`
			`from a local client over the unix domain socket`
			`*/`
			`static void daemon_request_call_from_client(struct ctdb_client *client,`
			`struct ctdb_req_call *c)`
			`{`
			`struct ctdb_call_state *state;`
			`struct ctdb_db_context *ctdb_db;`
			`struct daemon_call_state *dstate;`
			`struct ctdb_call *call;`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`struct ctdb_ltdb_header header;`
			`TDB_DATA key, data;`
			`int ret;`
			`struct ctdb_context *ctdb = client->ctdb;`
- catch a case where the client disconnects during a call - track all talloc memory, using NULL context (This used to be ctdb commit bf89c56002f5311520e91cb367753bc46e5dddc9) 2008-01-16 01:44:48 +03:00			`struct ctdb_daemon_packet_wrap *w;`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00
- renamed ctdb_control utility to ctdb - use -n to specify node number in ctdb utility - change 'ctdb status' to 'ctdb statistics' - added 'ctdb status' which shows status - added netmask to public IPs, so you don't try a takeover on a foreign network - cleaned up tools/ctdb_control.c a lot - generate usage message at runtime (This used to be ctdb commit 28de71c03ace7d32a9fd9882fabbd5d668b97656) 2007-05-29 06:16:59 +04:00			`ctdb->statistics.total_calls++;`
when tracking the ctdb statistics, only decrement num_clients and pending_calls IFF the counter is >0 Otherwise there is the chance that we will reset the statistics after the counter has been incremented (client connects) to zero and when the client disconnects we decrement it to a negative number. this is a pure cosmetic patch with no operational impact to ctdb (This used to be ctdb commit 72f1c696ee77899f7973878f2568a60d199d4fea) 2009-05-01 06:30:26 +04:00			`if (client->ctdb->statistics.pending_calls > 0) {`
			`ctdb->statistics.pending_calls++;`
			`}`
added ctdb_status tool (This used to be ctdb commit 908d6c6a936e21f70f05827ce302e966cca0132a) 2007-04-20 14:07:47 +04:00
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`ctdb_db = find_ctdb_db(client->ctdb, c->db_id);`
			`if (!ctdb_db) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR, (__location__ " Unknown database in request. db_id==0x%08x",`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`c->db_id));`
when tracking the ctdb statistics, only decrement num_clients and pending_calls IFF the counter is >0 Otherwise there is the chance that we will reset the statistics after the counter has been incremented (client connects) to zero and when the client disconnects we decrement it to a negative number. this is a pure cosmetic patch with no operational impact to ctdb (This used to be ctdb commit 72f1c696ee77899f7973878f2568a60d199d4fea) 2009-05-01 06:30:26 +04:00			`if (client->ctdb->statistics.pending_calls > 0) {`
			`ctdb->statistics.pending_calls--;`
			`}`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`return;`
			`}`

- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`key.dptr = c->data;`
			`key.dsize = c->keylen;`

- catch a case where the client disconnects during a call - track all talloc memory, using NULL context (This used to be ctdb commit bf89c56002f5311520e91cb367753bc46e5dddc9) 2008-01-16 01:44:48 +03:00			`w = talloc(ctdb, struct ctdb_daemon_packet_wrap);`
			`CTDB_NO_MEMORY_VOID(ctdb, w);`

			`w->ctdb = ctdb;`
			`w->client_id = client->client_id;`

- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`ret = ctdb_ltdb_lock_fetch_requeue(ctdb_db, key, &header,`
			`(struct ctdb_req_header *)c, &data,`
- catch a case where the client disconnects during a call - track all talloc memory, using NULL context (This used to be ctdb commit bf89c56002f5311520e91cb367753bc46e5dddc9) 2008-01-16 01:44:48 +03:00			`daemon_incoming_packet_wrap, w, True);`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`if (ret == -2) {`
			`/* will retry later */`
when tracking the ctdb statistics, only decrement num_clients and pending_calls IFF the counter is >0 Otherwise there is the chance that we will reset the statistics after the counter has been incremented (client connects) to zero and when the client disconnects we decrement it to a negative number. this is a pure cosmetic patch with no operational impact to ctdb (This used to be ctdb commit 72f1c696ee77899f7973878f2568a60d199d4fea) 2009-05-01 06:30:26 +04:00			`if (client->ctdb->statistics.pending_calls > 0) {`
			`ctdb->statistics.pending_calls--;`
			`}`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`return;`
			`}`

- catch a case where the client disconnects during a call - track all talloc memory, using NULL context (This used to be ctdb commit bf89c56002f5311520e91cb367753bc46e5dddc9) 2008-01-16 01:44:48 +03:00			`talloc_free(w);`

- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`if (ret != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,(__location__ " Unable to fetch record\n"));`
when tracking the ctdb statistics, only decrement num_clients and pending_calls IFF the counter is >0 Otherwise there is the chance that we will reset the statistics after the counter has been incremented (client connects) to zero and when the client disconnects we decrement it to a negative number. this is a pure cosmetic patch with no operational impact to ctdb (This used to be ctdb commit 72f1c696ee77899f7973878f2568a60d199d4fea) 2009-05-01 06:30:26 +04:00			`if (client->ctdb->statistics.pending_calls > 0) {`
			`ctdb->statistics.pending_calls--;`
			`}`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`return;`
			`}`

- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`dstate = talloc(client, struct daemon_call_state);`
			`if (dstate == NULL) {`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`ctdb_ltdb_unlock(ctdb_db, key);`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,(__location__ " Unable to allocate dstate\n"));`
when tracking the ctdb statistics, only decrement num_clients and pending_calls IFF the counter is >0 Otherwise there is the chance that we will reset the statistics after the counter has been incremented (client connects) to zero and when the client disconnects we decrement it to a negative number. this is a pure cosmetic patch with no operational impact to ctdb (This used to be ctdb commit 72f1c696ee77899f7973878f2568a60d199d4fea) 2009-05-01 06:30:26 +04:00			`if (client->ctdb->statistics.pending_calls > 0) {`
			`ctdb->statistics.pending_calls--;`
			`}`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`return;`
			`}`
added ctdb_status tool (This used to be ctdb commit 908d6c6a936e21f70f05827ce302e966cca0132a) 2007-04-20 14:07:47 +04:00			`dstate->start_time = timeval_current();`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`dstate->client = client;`
			`dstate->reqid = c->hdr.reqid;`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`talloc_steal(dstate, data.dptr);`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00
			`call = dstate->call = talloc_zero(dstate, struct ctdb_call);`
			`if (call == NULL) {`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`ctdb_ltdb_unlock(ctdb_db, key);`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,(__location__ " Unable to allocate call\n"));`
when tracking the ctdb statistics, only decrement num_clients and pending_calls IFF the counter is >0 Otherwise there is the chance that we will reset the statistics after the counter has been incremented (client connects) to zero and when the client disconnects we decrement it to a negative number. this is a pure cosmetic patch with no operational impact to ctdb (This used to be ctdb commit 72f1c696ee77899f7973878f2568a60d199d4fea) 2009-05-01 06:30:26 +04:00			`if (client->ctdb->statistics.pending_calls > 0) {`
			`ctdb->statistics.pending_calls--;`
			`}`
add control and logging of very high latencies. log the type of operation and the database name for all latencies higher than a treshold (This used to be ctdb commit 1d581dcd507e8e13d7ae085ff4d6a9f3e2aaeba5) 2008-10-30 04:49:53 +03:00			`ctdb_latency(ctdb_db, "call_from_client 1", &ctdb->statistics.max_call_latency, dstate->start_time);`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`return;`
			`}`

			`call->call_id = c->callid;`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`call->key = key;`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`call->call_data.dptr = c->data + c->keylen;`
			`call->call_data.dsize = c->calldatalen;`
The remote node needs to get the IMMEDIATE_MIGRATION flag to actually send the record (This used to be ctdb commit 9159434b1eef39b7de58b30626039f1e45a97306) 2007-04-19 19:44:45 +04:00			`call->flags = c->flags;`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`if (header.dmaster == ctdb->pnn) {`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`state = ctdb_call_local_send(ctdb_db, call, &header, &data);`
			`} else {`
			`state = ctdb_daemon_call_send_remote(ctdb_db, call, &header);`
			`}`

			`ctdb_ltdb_unlock(ctdb_db, key);`

- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`if (state == NULL) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,(__location__ " Unable to setup call send\n"));`
when tracking the ctdb statistics, only decrement num_clients and pending_calls IFF the counter is >0 Otherwise there is the chance that we will reset the statistics after the counter has been incremented (client connects) to zero and when the client disconnects we decrement it to a negative number. this is a pure cosmetic patch with no operational impact to ctdb (This used to be ctdb commit 72f1c696ee77899f7973878f2568a60d199d4fea) 2009-05-01 06:30:26 +04:00			`if (client->ctdb->statistics.pending_calls > 0) {`
			`ctdb->statistics.pending_calls--;`
			`}`
add control and logging of very high latencies. log the type of operation and the database name for all latencies higher than a treshold (This used to be ctdb commit 1d581dcd507e8e13d7ae085ff4d6a9f3e2aaeba5) 2008-10-30 04:49:53 +03:00			`ctdb_latency(ctdb_db, "call_from_client 2", &ctdb->statistics.max_call_latency, dstate->start_time);`
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`return;`
			`}`
			`talloc_steal(state, dstate);`
			`talloc_steal(client, state);`

			`state->async.fn = daemon_call_from_client_callback;`
			`state->async.private_data = dstate;`
			`}`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00
			`static void daemon_request_control_from_client(struct ctdb_client *client,`
			`struct ctdb_req_control *c);`

make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`/* data contains a packet from the client */`
make sure we don't increment rx_cnt for redirected packets, or for packets that have been requeued after a lockwait (This used to be ctdb commit 92e5569407dba173a27e9645b4339ce3e2c00520) 2007-05-19 07:45:24 +04:00			`static void daemon_incoming_packet(void p, struct ctdb_req_header hdr)`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`{`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`struct ctdb_client *client = talloc_get_type(p, struct ctdb_client);`
- merge volkers debug changes - fixed memory leaks in the 3 packet receive routines. The problem was that the ctdb_call logic would occasionally complete and free a incoming packet, which would then be freed again in the packet receive routine. The solution is to make the packet a child of a temporary context in the receive routine then free that temporary context. That allows other routines to keep or free the packet if they want to, while allowing us to safely free it (via a free of the temporary context) in the receive function (This used to be ctdb commit 304aaaa7235febbe97ff9ecb43875b7265ac48cd) 2007-04-18 05:20:24 +04:00			`TALLOC_CTX *tmp_ctx;`
- expanded status to include count of each call type - added lockwait latency (This used to be ctdb commit 0b5d196147e644cf8b172cb4b593fd46b1caa386) 2007-04-20 15:02:53 +04:00			`struct ctdb_context *ctdb = client->ctdb;`
- merge volkers debug changes - fixed memory leaks in the 3 packet receive routines. The problem was that the ctdb_call logic would occasionally complete and free a incoming packet, which would then be freed again in the packet receive routine. The solution is to make the packet a child of a temporary context in the receive routine then free that temporary context. That allows other routines to keep or free the packet if they want to, while allowing us to safely free it (via a free of the temporary context) in the receive function (This used to be ctdb commit 304aaaa7235febbe97ff9ecb43875b7265ac48cd) 2007-04-18 05:20:24 +04:00
			`/* place the packet as a child of a tmp_ctx. We then use`
			`talloc_free() below to free it. If any of the calls want`
			`to keep it, then they will steal it somewhere else, and the`
			`talloc_free() will be a no-op */`
			`tmp_ctx = talloc_new(client);`
			`talloc_steal(tmp_ctx, hdr);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
			`if (hdr->ctdb_magic != CTDB_MAGIC) {`
change some error printouts to make it easier to determine whether the error occured in the client or in the daemon (This used to be ctdb commit a7e42c2c56e38b4b58ede0ad45767695d704dac4) 2007-04-17 04:15:44 +04:00			`ctdb_set_error(client->ctdb, "Non CTDB packet rejected in daemon\n");`
merge from ronnie, plus complete the client side of inter-node messaging (This used to be ctdb commit e605417436855d22343462acae4cbb79a374977e) 2007-04-11 08:05:01 +04:00			`goto done;`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`}`

			`if (hdr->ctdb_version != CTDB_VERSION) {`
change some error printouts to make it easier to determine whether the error occured in the client or in the daemon (This used to be ctdb commit a7e42c2c56e38b4b58ede0ad45767695d704dac4) 2007-04-17 04:15:44 +04:00			`ctdb_set_error(client->ctdb, "Bad CTDB version 0x%x rejected in daemon\n", hdr->ctdb_version);`
merge from ronnie, plus complete the client side of inter-node messaging (This used to be ctdb commit e605417436855d22343462acae4cbb79a374977e) 2007-04-11 08:05:01 +04:00			`goto done;`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`}`

			`switch (hdr->operation) {`
			`case CTDB_REQ_CALL:`
- renamed ctdb_control utility to ctdb - use -n to specify node number in ctdb utility - change 'ctdb status' to 'ctdb statistics' - added 'ctdb status' which shows status - added netmask to public IPs, so you don't try a takeover on a foreign network - cleaned up tools/ctdb_control.c a lot - generate usage message at runtime (This used to be ctdb commit 28de71c03ace7d32a9fd9882fabbd5d668b97656) 2007-05-29 06:16:59 +04:00			`ctdb->statistics.client.req_call++;`
- add --daemon flag to ctdb_fetch test code - split client specific routines out of ctdb_daemon.c - use ctdb_queue code in message send from client to daemon - use clearer names in client/daemon functions - use talloc autofree context to avoid global for unlink of socket on exit - start on API change for message handler, to allow ctdb messaging to handle daemon mode with multiple clients (This used to be ctdb commit 53555db45f3583ae4a32cc3aa9e07fb8ef2a77e3) 2007-04-11 05:01:42 +04:00			`daemon_request_call_from_client(client, (struct ctdb_req_call *)hdr);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`break;`

add a test that sends messages between clients connected to the same ctdb add code to actually pass the messages between clients and ctdb (This used to be ctdb commit 6d5b55d7b9c611fb5e98765906757a7d82e4bf6b) 2007-04-11 07:43:15 +04:00			`case CTDB_REQ_MESSAGE:`
- renamed ctdb_control utility to ctdb - use -n to specify node number in ctdb utility - change 'ctdb status' to 'ctdb statistics' - added 'ctdb status' which shows status - added netmask to public IPs, so you don't try a takeover on a foreign network - cleaned up tools/ctdb_control.c a lot - generate usage message at runtime (This used to be ctdb commit 28de71c03ace7d32a9fd9882fabbd5d668b97656) 2007-05-29 06:16:59 +04:00			`ctdb->statistics.client.req_message++;`
add a test that sends messages between clients connected to the same ctdb add code to actually pass the messages between clients and ctdb (This used to be ctdb commit 6d5b55d7b9c611fb5e98765906757a7d82e4bf6b) 2007-04-11 07:43:15 +04:00			`daemon_request_message_from_client(client, (struct ctdb_req_message *)hdr);`
			`break;`
add proper support for ctdb_connect_wait in daemon mode (This used to be ctdb commit 8d110df5939b3e6a6341909956453887f4eb6b0d) 2007-04-11 08:54:47 +04:00
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`case CTDB_REQ_CONTROL:`
- renamed ctdb_control utility to ctdb - use -n to specify node number in ctdb utility - change 'ctdb status' to 'ctdb statistics' - added 'ctdb status' which shows status - added netmask to public IPs, so you don't try a takeover on a foreign network - cleaned up tools/ctdb_control.c a lot - generate usage message at runtime (This used to be ctdb commit 28de71c03ace7d32a9fd9882fabbd5d668b97656) 2007-05-29 06:16:59 +04:00			`ctdb->statistics.client.req_control++;`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`daemon_request_control_from_client(client, (struct ctdb_req_control *)hdr);`
			`break;`

add store_unlock pdu's for the domain socket. note that the store_unlock does not actually do anything yet apart from passing the pdu from client to daemon and daemon responds. next is to make sure the daemon actually stores the data in a database (This used to be ctdb commit 167d6993e78f6a1d0f6607ef66925a14993ae6a1) 2007-04-13 03:41:15 +04:00			`default:`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_CRIT,(__location__ " daemon: unrecognized operation %u\n",`
more DEBUG() calls (This used to be ctdb commit 79f3d63eec5652d87f13875c76e90ead81a26ad9) 2007-04-17 16:27:17 +04:00			`hdr->operation));`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`}`

merge from ronnie, plus complete the client side of inter-node messaging (This used to be ctdb commit e605417436855d22343462acae4cbb79a374977e) 2007-04-11 08:05:01 +04:00			`done:`
- merge volkers debug changes - fixed memory leaks in the 3 packet receive routines. The problem was that the ctdb_call logic would occasionally complete and free a incoming packet, which would then be freed again in the packet receive routine. The solution is to make the packet a child of a temporary context in the receive routine then free that temporary context. That allows other routines to keep or free the packet if they want to, while allowing us to safely free it (via a free of the temporary context) in the receive function (This used to be ctdb commit 304aaaa7235febbe97ff9ecb43875b7265ac48cd) 2007-04-18 05:20:24 +04:00			`talloc_free(tmp_ctx);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`}`

added ctdb_status tool (This used to be ctdb commit 908d6c6a936e21f70f05827ce302e966cca0132a) 2007-04-20 14:07:47 +04:00			`/*`
			`called when the daemon gets a incoming packet`
			`*/`
- removed the non-daemon mode from ctdb, in order to simplify the code. It may be added back later once everything is working nicely, or simulated using a in-process pipe instead of a unix domain socket - rewrote the ctdb_fetch_lock() code to follow the new design (This used to be ctdb commit 5024dd1f305fe1ecc262db2240c56f773b4f28f0) 2007-04-17 08:52:51 +04:00			`static void ctdb_daemon_read_cb(uint8_t data, size_t cnt, void args)`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`{`
change ctdb_client_read() to use the ctdb_read_pdu() helper (This used to be ctdb commit d476aa8533b394af6aced9c80fffaf0eefae1dd0) 2007-04-10 02:38:29 +04:00			`struct ctdb_client *client = talloc_get_type(args, struct ctdb_client);`
			`struct ctdb_req_header *hdr;`

Handle a client that exited correctly: We need to ignore SIGPIPE and when the read returns 0 bytes this means the client has exited. Close the connection then. (This used to be ctdb commit bd10f4e62146493848258df8a3dc3b9222337a12) 2007-04-11 15:17:36 +04:00			`if (cnt == 0) {`
			`talloc_free(client);`
			`return;`
			`}`

- renamed ctdb_control utility to ctdb - use -n to specify node number in ctdb utility - change 'ctdb status' to 'ctdb statistics' - added 'ctdb status' which shows status - added netmask to public IPs, so you don't try a takeover on a foreign network - cleaned up tools/ctdb_control.c a lot - generate usage message at runtime (This used to be ctdb commit 28de71c03ace7d32a9fd9882fabbd5d668b97656) 2007-05-29 06:16:59 +04:00			`client->ctdb->statistics.client_packets_recv++;`
added ctdb_status tool (This used to be ctdb commit 908d6c6a936e21f70f05827ce302e966cca0132a) 2007-04-20 14:07:47 +04:00
change ctdb_client_read() to use the ctdb_read_pdu() helper (This used to be ctdb commit d476aa8533b394af6aced9c80fffaf0eefae1dd0) 2007-04-10 02:38:29 +04:00			`if (cnt < sizeof(*hdr)) {`
fixed some warnings (This used to be ctdb commit b5434a40cf2db008eb1e681fcd2ceeff331324fa) 2007-04-28 13:35:49 +04:00			`ctdb_set_error(client->ctdb, "Bad packet length %u in daemon\n",`
			`(unsigned)cnt);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`return;`
			`}`
change ctdb_client_read() to use the ctdb_read_pdu() helper (This used to be ctdb commit d476aa8533b394af6aced9c80fffaf0eefae1dd0) 2007-04-10 02:38:29 +04:00			`hdr = (struct ctdb_req_header *)data;`
			`if (cnt != hdr->length) {`
removed some bogus debug lines (This used to be ctdb commit 25aa579058ecd2a33b13b4c1d6c7c75427bbdafa) 2007-04-26 20:31:13 +04:00			`ctdb_set_error(client->ctdb, "Bad header length %u expected %u\n in daemon",`
fixed some warnings (This used to be ctdb commit b5434a40cf2db008eb1e681fcd2ceeff331324fa) 2007-04-28 13:35:49 +04:00			`(unsigned)hdr->length, (unsigned)cnt);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`return;`
			`}`

change ctdb_client_read() to use the ctdb_read_pdu() helper (This used to be ctdb commit d476aa8533b394af6aced9c80fffaf0eefae1dd0) 2007-04-10 02:38:29 +04:00			`if (hdr->ctdb_magic != CTDB_MAGIC) {`
			`ctdb_set_error(client->ctdb, "Non CTDB packet rejected\n");`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`return;`
			`}`

change ctdb_client_read() to use the ctdb_read_pdu() helper (This used to be ctdb commit d476aa8533b394af6aced9c80fffaf0eefae1dd0) 2007-04-10 02:38:29 +04:00			`if (hdr->ctdb_version != CTDB_VERSION) {`
change some error printouts to make it easier to determine whether the error occured in the client or in the daemon (This used to be ctdb commit a7e42c2c56e38b4b58ede0ad45767695d704dac4) 2007-04-17 04:15:44 +04:00			`ctdb_set_error(client->ctdb, "Bad CTDB version 0x%x rejected in daemon\n", hdr->ctdb_version);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`return;`
			`}`

added debug constants to allow for better mapping to syslog levels (This used to be ctdb commit 7ba8f1dde318eab03f4257e5a89fd23e7281e502) 2008-02-04 09:44:24 +03:00			`DEBUG(DEBUG_DEBUG,(__location__ " client request %u of type %u length %u from "`
fixed %d which should be %u (This used to be ctdb commit 2792cf718ff1e66fe99f870f683a13baa160f629) 2007-05-23 14:15:09 +04:00			`"node %u to %u\n", hdr->reqid, hdr->operation, hdr->length,`
added ctdb_status tool (This used to be ctdb commit 908d6c6a936e21f70f05827ce302e966cca0132a) 2007-04-20 14:07:47 +04:00			`hdr->srcnode, hdr->destnode));`

change ctdb_client_read() to use the ctdb_read_pdu() helper (This used to be ctdb commit d476aa8533b394af6aced9c80fffaf0eefae1dd0) 2007-04-10 02:38:29 +04:00			`/* it is the responsibility of the incoming packet function to free 'data' */`
make sure we don't increment rx_cnt for redirected packets, or for packets that have been requeued after a lockwait (This used to be ctdb commit 92e5569407dba173a27e9645b4339ce3e2c00520) 2007-05-19 07:45:24 +04:00			`daemon_incoming_packet(client, hdr);`
change ctdb_client_read() to use the ctdb_read_pdu() helper (This used to be ctdb commit d476aa8533b394af6aced9c80fffaf0eefae1dd0) 2007-04-10 02:38:29 +04:00			`}`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
			`static void ctdb_accept_client(struct event_context ev, struct fd_event fde,`
private -> private_data for samba3 (This used to be ctdb commit 080b6901173afb2ad618dd0621876ff478c7d6e5) 2007-04-13 14:38:24 +04:00			`uint16_t flags, void *private_data)`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`{`
initial ipv6 patch Signed-off-by: Ronnie Sahlberg <ronniesahlberg@gmail.com> (This used to be ctdb commit 1f131f21386f428bbbbb29098d56c2f64596583b) 2008-08-19 08:58:29 +04:00			`struct sockaddr_un addr;`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`socklen_t len;`
			`int fd;`
private -> private_data for samba3 (This used to be ctdb commit 080b6901173afb2ad618dd0621876ff478c7d6e5) 2007-04-13 14:38:24 +04:00			`struct ctdb_context *ctdb = talloc_get_type(private_data, struct ctdb_context);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`struct ctdb_client *client;`
From Chris Cowan Add support in AIX to track the PID of a client that connects to the unix domain socket (This used to be ctdb commit 4c006c675d577d4a45f4db2929af6d50bc28dd9e) 2008-04-03 03:58:51 +04:00			`#ifdef _AIX`
			`struct peercred_struct cr;`
			`socklen_t crl = sizeof(struct peercred_struct);`
			`#else`
decorate the memdump output with a nice field for ctdb_client structures to show the pid of the client that attached (This used to be ctdb commit 0d9314302d0b988b6ab5d533deef40c5b343c249) 2008-04-01 10:17:21 +04:00			`struct ucred cr;`
			`socklen_t crl = sizeof(struct ucred);`
From Chris Cowan Add support in AIX to track the PID of a client that connects to the unix domain socket (This used to be ctdb commit 4c006c675d577d4a45f4db2929af6d50bc28dd9e) 2008-04-03 03:58:51 +04:00			`#endif`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
			`memset(&addr, 0, sizeof(addr));`
			`len = sizeof(addr);`
			`fd = accept(ctdb->daemon.sd, (struct sockaddr *)&addr, &len);`
			`if (fd == -1) {`
			`return;`
			`}`
close sockets when we exec scripts (This used to be ctdb commit 0fac2164db4279db2d7d376a34be05b890304087) 2007-05-30 09:43:25 +04:00
			`set_nonblocking(fd);`
			`set_close_on_exec(fd);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
			`client = talloc_zero(ctdb, struct ctdb_client);`
From Chris Cowan Add support in AIX to track the PID of a client that connects to the unix domain socket (This used to be ctdb commit 4c006c675d577d4a45f4db2929af6d50bc28dd9e) 2008-04-03 03:58:51 +04:00			`#ifdef _AIX`
			`if (getsockopt(fd, SOL_SOCKET, SO_PEERID, &cr, &crl) == 0) {`
			`#else`
decorate the memdump output with a nice field for ctdb_client structures to show the pid of the client that attached (This used to be ctdb commit 0d9314302d0b988b6ab5d533deef40c5b343c249) 2008-04-01 10:17:21 +04:00			`if (getsockopt(fd, SOL_SOCKET, SO_PEERCRED, &cr, &crl) == 0) {`
From Chris Cowan Add support in AIX to track the PID of a client that connects to the unix domain socket (This used to be ctdb commit 4c006c675d577d4a45f4db2929af6d50bc28dd9e) 2008-04-03 03:58:51 +04:00			`#endif`
decorate the memdump output with a nice field for ctdb_client structures to show the pid of the client that attached (This used to be ctdb commit 0d9314302d0b988b6ab5d533deef40c5b343c249) 2008-04-01 10:17:21 +04:00			`talloc_asprintf(client, "struct ctdb_client: pid:%u", (unsigned)cr.pid);`
add improvements to tracking memory usage in ctdbd adn the recovery daemon and a ctdb command to pull the talloc memory map from a recovery daemon ctdb rddumpmemory (This used to be ctdb commit d23950be7406cf288f48b660c0f57a9b8d7bdd05) 2008-04-01 08:34:54 +04:00			`}`

make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`client->ctdb = ctdb;`
			`client->fd = fd;`
- changed the REQ_REGISTER PDU to be a control - allow controls to know which client invoked them - added a client_id to clients, so they can be identified remotely - added the ability to remove registered srvids - in the list_keys code, register a temp srvid, then remove it afterwards (This used to be ctdb commit 29603c51cc6d81362532cd8e50f75c8360c5f5ef) 2007-05-04 05:41:29 +04:00			`client->client_id = ctdb_reqid_new(ctdb, client);`
- renamed ctdb_control utility to ctdb - use -n to specify node number in ctdb utility - change 'ctdb status' to 'ctdb statistics' - added 'ctdb status' which shows status - added netmask to public IPs, so you don't try a takeover on a foreign network - cleaned up tools/ctdb_control.c a lot - generate usage message at runtime (This used to be ctdb commit 28de71c03ace7d32a9fd9882fabbd5d668b97656) 2007-05-29 06:16:59 +04:00			`ctdb->statistics.num_clients++;`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
made all sockets handle partial IO abstract IO via ctdb_queue_*() functions (This used to be ctdb commit 636ae76f4632b29231db87be32c9114f58b37840) 2007-04-10 13:33:21 +04:00			`client->queue = ctdb_queue_setup(ctdb, client, fd, CTDB_DS_ALIGNMENT,`
- removed the non-daemon mode from ctdb, in order to simplify the code. It may be added back later once everything is working nicely, or simulated using a in-process pipe instead of a unix domain socket - rewrote the ctdb_fetch_lock() code to follow the new design (This used to be ctdb commit 5024dd1f305fe1ecc262db2240c56f773b4f28f0) 2007-04-17 08:52:51 +04:00			`ctdb_daemon_read_cb, client);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
			`talloc_set_destructor(client, ctdb_client_destructor);`
			`}`



			`/*`
			`create a unix domain socket and bind it`
			`return a file descriptor open on the socket`
			`*/`
			`static int ux_socket_bind(struct ctdb_context *ctdb)`
			`{`
			`struct sockaddr_un addr;`

			`ctdb->daemon.sd = socket(AF_UNIX, SOCK_STREAM, 0);`
			`if (ctdb->daemon.sd == -1) {`
			`return -1;`
			`}`

close sockets when we exec scripts (This used to be ctdb commit 0fac2164db4279db2d7d376a34be05b890304087) 2007-05-30 09:43:25 +04:00			`set_close_on_exec(ctdb->daemon.sd);`
			`set_nonblocking(ctdb->daemon.sd);`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
			`memset(&addr, 0, sizeof(addr));`
			`addr.sun_family = AF_UNIX;`
			`strncpy(addr.sun_path, ctdb->daemon.name, sizeof(addr.sun_path));`

			`if (bind(ctdb->daemon.sd, (struct sockaddr *)&addr, sizeof(addr)) == -1) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_CRIT,("Unable to bind on ctdb socket '%s'\n", ctdb->daemon.name));`
make sure the ctdb control socket is secure (This used to be ctdb commit 2954f2e501a418af578e75e8705b0b39a77c1861) 2007-05-13 03:20:16 +04:00			`goto failed;`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`}`
From Chris Cowan secure the domain socket and set permissions properly (This used to be ctdb commit ac6a362fc2fc4a56b4c310478a96eb12daace176) 2008-04-10 00:51:53 +04:00
			`if (chown(ctdb->daemon.name, geteuid(), getegid()) != 0 \|\|`
			`chmod(ctdb->daemon.name, 0700) != 0) {`
fix compiler warning during a fatal error failing to lock down the socket (This used to be ctdb commit 0ad22de1a614dc2d1926546027be5f5eea3381ed) 2008-04-10 03:56:49 +04:00			`DEBUG(DEBUG_CRIT,("Unable to secure ctdb socket '%s', ctdb->daemon.name\n", ctdb->daemon.name));`
From Chris Cowan secure the domain socket and set permissions properly (This used to be ctdb commit ac6a362fc2fc4a56b4c310478a96eb12daace176) 2008-04-10 00:51:53 +04:00			`goto failed;`
			`}`


increase the listen queue. Now that the eventscripts may become clients and connect back to the server we do get a lot more concurrent connection attempts (takepip/teleaseip are performed in parallell) (This used to be ctdb commit 018f8b0b1823ef59b46f1a671aec5309d10628f4) 2009-04-06 08:00:41 +04:00			`if (listen(ctdb->daemon.sd, 100) != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_CRIT,("Unable to listen on ctdb socket '%s'\n", ctdb->daemon.name));`
make sure the ctdb control socket is secure (This used to be ctdb commit 2954f2e501a418af578e75e8705b0b39a77c1861) 2007-05-13 03:20:16 +04:00			`goto failed;`
			`}`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00
			`return 0;`
make sure the ctdb control socket is secure (This used to be ctdb commit 2954f2e501a418af578e75e8705b0b39a77c1861) 2007-05-13 03:20:16 +04:00
			`failed:`
			`close(ctdb->daemon.sd);`
			`ctdb->daemon.sd = -1;`
			`return -1;`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`}`

proper waitpid() fix. remove all waitpid() calls and use the event system to trap sigchld (This used to be ctdb commit 77458b2b6b51b2970c12b0e5b097088d3fb9d358) 2008-07-09 08:02:54 +04:00			`static void sig_child_handler(struct event_context *ev,`
			`struct signal_event *se, int signum, int count,`
			`void *dont_care,`
			`void *private_data)`
			`{`
			`// struct ctdb_context *ctdb = talloc_get_type(private_data, struct ctdb_context);`
			`int status;`
			`pid_t pid = -1;`

			`while (pid != 0) {`
			`pid = waitpid(-1, &status, WNOHANG);`
			`if (pid == -1) {`
			`DEBUG(DEBUG_ERR, (__location__ " waitpid() returned error. errno:%d\n", errno));`
			`return;`
			`}`
			`if (pid > 0) {`
			`DEBUG(DEBUG_DEBUG, ("SIGCHLD from %d\n", (int)pid));`
			`}`
			`}`
			`}`

yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`/*`
			`start the protocol going as a daemon`
			`*/`
added a -i switch to run ctdbd without forking (This used to be ctdb commit 327df14ecd58f405fbe8b38afa2ee54a8dd0a2e4) 2007-05-15 03:44:33 +04:00			`int ctdb_start_daemon(struct ctdb_context *ctdb, bool do_fork)`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`{`
added support for persistent databases in ctdbd (This used to be ctdb commit 3115090a0d882beca9d70761130b74bb0821f201) 2007-09-21 06:24:02 +04:00			`int res, ret = -1;`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`struct fd_event *fde;`
			`const char *domain_socket_name;`
proper waitpid() fix. remove all waitpid() calls and use the event system to trap sigchld (This used to be ctdb commit 77458b2b6b51b2970c12b0e5b097088d3fb9d358) 2008-07-09 08:02:54 +04:00			`struct signal_event *se;`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00
			`/* get rid of any old sockets */`
			`unlink(ctdb->daemon.name);`

			`/* create a unix domain stream socket to listen to */`
			`res = ux_socket_bind(ctdb);`
			`if (res!=0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ALERT,(__location__ " Failed to open CTDB unix domain socket\n"));`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`exit(10);`
			`}`

added a -i switch to run ctdbd without forking (This used to be ctdb commit 327df14ecd58f405fbe8b38afa2ee54a8dd0a2e4) 2007-05-15 03:44:33 +04:00			`if (do_fork && fork()) {`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`return 0;`
			`}`

			`tdb_reopen_all(False);`

added a -i switch to run ctdbd without forking (This used to be ctdb commit 327df14ecd58f405fbe8b38afa2ee54a8dd0a2e4) 2007-05-15 03:44:33 +04:00			`if (do_fork) {`
			`setsid();`
fixed the bug that make "onnode N service ctdb start" hang (This used to be ctdb commit b50dcb16f30a60abce42f491f9b0aae7948b8206) 2008-01-05 04:09:29 +03:00			`close(0);`
			`if (open("/dev/null", O_RDONLY) != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ALERT,(__location__ " Failed to setup stdin on /dev/null\n"));`
fixed the bug that make "onnode N service ctdb start" hang (This used to be ctdb commit b50dcb16f30a60abce42f491f9b0aae7948b8206) 2008-01-05 04:09:29 +03:00			`exit(11);`
			`}`
added a -i switch to run ctdbd without forking (This used to be ctdb commit 327df14ecd58f405fbe8b38afa2ee54a8dd0a2e4) 2007-05-15 03:44:33 +04:00			`}`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`block_signal(SIGPIPE);`

added --nosetsched option to ctdbd (This used to be ctdb commit 4cbbb88c1735c7d112e751e22da1c1c69e09bf4a) 2007-07-13 02:47:02 +04:00			`if (ctdb->do_setsched) {`
			`/* try to set us up as realtime */`
fully save/restore scheduler parameters (This used to be ctdb commit 59408eabe7515d49a6eef3b6fb2590a1cd1df956) 2007-07-13 03:35:46 +04:00			`ctdb_set_scheduler(ctdb);`
added --nosetsched option to ctdbd (This used to be ctdb commit 4cbbb88c1735c7d112e751e22da1c1c69e09bf4a) 2007-07-13 02:47:02 +04:00			`}`
make ctdbd realtime if possible (This used to be ctdb commit 8852f6cca52b64a5239c83ab7c6a99ae4edb2597) 2007-05-24 08:52:10 +04:00
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`/* ensure the socket is deleted on exit of the daemon */`
			`domain_socket_name = talloc_strdup(talloc_autofree_context(), ctdb->daemon.name);`
add missing checks on so far ignored return values Most of these were found during a review by Jim Meyering <meyering@redhat.com> (This used to be ctdb commit 3aee5ee1deb4a19be3bd3a4ce3abbe09de763344) 2009-05-20 14:08:13 +04:00			`if (domain_socket_name == NULL) {`
			`DEBUG(DEBUG_ALERT,(__location__ " talloc_strdup failed.\n"));`
			`exit(12);`
			`}`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00
			`ctdb->ev = event_context_init(NULL);`
add a command line flag to ctdbd to start a recovery daemon. update the recovery test script to start all ctdb daemons with a recovery daemon (This used to be ctdb commit 47794e16df285cacefc30208d892d931a6e46b96) 2007-05-09 03:59:23 +04:00
added syslog support, and use a pipe to catch logging from child processes to the ctdbd logging functions (This used to be ctdb commit 1306b04cd01e996fd1aa1159a9521f2ff7b06165) 2008-01-16 14:03:01 +03:00			`ctdb_set_child_logging(ctdb);`

added support for persistent databases in ctdbd (This used to be ctdb commit 3115090a0d882beca9d70761130b74bb0821f201) 2007-09-21 06:24:02 +04:00			`/* force initial recovery for election */`
			`ctdb->recovery_mode = CTDB_RECOVERY_ACTIVE;`

			`if (strcmp(ctdb->transport, "tcp") == 0) {`
			`int ctdb_tcp_init(struct ctdb_context *);`
			`ret = ctdb_tcp_init(ctdb);`
			`}`
			`#ifdef USE_INFINIBAND`
			`if (strcmp(ctdb->transport, "ib") == 0) {`
			`int ctdb_ibw_init(struct ctdb_context *);`
			`ret = ctdb_ibw_init(ctdb);`
			`}`
			`#endif`
			`if (ret != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,("Failed to initialise transport '%s'\n", ctdb->transport));`
added support for persistent databases in ctdbd (This used to be ctdb commit 3115090a0d882beca9d70761130b74bb0821f201) 2007-09-21 06:24:02 +04:00			`return -1;`
			`}`

ctdb->methods becomes NULL when we shutdown the transport. If we shutdown the transport and CTDB later decides to send a command out for queueing, the call to ctdb->methods->allocate_pkt() will SEGV. This could trigger for example when we are in the process of shuttind down CTDBD and have already shutdown the transport but we are still waiting for the "shutdown" eventscripts to finish. If the event scripts now take much much longer to execute for some reason, this race condition becomes much more probable. Decorate all dereferencing of ctdb->methods-> with a check that ctdb->menthods is non-NULL (This used to be ctdb commit c4c2c53918da6fb566d6e9cbd6b02e61ae2921e7) 2008-05-11 08:28:33 +04:00			`if (ctdb->methods == NULL) {`
			`DEBUG(DEBUG_ALERT,(__location__ " Can not initialize transport. ctdb->methods is NULL\n"));`
			`ctdb_fatal(ctdb, "transport is unavailable. can not initialize.");`
			`}`

added support for persistent databases in ctdbd (This used to be ctdb commit 3115090a0d882beca9d70761130b74bb0821f201) 2007-09-21 06:24:02 +04:00			`/* initialise the transport */`
			`if (ctdb->methods->initialise(ctdb) != 0) {`
			`ctdb_fatal(ctdb, "transport failed to initialise");`
			`}`

			`/* attach to any existing persistent databases */`
			`if (ctdb_attach_persistent(ctdb) != 0) {`
			`ctdb_fatal(ctdb, "Failed to attach to persistent databases\n");`
			`}`

start ctdb frozen, and let the election sort things out. This prevents a race on startup (This used to be ctdb commit b788ed3fa64e31e517b4e602e8bd3ae7201ecddd) 2007-05-23 06:23:07 +04:00			`/* start frozen, then let the first election sort things out */`
initial attempt at freezing databases in priority order (This used to be ctdb commit e8d692590da1070c87a4144031e3306d190ebed2) 2009-10-12 05:08:39 +04:00			`if (ctdb_blocking_freeze(ctdb)) {`
added support for persistent databases in ctdbd (This used to be ctdb commit 3115090a0d882beca9d70761130b74bb0821f201) 2007-09-21 06:24:02 +04:00			`ctdb_fatal(ctdb, "Failed to get initial freeze\n");`
start ctdb frozen, and let the election sort things out. This prevents a race on startup (This used to be ctdb commit b788ed3fa64e31e517b4e602e8bd3ae7201ecddd) 2007-05-23 06:23:07 +04:00			`}`

			`/* now start accepting clients, only can do this once frozen */`
			`fde = event_add_fd(ctdb->ev, ctdb, ctdb->daemon.sd,`
			`EVENT_FD_READ\|EVENT_FD_AUTOCLOSE,`
			`ctdb_accept_client, ctdb);`

added support for persistent databases in ctdbd (This used to be ctdb commit 3115090a0d882beca9d70761130b74bb0821f201) 2007-09-21 06:24:02 +04:00			`/* tell all other nodes we've just started up */`
			`ctdb_daemon_send_control(ctdb, CTDB_BROADCAST_ALL,`
			`0, CTDB_CONTROL_STARTUP, 0,`
			`CTDB_CTRL_FLAG_NOREPLY,`
			`tdb_null, NULL, NULL);`

			`/* release any IPs we hold from previous runs of the daemon */`
			`ctdb_release_all_ips(ctdb);`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00
prevent a deadly embrace between smbd and ctdbd by moving the calling of the startup event scripts after the point where recovery has started and the node is in normal operation This makes the 'startup' script just a special type of the 'monitor' script which is called first (This used to be ctdb commit 7424c30a5fd04aea0137c466b4318c3f185280d8) 2007-11-12 02:53:11 +03:00			`/* start the transport going */`
			`ctdb_start_transport(ctdb);`
added support for persistent databases in ctdbd (This used to be ctdb commit 3115090a0d882beca9d70761130b74bb0821f201) 2007-09-21 06:24:02 +04:00
proper waitpid() fix. remove all waitpid() calls and use the event system to trap sigchld (This used to be ctdb commit 77458b2b6b51b2970c12b0e5b097088d3fb9d358) 2008-07-09 08:02:54 +04:00			`/* set up a handler to pick up sigchld */`
			`se = event_add_signal(ctdb->ev, ctdb,`
			`SIGCHLD, 0,`
			`sig_child_handler,`
			`ctdb);`
			`if (se == NULL) {`
			`DEBUG(DEBUG_CRIT,("Failed to set up signal handler for SIGCHLD\n"));`
			`exit(1);`
			`}`

added support for persistent databases in ctdbd (This used to be ctdb commit 3115090a0d882beca9d70761130b74bb0821f201) 2007-09-21 06:24:02 +04:00			`/* go into a wait loop to allow other nodes to complete */`
			`event_loop_wait(ctdb->ev);`

merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_CRIT,("event_loop_wait() returned. this should not happen\n"));`
added support for persistent databases in ctdbd (This used to be ctdb commit 3115090a0d882beca9d70761130b74bb0821f201) 2007-09-21 06:24:02 +04:00			`exit(1);`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`}`

factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`/*`
			`allocate a packet for use in daemon<->daemon communication`
			`*/`
			`struct ctdb_req_header _ctdb_transport_allocate(struct ctdb_context ctdb,`
			`TALLOC_CTX *mem_ctx,`
			`enum ctdb_operation operation,`
			`size_t length, size_t slength,`
			`const char *type)`
			`{`
			`int size;`
			`struct ctdb_req_header *hdr;`
don't zero beyond packet header unnecessarily (This used to be ctdb commit 4cf88ca2ce81db8fe10b0dfedb81d99a2bd93328) 2007-05-03 07:44:27 +04:00
			`length = MAX(length, slength);`
			`size = (length+(CTDB_DS_ALIGNMENT-1)) & ~(CTDB_DS_ALIGNMENT-1);`

ctdb->methods becomes NULL when we shutdown the transport. If we shutdown the transport and CTDB later decides to send a command out for queueing, the call to ctdb->methods->allocate_pkt() will SEGV. This could trigger for example when we are in the process of shuttind down CTDBD and have already shutdown the transport but we are still waiting for the "shutdown" eventscripts to finish. If the event scripts now take much much longer to execute for some reason, this race condition becomes much more probable. Decorate all dereferencing of ctdb->methods-> with a check that ctdb->menthods is non-NULL (This used to be ctdb commit c4c2c53918da6fb566d6e9cbd6b02e61ae2921e7) 2008-05-11 08:28:33 +04:00			`if (ctdb->methods == NULL) {`
			`DEBUG(DEBUG_ERR,(__location__ " Unable to allocate transport packet for operation %u of length %u. Transport is DOWN.\n",`
			`operation, (unsigned)length));`
			`return NULL;`
			`}`

factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`hdr = (struct ctdb_req_header *)ctdb->methods->allocate_pkt(mem_ctx, size);`
			`if (hdr == NULL) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,("Unable to allocate transport packet for operation %u of length %u\n",`
fixed more warnings on 64 bit boxes (This used to be ctdb commit 2f6eae476203f8a8b28e083553204c01f224c8a5) 2007-05-29 07:58:41 +04:00			`operation, (unsigned)length));`
factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`return NULL;`
			`}`
			`talloc_set_name_const(hdr, type);`
don't zero beyond packet header unnecessarily (This used to be ctdb commit 4cf88ca2ce81db8fe10b0dfedb81d99a2bd93328) 2007-05-03 07:44:27 +04:00			`memset(hdr, 0, slength);`
			`hdr->length = length;`
factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`hdr->operation = operation;`
			`hdr->ctdb_magic = CTDB_MAGIC;`
			`hdr->ctdb_version = CTDB_VERSION;`
			`hdr->generation = ctdb->vnn_map->generation;`
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`hdr->srcnode = ctdb->pnn;`
factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00
			`return hdr;`
make normal/deamon mode controllable by a ctdb flag so that the api looks the same in both modes to a client. send the correct structure back to a client assorted other cleanups (tests/test1.sh now works in daemon mode) (This used to be ctdb commit f4593754cab750dfdb9384884502e2e1b8fde1f0) 2007-04-10 00:03:39 +04:00			`}`

added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`struct daemon_control_state {`
timeout pending controls immediately when a node becomes disconnected (This used to be ctdb commit 93c4b16f4efef383ba8db83953019ef4821613e0) 2007-05-18 17:48:29 +04:00			`struct daemon_control_state next, prev;`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`struct ctdb_client *client;`
			`struct ctdb_req_control *c;`
added a ctdb_get_config call added a ctdb ping control (This used to be ctdb commit 7d17378b6e6076a922cffe98239e20dfbbae3bf7) 2007-04-26 21:27:07 +04:00			`uint32_t reqid;`
timeout pending controls immediately when a node becomes disconnected (This used to be ctdb commit 93c4b16f4efef383ba8db83953019ef4821613e0) 2007-05-18 17:48:29 +04:00			`struct ctdb_node *node;`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`};`

			`/*`
			`callback when a control reply comes in`
			`*/`
			`static void daemon_control_callback(struct ctdb_context *ctdb,`
timeout pending controls immediately when a node becomes disconnected (This used to be ctdb commit 93c4b16f4efef383ba8db83953019ef4821613e0) 2007-05-18 17:48:29 +04:00			`int32_t status, TDB_DATA data,`
added error messages in ctdb_control replies (This used to be ctdb commit bd848f5b760e6b2a73ebfc67fd8adb3c31479fb5) 2007-05-12 15:25:26 +04:00			`const char *errormsg,`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`void *private_data)`
			`{`
			`struct daemon_control_state *state = talloc_get_type(private_data,`
			`struct daemon_control_state);`
			`struct ctdb_client *client = state->client;`
			`struct ctdb_reply_control *r;`
			`size_t len;`

			`/* construct a message to send to the client containing the data */`
got rid of the getdbpath call (This used to be ctdb commit 736ce5c00a1d1b47abb44c4b262b14bfba5202b1) 2007-04-27 01:10:35 +04:00			`len = offsetof(struct ctdb_reply_control, data) + data.dsize;`
added error messages in ctdb_control replies (This used to be ctdb commit bd848f5b760e6b2a73ebfc67fd8adb3c31479fb5) 2007-05-12 15:25:26 +04:00			`if (errormsg) {`
			`len += strlen(errormsg);`
			`}`
fixed a memory leak in the ctdb_control code (This used to be ctdb commit 70aa77a66bb5f16c93ecb122b92a6e63f6343ab1) 2007-05-02 23:51:46 +04:00			`r = ctdbd_allocate_pkt(ctdb, state, CTDB_REPLY_CONTROL, len,`
factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`struct ctdb_reply_control);`
			`CTDB_NO_MEMORY_VOID(ctdb, r);`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00
added a ctdb_get_config call added a ctdb ping control (This used to be ctdb commit 7d17378b6e6076a922cffe98239e20dfbbae3bf7) 2007-04-26 21:27:07 +04:00			`r->hdr.reqid = state->reqid;`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`r->status = status;`
			`r->datalen = data.dsize;`
added error messages in ctdb_control replies (This used to be ctdb commit bd848f5b760e6b2a73ebfc67fd8adb3c31479fb5) 2007-05-12 15:25:26 +04:00			`r->errorlen = 0;`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`memcpy(&r->data[0], data.dptr, data.dsize);`
added error messages in ctdb_control replies (This used to be ctdb commit bd848f5b760e6b2a73ebfc67fd8adb3c31479fb5) 2007-05-12 15:25:26 +04:00			`if (errormsg) {`
			`r->errorlen = strlen(errormsg);`
			`memcpy(&r->data[r->datalen], errormsg, r->errorlen);`
			`}`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00
			`daemon_queue_send(client, &r->hdr);`

			`talloc_free(state);`
			`}`

timeout pending controls immediately when a node becomes disconnected (This used to be ctdb commit 93c4b16f4efef383ba8db83953019ef4821613e0) 2007-05-18 17:48:29 +04:00			`/*`
			`fail all pending controls to a disconnected node`
			`*/`
			`void ctdb_daemon_cancel_controls(struct ctdb_context ctdb, struct ctdb_node node)`
			`{`
			`struct daemon_control_state *state;`
			`while ((state = node->pending_controls)) {`
			`DLIST_REMOVE(node->pending_controls, state);`
			`daemon_control_callback(ctdb, (uint32_t)-1, tdb_null,`
			`"node is disconnected", state);`
			`}`
			`}`

			`/*`
			`destroy a daemon_control_state`
			`*/`
			`static int daemon_control_destructor(struct daemon_control_state *state)`
			`{`
			`if (state->node) {`
			`DLIST_REMOVE(state->node->pending_controls, state);`
			`}`
			`return 0;`
			`}`

added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`/*`
			`this is called when the ctdb daemon received a ctdb request control`
			`from a local client over the unix domain socket`
			`*/`
			`static void daemon_request_control_from_client(struct ctdb_client *client,`
			`struct ctdb_req_control *c)`
			`{`
			`TDB_DATA data;`
			`int res;`
			`struct daemon_control_state *state;`
- moved cmdline options that are only relevant to ctdbd into ctdbd.c - fixed a valgrind error on failing to send a control - don't mark node dead when already disconnected - moved node list lock code into common code (This used to be ctdb commit bcc0432d0fea7ef223f82ccee81cf35c18144b1b) 2007-06-02 04:03:28 +04:00			`TALLOC_CTX *tmp_ctx = talloc_new(client);`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00
added a ctdb_get_config call added a ctdb ping control (This used to be ctdb commit 7d17378b6e6076a922cffe98239e20dfbbae3bf7) 2007-04-26 21:27:07 +04:00			`if (c->hdr.destnode == CTDB_CURRENT_NODE) {`
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`c->hdr.destnode = client->ctdb->pnn;`
added a ctdb_get_config call added a ctdb ping control (This used to be ctdb commit 7d17378b6e6076a922cffe98239e20dfbbae3bf7) 2007-04-26 21:27:07 +04:00			`}`

added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`state = talloc(client, struct daemon_control_state);`
			`CTDB_NO_MEMORY_VOID(client->ctdb, state);`

			`state->client = client;`
			`state->c = talloc_steal(state, c);`
added a ctdb_get_config call added a ctdb ping control (This used to be ctdb commit 7d17378b6e6076a922cffe98239e20dfbbae3bf7) 2007-04-26 21:27:07 +04:00			`state->reqid = c->hdr.reqid;`
change ctdb_validate_vnn to ctdb_validate_pnn (This used to be ctdb commit a4a1f41b69475b9dc16d8fd7f8965c32e96c32f0) 2007-09-04 04:09:58 +04:00			`if (ctdb_validate_pnn(client->ctdb, c->hdr.destnode)) {`
timeout pending controls immediately when a node becomes disconnected (This used to be ctdb commit 93c4b16f4efef383ba8db83953019ef4821613e0) 2007-05-18 17:48:29 +04:00			`state->node = client->ctdb->nodes[c->hdr.destnode];`
			`DLIST_ADD(state->node->pending_controls, state);`
			`} else {`
			`state->node = NULL;`
			`}`

			`talloc_set_destructor(state, daemon_control_destructor);`
- moved cmdline options that are only relevant to ctdbd into ctdbd.c - fixed a valgrind error on failing to send a control - don't mark node dead when already disconnected - moved node list lock code into common code (This used to be ctdb commit bcc0432d0fea7ef223f82ccee81cf35c18144b1b) 2007-06-02 04:03:28 +04:00
			`if (c->flags & CTDB_CTRL_FLAG_NOREPLY) {`
			`talloc_steal(tmp_ctx, state);`
			`}`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00
			`data.dptr = &c->data[0];`
			`data.dsize = c->datalen;`
			`res = ctdb_daemon_send_control(client->ctdb, c->hdr.destnode,`
- changed the REQ_REGISTER PDU to be a control - allow controls to know which client invoked them - added a client_id to clients, so they can be identified remotely - added the ability to remove registered srvids - in the list_keys code, register a temp srvid, then remove it afterwards (This used to be ctdb commit 29603c51cc6d81362532cd8e50f75c8360c5f5ef) 2007-05-04 05:41:29 +04:00			`c->srvid, c->opcode, client->client_id,`
			`c->flags,`
changed the way set_call and attach are done so that you can safely attach to databases after the protocol has started. The daemon broadcasts information on new databases to the other daemons. This also eliminates the need for the client to know about the hash between db name and db_id. (This used to be ctdb commit 3bad91a9d987d4c09fe3322eac23c2733660ad08) 2007-04-30 17:31:40 +04:00			`data, daemon_control_callback,`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`state);`
			`if (res != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,(__location__ " Failed to send control to remote node %u\n",`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`c->hdr.destnode));`
			`}`
timeout pending controls immediately when a node becomes disconnected (This used to be ctdb commit 93c4b16f4efef383ba8db83953019ef4821613e0) 2007-05-18 17:48:29 +04:00
- moved cmdline options that are only relevant to ctdbd into ctdbd.c - fixed a valgrind error on failing to send a control - don't mark node dead when already disconnected - moved node list lock code into common code (This used to be ctdb commit bcc0432d0fea7ef223f82ccee81cf35c18144b1b) 2007-06-02 04:03:28 +04:00			`talloc_free(tmp_ctx);`
added a ctdb control message, and tool (This used to be ctdb commit 0d7a71f35bb8ce95231f8ca1e8e3e4024fe657e5) 2007-04-26 16:27:49 +04:00			`}`
changed the way set_call and attach are done so that you can safely attach to databases after the protocol has started. The daemon broadcasts information on new databases to the other daemons. This also eliminates the need for the client to know about the hash between db name and db_id. (This used to be ctdb commit 3bad91a9d987d4c09fe3322eac23c2733660ad08) 2007-04-30 17:31:40 +04:00
			`/*`
			`register a call function`
			`*/`
			`int ctdb_daemon_set_call(struct ctdb_context *ctdb, uint32_t db_id,`
			`ctdb_fn_t fn, int id)`
			`{`
			`struct ctdb_registered_call *call;`
			`struct ctdb_db_context *ctdb_db;`

			`ctdb_db = find_ctdb_db(ctdb, db_id);`
			`if (ctdb_db == NULL) {`
			`return -1;`
			`}`

			`call = talloc(ctdb_db, struct ctdb_registered_call);`
			`call->fn = fn;`
			`call->id = id;`

			`DLIST_ADD(ctdb_db->calls, call);`
			`return 0;`
			`}`
start splitting the code into separate client and server pieces (This used to be ctdb commit 603cd77988c181525946cd5eb0f4d0d646b58059) 2007-06-07 16:06:19 +04:00


			`/*`
			`this local messaging handler is ugly, but is needed to prevent`
			`recursion in ctdb_send_message() when the destination node is the`
			`same as the source node`
			`*/`
			`struct ctdb_local_message {`
			`struct ctdb_context *ctdb;`
			`uint64_t srvid;`
			`TDB_DATA data;`
			`};`

			`static void ctdb_local_message_trigger(struct event_context ev, struct timed_event te,`
			`struct timeval t, void *private_data)`
			`{`
			`struct ctdb_local_message *m = talloc_get_type(private_data,`
			`struct ctdb_local_message);`
			`int res;`

			`res = ctdb_dispatch_message(m->ctdb, m->srvid, m->data);`
			`if (res != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR, (__location__ " Failed to dispatch message for srvid=%llu\n",`
start splitting the code into separate client and server pieces (This used to be ctdb commit 603cd77988c181525946cd5eb0f4d0d646b58059) 2007-06-07 16:06:19 +04:00			`(unsigned long long)m->srvid));`
			`}`
			`talloc_free(m);`
			`}`

			`static int ctdb_local_message(struct ctdb_context *ctdb, uint64_t srvid, TDB_DATA data)`
			`{`
			`struct ctdb_local_message *m;`
			`m = talloc(ctdb, struct ctdb_local_message);`
			`CTDB_NO_MEMORY(ctdb, m);`

			`m->ctdb = ctdb;`
			`m->srvid = srvid;`
			`m->data = data;`
			`m->data.dptr = talloc_memdup(m, m->data.dptr, m->data.dsize);`
			`if (m->data.dptr == NULL) {`
			`talloc_free(m);`
			`return -1;`
			`}`

			`/* this needs to be done as an event to prevent recursion */`
			`event_add_timed(ctdb->ev, m, timeval_zero(), ctdb_local_message_trigger, m);`
			`return 0;`
			`}`

			`/*`
			`send a ctdb message`
			`*/`
change debug output from vnn to pnn change ctdb_daemon_send_message to take pnn as parameter isntead of vnn (This used to be ctdb commit e352a2bbf9bb9a0b2c4f8329e8a529cf02414097) 2007-09-04 04:45:41 +04:00			`int ctdb_daemon_send_message(struct ctdb_context *ctdb, uint32_t pnn,`
start splitting the code into separate client and server pieces (This used to be ctdb commit 603cd77988c181525946cd5eb0f4d0d646b58059) 2007-06-07 16:06:19 +04:00			`uint64_t srvid, TDB_DATA data)`
			`{`
			`struct ctdb_req_message *r;`
			`int len;`

dont even try to send a message from the main daemon if the transport is down (This used to be ctdb commit 9a2c4c3ed09ac9ea781d06999d11e5c3b5b4a97a) 2009-06-30 06:09:28 +04:00			`if (ctdb->methods == NULL) {`
			`DEBUG(DEBUG_ERR,(__location__ " Failed to send message. Transport is DOWN\n"));`
			`return -1;`
			`}`

start splitting the code into separate client and server pieces (This used to be ctdb commit 603cd77988c181525946cd5eb0f4d0d646b58059) 2007-06-07 16:06:19 +04:00			`/* see if this is a message to ourselves */`
change debug output from vnn to pnn change ctdb_daemon_send_message to take pnn as parameter isntead of vnn (This used to be ctdb commit e352a2bbf9bb9a0b2c4f8329e8a529cf02414097) 2007-09-04 04:45:41 +04:00			`if (pnn == ctdb->pnn) {`
start splitting the code into separate client and server pieces (This used to be ctdb commit 603cd77988c181525946cd5eb0f4d0d646b58059) 2007-06-07 16:06:19 +04:00			`return ctdb_local_message(ctdb, srvid, data);`
			`}`

			`len = offsetof(struct ctdb_req_message, data) + data.dsize;`
			`r = ctdb_transport_allocate(ctdb, ctdb, CTDB_REQ_MESSAGE, len,`
			`struct ctdb_req_message);`
			`CTDB_NO_MEMORY(ctdb, r);`

change debug output from vnn to pnn change ctdb_daemon_send_message to take pnn as parameter isntead of vnn (This used to be ctdb commit e352a2bbf9bb9a0b2c4f8329e8a529cf02414097) 2007-09-04 04:45:41 +04:00			`r->hdr.destnode = pnn;`
start splitting the code into separate client and server pieces (This used to be ctdb commit 603cd77988c181525946cd5eb0f4d0d646b58059) 2007-06-07 16:06:19 +04:00			`r->srvid = srvid;`
			`r->datalen = data.dsize;`
			`memcpy(&r->data[0], data.dptr, data.dsize);`

			`ctdb_queue_packet(ctdb, &r->hdr);`

			`talloc_free(r);`
			`return 0;`
			`}`

1038 lines 28 KiB C Raw Normal View History Unescape Escape

1038 lines

28 KiB

C

Raw Normal View History