samba-mirror

mirror of https://github.com/samba-team/samba.git synced 2024-12-25 23:21:54 +03:00

858 lines

25 KiB

C

Raw Normal View History

- added simple (fake) vnn system - split up ctdb layer code into 3 modules - added a simple test suite - added packet structures for ctdb_call - switched to an array for ctdb_node to make vnn lookup easy and fast (This used to be ctdb commit 8a17460a816a5970f2df8244a06aec55d814f186) 2006-11-28 09:56:10 +03:00			`/*`
			`ctdb_call protocol code`

			`Copyright (C) Andrew Tridgell 2006`

ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`This program is free software; you can redistribute it and/or modify`
			`it under the terms of the GNU General Public License as published by`
update lib/replace from samba4 (This used to be ctdb commit f0555484105668c01c21f56322992e752e831109) 2007-07-10 09:29:31 +04:00			`the Free Software Foundation; either version 3 of the License, or`
ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`(at your option) any later version.`

			`This program is distributed in the hope that it will be useful,`
- added simple (fake) vnn system - split up ctdb layer code into 3 modules - added a simple test suite - added packet structures for ctdb_call - switched to an array for ctdb_node to make vnn lookup easy and fast (This used to be ctdb commit 8a17460a816a5970f2df8244a06aec55d814f186) 2006-11-28 09:56:10 +03:00			`but WITHOUT ANY WARRANTY; without even the implied warranty of`
ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the`
			`GNU General Public License for more details.`

			`You should have received a copy of the GNU General Public License`
update lib/replace from samba4 (This used to be ctdb commit f0555484105668c01c21f56322992e752e831109) 2007-07-10 09:29:31 +04:00			`along with this program; if not, see <http://www.gnu.org/licenses/>.`
- added simple (fake) vnn system - split up ctdb layer code into 3 modules - added a simple test suite - added packet structures for ctdb_call - switched to an array for ctdb_node to make vnn lookup easy and fast (This used to be ctdb commit 8a17460a816a5970f2df8244a06aec55d814f186) 2006-11-28 09:56:10 +03:00			`*/`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`/*`
			`see http://wiki.samba.org/index.php/Samba_%26_Clustering for`
			`protocol design and packet details`
			`*/`
- added simple (fake) vnn system - split up ctdb layer code into 3 modules - added a simple test suite - added packet structures for ctdb_call - switched to an array for ctdb_node to make vnn lookup easy and fast (This used to be ctdb commit 8a17460a816a5970f2df8244a06aec55d814f186) 2006-11-28 09:56:10 +03:00			`#include "includes.h"`
event: Update events to latest Samba version 0.9.8 In Samba this is now called "tevent", and while we use the backwards compatibility wrappers they don't offer EVENT_FD_AUTOCLOSE: that is now a separate tevent_fd_set_auto_close() function. This is based on Samba version 7f29f817fa939ef1bbb740584f09e76e2ecd5b06. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 85e5e760cc91eb3157d3a88996ce474491646726) 2010-08-18 03:46:31 +04:00			`#include "lib/tevent/tevent.h"`
merge fixes from samba4 (This used to be ctdb commit fb90a5424348d0b6ed9a1b8da4ceadcc4d1a1cb1) 2007-01-23 03:38:45 +03:00			`#include "lib/tdb/include/tdb.h"`
some #include cleanups (This used to be ctdb commit 1a07d87122d51a40cd8ad5fe13533298c26857cb) 2007-06-07 16:26:27 +04:00			`#include "lib/util/dlinklist.h"`
- added simple (fake) vnn system - split up ctdb layer code into 3 modules - added a simple test suite - added packet structures for ctdb_call - switched to an array for ctdb_node to make vnn lookup easy and fast (This used to be ctdb commit 8a17460a816a5970f2df8244a06aec55d814f186) 2006-11-28 09:56:10 +03:00			`#include "system/network.h"`
			`#include "system/filesys.h"`
merge fixes from samba4 (This used to be ctdb commit fb90a5424348d0b6ed9a1b8da4ceadcc4d1a1cb1) 2007-01-23 03:38:45 +03:00			`#include "../include/ctdb_private.h"`
- added simple (fake) vnn system - split up ctdb layer code into 3 modules - added a simple test suite - added packet structures for ctdb_call - switched to an array for ctdb_node to make vnn lookup easy and fast (This used to be ctdb commit 8a17460a816a5970f2df8244a06aec55d814f186) 2006-11-28 09:56:10 +03:00
initial support for two new pdus for the domain socket to do fetch_lock no locking is yet done and the store_unlock call is still missing the ./tests/fetch.sh --daemon test fails with parent process dying which needs to be investigated. (This used to be ctdb commit 7d7141c968950a8856f1be79871932b688bfb07f) 2007-04-12 09:46:50 +04:00			`/*`
			`find the ctdb_db from a db index`
			`*/`
			`struct ctdb_db_context find_ctdb_db(struct ctdb_context ctdb, uint32_t id)`
			`{`
			`struct ctdb_db_context *ctdb_db;`

			`for (ctdb_db=ctdb->db_list; ctdb_db; ctdb_db=ctdb_db->next) {`
			`if (ctdb_db->db_id == id) {`
			`break;`
			`}`
			`}`
			`return ctdb_db;`
			`}`


make sure we don't increment rx_cnt for redirected packets, or for packets that have been requeued after a lockwait (This used to be ctdb commit 92e5569407dba173a27e9645b4339ce3e2c00520) 2007-05-19 07:45:24 +04:00			`/*`
			`a varient of input packet that can be used in lock requeue`
			`*/`
clean out some more cruft (This used to be ctdb commit ad16c5fe2748b48a6f6c79976359d56d9bed33f4) 2007-06-05 11:57:07 +04:00			`static void ctdb_call_input_pkt(void p, struct ctdb_req_header hdr)`
make sure we don't increment rx_cnt for redirected packets, or for packets that have been requeued after a lockwait (This used to be ctdb commit 92e5569407dba173a27e9645b4339ce3e2c00520) 2007-05-19 07:45:24 +04:00			`{`
			`struct ctdb_context *ctdb = talloc_get_type(p, struct ctdb_context);`
			`ctdb_input_pkt(ctdb, hdr);`
			`}`


next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`/*`
			`send an error reply`
			`*/`
merge fixes from samba4 (This used to be ctdb commit fb90a5424348d0b6ed9a1b8da4ceadcc4d1a1cb1) 2007-01-23 03:38:45 +03:00			`static void ctdb_send_error(struct ctdb_context *ctdb,`
			`struct ctdb_req_header *hdr, uint32_t status,`
			`const char *fmt, ...) PRINTF_ATTRIBUTE(4,5);`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`static void ctdb_send_error(struct ctdb_context *ctdb,`
added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00			`struct ctdb_req_header *hdr, uint32_t status,`
			`const char *fmt, ...)`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`{`
added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00			`va_list ap;`
			`struct ctdb_reply_error *r;`
			`char *msg;`
merge fixes from samba4 (This used to be ctdb commit fb90a5424348d0b6ed9a1b8da4ceadcc4d1a1cb1) 2007-01-23 03:38:45 +03:00			`int msglen, len;`
added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00
dont try to send error packets if the transport is down (This used to be ctdb commit 65b94d280731df3245b26d69f39acfaf5bccf0d8) 2009-06-30 06:10:27 +04:00			`if (ctdb->methods == NULL) {`
during shutdown there is a window after we have stopped TCP and disconnected from all other nodes but before we have stopped all processing. During this window we may still hit asynchronous events that will fail because we can not send/receive packets from other nodes. These messages are logged as ... Transport is DOWN. To help indicate that they are benign messages related to the process of shutting down. These messages spam the syslog during normal shutdown, so this patch will drop the loglevel of these messages to DEBUG, so that they will not appear in or spam the syslog. (This used to be ctdb commit 8275d265d2ae19b765e30ecf18f6b6319b6e6453) 2010-10-28 06:38:34 +04:00			`DEBUG(DEBUG_INFO,(__location__ " Failed to send error. Transport is DOWN\n"));`
if we fail a dmaster migration due to the transport being down, then that is a fatal condition. (This used to be ctdb commit 75dea671f68ac6649095357c36b3697a927721e9) 2009-06-30 06:13:15 +04:00			`return;`
dont try to send error packets if the transport is down (This used to be ctdb commit 65b94d280731df3245b26d69f39acfaf5bccf0d8) 2009-06-30 06:10:27 +04:00			`}`

added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00			`va_start(ap, fmt);`
			`msg = talloc_vasprintf(ctdb, fmt, ap);`
			`if (msg == NULL) {`
wrap the packet queue call (This used to be ctdb commit 4dd8ffd5752aedcaf0b8ad1941a5f84ec1ca2868) 2006-12-18 08:26:57 +03:00			`ctdb_fatal(ctdb, "Unable to allocate error in ctdb_send_error\n");`
added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00			`}`
			`va_end(ap);`

merge fixes from samba4 (This used to be ctdb commit fb90a5424348d0b6ed9a1b8da4ceadcc4d1a1cb1) 2007-01-23 03:38:45 +03:00			`msglen = strlen(msg)+1;`
			`len = offsetof(struct ctdb_reply_error, msg);`
factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`r = ctdb_transport_allocate(ctdb, msg, CTDB_REPLY_ERROR, len + msglen,`
			`struct ctdb_reply_error);`
wrap the packet queue call (This used to be ctdb commit 4dd8ffd5752aedcaf0b8ad1941a5f84ec1ca2868) 2006-12-18 08:26:57 +03:00			`CTDB_NO_MEMORY_FATAL(ctdb, r);`
merge fixes from samba4 (This used to be ctdb commit fb90a5424348d0b6ed9a1b8da4ceadcc4d1a1cb1) 2007-01-23 03:38:45 +03:00
added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00			`r->hdr.destnode = hdr->srcnode;`
			`r->hdr.reqid = hdr->reqid;`
			`r->status = status;`
merge fixes from samba4 (This used to be ctdb commit fb90a5424348d0b6ed9a1b8da4ceadcc4d1a1cb1) 2007-01-23 03:38:45 +03:00			`r->msglen = msglen;`
			`memcpy(&r->msg[0], msg, msglen);`
added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00
wrap the packet queue call (This used to be ctdb commit 4dd8ffd5752aedcaf0b8ad1941a5f84ec1ca2868) 2006-12-18 08:26:57 +03:00			`ctdb_queue_packet(ctdb, &r->hdr);`
added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`talloc_free(msg);`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`}`

added redirect handling (This used to be ctdb commit 3c1dc8b98c8e843c44a172ac15e67f4ab8c47500) 2006-12-18 06:44:06 +03:00
server: add a comment explaining the call redirect logic in ctdb_call_send_redirect(). (This used to be ctdb commit 81663b81687c0ba681500cca6aa8174bb9587ad2) 2010-11-24 10:01:01 +03:00			`/**`
			`* send a redirect reply`
			`*`
			`* The logic behind this function is this:`
			`*`
			`* A client wants to grab a record and sends a CTDB_REQ_CALL packet`
			`* to its local ctdb (ctdb_request_call). If the node is not itself`
			`* the record's DMASTER, it first redirects the packet to the`
			`* record's LMASTER. The LMASTER then redirects the call packet to`
			`* the current DMASTER. But there is a race: The record may have`
			`* been migrated off the DMASTER while the redirected packet is`
			`* on the wire (or in the local queue). So in case the record has`
			`* migrated off the new destinaton of the call packet, instead of`
			`* going back to the LMASTER to get the new DMASTER, we try to`
			`* reduce rountrips by fist chasing the record a couple of times`
			`* before giving up the direct chase and finally going back to the`
			`* LMASTER (again). Note that this works because auf this: When`
			`* a record is migrated off a node, then the new DMASTER is stored`
			`* in the record's copy on the former DMASTER.`
			`*`
			`* The maxiumum number of attempts for direct chase to make before`
			`* going back to the LMASTER is configurable by the tunable`
			`* "MaxRedirectCount".`
			`*/`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`static void ctdb_call_send_redirect(struct ctdb_context *ctdb,`
much simpler redirect logic (This used to be ctdb commit 95db9afa7dd039e1700e2f3078782f6ac66e9f51) 2007-04-28 20:18:33 +04:00			`TDB_DATA key,`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`struct ctdb_req_call *c,`
			`struct ctdb_ltdb_header *header)`
			`{`
much simpler redirect logic (This used to be ctdb commit 95db9afa7dd039e1700e2f3078782f6ac66e9f51) 2007-04-28 20:18:33 +04:00
			`uint32_t lmaster = ctdb_lmaster(ctdb, &key);`
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`if (ctdb->pnn == lmaster) {`
much simpler redirect logic (This used to be ctdb commit 95db9afa7dd039e1700e2f3078782f6ac66e9f51) 2007-04-28 20:18:33 +04:00			`c->hdr.destnode = header->dmaster;`
- start moving tunable variables into their own structure - fixed the test scripts to use a separate dbdir (This used to be ctdb commit 396752e8908c48373564e915e2d49cfc9ff61eba) 2007-06-04 11:46:37 +04:00			`} else if ((c->hopcount % ctdb->tunable.max_redirect_count) == 0) {`
much simpler redirect logic (This used to be ctdb commit 95db9afa7dd039e1700e2f3078782f6ac66e9f51) 2007-04-28 20:18:33 +04:00			`c->hdr.destnode = lmaster;`
added a hopcount in ctdb_call (This used to be ctdb commit 36d838801a2a2008c50322cdbfff65a308b1cd1a) 2007-05-01 07:25:02 +04:00			`} else {`
			`c->hdr.destnode = header->dmaster;`
much simpler redirect logic (This used to be ctdb commit 95db9afa7dd039e1700e2f3078782f6ac66e9f51) 2007-04-28 20:18:33 +04:00			`}`
added a hopcount in ctdb_call (This used to be ctdb commit 36d838801a2a2008c50322cdbfff65a308b1cd1a) 2007-05-01 07:25:02 +04:00			`c->hopcount++;`
much simpler redirect logic (This used to be ctdb commit 95db9afa7dd039e1700e2f3078782f6ac66e9f51) 2007-04-28 20:18:33 +04:00			`ctdb_queue_packet(ctdb, &c->hdr);`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`}`

- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00
			`/*`
			`send a dmaster reply`

			`caller must have the chainlock before calling this routine. Caller must be`
			`the lmaster`
			`*/`
			`static void ctdb_send_dmaster_reply(struct ctdb_db_context *ctdb_db,`
			`struct ctdb_ltdb_header *header,`
			`TDB_DATA key, TDB_DATA data,`
			`uint32_t new_dmaster,`
			`uint32_t reqid)`
			`{`
			`struct ctdb_context *ctdb = ctdb_db->ctdb;`
			`struct ctdb_reply_dmaster *r;`
			`int ret, len;`
			`TALLOC_CTX *tmp_ctx;`

change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`if (ctdb->pnn != ctdb_lmaster(ctdb, &key)) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ALERT,(__location__ " Caller is not lmaster!\n"));`
- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00			`return;`
			`}`

			`header->dmaster = new_dmaster;`
			`ret = ctdb_ltdb_store(ctdb_db, key, header, data);`
			`if (ret != 0) {`
if we fail a dmaster migration due to the transport being down, then that is a fatal condition. (This used to be ctdb commit 75dea671f68ac6649095357c36b3697a927721e9) 2009-06-30 06:13:15 +04:00			`ctdb_fatal(ctdb, "ctdb_send_dmaster_reply unable to update dmaster");`
			`return;`
			`}`

			`if (ctdb->methods == NULL) {`
server:ctdb_send_dmaster_reply: fix a message typo. Michael (This used to be ctdb commit aa63f728152c37e31cecf2258efcdc8cf5ac0092) 2010-01-06 16:59:23 +03:00			`ctdb_fatal(ctdb, "ctdb_send_dmaster_reply cant update dmaster since transport is down");`
- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00			`return;`
			`}`

			`/* put the packet on a temporary context, allowing us to safely free`
			`it below even if ctdb_reply_dmaster() has freed it already */`
			`tmp_ctx = talloc_new(ctdb);`

			`/* send the CTDB_REPLY_DMASTER */`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`len = offsetof(struct ctdb_reply_dmaster, data) + key.dsize + data.dsize;`
factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`r = ctdb_transport_allocate(ctdb, tmp_ctx, CTDB_REPLY_DMASTER, len,`
			`struct ctdb_reply_dmaster);`
- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00			`CTDB_NO_MEMORY_FATAL(ctdb, r);`

			`r->hdr.destnode = new_dmaster;`
			`r->hdr.reqid = reqid;`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`r->rsn = header->rsn;`
			`r->keylen = key.dsize;`
- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00			`r->datalen = data.dsize;`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`r->db_id = ctdb_db->db_id;`
			`memcpy(&r->data[0], key.dptr, key.dsize);`
			`memcpy(&r->data[key.dsize], data.dptr, data.dsize);`
- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00
			`ctdb_queue_packet(ctdb, &r->hdr);`

			`talloc_free(tmp_ctx);`
			`}`

added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`/*`
			`send a dmaster request (give another node the dmaster for a record)`

			`This is always sent to the lmaster, which ensures that the lmaster`
			`always knows who the dmaster is. The lmaster will then send a`
			`CTDB_REPLY_DMASTER to the new dmaster`
			`*/`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`static void ctdb_call_send_dmaster(struct ctdb_db_context *ctdb_db,`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`struct ctdb_req_call *c,`
			`struct ctdb_ltdb_header *header,`
			`TDB_DATA key, TDB_DATA data)`
			`{`
			`struct ctdb_req_dmaster *r;`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`struct ctdb_context *ctdb = ctdb_db->ctdb;`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`int len;`
- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00			`uint32_t lmaster = ctdb_lmaster(ctdb, key);`

failing a dmaster send due to the transport being down is fatal (This used to be ctdb commit c17dafc79bec25bbb796478c33f503503d382a20) 2009-06-30 06:14:58 +04:00			`if (ctdb->methods == NULL) {`
			`ctdb_fatal(ctdb, "Failed ctdb_call_send_dmaster since transport is down");`
			`return;`
			`}`

server: when we migrate off a record with data, set the MIGRATED_WITH_DATA flag (This used to be ctdb commit f5fb232117886186066ab3430fdd2307cba94960) 2010-12-03 17:21:51 +03:00			`if (data->dsize != 0) {`
			`header->flags \|= CTDB_REC_FLAG_MIGRATED_WITH_DATA;`
			`}`

change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`if (lmaster == ctdb->pnn) {`
- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00			`ctdb_send_dmaster_reply(ctdb_db, header, key, data,`
			`c->hdr.srcnode, c->hdr.reqid);`
			`return;`
			`}`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00
merge fixes from samba4 (This used to be ctdb commit fb90a5424348d0b6ed9a1b8da4ceadcc4d1a1cb1) 2007-01-23 03:38:45 +03:00			`len = offsetof(struct ctdb_req_dmaster, data) + key->dsize + data->dsize;`
factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`r = ctdb_transport_allocate(ctdb, ctdb, CTDB_REQ_DMASTER, len,`
			`struct ctdb_req_dmaster);`
wrap the packet queue call (This used to be ctdb commit 4dd8ffd5752aedcaf0b8ad1941a5f84ec1ca2868) 2006-12-18 08:26:57 +03:00			`CTDB_NO_MEMORY_FATAL(ctdb, r);`
- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00			`r->hdr.destnode = lmaster;`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`r->hdr.reqid = c->hdr.reqid;`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`r->db_id = c->db_id;`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`r->rsn = header->rsn;`
merge fetch code from ronnie, and add a simple fetch test (This used to be ctdb commit 83b794befd8d34b3da544a483f9d39a3fa140655) 2007-04-05 07:18:31 +04:00			`r->dmaster = c->hdr.srcnode;`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`r->keylen = key->dsize;`
			`r->datalen = data->dsize;`
			`memcpy(&r->data[0], key->dptr, key->dsize);`
			`memcpy(&r->data[key->dsize], data->dptr, data->dsize);`

- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00			`header->dmaster = c->hdr.srcnode;`
Revert "Add a new header flag for "migrated with data" and set this to 1" This reverts commit a8cc35191df1cd4b866897df71d317ce5f198cb5. (This used to be ctdb commit 7c37435fb517a621c45b21a21b4eb15f8bbd3c83) 2010-12-13 06:23:32 +03:00			`if (ctdb_ltdb_store(ctdb_db, key, header, data) != 0) {`
			`ctdb_fatal(ctdb, "Failed to store record in ctdb_call_send_dmaster");`
check for error on ctdb_ltdb_store (This used to be ctdb commit c4a34bac4ad4d2f9699e08074668d25586e3c0da) 2007-05-15 04:16:59 +04:00			`}`
Revert "Add a new header flag for "migrated with data" and set this to 1" This reverts commit a8cc35191df1cd4b866897df71d317ce5f198cb5. (This used to be ctdb commit 7c37435fb517a621c45b21a21b4eb15f8bbd3c83) 2010-12-13 06:23:32 +03:00
- fixed a problem with packets to ourselves. The packets were being processed immediately, but the input routines indirectly assumed they were being called as a new event (for example, a calling routine might queue the packet, then afterwards modify the ltdb record). The solution was to make self packets queue via a zero timeout. - fixed unlinking of the socket in a exit in the lockwait code. Needed an _exit instead of exit so atexit() doesn't trigger - print latency of lockwait delays (This used to be ctdb commit 1b0684b4f6a976f4c5fe54394ac54d121810b298) 2007-04-20 11:58:37 +04:00			`ctdb_queue_packet(ctdb, &r->hdr);`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00
			`talloc_free(r);`
			`}`

fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00			`/*`
			`called when a CTDB_REPLY_DMASTER packet comes in, or when the lmaster`
			`gets a CTDB_REQUEST_DMASTER for itself. We become the dmaster.`

			`must be called with the chainlock held. This function releases the chainlock`
			`*/`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`static void ctdb_become_dmaster(struct ctdb_db_context *ctdb_db,`
pass the header to ctdb_become_dmaster instead of just the reqid this allows us to print from which node Invalid or Dropped orphan become dmaster packets came from (This used to be ctdb commit 88efd1bf4c796cd2b184156b72296587bc38bb40) 2007-07-11 03:44:52 +04:00			`struct ctdb_req_header *hdr,`
			`TDB_DATA key, TDB_DATA data,`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`uint64_t rsn)`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00			`{`
			`struct ctdb_call_state *state;`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`struct ctdb_context *ctdb = ctdb_db->ctdb;`
			`struct ctdb_ltdb_header header;`
add additional logging when tdb_chainunlock() fails so we can see where it was called from when it fails (This used to be ctdb commit 0c091b3db6bdefd371787d87bc749593ea8e3c76) 2010-06-09 08:17:35 +04:00			`int ret;`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00
Reduce the log level for two debug messages DEBUG(DEBUG_DEBUG,("pnn %u dmaster response %08x\n", ctdb->pnn, ctdb_has DEBUG(DEBUG_DEBUG,("pnn %u dmaster request on %08x for %u from %u\n", (This used to be ctdb commit a3473e7a445b14520a49585c460429dfbfe1fce0) 2010-02-11 03:49:48 +03:00			`DEBUG(DEBUG_DEBUG,("pnn %u dmaster response %08x\n", ctdb->pnn, ctdb_hash(&key)));`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00
			`ZERO_STRUCT(header);`
- merge from ronnie - increment rsn only in become_dmaster - add torture check for rsn regression in ctdb_ltdb_store (This used to be ctdb commit 8047506a08bb53ee01aa64f25c9f72839e1e2d68) 2007-05-11 04:33:43 +04:00			`header.rsn = rsn + 1;`
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`header.dmaster = ctdb->pnn;`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00
			`if (ctdb_ltdb_store(ctdb_db, key, &header, data) != 0) {`
			`ctdb_fatal(ctdb, "ctdb_reply_dmaster store failed\n");`
add additional logging when tdb_chainunlock() fails so we can see where it was called from when it fails (This used to be ctdb commit 0c091b3db6bdefd371787d87bc749593ea8e3c76) 2010-06-09 08:17:35 +04:00
			`ret = ctdb_ltdb_unlock(ctdb_db, key);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_ltdb_unlock() failed with error %d\n", ret));`
			`}`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`return;`
			`}`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00
pass the header to ctdb_become_dmaster instead of just the reqid this allows us to print from which node Invalid or Dropped orphan become dmaster packets came from (This used to be ctdb commit 88efd1bf4c796cd2b184156b72296587bc38bb40) 2007-07-11 03:44:52 +04:00			`state = ctdb_reqid_find(ctdb, hdr->reqid, struct ctdb_call_state);`
split the 32bit idr field into two. store the idr as the high 16 bits and use a rotating counter for the low 16 bits. (This used to be ctdb commit 7c763b7b5e6ca54a6df4586893ddaf1b508b4c22) 2007-04-23 12:19:50 +04:00
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00			`if (state == NULL) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,("pnn %u Invalid reqid %u in ctdb_become_dmaster from node %u\n",`
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`ctdb->pnn, hdr->reqid, hdr->srcnode));`
add additional logging when tdb_chainunlock() fails so we can see where it was called from when it fails (This used to be ctdb commit 0c091b3db6bdefd371787d87bc749593ea8e3c76) 2010-06-09 08:17:35 +04:00
			`ret = ctdb_ltdb_unlock(ctdb_db, key);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_ltdb_unlock() failed with error %d\n", ret));`
			`}`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00			`return;`
			`}`

idr can timeout and wrap/be reused quite quickly. If a noremote node hangs for an extended period, it is possible that we might have a DMASTER request in flight for record A to that node. Eventually we will reuse the idr, and may reuse it for a DMASTER request to a different node for a different record B. If while the request for B is in flight, the first tnode un-hangs and responds back we would receive a dmaster reply for the wrong record. This would cause a record to become perpetually locked, since inside the daemon we would tdb_chainlock(dmaster_reply->pdu->key) but once the migration would complete we would chainunlock idr->state->call->key Adding code to verify that when we receive a dmaster reply packet that it does in fact match the exact same key that the state variable we have for the idr in flight. (This used to be ctdb commit 2f6a870d7ff02ceb61fde242f752dccbfcb4cb37) 2010-06-09 10:12:36 +04:00			`if (key.dsize != state->call->key.dsize \|\| memcmp(key.dptr, state->call->key.dptr, key.dsize)) {`
fix a debug message (This used to be ctdb commit 856bd6de6218d9b70baed0e6443be4253ea31afe) 2010-06-09 10:22:01 +04:00			`DEBUG(DEBUG_ERR, ("Got bogus DMASTER packet reqid:%u from node %u. Key does not match key held in matching idr.\n", hdr->reqid, hdr->srcnode));`
idr can timeout and wrap/be reused quite quickly. If a noremote node hangs for an extended period, it is possible that we might have a DMASTER request in flight for record A to that node. Eventually we will reuse the idr, and may reuse it for a DMASTER request to a different node for a different record B. If while the request for B is in flight, the first tnode un-hangs and responds back we would receive a dmaster reply for the wrong record. This would cause a record to become perpetually locked, since inside the daemon we would tdb_chainlock(dmaster_reply->pdu->key) but once the migration would complete we would chainunlock idr->state->call->key Adding code to verify that when we receive a dmaster reply packet that it does in fact match the exact same key that the state variable we have for the idr in flight. (This used to be ctdb commit 2f6a870d7ff02ceb61fde242f752dccbfcb4cb37) 2010-06-09 10:12:36 +04:00
			`ret = ctdb_ltdb_unlock(ctdb_db, key);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_ltdb_unlock() failed with error %d\n", ret));`
			`}`
			`return;`
			`}`

pass the header to ctdb_become_dmaster instead of just the reqid this allows us to print from which node Invalid or Dropped orphan become dmaster packets came from (This used to be ctdb commit 88efd1bf4c796cd2b184156b72296587bc38bb40) 2007-07-11 03:44:52 +04:00			`if (hdr->reqid != state->reqid) {`
split the 32bit idr field into two. store the idr as the high 16 bits and use a rotating counter for the low 16 bits. (This used to be ctdb commit 7c763b7b5e6ca54a6df4586893ddaf1b508b4c22) 2007-04-23 12:19:50 +04:00			`/* we found a record but it was the wrong one */`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR, ("Dropped orphan in ctdb_become_dmaster with reqid:%u\n from node %u", hdr->reqid, hdr->srcnode));`
add additional logging when tdb_chainunlock() fails so we can see where it was called from when it fails (This used to be ctdb commit 0c091b3db6bdefd371787d87bc749593ea8e3c76) 2010-06-09 08:17:35 +04:00
			`ret = ctdb_ltdb_unlock(ctdb_db, key);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_ltdb_unlock() failed with error %d\n", ret));`
			`}`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00			`return;`
			`}`

Remove LACOUNT and LACCESSOR and migrate the records immediately. This concept didnt work out and it is really just as expensive as a full migration anyway, without the benefit of caching the data for subsequence accesses. Now, migrate the records immediately on first access. This will be combined with a "cheap vacuum-lite" for special empty records to prevent growth of databases. Later extensions to mimic read-only behaviour of records will include proper shared read-only locking of database records, making the laccessor/lacount read-only access to the data obsolete anyway. By removing this special case and handling of lacount laccessor makes the codapath where shared read-only locking will be be implemented simpler, and frees up space in the ctdb_ltdb header for use by vacuuming flags as well as read-only locking flags. (This used to be ctdb commit 155dd1f4885fe142c6f8bd09430f65daf8a17e51) 2010-11-29 05:07:59 +03:00			`ctdb_call_local(ctdb_db, state->call, &header, state, &data);`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00
idr can timeout and wrap/be reused quite quickly. If a noremote node hangs for an extended period, it is possible that we might have a DMASTER request in flight for record A to that node. Eventually we will reuse the idr, and may reuse it for a DMASTER request to a different node for a different record B. If while the request for B is in flight, the first tnode un-hangs and responds back we would receive a dmaster reply for the wrong record. This would cause a record to become perpetually locked, since inside the daemon we would tdb_chainlock(dmaster_reply->pdu->key) but once the migration would complete we would chainunlock idr->state->call->key Adding code to verify that when we receive a dmaster reply packet that it does in fact match the exact same key that the state variable we have for the idr in flight. (This used to be ctdb commit 2f6a870d7ff02ceb61fde242f752dccbfcb4cb37) 2010-06-09 10:12:36 +04:00			`ret = ctdb_ltdb_unlock(ctdb_db, state->call->key);`
add additional logging when tdb_chainunlock() fails so we can see where it was called from when it fails (This used to be ctdb commit 0c091b3db6bdefd371787d87bc749593ea8e3c76) 2010-06-09 08:17:35 +04:00			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_ltdb_unlock() failed with error %d\n", ret));`
			`}`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00
			`state->state = CTDB_CALL_DONE;`
			`if (state->async.fn) {`
			`state->async.fn(state);`
			`}`
			`}`


added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00
			`/*`
			`called when a CTDB_REQ_DMASTER packet comes in`

			`this comes into the lmaster for a record when the current dmaster`
			`wants to give up the dmaster role and give it to someone else`
			`*/`
			`void ctdb_request_dmaster(struct ctdb_context ctdb, struct ctdb_req_header hdr)`
			`{`
			`struct ctdb_req_dmaster c = (struct ctdb_req_dmaster )hdr;`
merge fixes from samba4 (This used to be ctdb commit fb90a5424348d0b6ed9a1b8da4ceadcc4d1a1cb1) 2007-01-23 03:38:45 +03:00			`TDB_DATA key, data, data2;`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`struct ctdb_ltdb_header header;`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`struct ctdb_db_context *ctdb_db;`
- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00			`int ret;`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00
			`key.dptr = c->data;`
			`key.dsize = c->keylen;`
			`data.dptr = c->data + c->keylen;`
			`data.dsize = c->datalen;`

initial support for two new pdus for the domain socket to do fetch_lock no locking is yet done and the store_unlock call is still missing the ./tests/fetch.sh --daemon test fails with parent process dying which needs to be investigated. (This used to be ctdb commit 7d7141c968950a8856f1be79871932b688bfb07f) 2007-04-12 09:46:50 +04:00			`ctdb_db = find_ctdb_db(ctdb, c->db_id);`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`if (!ctdb_db) {`
Fix uninitialized variable warnings (This used to be ctdb commit b84f97adfd25b2fbfab1c7964b68931643e8029c) 2007-04-11 14:49:10 +04:00			`ctdb_send_error(ctdb, hdr, -1,`
			`"Unknown database in request. db_id==0x%08x",`
			`c->db_id);`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`return;`
			`}`
merge fetch code from ronnie, and add a simple fetch test (This used to be ctdb commit 83b794befd8d34b3da544a483f9d39a3fa140655) 2007-04-05 07:18:31 +04:00
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`/* fetch the current record */`
			`ret = ctdb_ltdb_lock_fetch_requeue(ctdb_db, key, &header, hdr, &data2,`
make sure we don't increment rx_cnt for redirected packets, or for packets that have been requeued after a lockwait (This used to be ctdb commit 92e5569407dba173a27e9645b4339ce3e2c00520) 2007-05-19 07:45:24 +04:00			`ctdb_call_input_pkt, ctdb, False);`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`if (ret == -1) {`
			`ctdb_fatal(ctdb, "ctdb_req_dmaster failed to fetch record");`
			`return;`
			`}`
			`if (ret == -2) {`
added debug constants to allow for better mapping to syslog levels (This used to be ctdb commit 7ba8f1dde318eab03f4257e5a89fd23e7281e502) 2008-02-04 09:44:24 +03:00			`DEBUG(DEBUG_INFO,(__location__ " deferring ctdb_request_dmaster\n"));`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`return;`
			`}`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`if (ctdb_lmaster(ctdb, &key) != ctdb->pnn) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ALERT,("pnn %u dmaster request to non-lmaster lmaster=%u gen=%u curgen=%u\n",`
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`ctdb->pnn, ctdb_lmaster(ctdb, &key),`
log the generation numbers to give a hint about this bug (This used to be ctdb commit 12018494baa33c5f6c52e6eae94ac77a56d3e5a0) 2007-07-08 13:36:55 +04:00			`hdr->generation, ctdb->vnn_map->generation));`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00			`ctdb_fatal(ctdb, "ctdb_req_dmaster to non-lmaster");`
			`}`

Reduce the log level for two debug messages DEBUG(DEBUG_DEBUG,("pnn %u dmaster response %08x\n", ctdb->pnn, ctdb_has DEBUG(DEBUG_DEBUG,("pnn %u dmaster request on %08x for %u from %u\n", (This used to be ctdb commit a3473e7a445b14520a49585c460429dfbfe1fce0) 2010-02-11 03:49:48 +03:00			`DEBUG(DEBUG_DEBUG,("pnn %u dmaster request on %08x for %u from %u\n",`
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`ctdb->pnn, ctdb_hash(&key), c->dmaster, c->hdr.srcnode));`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`/* its a protocol error if the sending node is not the current dmaster */`
- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00			`if (header.dmaster != hdr->srcnode) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ALERT,("pnn %u dmaster request for new-dmaster %u from non-master %u real-dmaster=%u key %08x dbid 0x%08x gen=%u curgen=%u c->rsn=%llu header.rsn=%llu reqid=%u keyval=0x%08x\n",`
make some specific cases of the non-dmaster bug non-fatal (This used to be ctdb commit 7b516ab06c7ba7ffe9ecf3f76720df5360176b2c) 2008-01-05 01:32:29 +03:00			`ctdb->pnn, c->dmaster, hdr->srcnode, header.dmaster, ctdb_hash(&key),`
			`ctdb_db->db_id, hdr->generation, ctdb->vnn_map->generation,`
			`(unsigned long long)c->rsn, (unsigned long long)header.rsn, c->hdr.reqid,`
			`(key.dsize >= 4)?((uint32_t )key.dptr):0));`
Revert "From Wolfgang M." This reverts commit 5b70fa8cfd5916d3c212823ad5cc1b251ae175ed. (This used to be ctdb commit 363e7e939ad46b3f75c83c30d4163d63876c2456) 2009-10-29 05:44:12 +03:00			`if (header.rsn != 0 \|\| header.dmaster != ctdb->pnn) {`
ctdb_req_dmaster from non-master If we find a situatior where we get a stray packet with the wrong dmaster, dont suicide with ctdb_fatal() since this is too disruptive. Just drop the stray packet and force a recovery to make sure all is good again. CQ S1022004 (This used to be ctdb commit 62b7fe853db37c0a90e48a0332a3426a8dcb4ed8) 2011-02-18 03:21:19 +03:00			`DEBUG(DEBUG_ERR,("ctdb_req_dmaster from non-master. Force a recovery.\n"));`

			`ctdb->recovery_mode = CTDB_RECOVERY_ACTIVE;`
Revert "From Wolfgang M." This reverts commit 5b70fa8cfd5916d3c212823ad5cc1b251ae175ed. (This used to be ctdb commit 363e7e939ad46b3f75c83c30d4163d63876c2456) 2009-10-29 05:44:12 +03:00			`return;`
			`}`
make some specific cases of the non-dmaster bug non-fatal (This used to be ctdb commit 7b516ab06c7ba7ffe9ecf3f76720df5360176b2c) 2008-01-05 01:32:29 +03:00			`}`

			`if (header.rsn > c->rsn) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ALERT,("pnn %u dmaster request with older RSN new-dmaster %u from %u real-dmaster=%u key %08x dbid 0x%08x gen=%u curgen=%u c->rsn=%llu header.rsn=%llu reqid=%u\n",`
make some specific cases of the non-dmaster bug non-fatal (This used to be ctdb commit 7b516ab06c7ba7ffe9ecf3f76720df5360176b2c) 2008-01-05 01:32:29 +03:00			`ctdb->pnn, c->dmaster, hdr->srcnode, header.dmaster, ctdb_hash(&key),`
			`ctdb_db->db_id, hdr->generation, ctdb->vnn_map->generation,`
			`(unsigned long long)c->rsn, (unsigned long long)header.rsn, c->hdr.reqid));`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`}`

ensure we propogate the correct rsn for a request dmaster (This used to be ctdb commit 70c1c67db865db8a49b56e8e3e8fd56ec5063208) 2007-05-12 13:55:18 +04:00			`/* use the rsn from the sending node */`
			`header.rsn = c->rsn;`

fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00			`/* check if the new dmaster is the lmaster, in which case we`
			`skip the dmaster reply */`
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`if (c->dmaster == ctdb->pnn) {`
pass the header to ctdb_become_dmaster instead of just the reqid this allows us to print from which node Invalid or Dropped orphan become dmaster packets came from (This used to be ctdb commit 88efd1bf4c796cd2b184156b72296587bc38bb40) 2007-07-11 03:44:52 +04:00			`ctdb_become_dmaster(ctdb_db, hdr, key, data, c->rsn);`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00			`} else {`
			`ctdb_send_dmaster_reply(ctdb_db, &header, key, data, c->dmaster, hdr->reqid);`
add extra logging for failed ctdb_ltdb_unlock() for a few more places it is called from (This used to be ctdb commit 5c0fea90c6474a51992a9c4aeb6af7dfeb213ee0) 2010-06-09 08:31:05 +04:00
			`ret = ctdb_ltdb_unlock(ctdb_db, key);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_ltdb_unlock() failed with error %d\n", ret));`
			`}`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00			`}`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`}`


- added simple (fake) vnn system - split up ctdb layer code into 3 modules - added a simple test suite - added packet structures for ctdb_call - switched to an array for ctdb_node to make vnn lookup easy and fast (This used to be ctdb commit 8a17460a816a5970f2df8244a06aec55d814f186) 2006-11-28 09:56:10 +03:00			`/*`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`called when a CTDB_REQ_CALL packet comes in`
- added simple (fake) vnn system - split up ctdb layer code into 3 modules - added a simple test suite - added packet structures for ctdb_call - switched to an array for ctdb_node to make vnn lookup easy and fast (This used to be ctdb commit 8a17460a816a5970f2df8244a06aec55d814f186) 2006-11-28 09:56:10 +03:00			`*/`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`void ctdb_request_call(struct ctdb_context ctdb, struct ctdb_req_header hdr)`
- added simple (fake) vnn system - split up ctdb layer code into 3 modules - added a simple test suite - added packet structures for ctdb_call - switched to an array for ctdb_node to make vnn lookup easy and fast (This used to be ctdb commit 8a17460a816a5970f2df8244a06aec55d814f186) 2006-11-28 09:56:10 +03:00			`{`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`struct ctdb_req_call c = (struct ctdb_req_call )hdr;`
simplified ctdb_call() interface, and made it easier to expand with more parameters later (This used to be ctdb commit 6c816fe85e84faad167101bcf26850966c3044e5) 2007-01-25 08:13:17 +03:00			`TDB_DATA data;`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`struct ctdb_reply_call *r;`
merge fixes from samba4 (This used to be ctdb commit fb90a5424348d0b6ed9a1b8da4ceadcc4d1a1cb1) 2007-01-23 03:38:45 +03:00			`int ret, len;`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`struct ctdb_ltdb_header header;`
in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`struct ctdb_call *call;`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`struct ctdb_db_context *ctdb_db;`

Dont even try allocating and sending a CALL packet if the transport is down (This used to be ctdb commit cb8dd896914d4e44ad7b8bb000176a7c78f394ae) 2009-06-30 06:16:13 +04:00			`if (ctdb->methods == NULL) {`
during shutdown there is a window after we have stopped TCP and disconnected from all other nodes but before we have stopped all processing. During this window we may still hit asynchronous events that will fail because we can not send/receive packets from other nodes. These messages are logged as ... Transport is DOWN. To help indicate that they are benign messages related to the process of shutting down. These messages spam the syslog during normal shutdown, so this patch will drop the loglevel of these messages to DEBUG, so that they will not appear in or spam the syslog. (This used to be ctdb commit 8275d265d2ae19b765e30ecf18f6b6319b6e6453) 2010-10-28 06:38:34 +04:00			`DEBUG(DEBUG_INFO,(__location__ " Failed ctdb_request_call. Transport is DOWN\n"));`
Dont even try allocating and sending a CALL packet if the transport is down (This used to be ctdb commit cb8dd896914d4e44ad7b8bb000176a7c78f394ae) 2009-06-30 06:16:13 +04:00			`return;`
			`}`


initial support for two new pdus for the domain socket to do fetch_lock no locking is yet done and the store_unlock call is still missing the ./tests/fetch.sh --daemon test fails with parent process dying which needs to be investigated. (This used to be ctdb commit 7d7141c968950a8856f1be79871932b688bfb07f) 2007-04-12 09:46:50 +04:00			`ctdb_db = find_ctdb_db(ctdb, c->db_id);`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`if (!ctdb_db) {`
Fix uninitialized variable warnings (This used to be ctdb commit b84f97adfd25b2fbfab1c7964b68931643e8029c) 2007-04-11 14:49:10 +04:00			`ctdb_send_error(ctdb, hdr, -1,`
			`"Unknown database in request. db_id==0x%08x",`
			`c->db_id);`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`return;`
			`}`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00
in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`call = talloc(hdr, struct ctdb_call);`
			`CTDB_NO_MEMORY_FATAL(ctdb, call);`

			`call->call_id = c->callid;`
			`call->key.dptr = c->data;`
			`call->key.dsize = c->keylen;`
			`call->call_data.dptr = c->data + c->keylen;`
			`call->call_data.dsize = c->calldatalen;`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`/* determine if we are the dmaster for this key. This also`
			`fetches the record data (if any), thus avoiding a 2nd fetch of the data`
			`if the call will be answered locally */`
fix a bug in new structure handling (This used to be ctdb commit 5f248d82717c8094f260ea16292996bb712df947) 2007-01-29 14:11:16 +03:00
in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`ret = ctdb_ltdb_lock_fetch_requeue(ctdb_db, call->key, &header, hdr, &data,`
make sure we don't increment rx_cnt for redirected packets, or for packets that have been requeued after a lockwait (This used to be ctdb commit 92e5569407dba173a27e9645b4339ce3e2c00520) 2007-05-19 07:45:24 +04:00			`ctdb_call_input_pkt, ctdb, False);`
start using ctdb_ltdb_lock_fetch_requeue() (This used to be ctdb commit f89ab3a06b4677f56c92768c3a8ae5ec9f5abbc2) 2007-04-17 10:54:03 +04:00			`if (ret == -1) {`
added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00			`ctdb_send_error(ctdb, hdr, ret, "ltdb fetch failed in ctdb_request_call");`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`return;`
			`}`
start using ctdb_ltdb_lock_fetch_requeue() (This used to be ctdb commit f89ab3a06b4677f56c92768c3a8ae5ec9f5abbc2) 2007-04-17 10:54:03 +04:00			`if (ret == -2) {`
added debug constants to allow for better mapping to syslog levels (This used to be ctdb commit 7ba8f1dde318eab03f4257e5a89fd23e7281e502) 2008-02-04 09:44:24 +03:00			`DEBUG(DEBUG_INFO,(__location__ " deferred ctdb_request_call\n"));`
start using ctdb_ltdb_lock_fetch_requeue() (This used to be ctdb commit f89ab3a06b4677f56c92768c3a8ae5ec9f5abbc2) 2007-04-17 10:54:03 +04:00			`return;`
			`}`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00
			`/* if we are not the dmaster, then send a redirect to the`
			`requesting node */`
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`if (header.dmaster != ctdb->pnn) {`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`talloc_free(data.dptr);`
in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`ctdb_call_send_redirect(ctdb, call->key, c, &header);`
add extra logging for failed ctdb_ltdb_unlock() for a few more places it is called from (This used to be ctdb commit 5c0fea90c6474a51992a9c4aeb6af7dfeb213ee0) 2010-06-09 08:31:05 +04:00
			`ret = ctdb_ltdb_unlock(ctdb_db, call->key);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_ltdb_unlock() failed with error %d\n", ret));`
			`}`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`return;`
			`}`

Create macros to update the statistics counters and use these macros everywhere instead of manipulating the coutenrs directly. (This used to be ctdb commit 2e648df890e5713bc575965d87937827b068d0d7) 2010-09-29 04:38:41 +04:00			`CTDB_UPDATE_STAT(ctdb, max_hop_count, c->hopcount);`
added a hopcount in ctdb_call (This used to be ctdb commit 36d838801a2a2008c50322cdbfff65a308b1cd1a) 2007-05-01 07:25:02 +04:00
Remove LACOUNT and LACCESSOR and migrate the records immediately. This concept didnt work out and it is really just as expensive as a full migration anyway, without the benefit of caching the data for subsequence accesses. Now, migrate the records immediately on first access. This will be combined with a "cheap vacuum-lite" for special empty records to prevent growth of databases. Later extensions to mimic read-only behaviour of records will include proper shared read-only locking of database records, making the laccessor/lacount read-only access to the data obsolete anyway. By removing this special case and handling of lacount laccessor makes the codapath where shared read-only locking will be be implemented simpler, and frees up space in the ctdb_ltdb header for use by vacuuming flags as well as read-only locking flags. (This used to be ctdb commit 155dd1f4885fe142c6f8bd09430f65daf8a17e51) 2010-11-29 05:07:59 +03:00			`/* Try if possible to migrate the record off to the caller node.`
			`* From the clients perspective a fetch of the data is just as`
			`* expensive as a migration.`
			`*/`
			`if (c->hdr.srcnode != ctdb->pnn) {`
Fix persistent transaction commit race condition. In ctdb_client.c:ctdb_transaction_commit(), after a failed TRANS2_COMMIT control call (for instance due to the 1-second being exceeded waiting for a busy node's reply), there is a 1-second gap between the transaction_cancel() and replay_transaction() calls in which there is no lock on the persistent db. And due to the lack of global state indicating that a transaction is in progress in ctdbd, other nodes may succeed to start transactions on the db in this gap and even worse work on top of the possibly already pushed changes. So the data diverges on the several nodes. This change fixes this by introducing global state for a transaction commit being active in the ctdb_db_context struct and in a db_id field in the client so that a client keeps track of _which_ tdb it as transaction commit running on. These data are set by ctdb upon entering the trans2_commit control and they are cleared in the trans2_error or trans2_finished controls. This makes it impossible to start a nother transaction or migrate a record to a different node while a transaction is active on a persistent tdb, including the retry loop. This approach is dead lock free and still allows recovery process to be started in the retry-gap between cancel and replay. Also note, that this solution does not require any change in the client side. This was debugged and developed together with Stefan Metzmacher <metze@samba.org> - thanks! Michael (This used to be ctdb commit f88103516e5ad723062fb95fcb07a128f1069d69) 2009-07-21 13:30:38 +04:00			`if (ctdb_db->transaction_active) {`
call: lower the debug message "refusing migration while transction" to lvl INFO This gets just too noisy on a busy system. And it is purley informational anyways... Michael (This used to be ctdb commit 7f64a00c76203fdf6673c3f862a4bfd17fb848d7) 2009-12-09 15:43:38 +03:00			`DEBUG(DEBUG_INFO, (__location__ " refusing migration"`
Fix persistent transaction commit race condition. In ctdb_client.c:ctdb_transaction_commit(), after a failed TRANS2_COMMIT control call (for instance due to the 1-second being exceeded waiting for a busy node's reply), there is a 1-second gap between the transaction_cancel() and replay_transaction() calls in which there is no lock on the persistent db. And due to the lack of global state indicating that a transaction is in progress in ctdbd, other nodes may succeed to start transactions on the db in this gap and even worse work on top of the possibly already pushed changes. So the data diverges on the several nodes. This change fixes this by introducing global state for a transaction commit being active in the ctdb_db_context struct and in a db_id field in the client so that a client keeps track of _which_ tdb it as transaction commit running on. These data are set by ctdb upon entering the trans2_commit control and they are cleared in the trans2_error or trans2_finished controls. This makes it impossible to start a nother transaction or migrate a record to a different node while a transaction is active on a persistent tdb, including the retry loop. This approach is dead lock free and still allows recovery process to be started in the retry-gap between cancel and replay. Also note, that this solution does not require any change in the client side. This was debugged and developed together with Stefan Metzmacher <metze@samba.org> - thanks! Michael (This used to be ctdb commit f88103516e5ad723062fb95fcb07a128f1069d69) 2009-07-21 13:30:38 +04:00			`" of key %s while transaction is active\n",`
			`(char *)call->key.dptr));`
			`} else {`
Reducing the log level for a debug message DEBUG(DEBUG_DEBUG,("pnn %u starting migration of %08x t\ (This used to be ctdb commit 6ce4b21b00cce1530aff022584bf695c257a5d55) 2010-02-11 03:54:46 +03:00			`DEBUG(DEBUG_DEBUG,("pnn %u starting migration of %08x to %u\n",`
Fix persistent transaction commit race condition. In ctdb_client.c:ctdb_transaction_commit(), after a failed TRANS2_COMMIT control call (for instance due to the 1-second being exceeded waiting for a busy node's reply), there is a 1-second gap between the transaction_cancel() and replay_transaction() calls in which there is no lock on the persistent db. And due to the lack of global state indicating that a transaction is in progress in ctdbd, other nodes may succeed to start transactions on the db in this gap and even worse work on top of the possibly already pushed changes. So the data diverges on the several nodes. This change fixes this by introducing global state for a transaction commit being active in the ctdb_db_context struct and in a db_id field in the client so that a client keeps track of _which_ tdb it as transaction commit running on. These data are set by ctdb upon entering the trans2_commit control and they are cleared in the trans2_error or trans2_finished controls. This makes it impossible to start a nother transaction or migrate a record to a different node while a transaction is active on a persistent tdb, including the retry loop. This approach is dead lock free and still allows recovery process to be started in the retry-gap between cancel and replay. Also note, that this solution does not require any change in the client side. This was debugged and developed together with Stefan Metzmacher <metze@samba.org> - thanks! Michael (This used to be ctdb commit f88103516e5ad723062fb95fcb07a128f1069d69) 2009-07-21 13:30:38 +04:00			`ctdb->pnn, ctdb_hash(&(call->key)), c->hdr.srcnode));`
			`ctdb_call_send_dmaster(ctdb_db, c, &header, &(call->key), &data);`
			`talloc_free(data.dptr);`
add extra logging for failed ctdb_ltdb_unlock() for a few more places it is called from (This used to be ctdb commit 5c0fea90c6474a51992a9c4aeb6af7dfeb213ee0) 2010-06-09 08:31:05 +04:00
			`ret = ctdb_ltdb_unlock(ctdb_db, call->key);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_ltdb_unlock() failed with error %d\n", ret));`
			`}`
Fix persistent transaction commit race condition. In ctdb_client.c:ctdb_transaction_commit(), after a failed TRANS2_COMMIT control call (for instance due to the 1-second being exceeded waiting for a busy node's reply), there is a 1-second gap between the transaction_cancel() and replay_transaction() calls in which there is no lock on the persistent db. And due to the lack of global state indicating that a transaction is in progress in ctdbd, other nodes may succeed to start transactions on the db in this gap and even worse work on top of the possibly already pushed changes. So the data diverges on the several nodes. This change fixes this by introducing global state for a transaction commit being active in the ctdb_db_context struct and in a db_id field in the client so that a client keeps track of _which_ tdb it as transaction commit running on. These data are set by ctdb upon entering the trans2_commit control and they are cleared in the trans2_error or trans2_finished controls. This makes it impossible to start a nother transaction or migrate a record to a different node while a transaction is active on a persistent tdb, including the retry loop. This approach is dead lock free and still allows recovery process to be started in the retry-gap between cancel and replay. Also note, that this solution does not require any change in the client side. This was debugged and developed together with Stefan Metzmacher <metze@samba.org> - thanks! Michael (This used to be ctdb commit f88103516e5ad723062fb95fcb07a128f1069d69) 2009-07-21 13:30:38 +04:00			`return;`
			`}`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`}`

Remove LACOUNT and LACCESSOR and migrate the records immediately. This concept didnt work out and it is really just as expensive as a full migration anyway, without the benefit of caching the data for subsequence accesses. Now, migrate the records immediately on first access. This will be combined with a "cheap vacuum-lite" for special empty records to prevent growth of databases. Later extensions to mimic read-only behaviour of records will include proper shared read-only locking of database records, making the laccessor/lacount read-only access to the data obsolete anyway. By removing this special case and handling of lacount laccessor makes the codapath where shared read-only locking will be be implemented simpler, and frees up space in the ctdb_ltdb header for use by vacuuming flags as well as read-only locking flags. (This used to be ctdb commit 155dd1f4885fe142c6f8bd09430f65daf8a17e51) 2010-11-29 05:07:59 +03:00			`ctdb_call_local(ctdb_db, call, &header, hdr, &data);`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00
add extra logging for failed ctdb_ltdb_unlock() for a few more places it is called from (This used to be ctdb commit 5c0fea90c6474a51992a9c4aeb6af7dfeb213ee0) 2010-06-09 08:31:05 +04:00			`ret = ctdb_ltdb_unlock(ctdb_db, call->key);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_ltdb_unlock() failed with error %d\n", ret));`
			`}`
start using ctdb_ltdb_lock_fetch_requeue() (This used to be ctdb commit f89ab3a06b4677f56c92768c3a8ae5ec9f5abbc2) 2007-04-17 10:54:03 +04:00
in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`len = offsetof(struct ctdb_reply_call, data) + call->reply_data.dsize;`
factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`r = ctdb_transport_allocate(ctdb, ctdb, CTDB_REPLY_CALL, len,`
			`struct ctdb_reply_call);`
wrap the packet queue call (This used to be ctdb commit 4dd8ffd5752aedcaf0b8ad1941a5f84ec1ca2868) 2006-12-18 08:26:57 +03:00			`CTDB_NO_MEMORY_FATAL(ctdb, r);`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`r->hdr.destnode = hdr->srcnode;`
			`r->hdr.reqid = hdr->reqid;`
in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`r->status = call->status;`
			`r->datalen = call->reply_data.dsize;`
			`if (call->reply_data.dsize) {`
			`memcpy(&r->data[0], call->reply_data.dptr, call->reply_data.dsize);`
merge status code changes from samba4 ctdb (This used to be ctdb commit 705a9f8e5238976aa5c8cd4a5371459650d8b553) 2007-01-29 14:30:06 +03:00			`}`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00
wrap the packet queue call (This used to be ctdb commit 4dd8ffd5752aedcaf0b8ad1941a5f84ec1ca2868) 2006-12-18 08:26:57 +03:00			`ctdb_queue_packet(ctdb, &r->hdr);`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00
			`talloc_free(r);`
			`}`

			`/*`
			`called when a CTDB_REPLY_CALL packet comes in`
expanded some comments (This used to be ctdb commit cf544e986d5837cc878792af571bdb23cc487882) 2006-12-18 06:49:32 +03:00
			`This packet comes in response to a CTDB_REQ_CALL request packet. It`
typo (This used to be ctdb commit bf2799504498ae452bb7244ae3eb6a51797afe9b) 2007-04-17 23:23:22 +04:00			`contains any reply data from the call`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`*/`
			`void ctdb_reply_call(struct ctdb_context ctdb, struct ctdb_req_header hdr)`
			`{`
			`struct ctdb_reply_call c = (struct ctdb_reply_call )hdr;`
			`struct ctdb_call_state *state;`

split the 32bit idr field into two. store the idr as the high 16 bits and use a rotating counter for the low 16 bits. (This used to be ctdb commit 7c763b7b5e6ca54a6df4586893ddaf1b508b4c22) 2007-04-23 12:19:50 +04:00			`state = ctdb_reqid_find(ctdb, hdr->reqid, struct ctdb_call_state);`
Some more debug and two memleak fixes (This used to be ctdb commit 1e2802422794956827263265306952df5e69b377) 2007-04-18 01:03:30 +04:00			`if (state == NULL) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR, (__location__ " reqid %u not found\n", hdr->reqid));`
Some more debug and two memleak fixes (This used to be ctdb commit 1e2802422794956827263265306952df5e69b377) 2007-04-18 01:03:30 +04:00			`return;`
			`}`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00
split the 32bit idr field into two. store the idr as the high 16 bits and use a rotating counter for the low 16 bits. (This used to be ctdb commit 7c763b7b5e6ca54a6df4586893ddaf1b508b4c22) 2007-04-23 12:19:50 +04:00			`if (hdr->reqid != state->reqid) {`
			`/* we found a record but it was the wrong one */`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR, ("Dropped orphaned call reply with reqid:%u\n",hdr->reqid));`
split the 32bit idr field into two. store the idr as the high 16 bits and use a rotating counter for the low 16 bits. (This used to be ctdb commit 7c763b7b5e6ca54a6df4586893ddaf1b508b4c22) 2007-04-23 12:19:50 +04:00			`return;`
			`}`

in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`state->call->reply_data.dptr = c->data;`
			`state->call->reply_data.dsize = c->datalen;`
			`state->call->status = c->status;`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00
			`talloc_steal(state, c);`

			`state->state = CTDB_CALL_DONE;`
initial support for two new pdus for the domain socket to do fetch_lock no locking is yet done and the store_unlock call is still missing the ./tests/fetch.sh --daemon test fails with parent process dying which needs to be investigated. (This used to be ctdb commit 7d7141c968950a8856f1be79871932b688bfb07f) 2007-04-12 09:46:50 +04:00			`if (state->async.fn) {`
			`state->async.fn(state);`
			`}`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`}`

fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`/*`
			`called when a CTDB_REPLY_DMASTER packet comes in`

			`This packet comes in from the lmaster response to a CTDB_REQ_CALL`
			`request packet. It means that the current dmaster wants to give us`
			`the dmaster role`
			`*/`
			`void ctdb_reply_dmaster(struct ctdb_context ctdb, struct ctdb_req_header hdr)`
			`{`
			`struct ctdb_reply_dmaster c = (struct ctdb_reply_dmaster )hdr;`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`struct ctdb_db_context *ctdb_db;`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`TDB_DATA key, data;`
- split out ctdb_ltdb_lock_fetch_requeue() into a simpler ctdb_ltdb_lock_requeue() and a small wrapper - use ctdb_ltdb_lock_requeue() to fix a possible hang in ctdb_reply_dmaster(), where the ctdb_ltdb_store() could hang waiting for a client. We now requeue the reply_dmaster packet until we have the lock (This used to be ctdb commit 97cd7aa09ce3abbb5e3e965c5c81668e0c0133a5) 2007-04-19 11:43:27 +04:00			`int ret;`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`ctdb_db = find_ctdb_db(ctdb, c->db_id);`
			`if (ctdb_db == NULL) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,("Unknown db_id 0x%x in ctdb_reply_dmaster\n", c->db_id));`
split the 32bit idr field into two. store the idr as the high 16 bits and use a rotating counter for the low 16 bits. (This used to be ctdb commit 7c763b7b5e6ca54a6df4586893ddaf1b508b4c22) 2007-04-23 12:19:50 +04:00			`return;`
			`}`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00
			`key.dptr = c->data;`
			`key.dsize = c->keylen;`
			`data.dptr = &c->data[key.dsize];`
			`data.dsize = c->datalen;`
split the 32bit idr field into two. store the idr as the high 16 bits and use a rotating counter for the low 16 bits. (This used to be ctdb commit 7c763b7b5e6ca54a6df4586893ddaf1b508b4c22) 2007-04-23 12:19:50 +04:00
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`ret = ctdb_ltdb_lock_requeue(ctdb_db, key, hdr,`
make sure we don't increment rx_cnt for redirected packets, or for packets that have been requeued after a lockwait (This used to be ctdb commit 92e5569407dba173a27e9645b4339ce3e2c00520) 2007-05-19 07:45:24 +04:00			`ctdb_call_input_pkt, ctdb, False);`
- split out ctdb_ltdb_lock_fetch_requeue() into a simpler ctdb_ltdb_lock_requeue() and a small wrapper - use ctdb_ltdb_lock_requeue() to fix a possible hang in ctdb_reply_dmaster(), where the ctdb_ltdb_store() could hang waiting for a client. We now requeue the reply_dmaster packet until we have the lock (This used to be ctdb commit 97cd7aa09ce3abbb5e3e965c5c81668e0c0133a5) 2007-04-19 11:43:27 +04:00			`if (ret == -2) {`
			`return;`
			`}`
			`if (ret != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,(__location__ " Failed to get lock in ctdb_reply_dmaster\n"));`
- split out ctdb_ltdb_lock_fetch_requeue() into a simpler ctdb_ltdb_lock_requeue() and a small wrapper - use ctdb_ltdb_lock_requeue() to fix a possible hang in ctdb_reply_dmaster(), where the ctdb_ltdb_store() could hang waiting for a client. We now requeue the reply_dmaster packet until we have the lock (This used to be ctdb commit 97cd7aa09ce3abbb5e3e965c5c81668e0c0133a5) 2007-04-19 11:43:27 +04:00			`return;`
			`}`

pass the header to ctdb_become_dmaster instead of just the reqid this allows us to print from which node Invalid or Dropped orphan become dmaster packets came from (This used to be ctdb commit 88efd1bf4c796cd2b184156b72296587bc38bb40) 2007-07-11 03:44:52 +04:00			`ctdb_become_dmaster(ctdb_db, hdr, key, data, c->rsn);`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`}`

added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00
			`/*`
			`called when a CTDB_REPLY_ERROR packet comes in`
			`*/`
			`void ctdb_reply_error(struct ctdb_context ctdb, struct ctdb_req_header hdr)`
			`{`
			`struct ctdb_reply_error c = (struct ctdb_reply_error )hdr;`
			`struct ctdb_call_state *state;`

split the 32bit idr field into two. store the idr as the high 16 bits and use a rotating counter for the low 16 bits. (This used to be ctdb commit 7c763b7b5e6ca54a6df4586893ddaf1b508b4c22) 2007-04-23 12:19:50 +04:00			`state = ctdb_reqid_find(ctdb, hdr->reqid, struct ctdb_call_state);`
			`if (state == NULL) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,("pnn %u Invalid reqid %u in ctdb_reply_error\n",`
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`ctdb->pnn, hdr->reqid));`
split the 32bit idr field into two. store the idr as the high 16 bits and use a rotating counter for the low 16 bits. (This used to be ctdb commit 7c763b7b5e6ca54a6df4586893ddaf1b508b4c22) 2007-04-23 12:19:50 +04:00			`return;`
			`}`

			`if (hdr->reqid != state->reqid) {`
			`/* we found a record but it was the wrong one */`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR, ("Dropped orphaned error reply with reqid:%u\n",hdr->reqid));`
split the 32bit idr field into two. store the idr as the high 16 bits and use a rotating counter for the low 16 bits. (This used to be ctdb commit 7c763b7b5e6ca54a6df4586893ddaf1b508b4c22) 2007-04-23 12:19:50 +04:00			`return;`
			`}`
added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00
			`talloc_steal(state, c);`

			`state->state = CTDB_CALL_ERROR;`
			`state->errmsg = (char *)c->msg;`
initial support for two new pdus for the domain socket to do fetch_lock no locking is yet done and the store_unlock call is still missing the ./tests/fetch.sh --daemon test fails with parent process dying which needs to be investigated. (This used to be ctdb commit 7d7141c968950a8856f1be79871932b688bfb07f) 2007-04-12 09:46:50 +04:00			`if (state->async.fn) {`
			`state->async.fn(state);`
			`}`
added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00			`}`

added redirect handling (This used to be ctdb commit 3c1dc8b98c8e843c44a172ac15e67f4ab8c47500) 2006-12-18 06:44:06 +03:00
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`/*`
			`destroy a ctdb_call`
			`*/`
			`static int ctdb_call_destructor(struct ctdb_call_state *state)`
			`{`
a better way to resend calls after recovery (This used to be ctdb commit 444f52e134fc22aaf254d05c86d8b357ded876f4) 2007-05-18 18:56:49 +04:00			`DLIST_REMOVE(state->ctdb_db->ctdb->pending_calls, state);`
removed unnecessary variable (This used to be ctdb commit ef0027faa631b00c7fc1a7c4538fbf3080248f0b) 2007-04-28 20:55:37 +04:00			`ctdb_reqid_remove(state->ctdb_db->ctdb, state->reqid);`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`return 0;`
			`}`

expanded some comments (This used to be ctdb commit cf544e986d5837cc878792af571bdb23cc487882) 2006-12-18 06:49:32 +03:00
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`/*`
a better way to resend calls after recovery (This used to be ctdb commit 444f52e134fc22aaf254d05c86d8b357ded876f4) 2007-05-18 18:56:49 +04:00			`called when a ctdb_call needs to be resent after a reconfigure event`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`*/`
a better way to resend calls after recovery (This used to be ctdb commit 444f52e134fc22aaf254d05c86d8b357ded876f4) 2007-05-18 18:56:49 +04:00			`static void ctdb_call_resend(struct ctdb_call_state *state)`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`{`
better timeout handling for calls, controls and traverses (This used to be ctdb commit 63346a6c59d4821b4c443939b5d88db8cd20f5fe) 2007-05-10 08:06:48 +04:00			`struct ctdb_context *ctdb = state->ctdb_db->ctdb;`

			`state->generation = ctdb->vnn_map->generation;`

			`/* use a new reqid, in case the old reply does eventually come in */`
			`ctdb_reqid_remove(ctdb, state->reqid);`
			`state->reqid = ctdb_reqid_new(ctdb, state);`
			`state->c->hdr.reqid = state->reqid;`

- got rid of the complex hand marshalling in the recovery controls - fixed the re-send of ctdb calls after a generation change - fixed a reqid idr leak in controls - removed the write_record test code - use the new nonblock lockall code to prevent ctdbd from ever doing a blocking lock that could deadlock with smbd - moved more of the recovery controls into ctdb_recover.c (This used to be ctdb commit 565a21aa4f1e842309986ab97d6244801153deec) 2007-05-10 11:43:45 +04:00			`/* update the generation count for this request, so its valid with the new vnn_map */`
			`state->c->hdr.generation = state->generation;`

better timeout handling for calls, controls and traverses (This used to be ctdb commit 63346a6c59d4821b4c443939b5d88db8cd20f5fe) 2007-05-10 08:06:48 +04:00			`/* send the packet to ourselves, it will be redirected appropriately */`
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`state->c->hdr.destnode = ctdb->pnn;`
better timeout handling for calls, controls and traverses (This used to be ctdb commit 63346a6c59d4821b4c443939b5d88db8cd20f5fe) 2007-05-10 08:06:48 +04:00
			`ctdb_queue_packet(ctdb, &state->c->hdr);`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_NOTICE,("resent ctdb_call\n"));`
a better way to resend calls after recovery (This used to be ctdb commit 444f52e134fc22aaf254d05c86d8b357ded876f4) 2007-05-18 18:56:49 +04:00			`}`

			`/*`
			`resend all pending calls on recovery`
			`*/`
			`void ctdb_call_resend_all(struct ctdb_context *ctdb)`
			`{`
			`struct ctdb_call_state state, next;`
			`for (state=ctdb->pending_calls;state;state=next) {`
			`next = state->next;`
			`ctdb_call_resend(state);`
			`}`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`}`

initial support for two new pdus for the domain socket to do fetch_lock no locking is yet done and the store_unlock call is still missing the ./tests/fetch.sh --daemon test fails with parent process dying which needs to be investigated. (This used to be ctdb commit 7d7141c968950a8856f1be79871932b688bfb07f) 2007-04-12 09:46:50 +04:00			`/*`
			`this allows the caller to setup a async.fn`
			`*/`
			`static void call_local_trigger(struct event_context ev, struct timed_event te,`
private -> private_data for samba3 (This used to be ctdb commit 080b6901173afb2ad618dd0621876ff478c7d6e5) 2007-04-13 14:38:24 +04:00			`struct timeval t, void *private_data)`
initial support for two new pdus for the domain socket to do fetch_lock no locking is yet done and the store_unlock call is still missing the ./tests/fetch.sh --daemon test fails with parent process dying which needs to be investigated. (This used to be ctdb commit 7d7141c968950a8856f1be79871932b688bfb07f) 2007-04-12 09:46:50 +04:00			`{`
private -> private_data for samba3 (This used to be ctdb commit 080b6901173afb2ad618dd0621876ff478c7d6e5) 2007-04-13 14:38:24 +04:00			`struct ctdb_call_state *state = talloc_get_type(private_data, struct ctdb_call_state);`
initial support for two new pdus for the domain socket to do fetch_lock no locking is yet done and the store_unlock call is still missing the ./tests/fetch.sh --daemon test fails with parent process dying which needs to be investigated. (This used to be ctdb commit 7d7141c968950a8856f1be79871932b688bfb07f) 2007-04-12 09:46:50 +04:00			`if (state->async.fn) {`
			`state->async.fn(state);`
			`}`
			`}`


- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`/*`
expanded some comments (This used to be ctdb commit cf544e986d5837cc878792af571bdb23cc487882) 2006-12-18 06:49:32 +03:00			`construct an event driven local ctdb_call`

			`this is used so that locally processed ctdb_call requests are processed`
			`in an event driven manner`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`*/`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`struct ctdb_call_state ctdb_call_local_send(struct ctdb_db_context ctdb_db,`
simplified ctdb_call() interface, and made it easier to expand with more parameters later (This used to be ctdb commit 6c816fe85e84faad167101bcf26850966c3044e5) 2007-01-25 08:13:17 +03:00			`struct ctdb_call *call,`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`struct ctdb_ltdb_header *header,`
			`TDB_DATA *data)`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`{`
			`struct ctdb_call_state *state;`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`struct ctdb_context *ctdb = ctdb_db->ctdb;`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`int ret;`

Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`state = talloc_zero(ctdb_db, struct ctdb_call_state);`
Provide an alternative CTDB_NO_MEMORY_NULL() for functions which return a pointer (This used to be ctdb commit 51c79e19df777fb53a5c210efc1c9d3159059de3) 2006-12-01 12:26:21 +03:00			`CTDB_NO_MEMORY_NULL(ctdb, state);`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00
another memory leak (This used to be ctdb commit 10466fe11da71c93fa764bea2b3e1e741c113f9c) 2007-04-07 04:58:14 +04:00			`talloc_steal(state, data->dptr);`

- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`state->state = CTDB_CALL_DONE;`
in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`state->call = talloc(state, struct ctdb_call);`
			`CTDB_NO_MEMORY_NULL(ctdb, state->call);`
			`(state->call) = call;`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`state->ctdb_db = ctdb_db;`
fix a bug in new structure handling (This used to be ctdb commit 5f248d82717c8094f260ea16292996bb712df947) 2007-01-29 14:11:16 +03:00
Remove LACOUNT and LACCESSOR and migrate the records immediately. This concept didnt work out and it is really just as expensive as a full migration anyway, without the benefit of caching the data for subsequence accesses. Now, migrate the records immediately on first access. This will be combined with a "cheap vacuum-lite" for special empty records to prevent growth of databases. Later extensions to mimic read-only behaviour of records will include proper shared read-only locking of database records, making the laccessor/lacount read-only access to the data obsolete anyway. By removing this special case and handling of lacount laccessor makes the codapath where shared read-only locking will be be implemented simpler, and frees up space in the ctdb_ltdb header for use by vacuuming flags as well as read-only locking flags. (This used to be ctdb commit 155dd1f4885fe142c6f8bd09430f65daf8a17e51) 2010-11-29 05:07:59 +03:00			`ret = ctdb_call_local(ctdb_db, state->call, header, state, data);`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00
initial support for two new pdus for the domain socket to do fetch_lock no locking is yet done and the store_unlock call is still missing the ./tests/fetch.sh --daemon test fails with parent process dying which needs to be investigated. (This used to be ctdb commit 7d7141c968950a8856f1be79871932b688bfb07f) 2007-04-12 09:46:50 +04:00			`event_add_timed(ctdb->ev, state, timeval_zero(), call_local_trigger, state);`

- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`return state;`
			`}`


			`/*`
- add --daemon flag to ctdb_fetch test code - split client specific routines out of ctdb_daemon.c - use ctdb_queue code in message send from client to daemon - use clearer names in client/daemon functions - use talloc autofree context to avoid global for unlink of socket on exit - start on API change for message handler, to allow ctdb messaging to handle daemon mode with multiple clients (This used to be ctdb commit 53555db45f3583ae4a32cc3aa9e07fb8ef2a77e3) 2007-04-11 05:01:42 +04:00			`make a remote ctdb call - async send. Called in daemon context.`
expanded some comments (This used to be ctdb commit cf544e986d5837cc878792af571bdb23cc487882) 2006-12-18 06:49:32 +03:00
			`This constructs a ctdb_call request and queues it for processing.`
			`This call never blocks.`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`*/`
- send the record header from the client to the daemon when doing a fetch, to avoid the daemon re-reading it - suffix the database name with the node name so that testing on loopback doesn't result in a name collision in the database open (This used to be ctdb commit ad30a4db75450643ff146c40faa306a021de3dd2) 2007-04-17 10:20:32 +04:00			`struct ctdb_call_state ctdb_daemon_call_send_remote(struct ctdb_db_context ctdb_db,`
			`struct ctdb_call *call,`
			`struct ctdb_ltdb_header *header)`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`{`
			`uint32_t len;`
			`struct ctdb_call_state *state;`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`struct ctdb_context *ctdb = ctdb_db->ctdb;`
- added simple (fake) vnn system - split up ctdb layer code into 3 modules - added a simple test suite - added packet structures for ctdb_call - switched to an array for ctdb_node to make vnn lookup easy and fast (This used to be ctdb commit 8a17460a816a5970f2df8244a06aec55d814f186) 2006-11-28 09:56:10 +03:00
dont even try to allocate a packet if the transport is down since it will fail (This used to be ctdb commit a73f316cb9cec877dc0bc3f7baa21be1b1454273) 2009-06-30 05:55:42 +04:00			`if (ctdb->methods == NULL) {`
during shutdown there is a window after we have stopped TCP and disconnected from all other nodes but before we have stopped all processing. During this window we may still hit asynchronous events that will fail because we can not send/receive packets from other nodes. These messages are logged as ... Transport is DOWN. To help indicate that they are benign messages related to the process of shutting down. These messages spam the syslog during normal shutdown, so this patch will drop the loglevel of these messages to DEBUG, so that they will not appear in or spam the syslog. (This used to be ctdb commit 8275d265d2ae19b765e30ecf18f6b6319b6e6453) 2010-10-28 06:38:34 +04:00			`DEBUG(DEBUG_INFO,(__location__ " Failed send packet. Transport is down\n"));`
dont even try to allocate a packet if the transport is down since it will fail (This used to be ctdb commit a73f316cb9cec877dc0bc3f7baa21be1b1454273) 2009-06-30 05:55:42 +04:00			`return NULL;`
			`}`

Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`state = talloc_zero(ctdb_db, struct ctdb_call_state);`
Provide an alternative CTDB_NO_MEMORY_NULL() for functions which return a pointer (This used to be ctdb commit 51c79e19df777fb53a5c210efc1c9d3159059de3) 2006-12-01 12:26:21 +03:00			`CTDB_NO_MEMORY_NULL(ctdb, state);`
in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`state->call = talloc(state, struct ctdb_call);`
			`CTDB_NO_MEMORY_NULL(ctdb, state->call);`

split the 32bit idr field into two. store the idr as the high 16 bits and use a rotating counter for the low 16 bits. (This used to be ctdb commit 7c763b7b5e6ca54a6df4586893ddaf1b508b4c22) 2007-04-23 12:19:50 +04:00			`state->reqid = ctdb_reqid_new(ctdb, state);`
- got rid of the complex hand marshalling in the recovery controls - fixed the re-send of ctdb calls after a generation change - fixed a reqid idr leak in controls - removed the write_record test code - use the new nonblock lockall code to prevent ctdbd from ever doing a blocking lock that could deadlock with smbd - moved more of the recovery controls into ctdb_recover.c (This used to be ctdb commit 565a21aa4f1e842309986ab97d6244801153deec) 2007-05-10 11:43:45 +04:00			`state->ctdb_db = ctdb_db;`
split the 32bit idr field into two. store the idr as the high 16 bits and use a rotating counter for the low 16 bits. (This used to be ctdb commit 7c763b7b5e6ca54a6df4586893ddaf1b508b4c22) 2007-04-23 12:19:50 +04:00			`talloc_set_destructor(state, ctdb_call_destructor);`
- added in idtree for efficient reqid handling - started adding ctdb_call() code - added ctdb_call_local() implementation (This used to be ctdb commit 97b1fdf7fa0e230f36add3f1770ecb3a9faee0a1) 2006-11-28 12:48:34 +03:00
simplified ctdb_call() interface, and made it easier to expand with more parameters later (This used to be ctdb commit 6c816fe85e84faad167101bcf26850966c3044e5) 2007-01-25 08:13:17 +03:00			`len = offsetof(struct ctdb_req_call, data) + call->key.dsize + call->call_data.dsize;`
factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`state->c = ctdb_transport_allocate(ctdb, state, CTDB_REQ_CALL, len,`
			`struct ctdb_req_call);`
Provide an alternative CTDB_NO_MEMORY_NULL() for functions which return a pointer (This used to be ctdb commit 51c79e19df777fb53a5c210efc1c9d3159059de3) 2006-12-01 12:26:21 +03:00			`CTDB_NO_MEMORY_NULL(ctdb, state->c);`
- send the record header from the client to the daemon when doing a fetch, to avoid the daemon re-reading it - suffix the database name with the node name so that testing on loopback doesn't result in a name collision in the database open (This used to be ctdb commit ad30a4db75450643ff146c40faa306a021de3dd2) 2007-04-17 10:20:32 +04:00			`state->c->hdr.destnode = header->dmaster;`
added status all and debug all control operations (This used to be ctdb commit 7f902f6c4270adc0543096c78415d335b88d6232) 2007-04-28 19:13:30 +04:00
- added in idtree for efficient reqid handling - started adding ctdb_call() code - added ctdb_call_local() implementation (This used to be ctdb commit 97b1fdf7fa0e230f36add3f1770ecb3a9faee0a1) 2006-11-28 12:48:34 +03:00			`/* this limits us to 16k outstanding messages - not unreasonable */`
split the 32bit idr field into two. store the idr as the high 16 bits and use a rotating counter for the low 16 bits. (This used to be ctdb commit 7c763b7b5e6ca54a6df4586893ddaf1b508b4c22) 2007-04-23 12:19:50 +04:00			`state->c->hdr.reqid = state->reqid;`
first test of forced migration of records. compiles but not tested. (This used to be ctdb commit ac6ac290e79446f52caf31f429b4c38668c27eda) 2007-04-04 15:15:56 +04:00			`state->c->flags = call->flags;`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`state->c->db_id = ctdb_db->db_id;`
simplified ctdb_call() interface, and made it easier to expand with more parameters later (This used to be ctdb commit 6c816fe85e84faad167101bcf26850966c3044e5) 2007-01-25 08:13:17 +03:00			`state->c->callid = call->call_id;`
added a hopcount in ctdb_call (This used to be ctdb commit 36d838801a2a2008c50322cdbfff65a308b1cd1a) 2007-05-01 07:25:02 +04:00			`state->c->hopcount = 0;`
simplified ctdb_call() interface, and made it easier to expand with more parameters later (This used to be ctdb commit 6c816fe85e84faad167101bcf26850966c3044e5) 2007-01-25 08:13:17 +03:00			`state->c->keylen = call->key.dsize;`
			`state->c->calldatalen = call->call_data.dsize;`
			`memcpy(&state->c->data[0], call->key.dptr, call->key.dsize);`
			`memcpy(&state->c->data[call->key.dsize],`
			`call->call_data.dptr, call->call_data.dsize);`
in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`(state->call) = call;`
			`state->call->call_data.dptr = &state->c->data[call->key.dsize];`
			`state->call->key.dptr = &state->c->data[0];`
- added in idtree for efficient reqid handling - started adding ctdb_call() code - added ctdb_call_local() implementation (This used to be ctdb commit 97b1fdf7fa0e230f36add3f1770ecb3a9faee0a1) 2006-11-28 12:48:34 +03:00
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`state->state = CTDB_CALL_WAIT;`
better timeout handling for calls, controls and traverses (This used to be ctdb commit 63346a6c59d4821b4c443939b5d88db8cd20f5fe) 2007-05-10 08:06:48 +04:00			`state->generation = ctdb->vnn_map->generation;`
- added in idtree for efficient reqid handling - started adding ctdb_call() code - added ctdb_call_local() implementation (This used to be ctdb commit 97b1fdf7fa0e230f36add3f1770ecb3a9faee0a1) 2006-11-28 12:48:34 +03:00
a better way to resend calls after recovery (This used to be ctdb commit 444f52e134fc22aaf254d05c86d8b357ded876f4) 2007-05-18 18:56:49 +04:00			`DLIST_ADD(ctdb->pending_calls, state);`

wrap the packet queue call (This used to be ctdb commit 4dd8ffd5752aedcaf0b8ad1941a5f84ec1ca2868) 2006-12-18 08:26:57 +03:00			`ctdb_queue_packet(ctdb, &state->c->hdr);`
- added in idtree for efficient reqid handling - started adding ctdb_call() code - added ctdb_call_local() implementation (This used to be ctdb commit 97b1fdf7fa0e230f36add3f1770ecb3a9faee0a1) 2006-11-28 12:48:34 +03:00
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`return state;`
			`}`

			`/*`
- add --daemon flag to ctdb_fetch test code - split client specific routines out of ctdb_daemon.c - use ctdb_queue code in message send from client to daemon - use clearer names in client/daemon functions - use talloc autofree context to avoid global for unlink of socket on exit - start on API change for message handler, to allow ctdb messaging to handle daemon mode with multiple clients (This used to be ctdb commit 53555db45f3583ae4a32cc3aa9e07fb8ef2a77e3) 2007-04-11 05:01:42 +04:00			`make a remote ctdb call - async recv - called in daemon context`
expanded some comments (This used to be ctdb commit cf544e986d5837cc878792af571bdb23cc487882) 2006-12-18 06:49:32 +03:00
			`This is called when the program wants to wait for a ctdb_call to complete and get the`
			`results. This call will block unless the call has already completed.`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`*/`
- removed the non-daemon mode from ctdb, in order to simplify the code. It may be added back later once everything is working nicely, or simulated using a in-process pipe instead of a unix domain socket - rewrote the ctdb_fetch_lock() code to follow the new design (This used to be ctdb commit 5024dd1f305fe1ecc262db2240c56f773b4f28f0) 2007-04-17 08:52:51 +04:00			`int ctdb_daemon_call_recv(struct ctdb_call_state state, struct ctdb_call call)`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`{`
merge fetch code from ronnie, and add a simple fetch test (This used to be ctdb commit 83b794befd8d34b3da544a483f9d39a3fa140655) 2007-04-05 07:18:31 +04:00			`while (state->state < CTDB_CALL_DONE) {`
removed unnecessary variable (This used to be ctdb commit ef0027faa631b00c7fc1a7c4538fbf3080248f0b) 2007-04-28 20:55:37 +04:00			`event_loop_once(state->ctdb_db->ctdb->ev);`
merge fetch code from ronnie, and add a simple fetch test (This used to be ctdb commit 83b794befd8d34b3da544a483f9d39a3fa140655) 2007-04-05 07:18:31 +04:00			`}`
			`if (state->state != CTDB_CALL_DONE) {`
removed unnecessary variable (This used to be ctdb commit ef0027faa631b00c7fc1a7c4538fbf3080248f0b) 2007-04-28 20:55:37 +04:00			`ctdb_set_error(state->ctdb_db->ctdb, "%s", state->errmsg);`
merge fetch code from ronnie, and add a simple fetch test (This used to be ctdb commit 83b794befd8d34b3da544a483f9d39a3fa140655) 2007-04-05 07:18:31 +04:00			`talloc_free(state);`
			`return -1;`
			`}`

in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`if (state->call->reply_data.dsize) {`
fix a memory leak allocate the memory to the 'call' context and not off the 'ctdb' context (This used to be ctdb commit be89005bd5d13409e377d425db2aad1c0d5b3826) 2008-03-25 03:11:13 +03:00			`call->reply_data.dptr = talloc_memdup(call,`
in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`state->call->reply_data.dptr,`
			`state->call->reply_data.dsize);`
			`call->reply_data.dsize = state->call->reply_data.dsize;`
merge status code changes from samba4 ctdb (This used to be ctdb commit 705a9f8e5238976aa5c8cd4a5371459650d8b553) 2007-01-29 14:30:06 +03:00			`} else {`
			`call->reply_data.dptr = NULL;`
			`call->reply_data.dsize = 0;`
			`}`
in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`call->status = state->call->status;`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`talloc_free(state);`
			`return 0;`
- added simple (fake) vnn system - split up ctdb layer code into 3 modules - added a simple test suite - added packet structures for ctdb_call - switched to an array for ctdb_node to make vnn lookup easy and fast (This used to be ctdb commit 8a17460a816a5970f2df8244a06aec55d814f186) 2006-11-28 09:56:10 +03:00			`}`

- add --daemon flag to ctdb_fetch test code - split client specific routines out of ctdb_daemon.c - use ctdb_queue code in message send from client to daemon - use clearer names in client/daemon functions - use talloc autofree context to avoid global for unlink of socket on exit - start on API change for message handler, to allow ctdb messaging to handle daemon mode with multiple clients (This used to be ctdb commit 53555db45f3583ae4a32cc3aa9e07fb8ef2a77e3) 2007-04-11 05:01:42 +04:00
add dead node detection so that if a node does not generate any keepalive traffic for x seconds it is deemed dead this triggers a recovery after a while if a ctdbd has been STOPPED but it doesnt recover automatically when the node reappears (This used to be ctdb commit d6324afe0d13b5e21d06e347caca433c6b36a32a) 2007-05-18 13:19:35 +04:00			`/*`
			`send a keepalive packet to the other node`
			`*/`
- up rx_cnt on all packet types - notice when a node becomes available again (This used to be ctdb commit e05110dd6112e81f224937dfd7370d963ce9531a) 2007-05-18 17:23:36 +04:00			`void ctdb_send_keepalive(struct ctdb_context *ctdb, uint32_t destnode)`
add dead node detection so that if a node does not generate any keepalive traffic for x seconds it is deemed dead this triggers a recovery after a while if a ctdbd has been STOPPED but it doesnt recover automatically when the node reappears (This used to be ctdb commit d6324afe0d13b5e21d06e347caca433c6b36a32a) 2007-05-18 13:19:35 +04:00			`{`
			`struct ctdb_req_keepalive *r;`

dont try sending a keepalive if the transport is down (This used to be ctdb commit 5cdc04669db8c2ddbbff5af82307a16e8d807b83) 2009-06-30 06:17:05 +04:00			`if (ctdb->methods == NULL) {`
during shutdown there is a window after we have stopped TCP and disconnected from all other nodes but before we have stopped all processing. During this window we may still hit asynchronous events that will fail because we can not send/receive packets from other nodes. These messages are logged as ... Transport is DOWN. To help indicate that they are benign messages related to the process of shutting down. These messages spam the syslog during normal shutdown, so this patch will drop the loglevel of these messages to DEBUG, so that they will not appear in or spam the syslog. (This used to be ctdb commit 8275d265d2ae19b765e30ecf18f6b6319b6e6453) 2010-10-28 06:38:34 +04:00			`DEBUG(DEBUG_INFO,(__location__ " Failed to send keepalive. Transport is DOWN\n"));`
dont try sending a keepalive if the transport is down (This used to be ctdb commit 5cdc04669db8c2ddbbff5af82307a16e8d807b83) 2009-06-30 06:17:05 +04:00			`return;`
			`}`

- up rx_cnt on all packet types - notice when a node becomes available again (This used to be ctdb commit e05110dd6112e81f224937dfd7370d963ce9531a) 2007-05-18 17:23:36 +04:00			`r = ctdb_transport_allocate(ctdb, ctdb, CTDB_REQ_KEEPALIVE,`
add dead node detection so that if a node does not generate any keepalive traffic for x seconds it is deemed dead this triggers a recovery after a while if a ctdbd has been STOPPED but it doesnt recover automatically when the node reappears (This used to be ctdb commit d6324afe0d13b5e21d06e347caca433c6b36a32a) 2007-05-18 13:19:35 +04:00			`sizeof(struct ctdb_req_keepalive),`
			`struct ctdb_req_keepalive);`
			`CTDB_NO_MEMORY_FATAL(ctdb, r);`
			`r->hdr.destnode = destnode;`
			`r->hdr.reqid = 0;`

Create macros to update the statistics counters and use these macros everywhere instead of manipulating the coutenrs directly. (This used to be ctdb commit 2e648df890e5713bc575965d87937827b068d0d7) 2010-09-29 04:38:41 +04:00			`CTDB_INCREMENT_STAT(ctdb, keepalive_packets_sent);`
add dead node detection so that if a node does not generate any keepalive traffic for x seconds it is deemed dead this triggers a recovery after a while if a ctdbd has been STOPPED but it doesnt recover automatically when the node reappears (This used to be ctdb commit d6324afe0d13b5e21d06e347caca433c6b36a32a) 2007-05-18 13:19:35 +04:00
			`ctdb_queue_packet(ctdb, &r->hdr);`

			`talloc_free(r);`
			`}`

858 lines 25 KiB C Raw Normal View History Unescape Escape

858 lines

25 KiB

C

Raw Normal View History