samba-mirror

mirror of https://github.com/samba-team/samba.git synced 2024-12-23 17:34:34 +03:00

2075 lines

56 KiB

C

Raw Normal View History

- added simple (fake) vnn system - split up ctdb layer code into 3 modules - added a simple test suite - added packet structures for ctdb_call - switched to an array for ctdb_node to make vnn lookup easy and fast (This used to be ctdb commit 8a17460a816a5970f2df8244a06aec55d814f186) 2006-11-28 09:56:10 +03:00			`/*`
			`ctdb_call protocol code`

			`Copyright (C) Andrew Tridgell 2006`

ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`This program is free software; you can redistribute it and/or modify`
			`it under the terms of the GNU General Public License as published by`
update lib/replace from samba4 (This used to be ctdb commit f0555484105668c01c21f56322992e752e831109) 2007-07-10 09:29:31 +04:00			`the Free Software Foundation; either version 3 of the License, or`
ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`(at your option) any later version.`

			`This program is distributed in the hope that it will be useful,`
- added simple (fake) vnn system - split up ctdb layer code into 3 modules - added a simple test suite - added packet structures for ctdb_call - switched to an array for ctdb_node to make vnn lookup easy and fast (This used to be ctdb commit 8a17460a816a5970f2df8244a06aec55d814f186) 2006-11-28 09:56:10 +03:00			`but WITHOUT ANY WARRANTY; without even the implied warranty of`
ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the`
			`GNU General Public License for more details.`

			`You should have received a copy of the GNU General Public License`
update lib/replace from samba4 (This used to be ctdb commit f0555484105668c01c21f56322992e752e831109) 2007-07-10 09:29:31 +04:00			`along with this program; if not, see <http://www.gnu.org/licenses/>.`
- added simple (fake) vnn system - split up ctdb layer code into 3 modules - added a simple test suite - added packet structures for ctdb_call - switched to an array for ctdb_node to make vnn lookup easy and fast (This used to be ctdb commit 8a17460a816a5970f2df8244a06aec55d814f186) 2006-11-28 09:56:10 +03:00			`*/`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`/*`
			`see http://wiki.samba.org/index.php/Samba_%26_Clustering for`
			`protocol design and packet details`
			`*/`
ctdb-daemon: Remove dependency on includes.h Instead of includes.h, include the required header files explicitly. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:46 +03:00			`#include "replace.h"`
- added simple (fake) vnn system - split up ctdb layer code into 3 modules - added a simple test suite - added packet structures for ctdb_call - switched to an array for ctdb_node to make vnn lookup easy and fast (This used to be ctdb commit 8a17460a816a5970f2df8244a06aec55d814f186) 2006-11-28 09:56:10 +03:00			`#include "system/network.h"`
			`#include "system/filesys.h"`
ctdb-daemon: Remove dependency on includes.h Instead of includes.h, include the required header files explicitly. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:46 +03:00
			`#include <talloc.h>`
			`#include <tevent.h>`

			`#include "lib/util/dlinklist.h"`
			`#include "lib/util/debug.h"`
			`#include "lib/util/samba_util.h"`
ctdb-common: Drop CTDB's copy of sys_read() and sys_write() Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Tue Nov 29 11:22:40 CET 2016 on sn-devel-144 2016-11-29 04:55:06 +03:00			`#include "lib/util/sys_rw.h"`
ctdb: Use prctl_set_comment from lib/util Signed-off-by: Christof Schmitt <cs@samba.org> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2015-09-24 02:10:59 +03:00			`#include "lib/util/util_process.h"`
ctdb-daemon: Remove dependency on includes.h Instead of includes.h, include the required header files explicitly. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:46 +03:00
			`#include "ctdb_private.h"`
			`#include "ctdb_client.h"`

			`#include "common/rb_tree.h"`
ctdb-daemon: Use reqid abstraction Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-03-17 06:30:18 +03:00			`#include "common/reqid.h"`
ctdb-daemon: Separate prototypes for system specific functions This groups function prototypes for system specific functions in common/system.h and removes them from ctdb_private.h. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-23 06:11:53 +03:00			`#include "common/system.h"`
ctdb-daemon: Separate prototypes for common client/server functions This groups function prototypes for common client/server functions in common/common.h and removes them from ctdb_private.h. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-23 06:17:34 +03:00			`#include "common/common.h"`
ctdb-server: Replace ctdb_logging.h with common/logging.h Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Michael Adam <obnox@samba.org> 2015-11-11 07:41:10 +03:00			`#include "common/logging.h"`
ctdb-daemon: Add tracking of migration records Instead of using hopcount as a metric for hot records, use the number of migrations per second as a metric. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Wed Apr 5 08:35:45 CEST 2017 on sn-devel-144 2017-03-21 08:48:45 +03:00			`#include "common/hash_count.h"`
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00
			`struct ctdb_sticky_record {`
			`struct ctdb_context *ctdb;`
			`struct ctdb_db_context *ctdb_db;`
			`TDB_CONTEXT *pindown;`
			`};`
- added simple (fake) vnn system - split up ctdb layer code into 3 modules - added a simple test suite - added packet structures for ctdb_call - switched to an array for ctdb_node to make vnn lookup easy and fast (This used to be ctdb commit 8a17460a816a5970f2df8244a06aec55d814f186) 2006-11-28 09:56:10 +03:00
initial support for two new pdus for the domain socket to do fetch_lock no locking is yet done and the store_unlock call is still missing the ./tests/fetch.sh --daemon test fails with parent process dying which needs to be investigated. (This used to be ctdb commit 7d7141c968950a8856f1be79871932b688bfb07f) 2007-04-12 09:46:50 +04:00			`/*`
			`find the ctdb_db from a db index`
			`*/`
			`struct ctdb_db_context find_ctdb_db(struct ctdb_context ctdb, uint32_t id)`
			`{`
			`struct ctdb_db_context *ctdb_db;`

			`for (ctdb_db=ctdb->db_list; ctdb_db; ctdb_db=ctdb_db->next) {`
			`if (ctdb_db->db_id == id) {`
			`break;`
			`}`
			`}`
			`return ctdb_db;`
			`}`

make sure we don't increment rx_cnt for redirected packets, or for packets that have been requeued after a lockwait (This used to be ctdb commit 92e5569407dba173a27e9645b4339ce3e2c00520) 2007-05-19 07:45:24 +04:00			`/*`
ctdb/server/ctdb_call.c: typo fixes Signed-off-by: Bjoern Jacke <bjacke@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net> 2019-10-26 03:41:08 +03:00			`a variant of input packet that can be used in lock requeue`
make sure we don't increment rx_cnt for redirected packets, or for packets that have been requeued after a lockwait (This used to be ctdb commit 92e5569407dba173a27e9645b4339ce3e2c00520) 2007-05-19 07:45:24 +04:00			`*/`
clean out some more cruft (This used to be ctdb commit ad16c5fe2748b48a6f6c79976359d56d9bed33f4) 2007-06-05 11:57:07 +04:00			`static void ctdb_call_input_pkt(void p, struct ctdb_req_header hdr)`
make sure we don't increment rx_cnt for redirected packets, or for packets that have been requeued after a lockwait (This used to be ctdb commit 92e5569407dba173a27e9645b4339ce3e2c00520) 2007-05-19 07:45:24 +04:00			`{`
			`struct ctdb_context *ctdb = talloc_get_type(p, struct ctdb_context);`
			`ctdb_input_pkt(ctdb, hdr);`
			`}`


next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`/*`
			`send an error reply`
			`*/`
merge fixes from samba4 (This used to be ctdb commit fb90a5424348d0b6ed9a1b8da4ceadcc4d1a1cb1) 2007-01-23 03:38:45 +03:00			`static void ctdb_send_error(struct ctdb_context *ctdb,`
			`struct ctdb_req_header *hdr, uint32_t status,`
			`const char *fmt, ...) PRINTF_ATTRIBUTE(4,5);`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`static void ctdb_send_error(struct ctdb_context *ctdb,`
added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00			`struct ctdb_req_header *hdr, uint32_t status,`
			`const char *fmt, ...)`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`{`
added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00			`va_list ap;`
ctdb-daemon: Rename struct ctdb_reply_error to ctdb_reply_error_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:30:31 +03:00			`struct ctdb_reply_error_old *r;`
added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00			`char *msg;`
merge fixes from samba4 (This used to be ctdb commit fb90a5424348d0b6ed9a1b8da4ceadcc4d1a1cb1) 2007-01-23 03:38:45 +03:00			`int msglen, len;`
added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00
dont try to send error packets if the transport is down (This used to be ctdb commit 65b94d280731df3245b26d69f39acfaf5bccf0d8) 2009-06-30 06:10:27 +04:00			`if (ctdb->methods == NULL) {`
during shutdown there is a window after we have stopped TCP and disconnected from all other nodes but before we have stopped all processing. During this window we may still hit asynchronous events that will fail because we can not send/receive packets from other nodes. These messages are logged as ... Transport is DOWN. To help indicate that they are benign messages related to the process of shutting down. These messages spam the syslog during normal shutdown, so this patch will drop the loglevel of these messages to DEBUG, so that they will not appear in or spam the syslog. (This used to be ctdb commit 8275d265d2ae19b765e30ecf18f6b6319b6e6453) 2010-10-28 06:38:34 +04:00			`DEBUG(DEBUG_INFO,(__location__ " Failed to send error. Transport is DOWN\n"));`
if we fail a dmaster migration due to the transport being down, then that is a fatal condition. (This used to be ctdb commit 75dea671f68ac6649095357c36b3697a927721e9) 2009-06-30 06:13:15 +04:00			`return;`
dont try to send error packets if the transport is down (This used to be ctdb commit 65b94d280731df3245b26d69f39acfaf5bccf0d8) 2009-06-30 06:10:27 +04:00			`}`

added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00			`va_start(ap, fmt);`
			`msg = talloc_vasprintf(ctdb, fmt, ap);`
			`if (msg == NULL) {`
wrap the packet queue call (This used to be ctdb commit 4dd8ffd5752aedcaf0b8ad1941a5f84ec1ca2868) 2006-12-18 08:26:57 +03:00			`ctdb_fatal(ctdb, "Unable to allocate error in ctdb_send_error\n");`
added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00			`}`
			`va_end(ap);`

merge fixes from samba4 (This used to be ctdb commit fb90a5424348d0b6ed9a1b8da4ceadcc4d1a1cb1) 2007-01-23 03:38:45 +03:00			`msglen = strlen(msg)+1;`
ctdb-daemon: Rename struct ctdb_reply_error to ctdb_reply_error_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:30:31 +03:00			`len = offsetof(struct ctdb_reply_error_old, msg);`
factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`r = ctdb_transport_allocate(ctdb, msg, CTDB_REPLY_ERROR, len + msglen,`
ctdb-daemon: Rename struct ctdb_reply_error to ctdb_reply_error_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:30:31 +03:00			`struct ctdb_reply_error_old);`
wrap the packet queue call (This used to be ctdb commit 4dd8ffd5752aedcaf0b8ad1941a5f84ec1ca2868) 2006-12-18 08:26:57 +03:00			`CTDB_NO_MEMORY_FATAL(ctdb, r);`
merge fixes from samba4 (This used to be ctdb commit fb90a5424348d0b6ed9a1b8da4ceadcc4d1a1cb1) 2007-01-23 03:38:45 +03:00
added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00			`r->hdr.destnode = hdr->srcnode;`
			`r->hdr.reqid = hdr->reqid;`
			`r->status = status;`
merge fixes from samba4 (This used to be ctdb commit fb90a5424348d0b6ed9a1b8da4ceadcc4d1a1cb1) 2007-01-23 03:38:45 +03:00			`r->msglen = msglen;`
			`memcpy(&r->msg[0], msg, msglen);`
added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00
wrap the packet queue call (This used to be ctdb commit 4dd8ffd5752aedcaf0b8ad1941a5f84ec1ca2868) 2006-12-18 08:26:57 +03:00			`ctdb_queue_packet(ctdb, &r->hdr);`
added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00
- make he packet allocation routines take a mem_ctx, which allows us to put memory directly in the right context, avoiding quite a few talloc_steal calls, and simplifying the code - make the fetch lock code in the daemon fully async (This used to be ctdb commit d98b4b4fcadad614861c0d44a3854d97b01d0f74) 2007-04-19 04:37:44 +04:00			`talloc_free(msg);`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`}`

added redirect handling (This used to be ctdb commit 3c1dc8b98c8e843c44a172ac15e67f4ab8c47500) 2006-12-18 06:44:06 +03:00
server: add a comment explaining the call redirect logic in ctdb_call_send_redirect(). (This used to be ctdb commit 81663b81687c0ba681500cca6aa8174bb9587ad2) 2010-11-24 10:01:01 +03:00			`/**`
			`* send a redirect reply`
			`*`
			`* The logic behind this function is this:`
			`*`
			`* A client wants to grab a record and sends a CTDB_REQ_CALL packet`
			`* to its local ctdb (ctdb_request_call). If the node is not itself`
			`* the record's DMASTER, it first redirects the packet to the`
			`* record's LMASTER. The LMASTER then redirects the call packet to`
ctdbd: update comment describing ctdb_call_send_redirect() Signed-off-by: Michael Adam <obnox@samba.org> (This used to be ctdb commit 9a21d417c51fb9cad8f2e87e00ca54d379aef860) 2013-05-17 13:00:32 +04:00			`* the current DMASTER. Note that this works because of this: When`
server: add a comment explaining the call redirect logic in ctdb_call_send_redirect(). (This used to be ctdb commit 81663b81687c0ba681500cca6aa8174bb9587ad2) 2010-11-24 10:01:01 +03:00			`* a record is migrated off a node, then the new DMASTER is stored`
			`* in the record's copy on the former DMASTER.`
			`*/`
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00			`static void ctdb_call_send_redirect(struct ctdb_context *ctdb,`
			`struct ctdb_db_context *ctdb_db,`
much simpler redirect logic (This used to be ctdb commit 95db9afa7dd039e1700e2f3078782f6ac66e9f51) 2007-04-28 20:18:33 +04:00			`TDB_DATA key,`
ctdb-daemon: Rename struct ctdb_req_call to ctdb_req_call_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:26:29 +03:00			`struct ctdb_req_call_old *c,`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`struct ctdb_ltdb_header *header)`
			`{`
much simpler redirect logic (This used to be ctdb commit 95db9afa7dd039e1700e2f3078782f6ac66e9f51) 2007-04-28 20:18:33 +04:00			`uint32_t lmaster = ctdb_lmaster(ctdb, &key);`
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00
			`c->hdr.destnode = lmaster;`
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`if (ctdb->pnn == lmaster) {`
much simpler redirect logic (This used to be ctdb commit 95db9afa7dd039e1700e2f3078782f6ac66e9f51) 2007-04-28 20:18:33 +04:00			`c->hdr.destnode = header->dmaster;`
			`}`
added a hopcount in ctdb_call (This used to be ctdb commit 36d838801a2a2008c50322cdbfff65a308b1cd1a) 2007-05-01 07:25:02 +04:00			`c->hopcount++;`
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00
ctdbd: Improve high hopcount log messages when request is redirected Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 9cde47e1a5bf1b9ca3b4da8c2db94caac2b1aa5e) 2013-07-15 11:34:31 +04:00			`if (c->hopcount%100 > 95) {`
			`DEBUG(DEBUG_WARNING,("High hopcount %d dbid:%s "`
			`"key:0x%08x reqid=%08x pnn:%d src:%d lmaster:%d "`
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00			`"header->dmaster:%d dst:%d\n",`
ctdbd: Improve high hopcount log messages when request is redirected Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 9cde47e1a5bf1b9ca3b4da8c2db94caac2b1aa5e) 2013-07-15 11:34:31 +04:00			`c->hopcount, ctdb_db->db_name, ctdb_hash(&key),`
			`c->hdr.reqid, ctdb->pnn, c->hdr.srcnode, lmaster,`
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00			`header->dmaster, c->hdr.destnode));`
			`}`

much simpler redirect logic (This used to be ctdb commit 95db9afa7dd039e1700e2f3078782f6ac66e9f51) 2007-04-28 20:18:33 +04:00			`ctdb_queue_packet(ctdb, &c->hdr);`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`}`

- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00
			`/*`
			`send a dmaster reply`

			`caller must have the chainlock before calling this routine. Caller must be`
			`the lmaster`
			`*/`
			`static void ctdb_send_dmaster_reply(struct ctdb_db_context *ctdb_db,`
			`struct ctdb_ltdb_header *header,`
			`TDB_DATA key, TDB_DATA data,`
			`uint32_t new_dmaster,`
			`uint32_t reqid)`
			`{`
			`struct ctdb_context *ctdb = ctdb_db->ctdb;`
ctdb-daemon: Rename struct ctdb_reply_dmaster to ctdb_reply_dmaster_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:34:01 +03:00			`struct ctdb_reply_dmaster_old *r;`
- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00			`int ret, len;`
			`TALLOC_CTX *tmp_ctx;`

change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`if (ctdb->pnn != ctdb_lmaster(ctdb, &key)) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ALERT,(__location__ " Caller is not lmaster!\n"));`
- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00			`return;`
			`}`

			`header->dmaster = new_dmaster;`
			`ret = ctdb_ltdb_store(ctdb_db, key, header, data);`
			`if (ret != 0) {`
if we fail a dmaster migration due to the transport being down, then that is a fatal condition. (This used to be ctdb commit 75dea671f68ac6649095357c36b3697a927721e9) 2009-06-30 06:13:15 +04:00			`ctdb_fatal(ctdb, "ctdb_send_dmaster_reply unable to update dmaster");`
			`return;`
			`}`

			`if (ctdb->methods == NULL) {`
server:ctdb_send_dmaster_reply: fix a message typo. Michael (This used to be ctdb commit aa63f728152c37e31cecf2258efcdc8cf5ac0092) 2010-01-06 16:59:23 +03:00			`ctdb_fatal(ctdb, "ctdb_send_dmaster_reply cant update dmaster since transport is down");`
- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00			`return;`
			`}`

			`/* put the packet on a temporary context, allowing us to safely free`
			`it below even if ctdb_reply_dmaster() has freed it already */`
			`tmp_ctx = talloc_new(ctdb);`

			`/* send the CTDB_REPLY_DMASTER */`
ctdb-daemon: Rename struct ctdb_reply_dmaster to ctdb_reply_dmaster_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:34:01 +03:00			`len = offsetof(struct ctdb_reply_dmaster_old, data) + key.dsize + data.dsize + sizeof(uint32_t);`
factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`r = ctdb_transport_allocate(ctdb, tmp_ctx, CTDB_REPLY_DMASTER, len,`
ctdb-daemon: Rename struct ctdb_reply_dmaster to ctdb_reply_dmaster_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:34:01 +03:00			`struct ctdb_reply_dmaster_old);`
- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00			`CTDB_NO_MEMORY_FATAL(ctdb, r);`

			`r->hdr.destnode = new_dmaster;`
			`r->hdr.reqid = reqid;`
ctdb-daemon: Use database generation in packet headers for database requests Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-09-15 09:50:19 +03:00			`r->hdr.generation = ctdb_db->generation;`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`r->rsn = header->rsn;`
			`r->keylen = key.dsize;`
- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00			`r->datalen = data.dsize;`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`r->db_id = ctdb_db->db_id;`
			`memcpy(&r->data[0], key.dptr, key.dsize);`
			`memcpy(&r->data[key.dsize], data.dptr, data.dsize);`
call: transfer the record flags in the ctdb call packets. This way, the MIGRATED_WITH_DATA information can be transported along with the records. This is important for vacuuming to function properly. The record flags are appended to the data section of the ctdb_req_dmaster and ctdb_reply_dmaster structs. Pair-Programmed-With: Stefan Metzmacher <metze@samba.org> (This used to be ctdb commit 945187d64cfc7bd30a0c3b0d548cbe582d95dde3) 2010-12-10 16:02:33 +03:00			`memcpy(&r->data[key.dsize+data.dsize], &header->flags, sizeof(uint32_t));`
- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00
			`ctdb_queue_packet(ctdb, &r->hdr);`

			`talloc_free(tmp_ctx);`
			`}`

added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`/*`
			`send a dmaster request (give another node the dmaster for a record)`

			`This is always sent to the lmaster, which ensures that the lmaster`
			`always knows who the dmaster is. The lmaster will then send a`
			`CTDB_REPLY_DMASTER to the new dmaster`
			`*/`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`static void ctdb_call_send_dmaster(struct ctdb_db_context *ctdb_db,`
ctdb-daemon: Rename struct ctdb_req_call to ctdb_req_call_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:26:29 +03:00			`struct ctdb_req_call_old *c,`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`struct ctdb_ltdb_header *header,`
			`TDB_DATA key, TDB_DATA data)`
			`{`
ctdb-daemon: Rename struct ctdb_req_dmaster to ctdb_req_dmaster_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:32:09 +03:00			`struct ctdb_req_dmaster_old *r;`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`struct ctdb_context *ctdb = ctdb_db->ctdb;`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`int len;`
- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00			`uint32_t lmaster = ctdb_lmaster(ctdb, key);`

failing a dmaster send due to the transport being down is fatal (This used to be ctdb commit c17dafc79bec25bbb796478c33f503503d382a20) 2009-06-30 06:14:58 +04:00			`if (ctdb->methods == NULL) {`
			`ctdb_fatal(ctdb, "Failed ctdb_call_send_dmaster since transport is down");`
			`return;`
			`}`

server: when we migrate off a record with data, set the MIGRATED_WITH_DATA flag (This used to be ctdb commit f5fb232117886186066ab3430fdd2307cba94960) 2010-12-03 17:21:51 +03:00			`if (data->dsize != 0) {`
			`header->flags \|= CTDB_REC_FLAG_MIGRATED_WITH_DATA;`
			`}`

change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`if (lmaster == ctdb->pnn) {`
- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00			`ctdb_send_dmaster_reply(ctdb_db, header, key, data,`
			`c->hdr.srcnode, c->hdr.reqid);`
			`return;`
			`}`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00
ctdb-daemon: Rename struct ctdb_req_dmaster to ctdb_req_dmaster_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:32:09 +03:00			`len = offsetof(struct ctdb_req_dmaster_old, data) + key->dsize + data->dsize`
call: transfer the record flags in the ctdb call packets. This way, the MIGRATED_WITH_DATA information can be transported along with the records. This is important for vacuuming to function properly. The record flags are appended to the data section of the ctdb_req_dmaster and ctdb_reply_dmaster structs. Pair-Programmed-With: Stefan Metzmacher <metze@samba.org> (This used to be ctdb commit 945187d64cfc7bd30a0c3b0d548cbe582d95dde3) 2010-12-10 16:02:33 +03:00			`+ sizeof(uint32_t);`
factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`r = ctdb_transport_allocate(ctdb, ctdb, CTDB_REQ_DMASTER, len,`
ctdb-daemon: Rename struct ctdb_req_dmaster to ctdb_req_dmaster_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:32:09 +03:00			`struct ctdb_req_dmaster_old);`
wrap the packet queue call (This used to be ctdb commit 4dd8ffd5752aedcaf0b8ad1941a5f84ec1ca2868) 2006-12-18 08:26:57 +03:00			`CTDB_NO_MEMORY_FATAL(ctdb, r);`
- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00			`r->hdr.destnode = lmaster;`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`r->hdr.reqid = c->hdr.reqid;`
ctdb-daemon: Use database generation in packet headers for database requests Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-09-15 09:50:19 +03:00			`r->hdr.generation = ctdb_db->generation;`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`r->db_id = c->db_id;`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`r->rsn = header->rsn;`
merge fetch code from ronnie, and add a simple fetch test (This used to be ctdb commit 83b794befd8d34b3da544a483f9d39a3fa140655) 2007-04-05 07:18:31 +04:00			`r->dmaster = c->hdr.srcnode;`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`r->keylen = key->dsize;`
			`r->datalen = data->dsize;`
			`memcpy(&r->data[0], key->dptr, key->dsize);`
			`memcpy(&r->data[key->dsize], data->dptr, data->dsize);`
call: transfer the record flags in the ctdb call packets. This way, the MIGRATED_WITH_DATA information can be transported along with the records. This is important for vacuuming to function properly. The record flags are appended to the data section of the ctdb_req_dmaster and ctdb_reply_dmaster structs. Pair-Programmed-With: Stefan Metzmacher <metze@samba.org> (This used to be ctdb commit 945187d64cfc7bd30a0c3b0d548cbe582d95dde3) 2010-12-10 16:02:33 +03:00			`memcpy(&r->data[key->dsize + data->dsize], &header->flags, sizeof(uint32_t));`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00
- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00			`header->dmaster = c->hdr.srcnode;`
Revert "Add a new header flag for "migrated with data" and set this to 1" This reverts commit a8cc35191df1cd4b866897df71d317ce5f198cb5. (This used to be ctdb commit 7c37435fb517a621c45b21a21b4eb15f8bbd3c83) 2010-12-13 06:23:32 +03:00			`if (ctdb_ltdb_store(ctdb_db, key, header, data) != 0) {`
			`ctdb_fatal(ctdb, "Failed to store record in ctdb_call_send_dmaster");`
check for error on ctdb_ltdb_store (This used to be ctdb commit c4a34bac4ad4d2f9699e08074668d25586e3c0da) 2007-05-15 04:16:59 +04:00			`}`
Revert "Add a new header flag for "migrated with data" and set this to 1" This reverts commit a8cc35191df1cd4b866897df71d317ce5f198cb5. (This used to be ctdb commit 7c37435fb517a621c45b21a21b4eb15f8bbd3c83) 2010-12-13 06:23:32 +03:00
- fixed a problem with packets to ourselves. The packets were being processed immediately, but the input routines indirectly assumed they were being called as a new event (for example, a calling routine might queue the packet, then afterwards modify the ltdb record). The solution was to make self packets queue via a zero timeout. - fixed unlinking of the socket in a exit in the lockwait code. Needed an _exit instead of exit so atexit() doesn't trigger - print latency of lockwait delays (This used to be ctdb commit 1b0684b4f6a976f4c5fe54394ac54d121810b298) 2007-04-20 11:58:37 +04:00			`ctdb_queue_packet(ctdb, &r->hdr);`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00
			`talloc_free(r);`
			`}`

ctdb-daemon: Stop using tevent compatibility definitions Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:09 +03:00			`static void ctdb_sticky_pindown_timeout(struct tevent_context *ev,`
			`struct tevent_timer *te,`
			`struct timeval t, void *private_data)`
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00			`{`
			`struct ctdb_sticky_record *sr = talloc_get_type(private_data,`
			`struct ctdb_sticky_record);`

			`DEBUG(DEBUG_ERR,("Pindown timeout db:%s unstick record\n", sr->ctdb_db->db_name));`
			`if (sr->pindown != NULL) {`
			`talloc_free(sr->pindown);`
			`sr->pindown = NULL;`
			`}`
			`}`

			`static int`
			`ctdb_set_sticky_pindown(struct ctdb_context ctdb, struct ctdb_db_context ctdb_db, TDB_DATA key)`
			`{`
			`TALLOC_CTX *tmp_ctx = talloc_new(NULL);`
			`uint32_t *k;`
			`struct ctdb_sticky_record *sr;`

ctdb-daemon: Remove duplicate code with refactored function Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-15 07:33:24 +04:00			`k = ctdb_key_to_idkey(tmp_ctx, key);`
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00			`if (k == NULL) {`
			`DEBUG(DEBUG_ERR,("Failed to allocate key for sticky record\n"));`
			`talloc_free(tmp_ctx);`
			`return -1;`
			`}`

			`sr = trbt_lookuparray32(ctdb_db->sticky_records, k[0], &k[0]);`
			`if (sr == NULL) {`
			`talloc_free(tmp_ctx);`
			`return 0;`
			`}`

			`talloc_free(tmp_ctx);`

			`if (sr->pindown == NULL) {`
			`DEBUG(DEBUG_ERR,("Pinning down record in %s for %d ms\n", ctdb_db->db_name, ctdb->tunable.sticky_pindown));`
			`sr->pindown = talloc_new(sr);`
			`if (sr->pindown == NULL) {`
			`DEBUG(DEBUG_ERR,("Failed to allocate pindown context for sticky record\n"));`
			`return -1;`
			`}`
ctdb-daemon: Stop using tevent compatibility definitions Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:09 +03:00			`tevent_add_timer(ctdb->ev, sr->pindown,`
			`timeval_current_ofs(ctdb->tunable.sticky_pindown / 1000,`
			`(ctdb->tunable.sticky_pindown * 1000) % 1000000),`
			`ctdb_sticky_pindown_timeout, sr);`
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00			`}`

			`return 0;`
			`}`

fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00			`/*`
			`called when a CTDB_REPLY_DMASTER packet comes in, or when the lmaster`
			`gets a CTDB_REQUEST_DMASTER for itself. We become the dmaster.`

			`must be called with the chainlock held. This function releases the chainlock`
			`*/`
call: transfer the record flags in the ctdb call packets. This way, the MIGRATED_WITH_DATA information can be transported along with the records. This is important for vacuuming to function properly. The record flags are appended to the data section of the ctdb_req_dmaster and ctdb_reply_dmaster structs. Pair-Programmed-With: Stefan Metzmacher <metze@samba.org> (This used to be ctdb commit 945187d64cfc7bd30a0c3b0d548cbe582d95dde3) 2010-12-10 16:02:33 +03:00			`static void ctdb_become_dmaster(struct ctdb_db_context *ctdb_db,`
pass the header to ctdb_become_dmaster instead of just the reqid this allows us to print from which node Invalid or Dropped orphan become dmaster packets came from (This used to be ctdb commit 88efd1bf4c796cd2b184156b72296587bc38bb40) 2007-07-11 03:44:52 +04:00			`struct ctdb_req_header *hdr,`
			`TDB_DATA key, TDB_DATA data,`
call: transfer the record flags in the ctdb call packets. This way, the MIGRATED_WITH_DATA information can be transported along with the records. This is important for vacuuming to function properly. The record flags are appended to the data section of the ctdb_req_dmaster and ctdb_reply_dmaster structs. Pair-Programmed-With: Stefan Metzmacher <metze@samba.org> (This used to be ctdb commit 945187d64cfc7bd30a0c3b0d548cbe582d95dde3) 2010-12-10 16:02:33 +03:00			`uint64_t rsn, uint32_t record_flags)`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00			`{`
			`struct ctdb_call_state *state;`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`struct ctdb_context *ctdb = ctdb_db->ctdb;`
			`struct ctdb_ltdb_header header;`
add additional logging when tdb_chainunlock() fails so we can see where it was called from when it fails (This used to be ctdb commit 0c091b3db6bdefd371787d87bc749593ea8e3c76) 2010-06-09 08:17:35 +04:00			`int ret;`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00
Reduce the log level for two debug messages DEBUG(DEBUG_DEBUG,("pnn %u dmaster response %08x\n", ctdb->pnn, ctdb_has DEBUG(DEBUG_DEBUG,("pnn %u dmaster request on %08x for %u from %u\n", (This used to be ctdb commit a3473e7a445b14520a49585c460429dfbfe1fce0) 2010-02-11 03:49:48 +03:00			`DEBUG(DEBUG_DEBUG,("pnn %u dmaster response %08x\n", ctdb->pnn, ctdb_hash(&key)));`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00
			`ZERO_STRUCT(header);`
ctdb_call: don't bump the rsn in ctdb_become_dmaster() any more This is now done in ctdb_ltdb_store_server(), so this extra bump can be spared. Signed-off-by: Michael Adam <obnox@samba.org> Reviewed-By: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit cad3107b12e8392f786f9a758ee38cf3a3d58538) 2013-04-03 14:02:59 +04:00			`header.rsn = rsn;`
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`header.dmaster = ctdb->pnn;`
call: hand the submitted record_flags to local record storage function. (This used to be ctdb commit 4079b8bf7a57a27a45d29784a1b0a414c778e552) 2010-12-10 16:07:21 +03:00			`header.flags = record_flags;`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00
ctdb-daemon: Use reqid abstraction Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-03-17 06:30:18 +03:00			`state = reqid_find(ctdb->idr, hdr->reqid, struct ctdb_call_state);`
call: becoming dmaster in VACUUM_MIGRATION, set the VACUUM_MIGRATED record flag This temporary flag is used for the local record storage function to decide whether to delete an empty record which has never been migrated with data as part of the fast-path vacuuming process or, or to store the record. (This used to be ctdb commit c11ca778ee90444c44dee0a629cd2eefa3a1f75e) 2010-12-10 16:11:38 +03:00
			`if (state) {`
			`if (state->call->flags & CTDB_CALL_FLAG_VACUUM_MIGRATION) {`
			`/*`
			`* We temporarily add the VACUUM_MIGRATED flag to`
			`* the record flags, so that ctdb_ltdb_store can`
			`* decide whether the record should be stored or`
			`* deleted.`
			`*/`
			`header.flags \|= CTDB_REC_FLAG_VACUUM_MIGRATED;`
			`}`
			`}`

yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`if (ctdb_ltdb_store(ctdb_db, key, &header, data) != 0) {`
			`ctdb_fatal(ctdb, "ctdb_reply_dmaster store failed\n");`
add additional logging when tdb_chainunlock() fails so we can see where it was called from when it fails (This used to be ctdb commit 0c091b3db6bdefd371787d87bc749593ea8e3c76) 2010-06-09 08:17:35 +04:00
			`ret = ctdb_ltdb_unlock(ctdb_db, key);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_ltdb_unlock() failed with error %d\n", ret));`
			`}`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`return;`
			`}`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00			`/* we just became DMASTER and this database is "sticky",`
			`see if the record is flagged as "hot" and set up a pin-down`
			`context to stop migrations for a little while if so`
			`*/`
ctdb-daemon: Add accessors for CTDB_DB_FLAGS_STICKY flag Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-03-02 07:47:46 +03:00			`if (ctdb_db_sticky(ctdb_db)) {`
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00			`ctdb_set_sticky_pindown(ctdb, ctdb_db, key);`
			`}`
split the 32bit idr field into two. store the idr as the high 16 bits and use a rotating counter for the low 16 bits. (This used to be ctdb commit 7c763b7b5e6ca54a6df4586893ddaf1b508b4c22) 2007-04-23 12:19:50 +04:00
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00			`if (state == NULL) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,("pnn %u Invalid reqid %u in ctdb_become_dmaster from node %u\n",`
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`ctdb->pnn, hdr->reqid, hdr->srcnode));`
add additional logging when tdb_chainunlock() fails so we can see where it was called from when it fails (This used to be ctdb commit 0c091b3db6bdefd371787d87bc749593ea8e3c76) 2010-06-09 08:17:35 +04:00
			`ret = ctdb_ltdb_unlock(ctdb_db, key);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_ltdb_unlock() failed with error %d\n", ret));`
			`}`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00			`return;`
			`}`

idr can timeout and wrap/be reused quite quickly. If a noremote node hangs for an extended period, it is possible that we might have a DMASTER request in flight for record A to that node. Eventually we will reuse the idr, and may reuse it for a DMASTER request to a different node for a different record B. If while the request for B is in flight, the first tnode un-hangs and responds back we would receive a dmaster reply for the wrong record. This would cause a record to become perpetually locked, since inside the daemon we would tdb_chainlock(dmaster_reply->pdu->key) but once the migration would complete we would chainunlock idr->state->call->key Adding code to verify that when we receive a dmaster reply packet that it does in fact match the exact same key that the state variable we have for the idr in flight. (This used to be ctdb commit 2f6a870d7ff02ceb61fde242f752dccbfcb4cb37) 2010-06-09 10:12:36 +04:00			`if (key.dsize != state->call->key.dsize \|\| memcmp(key.dptr, state->call->key.dptr, key.dsize)) {`
fix a debug message (This used to be ctdb commit 856bd6de6218d9b70baed0e6443be4253ea31afe) 2010-06-09 10:22:01 +04:00			`DEBUG(DEBUG_ERR, ("Got bogus DMASTER packet reqid:%u from node %u. Key does not match key held in matching idr.\n", hdr->reqid, hdr->srcnode));`
idr can timeout and wrap/be reused quite quickly. If a noremote node hangs for an extended period, it is possible that we might have a DMASTER request in flight for record A to that node. Eventually we will reuse the idr, and may reuse it for a DMASTER request to a different node for a different record B. If while the request for B is in flight, the first tnode un-hangs and responds back we would receive a dmaster reply for the wrong record. This would cause a record to become perpetually locked, since inside the daemon we would tdb_chainlock(dmaster_reply->pdu->key) but once the migration would complete we would chainunlock idr->state->call->key Adding code to verify that when we receive a dmaster reply packet that it does in fact match the exact same key that the state variable we have for the idr in flight. (This used to be ctdb commit 2f6a870d7ff02ceb61fde242f752dccbfcb4cb37) 2010-06-09 10:12:36 +04:00
			`ret = ctdb_ltdb_unlock(ctdb_db, key);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_ltdb_unlock() failed with error %d\n", ret));`
			`}`
			`return;`
			`}`

pass the header to ctdb_become_dmaster instead of just the reqid this allows us to print from which node Invalid or Dropped orphan become dmaster packets came from (This used to be ctdb commit 88efd1bf4c796cd2b184156b72296587bc38bb40) 2007-07-11 03:44:52 +04:00			`if (hdr->reqid != state->reqid) {`
split the 32bit idr field into two. store the idr as the high 16 bits and use a rotating counter for the low 16 bits. (This used to be ctdb commit 7c763b7b5e6ca54a6df4586893ddaf1b508b4c22) 2007-04-23 12:19:50 +04:00			`/* we found a record but it was the wrong one */`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR, ("Dropped orphan in ctdb_become_dmaster with reqid:%u\n from node %u", hdr->reqid, hdr->srcnode));`
add additional logging when tdb_chainunlock() fails so we can see where it was called from when it fails (This used to be ctdb commit 0c091b3db6bdefd371787d87bc749593ea8e3c76) 2010-06-09 08:17:35 +04:00
			`ret = ctdb_ltdb_unlock(ctdb_db, key);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_ltdb_unlock() failed with error %d\n", ret));`
			`}`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00			`return;`
			`}`

ctdb-daemon: Add tracking of migration records Instead of using hopcount as a metric for hot records, use the number of migrations per second as a metric. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Wed Apr 5 08:35:45 CEST 2017 on sn-devel-144 2017-03-21 08:48:45 +03:00			`(void) hash_count_increment(ctdb_db->migratedb, key);`

Revert "LACOUNT: Add back lacount mechanism to defer migrating a fetched/read copy until after default of 20 consecutive requests from the same node" This reverts commit 035c0d981bde8c0eee8b3f24ba8e2dc817e5b504. This is a premature optimization. Record can bounce between nodes very quickly if it is a contended record. There is no need to hold a record on a node unnecessarily. In case record contention becomes bad, enabling sticky records on a database is a better idea. Conflicts: include/ctdb_private.h server/ctdb_tunables.c Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit ac417b0003f0116f116834ad2ac51482d25cfa0d) 2013-08-19 09:04:46 +04:00			`ctdb_call_local(ctdb_db, state->call, &header, state, &data, true);`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00
idr can timeout and wrap/be reused quite quickly. If a noremote node hangs for an extended period, it is possible that we might have a DMASTER request in flight for record A to that node. Eventually we will reuse the idr, and may reuse it for a DMASTER request to a different node for a different record B. If while the request for B is in flight, the first tnode un-hangs and responds back we would receive a dmaster reply for the wrong record. This would cause a record to become perpetually locked, since inside the daemon we would tdb_chainlock(dmaster_reply->pdu->key) but once the migration would complete we would chainunlock idr->state->call->key Adding code to verify that when we receive a dmaster reply packet that it does in fact match the exact same key that the state variable we have for the idr in flight. (This used to be ctdb commit 2f6a870d7ff02ceb61fde242f752dccbfcb4cb37) 2010-06-09 10:12:36 +04:00			`ret = ctdb_ltdb_unlock(ctdb_db, state->call->key);`
add additional logging when tdb_chainunlock() fails so we can see where it was called from when it fails (This used to be ctdb commit 0c091b3db6bdefd371787d87bc749593ea8e3c76) 2010-06-09 08:17:35 +04:00			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_ltdb_unlock() failed with error %d\n", ret));`
			`}`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00
			`state->state = CTDB_CALL_DONE;`
			`if (state->async.fn) {`
			`state->async.fn(state);`
			`}`
			`}`

ctdb-daemon: Defer all calls when processing dmaster packets When CTDB receives DMASTER_REQUEST or DMASTER_REPLY packet, the specified record needs to be updated as soon as possible to avoid inconsistent dmaster information between nodes. During this time, queue up all calls for that record and process them only after dmaster request/reply has been processed. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-15 09:20:36 +04:00			`struct dmaster_defer_call {`
			`struct dmaster_defer_call next, prev;`
			`struct ctdb_context *ctdb;`
			`struct ctdb_req_header *hdr;`
			`};`

			`struct dmaster_defer_queue {`
ctdb-daemon: Use database generation in packet headers for database requests Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-09-15 09:50:19 +03:00			`struct ctdb_db_context *ctdb_db;`
ctdb-call: Drop all deferred requests from older generation Deferring packets has a nasty interaction with recovery. All deferred packets must be dropped when recovery happens, since those packets are tracked as pending requests and will be re-sent with new generation. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Fri Sep 5 09:30:50 CEST 2014 on sn-devel-104 2014-09-02 10:10:20 +04:00			`uint32_t generation;`
ctdb-daemon: Defer all calls when processing dmaster packets When CTDB receives DMASTER_REQUEST or DMASTER_REPLY packet, the specified record needs to be updated as soon as possible to avoid inconsistent dmaster information between nodes. During this time, queue up all calls for that record and process them only after dmaster request/reply has been processed. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-15 09:20:36 +04:00			`struct dmaster_defer_call *deferred_calls;`
			`};`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00
ctdb-daemon: Defer all calls when processing dmaster packets When CTDB receives DMASTER_REQUEST or DMASTER_REPLY packet, the specified record needs to be updated as soon as possible to avoid inconsistent dmaster information between nodes. During this time, queue up all calls for that record and process them only after dmaster request/reply has been processed. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-15 09:20:36 +04:00			`static void dmaster_defer_reprocess(struct tevent_context *ev,`
			`struct tevent_timer *te,`
			`struct timeval t,`
			`void *private_data)`
			`{`
			`struct dmaster_defer_call *call = talloc_get_type(`
			`private_data, struct dmaster_defer_call);`

			`ctdb_input_pkt(call->ctdb, call->hdr);`
			`talloc_free(call);`
			`}`

			`static int dmaster_defer_queue_destructor(struct dmaster_defer_queue *ddq)`
			`{`
ctdb-call: Drop all deferred requests from older generation Deferring packets has a nasty interaction with recovery. All deferred packets must be dropped when recovery happens, since those packets are tracked as pending requests and will be re-sent with new generation. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Fri Sep 5 09:30:50 CEST 2014 on sn-devel-104 2014-09-02 10:10:20 +04:00			`/* Ignore requests, if database recovery happens in-between. */`
ctdb-daemon: Use database generation in packet headers for database requests Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-09-15 09:50:19 +03:00			`if (ddq->generation != ddq->ctdb_db->generation) {`
ctdb-call: Drop all deferred requests from older generation Deferring packets has a nasty interaction with recovery. All deferred packets must be dropped when recovery happens, since those packets are tracked as pending requests and will be re-sent with new generation. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Fri Sep 5 09:30:50 CEST 2014 on sn-devel-104 2014-09-02 10:10:20 +04:00			`return 0;`
			`}`

ctdb-daemon: Defer all calls when processing dmaster packets When CTDB receives DMASTER_REQUEST or DMASTER_REPLY packet, the specified record needs to be updated as soon as possible to avoid inconsistent dmaster information between nodes. During this time, queue up all calls for that record and process them only after dmaster request/reply has been processed. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-15 09:20:36 +04:00			`while (ddq->deferred_calls != NULL) {`
			`struct dmaster_defer_call *call = ddq->deferred_calls;`

			`DLIST_REMOVE(ddq->deferred_calls, call);`

			`talloc_steal(call->ctdb, call);`
			`tevent_add_timer(call->ctdb->ev, call, timeval_zero(),`
			`dmaster_defer_reprocess, call);`
			`}`
			`return 0;`
			`}`

			`static void insert_ddq_callback(void parm, void *data)`
			`{`
			`if (data) {`
			`talloc_free(data);`
			`}`
			`return parm;`
			`}`

			`/**`
ctdb/server/ctdb_call.c: typo fixes Signed-off-by: Bjoern Jacke <bjacke@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net> 2019-10-26 03:41:08 +03:00			`* This function is used to register a key in database that needs to be updated.`
ctdb-daemon: Defer all calls when processing dmaster packets When CTDB receives DMASTER_REQUEST or DMASTER_REPLY packet, the specified record needs to be updated as soon as possible to avoid inconsistent dmaster information between nodes. During this time, queue up all calls for that record and process them only after dmaster request/reply has been processed. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-15 09:20:36 +04:00			`* Any requests for that key should get deferred till this is completed.`
			`*/`
			`static int dmaster_defer_setup(struct ctdb_db_context *ctdb_db,`
			`struct ctdb_req_header *hdr,`
			`TDB_DATA key)`
			`{`
			`uint32_t *k;`
			`struct dmaster_defer_queue *ddq;`

			`k = ctdb_key_to_idkey(hdr, key);`
			`if (k == NULL) {`
			`DEBUG(DEBUG_ERR, ("Failed to allocate key for dmaster defer setup\n"));`
			`return -1;`
			`}`

			`/* Already exists */`
			`ddq = trbt_lookuparray32(ctdb_db->defer_dmaster, k[0], k);`
			`if (ddq != NULL) {`
ctdb-call: Delete old defer queue if recovery occurs Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-09-23 07:47:58 +03:00			`if (ddq->generation == ctdb_db->generation) {`
			`talloc_free(k);`
			`return 0;`
			`}`

ctdb/server/ctdb_call.c: typo fixes Signed-off-by: Bjoern Jacke <bjacke@samba.org> Reviewed-by: Martin Schwenke <martin@meltin.net> 2019-10-26 03:41:08 +03:00			`/* Recovery occurred - get rid of old queue. All the deferred`
ctdb-call: Delete old defer queue if recovery occurs Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-09-23 07:47:58 +03:00			`* requests will be resent anyway from ctdb_call_resend_db.`
			`*/`
			`talloc_free(ddq);`
ctdb-daemon: Defer all calls when processing dmaster packets When CTDB receives DMASTER_REQUEST or DMASTER_REPLY packet, the specified record needs to be updated as soon as possible to avoid inconsistent dmaster information between nodes. During this time, queue up all calls for that record and process them only after dmaster request/reply has been processed. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-15 09:20:36 +04:00			`}`

			`ddq = talloc(hdr, struct dmaster_defer_queue);`
			`if (ddq == NULL) {`
			`DEBUG(DEBUG_ERR, ("Failed to allocate dmaster defer queue\n"));`
			`talloc_free(k);`
			`return -1;`
			`}`
ctdb-daemon: Use database generation in packet headers for database requests Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-09-15 09:50:19 +03:00			`ddq->ctdb_db = ctdb_db;`
ctdb-call: Drop all deferred requests from older generation Deferring packets has a nasty interaction with recovery. All deferred packets must be dropped when recovery happens, since those packets are tracked as pending requests and will be re-sent with new generation. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Fri Sep 5 09:30:50 CEST 2014 on sn-devel-104 2014-09-02 10:10:20 +04:00			`ddq->generation = hdr->generation;`
ctdb-daemon: Defer all calls when processing dmaster packets When CTDB receives DMASTER_REQUEST or DMASTER_REPLY packet, the specified record needs to be updated as soon as possible to avoid inconsistent dmaster information between nodes. During this time, queue up all calls for that record and process them only after dmaster request/reply has been processed. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-15 09:20:36 +04:00			`ddq->deferred_calls = NULL;`

			`trbt_insertarray32_callback(ctdb_db->defer_dmaster, k[0], k,`
			`insert_ddq_callback, ddq);`
			`talloc_set_destructor(ddq, dmaster_defer_queue_destructor);`

			`talloc_free(k);`
			`return 0;`
			`}`

			`static int dmaster_defer_add(struct ctdb_db_context *ctdb_db,`
			`struct ctdb_req_header *hdr,`
			`TDB_DATA key)`
			`{`
			`struct dmaster_defer_queue *ddq;`
			`struct dmaster_defer_call *call;`
			`uint32_t *k;`

			`k = ctdb_key_to_idkey(hdr, key);`
			`if (k == NULL) {`
			`DEBUG(DEBUG_ERR, ("Failed to allocate key for dmaster defer add\n"));`
			`return -1;`
			`}`

			`ddq = trbt_lookuparray32(ctdb_db->defer_dmaster, k[0], k);`
			`if (ddq == NULL) {`
			`talloc_free(k);`
			`return -1;`
			`}`

			`talloc_free(k);`

ctdb-call: Drop all deferred requests from older generation Deferring packets has a nasty interaction with recovery. All deferred packets must be dropped when recovery happens, since those packets are tracked as pending requests and will be re-sent with new generation. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Fri Sep 5 09:30:50 CEST 2014 on sn-devel-104 2014-09-02 10:10:20 +04:00			`if (ddq->generation != hdr->generation) {`
			`talloc_set_destructor(ddq, NULL);`
			`talloc_free(ddq);`
			`return -1;`
			`}`

ctdb-daemon: Defer all calls when processing dmaster packets When CTDB receives DMASTER_REQUEST or DMASTER_REPLY packet, the specified record needs to be updated as soon as possible to avoid inconsistent dmaster information between nodes. During this time, queue up all calls for that record and process them only after dmaster request/reply has been processed. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-15 09:20:36 +04:00			`call = talloc(ddq, struct dmaster_defer_call);`
			`if (call == NULL) {`
			`DEBUG(DEBUG_ERR, ("Failed to allocate dmaster defer call\n"));`
			`return -1;`
			`}`

			`call->ctdb = ctdb_db->ctdb;`
			`call->hdr = talloc_steal(call, hdr);`

dlist: remove unneeded type argument from DLIST_ADD_END() Signed-off-by: Michael Adam <obnox@samba.org> Reviewed-by: Jeremy Allison <jra@samba.org> 2016-02-05 13:32:18 +03:00			`DLIST_ADD_END(ddq->deferred_calls, call);`
ctdb-daemon: Defer all calls when processing dmaster packets When CTDB receives DMASTER_REQUEST or DMASTER_REPLY packet, the specified record needs to be updated as soon as possible to avoid inconsistent dmaster information between nodes. During this time, queue up all calls for that record and process them only after dmaster request/reply has been processed. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-15 09:20:36 +04:00
			`return 0;`
			`}`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00
			`/*`
			`called when a CTDB_REQ_DMASTER packet comes in`

			`this comes into the lmaster for a record when the current dmaster`
			`wants to give up the dmaster role and give it to someone else`
			`*/`
			`void ctdb_request_dmaster(struct ctdb_context ctdb, struct ctdb_req_header hdr)`
			`{`
ctdb-daemon: Rename struct ctdb_req_dmaster to ctdb_req_dmaster_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:32:09 +03:00			`struct ctdb_req_dmaster_old c = (struct ctdb_req_dmaster_old )hdr;`
merge fixes from samba4 (This used to be ctdb commit fb90a5424348d0b6ed9a1b8da4ceadcc4d1a1cb1) 2007-01-23 03:38:45 +03:00			`TDB_DATA key, data, data2;`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`struct ctdb_ltdb_header header;`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`struct ctdb_db_context *ctdb_db;`
call: transfer the record flags in the ctdb call packets. This way, the MIGRATED_WITH_DATA information can be transported along with the records. This is important for vacuuming to function properly. The record flags are appended to the data section of the ctdb_req_dmaster and ctdb_reply_dmaster structs. Pair-Programmed-With: Stefan Metzmacher <metze@samba.org> (This used to be ctdb commit 945187d64cfc7bd30a0c3b0d548cbe582d95dde3) 2010-12-10 16:02:33 +03:00			`uint32_t record_flags = 0;`
			`size_t len;`
- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00			`int ret;`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00
			`key.dptr = c->data;`
			`key.dsize = c->keylen;`
			`data.dptr = c->data + c->keylen;`
			`data.dsize = c->datalen;`
ctdb-daemon: Rename struct ctdb_req_dmaster to ctdb_req_dmaster_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:32:09 +03:00			`len = offsetof(struct ctdb_req_dmaster_old, data) + key.dsize + data.dsize`
call: transfer the record flags in the ctdb call packets. This way, the MIGRATED_WITH_DATA information can be transported along with the records. This is important for vacuuming to function properly. The record flags are appended to the data section of the ctdb_req_dmaster and ctdb_reply_dmaster structs. Pair-Programmed-With: Stefan Metzmacher <metze@samba.org> (This used to be ctdb commit 945187d64cfc7bd30a0c3b0d548cbe582d95dde3) 2010-12-10 16:02:33 +03:00			`+ sizeof(uint32_t);`
			`if (len <= c->hdr.length) {`
ctdb-daemon: Fix some strict-aliasing warnings Seeing these with -Wall: ../server/ctdb_call.c:1117:3: warning: dereferencing type-punned pointer will break strict-aliasing rules [-Wstrict-aliasing] record_flags = (uint32_t )&c->data[c->keylen + c->datalen]; ^ memcpy() seems to be the easiest way to get fix these. The alternative would be to use unmarshalling functions. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2014-08-04 08:50:17 +04:00			`memcpy(&record_flags, &c->data[c->keylen + c->datalen],`
			`sizeof(record_flags));`
call: transfer the record flags in the ctdb call packets. This way, the MIGRATED_WITH_DATA information can be transported along with the records. This is important for vacuuming to function properly. The record flags are appended to the data section of the ctdb_req_dmaster and ctdb_reply_dmaster structs. Pair-Programmed-With: Stefan Metzmacher <metze@samba.org> (This used to be ctdb commit 945187d64cfc7bd30a0c3b0d548cbe582d95dde3) 2010-12-10 16:02:33 +03:00			`}`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00
Revert "ctdb-daemon: Check packet generation against database generation" This reverts commit 0ff90f4fac74e61192aff100b168e38ce0adfabb. BUG: https://bugzilla.samba.org/show_bug.cgi?id=11707 The checks against database generation are not required since the global generation is updated as part of updating vnnmap before the actual database recovery. This change was done in 5aab31a39a3589b910a78b96071d6aa5e6547696. Checking only against the database generation is incomplete. It can cause CTDB to abort if the following sequence of events happen. - CTDB gets REQ_DMASTER packet (gen1) This packet processing gets deferred to get a record lock - CTDB goes into recovery, marks RECOVERY_ACTIVE CTDB recovery helper updates vnnmap (gen2) - CTDB processes REQ_DMASTER packet (gen1) The check against database generation (gen1) succeeds. The check for lmaster is now invalid because VNNMAP has changed. This will cause CTDB to abort due to protocol error. Reverting the patch stops processing packets of older generation before they get into call processing. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Signed-off-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Tue Feb 9 12:39:24 CET 2016 on sn-devel-144 2016-02-02 07:58:37 +03:00			`ctdb_db = find_ctdb_db(ctdb, c->db_id);`
			`if (!ctdb_db) {`
			`ctdb_send_error(ctdb, hdr, -1,`
			`"Unknown database in request. db_id==0x%08x",`
			`c->db_id);`
			`return;`
			`}`

ctdb-daemon: Defer all calls when processing dmaster packets When CTDB receives DMASTER_REQUEST or DMASTER_REPLY packet, the specified record needs to be updated as soon as possible to avoid inconsistent dmaster information between nodes. During this time, queue up all calls for that record and process them only after dmaster request/reply has been processed. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-15 09:20:36 +04:00			`dmaster_defer_setup(ctdb_db, hdr, key);`

- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`/* fetch the current record */`
			`ret = ctdb_ltdb_lock_fetch_requeue(ctdb_db, key, &header, hdr, &data2,`
server: Replace BOOL datatype with bool, True/False with true/false Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 6e5cbe8fff71985e5a2fc16b7e9f2b868011ff5d) 2012-05-17 10:08:37 +04:00			`ctdb_call_input_pkt, ctdb, false);`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`if (ret == -1) {`
			`ctdb_fatal(ctdb, "ctdb_req_dmaster failed to fetch record");`
			`return;`
			`}`
			`if (ret == -2) {`
added debug constants to allow for better mapping to syslog levels (This used to be ctdb commit 7ba8f1dde318eab03f4257e5a89fd23e7281e502) 2008-02-04 09:44:24 +03:00			`DEBUG(DEBUG_INFO,(__location__ " deferring ctdb_request_dmaster\n"));`
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`return;`
			`}`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`if (ctdb_lmaster(ctdb, &key) != ctdb->pnn) {`
ctdb-daemon: Improve log message when REQ_DMASTER is received on non-lmaster Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-12-03 09:43:44 +03:00			`DEBUG(DEBUG_ERR, ("dmaster request to non-lmaster "`
			`"db=%s lmaster=%u gen=%u curgen=%u\n",`
			`ctdb_db->db_name, ctdb_lmaster(ctdb, &key),`
			`hdr->generation, ctdb_db->generation));`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00			`ctdb_fatal(ctdb, "ctdb_req_dmaster to non-lmaster");`
			`}`

Reduce the log level for two debug messages DEBUG(DEBUG_DEBUG,("pnn %u dmaster response %08x\n", ctdb->pnn, ctdb_has DEBUG(DEBUG_DEBUG,("pnn %u dmaster request on %08x for %u from %u\n", (This used to be ctdb commit a3473e7a445b14520a49585c460429dfbfe1fce0) 2010-02-11 03:49:48 +03:00			`DEBUG(DEBUG_DEBUG,("pnn %u dmaster request on %08x for %u from %u\n",`
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`ctdb->pnn, ctdb_hash(&key), c->dmaster, c->hdr.srcnode));`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00
- added a --torture option to all ctdb tools. This sets CTDB_FLAG_TORTURE, which forces some race conditions to be much more likely. For example a 20% chance of not getting the lock on the first try in the daemon - abstraced the ctdb_ltdb_lock_fetch_requeue() code to allow it to work with both inter-node packets and client->daemon packets - fixed a bug left over in ctdb_call from when the client updated the header on a call reply - removed CTDB_FLAG_CONNECT_WAIT flag (not needed any more) (This used to be ctdb commit 7559dcd184666c3853127e3c8f5baef4fea327c4) 2007-04-19 10:27:56 +04:00			`/* its a protocol error if the sending node is not the current dmaster */`
- when handling a record migration in the lmaster, bypass the usual dmaster request stage, and instead directly send a dmaster reply. This avoids a race condition where a new call comes in for the same record while processing the dmaster request - don't keep any redirect records during a ctdb call. This prevents a memory leak in case of a redirect storm (This used to be ctdb commit 59889ca0fd606c7d2156839383a09dfc5a2e4853) 2007-04-22 16:26:45 +04:00			`if (header.dmaster != hdr->srcnode) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ALERT,("pnn %u dmaster request for new-dmaster %u from non-master %u real-dmaster=%u key %08x dbid 0x%08x gen=%u curgen=%u c->rsn=%llu header.rsn=%llu reqid=%u keyval=0x%08x\n",`
make some specific cases of the non-dmaster bug non-fatal (This used to be ctdb commit 7b516ab06c7ba7ffe9ecf3f76720df5360176b2c) 2008-01-05 01:32:29 +03:00			`ctdb->pnn, c->dmaster, hdr->srcnode, header.dmaster, ctdb_hash(&key),`
Revert "ctdb-daemon: Check packet generation against database generation" This reverts commit 0ff90f4fac74e61192aff100b168e38ce0adfabb. BUG: https://bugzilla.samba.org/show_bug.cgi?id=11707 The checks against database generation are not required since the global generation is updated as part of updating vnnmap before the actual database recovery. This change was done in 5aab31a39a3589b910a78b96071d6aa5e6547696. Checking only against the database generation is incomplete. It can cause CTDB to abort if the following sequence of events happen. - CTDB gets REQ_DMASTER packet (gen1) This packet processing gets deferred to get a record lock - CTDB goes into recovery, marks RECOVERY_ACTIVE CTDB recovery helper updates vnnmap (gen2) - CTDB processes REQ_DMASTER packet (gen1) The check against database generation (gen1) succeeds. The check for lmaster is now invalid because VNNMAP has changed. This will cause CTDB to abort due to protocol error. Reverting the patch stops processing packets of older generation before they get into call processing. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Signed-off-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Tue Feb 9 12:39:24 CET 2016 on sn-devel-144 2016-02-02 07:58:37 +03:00			`ctdb_db->db_id, hdr->generation, ctdb->vnn_map->generation,`
make some specific cases of the non-dmaster bug non-fatal (This used to be ctdb commit 7b516ab06c7ba7ffe9ecf3f76720df5360176b2c) 2008-01-05 01:32:29 +03:00			`(unsigned long long)c->rsn, (unsigned long long)header.rsn, c->hdr.reqid,`
			`(key.dsize >= 4)?((uint32_t )key.dptr):0));`
Revert "From Wolfgang M." This reverts commit 5b70fa8cfd5916d3c212823ad5cc1b251ae175ed. (This used to be ctdb commit 363e7e939ad46b3f75c83c30d4163d63876c2456) 2009-10-29 05:44:12 +03:00			`if (header.rsn != 0 \|\| header.dmaster != ctdb->pnn) {`
ctdb_req_dmaster from non-master If we find a situatior where we get a stray packet with the wrong dmaster, dont suicide with ctdb_fatal() since this is too disruptive. Just drop the stray packet and force a recovery to make sure all is good again. CQ S1022004 (This used to be ctdb commit 62b7fe853db37c0a90e48a0332a3426a8dcb4ed8) 2011-02-18 03:21:19 +03:00			`DEBUG(DEBUG_ERR,("ctdb_req_dmaster from non-master. Force a recovery.\n"));`

			`ctdb->recovery_mode = CTDB_RECOVERY_ACTIVE;`
ctdbd: fix lock held on error ("ctdb_req_dmaster from non-master.") We should release the lock on the record before returning; otherwise the recovery (which tries to freeze the database) will fail. Symptoms are as follows: ctdbd: pnn 15 dmaster request for new-dmaster 19 from non-master 1 real-dmaster=5 key f049c3c8 dbid 0x6cf2837d gen=1148812532 curgen=1148812532 c->rsn=2 header.rsn=15 reqid=2147483585 keyval=0x4f464e49 ctdbd: ctdb_req_dmaster from non-master. Force a recovery. ... ctdbd: freeze_lock-1:server/ctdb_freeze.c:55 Failed to lock database registry.tdb CQ:1022545 (This used to be ctdb commit 38b2dbe0605816742e74e2b8a811eaba99c7e12d) 2011-03-21 05:33:01 +03:00			`ctdb_ltdb_unlock(ctdb_db, key);`
Revert "From Wolfgang M." This reverts commit 5b70fa8cfd5916d3c212823ad5cc1b251ae175ed. (This used to be ctdb commit 363e7e939ad46b3f75c83c30d4163d63876c2456) 2009-10-29 05:44:12 +03:00			`return;`
			`}`
make some specific cases of the non-dmaster bug non-fatal (This used to be ctdb commit 7b516ab06c7ba7ffe9ecf3f76720df5360176b2c) 2008-01-05 01:32:29 +03:00			`}`

			`if (header.rsn > c->rsn) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ALERT,("pnn %u dmaster request with older RSN new-dmaster %u from %u real-dmaster=%u key %08x dbid 0x%08x gen=%u curgen=%u c->rsn=%llu header.rsn=%llu reqid=%u\n",`
make some specific cases of the non-dmaster bug non-fatal (This used to be ctdb commit 7b516ab06c7ba7ffe9ecf3f76720df5360176b2c) 2008-01-05 01:32:29 +03:00			`ctdb->pnn, c->dmaster, hdr->srcnode, header.dmaster, ctdb_hash(&key),`
Revert "ctdb-daemon: Check packet generation against database generation" This reverts commit 0ff90f4fac74e61192aff100b168e38ce0adfabb. BUG: https://bugzilla.samba.org/show_bug.cgi?id=11707 The checks against database generation are not required since the global generation is updated as part of updating vnnmap before the actual database recovery. This change was done in 5aab31a39a3589b910a78b96071d6aa5e6547696. Checking only against the database generation is incomplete. It can cause CTDB to abort if the following sequence of events happen. - CTDB gets REQ_DMASTER packet (gen1) This packet processing gets deferred to get a record lock - CTDB goes into recovery, marks RECOVERY_ACTIVE CTDB recovery helper updates vnnmap (gen2) - CTDB processes REQ_DMASTER packet (gen1) The check against database generation (gen1) succeeds. The check for lmaster is now invalid because VNNMAP has changed. This will cause CTDB to abort due to protocol error. Reverting the patch stops processing packets of older generation before they get into call processing. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Signed-off-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Tue Feb 9 12:39:24 CET 2016 on sn-devel-144 2016-02-02 07:58:37 +03:00			`ctdb_db->db_id, hdr->generation, ctdb->vnn_map->generation,`
make some specific cases of the non-dmaster bug non-fatal (This used to be ctdb commit 7b516ab06c7ba7ffe9ecf3f76720df5360176b2c) 2008-01-05 01:32:29 +03:00			`(unsigned long long)c->rsn, (unsigned long long)header.rsn, c->hdr.reqid));`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`}`

ensure we propogate the correct rsn for a request dmaster (This used to be ctdb commit 70c1c67db865db8a49b56e8e3e8fd56ec5063208) 2007-05-12 13:55:18 +04:00			`/* use the rsn from the sending node */`
			`header.rsn = c->rsn;`

call: transfer the record flags in the ctdb call packets. This way, the MIGRATED_WITH_DATA information can be transported along with the records. This is important for vacuuming to function properly. The record flags are appended to the data section of the ctdb_req_dmaster and ctdb_reply_dmaster structs. Pair-Programmed-With: Stefan Metzmacher <metze@samba.org> (This used to be ctdb commit 945187d64cfc7bd30a0c3b0d548cbe582d95dde3) 2010-12-10 16:02:33 +03:00			`/* store the record flags from the sending node */`
			`header.flags = record_flags;`

fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00			`/* check if the new dmaster is the lmaster, in which case we`
			`skip the dmaster reply */`
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`if (c->dmaster == ctdb->pnn) {`
call: transfer the record flags in the ctdb call packets. This way, the MIGRATED_WITH_DATA information can be transported along with the records. This is important for vacuuming to function properly. The record flags are appended to the data section of the ctdb_req_dmaster and ctdb_reply_dmaster structs. Pair-Programmed-With: Stefan Metzmacher <metze@samba.org> (This used to be ctdb commit 945187d64cfc7bd30a0c3b0d548cbe582d95dde3) 2010-12-10 16:02:33 +03:00			`ctdb_become_dmaster(ctdb_db, hdr, key, data, c->rsn, record_flags);`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00			`} else {`
			`ctdb_send_dmaster_reply(ctdb_db, &header, key, data, c->dmaster, hdr->reqid);`
add extra logging for failed ctdb_ltdb_unlock() for a few more places it is called from (This used to be ctdb commit 5c0fea90c6474a51992a9c4aeb6af7dfeb213ee0) 2010-06-09 08:31:05 +04:00
			`ret = ctdb_ltdb_unlock(ctdb_db, key);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_ltdb_unlock() failed with error %d\n", ret));`
			`}`
fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00			`}`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`}`

ctdb-daemon: Stop using tevent compatibility definitions Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:09 +03:00			`static void ctdb_sticky_record_timeout(struct tevent_context *ev,`
			`struct tevent_timer *te,`
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00			`struct timeval t, void *private_data)`
			`{`
			`struct ctdb_sticky_record *sr = talloc_get_type(private_data,`
			`struct ctdb_sticky_record);`
			`talloc_free(sr);`
			`}`

			`static void ctdb_make_sticky_record_callback(void parm, void *data)`
			`{`
			`if (data) {`
			`DEBUG(DEBUG_ERR,("Already have sticky record registered. Free old %p and create new %p\n", data, parm));`
			`talloc_free(data);`
			`}`
			`return parm;`
			`}`

			`static int`
			`ctdb_make_record_sticky(struct ctdb_context ctdb, struct ctdb_db_context ctdb_db, TDB_DATA key)`
			`{`
			`TALLOC_CTX *tmp_ctx = talloc_new(NULL);`
			`uint32_t *k;`
			`struct ctdb_sticky_record *sr;`

ctdb-daemon: Remove duplicate code with refactored function Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-15 07:33:24 +04:00			`k = ctdb_key_to_idkey(tmp_ctx, key);`
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00			`if (k == NULL) {`
			`DEBUG(DEBUG_ERR,("Failed to allocate key for sticky record\n"));`
			`talloc_free(tmp_ctx);`
			`return -1;`
			`}`

			`sr = trbt_lookuparray32(ctdb_db->sticky_records, k[0], &k[0]);`
			`if (sr != NULL) {`
			`talloc_free(tmp_ctx);`
			`return 0;`
			`}`

			`sr = talloc(ctdb_db->sticky_records, struct ctdb_sticky_record);`
			`if (sr == NULL) {`
			`talloc_free(tmp_ctx);`
			`DEBUG(DEBUG_ERR,("Failed to allocate sticky record structure\n"));`
			`return -1;`
			`}`

			`sr->ctdb = ctdb;`
			`sr->ctdb_db = ctdb_db;`
			`sr->pindown = NULL;`

ctdbd: When a record is made sticky, log only once Instead of logging from ctdb_request_call(), log the message from ctdb_make_record_sticky(). That way if the record is already sticky, the message is not repeated unnecessarily. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 44a64d1c388bfe3c3388b191edfaedecfb7bb831) 2013-07-31 09:59:11 +04:00			`DEBUG(DEBUG_ERR,("Make record sticky for %d seconds in db %s key:0x%08x.\n",`
			`ctdb->tunable.sticky_duration,`
			`ctdb_db->db_name, ctdb_hash(&key)));`
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00
			`trbt_insertarray32_callback(ctdb_db->sticky_records, k[0], &k[0], ctdb_make_sticky_record_callback, sr);`

ctdb-daemon: Stop using tevent compatibility definitions Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:09 +03:00			`tevent_add_timer(ctdb->ev, sr,`
			`timeval_current_ofs(ctdb->tunable.sticky_duration, 0),`
			`ctdb_sticky_record_timeout, sr);`
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00
			`talloc_free(tmp_ctx);`
			`return 0;`
			`}`

			`struct pinned_down_requeue_handle {`
			`struct ctdb_context *ctdb;`
			`struct ctdb_req_header *hdr;`
			`};`

			`struct pinned_down_deferred_call {`
			`struct ctdb_context *ctdb;`
			`struct ctdb_req_header *hdr;`
			`};`

ctdb-daemon: Stop using tevent compatibility definitions Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:09 +03:00			`static void pinned_down_requeue(struct tevent_context *ev,`
			`struct tevent_timer *te,`
			`struct timeval t, void *private_data)`
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00			`{`
			`struct pinned_down_requeue_handle *handle = talloc_get_type(private_data, struct pinned_down_requeue_handle);`
			`struct ctdb_context *ctdb = handle->ctdb;`

			`talloc_steal(ctdb, handle->hdr);`
			`ctdb_call_input_pkt(ctdb, handle->hdr);`

			`talloc_free(handle);`
			`}`

			`static int pinned_down_destructor(struct pinned_down_deferred_call *pinned_down)`
			`{`
			`struct ctdb_context *ctdb = pinned_down->ctdb;`
			`struct pinned_down_requeue_handle *handle = talloc(ctdb, struct pinned_down_requeue_handle);`

			`handle->ctdb = pinned_down->ctdb;`
			`handle->hdr = pinned_down->hdr;`
			`talloc_steal(handle, handle->hdr);`

ctdb-daemon: Stop using tevent compatibility definitions Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:09 +03:00			`tevent_add_timer(ctdb->ev, handle, timeval_zero(),`
			`pinned_down_requeue, handle);`
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00
			`return 0;`
			`}`

			`static int`
			`ctdb_defer_pinned_down_request(struct ctdb_context ctdb, struct ctdb_db_context ctdb_db, TDB_DATA key, struct ctdb_req_header *hdr)`
			`{`
			`TALLOC_CTX *tmp_ctx = talloc_new(NULL);`
			`uint32_t *k;`
			`struct ctdb_sticky_record *sr;`
			`struct pinned_down_deferred_call *pinned_down;`

ctdb-daemon: Remove duplicate code with refactored function Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-15 07:33:24 +04:00			`k = ctdb_key_to_idkey(tmp_ctx, key);`
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00			`if (k == NULL) {`
			`DEBUG(DEBUG_ERR,("Failed to allocate key for sticky record\n"));`
			`talloc_free(tmp_ctx);`
			`return -1;`
			`}`

			`sr = trbt_lookuparray32(ctdb_db->sticky_records, k[0], &k[0]);`
			`if (sr == NULL) {`
			`talloc_free(tmp_ctx);`
			`return -1;`
			`}`

			`talloc_free(tmp_ctx);`

			`if (sr->pindown == NULL) {`
			`return -1;`
			`}`

			`pinned_down = talloc(sr->pindown, struct pinned_down_deferred_call);`
			`if (pinned_down == NULL) {`
			`DEBUG(DEBUG_ERR,("Failed to allocate structure for deferred pinned down request\n"));`
			`return -1;`
			`}`

			`pinned_down->ctdb = ctdb;`
			`pinned_down->hdr = hdr;`

			`talloc_set_destructor(pinned_down, pinned_down_destructor);`
			`talloc_steal(pinned_down, hdr);`

			`return 0;`
			`}`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00
ctdb-daemon: Fix sorting of hot keys The current code only ever swaps with slot 0. This will only ever happen with slots 0 and 1, so probably never sorts. Replace with qsort(). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2020-04-23 11:59:47 +03:00			`static int hot_key_cmp(const void a, const void b)`
			`{`
			`const struct ctdb_db_hot_key ka = (const struct ctdb_db_hot_key )a;`
			`const struct ctdb_db_hot_key kb = (const struct ctdb_db_hot_key )b;`

			`if (ka->count < kb->count) {`
			`return -1;`
			`}`
			`if (ka->count > kb->count) {`
			`return 1;`
			`}`

			`return 0;`
			`}`

STATISTICS: Add tracking of the 10 hottest keys per database measured in hopcount and add mechanisms to dump it using the ctdb dbstatistics command (This used to be ctdb commit 8307c70ed98996b430c470e9641a09fdeeb81bd8) 2012-06-13 10:17:18 +04:00			`static void`
ctdb-daemon: For hot records, use count instead of hopcount This avoids tying hopcounts to hot records. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-04-03 10:32:32 +03:00			`ctdb_update_db_stat_hot_keys(struct ctdb_db_context *ctdb_db, TDB_DATA key,`
ctdb-daemon: Avoid signed/unsigned comparison by declaring as unsigned Compiling with -Wsign-compare complains: ctdb/server/ctdb_call.c:831:12: warning: comparison of integer expressions of different signedness: ‘int’ and ‘uint32_t’ {aka ‘unsigned int’} [-Wsign-compare] 831 \| if (count <= ctdb_db->statistics.hot_keys[0].count) { \| ^~ and ctdb/server/ctdb_call.c:844:13: warning: comparison of integer expressions of different signedness: ‘int’ and ‘uint32_t’ {aka ‘unsigned int’} [-Wsign-compare] 844 \| if (count <= ctdb_db->statistics.hot_keys[i].count) { \| ^~ Found by cs-build. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2019-08-01 03:55:39 +03:00			`unsigned int count)`
STATISTICS: Add tracking of the 10 hottest keys per database measured in hopcount and add mechanisms to dump it using the ctdb dbstatistics command (This used to be ctdb commit 8307c70ed98996b430c470e9641a09fdeeb81bd8) 2012-06-13 10:17:18 +04:00			`{`
ctdb-daemon: Switch some variables to unsigned These should be unsigned but luck is currently on our side. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2020-04-23 11:59:24 +03:00			`unsigned int i, id;`
ctdb: Print key as hex string instead of just the hash in hot record message Signed-off-by: Christof Schmitt <cs@samba.org> Reviewed-by: Volker Lendecke <vl@samba.org> 2017-02-17 02:23:39 +03:00			`char *keystr;`
STATISTICS: Add tracking of the 10 hottest keys per database measured in hopcount and add mechanisms to dump it using the ctdb dbstatistics command (This used to be ctdb commit 8307c70ed98996b430c470e9641a09fdeeb81bd8) 2012-06-13 10:17:18 +04:00
ctdb-daemon: Fix bug in slot 0 comparison optimisation This is only valid if all slots are in use. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2020-05-14 13:25:22 +03:00			`/*`
			`* If all slots are being used then only need to compare`
			`* against the count in the 0th slot, since it contains the`
			`* smallest count.`
			`*/`
			`if (ctdb_db->statistics.num_hot_keys == MAX_HOT_KEYS &&`
			`count <= ctdb_db->hot_keys[0].count) {`
STATISTICS: Add tracking of the 10 hottest keys per database measured in hopcount and add mechanisms to dump it using the ctdb dbstatistics command (This used to be ctdb commit 8307c70ed98996b430c470e9641a09fdeeb81bd8) 2012-06-13 10:17:18 +04:00			`return;`
			`}`

			`/* see if we already know this key */`
			`for (i = 0; i < MAX_HOT_KEYS; i++) {`
ctdb-daemon: Add separate hot keys array for database statistics There are 2 reasons for this. Sorting of hot keys is broken and will be changed to an implementation that needs a named (i.e. not anonymous) structure. Also, at least one non-protocol field will be added to facilitate more useful logging. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2020-04-23 11:51:40 +03:00			`if (key.dsize != ctdb_db->hot_keys[i].key.dsize) {`
STATISTICS: Add tracking of the 10 hottest keys per database measured in hopcount and add mechanisms to dump it using the ctdb dbstatistics command (This used to be ctdb commit 8307c70ed98996b430c470e9641a09fdeeb81bd8) 2012-06-13 10:17:18 +04:00			`continue;`
			`}`
ctdb-daemon: Add separate hot keys array for database statistics There are 2 reasons for this. Sorting of hot keys is broken and will be changed to an implementation that needs a named (i.e. not anonymous) structure. Also, at least one non-protocol field will be added to facilitate more useful logging. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2020-04-23 11:51:40 +03:00			`if (memcmp(key.dptr, ctdb_db->hot_keys[i].key.dptr, key.dsize)) {`
STATISTICS: Add tracking of the 10 hottest keys per database measured in hopcount and add mechanisms to dump it using the ctdb dbstatistics command (This used to be ctdb commit 8307c70ed98996b430c470e9641a09fdeeb81bd8) 2012-06-13 10:17:18 +04:00			`continue;`
			`}`
			`/* found an entry for this key */`
ctdb-daemon: Add separate hot keys array for database statistics There are 2 reasons for this. Sorting of hot keys is broken and will be changed to an implementation that needs a named (i.e. not anonymous) structure. Also, at least one non-protocol field will be added to facilitate more useful logging. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2020-04-23 11:51:40 +03:00			`if (count <= ctdb_db->hot_keys[i].count) {`
STATISTICS: Add tracking of the 10 hottest keys per database measured in hopcount and add mechanisms to dump it using the ctdb dbstatistics command (This used to be ctdb commit 8307c70ed98996b430c470e9641a09fdeeb81bd8) 2012-06-13 10:17:18 +04:00			`return;`
			`}`
ctdb-daemon: Add extra logging of hot keys ctdbd currently only logs when a new hot key is added. If a key gets hotter then nothing new is logged. Log hot key updates when the number of migrations has doubled since the last time that key was logged. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2020-05-01 09:44:22 +03:00			`if (count >= (2 * ctdb_db->hot_keys[i].last_logged_count)) {`
			`keystr = hex_encode_talloc(ctdb_db,`
			`(unsigned char *)key.dptr,`
			`key.dsize);`
			`D_NOTICE("Updated hot key database=%s key=%s count=%d\n",`
			`ctdb_db->db_name,`
			`keystr ? keystr : "" ,`
			`count);`
			`TALLOC_FREE(keystr);`
			`ctdb_db->hot_keys[i].last_logged_count = count;`
			`}`
ctdb-daemon: Add separate hot keys array for database statistics There are 2 reasons for this. Sorting of hot keys is broken and will be changed to an implementation that needs a named (i.e. not anonymous) structure. Also, at least one non-protocol field will be added to facilitate more useful logging. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2020-04-23 11:51:40 +03:00			`ctdb_db->hot_keys[i].count = count;`
STATISTICS: Add tracking of the 10 hottest keys per database measured in hopcount and add mechanisms to dump it using the ctdb dbstatistics command (This used to be ctdb commit 8307c70ed98996b430c470e9641a09fdeeb81bd8) 2012-06-13 10:17:18 +04:00			`goto sort_keys;`
			`}`

ctdbd: Fix updating of hot keys in database statistics Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit fde4b4db5a57f75c5efa5647c309f33e0d5a68f3) 2013-07-12 11:33:13 +04:00			`if (ctdb_db->statistics.num_hot_keys < MAX_HOT_KEYS) {`
			`id = ctdb_db->statistics.num_hot_keys;`
			`ctdb_db->statistics.num_hot_keys++;`
			`} else {`
			`id = 0;`
STATISTICS: Add tracking of the 10 hottest keys per database measured in hopcount and add mechanisms to dump it using the ctdb dbstatistics command (This used to be ctdb commit 8307c70ed98996b430c470e9641a09fdeeb81bd8) 2012-06-13 10:17:18 +04:00			`}`

ctdb-daemon: Add separate hot keys array for database statistics There are 2 reasons for this. Sorting of hot keys is broken and will be changed to an implementation that needs a named (i.e. not anonymous) structure. Also, at least one non-protocol field will be added to facilitate more useful logging. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2020-04-23 11:51:40 +03:00			`if (ctdb_db->hot_keys[id].key.dptr != NULL) {`
			`talloc_free(ctdb_db->hot_keys[id].key.dptr);`
ctdbd: Fix updating of hot keys in database statistics Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit fde4b4db5a57f75c5efa5647c309f33e0d5a68f3) 2013-07-12 11:33:13 +04:00			`}`
ctdb-daemon: Add separate hot keys array for database statistics There are 2 reasons for this. Sorting of hot keys is broken and will be changed to an implementation that needs a named (i.e. not anonymous) structure. Also, at least one non-protocol field will be added to facilitate more useful logging. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2020-04-23 11:51:40 +03:00			`ctdb_db->hot_keys[id].key.dsize = key.dsize;`
			`ctdb_db->hot_keys[id].key.dptr = talloc_memdup(ctdb_db,`
			`key.dptr,`
			`key.dsize);`
			`ctdb_db->hot_keys[id].count = count;`
ctdb: Print key as hex string instead of just the hash in hot record message Signed-off-by: Christof Schmitt <cs@samba.org> Reviewed-by: Volker Lendecke <vl@samba.org> 2017-02-17 02:23:39 +03:00
			`keystr = hex_encode_talloc(ctdb_db,`
			`(unsigned char *)key.dptr, key.dsize);`
ctdb-daemon: Update hot key logging This message indicates that a hot key was added, so say that. After all the hot key slots have been filled the id will always be 0, so don't bother logging it. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2020-05-01 09:24:27 +03:00			`D_NOTICE("Added hot key database=%s key=%s count=%d\n",`
			`ctdb_db->db_name,`
			`keystr ? keystr : "" ,`
			`count);`
ctdb: Print key as hex string instead of just the hash in hot record message Signed-off-by: Christof Schmitt <cs@samba.org> Reviewed-by: Volker Lendecke <vl@samba.org> 2017-02-17 02:23:39 +03:00			`talloc_free(keystr);`
ctdb-daemon: Add extra logging of hot keys ctdbd currently only logs when a new hot key is added. If a key gets hotter then nothing new is logged. Log hot key updates when the number of migrations has doubled since the last time that key was logged. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2020-05-01 09:44:22 +03:00			`ctdb_db->hot_keys[id].last_logged_count = count;`
STATISTICS: Add tracking of the 10 hottest keys per database measured in hopcount and add mechanisms to dump it using the ctdb dbstatistics command (This used to be ctdb commit 8307c70ed98996b430c470e9641a09fdeeb81bd8) 2012-06-13 10:17:18 +04:00
			`sort_keys:`
ctdb-daemon: Fix sorting of hot keys The current code only ever swaps with slot 0. This will only ever happen with slots 0 and 1, so probably never sorts. Replace with qsort(). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2020-04-23 11:59:47 +03:00			`qsort(&ctdb_db->hot_keys[0],`
			`ctdb_db->statistics.num_hot_keys,`
			`sizeof(struct ctdb_db_hot_key),`
			`hot_key_cmp);`
STATISTICS: Add tracking of the 10 hottest keys per database measured in hopcount and add mechanisms to dump it using the ctdb dbstatistics command (This used to be ctdb commit 8307c70ed98996b430c470e9641a09fdeeb81bd8) 2012-06-13 10:17:18 +04:00			`}`

- added simple (fake) vnn system - split up ctdb layer code into 3 modules - added a simple test suite - added packet structures for ctdb_call - switched to an array for ctdb_node to make vnn lookup easy and fast (This used to be ctdb commit 8a17460a816a5970f2df8244a06aec55d814f186) 2006-11-28 09:56:10 +03:00			`/*`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`called when a CTDB_REQ_CALL packet comes in`
- added simple (fake) vnn system - split up ctdb layer code into 3 modules - added a simple test suite - added packet structures for ctdb_call - switched to an array for ctdb_node to make vnn lookup easy and fast (This used to be ctdb commit 8a17460a816a5970f2df8244a06aec55d814f186) 2006-11-28 09:56:10 +03:00			`*/`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`void ctdb_request_call(struct ctdb_context ctdb, struct ctdb_req_header hdr)`
- added simple (fake) vnn system - split up ctdb layer code into 3 modules - added a simple test suite - added packet structures for ctdb_call - switched to an array for ctdb_node to make vnn lookup easy and fast (This used to be ctdb commit 8a17460a816a5970f2df8244a06aec55d814f186) 2006-11-28 09:56:10 +03:00			`{`
ctdb-daemon: Rename struct ctdb_req_call to ctdb_req_call_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:26:29 +03:00			`struct ctdb_req_call_old c = (struct ctdb_req_call_old )hdr;`
simplified ctdb_call() interface, and made it easier to expand with more parameters later (This used to be ctdb commit 6c816fe85e84faad167101bcf26850966c3044e5) 2007-01-25 08:13:17 +03:00			`TDB_DATA data;`
ctdb-daemon: Rename struct ctdb_reply_call to ctdb_reply_call_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:29:01 +03:00			`struct ctdb_reply_call_old *r;`
merge fixes from samba4 (This used to be ctdb commit fb90a5424348d0b6ed9a1b8da4ceadcc4d1a1cb1) 2007-01-23 03:38:45 +03:00			`int ret, len;`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`struct ctdb_ltdb_header header;`
in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`struct ctdb_call *call;`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`struct ctdb_db_context *ctdb_db;`
add max hop count buckets to see how bad hopcounts are (This used to be ctdb commit 7d3931298e6477d92f43652c3006b0c426cb1307) 2012-03-07 10:02:41 +04:00			`int tmp_count, bucket;`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00
Dont even try allocating and sending a CALL packet if the transport is down (This used to be ctdb commit cb8dd896914d4e44ad7b8bb000176a7c78f394ae) 2009-06-30 06:16:13 +04:00			`if (ctdb->methods == NULL) {`
during shutdown there is a window after we have stopped TCP and disconnected from all other nodes but before we have stopped all processing. During this window we may still hit asynchronous events that will fail because we can not send/receive packets from other nodes. These messages are logged as ... Transport is DOWN. To help indicate that they are benign messages related to the process of shutting down. These messages spam the syslog during normal shutdown, so this patch will drop the loglevel of these messages to DEBUG, so that they will not appear in or spam the syslog. (This used to be ctdb commit 8275d265d2ae19b765e30ecf18f6b6319b6e6453) 2010-10-28 06:38:34 +04:00			`DEBUG(DEBUG_INFO,(__location__ " Failed ctdb_request_call. Transport is DOWN\n"));`
Dont even try allocating and sending a CALL packet if the transport is down (This used to be ctdb commit cb8dd896914d4e44ad7b8bb000176a7c78f394ae) 2009-06-30 06:16:13 +04:00			`return;`
			`}`

Revert "ctdb-daemon: Check packet generation against database generation" This reverts commit 0ff90f4fac74e61192aff100b168e38ce0adfabb. BUG: https://bugzilla.samba.org/show_bug.cgi?id=11707 The checks against database generation are not required since the global generation is updated as part of updating vnnmap before the actual database recovery. This change was done in 5aab31a39a3589b910a78b96071d6aa5e6547696. Checking only against the database generation is incomplete. It can cause CTDB to abort if the following sequence of events happen. - CTDB gets REQ_DMASTER packet (gen1) This packet processing gets deferred to get a record lock - CTDB goes into recovery, marks RECOVERY_ACTIVE CTDB recovery helper updates vnnmap (gen2) - CTDB processes REQ_DMASTER packet (gen1) The check against database generation (gen1) succeeds. The check for lmaster is now invalid because VNNMAP has changed. This will cause CTDB to abort due to protocol error. Reverting the patch stops processing packets of older generation before they get into call processing. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Signed-off-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Tue Feb 9 12:39:24 CET 2016 on sn-devel-144 2016-02-02 07:58:37 +03:00
initial support for two new pdus for the domain socket to do fetch_lock no locking is yet done and the store_unlock call is still missing the ./tests/fetch.sh --daemon test fails with parent process dying which needs to be investigated. (This used to be ctdb commit 7d7141c968950a8856f1be79871932b688bfb07f) 2007-04-12 09:46:50 +04:00			`ctdb_db = find_ctdb_db(ctdb, c->db_id);`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`if (!ctdb_db) {`
Fix uninitialized variable warnings (This used to be ctdb commit b84f97adfd25b2fbfab1c7964b68931643e8029c) 2007-04-11 14:49:10 +04:00			`ctdb_send_error(ctdb, hdr, -1,`
			`"Unknown database in request. db_id==0x%08x",`
			`c->db_id);`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`return;`
			`}`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00
in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`call = talloc(hdr, struct ctdb_call);`
			`CTDB_NO_MEMORY_FATAL(ctdb, call);`

			`call->call_id = c->callid;`
			`call->key.dptr = c->data;`
			`call->key.dsize = c->keylen;`
			`call->call_data.dptr = c->data + c->keylen;`
			`call->call_data.dsize = c->calldatalen;`
ReadOnly: Add processing for ReadOnly delegation requests and revoke requests to the processing loop for CALL packets we receive from different nodes. This implements the ReadOnly and ReadWrite request processing, delegation and revoking of delegations for all requests coming in across the network from a remote node. (This used to be ctdb commit 78f2c2ea70e6270cec59db7c3f174a511bf608a9) 2011-07-20 09:13:47 +04:00			`call->reply_data.dptr = NULL;`
			`call->reply_data.dsize = 0;`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00
			`/* If this record is pinned down we should defer the`
			`request until the pindown times out`
			`*/`
ctdb-daemon: Add accessors for CTDB_DB_FLAGS_STICKY flag Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-03-02 07:47:46 +03:00			`if (ctdb_db_sticky(ctdb_db)) {`
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00			`if (ctdb_defer_pinned_down_request(ctdb, ctdb_db, call->key, hdr) == 0) {`
ctdbd: Make sure call data is freed if doing an early return This should avoid memory bloat when a request bounces between nodes. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 7677fb263f06a97398e2c546e32273fb96edca69) 2013-07-23 10:00:15 +04:00			`DEBUG(DEBUG_WARNING,`
			`("Defer request for pinned down record in %s\n", ctdb_db->db_name));`
			`talloc_free(call);`
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00			`return;`
			`}`
			`}`

ctdb-daemon: Defer all calls when processing dmaster packets When CTDB receives DMASTER_REQUEST or DMASTER_REPLY packet, the specified record needs to be updated as soon as possible to avoid inconsistent dmaster information between nodes. During this time, queue up all calls for that record and process them only after dmaster request/reply has been processed. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-15 09:20:36 +04:00			`if (dmaster_defer_add(ctdb_db, hdr, call->key) == 0) {`
			`talloc_free(call);`
			`return;`
			`}`
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`/* determine if we are the dmaster for this key. This also`
			`fetches the record data (if any), thus avoiding a 2nd fetch of the data`
			`if the call will be answered locally */`
fix a bug in new structure handling (This used to be ctdb commit 5f248d82717c8094f260ea16292996bb712df947) 2007-01-29 14:11:16 +03:00
in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`ret = ctdb_ltdb_lock_fetch_requeue(ctdb_db, call->key, &header, hdr, &data,`
server: Replace BOOL datatype with bool, True/False with true/false Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 6e5cbe8fff71985e5a2fc16b7e9f2b868011ff5d) 2012-05-17 10:08:37 +04:00			`ctdb_call_input_pkt, ctdb, false);`
start using ctdb_ltdb_lock_fetch_requeue() (This used to be ctdb commit f89ab3a06b4677f56c92768c3a8ae5ec9f5abbc2) 2007-04-17 10:54:03 +04:00			`if (ret == -1) {`
added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00			`ctdb_send_error(ctdb, hdr, ret, "ltdb fetch failed in ctdb_request_call");`
ctdbd: Make sure call data is freed if doing an early return This should avoid memory bloat when a request bounces between nodes. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 7677fb263f06a97398e2c546e32273fb96edca69) 2013-07-23 10:00:15 +04:00			`talloc_free(call);`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`return;`
			`}`
start using ctdb_ltdb_lock_fetch_requeue() (This used to be ctdb commit f89ab3a06b4677f56c92768c3a8ae5ec9f5abbc2) 2007-04-17 10:54:03 +04:00			`if (ret == -2) {`
added debug constants to allow for better mapping to syslog levels (This used to be ctdb commit 7ba8f1dde318eab03f4257e5a89fd23e7281e502) 2008-02-04 09:44:24 +03:00			`DEBUG(DEBUG_INFO,(__location__ " deferred ctdb_request_call\n"));`
ctdbd: Make sure call data is freed if doing an early return This should avoid memory bloat when a request bounces between nodes. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 7677fb263f06a97398e2c546e32273fb96edca69) 2013-07-23 10:00:15 +04:00			`talloc_free(call);`
start using ctdb_ltdb_lock_fetch_requeue() (This used to be ctdb commit f89ab3a06b4677f56c92768c3a8ae5ec9f5abbc2) 2007-04-17 10:54:03 +04:00			`return;`
			`}`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00
Fix various spelling errors Reviewed-by: Andrew Bartlett <abartlet@samba.org> Reviewed-by: Michael Adam <obnox@samba.org> Autobuild-User(master): Andrew Bartlett <abartlet@samba.org> Autobuild-Date(master): Fri Nov 6 13:43:45 CET 2015 on sn-devel-104 2015-07-27 00:02:57 +03:00			`/* Dont do READONLY if we don't have a tracking database */`
ctdb-daemon: Add accessors for CTDB_DB_FLAGS_READONLY flag Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-03-02 07:44:48 +03:00			`if ((c->flags & CTDB_WANT_READONLY) && !ctdb_db_readonly(ctdb_db)) {`
ReadOnly: Add processing for ReadOnly delegation requests and revoke requests to the processing loop for CALL packets we receive from different nodes. This implements the ReadOnly and ReadWrite request processing, delegation and revoking of delegations for all requests coming in across the network from a remote node. (This used to be ctdb commit 78f2c2ea70e6270cec59db7c3f174a511bf608a9) 2011-07-20 09:13:47 +04:00			`c->flags &= ~CTDB_WANT_READONLY;`
			`}`

			`if (header.flags & CTDB_REC_RO_REVOKE_COMPLETE) {`
ctdb_call: use CTDB_REC_RO_FLAGS where appropriate Signed-off-by: Michael Adam <obnox@samba.org> Reviewed-By: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit f99eb2f56d8ca27110a45ae0e1c4bff40ac7a60e) 2013-04-19 18:22:49 +04:00			`header.flags &= ~CTDB_REC_RO_FLAGS;`
STATISTICS: add total counts for number of delegations and number of revokes Everytime we give a delegation to another node we count this as one delegation. If the same record is delegated to several nodes we count one for each node. Everytime a record has all its delegations revoked we count this as one revoke. (This used to be ctdb commit b098bcf8007be63889aaed640a951b0eeaa9d191) 2012-02-08 06:42:30 +04:00			`CTDB_INCREMENT_STAT(ctdb, total_ro_revokes);`
ReadOnly: add per-database statistics to view how much delegations/revokes we have (This used to be ctdb commit 751ed46197661eb841042ab6a02855a51dd0b17c) 2012-02-08 08:29:27 +04:00			`CTDB_INCREMENT_DB_STAT(ctdb_db, db_ro_revokes);`
ReadOnly: Add processing for ReadOnly delegation requests and revoke requests to the processing loop for CALL packets we receive from different nodes. This implements the ReadOnly and ReadWrite request processing, delegation and revoking of delegations for all requests coming in across the network from a remote node. (This used to be ctdb commit 78f2c2ea70e6270cec59db7c3f174a511bf608a9) 2011-07-20 09:13:47 +04:00			`if (ctdb_ltdb_store(ctdb_db, call->key, &header, data) != 0) {`
			`ctdb_fatal(ctdb, "Failed to write header with cleared REVOKE flag");`
			`}`
ReadOnly: clear out the tracking record once a revoke is completed (This used to be ctdb commit 7af255551f058d1f6bfdd38ca603e7a19d1bb7ba) 2011-08-17 10:14:57 +04:00			`/* and clear out the tracking data */`
			`if (tdb_delete(ctdb_db->rottdb, call->key) != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " Failed to clear out trackingdb record\n"));`
			`}`
ReadOnly: Add processing for ReadOnly delegation requests and revoke requests to the processing loop for CALL packets we receive from different nodes. This implements the ReadOnly and ReadWrite request processing, delegation and revoking of delegations for all requests coming in across the network from a remote node. (This used to be ctdb commit 78f2c2ea70e6270cec59db7c3f174a511bf608a9) 2011-07-20 09:13:47 +04:00			`}`

			`/* if we are revoking, we must defer all other calls until the revoke`
			`* had completed.`
			`*/`
			`if (header.flags & CTDB_REC_RO_REVOKING_READONLY) {`
			`talloc_free(data.dptr);`
			`ret = ctdb_ltdb_unlock(ctdb_db, call->key);`

			`if (ctdb_add_revoke_deferred_call(ctdb, ctdb_db, call->key, hdr, ctdb_call_input_pkt, ctdb) != 0) {`
			`ctdb_fatal(ctdb, "Failed to add deferred call for revoke child");`
			`}`
			`talloc_free(call);`
			`return;`
			`}`

ctdbd: fix comment explaining redirection of CTDB_REQ_CALL redirection. Signed-off-by: Michael Adam <obnox@samba.org> (This used to be ctdb commit b697625b184227dad1be31a41b7a3fd9bd312e29) 2013-05-17 13:05:44 +04:00			`/*`
			`* If we are not the dmaster and are not hosting any delegations,`
			`* then we redirect the request to the node than can answer it`
			`* (the lmaster or the dmaster).`
			`*/`
ReadOnly: Add processing for ReadOnly delegation requests and revoke requests to the processing loop for CALL packets we receive from different nodes. This implements the ReadOnly and ReadWrite request processing, delegation and revoking of delegations for all requests coming in across the network from a remote node. (This used to be ctdb commit 78f2c2ea70e6270cec59db7c3f174a511bf608a9) 2011-07-20 09:13:47 +04:00			`if ((header.dmaster != ctdb->pnn)`
			`&& (!(header.flags & CTDB_REC_RO_HAVE_DELEGATIONS)) ) {`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`talloc_free(data.dptr);`
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00			`ctdb_call_send_redirect(ctdb, ctdb_db, call->key, c, &header);`
add extra logging for failed ctdb_ltdb_unlock() for a few more places it is called from (This used to be ctdb commit 5c0fea90c6474a51992a9c4aeb6af7dfeb213ee0) 2010-06-09 08:31:05 +04:00
			`ret = ctdb_ltdb_unlock(ctdb_db, call->key);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_ltdb_unlock() failed with error %d\n", ret));`
			`}`
ctdbd: Make sure call data is freed if doing an early return This should avoid memory bloat when a request bounces between nodes. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 7677fb263f06a97398e2c546e32273fb96edca69) 2013-07-23 10:00:15 +04:00			`talloc_free(call);`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`return;`
			`}`

ReadOnly: Add processing for ReadOnly delegation requests and revoke requests to the processing loop for CALL packets we receive from different nodes. This implements the ReadOnly and ReadWrite request processing, delegation and revoking of delegations for all requests coming in across the network from a remote node. (This used to be ctdb commit 78f2c2ea70e6270cec59db7c3f174a511bf608a9) 2011-07-20 09:13:47 +04:00			`if ( (!(c->flags & CTDB_WANT_READONLY))`
			`&& (header.flags & (CTDB_REC_RO_HAVE_DELEGATIONS\|CTDB_REC_RO_HAVE_READONLY)) ) {`
			`header.flags \|= CTDB_REC_RO_REVOKING_READONLY;`
			`if (ctdb_ltdb_store(ctdb_db, call->key, &header, data) != 0) {`
			`ctdb_fatal(ctdb, "Failed to store record with HAVE_DELEGATIONS set");`
			`}`
			`ret = ctdb_ltdb_unlock(ctdb_db, call->key);`

			`if (ctdb_start_revoke_ro_record(ctdb, ctdb_db, call->key, &header, data) != 0) {`
			`ctdb_fatal(ctdb, "Failed to start record revoke");`
			`}`
			`talloc_free(data.dptr);`

			`if (ctdb_add_revoke_deferred_call(ctdb, ctdb_db, call->key, hdr, ctdb_call_input_pkt, ctdb) != 0) {`
			`ctdb_fatal(ctdb, "Failed to add deferred call for revoke child");`
			`}`
			`talloc_free(call);`

			`return;`
			`}`

			`/* If this is the first request for delegation. bump rsn and set`
			`* the delegations flag`
			`*/`
			`if ((c->flags & CTDB_WANT_READONLY)`
			`&& (c->callid == CTDB_FETCH_WITH_HEADER_FUNC)`
			`&& (!(header.flags & CTDB_REC_RO_HAVE_DELEGATIONS))) {`
			`header.rsn += 3;`
			`header.flags \|= CTDB_REC_RO_HAVE_DELEGATIONS;`
			`if (ctdb_ltdb_store(ctdb_db, call->key, &header, data) != 0) {`
			`ctdb_fatal(ctdb, "Failed to store record with HAVE_DELEGATIONS set");`
			`}`
			`}`
			`if ((c->flags & CTDB_WANT_READONLY)`
ctdb-daemon: Avoid signed/unsigned comparison by casting Compiling with -Wsign-compare complains: 1047 \| && (call->call_id == CTDB_FETCH_WITH_HEADER_FUNC)) { \| ^~ struct ctdb_call is a protocol element, so we can't simply change it. Found by csbuild. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Wed Aug 14 10:29:59 UTC 2019 on sn-devel-184 2019-08-01 03:58:42 +03:00			`&& ((unsigned int)call->call_id == CTDB_FETCH_WITH_HEADER_FUNC)) {`
ReadOnly: Add processing for ReadOnly delegation requests and revoke requests to the processing loop for CALL packets we receive from different nodes. This implements the ReadOnly and ReadWrite request processing, delegation and revoking of delegations for all requests coming in across the network from a remote node. (This used to be ctdb commit 78f2c2ea70e6270cec59db7c3f174a511bf608a9) 2011-07-20 09:13:47 +04:00			`TDB_DATA tdata;`

			`tdata = tdb_fetch(ctdb_db->rottdb, call->key);`
			`if (ctdb_trackingdb_add_pnn(ctdb, &tdata, c->hdr.srcnode) != 0) {`
			`ctdb_fatal(ctdb, "Failed to add node to trackingdb");`
			`}`
			`if (tdb_store(ctdb_db->rottdb, call->key, tdata, TDB_REPLACE) != 0) {`
			`ctdb_fatal(ctdb, "Failed to store trackingdb data");`
			`}`
			`free(tdata.dptr);`

			`ret = ctdb_ltdb_unlock(ctdb_db, call->key);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_ltdb_unlock() failed with error %d\n", ret));`
			`}`

ctdb-daemon: Rename struct ctdb_reply_call to ctdb_reply_call_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:29:01 +03:00			`len = offsetof(struct ctdb_reply_call_old, data) + data.dsize + sizeof(struct ctdb_ltdb_header);`
ReadOnly: Add processing for ReadOnly delegation requests and revoke requests to the processing loop for CALL packets we receive from different nodes. This implements the ReadOnly and ReadWrite request processing, delegation and revoking of delegations for all requests coming in across the network from a remote node. (This used to be ctdb commit 78f2c2ea70e6270cec59db7c3f174a511bf608a9) 2011-07-20 09:13:47 +04:00			`r = ctdb_transport_allocate(ctdb, ctdb, CTDB_REPLY_CALL, len,`
ctdb-daemon: Rename struct ctdb_reply_call to ctdb_reply_call_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:29:01 +03:00			`struct ctdb_reply_call_old);`
ReadOnly: Add processing for ReadOnly delegation requests and revoke requests to the processing loop for CALL packets we receive from different nodes. This implements the ReadOnly and ReadWrite request processing, delegation and revoking of delegations for all requests coming in across the network from a remote node. (This used to be ctdb commit 78f2c2ea70e6270cec59db7c3f174a511bf608a9) 2011-07-20 09:13:47 +04:00			`CTDB_NO_MEMORY_FATAL(ctdb, r);`
			`r->hdr.destnode = c->hdr.srcnode;`
			`r->hdr.reqid = c->hdr.reqid;`
ctdb-daemon: Use database generation in packet headers for database requests Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-09-15 09:50:19 +03:00			`r->hdr.generation = ctdb_db->generation;`
ReadOnly: Add processing for ReadOnly delegation requests and revoke requests to the processing loop for CALL packets we receive from different nodes. This implements the ReadOnly and ReadWrite request processing, delegation and revoking of delegations for all requests coming in across the network from a remote node. (This used to be ctdb commit 78f2c2ea70e6270cec59db7c3f174a511bf608a9) 2011-07-20 09:13:47 +04:00			`r->status = 0;`
			`r->datalen = data.dsize + sizeof(struct ctdb_ltdb_header);`
			`header.rsn -= 2;`
			`header.flags \|= CTDB_REC_RO_HAVE_READONLY;`
			`header.flags &= ~CTDB_REC_RO_HAVE_DELEGATIONS;`
			`memcpy(&r->data[0], &header, sizeof(struct ctdb_ltdb_header));`

			`if (data.dsize) {`
			`memcpy(&r->data[sizeof(struct ctdb_ltdb_header)], data.dptr, data.dsize);`
			`}`

			`ctdb_queue_packet(ctdb, &r->hdr);`
STATISTICS: add total counts for number of delegations and number of revokes Everytime we give a delegation to another node we count this as one delegation. If the same record is delegated to several nodes we count one for each node. Everytime a record has all its delegations revoked we count this as one revoke. (This used to be ctdb commit b098bcf8007be63889aaed640a951b0eeaa9d191) 2012-02-08 06:42:30 +04:00			`CTDB_INCREMENT_STAT(ctdb, total_ro_delegations);`
ReadOnly: add per-database statistics to view how much delegations/revokes we have (This used to be ctdb commit 751ed46197661eb841042ab6a02855a51dd0b17c) 2012-02-08 08:29:27 +04:00			`CTDB_INCREMENT_DB_STAT(ctdb_db, db_ro_delegations);`
ReadOnly: Add processing for ReadOnly delegation requests and revoke requests to the processing loop for CALL packets we receive from different nodes. This implements the ReadOnly and ReadWrite request processing, delegation and revoking of delegations for all requests coming in across the network from a remote node. (This used to be ctdb commit 78f2c2ea70e6270cec59db7c3f174a511bf608a9) 2011-07-20 09:13:47 +04:00
			`talloc_free(r);`
ctdbd: Make sure call data is freed if doing an early return This should avoid memory bloat when a request bounces between nodes. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 7677fb263f06a97398e2c546e32273fb96edca69) 2013-07-23 10:00:15 +04:00			`talloc_free(call);`
ReadOnly: Add processing for ReadOnly delegation requests and revoke requests to the processing loop for CALL packets we receive from different nodes. This implements the ReadOnly and ReadWrite request processing, delegation and revoking of delegations for all requests coming in across the network from a remote node. (This used to be ctdb commit 78f2c2ea70e6270cec59db7c3f174a511bf608a9) 2011-07-20 09:13:47 +04:00			`return;`
			`}`

Create macros to update the statistics counters and use these macros everywhere instead of manipulating the coutenrs directly. (This used to be ctdb commit 2e648df890e5713bc575965d87937827b068d0d7) 2010-09-29 04:38:41 +04:00			`CTDB_UPDATE_STAT(ctdb, max_hop_count, c->hopcount);`
add max hop count buckets to see how bad hopcounts are (This used to be ctdb commit 7d3931298e6477d92f43652c3006b0c426cb1307) 2012-03-07 10:02:41 +04:00			`tmp_count = c->hopcount;`
			`bucket = 0;`
			`while (tmp_count) {`
ctdb-daemon: Divide by 2 when calculating hop count bucket This provides finer resolution while still maintaining a reasonable maximum. In this case the top bucket contains any hop counts >= 16384, compared to the current situation where the top bucket contains hop counts >= 268435456. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2018-11-15 05:58:53 +03:00			`tmp_count >>= 1;`
add max hop count buckets to see how bad hopcounts are (This used to be ctdb commit 7d3931298e6477d92f43652c3006b0c426cb1307) 2012-03-07 10:02:41 +04:00			`bucket++;`
			`}`
STATISTICS: add per-db hop count statistics (This used to be ctdb commit 1c976d83b1d7dac6f0ef81306774998e4c8b56a1) 2012-03-20 05:08:12 +04:00			`if (bucket >= MAX_COUNT_BUCKETS) {`
			`bucket = MAX_COUNT_BUCKETS - 1;`
add max hop count buckets to see how bad hopcounts are (This used to be ctdb commit 7d3931298e6477d92f43652c3006b0c426cb1307) 2012-03-07 10:02:41 +04:00			`}`
			`CTDB_INCREMENT_STAT(ctdb, hop_count_bucket[bucket]);`
STATISTICS: add per-db hop count statistics (This used to be ctdb commit 1c976d83b1d7dac6f0ef81306774998e4c8b56a1) 2012-03-20 05:08:12 +04:00			`CTDB_INCREMENT_DB_STAT(ctdb_db, hop_count_bucket[bucket]);`
added a hopcount in ctdb_call (This used to be ctdb commit 36d838801a2a2008c50322cdbfff65a308b1cd1a) 2007-05-01 07:25:02 +04:00
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00			`/* If this database supports sticky records, then check if the`
			`hopcount is big. If it is it means the record is hot and we`
			`should make it sticky.`
			`*/`
ctdb-daemon: Add accessors for CTDB_DB_FLAGS_STICKY flag Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-03-02 07:47:46 +03:00			`if (ctdb_db_sticky(ctdb_db) &&`
			`c->hopcount >= ctdb->tunable.hopcount_make_sticky) {`
STICKY: add prototype code to make records stick to a node to "calm" down if they are found to be very hot and accessed by a lot of clients. This can improve performance and stop clients from having to chase a rapidly migrating/bouncing record (This used to be ctdb commit d0d98f7e45e5084b81335b004d50bddc80cdc219) 2012-03-20 09:58:35 +04:00			`ctdb_make_record_sticky(ctdb, ctdb_db, call->key);`
			`}`


Revert "LACOUNT: Add back lacount mechanism to defer migrating a fetched/read copy until after default of 20 consecutive requests from the same node" This reverts commit 035c0d981bde8c0eee8b3f24ba8e2dc817e5b504. This is a premature optimization. Record can bounce between nodes very quickly if it is a contended record. There is no need to hold a record on a node unnecessarily. In case record contention becomes bad, enabling sticky records on a database is a better idea. Conflicts: include/ctdb_private.h server/ctdb_tunables.c Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit ac417b0003f0116f116834ad2ac51482d25cfa0d) 2013-08-19 09:04:46 +04:00			`/* Try if possible to migrate the record off to the caller node.`
			`* From the clients perspective a fetch of the data is just as`
			`* expensive as a migration.`
			`*/`
			`if (c->hdr.srcnode != ctdb->pnn) {`
ctdbd: Remove transaction code related to TRANS2 commits This removes data types and structure elements related to TRANS2 persistent transaction code. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 22a253b7ccf1ff854cddf0b67969dc84d7d6a654) 2013-09-12 10:43:43 +04:00			`if (ctdb_db->persistent_state) {`
call: lower the debug message "refusing migration while transction" to lvl INFO This gets just too noisy on a busy system. And it is purley informational anyways... Michael (This used to be ctdb commit 7f64a00c76203fdf6673c3f862a4bfd17fb848d7) 2009-12-09 15:43:38 +03:00			`DEBUG(DEBUG_INFO, (__location__ " refusing migration"`
Fix persistent transaction commit race condition. In ctdb_client.c:ctdb_transaction_commit(), after a failed TRANS2_COMMIT control call (for instance due to the 1-second being exceeded waiting for a busy node's reply), there is a 1-second gap between the transaction_cancel() and replay_transaction() calls in which there is no lock on the persistent db. And due to the lack of global state indicating that a transaction is in progress in ctdbd, other nodes may succeed to start transactions on the db in this gap and even worse work on top of the possibly already pushed changes. So the data diverges on the several nodes. This change fixes this by introducing global state for a transaction commit being active in the ctdb_db_context struct and in a db_id field in the client so that a client keeps track of _which_ tdb it as transaction commit running on. These data are set by ctdb upon entering the trans2_commit control and they are cleared in the trans2_error or trans2_finished controls. This makes it impossible to start a nother transaction or migrate a record to a different node while a transaction is active on a persistent tdb, including the retry loop. This approach is dead lock free and still allows recovery process to be started in the retry-gap between cancel and replay. Also note, that this solution does not require any change in the client side. This was debugged and developed together with Stefan Metzmacher <metze@samba.org> - thanks! Michael (This used to be ctdb commit f88103516e5ad723062fb95fcb07a128f1069d69) 2009-07-21 13:30:38 +04:00			`" of key %s while transaction is active\n",`
			`(char *)call->key.dptr));`
			`} else {`
Reducing the log level for a debug message DEBUG(DEBUG_DEBUG,("pnn %u starting migration of %08x t\ (This used to be ctdb commit 6ce4b21b00cce1530aff022584bf695c257a5d55) 2010-02-11 03:54:46 +03:00			`DEBUG(DEBUG_DEBUG,("pnn %u starting migration of %08x to %u\n",`
Fix persistent transaction commit race condition. In ctdb_client.c:ctdb_transaction_commit(), after a failed TRANS2_COMMIT control call (for instance due to the 1-second being exceeded waiting for a busy node's reply), there is a 1-second gap between the transaction_cancel() and replay_transaction() calls in which there is no lock on the persistent db. And due to the lack of global state indicating that a transaction is in progress in ctdbd, other nodes may succeed to start transactions on the db in this gap and even worse work on top of the possibly already pushed changes. So the data diverges on the several nodes. This change fixes this by introducing global state for a transaction commit being active in the ctdb_db_context struct and in a db_id field in the client so that a client keeps track of _which_ tdb it as transaction commit running on. These data are set by ctdb upon entering the trans2_commit control and they are cleared in the trans2_error or trans2_finished controls. This makes it impossible to start a nother transaction or migrate a record to a different node while a transaction is active on a persistent tdb, including the retry loop. This approach is dead lock free and still allows recovery process to be started in the retry-gap between cancel and replay. Also note, that this solution does not require any change in the client side. This was debugged and developed together with Stefan Metzmacher <metze@samba.org> - thanks! Michael (This used to be ctdb commit f88103516e5ad723062fb95fcb07a128f1069d69) 2009-07-21 13:30:38 +04:00			`ctdb->pnn, ctdb_hash(&(call->key)), c->hdr.srcnode));`
			`ctdb_call_send_dmaster(ctdb_db, c, &header, &(call->key), &data);`
			`talloc_free(data.dptr);`
add extra logging for failed ctdb_ltdb_unlock() for a few more places it is called from (This used to be ctdb commit 5c0fea90c6474a51992a9c4aeb6af7dfeb213ee0) 2010-06-09 08:31:05 +04:00
			`ret = ctdb_ltdb_unlock(ctdb_db, call->key);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_ltdb_unlock() failed with error %d\n", ret));`
			`}`
Fix persistent transaction commit race condition. In ctdb_client.c:ctdb_transaction_commit(), after a failed TRANS2_COMMIT control call (for instance due to the 1-second being exceeded waiting for a busy node's reply), there is a 1-second gap between the transaction_cancel() and replay_transaction() calls in which there is no lock on the persistent db. And due to the lack of global state indicating that a transaction is in progress in ctdbd, other nodes may succeed to start transactions on the db in this gap and even worse work on top of the possibly already pushed changes. So the data diverges on the several nodes. This change fixes this by introducing global state for a transaction commit being active in the ctdb_db_context struct and in a db_id field in the client so that a client keeps track of _which_ tdb it as transaction commit running on. These data are set by ctdb upon entering the trans2_commit control and they are cleared in the trans2_error or trans2_finished controls. This makes it impossible to start a nother transaction or migrate a record to a different node while a transaction is active on a persistent tdb, including the retry loop. This approach is dead lock free and still allows recovery process to be started in the retry-gap between cancel and replay. Also note, that this solution does not require any change in the client side. This was debugged and developed together with Stefan Metzmacher <metze@samba.org> - thanks! Michael (This used to be ctdb commit f88103516e5ad723062fb95fcb07a128f1069d69) 2009-07-21 13:30:38 +04:00			`}`
ctdbd: Make sure call data is freed if doing an early return This should avoid memory bloat when a request bounces between nodes. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 7677fb263f06a97398e2c546e32273fb96edca69) 2013-07-23 10:00:15 +04:00			`talloc_free(call);`
			`return;`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`}`

Revert "LACOUNT: Add back lacount mechanism to defer migrating a fetched/read copy until after default of 20 consecutive requests from the same node" This reverts commit 035c0d981bde8c0eee8b3f24ba8e2dc817e5b504. This is a premature optimization. Record can bounce between nodes very quickly if it is a contended record. There is no need to hold a record on a node unnecessarily. In case record contention becomes bad, enabling sticky records on a database is a better idea. Conflicts: include/ctdb_private.h server/ctdb_tunables.c Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit ac417b0003f0116f116834ad2ac51482d25cfa0d) 2013-08-19 09:04:46 +04:00			`ret = ctdb_call_local(ctdb_db, call, &header, hdr, &data, true);`
ReadOnly: Add an extra flag to ctdb_call_local to specify whether we want to write the record and header back to the tdb (for example we do when performing dmaster migrations) (This used to be ctdb commit b935e83255aeb3754b2fd37cf5611e02f7283514) 2011-07-20 07:30:12 +04:00			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_call_local failed\n"));`
			`call->status = -1;`
			`}`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00
add extra logging for failed ctdb_ltdb_unlock() for a few more places it is called from (This used to be ctdb commit 5c0fea90c6474a51992a9c4aeb6af7dfeb213ee0) 2010-06-09 08:31:05 +04:00			`ret = ctdb_ltdb_unlock(ctdb_db, call->key);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " ctdb_ltdb_unlock() failed with error %d\n", ret));`
			`}`
start using ctdb_ltdb_lock_fetch_requeue() (This used to be ctdb commit f89ab3a06b4677f56c92768c3a8ae5ec9f5abbc2) 2007-04-17 10:54:03 +04:00
ctdb-daemon: Rename struct ctdb_reply_call to ctdb_reply_call_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:29:01 +03:00			`len = offsetof(struct ctdb_reply_call_old, data) + call->reply_data.dsize;`
factor out the packet allocation code (This used to be ctdb commit 4cbaf22cc4bede8c95dd9b87e21bbe886307e23b) 2007-04-28 12:50:32 +04:00			`r = ctdb_transport_allocate(ctdb, ctdb, CTDB_REPLY_CALL, len,`
ctdb-daemon: Rename struct ctdb_reply_call to ctdb_reply_call_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:29:01 +03:00			`struct ctdb_reply_call_old);`
wrap the packet queue call (This used to be ctdb commit 4dd8ffd5752aedcaf0b8ad1941a5f84ec1ca2868) 2006-12-18 08:26:57 +03:00			`CTDB_NO_MEMORY_FATAL(ctdb, r);`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`r->hdr.destnode = hdr->srcnode;`
			`r->hdr.reqid = hdr->reqid;`
ctdb-daemon: Use database generation in packet headers for database requests Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-09-15 09:50:19 +03:00			`r->hdr.generation = ctdb_db->generation;`
in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`r->status = call->status;`
			`r->datalen = call->reply_data.dsize;`
			`if (call->reply_data.dsize) {`
			`memcpy(&r->data[0], call->reply_data.dptr, call->reply_data.dsize);`
merge status code changes from samba4 ctdb (This used to be ctdb commit 705a9f8e5238976aa5c8cd4a5371459650d8b553) 2007-01-29 14:30:06 +03:00			`}`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00
wrap the packet queue call (This used to be ctdb commit 4dd8ffd5752aedcaf0b8ad1941a5f84ec1ca2868) 2006-12-18 08:26:57 +03:00			`ctdb_queue_packet(ctdb, &r->hdr);`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00
			`talloc_free(r);`
ctdbd: Make sure call data is freed if doing an early return This should avoid memory bloat when a request bounces between nodes. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 7677fb263f06a97398e2c546e32273fb96edca69) 2013-07-23 10:00:15 +04:00			`talloc_free(call);`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`}`

server: standardize formatting of comment block for ctdb_reply_dmaster() while I'm at it.. Signed-off-by: Michael Adam <obnox@samba.org> (This used to be ctdb commit 00d3bf092e2f72eda330978c75ec85f17e870553) 2013-08-19 19:07:19 +04:00			`/**`
			`* called when a CTDB_REPLY_CALL packet comes in`
			`*`
			`* This packet comes in response to a CTDB_REQ_CALL request packet. It`
			`* contains any reply data from the call`
			`*/`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`void ctdb_reply_call(struct ctdb_context ctdb, struct ctdb_req_header hdr)`
			`{`
ctdb-daemon: Rename struct ctdb_reply_call to ctdb_reply_call_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:29:01 +03:00			`struct ctdb_reply_call_old c = (struct ctdb_reply_call_old )hdr;`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`struct ctdb_call_state *state;`

ctdb-daemon: Use reqid abstraction Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-03-17 06:30:18 +03:00			`state = reqid_find(ctdb->idr, hdr->reqid, struct ctdb_call_state);`
Some more debug and two memleak fixes (This used to be ctdb commit 1e2802422794956827263265306952df5e69b377) 2007-04-18 01:03:30 +04:00			`if (state == NULL) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR, (__location__ " reqid %u not found\n", hdr->reqid));`
Some more debug and two memleak fixes (This used to be ctdb commit 1e2802422794956827263265306952df5e69b377) 2007-04-18 01:03:30 +04:00			`return;`
			`}`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00
split the 32bit idr field into two. store the idr as the high 16 bits and use a rotating counter for the low 16 bits. (This used to be ctdb commit 7c763b7b5e6ca54a6df4586893ddaf1b508b4c22) 2007-04-23 12:19:50 +04:00			`if (hdr->reqid != state->reqid) {`
			`/* we found a record but it was the wrong one */`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR, ("Dropped orphaned call reply with reqid:%u\n",hdr->reqid));`
split the 32bit idr field into two. store the idr as the high 16 bits and use a rotating counter for the low 16 bits. (This used to be ctdb commit 7c763b7b5e6ca54a6df4586893ddaf1b508b4c22) 2007-04-23 12:19:50 +04:00			`return;`
			`}`

ReadOnly: Dont update the record header from the calling client. While it is convenient since it avoids having to create a child process from the main dameon for writing the updated record it makes the cleitn more complex. Remove the code in the example client code that writes the record to the local tdb. Add code to the local ctdbd processing of replies to check if this reply contain a ro delegation and if so, spawn a child process to lock the tdb and then write the data. (This used to be ctdb commit bf1d429227dc4f5818263cc39401d0a22663cdba) 2011-10-24 06:14:26 +04:00
			`/* read only delegation processing */`
			`/* If we got a FETCH_WITH_HEADER we should check if this is a ro`
			`* delegation since we may need to update the record header`
			`*/`
			`if (state->c->callid == CTDB_FETCH_WITH_HEADER_FUNC) {`
			`struct ctdb_db_context *ctdb_db = state->ctdb_db;`
			`struct ctdb_ltdb_header header = (struct ctdb_ltdb_header )&c->data[0];`
			`struct ctdb_ltdb_header oldheader;`
			`TDB_DATA key, data, olddata;`
			`int ret;`

			`if (!(header->flags & CTDB_REC_RO_HAVE_READONLY)) {`
			`goto finished_ro;`
			`return;`
			`}`

			`key.dsize = state->c->keylen;`
			`key.dptr = state->c->data;`
			`ret = ctdb_ltdb_lock_requeue(ctdb_db, key, hdr,`
server: Replace BOOL datatype with bool, True/False with true/false Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 6e5cbe8fff71985e5a2fc16b7e9f2b868011ff5d) 2012-05-17 10:08:37 +04:00			`ctdb_call_input_pkt, ctdb, false);`
ReadOnly: Dont update the record header from the calling client. While it is convenient since it avoids having to create a child process from the main dameon for writing the updated record it makes the cleitn more complex. Remove the code in the example client code that writes the record to the local tdb. Add code to the local ctdbd processing of replies to check if this reply contain a ro delegation and if so, spawn a child process to lock the tdb and then write the data. (This used to be ctdb commit bf1d429227dc4f5818263cc39401d0a22663cdba) 2011-10-24 06:14:26 +04:00			`if (ret == -2) {`
			`return;`
			`}`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,(__location__ " Failed to get lock in ctdb_reply_call\n"));`
			`return;`
			`}`

			`ret = ctdb_ltdb_fetch(ctdb_db, key, &oldheader, state, &olddata);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR, ("Failed to fetch old record in ctdb_reply_call\n"));`
			`ctdb_ltdb_unlock(ctdb_db, key);`
			`goto finished_ro;`
			`}`

			`if (header->rsn <= oldheader.rsn) {`
			`ctdb_ltdb_unlock(ctdb_db, key);`
			`goto finished_ro;`
			`}`

ReadOnly: fix bug writing incorrect amount of data in delegated record Fix bug when ctdbd updates the local copy of a delegated record to write the correct amount of data to the record. (This used to be ctdb commit 8814d8bc159a5e368afaa236ac7d865165db04b2) 2011-10-28 04:44:19 +04:00			`if (c->datalen < sizeof(struct ctdb_ltdb_header)) {`
			`DEBUG(DEBUG_ERR,(__location__ " Got FETCH_WITH_HEADER reply with too little data: %d bytes\n", c->datalen));`
			`ctdb_ltdb_unlock(ctdb_db, key);`
			`goto finished_ro;`
			`}`

			`data.dsize = c->datalen - sizeof(struct ctdb_ltdb_header);`
ReadOnly: Dont update the record header from the calling client. While it is convenient since it avoids having to create a child process from the main dameon for writing the updated record it makes the cleitn more complex. Remove the code in the example client code that writes the record to the local tdb. Add code to the local ctdbd processing of replies to check if this reply contain a ro delegation and if so, spawn a child process to lock the tdb and then write the data. (This used to be ctdb commit bf1d429227dc4f5818263cc39401d0a22663cdba) 2011-10-24 06:14:26 +04:00			`data.dptr = &c->data[sizeof(struct ctdb_ltdb_header)];`
			`ret = ctdb_ltdb_store(ctdb_db, key, header, data);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR, ("Failed to store new record in ctdb_reply_call\n"));`
			`ctdb_ltdb_unlock(ctdb_db, key);`
			`goto finished_ro;`
			`}`

			`ctdb_ltdb_unlock(ctdb_db, key);`
			`}`
			`finished_ro:`

in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`state->call->reply_data.dptr = c->data;`
			`state->call->reply_data.dsize = c->datalen;`
			`state->call->status = c->status;`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00
			`talloc_steal(state, c);`

			`state->state = CTDB_CALL_DONE;`
initial support for two new pdus for the domain socket to do fetch_lock no locking is yet done and the store_unlock call is still missing the ./tests/fetch.sh --daemon test fails with parent process dying which needs to be investigated. (This used to be ctdb commit 7d7141c968950a8856f1be79871932b688bfb07f) 2007-04-12 09:46:50 +04:00			`if (state->async.fn) {`
			`state->async.fn(state);`
			`}`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`}`

fixed the reverse of the last bug - handle the case when the new dmaster is the lmaster (This used to be ctdb commit b2599834d2ace7369a1b36f85fdf6eb62f047e30) 2007-04-22 20:19:49 +04:00
server: standardize formatting of comment block for ctdb_reply_dmaster() while I'm at it.. This was the comment block I was touching and meant to adapt in commit 00d3bf092e2f72eda330978c75ec85f17e870553. My search was apparently not unique... Signed-off-by: Michael Adam <obnox@samba.org> (This used to be ctdb commit 09940255011b119dc6af3304f5d3e9568e6006fd) 2013-08-22 18:17:09 +04:00			`/**`
			`* called when a CTDB_REPLY_DMASTER packet comes in`
			`*`
			`* This packet comes in from the lmaster in response to a CTDB_REQ_CALL`
			`* request packet. It means that the current dmaster wants to give us`
			`* the dmaster role.`
			`*/`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`void ctdb_reply_dmaster(struct ctdb_context ctdb, struct ctdb_req_header hdr)`
			`{`
ctdb-daemon: Rename struct ctdb_reply_dmaster to ctdb_reply_dmaster_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:34:01 +03:00			`struct ctdb_reply_dmaster_old c = (struct ctdb_reply_dmaster_old )hdr;`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`struct ctdb_db_context *ctdb_db;`
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`TDB_DATA key, data;`
call: transfer the record flags in the ctdb call packets. This way, the MIGRATED_WITH_DATA information can be transported along with the records. This is important for vacuuming to function properly. The record flags are appended to the data section of the ctdb_req_dmaster and ctdb_reply_dmaster structs. Pair-Programmed-With: Stefan Metzmacher <metze@samba.org> (This used to be ctdb commit 945187d64cfc7bd30a0c3b0d548cbe582d95dde3) 2010-12-10 16:02:33 +03:00			`uint32_t record_flags = 0;`
			`size_t len;`
- split out ctdb_ltdb_lock_fetch_requeue() into a simpler ctdb_ltdb_lock_requeue() and a small wrapper - use ctdb_ltdb_lock_requeue() to fix a possible hang in ctdb_reply_dmaster(), where the ctdb_ltdb_store() could hang waiting for a client. We now requeue the reply_dmaster packet until we have the lock (This used to be ctdb commit 97cd7aa09ce3abbb5e3e965c5c81668e0c0133a5) 2007-04-19 11:43:27 +04:00			`int ret;`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`ctdb_db = find_ctdb_db(ctdb, c->db_id);`
			`if (ctdb_db == NULL) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,("Unknown db_id 0x%x in ctdb_reply_dmaster\n", c->db_id));`
split the 32bit idr field into two. store the idr as the high 16 bits and use a rotating counter for the low 16 bits. (This used to be ctdb commit 7c763b7b5e6ca54a6df4586893ddaf1b508b4c22) 2007-04-23 12:19:50 +04:00			`return;`
			`}`
Revert "ctdb-daemon: Check packet generation against database generation" This reverts commit 0ff90f4fac74e61192aff100b168e38ce0adfabb. BUG: https://bugzilla.samba.org/show_bug.cgi?id=11707 The checks against database generation are not required since the global generation is updated as part of updating vnnmap before the actual database recovery. This change was done in 5aab31a39a3589b910a78b96071d6aa5e6547696. Checking only against the database generation is incomplete. It can cause CTDB to abort if the following sequence of events happen. - CTDB gets REQ_DMASTER packet (gen1) This packet processing gets deferred to get a record lock - CTDB goes into recovery, marks RECOVERY_ACTIVE CTDB recovery helper updates vnnmap (gen2) - CTDB processes REQ_DMASTER packet (gen1) The check against database generation (gen1) succeeds. The check for lmaster is now invalid because VNNMAP has changed. This will cause CTDB to abort due to protocol error. Reverting the patch stops processing packets of older generation before they get into call processing. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Signed-off-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Tue Feb 9 12:39:24 CET 2016 on sn-devel-144 2016-02-02 07:58:37 +03:00
yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`key.dptr = c->data;`
			`key.dsize = c->keylen;`
			`data.dptr = &c->data[key.dsize];`
			`data.dsize = c->datalen;`
ctdb-daemon: Rename struct ctdb_reply_dmaster to ctdb_reply_dmaster_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:34:01 +03:00			`len = offsetof(struct ctdb_reply_dmaster_old, data) + key.dsize + data.dsize`
call: transfer the record flags in the ctdb call packets. This way, the MIGRATED_WITH_DATA information can be transported along with the records. This is important for vacuuming to function properly. The record flags are appended to the data section of the ctdb_req_dmaster and ctdb_reply_dmaster structs. Pair-Programmed-With: Stefan Metzmacher <metze@samba.org> (This used to be ctdb commit 945187d64cfc7bd30a0c3b0d548cbe582d95dde3) 2010-12-10 16:02:33 +03:00			`+ sizeof(uint32_t);`
			`if (len <= c->hdr.length) {`
ctdb-daemon: Fix some strict-aliasing warnings Seeing these with -Wall: ../server/ctdb_call.c:1117:3: warning: dereferencing type-punned pointer will break strict-aliasing rules [-Wstrict-aliasing] record_flags = (uint32_t )&c->data[c->keylen + c->datalen]; ^ memcpy() seems to be the easiest way to get fix these. The alternative would be to use unmarshalling functions. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2014-08-04 08:50:17 +04:00			`memcpy(&record_flags, &c->data[c->keylen + c->datalen],`
			`sizeof(record_flags));`
call: transfer the record flags in the ctdb call packets. This way, the MIGRATED_WITH_DATA information can be transported along with the records. This is important for vacuuming to function properly. The record flags are appended to the data section of the ctdb_req_dmaster and ctdb_reply_dmaster structs. Pair-Programmed-With: Stefan Metzmacher <metze@samba.org> (This used to be ctdb commit 945187d64cfc7bd30a0c3b0d548cbe582d95dde3) 2010-12-10 16:02:33 +03:00			`}`
split the 32bit idr field into two. store the idr as the high 16 bits and use a rotating counter for the low 16 bits. (This used to be ctdb commit 7c763b7b5e6ca54a6df4586893ddaf1b508b4c22) 2007-04-23 12:19:50 +04:00
ctdb-daemon: Defer all calls when processing dmaster packets When CTDB receives DMASTER_REQUEST or DMASTER_REPLY packet, the specified record needs to be updated as soon as possible to avoid inconsistent dmaster information between nodes. During this time, queue up all calls for that record and process them only after dmaster request/reply has been processed. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-15 09:20:36 +04:00			`dmaster_defer_setup(ctdb_db, hdr, key);`

yay! finally fixed the bug that volker, ronnie and I have been chasing for 2 days. The main bug was in smbd, but there was a secondary (and more subtle) bug in ctdb that the bug in smbd exposed. When we get send a dmaster reply, we have to correctly update the dmaster in the recipient even if the original requst has timed out, otherwise ctdbd can get into a loop fighting over who will handle a key. This patch also cleans up the packet allocation, and makes ctdbd become a real daemon. (This used to be ctdb commit 59405e59ef522b97d8e20e4b14310a217141ac7c) 2007-04-29 18:19:40 +04:00			`ret = ctdb_ltdb_lock_requeue(ctdb_db, key, hdr,`
server: Replace BOOL datatype with bool, True/False with true/false Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 6e5cbe8fff71985e5a2fc16b7e9f2b868011ff5d) 2012-05-17 10:08:37 +04:00			`ctdb_call_input_pkt, ctdb, false);`
- split out ctdb_ltdb_lock_fetch_requeue() into a simpler ctdb_ltdb_lock_requeue() and a small wrapper - use ctdb_ltdb_lock_requeue() to fix a possible hang in ctdb_reply_dmaster(), where the ctdb_ltdb_store() could hang waiting for a client. We now requeue the reply_dmaster packet until we have the lock (This used to be ctdb commit 97cd7aa09ce3abbb5e3e965c5c81668e0c0133a5) 2007-04-19 11:43:27 +04:00			`if (ret == -2) {`
			`return;`
			`}`
			`if (ret != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,(__location__ " Failed to get lock in ctdb_reply_dmaster\n"));`
- split out ctdb_ltdb_lock_fetch_requeue() into a simpler ctdb_ltdb_lock_requeue() and a small wrapper - use ctdb_ltdb_lock_requeue() to fix a possible hang in ctdb_reply_dmaster(), where the ctdb_ltdb_store() could hang waiting for a client. We now requeue the reply_dmaster packet until we have the lock (This used to be ctdb commit 97cd7aa09ce3abbb5e3e965c5c81668e0c0133a5) 2007-04-19 11:43:27 +04:00			`return;`
			`}`

call: transfer the record flags in the ctdb call packets. This way, the MIGRATED_WITH_DATA information can be transported along with the records. This is important for vacuuming to function properly. The record flags are appended to the data section of the ctdb_req_dmaster and ctdb_reply_dmaster structs. Pair-Programmed-With: Stefan Metzmacher <metze@samba.org> (This used to be ctdb commit 945187d64cfc7bd30a0c3b0d548cbe582d95dde3) 2010-12-10 16:02:33 +03:00			`ctdb_become_dmaster(ctdb_db, hdr, key, data, c->rsn, record_flags);`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`}`

added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00
			`/*`
			`called when a CTDB_REPLY_ERROR packet comes in`
			`*/`
			`void ctdb_reply_error(struct ctdb_context ctdb, struct ctdb_req_header hdr)`
			`{`
ctdb-daemon: Rename struct ctdb_reply_error to ctdb_reply_error_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:30:31 +03:00			`struct ctdb_reply_error_old c = (struct ctdb_reply_error_old )hdr;`
added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00			`struct ctdb_call_state *state;`

ctdb-daemon: Use reqid abstraction Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-03-17 06:30:18 +03:00			`state = reqid_find(ctdb->idr, hdr->reqid, struct ctdb_call_state);`
split the 32bit idr field into two. store the idr as the high 16 bits and use a rotating counter for the low 16 bits. (This used to be ctdb commit 7c763b7b5e6ca54a6df4586893ddaf1b508b4c22) 2007-04-23 12:19:50 +04:00			`if (state == NULL) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,("pnn %u Invalid reqid %u in ctdb_reply_error\n",`
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`ctdb->pnn, hdr->reqid));`
split the 32bit idr field into two. store the idr as the high 16 bits and use a rotating counter for the low 16 bits. (This used to be ctdb commit 7c763b7b5e6ca54a6df4586893ddaf1b508b4c22) 2007-04-23 12:19:50 +04:00			`return;`
			`}`

			`if (hdr->reqid != state->reqid) {`
			`/* we found a record but it was the wrong one */`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR, ("Dropped orphaned error reply with reqid:%u\n",hdr->reqid));`
split the 32bit idr field into two. store the idr as the high 16 bits and use a rotating counter for the low 16 bits. (This used to be ctdb commit 7c763b7b5e6ca54a6df4586893ddaf1b508b4c22) 2007-04-23 12:19:50 +04:00			`return;`
			`}`
added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00
			`talloc_steal(state, c);`

			`state->state = CTDB_CALL_ERROR;`
			`state->errmsg = (char *)c->msg;`
initial support for two new pdus for the domain socket to do fetch_lock no locking is yet done and the store_unlock call is still missing the ./tests/fetch.sh --daemon test fails with parent process dying which needs to be investigated. (This used to be ctdb commit 7d7141c968950a8856f1be79871932b688bfb07f) 2007-04-12 09:46:50 +04:00			`if (state->async.fn) {`
			`state->async.fn(state);`
			`}`
added error reply packets (This used to be ctdb commit 49ee165808985ce0fa174dd6e05292871d3f3130) 2006-12-18 06:27:20 +03:00			`}`

added redirect handling (This used to be ctdb commit 3c1dc8b98c8e843c44a172ac15e67f4ab8c47500) 2006-12-18 06:44:06 +03:00
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`/*`
			`destroy a ctdb_call`
			`*/`
			`static int ctdb_call_destructor(struct ctdb_call_state *state)`
			`{`
ctdb-call: Convert pending calls list to per database list The pending calls are migration requests received from clients (over unix domain socket) which are under processing. After a recovery is finished, any requests which are under processing will be dropped since they do not belong to the current generation. All the pending call requests are resent with new generation to restart record migrations. This is in preparation for parallel database recovery. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-05 08:42:00 +04:00			`DLIST_REMOVE(state->ctdb_db->pending_calls, state);`
ctdb-daemon: Use reqid abstraction Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-03-17 06:30:18 +03:00			`reqid_remove(state->ctdb_db->ctdb->idr, state->reqid);`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`return 0;`
			`}`

expanded some comments (This used to be ctdb commit cf544e986d5837cc878792af571bdb23cc487882) 2006-12-18 06:49:32 +03:00
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`/*`
a better way to resend calls after recovery (This used to be ctdb commit 444f52e134fc22aaf254d05c86d8b357ded876f4) 2007-05-18 18:56:49 +04:00			`called when a ctdb_call needs to be resent after a reconfigure event`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`*/`
a better way to resend calls after recovery (This used to be ctdb commit 444f52e134fc22aaf254d05c86d8b357ded876f4) 2007-05-18 18:56:49 +04:00			`static void ctdb_call_resend(struct ctdb_call_state *state)`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`{`
better timeout handling for calls, controls and traverses (This used to be ctdb commit 63346a6c59d4821b4c443939b5d88db8cd20f5fe) 2007-05-10 08:06:48 +04:00			`struct ctdb_context *ctdb = state->ctdb_db->ctdb;`

ctdb-daemon: Use database generation in packet headers for database requests Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-09-15 09:50:19 +03:00			`state->generation = state->ctdb_db->generation;`
better timeout handling for calls, controls and traverses (This used to be ctdb commit 63346a6c59d4821b4c443939b5d88db8cd20f5fe) 2007-05-10 08:06:48 +04:00
			`/* use a new reqid, in case the old reply does eventually come in */`
ctdb-daemon: Use reqid abstraction Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-03-17 06:30:18 +03:00			`reqid_remove(ctdb->idr, state->reqid);`
			`state->reqid = reqid_new(ctdb->idr, state);`
better timeout handling for calls, controls and traverses (This used to be ctdb commit 63346a6c59d4821b4c443939b5d88db8cd20f5fe) 2007-05-10 08:06:48 +04:00			`state->c->hdr.reqid = state->reqid;`

- got rid of the complex hand marshalling in the recovery controls - fixed the re-send of ctdb calls after a generation change - fixed a reqid idr leak in controls - removed the write_record test code - use the new nonblock lockall code to prevent ctdbd from ever doing a blocking lock that could deadlock with smbd - moved more of the recovery controls into ctdb_recover.c (This used to be ctdb commit 565a21aa4f1e842309986ab97d6244801153deec) 2007-05-10 11:43:45 +04:00			`/* update the generation count for this request, so its valid with the new vnn_map */`
			`state->c->hdr.generation = state->generation;`

better timeout handling for calls, controls and traverses (This used to be ctdb commit 63346a6c59d4821b4c443939b5d88db8cd20f5fe) 2007-05-10 08:06:48 +04:00			`/* send the packet to ourselves, it will be redirected appropriately */`
change ctdb->vnn to ctdb->pnn (This used to be ctdb commit 8c776e5707e503ec6586aae39ac6b3ea5a2fd2bc) 2007-09-04 04:06:36 +04:00			`state->c->hdr.destnode = ctdb->pnn;`
better timeout handling for calls, controls and traverses (This used to be ctdb commit 63346a6c59d4821b4c443939b5d88db8cd20f5fe) 2007-05-10 08:06:48 +04:00
			`ctdb_queue_packet(ctdb, &state->c->hdr);`
ctdb-call: Improve a log message Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-09-21 10:12:25 +03:00			`DEBUG(DEBUG_NOTICE,("resent ctdb_call for db %s reqid %u generation %u\n",`
			`state->ctdb_db->db_name, state->reqid, state->generation));`
a better way to resend calls after recovery (This used to be ctdb commit 444f52e134fc22aaf254d05c86d8b357ded876f4) 2007-05-18 18:56:49 +04:00			`}`

			`/*`
			`resend all pending calls on recovery`
			`*/`
ctdb-call: Convert pending calls list to per database list The pending calls are migration requests received from clients (over unix domain socket) which are under processing. After a recovery is finished, any requests which are under processing will be dropped since they do not belong to the current generation. All the pending call requests are resent with new generation to restart record migrations. This is in preparation for parallel database recovery. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-05 08:42:00 +04:00			`void ctdb_call_resend_db(struct ctdb_db_context *ctdb_db)`
a better way to resend calls after recovery (This used to be ctdb commit 444f52e134fc22aaf254d05c86d8b357ded876f4) 2007-05-18 18:56:49 +04:00			`{`
			`struct ctdb_call_state state, next;`
ctdb-call: Convert pending calls list to per database list The pending calls are migration requests received from clients (over unix domain socket) which are under processing. After a recovery is finished, any requests which are under processing will be dropped since they do not belong to the current generation. All the pending call requests are resent with new generation to restart record migrations. This is in preparation for parallel database recovery. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-05 08:42:00 +04:00
			`for (state = ctdb_db->pending_calls; state; state = next) {`
a better way to resend calls after recovery (This used to be ctdb commit 444f52e134fc22aaf254d05c86d8b357ded876f4) 2007-05-18 18:56:49 +04:00			`next = state->next;`
			`ctdb_call_resend(state);`
			`}`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`}`

ctdb-call: Convert pending calls list to per database list The pending calls are migration requests received from clients (over unix domain socket) which are under processing. After a recovery is finished, any requests which are under processing will be dropped since they do not belong to the current generation. All the pending call requests are resent with new generation to restart record migrations. This is in preparation for parallel database recovery. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-05 08:42:00 +04:00			`void ctdb_call_resend_all(struct ctdb_context *ctdb)`
			`{`
			`struct ctdb_db_context *ctdb_db;`

			`for (ctdb_db = ctdb->db_list; ctdb_db; ctdb_db = ctdb_db->next) {`
			`ctdb_call_resend_db(ctdb_db);`
			`}`
			`}`

initial support for two new pdus for the domain socket to do fetch_lock no locking is yet done and the store_unlock call is still missing the ./tests/fetch.sh --daemon test fails with parent process dying which needs to be investigated. (This used to be ctdb commit 7d7141c968950a8856f1be79871932b688bfb07f) 2007-04-12 09:46:50 +04:00			`/*`
			`this allows the caller to setup a async.fn`
			`*/`
ctdb-daemon: Stop using tevent compatibility definitions Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:09 +03:00			`static void call_local_trigger(struct tevent_context *ev,`
			`struct tevent_timer *te,`
			`struct timeval t, void *private_data)`
initial support for two new pdus for the domain socket to do fetch_lock no locking is yet done and the store_unlock call is still missing the ./tests/fetch.sh --daemon test fails with parent process dying which needs to be investigated. (This used to be ctdb commit 7d7141c968950a8856f1be79871932b688bfb07f) 2007-04-12 09:46:50 +04:00			`{`
private -> private_data for samba3 (This used to be ctdb commit 080b6901173afb2ad618dd0621876ff478c7d6e5) 2007-04-13 14:38:24 +04:00			`struct ctdb_call_state *state = talloc_get_type(private_data, struct ctdb_call_state);`
initial support for two new pdus for the domain socket to do fetch_lock no locking is yet done and the store_unlock call is still missing the ./tests/fetch.sh --daemon test fails with parent process dying which needs to be investigated. (This used to be ctdb commit 7d7141c968950a8856f1be79871932b688bfb07f) 2007-04-12 09:46:50 +04:00			`if (state->async.fn) {`
			`state->async.fn(state);`
			`}`
			`}`


- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`/*`
expanded some comments (This used to be ctdb commit cf544e986d5837cc878792af571bdb23cc487882) 2006-12-18 06:49:32 +03:00			`construct an event driven local ctdb_call`

			`this is used so that locally processed ctdb_call requests are processed`
			`in an event driven manner`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`*/`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`struct ctdb_call_state ctdb_call_local_send(struct ctdb_db_context ctdb_db,`
simplified ctdb_call() interface, and made it easier to expand with more parameters later (This used to be ctdb commit 6c816fe85e84faad167101bcf26850966c3044e5) 2007-01-25 08:13:17 +03:00			`struct ctdb_call *call,`
next step towards dmaster/lmaster code (This used to be ctdb commit 95e7be8d1aaafafb574c406fe778093606a28be8) 2006-12-18 06:05:49 +03:00			`struct ctdb_ltdb_header *header,`
			`TDB_DATA *data)`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`{`
			`struct ctdb_call_state *state;`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`struct ctdb_context *ctdb = ctdb_db->ctdb;`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`int ret;`

Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`state = talloc_zero(ctdb_db, struct ctdb_call_state);`
Provide an alternative CTDB_NO_MEMORY_NULL() for functions which return a pointer (This used to be ctdb commit 51c79e19df777fb53a5c210efc1c9d3159059de3) 2006-12-01 12:26:21 +03:00			`CTDB_NO_MEMORY_NULL(ctdb, state);`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00
another memory leak (This used to be ctdb commit 10466fe11da71c93fa764bea2b3e1e741c113f9c) 2007-04-07 04:58:14 +04:00			`talloc_steal(state, data->dptr);`

- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`state->state = CTDB_CALL_DONE;`
in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`state->call = talloc(state, struct ctdb_call);`
			`CTDB_NO_MEMORY_NULL(ctdb, state->call);`
			`(state->call) = call;`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`state->ctdb_db = ctdb_db;`
fix a bug in new structure handling (This used to be ctdb commit 5f248d82717c8094f260ea16292996bb712df947) 2007-01-29 14:11:16 +03:00
Revert "LACOUNT: Add back lacount mechanism to defer migrating a fetched/read copy until after default of 20 consecutive requests from the same node" This reverts commit 035c0d981bde8c0eee8b3f24ba8e2dc817e5b504. This is a premature optimization. Record can bounce between nodes very quickly if it is a contended record. There is no need to hold a record on a node unnecessarily. In case record contention becomes bad, enabling sticky records on a database is a better idea. Conflicts: include/ctdb_private.h server/ctdb_tunables.c Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit ac417b0003f0116f116834ad2ac51482d25cfa0d) 2013-08-19 09:04:46 +04:00			`ret = ctdb_call_local(ctdb_db, state->call, header, state, data, true);`
Clean up warnings: log some unchecked return codes from function calls In a few places functions are called, the return code is assigned into a variable but it is not checked. This generates a compiler warning like this: warning: variable ‘ret’ set but not used [-Wunused-but-set-variable] Instead we remove the warning by checking the return code variable and log a warning at DEBUG level if the return code indicates an error. The justification is that there may have been a future intent to check the return code but it hasn't been important enough to follow-up. If it matters, it will be logged for easy debugging. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 1932466c76de2b184c2a257120768ab8c9d6c12a) 2011-11-09 08:20:07 +04:00			`if (ret != 0) {`
			`DEBUG(DEBUG_DEBUG,("ctdb_call_local() failed, ignoring return code %d\n", ret));`
			`}`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00
ctdb-daemon: Stop using tevent compatibility definitions Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:09 +03:00			`tevent_add_timer(ctdb->ev, state, timeval_zero(),`
			`call_local_trigger, state);`
initial support for two new pdus for the domain socket to do fetch_lock no locking is yet done and the store_unlock call is still missing the ./tests/fetch.sh --daemon test fails with parent process dying which needs to be investigated. (This used to be ctdb commit 7d7141c968950a8856f1be79871932b688bfb07f) 2007-04-12 09:46:50 +04:00
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`return state;`
			`}`


			`/*`
- add --daemon flag to ctdb_fetch test code - split client specific routines out of ctdb_daemon.c - use ctdb_queue code in message send from client to daemon - use clearer names in client/daemon functions - use talloc autofree context to avoid global for unlink of socket on exit - start on API change for message handler, to allow ctdb messaging to handle daemon mode with multiple clients (This used to be ctdb commit 53555db45f3583ae4a32cc3aa9e07fb8ef2a77e3) 2007-04-11 05:01:42 +04:00			`make a remote ctdb call - async send. Called in daemon context.`
expanded some comments (This used to be ctdb commit cf544e986d5837cc878792af571bdb23cc487882) 2006-12-18 06:49:32 +03:00
			`This constructs a ctdb_call request and queues it for processing.`
			`This call never blocks.`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`*/`
- send the record header from the client to the daemon when doing a fetch, to avoid the daemon re-reading it - suffix the database name with the node name so that testing on loopback doesn't result in a name collision in the database open (This used to be ctdb commit ad30a4db75450643ff146c40faa306a021de3dd2) 2007-04-17 10:20:32 +04:00			`struct ctdb_call_state ctdb_daemon_call_send_remote(struct ctdb_db_context ctdb_db,`
			`struct ctdb_call *call,`
			`struct ctdb_ltdb_header *header)`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`{`
			`uint32_t len;`
			`struct ctdb_call_state *state;`
Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`struct ctdb_context *ctdb = ctdb_db->ctdb;`
ctdb-server: Cleanup ctdb_daemon_call_send_remote Minor code cleanup and adding a temporary variable to improve readabilty. Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-02-08 13:33:08 +03:00			`struct ctdb_req_call_old *c;`
- added simple (fake) vnn system - split up ctdb layer code into 3 modules - added a simple test suite - added packet structures for ctdb_call - switched to an array for ctdb_node to make vnn lookup easy and fast (This used to be ctdb commit 8a17460a816a5970f2df8244a06aec55d814f186) 2006-11-28 09:56:10 +03:00
dont even try to allocate a packet if the transport is down since it will fail (This used to be ctdb commit a73f316cb9cec877dc0bc3f7baa21be1b1454273) 2009-06-30 05:55:42 +04:00			`if (ctdb->methods == NULL) {`
during shutdown there is a window after we have stopped TCP and disconnected from all other nodes but before we have stopped all processing. During this window we may still hit asynchronous events that will fail because we can not send/receive packets from other nodes. These messages are logged as ... Transport is DOWN. To help indicate that they are benign messages related to the process of shutting down. These messages spam the syslog during normal shutdown, so this patch will drop the loglevel of these messages to DEBUG, so that they will not appear in or spam the syslog. (This used to be ctdb commit 8275d265d2ae19b765e30ecf18f6b6319b6e6453) 2010-10-28 06:38:34 +04:00			`DEBUG(DEBUG_INFO,(__location__ " Failed send packet. Transport is down\n"));`
dont even try to allocate a packet if the transport is down since it will fail (This used to be ctdb commit a73f316cb9cec877dc0bc3f7baa21be1b1454273) 2009-06-30 05:55:42 +04:00			`return NULL;`
			`}`

Split CTDB into sub contexts to handle multiple concurrent databases within the same context. (This used to be ctdb commit d995103143f6f13f59118549d93ab4b29c27ec89) 2007-04-03 13:41:00 +04:00			`state = talloc_zero(ctdb_db, struct ctdb_call_state);`
Provide an alternative CTDB_NO_MEMORY_NULL() for functions which return a pointer (This used to be ctdb commit 51c79e19df777fb53a5c210efc1c9d3159059de3) 2006-12-01 12:26:21 +03:00			`CTDB_NO_MEMORY_NULL(ctdb, state);`
in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`state->call = talloc(state, struct ctdb_call);`
			`CTDB_NO_MEMORY_NULL(ctdb, state->call);`

ctdb-daemon: Use reqid abstraction Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-03-17 06:30:18 +03:00			`state->reqid = reqid_new(ctdb->idr, state);`
- got rid of the complex hand marshalling in the recovery controls - fixed the re-send of ctdb calls after a generation change - fixed a reqid idr leak in controls - removed the write_record test code - use the new nonblock lockall code to prevent ctdbd from ever doing a blocking lock that could deadlock with smbd - moved more of the recovery controls into ctdb_recover.c (This used to be ctdb commit 565a21aa4f1e842309986ab97d6244801153deec) 2007-05-10 11:43:45 +04:00			`state->ctdb_db = ctdb_db;`
added request_dmaster and reply_dmaster logic ctdb will now move the dmaster role between nodes after CTDB_MAX_LACOUNT consecutive accesses by the same node. (This used to be ctdb commit af87f587d8f70192ecac0125054bf9583a4849a7) 2006-12-18 08:01:11 +03:00			`state->state = CTDB_CALL_WAIT;`
ctdb-daemon: Use database generation in packet headers for database requests Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-09-15 09:50:19 +03:00			`state->generation = ctdb_db->generation;`
- added in idtree for efficient reqid handling - started adding ctdb_call() code - added ctdb_call_local() implementation (This used to be ctdb commit 97b1fdf7fa0e230f36add3f1770ecb3a9faee0a1) 2006-11-28 12:48:34 +03:00
ctdb-server: Cleanup ctdb_daemon_call_send_remote Minor code cleanup and adding a temporary variable to improve readabilty. Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-02-08 13:33:08 +03:00			`len = offsetof(struct ctdb_req_call_old, data) + call->key.dsize +`
			`call->call_data.dsize;`

			`c = ctdb_transport_allocate(ctdb,`
			`state,`
			`CTDB_REQ_CALL,`
			`len,`
			`struct ctdb_req_call_old);`

			`CTDB_NO_MEMORY_NULL(ctdb, c);`
			`state->c = c;`

			`c->hdr.destnode = header->dmaster;`
			`c->hdr.reqid = state->reqid;`
			`c->hdr.generation = ctdb_db->generation;`
			`c->flags = call->flags;`
			`c->db_id = ctdb_db->db_id;`
			`c->callid = call->call_id;`
			`c->hopcount = 0;`
			`c->keylen = call->key.dsize;`
			`c->calldatalen = call->call_data.dsize;`

			`memcpy(&c->data[0], call->key.dptr, call->key.dsize);`
			`memcpy(&c->data[call->key.dsize],`
			`call->call_data.dptr,`
			`call->call_data.dsize);`

			`(state->call) = call;`
			`state->call->call_data.dptr = &c->data[call->key.dsize];`
			`state->call->key.dptr = &c->data[0];`

ctdb-call: Convert pending calls list to per database list The pending calls are migration requests received from clients (over unix domain socket) which are under processing. After a recovery is finished, any requests which are under processing will be dropped since they do not belong to the current generation. All the pending call requests are resent with new generation to restart record migrations. This is in preparation for parallel database recovery. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-05 08:42:00 +04:00			`DLIST_ADD(ctdb_db->pending_calls, state);`
a better way to resend calls after recovery (This used to be ctdb commit 444f52e134fc22aaf254d05c86d8b357ded876f4) 2007-05-18 18:56:49 +04:00
ctdb-server: Only talloc_set_destructor when required The destructor is only needed once the state got added to the DLIST. Therefore, move the setting of the destructor to after the addition of state to the DLIST. Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-02-20 19:08:59 +03:00			`talloc_set_destructor(state, ctdb_call_destructor);`
wrap the packet queue call (This used to be ctdb commit 4dd8ffd5752aedcaf0b8ad1941a5f84ec1ca2868) 2006-12-18 08:26:57 +03:00			`ctdb_queue_packet(ctdb, &state->c->hdr);`
- added in idtree for efficient reqid handling - started adding ctdb_call() code - added ctdb_call_local() implementation (This used to be ctdb commit 97b1fdf7fa0e230f36add3f1770ecb3a9faee0a1) 2006-11-28 12:48:34 +03:00
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`return state;`
			`}`

			`/*`
- add --daemon flag to ctdb_fetch test code - split client specific routines out of ctdb_daemon.c - use ctdb_queue code in message send from client to daemon - use clearer names in client/daemon functions - use talloc autofree context to avoid global for unlink of socket on exit - start on API change for message handler, to allow ctdb messaging to handle daemon mode with multiple clients (This used to be ctdb commit 53555db45f3583ae4a32cc3aa9e07fb8ef2a77e3) 2007-04-11 05:01:42 +04:00			`make a remote ctdb call - async recv - called in daemon context`
expanded some comments (This used to be ctdb commit cf544e986d5837cc878792af571bdb23cc487882) 2006-12-18 06:49:32 +03:00
			`This is called when the program wants to wait for a ctdb_call to complete and get the`
			`results. This call will block unless the call has already completed.`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`*/`
- removed the non-daemon mode from ctdb, in order to simplify the code. It may be added back later once everything is working nicely, or simulated using a in-process pipe instead of a unix domain socket - rewrote the ctdb_fetch_lock() code to follow the new design (This used to be ctdb commit 5024dd1f305fe1ecc262db2240c56f773b4f28f0) 2007-04-17 08:52:51 +04:00			`int ctdb_daemon_call_recv(struct ctdb_call_state state, struct ctdb_call call)`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`{`
merge fetch code from ronnie, and add a simple fetch test (This used to be ctdb commit 83b794befd8d34b3da544a483f9d39a3fa140655) 2007-04-05 07:18:31 +04:00			`while (state->state < CTDB_CALL_DONE) {`
ctdb-daemon: Stop using tevent compatibility definitions Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:09 +03:00			`tevent_loop_once(state->ctdb_db->ctdb->ev);`
merge fetch code from ronnie, and add a simple fetch test (This used to be ctdb commit 83b794befd8d34b3da544a483f9d39a3fa140655) 2007-04-05 07:18:31 +04:00			`}`
			`if (state->state != CTDB_CALL_DONE) {`
removed unnecessary variable (This used to be ctdb commit ef0027faa631b00c7fc1a7c4538fbf3080248f0b) 2007-04-28 20:55:37 +04:00			`ctdb_set_error(state->ctdb_db->ctdb, "%s", state->errmsg);`
merge fetch code from ronnie, and add a simple fetch test (This used to be ctdb commit 83b794befd8d34b3da544a483f9d39a3fa140655) 2007-04-05 07:18:31 +04:00			`talloc_free(state);`
			`return -1;`
			`}`

in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`if (state->call->reply_data.dsize) {`
fix a memory leak allocate the memory to the 'call' context and not off the 'ctdb' context (This used to be ctdb commit be89005bd5d13409e377d425db2aad1c0d5b3826) 2008-03-25 03:11:13 +03:00			`call->reply_data.dptr = talloc_memdup(call,`
in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`state->call->reply_data.dptr,`
			`state->call->reply_data.dsize);`
			`call->reply_data.dsize = state->call->reply_data.dsize;`
merge status code changes from samba4 ctdb (This used to be ctdb commit 705a9f8e5238976aa5c8cd4a5371459650d8b553) 2007-01-29 14:30:06 +03:00			`} else {`
			`call->reply_data.dptr = NULL;`
			`call->reply_data.dsize = 0;`
			`}`
in ctdb_call_local() we can not talloc_steal() the returned data and hang it off ctdb. This can cause a memory leak if the call is terminated before we have managed to respond to the client. (and the call is talloc_free()d but the data is still hanging off ctdb) instead we must talloc_steal() the data and hang it off the call structure to avoid the memory leak. In order to do this we must also change the call structure that is passed into ctdb_call_local() to be allocated through talloc(). This structure was previously either a static variable, or an element of a larger talloc()ed structure (ctdb_call_state or ctdb_client_call_state) so we must change all creations of a ctdb_call into explicitely creating it through talloc() (This used to be ctdb commit 4becf32aea088a25686e8bc330eb47d85ae0ef8f) 2008-03-19 05:54:17 +03:00			`call->status = state->call->status;`
- added ctdb_set_flags() call - added --self-connect option to ctdb_test, allowing testing when a node connects to itself. not as efficient as local bypass, but very useful for testing purposes (easier to work with 1 task in gdb than 2) - split the ctdb_call() into an async triple, in the style of Samba4 async functions. So we now have ctdb_call_send(), ctdb_call_recv() and ctdb_call(). - added the main ctdb_call protocol logic. No error checking yet, but seems to work for simple cases - ensure we initialise the length argument to getsockopt() (This used to be ctdb commit 95fad717ef5ab93be3603aa11d2878876fe868d3) 2006-12-01 07:45:24 +03:00			`talloc_free(state);`
			`return 0;`
- added simple (fake) vnn system - split up ctdb layer code into 3 modules - added a simple test suite - added packet structures for ctdb_call - switched to an array for ctdb_node to make vnn lookup easy and fast (This used to be ctdb commit 8a17460a816a5970f2df8244a06aec55d814f186) 2006-11-28 09:56:10 +03:00			`}`

- add --daemon flag to ctdb_fetch test code - split client specific routines out of ctdb_daemon.c - use ctdb_queue code in message send from client to daemon - use clearer names in client/daemon functions - use talloc autofree context to avoid global for unlink of socket on exit - start on API change for message handler, to allow ctdb messaging to handle daemon mode with multiple clients (This used to be ctdb commit 53555db45f3583ae4a32cc3aa9e07fb8ef2a77e3) 2007-04-11 05:01:42 +04:00
ReadOnly: Add functions to register CALLs to a context used to handle deferal of processing of CALL commands. Once the contexts are freed, the deferred calls are re-issued to the input packet processing functions again. This is needed when/if a CALL can not currently be processed by the main engine due to the record being locked down for revoking of all delegations. The data is passed through several layers of callbacks, and finally a timed event callback to ensure that the processing of the packet will be restarted again at the topmost eventloop, avoinding event loop nesting. (This used to be ctdb commit cc6f78efcfa3b8caeffbd68018e6dfbf81488dce) 2011-07-20 07:49:17 +04:00			`struct revokechild_deferred_call {`
ctdb-readonly: Avoid a tight loop waiting for revoke to complete BUG: https://bugzilla.samba.org/show_bug.cgi?id=12697 During revoking readonly delegations, if one of the nodes disappears, then there is no point re-trying revoking readonly delegation immedately. The database needs to be recovered before the revoke operation can succeed. However, if the revoke is successful, then all the write requests need to be processed immediately before the read-only requests. This avoids starving write requests, in case there are read-only requests coming from other nodes. In deferred_call_destructor, the result of revoke is not available and deferred calls cannot be correctly ordered. To correctly order the deferred calls, process them in revokechild_destructor where the result of revoke is known. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-05-18 04:50:09 +03:00			`struct revokechild_deferred_call prev, next;`
ReadOnly: Add functions to register CALLs to a context used to handle deferal of processing of CALL commands. Once the contexts are freed, the deferred calls are re-issued to the input packet processing functions again. This is needed when/if a CALL can not currently be processed by the main engine due to the record being locked down for revoking of all delegations. The data is passed through several layers of callbacks, and finally a timed event callback to ensure that the processing of the packet will be restarted again at the topmost eventloop, avoinding event loop nesting. (This used to be ctdb commit cc6f78efcfa3b8caeffbd68018e6dfbf81488dce) 2011-07-20 07:49:17 +04:00			`struct ctdb_context *ctdb;`
			`struct ctdb_req_header *hdr;`
			`deferred_requeue_fn fn;`
			`void *ctx;`
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`struct revokechild_handle *rev_hdl;`
ReadOnly: Add functions to register CALLs to a context used to handle deferal of processing of CALL commands. Once the contexts are freed, the deferred calls are re-issued to the input packet processing functions again. This is needed when/if a CALL can not currently be processed by the main engine due to the record being locked down for revoking of all delegations. The data is passed through several layers of callbacks, and finally a timed event callback to ensure that the processing of the packet will be restarted again at the topmost eventloop, avoinding event loop nesting. (This used to be ctdb commit cc6f78efcfa3b8caeffbd68018e6dfbf81488dce) 2011-07-20 07:49:17 +04:00			`};`

			`struct revokechild_handle {`
			`struct revokechild_handle next, prev;`
			`struct ctdb_context *ctdb;`
			`struct ctdb_db_context *ctdb_db;`
ctdb-daemon: Stop using tevent compatibility definitions Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:09 +03:00			`struct tevent_fd *fde;`
ReadOnly: Add functions to register CALLs to a context used to handle deferal of processing of CALL commands. Once the contexts are freed, the deferred calls are re-issued to the input packet processing functions again. This is needed when/if a CALL can not currently be processed by the main engine due to the record being locked down for revoking of all delegations. The data is passed through several layers of callbacks, and finally a timed event callback to ensure that the processing of the packet will be restarted again at the topmost eventloop, avoinding event loop nesting. (This used to be ctdb commit cc6f78efcfa3b8caeffbd68018e6dfbf81488dce) 2011-07-20 07:49:17 +04:00			`int status;`
			`int fd[2];`
			`pid_t child;`
			`TDB_DATA key;`
ctdb-readonly: Avoid a tight loop waiting for revoke to complete BUG: https://bugzilla.samba.org/show_bug.cgi?id=12697 During revoking readonly delegations, if one of the nodes disappears, then there is no point re-trying revoking readonly delegation immedately. The database needs to be recovered before the revoke operation can succeed. However, if the revoke is successful, then all the write requests need to be processed immediately before the read-only requests. This avoids starving write requests, in case there are read-only requests coming from other nodes. In deferred_call_destructor, the result of revoke is not available and deferred calls cannot be correctly ordered. To correctly order the deferred calls, process them in revokechild_destructor where the result of revoke is known. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-05-18 04:50:09 +03:00			`struct revokechild_deferred_call *deferred_call_list;`
ReadOnly: Add functions to register CALLs to a context used to handle deferal of processing of CALL commands. Once the contexts are freed, the deferred calls are re-issued to the input packet processing functions again. This is needed when/if a CALL can not currently be processed by the main engine due to the record being locked down for revoking of all delegations. The data is passed through several layers of callbacks, and finally a timed event callback to ensure that the processing of the packet will be restarted again at the topmost eventloop, avoinding event loop nesting. (This used to be ctdb commit cc6f78efcfa3b8caeffbd68018e6dfbf81488dce) 2011-07-20 07:49:17 +04:00			`};`

ctdb-daemon: Stop using tevent compatibility definitions Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:09 +03:00			`static void deferred_call_requeue(struct tevent_context *ev,`
			`struct tevent_timer *te,`
			`struct timeval t, void *private_data)`
ReadOnly: Add functions to register CALLs to a context used to handle deferal of processing of CALL commands. Once the contexts are freed, the deferred calls are re-issued to the input packet processing functions again. This is needed when/if a CALL can not currently be processed by the main engine due to the record being locked down for revoking of all delegations. The data is passed through several layers of callbacks, and finally a timed event callback to ensure that the processing of the packet will be restarted again at the topmost eventloop, avoinding event loop nesting. (This used to be ctdb commit cc6f78efcfa3b8caeffbd68018e6dfbf81488dce) 2011-07-20 07:49:17 +04:00			`{`
ctdb-readonly: Avoid a tight loop waiting for revoke to complete BUG: https://bugzilla.samba.org/show_bug.cgi?id=12697 During revoking readonly delegations, if one of the nodes disappears, then there is no point re-trying revoking readonly delegation immedately. The database needs to be recovered before the revoke operation can succeed. However, if the revoke is successful, then all the write requests need to be processed immediately before the read-only requests. This avoids starving write requests, in case there are read-only requests coming from other nodes. In deferred_call_destructor, the result of revoke is not available and deferred calls cannot be correctly ordered. To correctly order the deferred calls, process them in revokechild_destructor where the result of revoke is known. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-05-18 04:50:09 +03:00			`struct revokechild_deferred_call *dlist = talloc_get_type_abort(`
			`private_data, struct revokechild_deferred_call);`
ReadOnly: Add functions to register CALLs to a context used to handle deferal of processing of CALL commands. Once the contexts are freed, the deferred calls are re-issued to the input packet processing functions again. This is needed when/if a CALL can not currently be processed by the main engine due to the record being locked down for revoking of all delegations. The data is passed through several layers of callbacks, and finally a timed event callback to ensure that the processing of the packet will be restarted again at the topmost eventloop, avoinding event loop nesting. (This used to be ctdb commit cc6f78efcfa3b8caeffbd68018e6dfbf81488dce) 2011-07-20 07:49:17 +04:00
ctdb-readonly: Avoid a tight loop waiting for revoke to complete BUG: https://bugzilla.samba.org/show_bug.cgi?id=12697 During revoking readonly delegations, if one of the nodes disappears, then there is no point re-trying revoking readonly delegation immedately. The database needs to be recovered before the revoke operation can succeed. However, if the revoke is successful, then all the write requests need to be processed immediately before the read-only requests. This avoids starving write requests, in case there are read-only requests coming from other nodes. In deferred_call_destructor, the result of revoke is not available and deferred calls cannot be correctly ordered. To correctly order the deferred calls, process them in revokechild_destructor where the result of revoke is known. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-05-18 04:50:09 +03:00			`while (dlist != NULL) {`
			`struct revokechild_deferred_call *dcall = dlist;`
ReadOnly: Add functions to register CALLs to a context used to handle deferal of processing of CALL commands. Once the contexts are freed, the deferred calls are re-issued to the input packet processing functions again. This is needed when/if a CALL can not currently be processed by the main engine due to the record being locked down for revoking of all delegations. The data is passed through several layers of callbacks, and finally a timed event callback to ensure that the processing of the packet will be restarted again at the topmost eventloop, avoinding event loop nesting. (This used to be ctdb commit cc6f78efcfa3b8caeffbd68018e6dfbf81488dce) 2011-07-20 07:49:17 +04:00
ctdb-daemon: Allocate deferred calls off calling context BUG: https://bugzilla.samba.org/show_bug.cgi?id=13152 This makes sure that if a client disconnects, all the deferred calls from the client are correctly freed. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-10-19 06:58:18 +03:00			`talloc_set_destructor(dcall, NULL);`
ctdb-readonly: Avoid a tight loop waiting for revoke to complete BUG: https://bugzilla.samba.org/show_bug.cgi?id=12697 During revoking readonly delegations, if one of the nodes disappears, then there is no point re-trying revoking readonly delegation immedately. The database needs to be recovered before the revoke operation can succeed. However, if the revoke is successful, then all the write requests need to be processed immediately before the read-only requests. This avoids starving write requests, in case there are read-only requests coming from other nodes. In deferred_call_destructor, the result of revoke is not available and deferred calls cannot be correctly ordered. To correctly order the deferred calls, process them in revokechild_destructor where the result of revoke is known. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-05-18 04:50:09 +03:00			`DLIST_REMOVE(dlist, dcall);`
			`dcall->fn(dcall->ctx, dcall->hdr);`
			`talloc_free(dcall);`
			`}`
ReadOnly: Add functions to register CALLs to a context used to handle deferal of processing of CALL commands. Once the contexts are freed, the deferred calls are re-issued to the input packet processing functions again. This is needed when/if a CALL can not currently be processed by the main engine due to the record being locked down for revoking of all delegations. The data is passed through several layers of callbacks, and finally a timed event callback to ensure that the processing of the packet will be restarted again at the topmost eventloop, avoinding event loop nesting. (This used to be ctdb commit cc6f78efcfa3b8caeffbd68018e6dfbf81488dce) 2011-07-20 07:49:17 +04:00			`}`

ctdb-daemon: Allocate deferred calls off calling context BUG: https://bugzilla.samba.org/show_bug.cgi?id=13152 This makes sure that if a client disconnects, all the deferred calls from the client are correctly freed. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-10-19 06:58:18 +03:00			`static int deferred_call_destructor(struct revokechild_deferred_call *dcall)`
			`{`
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`struct revokechild_handle *rev_hdl = dcall->rev_hdl;`
ctdb-daemon: Allocate deferred calls off calling context BUG: https://bugzilla.samba.org/show_bug.cgi?id=13152 This makes sure that if a client disconnects, all the deferred calls from the client are correctly freed. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-10-19 06:58:18 +03:00
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`DLIST_REMOVE(rev_hdl->deferred_call_list, dcall);`
ctdb-daemon: Allocate deferred calls off calling context BUG: https://bugzilla.samba.org/show_bug.cgi?id=13152 This makes sure that if a client disconnects, all the deferred calls from the client are correctly freed. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-10-19 06:58:18 +03:00			`return 0;`
			`}`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`static int revokechild_destructor(struct revokechild_handle *rev_hdl)`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`{`
ctdb-readonly: Avoid a tight loop waiting for revoke to complete BUG: https://bugzilla.samba.org/show_bug.cgi?id=12697 During revoking readonly delegations, if one of the nodes disappears, then there is no point re-trying revoking readonly delegation immedately. The database needs to be recovered before the revoke operation can succeed. However, if the revoke is successful, then all the write requests need to be processed immediately before the read-only requests. This avoids starving write requests, in case there are read-only requests coming from other nodes. In deferred_call_destructor, the result of revoke is not available and deferred calls cannot be correctly ordered. To correctly order the deferred calls, process them in revokechild_destructor where the result of revoke is known. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-05-18 04:50:09 +03:00			`struct revokechild_deferred_call *now_list = NULL;`
			`struct revokechild_deferred_call *delay_list = NULL;`

ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`if (rev_hdl->fde != NULL) {`
			`talloc_free(rev_hdl->fde);`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`}`

ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`if (rev_hdl->fd[0] != -1) {`
			`close(rev_hdl->fd[0]);`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`}`
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`if (rev_hdl->fd[1] != -1) {`
			`close(rev_hdl->fd[1]);`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`}`
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`ctdb_kill(rev_hdl->ctdb, rev_hdl->child, SIGKILL);`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`DLIST_REMOVE(rev_hdl->ctdb_db->revokechild_active, rev_hdl);`
ctdb-readonly: Avoid a tight loop waiting for revoke to complete BUG: https://bugzilla.samba.org/show_bug.cgi?id=12697 During revoking readonly delegations, if one of the nodes disappears, then there is no point re-trying revoking readonly delegation immedately. The database needs to be recovered before the revoke operation can succeed. However, if the revoke is successful, then all the write requests need to be processed immediately before the read-only requests. This avoids starving write requests, in case there are read-only requests coming from other nodes. In deferred_call_destructor, the result of revoke is not available and deferred calls cannot be correctly ordered. To correctly order the deferred calls, process them in revokechild_destructor where the result of revoke is known. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-05-18 04:50:09 +03:00
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`while (rev_hdl->deferred_call_list != NULL) {`
ctdb-readonly: Avoid a tight loop waiting for revoke to complete BUG: https://bugzilla.samba.org/show_bug.cgi?id=12697 During revoking readonly delegations, if one of the nodes disappears, then there is no point re-trying revoking readonly delegation immedately. The database needs to be recovered before the revoke operation can succeed. However, if the revoke is successful, then all the write requests need to be processed immediately before the read-only requests. This avoids starving write requests, in case there are read-only requests coming from other nodes. In deferred_call_destructor, the result of revoke is not available and deferred calls cannot be correctly ordered. To correctly order the deferred calls, process them in revokechild_destructor where the result of revoke is known. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-05-18 04:50:09 +03:00			`struct revokechild_deferred_call *dcall;`

ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`dcall = rev_hdl->deferred_call_list;`
			`DLIST_REMOVE(rev_hdl->deferred_call_list, dcall);`
ctdb-readonly: Avoid a tight loop waiting for revoke to complete BUG: https://bugzilla.samba.org/show_bug.cgi?id=12697 During revoking readonly delegations, if one of the nodes disappears, then there is no point re-trying revoking readonly delegation immedately. The database needs to be recovered before the revoke operation can succeed. However, if the revoke is successful, then all the write requests need to be processed immediately before the read-only requests. This avoids starving write requests, in case there are read-only requests coming from other nodes. In deferred_call_destructor, the result of revoke is not available and deferred calls cannot be correctly ordered. To correctly order the deferred calls, process them in revokechild_destructor where the result of revoke is known. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-05-18 04:50:09 +03:00
			`/* If revoke is successful, then first process all the calls`
			`* that need write access, and delay readonly requests by 1`
			`* second grace.`
			`*`
			`* If revoke is unsuccessful, most likely because of node`
			`* failure, delay all the pending requests, so database can`
			`* be recovered.`
			`*/`

ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`if (rev_hdl->status == 0) {`
ctdb-readonly: Avoid a tight loop waiting for revoke to complete BUG: https://bugzilla.samba.org/show_bug.cgi?id=12697 During revoking readonly delegations, if one of the nodes disappears, then there is no point re-trying revoking readonly delegation immedately. The database needs to be recovered before the revoke operation can succeed. However, if the revoke is successful, then all the write requests need to be processed immediately before the read-only requests. This avoids starving write requests, in case there are read-only requests coming from other nodes. In deferred_call_destructor, the result of revoke is not available and deferred calls cannot be correctly ordered. To correctly order the deferred calls, process them in revokechild_destructor where the result of revoke is known. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-05-18 04:50:09 +03:00			`struct ctdb_req_call_old *c;`

			`c = (struct ctdb_req_call_old *)dcall->hdr;`
			`if (c->flags & CTDB_WANT_READONLY) {`
			`DLIST_ADD(delay_list, dcall);`
			`} else {`
			`DLIST_ADD(now_list, dcall);`
			`}`
			`} else {`
			`DLIST_ADD(delay_list, dcall);`
			`}`
			`}`

			`if (now_list != NULL) {`
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`tevent_add_timer(rev_hdl->ctdb->ev,`
			`rev_hdl->ctdb_db,`
ctdb-readonly: Avoid a tight loop waiting for revoke to complete BUG: https://bugzilla.samba.org/show_bug.cgi?id=12697 During revoking readonly delegations, if one of the nodes disappears, then there is no point re-trying revoking readonly delegation immedately. The database needs to be recovered before the revoke operation can succeed. However, if the revoke is successful, then all the write requests need to be processed immediately before the read-only requests. This avoids starving write requests, in case there are read-only requests coming from other nodes. In deferred_call_destructor, the result of revoke is not available and deferred calls cannot be correctly ordered. To correctly order the deferred calls, process them in revokechild_destructor where the result of revoke is known. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-05-18 04:50:09 +03:00			`tevent_timeval_current_ofs(0, 0),`
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`deferred_call_requeue,`
			`now_list);`
ctdb-readonly: Avoid a tight loop waiting for revoke to complete BUG: https://bugzilla.samba.org/show_bug.cgi?id=12697 During revoking readonly delegations, if one of the nodes disappears, then there is no point re-trying revoking readonly delegation immedately. The database needs to be recovered before the revoke operation can succeed. However, if the revoke is successful, then all the write requests need to be processed immediately before the read-only requests. This avoids starving write requests, in case there are read-only requests coming from other nodes. In deferred_call_destructor, the result of revoke is not available and deferred calls cannot be correctly ordered. To correctly order the deferred calls, process them in revokechild_destructor where the result of revoke is known. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-05-18 04:50:09 +03:00			`}`

			`if (delay_list != NULL) {`
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`tevent_add_timer(rev_hdl->ctdb->ev,`
			`rev_hdl->ctdb_db,`
ctdb-readonly: Avoid a tight loop waiting for revoke to complete BUG: https://bugzilla.samba.org/show_bug.cgi?id=12697 During revoking readonly delegations, if one of the nodes disappears, then there is no point re-trying revoking readonly delegation immedately. The database needs to be recovered before the revoke operation can succeed. However, if the revoke is successful, then all the write requests need to be processed immediately before the read-only requests. This avoids starving write requests, in case there are read-only requests coming from other nodes. In deferred_call_destructor, the result of revoke is not available and deferred calls cannot be correctly ordered. To correctly order the deferred calls, process them in revokechild_destructor where the result of revoke is known. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-05-18 04:50:09 +03:00			`tevent_timeval_current_ofs(1, 0),`
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`deferred_call_requeue,`
			`delay_list);`
ctdb-readonly: Avoid a tight loop waiting for revoke to complete BUG: https://bugzilla.samba.org/show_bug.cgi?id=12697 During revoking readonly delegations, if one of the nodes disappears, then there is no point re-trying revoking readonly delegation immedately. The database needs to be recovered before the revoke operation can succeed. However, if the revoke is successful, then all the write requests need to be processed immediately before the read-only requests. This avoids starving write requests, in case there are read-only requests coming from other nodes. In deferred_call_destructor, the result of revoke is not available and deferred calls cannot be correctly ordered. To correctly order the deferred calls, process them in revokechild_destructor where the result of revoke is known. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-05-18 04:50:09 +03:00			`}`

ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`return 0;`
			`}`

ctdb-daemon: Stop using tevent compatibility definitions Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:09 +03:00			`static void revokechild_handler(struct tevent_context *ev,`
			`struct tevent_fd *fde,`
			`uint16_t flags, void *private_data)`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`{`
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`struct revokechild_handle *rev_hdl =`
			`talloc_get_type(private_data, struct revokechild_handle);`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`int ret;`
			`char c;`

ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`ret = sys_read(rev_hdl->fd[0], &c, 1);`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`if (ret != 1) {`
			`DEBUG(DEBUG_ERR,("Failed to read status from revokechild. errno:%d\n", errno));`
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`rev_hdl->status = -1;`
			`talloc_free(rev_hdl);`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`return;`
			`}`
			`if (c != 0) {`
			`DEBUG(DEBUG_ERR,("revokechild returned failure. status:%d\n", c));`
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`rev_hdl->status = -1;`
			`talloc_free(rev_hdl);`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`return;`
			`}`

ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`talloc_free(rev_hdl);`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`}`

			`struct ctdb_revoke_state {`
			`struct ctdb_db_context *ctdb_db;`
			`TDB_DATA key;`
			`struct ctdb_ltdb_header *header;`
			`TDB_DATA data;`
			`int count;`
			`int status;`
			`int finished;`
			`};`

			`static void update_record_cb(struct ctdb_client_control_state *state)`
			`{`
			`struct ctdb_revoke_state *revoke_state;`
			`int ret;`
			`int32_t res;`

			`if (state == NULL) {`
			`return;`
			`}`
			`revoke_state = state->async.private_data;`

			`state->async.fn = NULL;`
ReadOnly: Add processing for ReadOnly delegation requests and revoke requests to the processing loop for CALL packets we receive from different nodes. This implements the ReadOnly and ReadWrite request processing, delegation and revoking of delegations for all requests coming in across the network from a remote node. (This used to be ctdb commit 78f2c2ea70e6270cec59db7c3f174a511bf608a9) 2011-07-20 09:13:47 +04:00			`ret = ctdb_control_recv(state->ctdb, state, state, NULL, &res, NULL);`
			`if ((ret != 0) \|\| (res != 0)) {`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`DEBUG(DEBUG_ERR,("Recv for revoke update record failed ret:%d res:%d\n", ret, res));`
			`revoke_state->status = -1;`
			`}`

			`revoke_state->count--;`
			`if (revoke_state->count <= 0) {`
			`revoke_state->finished = 1;`
			`}`
			`}`

			`static void revoke_send_cb(struct ctdb_context ctdb, uint32_t pnn, void private_data)`
			`{`
			`struct ctdb_revoke_state *revoke_state = private_data;`
			`struct ctdb_client_control_state *state;`

ctdb-readonly: Do not use hard-coded value for readonly revoke timeout In case of control timeouts, readonly revoke code currently aborts. This needs to be fixed. Meanwhile, using control_timeout instead of 5 seconds, increases the timeout to 60 seconds. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Mon Mar 31 07:20:48 CEST 2014 on sn-devel-104 2014-03-28 06:44:34 +04:00			`state = ctdb_ctrl_updaterecord_send(ctdb, revoke_state, timeval_current_ofs(ctdb->tunable.control_timeout,0), pnn, revoke_state->ctdb_db, revoke_state->key, revoke_state->header, revoke_state->data);`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`if (state == NULL) {`
			`DEBUG(DEBUG_ERR,("Failure to send update record to revoke readonly delegation\n"));`
			`revoke_state->status = -1;`
			`return;`
			`}`
			`state->async.fn = update_record_cb;`
			`state->async.private_data = revoke_state;`

			`revoke_state->count++;`

			`}`

ctdb-daemon: Stop using tevent compatibility definitions Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:09 +03:00			`static void ctdb_revoke_timeout_handler(struct tevent_context *ev,`
			`struct tevent_timer *te,`
			`struct timeval yt, void *private_data)`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`{`
			`struct ctdb_revoke_state *state = private_data;`

			`DEBUG(DEBUG_ERR,("Timed out waiting for revoke to finish\n"));`
			`state->finished = 1;`
			`state->status = -1;`
			`}`

			`static int ctdb_revoke_all_delegations(struct ctdb_context ctdb, struct ctdb_db_context ctdb_db, TDB_DATA tdata, TDB_DATA key, struct ctdb_ltdb_header *header, TDB_DATA data)`
			`{`
			`struct ctdb_revoke_state *state = talloc_zero(ctdb, struct ctdb_revoke_state);`
ctdb-readonly: Add an early return to simplify code This patch makes the subsequent logic change small and easier to understand. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-12 17:54:39 +04:00			`struct ctdb_ltdb_header new_header;`
			`TDB_DATA new_data;`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00
			`state->ctdb_db = ctdb_db;`
			`state->key = key;`
			`state->header = header;`
			`state->data = data;`
ReadOnly: Add processing for ReadOnly delegation requests and revoke requests to the processing loop for CALL packets we receive from different nodes. This implements the ReadOnly and ReadWrite request processing, delegation and revoking of delegations for all requests coming in across the network from a remote node. (This used to be ctdb commit 78f2c2ea70e6270cec59db7c3f174a511bf608a9) 2011-07-20 09:13:47 +04:00
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`ctdb_trackingdb_traverse(ctdb, tdata, revoke_send_cb, state);`

ctdb-daemon: Stop using tevent compatibility definitions Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:09 +03:00			`tevent_add_timer(ctdb->ev, state,`
			`timeval_current_ofs(ctdb->tunable.control_timeout, 0),`
			`ctdb_revoke_timeout_handler, state);`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00
			`while (state->finished == 0) {`
ctdb-daemon: Stop using tevent compatibility definitions Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:09 +03:00			`tevent_loop_once(ctdb->ev);`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`}`

ctdb-readonly: Add an early return to simplify code This patch makes the subsequent logic change small and easier to understand. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-12 17:54:39 +04:00			`if (ctdb_ltdb_lock(ctdb_db, key) != 0) {`
			`DEBUG(DEBUG_ERR,("Failed to chainlock the database in revokechild\n"));`
			`talloc_free(state);`
			`return -1;`
			`}`
			`if (ctdb_ltdb_fetch(ctdb_db, key, &new_header, state, &new_data) != 0) {`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`ctdb_ltdb_unlock(ctdb_db, key);`
ctdb-readonly: Add an early return to simplify code This patch makes the subsequent logic change small and easier to understand. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-12 17:54:39 +04:00			`DEBUG(DEBUG_ERR,("Failed for fetch tdb record in revokechild\n"));`
			`talloc_free(state);`
			`return -1;`
			`}`
			`header->rsn++;`
			`if (new_header.rsn > header->rsn) {`
			`ctdb_ltdb_unlock(ctdb_db, key);`
			`DEBUG(DEBUG_ERR,("RSN too high in tdb record in revokechild\n"));`
			`talloc_free(state);`
			`return -1;`
			`}`
			`if ( (new_header.flags & (CTDB_REC_RO_REVOKING_READONLY\|CTDB_REC_RO_HAVE_DELEGATIONS)) != (CTDB_REC_RO_REVOKING_READONLY\|CTDB_REC_RO_HAVE_DELEGATIONS) ) {`
			`ctdb_ltdb_unlock(ctdb_db, key);`
			`DEBUG(DEBUG_ERR,("Flags are wrong in tdb record in revokechild\n"));`
			`talloc_free(state);`
			`return -1;`
			`}`
ctdb-readonly: Do not abort if revoke of readonly record fails on a node Revoking readonly record involves first marking the record on dmaster as RO_REVOKING_READONLY. Then all the other nodes are sent update_record control to get rid of RO_DELEGATION. Once that succeeds, the record is marked RO_REVOKING_COMPLETE. Currently, revoking of readonly delegations on the nodes is tried only once. If a node goes in recovery, it can fail update_record control and revoke code will abort ctdb. Since database recovery would revoke all readonly delegations anyway, there is no reason to abort. Simply undo the start of revoke process by resetting RO_REVOKING_READONLY flag. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Wed Aug 13 11:24:09 CEST 2014 on sn-devel-104 2014-08-12 17:58:00 +04:00
			`/*`
			`* If revoke on all nodes succeed, revoke is complete. Otherwise,`
			`* remove CTDB_REC_RO_REVOKING_READONLY flag and retry.`
			`*/`
			`if (state->status == 0) {`
			`new_header.rsn++;`
			`new_header.flags \|= CTDB_REC_RO_REVOKE_COMPLETE;`
			`} else {`
			`DEBUG(DEBUG_NOTICE, ("Revoke all delegations failed, retrying.\n"));`
			`new_header.flags &= ~CTDB_REC_RO_REVOKING_READONLY;`
			`}`
ctdb-readonly: Add an early return to simplify code This patch makes the subsequent logic change small and easier to understand. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-12 17:54:39 +04:00			`if (ctdb_ltdb_store(ctdb_db, key, &new_header, new_data) != 0) {`
			`ctdb_ltdb_unlock(ctdb_db, key);`
			`DEBUG(DEBUG_ERR,("Failed to write new record in revokechild\n"));`
			`talloc_free(state);`
			`return -1;`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`}`
ctdb-readonly: Add an early return to simplify code This patch makes the subsequent logic change small and easier to understand. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-12 17:54:39 +04:00			`ctdb_ltdb_unlock(ctdb_db, key);`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00
			`talloc_free(state);`
ctdb-readonly: Do not abort if revoke of readonly record fails on a node Revoking readonly record involves first marking the record on dmaster as RO_REVOKING_READONLY. Then all the other nodes are sent update_record control to get rid of RO_DELEGATION. Once that succeeds, the record is marked RO_REVOKING_COMPLETE. Currently, revoking of readonly delegations on the nodes is tried only once. If a node goes in recovery, it can fail update_record control and revoke code will abort ctdb. Since database recovery would revoke all readonly delegations anyway, there is no reason to abort. Simply undo the start of revoke process by resetting RO_REVOKING_READONLY flag. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Wed Aug 13 11:24:09 CEST 2014 on sn-devel-104 2014-08-12 17:58:00 +04:00			`return 0;`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`}`


ctdb-server: Minor code cleanup Cleanup ctdb_start_revoke_ro_record to improve readability. Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-02-08 13:57:23 +03:00			`int ctdb_start_revoke_ro_record(struct ctdb_context *ctdb,`
			`struct ctdb_db_context *ctdb_db,`
			`TDB_DATA key,`
			`struct ctdb_ltdb_header *header,`
			`TDB_DATA data)`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`{`
			`TDB_DATA tdata;`
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`struct revokechild_handle *rev_hdl;`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`pid_t parent = getpid();`
			`int ret;`

ctdb-server: Minor code cleanup Cleanup ctdb_start_revoke_ro_record to improve readability. Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-02-08 13:57:23 +03:00			`header->flags &= ~(CTDB_REC_RO_REVOKING_READONLY \|`
			`CTDB_REC_RO_HAVE_DELEGATIONS \|`
			`CTDB_REC_RO_HAVE_READONLY);`

READONLY: when updating a remote node to revoke a delegation, make sure we dont create the record if it doesnt already exist (This used to be ctdb commit fb00e1290fcea3386132a46c883994019a43799a) 2012-03-02 05:57:23 +04:00			`header->flags \|= CTDB_REC_FLAG_MIGRATED_WITH_DATA;`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`header->rsn -= 1;`

ctdb-server: Minor code cleanup Cleanup ctdb_start_revoke_ro_record to improve readability. Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-02-08 13:57:23 +03:00			`rev_hdl = talloc_zero(ctdb_db, struct revokechild_handle);`
			`if (rev_hdl == NULL) {`
			`D_ERR("Failed to allocate revokechild_handle\n");`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`return -1;`
			`}`

			`tdata = tdb_fetch(ctdb_db->rottdb, key);`
			`if (tdata.dsize > 0) {`
			`uint8_t *tmp;`

			`tmp = tdata.dptr;`
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`tdata.dptr = talloc_memdup(rev_hdl, tdata.dptr, tdata.dsize);`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`free(tmp);`
			`}`

ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`rev_hdl->status = 0;`
			`rev_hdl->ctdb = ctdb;`
			`rev_hdl->ctdb_db = ctdb_db;`
			`rev_hdl->fd[0] = -1;`
			`rev_hdl->fd[1] = -1;`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`rev_hdl->key.dsize = key.dsize;`
			`rev_hdl->key.dptr = talloc_memdup(rev_hdl, key.dptr, key.dsize);`
			`if (rev_hdl->key.dptr == NULL) {`
ctdb-server: Minor code cleanup Cleanup ctdb_start_revoke_ro_record to improve readability. Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-02-08 13:57:23 +03:00			`D_ERR("Failed to allocate key for revokechild_handle\n");`
ctdb-server: Add goto tag avoiding code duplication Introduced err_out goto tag to prevent code duplication. Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-02-08 14:08:45 +03:00			`goto err_out;`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`}`

ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`ret = pipe(rev_hdl->fd);`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`if (ret != 0) {`
ctdb-server: Minor code cleanup Cleanup ctdb_start_revoke_ro_record to improve readability. Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-02-08 13:57:23 +03:00			`D_ERR("Failed to allocate key for revokechild_handle\n");`
ctdb-server: Add goto tag avoiding code duplication Introduced err_out goto tag to prevent code duplication. Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-02-08 14:08:45 +03:00			`goto err_out;`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`}`


ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`rev_hdl->child = ctdb_fork(ctdb);`
			`if (rev_hdl->child == (pid_t)-1) {`
ctdb-server: Minor code cleanup Cleanup ctdb_start_revoke_ro_record to improve readability. Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-02-08 13:57:23 +03:00			`D_ERR("Failed to fork child for revokechild\n");`
ctdb-server: Add goto tag avoiding code duplication Introduced err_out goto tag to prevent code duplication. Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-02-08 14:08:45 +03:00			`goto err_out;`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`}`

ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`if (rev_hdl->child == 0) {`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`char c = 0;`
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`close(rev_hdl->fd[0]);`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00
ctdb: Use prctl_set_comment from lib/util Signed-off-by: Christof Schmitt <cs@samba.org> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2015-09-24 02:10:59 +03:00			`prctl_set_comment("ctdb_revokechild");`
ctdb-daemon: Remove setting of debug_extra from switch_from_server_to_client() Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2016-11-25 06:44:10 +03:00			`if (switch_from_server_to_client(ctdb) != 0) {`
ctdb-server: Minor code cleanup Cleanup ctdb_start_revoke_ro_record to improve readability. Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-02-08 13:57:23 +03:00			`D_ERR("Failed to switch from server to client "`
			`"for revokechild process\n");`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`c = 1;`
			`goto child_finished;`
			`}`

ctdb-server: Minor code cleanup Cleanup ctdb_start_revoke_ro_record to improve readability. Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-02-08 13:57:23 +03:00			`c = ctdb_revoke_all_delegations(ctdb,`
			`ctdb_db,`
			`tdata,`
			`key,`
			`header,`
			`data);`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00
			`child_finished:`
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`sys_write(rev_hdl->fd[1], &c, 1);`
ctdb: Use ctdb_wait_for_process_to_exit() Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2015-12-08 06:20:59 +03:00			`ctdb_wait_for_process_to_exit(parent);`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`_exit(0);`
			`}`

ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`close(rev_hdl->fd[1]);`
			`rev_hdl->fd[1] = -1;`
			`set_close_on_exec(rev_hdl->fd[0]);`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`rev_hdl->fde = tevent_add_fd(ctdb->ev,`
			`rev_hdl,`
			`rev_hdl->fd[0],`
			`TEVENT_FD_READ,`
			`revokechild_handler,`
			`(void *)rev_hdl);`

			`if (rev_hdl->fde == NULL) {`
ctdb-server: Minor code cleanup Cleanup ctdb_start_revoke_ro_record to improve readability. Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-02-08 13:57:23 +03:00			`D_ERR("Failed to set up fd event for revokechild process\n");`
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`talloc_free(rev_hdl);`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`}`
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`tevent_fd_set_auto_close(rev_hdl->fde);`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00
ctdb-server: Only set destructor if required Set the detructor in ctdb_start_revoke_ro_record after the revokechild_handle was added to the list. Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> Autobuild-User(master): Jeremy Allison <jra@samba.org> Autobuild-Date(master): Sat Mar 31 03:45:51 CEST 2018 on sn-devel-144 2018-02-08 14:19:09 +03:00			`/* This is an active revokechild child process */`
			`DLIST_ADD_END(ctdb_db->revokechild_active, rev_hdl);`
			`talloc_set_destructor(rev_hdl, revokechild_destructor);`

ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`return 0;`
ctdb-server: Add goto tag avoiding code duplication Introduced err_out goto tag to prevent code duplication. Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-02-08 14:08:45 +03:00			`err_out:`
			`talloc_free(rev_hdl);`
			`return -1;`
ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a) 2011-08-23 04:27:31 +04:00			`}`

ReadOnly: Add functions to register CALLs to a context used to handle deferal of processing of CALL commands. Once the contexts are freed, the deferred calls are re-issued to the input packet processing functions again. This is needed when/if a CALL can not currently be processed by the main engine due to the record being locked down for revoking of all delegations. The data is passed through several layers of callbacks, and finally a timed event callback to ensure that the processing of the packet will be restarted again at the topmost eventloop, avoinding event loop nesting. (This used to be ctdb commit cc6f78efcfa3b8caeffbd68018e6dfbf81488dce) 2011-07-20 07:49:17 +04:00			`int ctdb_add_revoke_deferred_call(struct ctdb_context ctdb, struct ctdb_db_context ctdb_db, TDB_DATA key, struct ctdb_req_header hdr, deferred_requeue_fn fn, void call_context)`
			`{`
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`struct revokechild_handle *rev_hdl;`
ReadOnly: Add functions to register CALLs to a context used to handle deferal of processing of CALL commands. Once the contexts are freed, the deferred calls are re-issued to the input packet processing functions again. This is needed when/if a CALL can not currently be processed by the main engine due to the record being locked down for revoking of all delegations. The data is passed through several layers of callbacks, and finally a timed event callback to ensure that the processing of the packet will be restarted again at the topmost eventloop, avoinding event loop nesting. (This used to be ctdb commit cc6f78efcfa3b8caeffbd68018e6dfbf81488dce) 2011-07-20 07:49:17 +04:00			`struct revokechild_deferred_call *deferred_call;`

ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`for (rev_hdl = ctdb_db->revokechild_active;`
			`rev_hdl;`
			`rev_hdl = rev_hdl->next) {`
			`if (rev_hdl->key.dsize == 0) {`
ReadOnly: Add functions to register CALLs to a context used to handle deferal of processing of CALL commands. Once the contexts are freed, the deferred calls are re-issued to the input packet processing functions again. This is needed when/if a CALL can not currently be processed by the main engine due to the record being locked down for revoking of all delegations. The data is passed through several layers of callbacks, and finally a timed event callback to ensure that the processing of the packet will be restarted again at the topmost eventloop, avoinding event loop nesting. (This used to be ctdb commit cc6f78efcfa3b8caeffbd68018e6dfbf81488dce) 2011-07-20 07:49:17 +04:00			`continue;`
			`}`
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`if (rev_hdl->key.dsize != key.dsize) {`
ReadOnly: Add functions to register CALLs to a context used to handle deferal of processing of CALL commands. Once the contexts are freed, the deferred calls are re-issued to the input packet processing functions again. This is needed when/if a CALL can not currently be processed by the main engine due to the record being locked down for revoking of all delegations. The data is passed through several layers of callbacks, and finally a timed event callback to ensure that the processing of the packet will be restarted again at the topmost eventloop, avoinding event loop nesting. (This used to be ctdb commit cc6f78efcfa3b8caeffbd68018e6dfbf81488dce) 2011-07-20 07:49:17 +04:00			`continue;`
			`}`
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`if (!memcmp(rev_hdl->key.dptr, key.dptr, key.dsize)) {`
ReadOnly: Add functions to register CALLs to a context used to handle deferal of processing of CALL commands. Once the contexts are freed, the deferred calls are re-issued to the input packet processing functions again. This is needed when/if a CALL can not currently be processed by the main engine due to the record being locked down for revoking of all delegations. The data is passed through several layers of callbacks, and finally a timed event callback to ensure that the processing of the packet will be restarted again at the topmost eventloop, avoinding event loop nesting. (This used to be ctdb commit cc6f78efcfa3b8caeffbd68018e6dfbf81488dce) 2011-07-20 07:49:17 +04:00			`break;`
			`}`
			`}`

ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`if (rev_hdl == NULL) {`
ReadOnly: Add functions to register CALLs to a context used to handle deferal of processing of CALL commands. Once the contexts are freed, the deferred calls are re-issued to the input packet processing functions again. This is needed when/if a CALL can not currently be processed by the main engine due to the record being locked down for revoking of all delegations. The data is passed through several layers of callbacks, and finally a timed event callback to ensure that the processing of the packet will be restarted again at the topmost eventloop, avoinding event loop nesting. (This used to be ctdb commit cc6f78efcfa3b8caeffbd68018e6dfbf81488dce) 2011-07-20 07:49:17 +04:00			`DEBUG(DEBUG_ERR,("Failed to add deferred call to revoke list. revoke structure not found\n"));`
			`return -1;`
			`}`

ctdb-daemon: Allocate deferred calls off calling context BUG: https://bugzilla.samba.org/show_bug.cgi?id=13152 This makes sure that if a client disconnects, all the deferred calls from the client are correctly freed. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-10-19 06:58:18 +03:00			`deferred_call = talloc(call_context, struct revokechild_deferred_call);`
ReadOnly: Add functions to register CALLs to a context used to handle deferal of processing of CALL commands. Once the contexts are freed, the deferred calls are re-issued to the input packet processing functions again. This is needed when/if a CALL can not currently be processed by the main engine due to the record being locked down for revoking of all delegations. The data is passed through several layers of callbacks, and finally a timed event callback to ensure that the processing of the packet will be restarted again at the topmost eventloop, avoinding event loop nesting. (This used to be ctdb commit cc6f78efcfa3b8caeffbd68018e6dfbf81488dce) 2011-07-20 07:49:17 +04:00			`if (deferred_call == NULL) {`
			`DEBUG(DEBUG_ERR,("Failed to allocate deferred call structure for revoking record\n"));`
			`return -1;`
			`}`

			`deferred_call->ctdb = ctdb;`
ctdb-readonly: Avoid a tight loop waiting for revoke to complete BUG: https://bugzilla.samba.org/show_bug.cgi?id=12697 During revoking readonly delegations, if one of the nodes disappears, then there is no point re-trying revoking readonly delegation immedately. The database needs to be recovered before the revoke operation can succeed. However, if the revoke is successful, then all the write requests need to be processed immediately before the read-only requests. This avoids starving write requests, in case there are read-only requests coming from other nodes. In deferred_call_destructor, the result of revoke is not available and deferred calls cannot be correctly ordered. To correctly order the deferred calls, process them in revokechild_destructor where the result of revoke is known. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-05-18 04:50:09 +03:00			`deferred_call->hdr = talloc_steal(deferred_call, hdr);`
ReadOnly: Add functions to register CALLs to a context used to handle deferal of processing of CALL commands. Once the contexts are freed, the deferred calls are re-issued to the input packet processing functions again. This is needed when/if a CALL can not currently be processed by the main engine due to the record being locked down for revoking of all delegations. The data is passed through several layers of callbacks, and finally a timed event callback to ensure that the processing of the packet will be restarted again at the topmost eventloop, avoinding event loop nesting. (This used to be ctdb commit cc6f78efcfa3b8caeffbd68018e6dfbf81488dce) 2011-07-20 07:49:17 +04:00			`deferred_call->fn = fn;`
			`deferred_call->ctx = call_context;`
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`deferred_call->rev_hdl = rev_hdl;`
ctdb-daemon: Allocate deferred calls off calling context BUG: https://bugzilla.samba.org/show_bug.cgi?id=13152 This makes sure that if a client disconnects, all the deferred calls from the client are correctly freed. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-10-19 06:58:18 +03:00
			`talloc_set_destructor(deferred_call, deferred_call_destructor);`
ReadOnly: Add functions to register CALLs to a context used to handle deferal of processing of CALL commands. Once the contexts are freed, the deferred calls are re-issued to the input packet processing functions again. This is needed when/if a CALL can not currently be processed by the main engine due to the record being locked down for revoking of all delegations. The data is passed through several layers of callbacks, and finally a timed event callback to ensure that the processing of the packet will be restarted again at the topmost eventloop, avoinding event loop nesting. (This used to be ctdb commit cc6f78efcfa3b8caeffbd68018e6dfbf81488dce) 2011-07-20 07:49:17 +04:00
ctdb-server: Replace the variable rc by something meaningful Replace the varibale name "rc" in ctdb_start_revoke_ro_record to prevent a mix-up with the common meaning of rc (return code). Signed-off-by: Swen Schillig <swen@vnet.ibm.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Jeremy Allison <jra@samba.org> 2018-01-31 15:06:30 +03:00			`DLIST_ADD(rev_hdl->deferred_call_list, deferred_call);`
ReadOnly: Add functions to register CALLs to a context used to handle deferal of processing of CALL commands. Once the contexts are freed, the deferred calls are re-issued to the input packet processing functions again. This is needed when/if a CALL can not currently be processed by the main engine due to the record being locked down for revoking of all delegations. The data is passed through several layers of callbacks, and finally a timed event callback to ensure that the processing of the packet will be restarted again at the topmost eventloop, avoinding event loop nesting. (This used to be ctdb commit cc6f78efcfa3b8caeffbd68018e6dfbf81488dce) 2011-07-20 07:49:17 +04:00
			`return 0;`
			`}`
ctdb-daemon: Add tracking of migration records Instead of using hopcount as a metric for hot records, use the number of migrations per second as a metric. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Wed Apr 5 08:35:45 CEST 2017 on sn-devel-144 2017-03-21 08:48:45 +03:00
			`static void ctdb_migration_count_handler(TDB_DATA key, uint64_t counter,`
			`void *private_data)`
			`{`
			`struct ctdb_db_context *ctdb_db = talloc_get_type_abort(`
			`private_data, struct ctdb_db_context);`
ctdb-daemon: Avoid signed/unsigned comparison by declaring as unsigned Compiling with -Wsign-compare complains: ctdb/server/ctdb_call.c:831:12: warning: comparison of integer expressions of different signedness: ‘int’ and ‘uint32_t’ {aka ‘unsigned int’} [-Wsign-compare] 831 \| if (count <= ctdb_db->statistics.hot_keys[0].count) { \| ^~ and ctdb/server/ctdb_call.c:844:13: warning: comparison of integer expressions of different signedness: ‘int’ and ‘uint32_t’ {aka ‘unsigned int’} [-Wsign-compare] 844 \| if (count <= ctdb_db->statistics.hot_keys[i].count) { \| ^~ Found by cs-build. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2019-08-01 03:55:39 +03:00			`unsigned int value;`
ctdb-daemon: Add tracking of migration records Instead of using hopcount as a metric for hot records, use the number of migrations per second as a metric. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Wed Apr 5 08:35:45 CEST 2017 on sn-devel-144 2017-03-21 08:48:45 +03:00
			`value = (counter < INT_MAX ? counter : INT_MAX);`
			`ctdb_update_db_stat_hot_keys(ctdb_db, key, value);`
			`}`

			`static void ctdb_migration_cleandb_event(struct tevent_context *ev,`
			`struct tevent_timer *te,`
			`struct timeval current_time,`
			`void *private_data)`
			`{`
			`struct ctdb_db_context *ctdb_db = talloc_get_type_abort(`
			`private_data, struct ctdb_db_context);`

			`if (ctdb_db->migratedb == NULL) {`
			`return;`
			`}`

			`hash_count_expire(ctdb_db->migratedb, NULL);`

			`te = tevent_add_timer(ctdb_db->ctdb->ev, ctdb_db->migratedb,`
			`tevent_timeval_current_ofs(10, 0),`
			`ctdb_migration_cleandb_event, ctdb_db);`
			`if (te == NULL) {`
			`DEBUG(DEBUG_ERR,`
			`("Memory error in migration cleandb event for %s\n",`
			`ctdb_db->db_name));`
			`TALLOC_FREE(ctdb_db->migratedb);`
			`}`
			`}`

			`int ctdb_migration_init(struct ctdb_db_context *ctdb_db)`
			`{`
			`struct timeval one_second = { 1, 0 };`
			`struct tevent_timer *te;`
			`int ret;`

ctdb-daemon: Add accessors for CTDB_DB_FLAGS_PERSISTENT flag This allows to differentiate between the two database models. ctdb_db_persistent() - replicated and permanent ctdb_db_volatile() - distributed and temporary Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-03-02 07:39:29 +03:00			`if (! ctdb_db_volatile(ctdb_db)) {`
ctdb-daemon: Add tracking of migration records Instead of using hopcount as a metric for hot records, use the number of migrations per second as a metric. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Wed Apr 5 08:35:45 CEST 2017 on sn-devel-144 2017-03-21 08:48:45 +03:00			`return 0;`
			`}`

			`ret = hash_count_init(ctdb_db, one_second,`
			`ctdb_migration_count_handler, ctdb_db,`
			`&ctdb_db->migratedb);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,`
			`("Memory error in migration init for %s\n",`
			`ctdb_db->db_name));`
			`return -1;`
			`}`

			`te = tevent_add_timer(ctdb_db->ctdb->ev, ctdb_db->migratedb,`
			`tevent_timeval_current_ofs(10, 0),`
			`ctdb_migration_cleandb_event, ctdb_db);`
			`if (te == NULL) {`
			`DEBUG(DEBUG_ERR,`
			`("Memory error in migration init for %s\n",`
			`ctdb_db->db_name));`
			`TALLOC_FREE(ctdb_db->migratedb);`
			`return -1;`
			`}`

			`return 0;`
			`}`

2075 lines 56 KiB C Raw Normal View History Unescape Escape

2075 lines

56 KiB

C

Raw Normal View History