samba-mirror

mirror of https://github.com/samba-team/samba.git synced 2024-12-22 13:34:15 +03:00

Ignoring revisions in .git-blame-ignore-revs. Click here to bypass and see the normal blame view.

1244 lines

32 KiB

C

Raw Normal View History

ctdb:server: Remove trailing whitespaces in ctdb_recover.c Signed-off-by: Andreas Schneider <asn@samba.org> Reviewed-by: Martin Schwenke <mschwenke@ddn.com> 2023-03-22 11:33:04 +03:00			`/*`
separate out the freeze/thaw handling from recovery (This used to be ctdb commit 0b0640bd8b8334961f240e0cf276ac112cd6e616) 2007-05-12 09:15:27 +04:00			`ctdb recovery code`
break set/get vnn map out from ctdb_control and put it in ctdb_recover.c for the time being remove all the [de]marshalling and just pass a structure around instead (This used to be ctdb commit b1169555ab7015976c0135ff51121cc238f5887c) 2007-05-03 05:06:24 +04:00
			`Copyright (C) Andrew Tridgell 2007`
			`Copyright (C) Ronnie Sahlberg 2007`

ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`This program is free software; you can redistribute it and/or modify`
			`it under the terms of the GNU General Public License as published by`
update lib/replace from samba4 (This used to be ctdb commit f0555484105668c01c21f56322992e752e831109) 2007-07-10 09:29:31 +04:00			`the Free Software Foundation; either version 3 of the License, or`
ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`(at your option) any later version.`
ctdb:server: Remove trailing whitespaces in ctdb_recover.c Signed-off-by: Andreas Schneider <asn@samba.org> Reviewed-by: Martin Schwenke <mschwenke@ddn.com> 2023-03-22 11:33:04 +03:00
ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`This program is distributed in the hope that it will be useful,`
break set/get vnn map out from ctdb_control and put it in ctdb_recover.c for the time being remove all the [de]marshalling and just pass a structure around instead (This used to be ctdb commit b1169555ab7015976c0135ff51121cc238f5887c) 2007-05-03 05:06:24 +04:00			`but WITHOUT ANY WARRANTY; without even the implied warranty of`
ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the`
			`GNU General Public License for more details.`
ctdb:server: Remove trailing whitespaces in ctdb_recover.c Signed-off-by: Andreas Schneider <asn@samba.org> Reviewed-by: Martin Schwenke <mschwenke@ddn.com> 2023-03-22 11:33:04 +03:00
ctdb is GPL not LGPL (This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960) 2007-05-31 07:50:53 +04:00			`You should have received a copy of the GNU General Public License`
update lib/replace from samba4 (This used to be ctdb commit f0555484105668c01c21f56322992e752e831109) 2007-07-10 09:29:31 +04:00			`along with this program; if not, see <http://www.gnu.org/licenses/>.`
break set/get vnn map out from ctdb_control and put it in ctdb_recover.c for the time being remove all the [de]marshalling and just pass a structure around instead (This used to be ctdb commit b1169555ab7015976c0135ff51121cc238f5887c) 2007-05-03 05:06:24 +04:00			`*/`
ctdb-daemon: Remove dependency on includes.h Instead of includes.h, include the required header files explicitly. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:46 +03:00			`#include "replace.h"`
add a ctdb uptime command that prints when ctdb was started and when the last recovery occured (This used to be ctdb commit b86e8ccbdac044bb949c4fc2ebb27635126272a9) 2008-01-17 03:33:23 +03:00			`#include "system/time.h"`
break set/get vnn map out from ctdb_control and put it in ctdb_recover.c for the time being remove all the [de]marshalling and just pass a structure around instead (This used to be ctdb commit b1169555ab7015976c0135ff51121cc238f5887c) 2007-05-03 05:06:24 +04:00			`#include "system/network.h"`
			`#include "system/filesys.h"`
			`#include "system/wait.h"`
ctdb-daemon: Remove dependency on includes.h Instead of includes.h, include the required header files explicitly. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:46 +03:00
			`#include <talloc.h>`
			`#include <tevent.h>`
			`#include <tdb.h>`

ctdb-util: Rename db_wrap to tdb_wrap and make it a build subsystem This makes it consistent with Samba, to ease transition. Update unit test code to link to with tdb_wrap instead of including db_wrap.c. There are some potential whitespace fixes in this commit that have been ignored. CTDB's lib/tdb_wrap will be deleted after the transition to Samba's lib/tdb_wrap, so there's no point polishing it too much. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2014-08-15 09:46:33 +04:00			`#include "lib/tdb_wrap/tdb_wrap.h"`
ctdb-daemon: Remove dependency on includes.h Instead of includes.h, include the required header files explicitly. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:46 +03:00			`#include "lib/util/dlinklist.h"`
			`#include "lib/util/debug.h"`
ctdb-recovery: Include lib/util/time.h instead of samba_util.h Less is more... Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-02-16 05:41:21 +03:00			`#include "lib/util/time.h"`
ctdb: Use prctl_set_comment from lib/util Signed-off-by: Christof Schmitt <cs@samba.org> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2015-09-24 02:10:59 +03:00			`#include "lib/util/util_process.h"`
ctdb-daemon: Remove dependency on includes.h Instead of includes.h, include the required header files explicitly. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:46 +03:00
			`#include "ctdb_private.h"`
			`#include "ctdb_client.h"`

ctdb-daemon: Separate prototypes for system specific functions This groups function prototypes for system specific functions in common/system.h and removes them from ctdb_private.h. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-23 06:11:53 +03:00			`#include "common/system.h"`
ctdb-daemon: Separate prototypes for common client/server functions This groups function prototypes for common client/server functions in common/common.h and removes them from ctdb_private.h. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-23 06:17:34 +03:00			`#include "common/common.h"`
ctdb-server: Replace ctdb_logging.h with common/logging.h Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Michael Adam <obnox@samba.org> 2015-11-11 07:41:10 +03:00			`#include "common/logging.h"`
more robust freeze/thaw logic (This used to be ctdb commit 51c1e51aeb7dfac1683584df7ef1bef98c092f76) 2007-05-12 09:29:06 +04:00
ctdb-cluster-mutex: Factor out cluster mutex code Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-02-17 06:32:03 +03:00			`#include "ctdb_cluster_mutex.h"`

ctdb:server: Remove trailing whitespaces in ctdb_recover.c Signed-off-by: Andreas Schneider <asn@samba.org> Reviewed-by: Martin Schwenke <mschwenke@ddn.com> 2023-03-22 11:33:04 +03:00			`int`
break set/get vnn map out from ctdb_control and put it in ctdb_recover.c for the time being remove all the [de]marshalling and just pass a structure around instead (This used to be ctdb commit b1169555ab7015976c0135ff51121cc238f5887c) 2007-05-03 05:06:24 +04:00			`ctdb_control_getvnnmap(struct ctdb_context ctdb, uint32_t opcode, TDB_DATA indata, TDB_DATA outdata)`
			`{`
separate the wire format and internal format for the vnn_map (This used to be ctdb commit 9a71718d87c5162f1423d85c2e86a01f6771925e) 2007-05-10 02:13:19 +04:00			`struct ctdb_vnn_map_wire *map;`
			`size_t len;`
break set/get vnn map out from ctdb_control and put it in ctdb_recover.c for the time being remove all the [de]marshalling and just pass a structure around instead (This used to be ctdb commit b1169555ab7015976c0135ff51121cc238f5887c) 2007-05-03 05:06:24 +04:00
ctdb: Fix some "declarations after code" problems Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2014-09-04 05:21:24 +04:00			`CHECK_CONTROL_DATA_SIZE(0);`

separate the wire format and internal format for the vnn_map (This used to be ctdb commit 9a71718d87c5162f1423d85c2e86a01f6771925e) 2007-05-10 02:13:19 +04:00			`len = offsetof(struct ctdb_vnn_map_wire, map) + sizeof(uint32_t)*ctdb->vnn_map->size;`
			`map = talloc_size(outdata, len);`
fixed some incorrect CTDB_NO_MEMORY*() calls found after fixing the _VOID varient (This used to be ctdb commit 07c9133aedecaee3607ad3b6fa94e5c56417a9de) 2008-07-04 11:04:26 +04:00			`CTDB_NO_MEMORY(ctdb, map);`
separate the wire format and internal format for the vnn_map (This used to be ctdb commit 9a71718d87c5162f1423d85c2e86a01f6771925e) 2007-05-10 02:13:19 +04:00
			`map->generation = ctdb->vnn_map->generation;`
			`map->size = ctdb->vnn_map->size;`
			`memcpy(map->map, ctdb->vnn_map->map, sizeof(uint32_t)*map->size);`

			`outdata->dsize = len;`
			`outdata->dptr = (uint8_t *)map;`
break set/get vnn map out from ctdb_control and put it in ctdb_recover.c for the time being remove all the [de]marshalling and just pass a structure around instead (This used to be ctdb commit b1169555ab7015976c0135ff51121cc238f5887c) 2007-05-03 05:06:24 +04:00
			`return 0;`
			`}`

ctdb-daemon: Remove freeze requirement for updating vnnmap In the parallel database recovery model, all the database will not remain frozen at the same time. So relax the condition to check if recovery is active. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-09-14 06:49:05 +03:00			`int`
break set/get vnn map out from ctdb_control and put it in ctdb_recover.c for the time being remove all the [de]marshalling and just pass a structure around instead (This used to be ctdb commit b1169555ab7015976c0135ff51121cc238f5887c) 2007-05-03 05:06:24 +04:00			`ctdb_control_setvnnmap(struct ctdb_context ctdb, uint32_t opcode, TDB_DATA indata, TDB_DATA outdata)`
			`{`
fixed setvnnmap to use wire structures too (This used to be ctdb commit 1208e4219d220b80e2f74974cac8ed2b8956d3ef) 2007-05-10 02:22:26 +04:00			`struct ctdb_vnn_map_wire map = (struct ctdb_vnn_map_wire )indata.dptr;`

ctdb-daemon: Remove freeze requirement for updating vnnmap In the parallel database recovery model, all the database will not remain frozen at the same time. So relax the condition to check if recovery is active. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-09-14 06:49:05 +03:00			`if (ctdb->recovery_mode != CTDB_RECOVERY_ACTIVE) {`
			`DEBUG(DEBUG_ERR, ("Attempt to set vnnmap when not in recovery\n"));`
ctdb-daemon: Avoid the use of ctdb->freeze_mode variable Use ctdb->freeze_mode only in ctdb_freeze.c and use the functions to check if databases are frozen everywhere else. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2014-08-21 06:32:02 +04:00			`return -1;`
don't allow setvnnmap while not frozen (This used to be ctdb commit a73f47f565894cc7e346177d87f2e6813837e1c6) 2007-05-14 07:48:40 +04:00			`}`

fixed setvnnmap to use wire structures too (This used to be ctdb commit 1208e4219d220b80e2f74974cac8ed2b8956d3ef) 2007-05-10 02:22:26 +04:00			`talloc_free(ctdb->vnn_map);`

			`ctdb->vnn_map = talloc(ctdb, struct ctdb_vnn_map);`
			`CTDB_NO_MEMORY(ctdb, ctdb->vnn_map);`

			`ctdb->vnn_map->generation = map->generation;`
			`ctdb->vnn_map->size = map->size;`
			`ctdb->vnn_map->map = talloc_array(ctdb->vnn_map, uint32_t, map->size);`
			`CTDB_NO_MEMORY(ctdb, ctdb->vnn_map->map);`
break set/get vnn map out from ctdb_control and put it in ctdb_recover.c for the time being remove all the [de]marshalling and just pass a structure around instead (This used to be ctdb commit b1169555ab7015976c0135ff51121cc238f5887c) 2007-05-03 05:06:24 +04:00
fixed setvnnmap to use wire structures too (This used to be ctdb commit 1208e4219d220b80e2f74974cac8ed2b8956d3ef) 2007-05-10 02:22:26 +04:00			`memcpy(ctdb->vnn_map->map, map->map, sizeof(uint32_t)*map->size);`
break set/get vnn map out from ctdb_control and put it in ctdb_recover.c for the time being remove all the [de]marshalling and just pass a structure around instead (This used to be ctdb commit b1169555ab7015976c0135ff51121cc238f5887c) 2007-05-03 05:06:24 +04:00
			`return 0;`
			`}`

ctdb:server: Remove trailing whitespaces in ctdb_recover.c Signed-off-by: Andreas Schneider <asn@samba.org> Reviewed-by: Martin Schwenke <mschwenke@ddn.com> 2023-03-22 11:33:04 +03:00			`int`
fixup getdbmap control so it looks a bit nicer (This used to be ctdb commit 78a4d61cb78da20af5210488e685c91bc3023e90) 2007-05-03 07:07:34 +04:00			`ctdb_control_getdbmap(struct ctdb_context ctdb, uint32_t opcode, TDB_DATA indata, TDB_DATA outdata)`
			`{`
			`uint32_t i, len;`
			`struct ctdb_db_context *ctdb_db;`
ctdb-daemon: Rename struct ctdb_dbid_map to ctdb_dbid_map_old Match struct ctdb_dbid as per protocol/protocol.h Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 09:46:05 +03:00			`struct ctdb_dbid_map_old *dbid_map;`
fixup getdbmap control so it looks a bit nicer (This used to be ctdb commit 78a4d61cb78da20af5210488e685c91bc3023e90) 2007-05-03 07:07:34 +04:00
			`CHECK_CONTROL_DATA_SIZE(0);`

			`len = 0;`
			`for(ctdb_db=ctdb->db_list;ctdb_db;ctdb_db=ctdb_db->next){`
			`len++;`
			`}`


ctdb-daemon: Rename struct ctdb_dbid_map to ctdb_dbid_map_old Match struct ctdb_dbid as per protocol/protocol.h Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 09:46:05 +03:00			`outdata->dsize = offsetof(struct ctdb_dbid_map_old, dbs) + sizeof(dbid_map->dbs[0])*len;`
fixup getdbmap control so it looks a bit nicer (This used to be ctdb commit 78a4d61cb78da20af5210488e685c91bc3023e90) 2007-05-03 07:07:34 +04:00			`outdata->dptr = (unsigned char *)talloc_zero_size(outdata, outdata->dsize);`
			`if (!outdata->dptr) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ALERT, (__location__ " Failed to allocate dbmap array\n"));`
fixup getdbmap control so it looks a bit nicer (This used to be ctdb commit 78a4d61cb78da20af5210488e685c91bc3023e90) 2007-05-03 07:07:34 +04:00			`exit(1);`
			`}`

ctdb-daemon: Rename struct ctdb_dbid_map to ctdb_dbid_map_old Match struct ctdb_dbid as per protocol/protocol.h Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 09:46:05 +03:00			`dbid_map = (struct ctdb_dbid_map_old *)outdata->dptr;`
fixup getdbmap control so it looks a bit nicer (This used to be ctdb commit 78a4d61cb78da20af5210488e685c91bc3023e90) 2007-05-03 07:07:34 +04:00			`dbid_map->num = len;`
added support for persistent databases in ctdbd (This used to be ctdb commit 3115090a0d882beca9d70761130b74bb0821f201) 2007-09-21 06:24:02 +04:00			`for (i=0,ctdb_db=ctdb->db_list;ctdb_db;i++,ctdb_db=ctdb_db->next){`
ctdb-daemon: Rename struct ctdb_dbid_map to ctdb_dbid_map_old Match struct ctdb_dbid as per protocol/protocol.h Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 09:46:05 +03:00			`dbid_map->dbs[i].db_id = ctdb_db->db_id;`
ctdb-daemon: Store db_flags instead of individual boolean flags Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-03-02 07:53:17 +03:00			`dbid_map->dbs[i].flags = ctdb_db->db_flags;`
fixup getdbmap control so it looks a bit nicer (This used to be ctdb commit 78a4d61cb78da20af5210488e685c91bc3023e90) 2007-05-03 07:07:34 +04:00			`}`

			`return 0;`
			`}`
cleanup getnodemap (This used to be ctdb commit 3867ccf71a167fb82dbc5a3f03f968a325a0c70b) 2007-05-03 07:30:38 +04:00
ctdb-daemon: Factor out new function ctdb_node_list_to_map() Change ctdb_control_getnodemap() to use this. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2015-02-20 04:31:37 +03:00			`int`
			`ctdb_control_getnodemap(struct ctdb_context ctdb, uint32_t opcode, TDB_DATA indata, TDB_DATA outdata)`
			`{`
			`CHECK_CONTROL_DATA_SIZE(0);`

			`outdata->dptr = (unsigned char *)ctdb_node_list_to_map(ctdb->nodes,`
			`ctdb->num_nodes,`
			`outdata);`
			`if (outdata->dptr == NULL) {`
			`return -1;`
			`}`

			`outdata->dsize = talloc_get_size(outdata->dptr);`

update TAKEIP/RELEASEIP/GETPUBLICIP/GETNODEMAP controls so we retain an older ipv4-only version of these controls. We need this so that we are backwardcompatible with old versions of ctdb and so that we can interoperate with a ipv4-only recmaster during a rolling upgrade. (This used to be ctdb commit 6b76c520f97127099bd9fbaa0fa7af1c61947fb7) 2008-10-14 03:40:29 +04:00			`return 0;`
			`}`

ctdb-daemon: Don't delay reloading the nodes file Presumably this was done to minimise the chance of a recovery occurring while the nodemaps are inconsistent across nodes. Another potential theory is that the forced recovery in the ctdb.c:control_reload_nodes_file() stops another recovery occurring for ReRecoveryTimeout seconds, so this delay causes the reloads to occur during that period. This is no longer necessary because recoveries are now explicitly disabled while node files are reloaded. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2015-02-10 07:43:03 +03:00			`/*`
			`reload the nodes file`
			`*/`
			`int`
			`ctdb_control_reload_nodes_file(struct ctdb_context *ctdb, uint32_t opcode)`
to make it easier/less disruptive to add nodes to a running cluster add a new control that causes the node to drop the current nodes list and reread it from the nodes file. During this operation, the node will also drop the tcp layer and restart it. When we drop the tcp layer, by talloc_free()ing the ctcp structure add a destructor to ctcp so that we also can clean up and remove the references in the ctdb structure to the transport layer add two new commands for the ctdb tool. one to list all nodes in the nodesfile and the second a command to trigger a node to drop the transport and reinitialize it with the nde nodes file (This used to be ctdb commit 4bc20ac73e9fa94ffd43cccb6eeb438eeff9963c) 2008-02-19 06:44:48 +03:00			`{`
ctdb-recovery: Fix signed/unsigned comparisons by declaring as unsigned Simple cases where variables need to be declared as an unsigned type instead of an int. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2019-05-23 01:43:58 +03:00			`unsigned int i, num_nodes;`
redesign how reloadnodes is implemented. modify the transport methods to allow to restart individual connections and set up destructors properly. only tear down/set-up tcp connections to nodes removed from the cluster or nodes added to the cluster. Leave tcp connections to unchanged nodes connected. make "ctdb reloadnodes" explicitely cause a recovery of the cluster once the files have been realoaded (This used to be ctdb commit d1057ed6de7de9f2a64d8fa012c52647e89b515b) 2008-12-02 05:26:30 +03:00			`TALLOC_CTX *tmp_ctx;`
ctdb-daemon: Don't delay reloading the nodes file Presumably this was done to minimise the chance of a recovery occurring while the nodemaps are inconsistent across nodes. Another potential theory is that the forced recovery in the ctdb.c:control_reload_nodes_file() stops another recovery occurring for ReRecoveryTimeout seconds, so this delay causes the reloads to occur during that period. This is no longer necessary because recoveries are now explicitly disabled while node files are reloaded. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2015-02-10 07:43:03 +03:00			`struct ctdb_node **nodes;`
redesign how reloadnodes is implemented. modify the transport methods to allow to restart individual connections and set up destructors properly. only tear down/set-up tcp connections to nodes removed from the cluster or nodes added to the cluster. Leave tcp connections to unchanged nodes connected. make "ctdb reloadnodes" explicitely cause a recovery of the cluster once the files have been realoaded (This used to be ctdb commit d1057ed6de7de9f2a64d8fa012c52647e89b515b) 2008-12-02 05:26:30 +03:00
			`tmp_ctx = talloc_new(ctdb);`

			`/* steal the old nodes file for a while */`
			`talloc_steal(tmp_ctx, ctdb->nodes);`
			`nodes = ctdb->nodes;`
			`ctdb->nodes = NULL;`
			`num_nodes = ctdb->num_nodes;`
			`ctdb->num_nodes = 0;`
to make it easier/less disruptive to add nodes to a running cluster add a new control that causes the node to drop the current nodes list and reread it from the nodes file. During this operation, the node will also drop the tcp layer and restart it. When we drop the tcp layer, by talloc_free()ing the ctcp structure add a destructor to ctcp so that we also can clean up and remove the references in the ctdb structure to the transport layer add two new commands for the ctdb tool. one to list all nodes in the nodesfile and the second a command to trigger a node to drop the transport and reinitialize it with the nde nodes file (This used to be ctdb commit 4bc20ac73e9fa94ffd43cccb6eeb438eeff9963c) 2008-02-19 06:44:48 +03:00
redesign how reloadnodes is implemented. modify the transport methods to allow to restart individual connections and set up destructors properly. only tear down/set-up tcp connections to nodes removed from the cluster or nodes added to the cluster. Leave tcp connections to unchanged nodes connected. make "ctdb reloadnodes" explicitely cause a recovery of the cluster once the files have been realoaded (This used to be ctdb commit d1057ed6de7de9f2a64d8fa012c52647e89b515b) 2008-12-02 05:26:30 +03:00			`/* load the new nodes file */`
to make it easier/less disruptive to add nodes to a running cluster add a new control that causes the node to drop the current nodes list and reread it from the nodes file. During this operation, the node will also drop the tcp layer and restart it. When we drop the tcp layer, by talloc_free()ing the ctcp structure add a destructor to ctcp so that we also can clean up and remove the references in the ctdb structure to the transport layer add two new commands for the ctdb tool. one to list all nodes in the nodesfile and the second a command to trigger a node to drop the transport and reinitialize it with the nde nodes file (This used to be ctdb commit 4bc20ac73e9fa94ffd43cccb6eeb438eeff9963c) 2008-02-19 06:44:48 +03:00			`ctdb_load_nodes_file(ctdb);`
ctdb->methods becomes NULL when we shutdown the transport. If we shutdown the transport and CTDB later decides to send a command out for queueing, the call to ctdb->methods->allocate_pkt() will SEGV. This could trigger for example when we are in the process of shuttind down CTDBD and have already shutdown the transport but we are still waiting for the "shutdown" eventscripts to finish. If the event scripts now take much much longer to execute for some reason, this race condition becomes much more probable. Decorate all dereferencing of ctdb->methods-> with a check that ctdb->menthods is non-NULL (This used to be ctdb commit c4c2c53918da6fb566d6e9cbd6b02e61ae2921e7) 2008-05-11 08:28:33 +04:00
When we reload the nodes file instead of shutting down/restarting the entire tcp layer just bounce all outgoing connections and reconnect (This used to be ctdb commit e701a531868149f16561011e65794a4a46ee6596) 2008-10-07 11:12:54 +04:00			`for (i=0; i<ctdb->num_nodes; i++) {`
redesign how reloadnodes is implemented. modify the transport methods to allow to restart individual connections and set up destructors properly. only tear down/set-up tcp connections to nodes removed from the cluster or nodes added to the cluster. Leave tcp connections to unchanged nodes connected. make "ctdb reloadnodes" explicitely cause a recovery of the cluster once the files have been realoaded (This used to be ctdb commit d1057ed6de7de9f2a64d8fa012c52647e89b515b) 2008-12-02 05:26:30 +03:00			`/* keep any identical pre-existing nodes and connections */`
			`if ((i < num_nodes) && ctdb_same_address(&ctdb->nodes[i]->address, &nodes[i]->address)) {`
			`talloc_free(ctdb->nodes[i]);`
			`ctdb->nodes[i] = talloc_steal(ctdb->nodes, nodes[i]);`
			`continue;`
			`}`

add a new node state : DELETED. This is used to mark nodes as being DELETED internally in ctdb so that nodes are not renumbered if / when they are removed from the nodes file. This is used to be able to do "ctdb reloadnodes" at runtime without causing nodes to be renumbered. To do this, instead of deleting a node from the nodes file, just comment it out like 1.0.0.1 #1.0.0.2 1.0.0.3 After removing 1.0.0.2 from the cluster, the remaining nodes retain their pnn's from prior to the deletion, namely 0 and 2 Any line in the nodes file that is commented out represents a DELETED pnn (This used to be ctdb commit 6a5e4fd7fa391206b463bb4e976502f3ac5bd343) 2009-06-01 08:18:34 +04:00			`if (ctdb->nodes[i]->flags & NODE_FLAGS_DELETED) {`
			`continue;`
			`}`

redesign how reloadnodes is implemented. modify the transport methods to allow to restart individual connections and set up destructors properly. only tear down/set-up tcp connections to nodes removed from the cluster or nodes added to the cluster. Leave tcp connections to unchanged nodes connected. make "ctdb reloadnodes" explicitely cause a recovery of the cluster once the files have been realoaded (This used to be ctdb commit d1057ed6de7de9f2a64d8fa012c52647e89b515b) 2008-12-02 05:26:30 +03:00			`/* any new or different nodes must be added */`
When we reload the nodes file instead of shutting down/restarting the entire tcp layer just bounce all outgoing connections and reconnect (This used to be ctdb commit e701a531868149f16561011e65794a4a46ee6596) 2008-10-07 11:12:54 +04:00			`if (ctdb->methods->add_node(ctdb->nodes[i]) != 0) {`
			`DEBUG(DEBUG_CRIT, (__location__ " methods->add_node failed at %d\n", i));`
			`ctdb_fatal(ctdb, "failed to add node. shutting down\n");`
			`}`
redesign how reloadnodes is implemented. modify the transport methods to allow to restart individual connections and set up destructors properly. only tear down/set-up tcp connections to nodes removed from the cluster or nodes added to the cluster. Leave tcp connections to unchanged nodes connected. make "ctdb reloadnodes" explicitely cause a recovery of the cluster once the files have been realoaded (This used to be ctdb commit d1057ed6de7de9f2a64d8fa012c52647e89b515b) 2008-12-02 05:26:30 +03:00			`if (ctdb->methods->connect_node(ctdb->nodes[i]) != 0) {`
			`DEBUG(DEBUG_CRIT, (__location__ " methods->add_connect failed at %d\n", i));`
			`ctdb_fatal(ctdb, "failed to connect to node. shutting down\n");`
			`}`
ctdb->methods becomes NULL when we shutdown the transport. If we shutdown the transport and CTDB later decides to send a command out for queueing, the call to ctdb->methods->allocate_pkt() will SEGV. This could trigger for example when we are in the process of shuttind down CTDBD and have already shutdown the transport but we are still waiting for the "shutdown" eventscripts to finish. If the event scripts now take much much longer to execute for some reason, this race condition becomes much more probable. Decorate all dereferencing of ctdb->methods-> with a check that ctdb->menthods is non-NULL (This used to be ctdb commit c4c2c53918da6fb566d6e9cbd6b02e61ae2921e7) 2008-05-11 08:28:33 +04:00			`}`
to make it easier/less disruptive to add nodes to a running cluster add a new control that causes the node to drop the current nodes list and reread it from the nodes file. During this operation, the node will also drop the tcp layer and restart it. When we drop the tcp layer, by talloc_free()ing the ctcp structure add a destructor to ctcp so that we also can clean up and remove the references in the ctdb structure to the transport layer add two new commands for the ctdb tool. one to list all nodes in the nodesfile and the second a command to trigger a node to drop the transport and reinitialize it with the nde nodes file (This used to be ctdb commit 4bc20ac73e9fa94ffd43cccb6eeb438eeff9963c) 2008-02-19 06:44:48 +03:00
ctdb: Fix a typo Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Jeremy Allison <jra@samba.org> 2021-03-03 11:58:50 +03:00			`/* tell the recovery daemon to reload the nodes file too */`
add a new node state : DELETED. This is used to mark nodes as being DELETED internally in ctdb so that nodes are not renumbered if / when they are removed from the nodes file. This is used to be able to do "ctdb reloadnodes" at runtime without causing nodes to be renumbered. To do this, instead of deleting a node from the nodes file, just comment it out like 1.0.0.1 #1.0.0.2 1.0.0.3 After removing 1.0.0.2 from the cluster, the remaining nodes retain their pnn's from prior to the deletion, namely 0 and 2 Any line in the nodes file that is commented out represents a DELETED pnn (This used to be ctdb commit 6a5e4fd7fa391206b463bb4e976502f3ac5bd343) 2009-06-01 08:18:34 +04:00			`ctdb_daemon_send_message(ctdb, ctdb->pnn, CTDB_SRVID_RELOAD_NODES, tdb_null);`

redesign how reloadnodes is implemented. modify the transport methods to allow to restart individual connections and set up destructors properly. only tear down/set-up tcp connections to nodes removed from the cluster or nodes added to the cluster. Leave tcp connections to unchanged nodes connected. make "ctdb reloadnodes" explicitely cause a recovery of the cluster once the files have been realoaded (This used to be ctdb commit d1057ed6de7de9f2a64d8fa012c52647e89b515b) 2008-12-02 05:26:30 +03:00			`talloc_free(tmp_ctx);`
to make it easier/less disruptive to add nodes to a running cluster add a new control that causes the node to drop the current nodes list and reread it from the nodes file. During this operation, the node will also drop the tcp layer and restart it. When we drop the tcp layer, by talloc_free()ing the ctcp structure add a destructor to ctcp so that we also can clean up and remove the references in the ctdb structure to the transport layer add two new commands for the ctdb tool. one to list all nodes in the nodesfile and the second a command to trigger a node to drop the transport and reinitialize it with the nde nodes file (This used to be ctdb commit 4bc20ac73e9fa94ffd43cccb6eeb438eeff9963c) 2008-02-19 06:44:48 +03:00
cleanup getnodemap (This used to be ctdb commit 3867ccf71a167fb82dbc5a3f03f968a325a0c70b) 2007-05-03 07:30:38 +04:00			`return 0;`
			`}`
cleanup the control "write record" (This used to be ctdb commit 4dd5c26a21a5dc2b2f76eb23cfeb4df82ba4e956) 2007-05-03 10:18:03 +04:00
ctdb-daemon: Implement new controls DB_PULL and DB_PUSH_START/DB_PUSH_CONFIRM Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2016-02-19 09:32:09 +03:00			`struct db_pull_state {`
			`struct ctdb_context *ctdb;`
			`struct ctdb_db_context *ctdb_db;`
			`struct ctdb_marshall_buffer *recs;`
			`uint32_t pnn;`
			`uint64_t srvid;`
			`uint32_t num_records;`
			`};`

			`static int traverse_db_pull(struct tdb_context *tdb, TDB_DATA key,`
			`TDB_DATA data, void *private_data)`
			`{`
			`struct db_pull_state state = (struct db_pull_state )private_data;`
			`struct ctdb_marshall_buffer *recs;`

			`recs = ctdb_marshall_add(state->ctdb, state->recs,`
			`state->ctdb_db->db_id, 0, key, NULL, data);`
			`if (recs == NULL) {`
			`TALLOC_FREE(state->recs);`
			`return -1;`
			`}`
			`state->recs = recs;`

			`if (talloc_get_size(state->recs) >=`
			`state->ctdb->tunable.rec_buffer_size_limit) {`
			`TDB_DATA buffer;`
			`int ret;`

			`buffer = ctdb_marshall_finish(state->recs);`
			`ret = ctdb_daemon_send_message(state->ctdb, state->pnn,`
			`state->srvid, buffer);`
			`if (ret != 0) {`
			`TALLOC_FREE(state->recs);`
			`return -1;`
			`}`

			`state->num_records += state->recs->count;`
			`TALLOC_FREE(state->recs);`
			`}`

			`return 0;`
			`}`

			`int32_t ctdb_control_db_pull(struct ctdb_context *ctdb,`
			`struct ctdb_req_control_old *c,`
			`TDB_DATA indata, TDB_DATA *outdata)`
			`{`
			`struct ctdb_pulldb_ext *pulldb_ext;`
			`struct ctdb_db_context *ctdb_db;`
			`struct db_pull_state state;`
			`int ret;`

			`pulldb_ext = (struct ctdb_pulldb_ext *)indata.dptr;`

			`ctdb_db = find_ctdb_db(ctdb, pulldb_ext->db_id);`
			`if (ctdb_db == NULL) {`
			`DEBUG(DEBUG_ERR,(__location__ " Unknown db 0x%08x\n",`
			`pulldb_ext->db_id));`
			`return -1;`
			`}`

			`if (!ctdb_db_frozen(ctdb_db)) {`
			`DEBUG(DEBUG_ERR,`
			`("rejecting ctdb_control_pull_db when not frozen\n"));`
			`return -1;`
			`}`

			`if (ctdb_db->unhealthy_reason) {`
			`/* this is just a warning, as the tdb should be empty anyway */`
			`DEBUG(DEBUG_WARNING,`
			`("db(%s) unhealty in ctdb_control_db_pull: %s\n",`
			`ctdb_db->db_name, ctdb_db->unhealthy_reason));`
			`}`

			`state.ctdb = ctdb;`
			`state.ctdb_db = ctdb_db;`
			`state.recs = NULL;`
			`state.pnn = c->hdr.srcnode;`
			`state.srvid = pulldb_ext->srvid;`
			`state.num_records = 0;`

ctdb-daemon: Don't pull any records if records are invalidated This avoids unnecessary work during recovery to pull records from nodes that were INACTIVE just before the recovery. BUG: https://bugzilla.samba.org/show_bug.cgi?id=13641 Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2018-02-14 06:27:32 +03:00			`/* If the records are invalid, we are done */`
			`if (ctdb_db->invalid_records) {`
			`goto done;`
			`}`

ctdb-daemon: Implement new controls DB_PULL and DB_PUSH_START/DB_PUSH_CONFIRM Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2016-02-19 09:32:09 +03:00			`if (ctdb_lockdb_mark(ctdb_db) != 0) {`
			`DEBUG(DEBUG_ERR,`
			`(__location__ " Failed to get lock on entire db - failing\n"));`
			`return -1;`
			`}`

			`ret = tdb_traverse_read(ctdb_db->ltdb->tdb, traverse_db_pull, &state);`
			`if (ret == -1) {`
			`DEBUG(DEBUG_ERR,`
			`(__location__ " Failed to get traverse db '%s'\n",`
			`ctdb_db->db_name));`
			`ctdb_lockdb_unmark(ctdb_db);`
			`return -1;`
			`}`

			`/* Last few records */`
			`if (state.recs != NULL) {`
			`TDB_DATA buffer;`

			`buffer = ctdb_marshall_finish(state.recs);`
			`ret = ctdb_daemon_send_message(state.ctdb, state.pnn,`
			`state.srvid, buffer);`
			`if (ret != 0) {`
			`TALLOC_FREE(state.recs);`
			`ctdb_lockdb_unmark(ctdb_db);`
			`return -1;`
			`}`

			`state.num_records += state.recs->count;`
			`TALLOC_FREE(state.recs);`
			`}`

			`ctdb_lockdb_unmark(ctdb_db);`

ctdb-daemon: Don't pull any records if records are invalidated This avoids unnecessary work during recovery to pull records from nodes that were INACTIVE just before the recovery. BUG: https://bugzilla.samba.org/show_bug.cgi?id=13641 Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2018-02-14 06:27:32 +03:00			`done:`
ctdb-daemon: Implement new controls DB_PULL and DB_PUSH_START/DB_PUSH_CONFIRM Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2016-02-19 09:32:09 +03:00			`outdata->dptr = talloc_size(outdata, sizeof(uint32_t));`
			`if (outdata->dptr == NULL) {`
			`DEBUG(DEBUG_ERR, (__location__ " Memory allocation error\n"));`
			`return -1;`
			`}`

			`memcpy(outdata->dptr, (uint8_t *)&state.num_records, sizeof(uint32_t));`
			`outdata->dsize = sizeof(uint32_t);`

			`return 0;`
			`}`

			`struct db_push_state {`
			`struct ctdb_context *ctdb;`
			`struct ctdb_db_context *ctdb_db;`
			`uint64_t srvid;`
			`uint32_t num_records;`
			`bool failed;`
			`};`

			`static void db_push_msg_handler(uint64_t srvid, TDB_DATA indata,`
			`void *private_data)`
			`{`
			`struct db_push_state *state = talloc_get_type(`
			`private_data, struct db_push_state);`
			`struct ctdb_marshall_buffer *recs;`
			`struct ctdb_rec_data_old *rec;`
ctdb-recovery: Fix signed/unsigned comparisons by declaring as unsigned Simple cases where variables need to be declared as an unsigned type instead of an int. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2019-05-23 01:43:58 +03:00			`unsigned int i;`
			`int ret;`
ctdb-daemon: Implement new controls DB_PULL and DB_PUSH_START/DB_PUSH_CONFIRM Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2016-02-19 09:32:09 +03:00
			`if (state->failed) {`
			`return;`
			`}`

			`recs = (struct ctdb_marshall_buffer *)indata.dptr;`
			`rec = (struct ctdb_rec_data_old *)&recs->data[0];`

			`DEBUG(DEBUG_INFO, ("starting push of %u records for dbid 0x%x\n",`
			`recs->count, recs->db_id));`

			`for (i=0; i<recs->count; i++) {`
			`TDB_DATA key, data;`
			`struct ctdb_ltdb_header *hdr;`

			`key.dptr = &rec->data[0];`
			`key.dsize = rec->keylen;`
			`data.dptr = &rec->data[key.dsize];`
			`data.dsize = rec->datalen;`

			`if (data.dsize < sizeof(struct ctdb_ltdb_header)) {`
			`DEBUG(DEBUG_CRIT,(__location__ " bad ltdb record\n"));`
			`goto failed;`
			`}`

			`hdr = (struct ctdb_ltdb_header *)data.dptr;`
			`/* Strip off any read only record flags.`
ctdb:server: Fix code spelling Best reviewed with: `git show --word-diff` Signed-off-by: Andreas Schneider <asn@samba.org> Reviewed-by: Martin Schwenke <mschwenke@ddn.com> 2023-03-22 11:36:23 +03:00			`* All readonly records are revoked implicitly by a recovery.`
ctdb-daemon: Implement new controls DB_PULL and DB_PUSH_START/DB_PUSH_CONFIRM Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2016-02-19 09:32:09 +03:00			`*/`
			`hdr->flags &= ~CTDB_REC_RO_FLAGS;`

			`data.dptr += sizeof(*hdr);`
			`data.dsize -= sizeof(*hdr);`

			`ret = ctdb_ltdb_store(state->ctdb_db, key, hdr, data);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,`
			`(__location__ " Unable to store record\n"));`
			`goto failed;`
			`}`

			`rec = (struct ctdb_rec_data_old )(rec->length + (uint8_t )rec);`
			`}`

			`DEBUG(DEBUG_DEBUG, ("finished push of %u records for dbid 0x%x\n",`
			`recs->count, recs->db_id));`

			`state->num_records += recs->count;`
			`return;`

			`failed:`
			`state->failed = true;`
			`}`

			`int32_t ctdb_control_db_push_start(struct ctdb_context *ctdb, TDB_DATA indata)`
			`{`
			`struct ctdb_pulldb_ext *pulldb_ext;`
			`struct ctdb_db_context *ctdb_db;`
			`struct db_push_state *state;`
			`int ret;`

			`pulldb_ext = (struct ctdb_pulldb_ext *)indata.dptr;`

			`ctdb_db = find_ctdb_db(ctdb, pulldb_ext->db_id);`
			`if (ctdb_db == NULL) {`
			`DEBUG(DEBUG_ERR,`
			`(__location__ " Unknown db 0x%08x\n", pulldb_ext->db_id));`
			`return -1;`
			`}`

			`if (!ctdb_db_frozen(ctdb_db)) {`
			`DEBUG(DEBUG_ERR,`
			`("rejecting ctdb_control_db_push_start when not frozen\n"));`
			`return -1;`
			`}`

			`if (ctdb_db->push_started) {`
			`DEBUG(DEBUG_WARNING,`
			`(__location__ " DB push already started for %s\n",`
			`ctdb_db->db_name));`

			`/* De-register old state */`
			`state = (struct db_push_state *)ctdb_db->push_state;`
			`if (state != NULL) {`
			`srvid_deregister(ctdb->srv, state->srvid, state);`
			`talloc_free(state);`
			`ctdb_db->push_state = NULL;`
			`}`
			`}`

			`state = talloc_zero(ctdb_db, struct db_push_state);`
			`if (state == NULL) {`
			`DEBUG(DEBUG_ERR, (__location__ " Memory allocation error\n"));`
			`return -1;`
			`}`

			`state->ctdb = ctdb;`
			`state->ctdb_db = ctdb_db;`
			`state->srvid = pulldb_ext->srvid;`
			`state->failed = false;`

			`ret = srvid_register(ctdb->srv, state, state->srvid,`
			`db_push_msg_handler, state);`
			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR,`
			`(__location__ " Failed to register srvid for db push\n"));`
			`talloc_free(state);`
			`return -1;`
			`}`

			`if (ctdb_lockdb_mark(ctdb_db) != 0) {`
			`DEBUG(DEBUG_ERR,`
			`(__location__ " Failed to get lock on entire db - failing\n"));`
			`srvid_deregister(ctdb->srv, state->srvid, state);`
			`talloc_free(state);`
			`return -1;`
			`}`

			`ctdb_db->push_started = true;`
			`ctdb_db->push_state = state;`

			`return 0;`
			`}`

			`int32_t ctdb_control_db_push_confirm(struct ctdb_context *ctdb,`
			`TDB_DATA indata, TDB_DATA *outdata)`
			`{`
			`uint32_t db_id;`
			`struct ctdb_db_context *ctdb_db;`
			`struct db_push_state *state;`

			`db_id = (uint32_t )indata.dptr;`

			`ctdb_db = find_ctdb_db(ctdb, db_id);`
			`if (ctdb_db == NULL) {`
			`DEBUG(DEBUG_ERR,(__location__ " Unknown db 0x%08x\n", db_id));`
			`return -1;`
			`}`

			`if (!ctdb_db_frozen(ctdb_db)) {`
			`DEBUG(DEBUG_ERR,`
			`("rejecting ctdb_control_db_push_confirm when not frozen\n"));`
			`return -1;`
			`}`

			`if (!ctdb_db->push_started) {`
			`DEBUG(DEBUG_ERR, (__location__ " DB push not started\n"));`
			`return -1;`
			`}`

ctdb-daemon: Add accessors for CTDB_DB_FLAGS_READONLY flag Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-03-02 07:44:48 +03:00			`if (ctdb_db_readonly(ctdb_db)) {`
ctdb-daemon: Implement new controls DB_PULL and DB_PUSH_START/DB_PUSH_CONFIRM Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2016-02-19 09:32:09 +03:00			`DEBUG(DEBUG_ERR,`
			`("Clearing the tracking database for dbid 0x%x\n",`
			`ctdb_db->db_id));`
			`if (tdb_wipe_all(ctdb_db->rottdb) != 0) {`
			`DEBUG(DEBUG_ERR,`
			`("Failed to wipe tracking database for 0x%x."`
			`" Dropping read-only delegation support\n",`
			`ctdb_db->db_id));`
			`tdb_close(ctdb_db->rottdb);`
			`ctdb_db->rottdb = NULL;`
ctdb-daemon: Add accessors for CTDB_DB_FLAGS_READONLY flag Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-03-02 07:44:48 +03:00			`ctdb_db_reset_readonly(ctdb_db);`
ctdb-daemon: Implement new controls DB_PULL and DB_PUSH_START/DB_PUSH_CONFIRM Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2016-02-19 09:32:09 +03:00			`}`

			`while (ctdb_db->revokechild_active != NULL) {`
			`talloc_free(ctdb_db->revokechild_active);`
			`}`
			`}`

			`ctdb_lockdb_unmark(ctdb_db);`

			`state = (struct db_push_state *)ctdb_db->push_state;`
			`if (state == NULL) {`
			`DEBUG(DEBUG_ERR, (__location__ " Missing push db state\n"));`
			`return -1;`
			`}`

			`srvid_deregister(ctdb->srv, state->srvid, state);`

			`outdata->dptr = talloc_size(outdata, sizeof(uint32_t));`
			`if (outdata->dptr == NULL) {`
			`DEBUG(DEBUG_ERR, (__location__ " Memory allocation error\n"));`
			`talloc_free(state);`
			`ctdb_db->push_state = NULL;`
			`return -1;`
			`}`

			`memcpy(outdata->dptr, (uint8_t *)&state->num_records, sizeof(uint32_t));`
			`outdata->dsize = sizeof(uint32_t);`

			`talloc_free(state);`
ctdb-daemon: Reset push_started flag once DB_PUSH_CONFIRM is done Once DB_PUSH_START is processed as part of recovery, push_started flag tracks if there are multiple attempts to send DB_PUSH_START. In DB_PUSH_CONFIRM, once the record count is confirmed, all information related to DB_PUSH should be reset. However, The push_started flag was not reset when the push_state was reset. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Wed Jun 8 14:31:52 CEST 2016 on sn-devel-144 2016-06-08 08:04:52 +03:00			`ctdb_db->push_started = false;`
ctdb-daemon: Implement new controls DB_PULL and DB_PUSH_START/DB_PUSH_CONFIRM Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2016-02-19 09:32:09 +03:00			`ctdb_db->push_state = NULL;`

			`return 0;`
			`}`

ctdb-recovery: Wrap private data for reclock test callback This will allow a simplification of the cluster mutex API, so the private data can be registered when calling ctdb_cluster_mutex(). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-06-01 10:45:36 +03:00			`struct set_recmode_state {`
ctdb-cluster-mutex: Drop cluster_mutex_handler() ctdb and handle arguments This makes the API more general. If they are needed in a handler then they can be in the private data. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-06-01 10:10:26 +03:00			`struct ctdb_context *ctdb;`
ctdb-recovery: Wrap private data for reclock test callback This will allow a simplification of the cluster mutex API, so the private data can be registered when calling ctdb_cluster_mutex(). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-06-01 10:45:36 +03:00			`struct ctdb_req_control_old *c;`
			`};`

ctdb-cluster-mutex: Drop cluster_mutex_handler() ctdb and handle arguments This makes the API more general. If they are needed in a handler then they can be in the private data. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-06-01 10:10:26 +03:00			`static void set_recmode_handler(char status,`
ctdb-recovery: Factor out new function set_recmode_handler() This is used to reply to the recmode control for all the different cases. The callers can later be generalised to use a pointer, which can then be used for recovery lock handling in different contexts. Note that the handle is now freed in set_recmode_handler() rather than the callbacks. There is one difference in behaviour. Deferred attach calls are now processed in the timeout case, where they weren't before. That's a bug fix! Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-01-11 08:35:35 +03:00			`double latency,`
ctdb-recovery: Use a configurable handler when testing cluster mutex Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-01-12 05:35:47 +03:00			`void *private_data)`
ctdb-recovery: Factor out new function set_recmode_handler() This is used to reply to the recmode control for all the different cases. The callers can later be generalised to use a pointer, which can then be used for recovery lock handling in different contexts. Note that the handle is now freed in set_recmode_handler() rather than the callbacks. There is one difference in behaviour. Deferred attach calls are now processed in the timeout case, where they weren't before. That's a bug fix! Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-01-11 08:35:35 +03:00			`{`
ctdb-recovery: Wrap private data for reclock test callback This will allow a simplification of the cluster mutex API, so the private data can be registered when calling ctdb_cluster_mutex(). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-06-01 10:45:36 +03:00			`struct set_recmode_state *state = talloc_get_type_abort(`
			`private_data, struct set_recmode_state);`
ctdb-recovery: Factor out new function set_recmode_handler() This is used to reply to the recmode control for all the different cases. The callers can later be generalised to use a pointer, which can then be used for recovery lock handling in different contexts. Note that the handle is now freed in set_recmode_handler() rather than the callbacks. There is one difference in behaviour. Deferred attach calls are now processed in the timeout case, where they weren't before. That's a bug fix! Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-01-11 08:35:35 +03:00			`int s = 0;`
			`const char *err = NULL;`

			`switch (status) {`
			`case '0':`
			`/* Mutex taken */`
			`DEBUG(DEBUG_ERR,`
			`("ERROR: Daemon able to take recovery lock on \"%s\" during recovery\n",`
ctdb-cluster-mutex: Drop cluster_mutex_handler() ctdb and handle arguments This makes the API more general. If they are needed in a handler then they can be in the private data. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-06-01 10:10:26 +03:00			`state->ctdb->recovery_lock));`
ctdb-recovery: Factor out new function set_recmode_handler() This is used to reply to the recmode control for all the different cases. The callers can later be generalised to use a pointer, which can then be used for recovery lock handling in different contexts. Note that the handle is now freed in set_recmode_handler() rather than the callbacks. There is one difference in behaviour. Deferred attach calls are now processed in the timeout case, where they weren't before. That's a bug fix! Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-01-11 08:35:35 +03:00			`s = -1;`
			`err = "Took recovery lock from daemon during recovery - probably a cluster filesystem lock coherence problem";`
			`break;`

			`case '1':`
			`/* Contention */`
			`DEBUG(DEBUG_DEBUG, (__location__ " Recovery lock check OK\n"));`
ctdb-cluster-mutex: Drop cluster_mutex_handler() ctdb and handle arguments This makes the API more general. If they are needed in a handler then they can be in the private data. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-06-01 10:10:26 +03:00			`state->ctdb->recovery_mode = CTDB_RECOVERY_NORMAL;`
			`ctdb_process_deferred_attach(state->ctdb);`
ctdb-recovery: Factor out new function set_recmode_handler() This is used to reply to the recmode control for all the different cases. The callers can later be generalised to use a pointer, which can then be used for recovery lock handling in different contexts. Note that the handle is now freed in set_recmode_handler() rather than the callbacks. There is one difference in behaviour. Deferred attach calls are now processed in the timeout case, where they weren't before. That's a bug fix! Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-01-11 08:35:35 +03:00
			`s = 0;`

ctdb-cluster-mutex: Drop cluster_mutex_handler() ctdb and handle arguments This makes the API more general. If they are needed in a handler then they can be in the private data. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-06-01 10:10:26 +03:00			`CTDB_UPDATE_RECLOCK_LATENCY(state->ctdb, "daemon reclock",`
ctdb-recovery: Factor out new function set_recmode_handler() This is used to reply to the recmode control for all the different cases. The callers can later be generalised to use a pointer, which can then be used for recovery lock handling in different contexts. Note that the handle is now freed in set_recmode_handler() rather than the callbacks. There is one difference in behaviour. Deferred attach calls are now processed in the timeout case, where they weren't before. That's a bug fix! Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-01-11 08:35:35 +03:00			`reclock.ctdbd, latency);`
			`break;`

			`case '2':`
			`/* Timeout. Consider this a success, not a failure,`
			`* as we failed to set the recovery lock which is what`
			`* we wanted. This can be caused by the cluster`
			`* filesystem being very slow to arbitrate locks`
			`* immediately after a node failure. */`
			`DEBUG(DEBUG_WARNING,`
			`(__location__`
			`"Time out getting recovery lock, allowing recmode set anyway\n"));`
ctdb-cluster-mutex: Drop cluster_mutex_handler() ctdb and handle arguments This makes the API more general. If they are needed in a handler then they can be in the private data. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-06-01 10:10:26 +03:00			`state->ctdb->recovery_mode = CTDB_RECOVERY_NORMAL;`
			`ctdb_process_deferred_attach(state->ctdb);`
ctdb-recovery: Factor out new function set_recmode_handler() This is used to reply to the recmode control for all the different cases. The callers can later be generalised to use a pointer, which can then be used for recovery lock handling in different contexts. Note that the handle is now freed in set_recmode_handler() rather than the callbacks. There is one difference in behaviour. Deferred attach calls are now processed in the timeout case, where they weren't before. That's a bug fix! Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-01-11 08:35:35 +03:00
			`s = 0;`
			`break;`

			`default:`
			`DEBUG(DEBUG_ERR,`
			`("Unexpected error when testing recovery lock\n"));`
			`s = -1;`
			`err = "Unexpected error when testing recovery lock";`
			`}`

ctdb-cluster-mutex: Drop cluster_mutex_handler() ctdb and handle arguments This makes the API more general. If they are needed in a handler then they can be in the private data. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-06-01 10:10:26 +03:00			`ctdb_request_control_reply(state->ctdb, state->c, NULL, s, err);`
ctdb-recovery: Wrap private data for reclock test callback This will allow a simplification of the cluster mutex API, so the private data can be registered when calling ctdb_cluster_mutex(). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-06-01 10:45:36 +03:00			`talloc_free(state);`
ctdb-recovery: Factor out new function set_recmode_handler() This is used to reply to the recmode control for all the different cases. The callers can later be generalised to use a pointer, which can then be used for recovery lock handling in different contexts. Note that the handle is now freed in set_recmode_handler() rather than the callbacks. There is one difference in behaviour. Deferred attach calls are now processed in the timeout case, where they weren't before. That's a bug fix! Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-01-11 08:35:35 +03:00			`}`

add a context and a timed event so that once we have been in recovery mode for too long we drop all public ip addresses (This used to be ctdb commit 403c68f96e1380dd07217c688de2730464f77ea0) 2008-10-22 04:04:41 +04:00			`static void`
ctdb-daemon: Stop using tevent compatibility definitions Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:09 +03:00			`ctdb_drop_all_ips_event(struct tevent_context ev, struct tevent_timer te,`
			`struct timeval t, void *private_data)`
add a context and a timed event so that once we have been in recovery mode for too long we drop all public ip addresses (This used to be ctdb commit 403c68f96e1380dd07217c688de2730464f77ea0) 2008-10-22 04:04:41 +04:00			`{`
			`struct ctdb_context *ctdb = talloc_get_type(private_data, struct ctdb_context);`

increase the loglevel for the message we print when we automatically release all ips when we have been in recovery for too long (This used to be ctdb commit 7af060ded5113a49832f6a08a942523a202586b3) 2009-04-24 12:09:51 +04:00			`DEBUG(DEBUG_ERR,(__location__ " Been in recovery mode for too long. Dropping all IPS\n"));`
add a context and a timed event so that once we have been in recovery mode for too long we drop all public ip addresses (This used to be ctdb commit 403c68f96e1380dd07217c688de2730464f77ea0) 2008-10-22 04:04:41 +04:00			`talloc_free(ctdb->release_ips_ctx);`
			`ctdb->release_ips_ctx = NULL;`

			`ctdb_release_all_ips(ctdb);`
			`}`

Add a new tunable : DisableIPFailover that when set to non 0 will stopp any ip reallocations at all from happening. (This used to be ctdb commit d8d37493478a26c5f1809a5f3df89ffd6e149281) 2010-11-09 07:19:06 +03:00			`/*`
			`* Set up an event to drop all public ips if we remain in recovery for too`
			`* long`
			`*/`
			`int ctdb_deferred_drop_all_ips(struct ctdb_context *ctdb)`
			`{`
			`if (ctdb->release_ips_ctx != NULL) {`
			`talloc_free(ctdb->release_ips_ctx);`
			`}`
			`ctdb->release_ips_ctx = talloc_new(ctdb);`
			`CTDB_NO_MEMORY(ctdb, ctdb->release_ips_ctx);`

ctdb-daemon: Stop using tevent compatibility definitions Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:09 +03:00			`tevent_add_timer(ctdb->ev, ctdb->release_ips_ctx,`
			`timeval_current_ofs(ctdb->tunable.recovery_drop_all_ips, 0),`
			`ctdb_drop_all_ips_event, ctdb);`
Add a new tunable : DisableIPFailover that when set to non 0 will stopp any ip reallocations at all from happening. (This used to be ctdb commit d8d37493478a26c5f1809a5f3df89ffd6e149281) 2010-11-09 07:19:06 +03:00			`return 0;`
			`}`

added lockwait child code for entering recovery mode. A child processes holds lockall locks for the entire recovery process (This used to be ctdb commit f892f30def75b0d964c35eae38c4cf675597dd28) 2007-05-12 08:34:21 +04:00			`/*`
			`set the recovery mode`
			`*/`
ctdb:server: Remove trailing whitespaces in ctdb_recover.c Signed-off-by: Andreas Schneider <asn@samba.org> Reviewed-by: Martin Schwenke <mschwenke@ddn.com> 2023-03-22 11:33:04 +03:00			`int32_t ctdb_control_set_recmode(struct ctdb_context *ctdb,`
ctdb-daemon: Rename struct ctdb_req_control to ctdb_req_control_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:42:05 +03:00			`struct ctdb_req_control_old *c,`
- make calling of recovered event script async - shutdown sockets before calling shutdown script (This used to be ctdb commit c5e099feef94a014a77742b6cc1d0afe78ef9da9) 2007-06-02 02:41:19 +04:00			`TDB_DATA indata, bool *async_reply,`
added error messages in ctdb_control replies (This used to be ctdb commit bd848f5b760e6b2a73ebfc67fd8adb3c31479fb5) 2007-05-12 15:25:26 +04:00			`const char **errormsg)`
added lockwait child code for entering recovery mode. A child processes holds lockall locks for the entire recovery process (This used to be ctdb commit f892f30def75b0d964c35eae38c4cf675597dd28) 2007-05-12 08:34:21 +04:00			`{`
separate out the freeze/thaw handling from recovery (This used to be ctdb commit 0b0640bd8b8334961f240e0cf276ac112cd6e616) 2007-05-12 09:15:27 +04:00			`uint32_t recmode = (uint32_t )indata.dptr;`
ctdb-daemon: Add a check for database generation consistency Before setting recovery mode to normal, confirm that all the databases are recovered by matching the database generation with the global generation. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-09-11 09:14:12 +03:00			`struct ctdb_db_context *ctdb_db;`
ctdb-recovery: Wrap private data for reclock test callback This will allow a simplification of the cluster mutex API, so the private data can be registered when calling ctdb_cluster_mutex(). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-06-01 10:45:36 +03:00			`struct set_recmode_state *state;`
ctdb-recovery: Factor out reclock testing into ctdb_cluster_mutex() This is currently only used to check whether the recovery lock can be taken. However, name it more generally in anticipation of using it for general cluster mutex taking and testing. No functional changes. A couple of debug message simplifications and code rearrangements. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-01-12 06:18:27 +03:00			`struct ctdb_cluster_mutex_handle *h;`
- make specification of a recovery lock file compulsory - die if someone other than the recmaster can get the recovery lock (This used to be ctdb commit a827d0d0e430ca8ad5d521367e45097185492869) 2007-06-02 05:36:42 +04:00
ctdb-recovery: Setting up of recmode should be idempotent BUG: https://bugzilla.samba.org/show_bug.cgi?id=12857 If the recovery mode is already set to the expected value, there is nothing to do. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-06-22 07:49:02 +03:00			`if (recmode == ctdb->recovery_mode) {`
			`D_INFO("Recovery mode already set to %s\n",`
			`recmode == CTDB_RECOVERY_NORMAL ? "NORMAL" : "ACTIVE");`
			`return 0;`
			`}`

ctdb-recovery: Simplify logging of recovery mode setting BUG: https://bugzilla.samba.org/show_bug.cgi?id=12857 Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-06-22 07:52:32 +03:00			`D_NOTICE("Recovery mode set to %s\n",`
			`recmode == CTDB_RECOVERY_NORMAL ? "NORMAL" : "ACTIVE");`

add a context and a timed event so that once we have been in recovery mode for too long we drop all public ip addresses (This used to be ctdb commit 403c68f96e1380dd07217c688de2730464f77ea0) 2008-10-22 04:04:41 +04:00			`/* if we enter recovery but stay in recovery for too long`
			`we will eventually drop all our ip addresses`
			`*/`
ctdb-recovery: Finish processing for recovery mode ACTIVE first BUG: https://bugzilla.samba.org/show_bug.cgi?id=12857 This simplifies the code and avoids complicated conditions. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-06-22 07:09:32 +03:00			`if (recmode == CTDB_RECOVERY_ACTIVE) {`
Add a new tunable : DisableIPFailover that when set to non 0 will stopp any ip reallocations at all from happening. (This used to be ctdb commit d8d37493478a26c5f1809a5f3df89ffd6e149281) 2010-11-09 07:19:06 +03:00			`if (ctdb_deferred_drop_all_ips(ctdb) != 0) {`
ctdb-recovery: Finish processing for recovery mode ACTIVE first BUG: https://bugzilla.samba.org/show_bug.cgi?id=12857 This simplifies the code and avoids complicated conditions. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-06-22 07:09:32 +03:00			`D_ERR("Failed to set up deferred drop all ips\n");`
Add a new tunable : DisableIPFailover that when set to non 0 will stopp any ip reallocations at all from happening. (This used to be ctdb commit d8d37493478a26c5f1809a5f3df89ffd6e149281) 2010-11-09 07:19:06 +03:00			`}`
add a context and a timed event so that once we have been in recovery mode for too long we drop all public ip addresses (This used to be ctdb commit 403c68f96e1380dd07217c688de2730464f77ea0) 2008-10-22 04:04:41 +04:00
ctdb-recovery: Finish processing for recovery mode ACTIVE first BUG: https://bugzilla.samba.org/show_bug.cgi?id=12857 This simplifies the code and avoids complicated conditions. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-06-22 07:09:32 +03:00			`ctdb->recovery_mode = CTDB_RECOVERY_ACTIVE;`
- make specification of a recovery lock file compulsory - die if someone other than the recmaster can get the recovery lock (This used to be ctdb commit a827d0d0e430ca8ad5d521367e45097185492869) 2007-06-02 05:36:42 +04:00			`return 0;`
- make calling of recovered event script async - shutdown sockets before calling shutdown script (This used to be ctdb commit c5e099feef94a014a77742b6cc1d0afe78ef9da9) 2007-06-02 02:41:19 +04:00			`}`
- make specification of a recovery lock file compulsory - die if someone other than the recmaster can get the recovery lock (This used to be ctdb commit a827d0d0e430ca8ad5d521367e45097185492869) 2007-06-02 05:36:42 +04:00
ctdb-recovery: Don't store recmode in recovery mode state The callbacks that use this value are only ever called if recovery mode is being set to NORMAL. So do not check if recmode is NORMAL either. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-01-11 05:58:54 +03:00			`/* From this point: recmode == CTDB_RECOVERY_NORMAL`
			`*`
			`* Therefore, what follows is special handling when setting`
			`* recovery mode back to normal */`
test (This used to be ctdb commit 4f2d722cf29175c3c207e6ebb6d4f9e370767249) 2008-06-26 08:14:37 +04:00
ctdb-recovery: Finish processing for recovery mode ACTIVE first BUG: https://bugzilla.samba.org/show_bug.cgi?id=12857 This simplifies the code and avoids complicated conditions. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-06-22 07:09:32 +03:00			`TALLOC_FREE(ctdb->release_ips_ctx);`

ctdb-daemon: Add a check for database generation consistency Before setting recovery mode to normal, confirm that all the databases are recovered by matching the database generation with the global generation. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-09-11 09:14:12 +03:00			`for (ctdb_db = ctdb->db_list; ctdb_db != NULL; ctdb_db = ctdb_db->next) {`
			`if (ctdb_db->generation != ctdb->vnn_map->generation) {`
			`DEBUG(DEBUG_ERR,`
			`("Inconsistent DB generation %u for %s\n",`
			`ctdb_db->generation, ctdb_db->db_name));`
			`DEBUG(DEBUG_ERR, ("Recovery mode set to ACTIVE\n"));`
			`return -1;`
			`}`
			`}`

initial attempt at freezing databases in priority order (This used to be ctdb commit e8d692590da1070c87a4144031e3306d190ebed2) 2009-10-12 05:08:39 +04:00			`/* force the databases to thaw */`
ctdb-daemon: Drop priorites from freeze/thaw code Parallel database recovery freezes databases in parallel and irrespective of database priority. So drop priority from freeze/thaw code. Database priority will be dropped completely soon. Now FREEZE and THAW controls operate on all the databases. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2016-07-19 09:32:16 +03:00			`if (ctdb_db_all_frozen(ctdb)) {`
			`ctdb_control_thaw(ctdb, false);`
test (This used to be ctdb commit 4f2d722cf29175c3c207e6ebb6d4f9e370767249) 2008-06-26 08:14:37 +04:00			`}`

ctdb-daemon: Rename recovery lock file to just recovery lock It isn't necessarily a file. Don't bother changing the control, since it doesn't pervade the code. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-05-17 11:28:56 +03:00			`if (ctdb->recovery_lock == NULL) {`
ctdb-daemon: Mark tunable VerifyRecoveryLock as obsolete It is pointless having a recovery lock but not sanity checking that it is working. Also, the logic that uses this tunable is confusing. In some places the recovery lock is released unnecessarily because the tunable isn't set. Simplify the logic by assuming that if a recovery lock is specified then it should be verified. Update documentation that references this tunable. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2014-12-09 05:47:42 +03:00			`/* Not using recovery lock file */`
ctdb-recover: Avoid duplicate deferred attach processing Deferred attach processing is done unconditionally at this point. It is then done again if recovery lock checking is done and completes successfuly. If the recovery lock checking fails then it should not be done at all. Move this processing so it is done with the early exit when the recovery lock is not being used. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-04-28 10:11:22 +03:00			`ctdb->recovery_mode = CTDB_RECOVERY_NORMAL;`
			`ctdb_process_deferred_attach(ctdb);`
Add a new variable VerifyRecoveryLock which can be used to disable the test that the recovery daemon holds the lock properly when performing a recovery (This used to be ctdb commit 329df9e47e6ca8ab5143985a999e68f37c6d88a5) 2009-04-30 19:18:27 +04:00			`return 0;`
			`}`

ctdb-recovery: Wrap private data for reclock test callback This will allow a simplification of the cluster mutex API, so the private data can be registered when calling ctdb_cluster_mutex(). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-06-01 10:45:36 +03:00			`state = talloc_zero(ctdb, struct set_recmode_state);`
			`if (state == NULL) {`
			`DEBUG(DEBUG_ERR, (__location__ " out of memory\n"));`
			`return -1;`
			`}`
ctdb-cluster-mutex: Drop cluster_mutex_handler() ctdb and handle arguments This makes the API more general. If they are needed in a handler then they can be in the private data. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-06-01 10:10:26 +03:00			`state->ctdb = ctdb;`
ctdb-recovery: Wrap private data for reclock test callback This will allow a simplification of the cluster mutex API, so the private data can be registered when calling ctdb_cluster_mutex(). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-06-01 10:45:36 +03:00			`state->c = NULL;`

ctdb-cluster-mutex: ctdb_cluster_mutex() registers handler and private data Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-06-01 11:56:33 +03:00			`h = ctdb_cluster_mutex(state, ctdb, ctdb->recovery_lock, 5,`
ctdb-cluster-mutex: Register an extra handler for when mutex is lost Pass NULL if not needed. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-06-01 12:05:47 +03:00			`set_recmode_handler, state, NULL, NULL);`
ctdb-recovery: Factor out reclock testing into ctdb_cluster_mutex() This is currently only used to check whether the recovery lock can be taken. However, name it more generally in anticipation of using it for general cluster mutex taking and testing. No functional changes. A couple of debug message simplifications and code rearrangements. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-01-12 06:18:27 +03:00			`if (h == NULL) {`
ctdb-recovery: Wrap private data for reclock test callback This will allow a simplification of the cluster mutex API, so the private data can be registered when calling ctdb_cluster_mutex(). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-06-01 10:45:36 +03:00			`talloc_free(state);`
add back the test inside the daemon that if someone asks us to drop recovery mode back to NORMAL that we can not lock the reclock file since at this stage it MUST be locked by the recovery daemon. in order to avoid a non-blocking fnctl() lock from blocking and cause "issues" we move the 'test that we can not lock reclock file' into a child process. (This used to be ctdb commit 3af994641ec2234e37da1fa1f693441586471a7e) 2007-10-16 09:27:07 +04:00			`return -1;`
- make specification of a recovery lock file compulsory - die if someone other than the recmaster can get the recovery lock (This used to be ctdb commit a827d0d0e430ca8ad5d521367e45097185492869) 2007-06-02 05:36:42 +04:00			`}`
add back the test inside the daemon that if someone asks us to drop recovery mode back to NORMAL that we can not lock the reclock file since at this stage it MUST be locked by the recovery daemon. in order to avoid a non-blocking fnctl() lock from blocking and cause "issues" we move the 'test that we can not lock reclock file' into a child process. (This used to be ctdb commit 3af994641ec2234e37da1fa1f693441586471a7e) 2007-10-16 09:27:07 +04:00
ctdb-recovery: Wrap private data for reclock test callback This will allow a simplification of the cluster mutex API, so the private data can be registered when calling ctdb_cluster_mutex(). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-06-01 10:45:36 +03:00			`state->c = talloc_steal(state, c);`
- make specification of a recovery lock file compulsory - die if someone other than the recmaster can get the recovery lock (This used to be ctdb commit a827d0d0e430ca8ad5d521367e45097185492869) 2007-06-02 05:36:42 +04:00			`*async_reply = true;`

separate out the freeze/thaw handling from recovery (This used to be ctdb commit 0b0640bd8b8334961f240e0cf276ac112cd6e616) 2007-05-12 09:15:27 +04:00			`return 0;`
added lockwait child code for entering recovery mode. A child processes holds lockall locks for the entire recovery process (This used to be ctdb commit f892f30def75b0d964c35eae38c4cf675597dd28) 2007-05-12 08:34:21 +04:00			`}`
merge from tridge (This used to be ctdb commit 7bca79ad6357149fd7c6b28ce4b05de3d223a7de) 2007-05-14 00:25:15 +04:00
- moved cmdline options that are only relevant to ctdbd into ctdbd.c - fixed a valgrind error on failing to send a control - don't mark node dead when already disconnected - moved node list lock code into common code (This used to be ctdb commit bcc0432d0fea7ef223f82ccee81cf35c18144b1b) 2007-06-02 04:03:28 +04:00
added two new ctdb commands: ctdb vacuum : vacuums all the databases, deleting any zero length ctdb records ctdb repack : repacks all the databases, resulting in a perfectly packed database with no freelist entries (This used to be ctdb commit 3532119c84ab3247051ed6ba21ba3243ae2f6bf4) 2008-01-08 09:23:27 +03:00			`/*`
			`delete a record as part of the vacuum process`
			`only delete if we are not lmaster or dmaster, and our rsn is <= the provided rsn`
			`use non-blocking locks`
Redo the vacukming process to mkake it scalable. Vacumming used to delete one record at a time on all nodes, that was m*n behaviour and would require a huge storm of ctdb->ctdb controls and just wouldnt scale at all. The new vacuming process collects all records to be deleted locally and then only sends 1 control to the other nodes. This control contains a list of all records to be deleted. (This used to be ctdb commit 9e625ece19a91f362c9539fa73b6b2108f0d9c53) 2008-03-12 23:53:29 +03:00
			`return 0 if the record was successfully deleted (i.e. it does not exist`
			`when the function returns)`
			`or !0 is the record still exists in the tdb after returning.`
added two new ctdb commands: ctdb vacuum : vacuums all the databases, deleting any zero length ctdb records ctdb repack : repacks all the databases, resulting in a perfectly packed database with no freelist entries (This used to be ctdb commit 3532119c84ab3247051ed6ba21ba3243ae2f6bf4) 2008-01-08 09:23:27 +03:00			`*/`
ctdb-daemon: Rename struct ctdb_rec_data to ctdb_rec_data_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 09:30:30 +03:00			`static int delete_tdb_record(struct ctdb_context ctdb, struct ctdb_db_context ctdb_db, struct ctdb_rec_data_old *rec)`
added two new ctdb commands: ctdb vacuum : vacuums all the databases, deleting any zero length ctdb records ctdb repack : repacks all the databases, resulting in a perfectly packed database with no freelist entries (This used to be ctdb commit 3532119c84ab3247051ed6ba21ba3243ae2f6bf4) 2008-01-08 09:23:27 +03:00			`{`
vacuuming: Fix vacuuming bug where requests keep bouncing between nodes (part 2) This is caused by corruption of a record header such that the records on two nodes point to each other as dmaster. This makes a request for that record bounce between nodes endlessly. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit f0853013655ac3bedf1b793de128fb679c6db6c6) 2013-08-12 09:50:30 +04:00			`TDB_DATA key, data, data2;`
added two new ctdb commands: ctdb vacuum : vacuums all the databases, deleting any zero length ctdb records ctdb repack : repacks all the databases, resulting in a perfectly packed database with no freelist entries (This used to be ctdb commit 3532119c84ab3247051ed6ba21ba3243ae2f6bf4) 2008-01-08 09:23:27 +03:00			`struct ctdb_ltdb_header hdr, hdr2;`
ctdb:server: Remove trailing whitespaces in ctdb_recover.c Signed-off-by: Andreas Schneider <asn@samba.org> Reviewed-by: Martin Schwenke <mschwenke@ddn.com> 2023-03-22 11:33:04 +03:00
ensure the main daemon doesn't use a blocking lock on the freelist (This used to be ctdb commit 73f8257906b09e6516f675883d8e7a3c455ad869) 2008-01-08 14:31:48 +03:00			`/* these are really internal tdb functions - but we need them here for`
			`non-blocking lock of the freelist */`
			`int tdb_lock_nonblock(struct tdb_context *tdb, int list, int ltype);`
			`int tdb_unlock(struct tdb_context *tdb, int list, int ltype);`
added two new ctdb commands: ctdb vacuum : vacuums all the databases, deleting any zero length ctdb records ctdb repack : repacks all the databases, resulting in a perfectly packed database with no freelist entries (This used to be ctdb commit 3532119c84ab3247051ed6ba21ba3243ae2f6bf4) 2008-01-08 09:23:27 +03:00

			`key.dsize = rec->keylen;`
			`key.dptr = &rec->data[0];`
			`data.dsize = rec->datalen;`
			`data.dptr = &rec->data[rec->keylen];`

			`if (ctdb_lmaster(ctdb, &key) == ctdb->pnn) {`
ctdb-daemon: Modernise debug during record deletion for vacuuming Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Tue Dec 18 10:13:50 CET 2018 on sn-devel-144 2018-10-24 04:29:54 +03:00			`DBG_INFO("Called delete on record where we are lmaster\n");`
added two new ctdb commands: ctdb vacuum : vacuums all the databases, deleting any zero length ctdb records ctdb repack : repacks all the databases, resulting in a perfectly packed database with no freelist entries (This used to be ctdb commit 3532119c84ab3247051ed6ba21ba3243ae2f6bf4) 2008-01-08 09:23:27 +03:00			`return -1;`
			`}`

			`if (data.dsize != sizeof(struct ctdb_ltdb_header)) {`
ctdb-daemon: Modernise debug during record deletion for vacuuming Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Tue Dec 18 10:13:50 CET 2018 on sn-devel-144 2018-10-24 04:29:54 +03:00			`DBG_ERR("Bad record size\n");`
added two new ctdb commands: ctdb vacuum : vacuums all the databases, deleting any zero length ctdb records ctdb repack : repacks all the databases, resulting in a perfectly packed database with no freelist entries (This used to be ctdb commit 3532119c84ab3247051ed6ba21ba3243ae2f6bf4) 2008-01-08 09:23:27 +03:00			`return -1;`
			`}`

			`hdr = (struct ctdb_ltdb_header *)data.dptr;`

			`/* use a non-blocking lock */`
			`if (tdb_chainlock_nonblock(ctdb_db->ltdb->tdb, key) != 0) {`
ctdb-daemon Add extra debug during record deletion for vacuuming It isn't currently possible to distinguish these 2 cases. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2018-10-15 13:21:25 +03:00			`DBG_INFO("Failed to get non-blocking chain lock\n");`
added two new ctdb commands: ctdb vacuum : vacuums all the databases, deleting any zero length ctdb records ctdb repack : repacks all the databases, resulting in a perfectly packed database with no freelist entries (This used to be ctdb commit 3532119c84ab3247051ed6ba21ba3243ae2f6bf4) 2008-01-08 09:23:27 +03:00			`return -1;`
			`}`

vacuuming: Fix vacuuming bug where requests keep bouncing between nodes (part 2) This is caused by corruption of a record header such that the records on two nodes point to each other as dmaster. This makes a request for that record bounce between nodes endlessly. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit f0853013655ac3bedf1b793de128fb679c6db6c6) 2013-08-12 09:50:30 +04:00			`data2 = tdb_fetch(ctdb_db->ltdb->tdb, key);`
			`if (data2.dptr == NULL) {`
added two new ctdb commands: ctdb vacuum : vacuums all the databases, deleting any zero length ctdb records ctdb repack : repacks all the databases, resulting in a perfectly packed database with no freelist entries (This used to be ctdb commit 3532119c84ab3247051ed6ba21ba3243ae2f6bf4) 2008-01-08 09:23:27 +03:00			`tdb_chainunlock(ctdb_db->ltdb->tdb, key);`
			`return 0;`
			`}`

vacuuming: Fix vacuuming bug where requests keep bouncing between nodes (part 2) This is caused by corruption of a record header such that the records on two nodes point to each other as dmaster. This makes a request for that record bounce between nodes endlessly. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit f0853013655ac3bedf1b793de128fb679c6db6c6) 2013-08-12 09:50:30 +04:00			`if (data2.dsize < sizeof(struct ctdb_ltdb_header)) {`
ensure the main daemon doesn't use a blocking lock on the freelist (This used to be ctdb commit 73f8257906b09e6516f675883d8e7a3c455ad869) 2008-01-08 14:31:48 +03:00			`if (tdb_lock_nonblock(ctdb_db->ltdb->tdb, -1, F_WRLCK) == 0) {`
Check return value of tdb_delete() Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 5cdcc3d45d358ddbcd7e864898eed9cbd9935429) 2012-11-19 14:20:31 +04:00			`if (tdb_delete(ctdb_db->ltdb->tdb, key) != 0) {`
ctdb-daemon: Modernise debug during record deletion for vacuuming Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Tue Dec 18 10:13:50 CET 2018 on sn-devel-144 2018-10-24 04:29:54 +03:00			`DBG_ERR("Failed to delete corrupt record\n");`
Check return value of tdb_delete() Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 5cdcc3d45d358ddbcd7e864898eed9cbd9935429) 2012-11-19 14:20:31 +04:00			`}`
ensure the main daemon doesn't use a blocking lock on the freelist (This used to be ctdb commit 73f8257906b09e6516f675883d8e7a3c455ad869) 2008-01-08 14:31:48 +03:00			`tdb_unlock(ctdb_db->ltdb->tdb, -1, F_WRLCK);`
ctdb-daemon: Modernise debug during record deletion for vacuuming Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Tue Dec 18 10:13:50 CET 2018 on sn-devel-144 2018-10-24 04:29:54 +03:00			`DBG_ERR("Deleted corrupt record\n");`
ensure the main daemon doesn't use a blocking lock on the freelist (This used to be ctdb commit 73f8257906b09e6516f675883d8e7a3c455ad869) 2008-01-08 14:31:48 +03:00			`}`
added two new ctdb commands: ctdb vacuum : vacuums all the databases, deleting any zero length ctdb records ctdb repack : repacks all the databases, resulting in a perfectly packed database with no freelist entries (This used to be ctdb commit 3532119c84ab3247051ed6ba21ba3243ae2f6bf4) 2008-01-08 09:23:27 +03:00			`tdb_chainunlock(ctdb_db->ltdb->tdb, key);`
vacuuming: Fix vacuuming bug where requests keep bouncing between nodes (part 2) This is caused by corruption of a record header such that the records on two nodes point to each other as dmaster. This makes a request for that record bounce between nodes endlessly. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit f0853013655ac3bedf1b793de128fb679c6db6c6) 2013-08-12 09:50:30 +04:00			`free(data2.dptr);`
added two new ctdb commands: ctdb vacuum : vacuums all the databases, deleting any zero length ctdb records ctdb repack : repacks all the databases, resulting in a perfectly packed database with no freelist entries (This used to be ctdb commit 3532119c84ab3247051ed6ba21ba3243ae2f6bf4) 2008-01-08 09:23:27 +03:00			`return 0;`
			`}`
ctdb:server: Remove trailing whitespaces in ctdb_recover.c Signed-off-by: Andreas Schneider <asn@samba.org> Reviewed-by: Martin Schwenke <mschwenke@ddn.com> 2023-03-22 11:33:04 +03:00
vacuuming: Fix vacuuming bug where requests keep bouncing between nodes (part 2) This is caused by corruption of a record header such that the records on two nodes point to each other as dmaster. This makes a request for that record bounce between nodes endlessly. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit f0853013655ac3bedf1b793de128fb679c6db6c6) 2013-08-12 09:50:30 +04:00			`hdr2 = (struct ctdb_ltdb_header *)data2.dptr;`
added two new ctdb commands: ctdb vacuum : vacuums all the databases, deleting any zero length ctdb records ctdb repack : repacks all the databases, resulting in a perfectly packed database with no freelist entries (This used to be ctdb commit 3532119c84ab3247051ed6ba21ba3243ae2f6bf4) 2008-01-08 09:23:27 +03:00
			`if (hdr2->rsn > hdr->rsn) {`
			`tdb_chainunlock(ctdb_db->ltdb->tdb, key);`
ctdb-daemon: Modernise debug during record deletion for vacuuming Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Tue Dec 18 10:13:50 CET 2018 on sn-devel-144 2018-10-24 04:29:54 +03:00			`DBG_INFO("Skipping record with rsn=%llu - called with rsn=%llu\n",`
			`(unsigned long long)hdr2->rsn,`
			`(unsigned long long)hdr->rsn);`
vacuuming: Fix vacuuming bug where requests keep bouncing between nodes (part 2) This is caused by corruption of a record header such that the records on two nodes point to each other as dmaster. This makes a request for that record bounce between nodes endlessly. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit f0853013655ac3bedf1b793de128fb679c6db6c6) 2013-08-12 09:50:30 +04:00			`free(data2.dptr);`
			`return -1;`
added two new ctdb commands: ctdb vacuum : vacuums all the databases, deleting any zero length ctdb records ctdb repack : repacks all the databases, resulting in a perfectly packed database with no freelist entries (This used to be ctdb commit 3532119c84ab3247051ed6ba21ba3243ae2f6bf4) 2008-01-08 09:23:27 +03:00			`}`

READONLY: skip vacuuming or deleting records with readonly delegations. they are hot. wait until they have been revoked before we recall them. (This used to be ctdb commit 7417d994c2a159f71d27d4bcd2f53684862eece3) 2012-02-29 09:09:24 +04:00			`/* do not allow deleting record that have readonly flags set. */`
recover: use CTDB_REC_RO_FLAGS where appropriate Signed-off-by: Michael Adam <obnox@samba.org> Reviewed-By: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit b5a8791268e938d7e017056e0e2bd2cbec1fa690) 2013-04-19 18:24:32 +04:00			`if (hdr->flags & CTDB_REC_RO_FLAGS) {`
READONLY: skip vacuuming or deleting records with readonly delegations. they are hot. wait until they have been revoked before we recall them. (This used to be ctdb commit 7417d994c2a159f71d27d4bcd2f53684862eece3) 2012-02-29 09:09:24 +04:00			`tdb_chainunlock(ctdb_db->ltdb->tdb, key);`
ctdb-daemon: Modernise debug during record deletion for vacuuming Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Tue Dec 18 10:13:50 CET 2018 on sn-devel-144 2018-10-24 04:29:54 +03:00			`DBG_INFO("Skipping record with readonly flags set\n");`
vacuuming: Fix vacuuming bug where requests keep bouncing between nodes (part 2) This is caused by corruption of a record header such that the records on two nodes point to each other as dmaster. This makes a request for that record bounce between nodes endlessly. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit f0853013655ac3bedf1b793de128fb679c6db6c6) 2013-08-12 09:50:30 +04:00			`free(data2.dptr);`
			`return -1;`
READONLY: skip vacuuming or deleting records with readonly delegations. they are hot. wait until they have been revoked before we recall them. (This used to be ctdb commit 7417d994c2a159f71d27d4bcd2f53684862eece3) 2012-02-29 09:09:24 +04:00			`}`
recover: use CTDB_REC_RO_FLAGS where appropriate Signed-off-by: Michael Adam <obnox@samba.org> Reviewed-By: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit b5a8791268e938d7e017056e0e2bd2cbec1fa690) 2013-04-19 18:24:32 +04:00			`if (hdr2->flags & CTDB_REC_RO_FLAGS) {`
READONLY: skip vacuuming or deleting records with readonly delegations. they are hot. wait until they have been revoked before we recall them. (This used to be ctdb commit 7417d994c2a159f71d27d4bcd2f53684862eece3) 2012-02-29 09:09:24 +04:00			`tdb_chainunlock(ctdb_db->ltdb->tdb, key);`
ctdb-daemon: Modernise debug during record deletion for vacuuming Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Tue Dec 18 10:13:50 CET 2018 on sn-devel-144 2018-10-24 04:29:54 +03:00			`DBG_INFO("Skipping record with readonly flags set locally\n");`
vacuuming: Fix vacuuming bug where requests keep bouncing between nodes (part 2) This is caused by corruption of a record header such that the records on two nodes point to each other as dmaster. This makes a request for that record bounce between nodes endlessly. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit f0853013655ac3bedf1b793de128fb679c6db6c6) 2013-08-12 09:50:30 +04:00			`free(data2.dptr);`
			`return -1;`
READONLY: skip vacuuming or deleting records with readonly delegations. they are hot. wait until they have been revoked before we recall them. (This used to be ctdb commit 7417d994c2a159f71d27d4bcd2f53684862eece3) 2012-02-29 09:09:24 +04:00			`}`

added two new ctdb commands: ctdb vacuum : vacuums all the databases, deleting any zero length ctdb records ctdb repack : repacks all the databases, resulting in a perfectly packed database with no freelist entries (This used to be ctdb commit 3532119c84ab3247051ed6ba21ba3243ae2f6bf4) 2008-01-08 09:23:27 +03:00			`if (hdr2->dmaster == ctdb->pnn) {`
			`tdb_chainunlock(ctdb_db->ltdb->tdb, key);`
ctdb-daemon: Modernise debug during record deletion for vacuuming Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Tue Dec 18 10:13:50 CET 2018 on sn-devel-144 2018-10-24 04:29:54 +03:00			`DBG_INFO("Attempted delete record where we are the dmaster\n");`
vacuuming: Fix vacuuming bug where requests keep bouncing between nodes (part 2) This is caused by corruption of a record header such that the records on two nodes point to each other as dmaster. This makes a request for that record bounce between nodes endlessly. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit f0853013655ac3bedf1b793de128fb679c6db6c6) 2013-08-12 09:50:30 +04:00			`free(data2.dptr);`
			`return -1;`
added two new ctdb commands: ctdb vacuum : vacuums all the databases, deleting any zero length ctdb records ctdb repack : repacks all the databases, resulting in a perfectly packed database with no freelist entries (This used to be ctdb commit 3532119c84ab3247051ed6ba21ba3243ae2f6bf4) 2008-01-08 09:23:27 +03:00			`}`

ensure the main daemon doesn't use a blocking lock on the freelist (This used to be ctdb commit 73f8257906b09e6516f675883d8e7a3c455ad869) 2008-01-08 14:31:48 +03:00			`if (tdb_lock_nonblock(ctdb_db->ltdb->tdb, -1, F_WRLCK) != 0) {`
			`tdb_chainunlock(ctdb_db->ltdb->tdb, key);`
ctdb-daemon Add extra debug during record deletion for vacuuming It isn't currently possible to distinguish these 2 cases. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2018-10-15 13:21:25 +03:00			`DBG_INFO("Failed to get non-blocking freelist lock\n");`
vacuuming: Fix vacuuming bug where requests keep bouncing between nodes (part 2) This is caused by corruption of a record header such that the records on two nodes point to each other as dmaster. This makes a request for that record bounce between nodes endlessly. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit f0853013655ac3bedf1b793de128fb679c6db6c6) 2013-08-12 09:50:30 +04:00			`free(data2.dptr);`
			`return -1;`
ensure the main daemon doesn't use a blocking lock on the freelist (This used to be ctdb commit 73f8257906b09e6516f675883d8e7a3c455ad869) 2008-01-08 14:31:48 +03:00			`}`

added two new ctdb commands: ctdb vacuum : vacuums all the databases, deleting any zero length ctdb records ctdb repack : repacks all the databases, resulting in a perfectly packed database with no freelist entries (This used to be ctdb commit 3532119c84ab3247051ed6ba21ba3243ae2f6bf4) 2008-01-08 09:23:27 +03:00			`if (tdb_delete(ctdb_db->ltdb->tdb, key) != 0) {`
ensure the main daemon doesn't use a blocking lock on the freelist (This used to be ctdb commit 73f8257906b09e6516f675883d8e7a3c455ad869) 2008-01-08 14:31:48 +03:00			`tdb_unlock(ctdb_db->ltdb->tdb, -1, F_WRLCK);`
added two new ctdb commands: ctdb vacuum : vacuums all the databases, deleting any zero length ctdb records ctdb repack : repacks all the databases, resulting in a perfectly packed database with no freelist entries (This used to be ctdb commit 3532119c84ab3247051ed6ba21ba3243ae2f6bf4) 2008-01-08 09:23:27 +03:00			`tdb_chainunlock(ctdb_db->ltdb->tdb, key);`
ctdb-daemon: Modernise debug during record deletion for vacuuming Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Tue Dec 18 10:13:50 CET 2018 on sn-devel-144 2018-10-24 04:29:54 +03:00			`DBG_INFO("Failed to delete record\n");`
vacuuming: Fix vacuuming bug where requests keep bouncing between nodes (part 2) This is caused by corruption of a record header such that the records on two nodes point to each other as dmaster. This makes a request for that record bounce between nodes endlessly. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit f0853013655ac3bedf1b793de128fb679c6db6c6) 2013-08-12 09:50:30 +04:00			`free(data2.dptr);`
			`return -1;`
added two new ctdb commands: ctdb vacuum : vacuums all the databases, deleting any zero length ctdb records ctdb repack : repacks all the databases, resulting in a perfectly packed database with no freelist entries (This used to be ctdb commit 3532119c84ab3247051ed6ba21ba3243ae2f6bf4) 2008-01-08 09:23:27 +03:00			`}`

ensure the main daemon doesn't use a blocking lock on the freelist (This used to be ctdb commit 73f8257906b09e6516f675883d8e7a3c455ad869) 2008-01-08 14:31:48 +03:00			`tdb_unlock(ctdb_db->ltdb->tdb, -1, F_WRLCK);`
added two new ctdb commands: ctdb vacuum : vacuums all the databases, deleting any zero length ctdb records ctdb repack : repacks all the databases, resulting in a perfectly packed database with no freelist entries (This used to be ctdb commit 3532119c84ab3247051ed6ba21ba3243ae2f6bf4) 2008-01-08 09:23:27 +03:00			`tdb_chainunlock(ctdb_db->ltdb->tdb, key);`
vacuuming: Fix vacuuming bug where requests keep bouncing between nodes (part 2) This is caused by corruption of a record header such that the records on two nodes point to each other as dmaster. This makes a request for that record bounce between nodes endlessly. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit f0853013655ac3bedf1b793de128fb679c6db6c6) 2013-08-12 09:50:30 +04:00			`free(data2.dptr);`
			`return 0;`
added two new ctdb commands: ctdb vacuum : vacuums all the databases, deleting any zero length ctdb records ctdb repack : repacks all the databases, resulting in a perfectly packed database with no freelist entries (This used to be ctdb commit 3532119c84ab3247051ed6ba21ba3243ae2f6bf4) 2008-01-08 09:23:27 +03:00			`}`
merge async recovery changes from Ronnie (This used to be ctdb commit 576e317640d25f8059114f15c6f1ebcee5e5b6e2) 2008-01-29 05:59:28 +03:00

Redo the vacukming process to mkake it scalable. Vacumming used to delete one record at a time on all nodes, that was m*n behaviour and would require a huge storm of ctdb->ctdb controls and just wouldnt scale at all. The new vacuming process collects all records to be deleted locally and then only sends 1 control to the other nodes. This control contains a list of all records to be deleted. (This used to be ctdb commit 9e625ece19a91f362c9539fa73b6b2108f0d9c53) 2008-03-12 23:53:29 +03:00
merge async recovery changes from Ronnie (This used to be ctdb commit 576e317640d25f8059114f15c6f1ebcee5e5b6e2) 2008-01-29 05:59:28 +03:00			`struct recovery_callback_state {`
ctdb-daemon: Rename struct ctdb_req_control to ctdb_req_control_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:42:05 +03:00			`struct ctdb_req_control_old *c;`
merge async recovery changes from Ronnie (This used to be ctdb commit 576e317640d25f8059114f15c6f1ebcee5e5b6e2) 2008-01-29 05:59:28 +03:00			`};`


			`/*`
			`called when the 'recovered' event script has finished`
			`*/`
			`static void ctdb_end_recovery_callback(struct ctdb_context ctdb, int status, void p)`
			`{`
			`struct recovery_callback_state *state = talloc_get_type(p, struct recovery_callback_state);`

Create macros to update the statistics counters and use these macros everywhere instead of manipulating the coutenrs directly. (This used to be ctdb commit 2e648df890e5713bc575965d87937827b068d0d7) 2010-09-29 04:38:41 +04:00			`CTDB_INCREMENT_STAT(ctdb, num_recoveries);`
merge async recovery changes from Ronnie (This used to be ctdb commit 576e317640d25f8059114f15c6f1ebcee5e5b6e2) 2008-01-29 05:59:28 +03:00
			`if (status != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,(__location__ " recovered event script failed (status %d)\n", status));`
ctdb-daemon: Switch to using ETIMEDOUT instead of ETIME BUG: https://bugzilla.samba.org/show_bug.cgi?id=13520 Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2018-07-10 11:18:33 +03:00			`if (status == -ETIMEDOUT) {`
eventscript: handle banning within the callbacks Currently the timeout handler in eventscript.c does the banning if a timeout happens. However, because monitor events are different, it has to special case them. As we call the callback anyway in this case, we should make that handle -ETIME as it sees fit: for everyone but the monitor event, we simply ban ourselves. The more complicated monitor event banning logic is now in ctdb_monitor.c where it belongs. Note: I wrapped the other bans in "if (status == -ETIME)", though they should probably ban themselves on any error. This change should be a noop. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 9ecee127e19a9e7cae114a66f3514ee7a75276c5) 2009-12-07 16:18:57 +03:00			`ctdb_ban_self(ctdb);`
			`}`
merge async recovery changes from Ronnie (This used to be ctdb commit 576e317640d25f8059114f15c6f1ebcee5e5b6e2) 2008-01-29 05:59:28 +03:00			`}`

			`ctdb_request_control_reply(ctdb, state->c, NULL, status, NULL);`
			`talloc_free(state);`

track both when we last started and ended a recovery. make ctdb uptime print how long the recovery took in the recovery daemon when we check that the public ip address allocation on the local node is correct (we have the ips we should have and we dont have any we shouldnt have) use ctdb uptime and check the recovery start/stop times and make sure we dont check for ip allocation inconsistencies during a recovery where the ip address allocation is in flux. (This used to be ctdb commit f86551580349b7f662f9a07e4eb0c1189e38e429) 2008-07-02 07:55:59 +04:00			`gettimeofday(&ctdb->last_recovery_finished, NULL);`
ctdbd: Add new runstate CTDB_RUNSTATE_FIRST_RECOVERY This adds more serialisation to the startup, ensuring that the "startup" event runs after everything to do with the first recovery (including the "recovered" event). Given that it now takes longer to get to the "startup" state, the initscript needs to wait until ctdbd gets to "first_recovery". Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit ed6814ff0a59ddbb1c1b3128b505380f60d7aeb7) 2013-04-18 14:30:14 +04:00
			`if (ctdb->runstate == CTDB_RUNSTATE_FIRST_RECOVERY) {`
			`ctdb_set_runstate(ctdb, CTDB_RUNSTATE_STARTUP);`
			`}`
merge async recovery changes from Ronnie (This used to be ctdb commit 576e317640d25f8059114f15c6f1ebcee5e5b6e2) 2008-01-29 05:59:28 +03:00			`}`

			`/*`
			`recovery has finished`
			`*/`
ctdb:server: Remove trailing whitespaces in ctdb_recover.c Signed-off-by: Andreas Schneider <asn@samba.org> Reviewed-by: Martin Schwenke <mschwenke@ddn.com> 2023-03-22 11:33:04 +03:00			`int32_t ctdb_control_end_recovery(struct ctdb_context *ctdb,`
ctdb-daemon: Rename struct ctdb_req_control to ctdb_req_control_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 08:42:05 +03:00			`struct ctdb_req_control_old *c,`
merge async recovery changes from Ronnie (This used to be ctdb commit 576e317640d25f8059114f15c6f1ebcee5e5b6e2) 2008-01-29 05:59:28 +03:00			`bool *async_reply)`
			`{`
			`int ret;`
			`struct recovery_callback_state *state;`

ctdb-daemon: Increase priority of logs when recovery happens Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-07-04 08:50:12 +03:00			`DEBUG(DEBUG_ERR,("Recovery has finished\n"));`
merge async recovery changes from Ronnie (This used to be ctdb commit 576e317640d25f8059114f15c6f1ebcee5e5b6e2) 2008-01-29 05:59:28 +03:00
recover: finish pending trans3 commits when a recovery is finished. When the end_recovery control is received, pending trans3 commits are finished. During the recovery, all the actions like persistent_callback and persistent_store_timeout had been disabled to let the recovery do its job. After the recover is completed, send the reply to the waiting clients. (This used to be ctdb commit f7dfeb7143f574c2434f7dd16917380dfd1f4f64) 2011-02-23 19:39:57 +03:00			`ctdb_persistent_finish_trans3_commits(ctdb);`

merge async recovery changes from Ronnie (This used to be ctdb commit 576e317640d25f8059114f15c6f1ebcee5e5b6e2) 2008-01-29 05:59:28 +03:00			`state = talloc(ctdb, struct recovery_callback_state);`
			`CTDB_NO_MEMORY(ctdb, state);`

In ctdb_control_end_recovery, We used to talloc_steal c (the command packet) and make it a child of the "event script state context". If we failed to create a eventscript child context for some reason, this would have talloc freed state, but at the same time it would also implicitely have freed c. Once ctdb_control_end_recovery() returns the error back to the caller, the caller would dereference both c, and also outdata which is a child of c and we would either read garbage data or segv. Change the ordering so we only talloc_steal c as a child of state IFF we have successfully created a child context for the script. BZ61068 (This used to be ctdb commit 259054c3632e42bbaa614ee7e888e6e850733d60) 2010-02-23 04:43:49 +03:00			`state->c = c;`
merge async recovery changes from Ronnie (This used to be ctdb commit 576e317640d25f8059114f15c6f1ebcee5e5b6e2) 2008-01-29 05:59:28 +03:00
eventscript: put timeout inside ctdb_event_script_callback_v Everyone uses the same timeout value, so just remove it from the API. If we ever need variable timeouts, that might as well be central too. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 533c3e053293941d2a9484b495e78d45f478bb08) 2009-11-24 03:39:46 +03:00			`ret = ctdb_event_script_callback(ctdb, state,`
ctdb:server: Remove trailing whitespaces in ctdb_recover.c Signed-off-by: Andreas Schneider <asn@samba.org> Reviewed-by: Martin Schwenke <mschwenke@ddn.com> 2023-03-22 11:33:04 +03:00			`ctdb_end_recovery_callback,`
			`state,`
Add flag to ctdb_event_script_callback indicating when called by client. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit a1d654a982ca56fade82552f4e6b5586236d3233) 2009-11-26 07:49:49 +03:00			`CTDB_EVENT_RECOVERED, "%s", "");`
merge async recovery changes from Ronnie (This used to be ctdb commit 576e317640d25f8059114f15c6f1ebcee5e5b6e2) 2008-01-29 05:59:28 +03:00
			`if (ret != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,(__location__ " Failed to end recovery\n"));`
merge async recovery changes from Ronnie (This used to be ctdb commit 576e317640d25f8059114f15c6f1ebcee5e5b6e2) 2008-01-29 05:59:28 +03:00			`talloc_free(state);`
			`return -1;`
			`}`

			`/* tell the control that we will be reply asynchronously */`
In ctdb_control_end_recovery, We used to talloc_steal c (the command packet) and make it a child of the "event script state context". If we failed to create a eventscript child context for some reason, this would have talloc freed state, but at the same time it would also implicitely have freed c. Once ctdb_control_end_recovery() returns the error back to the caller, the caller would dereference both c, and also outdata which is a child of c and we would either read garbage data or segv. Change the ordering so we only talloc_steal c as a child of state IFF we have successfully created a child context for the script. BZ61068 (This used to be ctdb commit 259054c3632e42bbaa614ee7e888e6e850733d60) 2010-02-23 04:43:49 +03:00			`state->c = talloc_steal(state, c);`
merge async recovery changes from Ronnie (This used to be ctdb commit 576e317640d25f8059114f15c6f1ebcee5e5b6e2) 2008-01-29 05:59:28 +03:00			`*async_reply = true;`
			`return 0;`
			`}`

			`/*`
			`called when the 'startrecovery' event script has finished`
			`*/`
			`static void ctdb_start_recovery_callback(struct ctdb_context ctdb, int status, void p)`
			`{`
			`struct recovery_callback_state *state = talloc_get_type(p, struct recovery_callback_state);`

			`if (status != 0) {`
merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c) 2008-02-04 12:07:15 +03:00			`DEBUG(DEBUG_ERR,(__location__ " startrecovery event script failed (status %d)\n", status));`
merge async recovery changes from Ronnie (This used to be ctdb commit 576e317640d25f8059114f15c6f1ebcee5e5b6e2) 2008-01-29 05:59:28 +03:00			`}`

			`ctdb_request_control_reply(ctdb, state->c, NULL, status, NULL);`
			`talloc_free(state);`
			`}`

ctdb-recovery: Consistency check reclock in start recovery control If the recovery lock setting is not consistent with that of the recovery master then abort. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-04-05 08:26:22 +03:00			`static void run_start_recovery_event(struct ctdb_context *ctdb,`
			`struct recovery_callback_state *state)`
merge async recovery changes from Ronnie (This used to be ctdb commit 576e317640d25f8059114f15c6f1ebcee5e5b6e2) 2008-01-29 05:59:28 +03:00			`{`
			`int ret;`

eventscript: put timeout inside ctdb_event_script_callback_v Everyone uses the same timeout value, so just remove it from the API. If we ever need variable timeouts, that might as well be central too. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 533c3e053293941d2a9484b495e78d45f478bb08) 2009-11-24 03:39:46 +03:00			`ret = ctdb_event_script_callback(ctdb, state,`
ctdb-recovery: Consistency check reclock in start recovery control If the recovery lock setting is not consistent with that of the recovery master then abort. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-04-05 08:26:22 +03:00			`ctdb_start_recovery_callback,`
ctdb-daemon: No need to call event scripts with CTDB_CALLED_BY_USER This was added to support external monitoring using CTDB event scripts. However, it was never used. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2013-12-16 08:57:42 +04:00			`state,`
Add flag to ctdb_event_script_callback indicating when called by client. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit a1d654a982ca56fade82552f4e6b5586236d3233) 2009-11-26 07:49:49 +03:00			`CTDB_EVENT_START_RECOVERY,`
eventscript: introduce enum for different event script calls. Rather than doing strcmp everywhere, pass an explicit enum around. This also subtly documents what options are available. The "options" arg is now used for extra arguments only. Unfortunately, gcc complains on empty format strings, so we make ctdb_event_script() take no varargs, and add ctdb_event_script_args(). We leave ctdb_event_script_callback() taking varargs, which means callers have to do "%s", "". For the moment, we have CTDB_EVENT_UNKNOWN for handling forced scripts from the ctdb tool. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 8001488be4f2beb25e943fe01b2afc2e8779930d) 2009-11-24 03:46:49 +03:00			`"%s", "");`
merge async recovery changes from Ronnie (This used to be ctdb commit 576e317640d25f8059114f15c6f1ebcee5e5b6e2) 2008-01-29 05:59:28 +03:00
			`if (ret != 0) {`
ctdb-recovery: Consistency check reclock in start recovery control If the recovery lock setting is not consistent with that of the recovery master then abort. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-04-05 08:26:22 +03:00			`DEBUG(DEBUG_ERR,("Unable to run startrecovery event\n"));`
			`ctdb_request_control_reply(ctdb, state->c, NULL, -1, NULL);`
merge async recovery changes from Ronnie (This used to be ctdb commit 576e317640d25f8059114f15c6f1ebcee5e5b6e2) 2008-01-29 05:59:28 +03:00			`talloc_free(state);`
ctdb-recovery: Consistency check reclock in start recovery control If the recovery lock setting is not consistent with that of the recovery master then abort. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-04-05 08:26:22 +03:00			`return;`
			`}`

			`return;`
			`}`

			`static bool reclock_strings_equal(const char a, const char b)`
			`{`
			`return (a == NULL && b == NULL) \|\|`
			`(a != NULL && b != NULL && strcmp(a, b) == 0);`
			`}`

			`static void start_recovery_reclock_callback(struct ctdb_context *ctdb,`
			`int32_t status,`
			`TDB_DATA data,`
			`const char *errormsg,`
			`void *private_data)`
			`{`
			`struct recovery_callback_state *state = talloc_get_type_abort(`
			`private_data, struct recovery_callback_state);`
ctdb-daemon: Rename recovery lock file to just recovery lock It isn't necessarily a file. Don't bother changing the control, since it doesn't pervade the code. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-05-17 11:28:56 +03:00			`const char *local = ctdb->recovery_lock;`
ctdb-recovery: Consistency check reclock in start recovery control If the recovery lock setting is not consistent with that of the recovery master then abort. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-04-05 08:26:22 +03:00			`const char *remote = NULL;`

			`if (status != 0) {`
			`DEBUG(DEBUG_ERR, (__location__ " GET_RECLOCK failed\n"));`
			`ctdb_request_control_reply(ctdb, state->c, NULL,`
			`status, errormsg);`
			`talloc_free(state);`
			`return;`
			`}`

			`/* Check reclock consistency */`
			`if (data.dsize > 0) {`
			`/* Ensure NUL-termination */`
			`data.dptr[data.dsize-1] = '\0';`
			`remote = (const char *)data.dptr;`
			`}`
			`if (! reclock_strings_equal(local, remote)) {`
			`/* Inconsistent */`
			`ctdb_request_control_reply(ctdb, state->c, NULL, -1, NULL);`
			`DEBUG(DEBUG_ERR,`
			`("Recovery lock configuration inconsistent: "`
			`"recmaster has %s, this node has %s, shutting down\n",`
			`remote == NULL ? "NULL" : remote,`
			`local == NULL ? "NULL" : local));`
			`talloc_free(state);`
			`ctdb_shutdown_sequence(ctdb, 1);`
			`}`
			`DEBUG(DEBUG_INFO,`
			`("Recovery lock consistency check successful\n"));`

			`run_start_recovery_event(ctdb, state);`
			`}`

			`/* Check recovery lock consistency and run eventscripts for the`
			`* "startrecovery" event */`
			`int32_t ctdb_control_start_recovery(struct ctdb_context *ctdb,`
			`struct ctdb_req_control_old *c,`
			`bool *async_reply)`
			`{`
			`int ret;`
			`struct recovery_callback_state *state;`
			`uint32_t recmaster = c->hdr.srcnode;`

ctdb-daemon: Increase priority of logs when recovery happens Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-07-04 08:50:12 +03:00			`DEBUG(DEBUG_ERR, ("Recovery has started\n"));`
ctdb-recovery: Consistency check reclock in start recovery control If the recovery lock setting is not consistent with that of the recovery master then abort. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-04-05 08:26:22 +03:00			`gettimeofday(&ctdb->last_recovery_started, NULL);`

			`state = talloc(ctdb, struct recovery_callback_state);`
			`CTDB_NO_MEMORY(ctdb, state);`

			`state->c = c;`

			`/* Although the recovery master sent this node a start`
			`* recovery control, this node might still think the recovery`
			`* master is disconnected. In this case defer the recovery`
			`* lock consistency check. */`
			`if (ctdb->nodes[recmaster]->flags & NODE_FLAGS_DISCONNECTED) {`
			`run_start_recovery_event(ctdb, state);`
			`} else {`
			`/* Ask the recovery master about its reclock setting */`
			`ret = ctdb_daemon_send_control(ctdb,`
			`recmaster,`
			`0,`
			`CTDB_CONTROL_GET_RECLOCK_FILE,`
			`0, 0,`
			`tdb_null,`
			`start_recovery_reclock_callback,`
			`state);`

			`if (ret != 0) {`
			`DEBUG(DEBUG_ERR, (__location__ " GET_RECLOCK failed\n"));`
			`talloc_free(state);`
			`return -1;`
			`}`
merge async recovery changes from Ronnie (This used to be ctdb commit 576e317640d25f8059114f15c6f1ebcee5e5b6e2) 2008-01-29 05:59:28 +03:00			`}`

			`/* tell the control that we will be reply asynchronously */`
ctdb-recovery: Consistency check reclock in start recovery control If the recovery lock setting is not consistent with that of the recovery master then abort. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-04-05 08:26:22 +03:00			`state->c = talloc_steal(state, c);`
merge async recovery changes from Ronnie (This used to be ctdb commit 576e317640d25f8059114f15c6f1ebcee5e5b6e2) 2008-01-29 05:59:28 +03:00			`*async_reply = true;`
ctdb-recovery: Consistency check reclock in start recovery control If the recovery lock setting is not consistent with that of the recovery master then abort. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-04-05 08:26:22 +03:00
merge async recovery changes from Ronnie (This used to be ctdb commit 576e317640d25f8059114f15c6f1ebcee5e5b6e2) 2008-01-29 05:59:28 +03:00			`return 0;`
			`}`

Redo the vacukming process to mkake it scalable. Vacumming used to delete one record at a time on all nodes, that was m*n behaviour and would require a huge storm of ctdb->ctdb controls and just wouldnt scale at all. The new vacuming process collects all records to be deleted locally and then only sends 1 control to the other nodes. This control contains a list of all records to be deleted. (This used to be ctdb commit 9e625ece19a91f362c9539fa73b6b2108f0d9c53) 2008-03-12 23:53:29 +03:00			`/*`
			`try to delete all these records as part of the vacuuming process`
			`and return the records we failed to delete`
			`*/`
			`int32_t ctdb_control_try_delete_records(struct ctdb_context ctdb, TDB_DATA indata, TDB_DATA outdata)`
			`{`
rename the structure we use for marshalling multiple records (This used to be ctdb commit 4d205476d286570a6e1f52b59af42858ce051106) 2008-07-30 08:24:56 +04:00			`struct ctdb_marshall_buffer reply = (struct ctdb_marshall_buffer )indata.dptr;`
Redo the vacukming process to mkake it scalable. Vacumming used to delete one record at a time on all nodes, that was m*n behaviour and would require a huge storm of ctdb->ctdb controls and just wouldnt scale at all. The new vacuming process collects all records to be deleted locally and then only sends 1 control to the other nodes. This control contains a list of all records to be deleted. (This used to be ctdb commit 9e625ece19a91f362c9539fa73b6b2108f0d9c53) 2008-03-12 23:53:29 +03:00			`struct ctdb_db_context *ctdb_db;`
ctdb-recovery: Fix signed/unsigned comparisons by declaring as unsigned Simple cases where variables need to be declared as an unsigned type instead of an int. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2019-05-23 01:43:58 +03:00			`unsigned int i;`
ctdb-daemon: Rename struct ctdb_rec_data to ctdb_rec_data_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 09:30:30 +03:00			`struct ctdb_rec_data_old *rec;`
rename the structure we use for marshalling multiple records (This used to be ctdb commit 4d205476d286570a6e1f52b59af42858ce051106) 2008-07-30 08:24:56 +04:00			`struct ctdb_marshall_buffer *records;`
Redo the vacukming process to mkake it scalable. Vacumming used to delete one record at a time on all nodes, that was m*n behaviour and would require a huge storm of ctdb->ctdb controls and just wouldnt scale at all. The new vacuming process collects all records to be deleted locally and then only sends 1 control to the other nodes. This control contains a list of all records to be deleted. (This used to be ctdb commit 9e625ece19a91f362c9539fa73b6b2108f0d9c53) 2008-03-12 23:53:29 +03:00
rename the structure we use for marshalling multiple records (This used to be ctdb commit 4d205476d286570a6e1f52b59af42858ce051106) 2008-07-30 08:24:56 +04:00			`if (indata.dsize < offsetof(struct ctdb_marshall_buffer, data)) {`
Redo the vacukming process to mkake it scalable. Vacumming used to delete one record at a time on all nodes, that was m*n behaviour and would require a huge storm of ctdb->ctdb controls and just wouldnt scale at all. The new vacuming process collects all records to be deleted locally and then only sends 1 control to the other nodes. This control contains a list of all records to be deleted. (This used to be ctdb commit 9e625ece19a91f362c9539fa73b6b2108f0d9c53) 2008-03-12 23:53:29 +03:00			`DEBUG(DEBUG_ERR,(__location__ " invalid data in try_delete_records\n"));`
			`return -1;`
			`}`

			`ctdb_db = find_ctdb_db(ctdb, reply->db_id);`
			`if (!ctdb_db) {`
			`DEBUG(DEBUG_ERR,(__location__ " Unknown db 0x%08x\n", reply->db_id));`
			`return -1;`
			`}`


			`DEBUG(DEBUG_DEBUG,("starting try_delete_records of %u records for dbid 0x%x\n",`
			`reply->count, reply->db_id));`


ctdb:server: Fix code spelling Best reviewed with: `git show --word-diff` Signed-off-by: Andreas Schneider <asn@samba.org> Reviewed-by: Martin Schwenke <mschwenke@ddn.com> 2023-03-22 11:36:23 +03:00			`/* create a blob to send back the records we couldn't delete */`
rename the structure we use for marshalling multiple records (This used to be ctdb commit 4d205476d286570a6e1f52b59af42858ce051106) 2008-07-30 08:24:56 +04:00			`records = (struct ctdb_marshall_buffer *)`
ctdb:server: Remove trailing whitespaces in ctdb_recover.c Signed-off-by: Andreas Schneider <asn@samba.org> Reviewed-by: Martin Schwenke <mschwenke@ddn.com> 2023-03-22 11:33:04 +03:00			`talloc_zero_size(outdata,`
rename the structure we use for marshalling multiple records (This used to be ctdb commit 4d205476d286570a6e1f52b59af42858ce051106) 2008-07-30 08:24:56 +04:00			`offsetof(struct ctdb_marshall_buffer, data));`
Redo the vacukming process to mkake it scalable. Vacumming used to delete one record at a time on all nodes, that was m*n behaviour and would require a huge storm of ctdb->ctdb controls and just wouldnt scale at all. The new vacuming process collects all records to be deleted locally and then only sends 1 control to the other nodes. This control contains a list of all records to be deleted. (This used to be ctdb commit 9e625ece19a91f362c9539fa73b6b2108f0d9c53) 2008-03-12 23:53:29 +03:00			`if (records == NULL) {`
			`DEBUG(DEBUG_ERR,(__location__ " Out of memory\n"));`
			`return -1;`
			`}`
			`records->db_id = ctdb_db->db_id;`


ctdb-daemon: Rename struct ctdb_rec_data to ctdb_rec_data_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 09:30:30 +03:00			`rec = (struct ctdb_rec_data_old *)&reply->data[0];`
Redo the vacukming process to mkake it scalable. Vacumming used to delete one record at a time on all nodes, that was m*n behaviour and would require a huge storm of ctdb->ctdb controls and just wouldnt scale at all. The new vacuming process collects all records to be deleted locally and then only sends 1 control to the other nodes. This control contains a list of all records to be deleted. (This used to be ctdb commit 9e625ece19a91f362c9539fa73b6b2108f0d9c53) 2008-03-12 23:53:29 +03:00			`for (i=0;i<reply->count;i++) {`
			`TDB_DATA key, data;`

			`key.dptr = &rec->data[0];`
			`key.dsize = rec->keylen;`
			`data.dptr = &rec->data[key.dsize];`
			`data.dsize = rec->datalen;`

			`if (data.dsize < sizeof(struct ctdb_ltdb_header)) {`
			`DEBUG(DEBUG_CRIT,(__location__ " bad ltdb record in indata\n"));`
ctdb-daemon: Fix CID 1363233 Resource leak (RESOURCE_LEAK) Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-07-28 05:00:27 +03:00			`talloc_free(records);`
Redo the vacukming process to mkake it scalable. Vacumming used to delete one record at a time on all nodes, that was m*n behaviour and would require a huge storm of ctdb->ctdb controls and just wouldnt scale at all. The new vacuming process collects all records to be deleted locally and then only sends 1 control to the other nodes. This control contains a list of all records to be deleted. (This used to be ctdb commit 9e625ece19a91f362c9539fa73b6b2108f0d9c53) 2008-03-12 23:53:29 +03:00			`return -1;`
			`}`

ctdb:server: Fix code spelling Best reviewed with: `git show --word-diff` Signed-off-by: Andreas Schneider <asn@samba.org> Reviewed-by: Martin Schwenke <mschwenke@ddn.com> 2023-03-22 11:36:23 +03:00			`/* If we can't delete the record we must add it to the reply`
Redo the vacukming process to mkake it scalable. Vacumming used to delete one record at a time on all nodes, that was m*n behaviour and would require a huge storm of ctdb->ctdb controls and just wouldnt scale at all. The new vacuming process collects all records to be deleted locally and then only sends 1 control to the other nodes. This control contains a list of all records to be deleted. (This used to be ctdb commit 9e625ece19a91f362c9539fa73b6b2108f0d9c53) 2008-03-12 23:53:29 +03:00			`so the lmaster knows it may not purge this record`
			`*/`
			`if (delete_tdb_record(ctdb, ctdb_db, rec) != 0) {`
			`size_t old_size;`
			`struct ctdb_ltdb_header *hdr;`

			`hdr = (struct ctdb_ltdb_header *)data.dptr;`
			`data.dptr += sizeof(*hdr);`
			`data.dsize -= sizeof(*hdr);`

			`DEBUG(DEBUG_INFO, (__location__ " Failed to vacuum delete record with hash 0x%08x\n", ctdb_hash(&key)));`

			`old_size = talloc_get_size(records);`
			`records = talloc_realloc_size(outdata, records, old_size + rec->length);`
			`if (records == NULL) {`
			`DEBUG(DEBUG_ERR,(__location__ " Failed to expand\n"));`
			`return -1;`
			`}`
			`records->count++;`
			`memcpy(old_size+(uint8_t *)records, rec, rec->length);`
ctdb:server: Remove trailing whitespaces in ctdb_recover.c Signed-off-by: Andreas Schneider <asn@samba.org> Reviewed-by: Martin Schwenke <mschwenke@ddn.com> 2023-03-22 11:33:04 +03:00			`}`
Redo the vacukming process to mkake it scalable. Vacumming used to delete one record at a time on all nodes, that was m*n behaviour and would require a huge storm of ctdb->ctdb controls and just wouldnt scale at all. The new vacuming process collects all records to be deleted locally and then only sends 1 control to the other nodes. This control contains a list of all records to be deleted. (This used to be ctdb commit 9e625ece19a91f362c9539fa73b6b2108f0d9c53) 2008-03-12 23:53:29 +03:00
ctdb-daemon: Rename struct ctdb_rec_data to ctdb_rec_data_old Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-29 09:30:30 +03:00			`rec = (struct ctdb_rec_data_old )(rec->length + (uint8_t )rec);`
ctdb:server: Remove trailing whitespaces in ctdb_recover.c Signed-off-by: Andreas Schneider <asn@samba.org> Reviewed-by: Martin Schwenke <mschwenke@ddn.com> 2023-03-22 11:33:04 +03:00			`}`
Redo the vacukming process to mkake it scalable. Vacumming used to delete one record at a time on all nodes, that was m*n behaviour and would require a huge storm of ctdb->ctdb controls and just wouldnt scale at all. The new vacuming process collects all records to be deleted locally and then only sends 1 control to the other nodes. This control contains a list of all records to be deleted. (This used to be ctdb commit 9e625ece19a91f362c9539fa73b6b2108f0d9c53) 2008-03-12 23:53:29 +03:00

ctdb-vacuum: Use existing function ctdb_marshall_finish Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Volker Lendecke <vl@samba.org> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Wed Jul 23 09:44:00 CEST 2014 on sn-devel-104 2014-05-06 12:52:54 +04:00			`*outdata = ctdb_marshall_finish(records);`
Redo the vacukming process to mkake it scalable. Vacumming used to delete one record at a time on all nodes, that was m*n behaviour and would require a huge storm of ctdb->ctdb controls and just wouldnt scale at all. The new vacuming process collects all records to be deleted locally and then only sends 1 control to the other nodes. This control contains a list of all records to be deleted. (This used to be ctdb commit 9e625ece19a91f362c9539fa73b6b2108f0d9c53) 2008-03-12 23:53:29 +03:00
			`return 0;`
			`}`
Expand the client async framework so that it can take a callback function. This allows us to use the async framework also for controls that return outdata. Add a "capabilities" field to the ctdb_node structure. This field is only initialized and kept valid inside the recovery daemon context and not inside the main ctdb daemon. change the GET_CAPABILITIES control to return the capabilities in outdata instead of in the res return variable. When performing a recovery inside the recovery daemon, read the capabilities from all connected nodes and update the ctdb->nodes list of nodes. when building the new vnnmap after the database rebuild in recovery, do not include any nodes which lack the LMASTER capability in the new vnnmap. Unless there are no available connected node that sports the LMASTER capability in which case we let the local node (recmaster) take on the lmaster role temporarily (i.e. become a member of the vnnmap list) (This used to be ctdb commit 0f1883c69c689b28b0c04148774840b2c4081df6) 2008-05-06 09:42:59 +04:00
			`/*`
			`report capabilities`
			`*/`
			`int32_t ctdb_control_get_capabilities(struct ctdb_context ctdb, TDB_DATA outdata)`
			`{`
			`uint32_t *capabilities = NULL;`

			`capabilities = talloc(outdata, uint32_t);`
			`CTDB_NO_MEMORY(ctdb, capabilities);`
			`*capabilities = ctdb->capabilities;`

			`outdata->dsize = sizeof(uint32_t);`
			`outdata->dptr = (uint8_t *)capabilities;`

ctdb:server: Remove trailing whitespaces in ctdb_recover.c Signed-off-by: Andreas Schneider <asn@samba.org> Reviewed-by: Martin Schwenke <mschwenke@ddn.com> 2023-03-22 11:33:04 +03:00			`return 0;`
Expand the client async framework so that it can take a callback function. This allows us to use the async framework also for controls that return outdata. Add a "capabilities" field to the ctdb_node structure. This field is only initialized and kept valid inside the recovery daemon context and not inside the main ctdb daemon. change the GET_CAPABILITIES control to return the capabilities in outdata instead of in the res return variable. When performing a recovery inside the recovery daemon, read the capabilities from all connected nodes and update the ctdb->nodes list of nodes. when building the new vnnmap after the database rebuild in recovery, do not include any nodes which lack the LMASTER capability in the new vnnmap. Unless there are no available connected node that sports the LMASTER capability in which case we let the local node (recmaster) take on the lmaster role temporarily (i.e. become a member of the vnnmap list) (This used to be ctdb commit 0f1883c69c689b28b0c04148774840b2c4081df6) 2008-05-06 09:42:59 +04:00			`}`

daemon: On shutdown, destroy timed events that check if recoverd is active When CTDB is shutting down, recovery daemon is stopped, but the event that checks if recovery daemon is still alive is not destroyed. So recovery master is restarted during shutdown if CTDB daemon takes longer to shutdown. There are two processes that check if recovery daemon is working. 1. ctdb_check_recd() - which checks every 30 seconds if the recovery daemon process exists. 2. ctdb_recd_ping_timeout() - which is triggered when recovery daemon fails to ping CTDB daemon. Both the events are periodic and need to be destroyed when shutting down. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 746168df2e691058e601016110fae818c6a265c3) 2012-12-04 08:05:44 +04:00			`/* The recovery daemon will ping us at regular intervals.`
ctdb:server: Fix code spelling Best reviewed with: `git show --word-diff` Signed-off-by: Andreas Schneider <asn@samba.org> Reviewed-by: Martin Schwenke <mschwenke@ddn.com> 2023-03-22 11:36:23 +03:00			`If we haven't been pinged for a while we assume the recovery`
daemon: On shutdown, destroy timed events that check if recoverd is active When CTDB is shutting down, recovery daemon is stopped, but the event that checks if recovery daemon is still alive is not destroyed. So recovery master is restarted during shutdown if CTDB daemon takes longer to shutdown. There are two processes that check if recovery daemon is working. 1. ctdb_check_recd() - which checks every 30 seconds if the recovery daemon process exists. 2. ctdb_recd_ping_timeout() - which is triggered when recovery daemon fails to ping CTDB daemon. Both the events are periodic and need to be destroyed when shutting down. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 746168df2e691058e601016110fae818c6a265c3) 2012-12-04 08:05:44 +04:00			`daemon is inoperable and we restart.`
			`*/`
ctdb-daemon: Stop using tevent compatibility definitions Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:09 +03:00			`static void ctdb_recd_ping_timeout(struct tevent_context *ev,`
			`struct tevent_timer *te,`
			`struct timeval t, void *p)`
additional monitoring between the two daemons. we currently only monitor that the dameons are running by kill(0, pid) and verifying the the domain socket between them is ok. this is not sufficient since we can have a situation where the recovery daemon is hung. this new code monitors that the recovery daemon is operating. if the recovery hangs, we log this and shut down the main daemon (This used to be ctdb commit cd69d292292eaab3aac0e9d9fc57cb621597c63c) 2008-09-09 07:44:46 +04:00			`{`
			`struct ctdb_context *ctdb = talloc_get_type(p, struct ctdb_context);`
The ctdb daemon keeps track of whether the recovery process is running correctly by measuring how long it was since the last successful communication with the recovery daemon was recorded. After a certain timeout the ctdb daemon would deem the recovery daemon as inoperable and shut down. If the system clock is suddenly changed forward by many (60 or more) seconds this could cause the timeout to trigger prematurely/immediately where ctdb would incorrectly think that more than 60 seconds had passed since last successful communications and thus abort. Instead of cehcking for one timeout occuring, only deem the recovery daemon to be "down" and trigger a shutdown if communications have timedout for three intervals in a row. (This used to be ctdb commit 196968c552e6ebcb57389d769a4b25f42fa8bc5d) 2008-09-17 08:17:41 +04:00			`uint32_t *count = talloc_get_type(ctdb->recd_ping_count, uint32_t);`
additional monitoring between the two daemons. we currently only monitor that the dameons are running by kill(0, pid) and verifying the the domain socket between them is ok. this is not sufficient since we can have a situation where the recovery daemon is hung. this new code monitors that the recovery daemon is operating. if the recovery hangs, we log this and shut down the main daemon (This used to be ctdb commit cd69d292292eaab3aac0e9d9fc57cb621597c63c) 2008-09-09 07:44:46 +04:00
add extra debug statements to the log to make it easier to see when a recovery dameon has hung due to the underlying filesystem hanging. (This used to be ctdb commit 5b0067a4e335cbbf6e606646e612d4bfcfdb7441) 2009-05-12 12:39:34 +04:00			`DEBUG(DEBUG_ERR, ("Recovery daemon ping timeout. Count : %u\n", *count));`
The ctdb daemon keeps track of whether the recovery process is running correctly by measuring how long it was since the last successful communication with the recovery daemon was recorded. After a certain timeout the ctdb daemon would deem the recovery daemon as inoperable and shut down. If the system clock is suddenly changed forward by many (60 or more) seconds this could cause the timeout to trigger prematurely/immediately where ctdb would incorrectly think that more than 60 seconds had passed since last successful communications and thus abort. Instead of cehcking for one timeout occuring, only deem the recovery daemon to be "down" and trigger a shutdown if communications have timedout for three intervals in a row. (This used to be ctdb commit 196968c552e6ebcb57389d769a4b25f42fa8bc5d) 2008-09-17 08:17:41 +04:00
use the correct tunable failcount not timeout (This used to be ctdb commit 475cfada33b4c13aaaca773d5485bbe26bffbf46) 2008-09-17 08:24:12 +04:00			`if (*count < ctdb->tunable.recd_ping_failcount) {`
The ctdb daemon keeps track of whether the recovery process is running correctly by measuring how long it was since the last successful communication with the recovery daemon was recorded. After a certain timeout the ctdb daemon would deem the recovery daemon as inoperable and shut down. If the system clock is suddenly changed forward by many (60 or more) seconds this could cause the timeout to trigger prematurely/immediately where ctdb would incorrectly think that more than 60 seconds had passed since last successful communications and thus abort. Instead of cehcking for one timeout occuring, only deem the recovery daemon to be "down" and trigger a shutdown if communications have timedout for three intervals in a row. (This used to be ctdb commit 196968c552e6ebcb57389d769a4b25f42fa8bc5d) 2008-09-17 08:17:41 +04:00			`(*count)++;`
ctdb-daemon: Stop using tevent compatibility definitions Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:09 +03:00			`tevent_add_timer(ctdb->ev, ctdb->recd_ping_count,`
			`timeval_current_ofs(ctdb->tunable.recd_ping_timeout, 0),`
			`ctdb_recd_ping_timeout, ctdb);`
The ctdb daemon keeps track of whether the recovery process is running correctly by measuring how long it was since the last successful communication with the recovery daemon was recorded. After a certain timeout the ctdb daemon would deem the recovery daemon as inoperable and shut down. If the system clock is suddenly changed forward by many (60 or more) seconds this could cause the timeout to trigger prematurely/immediately where ctdb would incorrectly think that more than 60 seconds had passed since last successful communications and thus abort. Instead of cehcking for one timeout occuring, only deem the recovery daemon to be "down" and trigger a shutdown if communications have timedout for three intervals in a row. (This used to be ctdb commit 196968c552e6ebcb57389d769a4b25f42fa8bc5d) 2008-09-17 08:17:41 +04:00			`return;`
			`}`

Restart recovery dameon if it looks like it hung. Dont shutdown ctdbd completely, that only makes the problem worse. (This used to be ctdb commit 221ecc2509f6d267d1854c1042ff945a620510bb) 2011-03-03 22:55:24 +03:00			`DEBUG(DEBUG_ERR, ("Final timeout for recovery daemon ping. Restarting recovery daemon. (This can be caused if the cluster filesystem has hung)\n"));`
additional monitoring between the two daemons. we currently only monitor that the dameons are running by kill(0, pid) and verifying the the domain socket between them is ok. this is not sufficient since we can have a situation where the recovery daemon is hung. this new code monitors that the recovery daemon is operating. if the recovery hangs, we log this and shut down the main daemon (This used to be ctdb commit cd69d292292eaab3aac0e9d9fc57cb621597c63c) 2008-09-09 07:44:46 +04:00
			`ctdb_stop_recoverd(ctdb);`
Restart recovery dameon if it looks like it hung. Dont shutdown ctdbd completely, that only makes the problem worse. (This used to be ctdb commit 221ecc2509f6d267d1854c1042ff945a620510bb) 2011-03-03 22:55:24 +03:00			`ctdb_start_recoverd(ctdb);`
additional monitoring between the two daemons. we currently only monitor that the dameons are running by kill(0, pid) and verifying the the domain socket between them is ok. this is not sufficient since we can have a situation where the recovery daemon is hung. this new code monitors that the recovery daemon is operating. if the recovery hangs, we log this and shut down the main daemon (This used to be ctdb commit cd69d292292eaab3aac0e9d9fc57cb621597c63c) 2008-09-09 07:44:46 +04:00			`}`

			`int32_t ctdb_control_recd_ping(struct ctdb_context *ctdb)`
			`{`
The ctdb daemon keeps track of whether the recovery process is running correctly by measuring how long it was since the last successful communication with the recovery daemon was recorded. After a certain timeout the ctdb daemon would deem the recovery daemon as inoperable and shut down. If the system clock is suddenly changed forward by many (60 or more) seconds this could cause the timeout to trigger prematurely/immediately where ctdb would incorrectly think that more than 60 seconds had passed since last successful communications and thus abort. Instead of cehcking for one timeout occuring, only deem the recovery daemon to be "down" and trigger a shutdown if communications have timedout for three intervals in a row. (This used to be ctdb commit 196968c552e6ebcb57389d769a4b25f42fa8bc5d) 2008-09-17 08:17:41 +04:00			`talloc_free(ctdb->recd_ping_count);`
additional monitoring between the two daemons. we currently only monitor that the dameons are running by kill(0, pid) and verifying the the domain socket between them is ok. this is not sufficient since we can have a situation where the recovery daemon is hung. this new code monitors that the recovery daemon is operating. if the recovery hangs, we log this and shut down the main daemon (This used to be ctdb commit cd69d292292eaab3aac0e9d9fc57cb621597c63c) 2008-09-09 07:44:46 +04:00
The ctdb daemon keeps track of whether the recovery process is running correctly by measuring how long it was since the last successful communication with the recovery daemon was recorded. After a certain timeout the ctdb daemon would deem the recovery daemon as inoperable and shut down. If the system clock is suddenly changed forward by many (60 or more) seconds this could cause the timeout to trigger prematurely/immediately where ctdb would incorrectly think that more than 60 seconds had passed since last successful communications and thus abort. Instead of cehcking for one timeout occuring, only deem the recovery daemon to be "down" and trigger a shutdown if communications have timedout for three intervals in a row. (This used to be ctdb commit 196968c552e6ebcb57389d769a4b25f42fa8bc5d) 2008-09-17 08:17:41 +04:00			`ctdb->recd_ping_count = talloc_zero(ctdb, uint32_t);`
			`CTDB_NO_MEMORY(ctdb, ctdb->recd_ping_count);`
additional monitoring between the two daemons. we currently only monitor that the dameons are running by kill(0, pid) and verifying the the domain socket between them is ok. this is not sufficient since we can have a situation where the recovery daemon is hung. this new code monitors that the recovery daemon is operating. if the recovery hangs, we log this and shut down the main daemon (This used to be ctdb commit cd69d292292eaab3aac0e9d9fc57cb621597c63c) 2008-09-09 07:44:46 +04:00
			`if (ctdb->tunable.recd_ping_timeout != 0) {`
ctdb-daemon: Stop using tevent compatibility definitions Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2015-10-26 08:50:09 +03:00			`tevent_add_timer(ctdb->ev, ctdb->recd_ping_count,`
			`timeval_current_ofs(ctdb->tunable.recd_ping_timeout, 0),`
			`ctdb_recd_ping_timeout, ctdb);`
additional monitoring between the two daemons. we currently only monitor that the dameons are running by kill(0, pid) and verifying the the domain socket between them is ok. this is not sufficient since we can have a situation where the recovery daemon is hung. this new code monitors that the recovery daemon is operating. if the recovery hangs, we log this and shut down the main daemon (This used to be ctdb commit cd69d292292eaab3aac0e9d9fc57cb621597c63c) 2008-09-09 07:44:46 +04:00			`}`

			`return 0;`
			`}`

ctdb-daemon: Factor out new function ctdb_node_become_inactive() This is a superset of ctdb_local_node_got_banned() so will replace that function, and will also be used in the NODE_STOP control. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14087 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2019-08-19 14:47:03 +03:00			`void ctdb_node_become_inactive(struct ctdb_context *ctdb)`
			`{`
			`struct ctdb_db_context *ctdb_db;`

			`D_WARNING("Making node INACTIVE\n");`

			`/*`
			`* Do not service database calls - reset generation to invalid`
			`* so this node ignores any REQ/REPLY CALL/DMASTER`
			`*/`
			`ctdb->vnn_map->generation = INVALID_GENERATION;`
			`for (ctdb_db = ctdb->db_list; ctdb_db != NULL; ctdb_db = ctdb_db->next) {`
			`ctdb_db->generation = INVALID_GENERATION;`
			`}`

			`/*`
			`* Although this bypasses the control, the only thing missing`
			`* is the deferred drop of all public IPs, which isn't`
			`* necessary because they are dropped below`
			`*/`
			`if (ctdb->recovery_mode != CTDB_RECOVERY_ACTIVE) {`
			`D_NOTICE("Recovery mode set to ACTIVE\n");`
			`ctdb->recovery_mode = CTDB_RECOVERY_ACTIVE;`
			`}`

			`/*`
			`* Initiate database freeze - this will be scheduled for`
			`* immediate execution and will be in progress long before the`
			`* calling control returns`
			`*/`
			`ctdb_daemon_send_control(ctdb,`
			`ctdb->pnn,`
			`0,`
			`CTDB_CONTROL_FREEZE,`
			`0,`
			`CTDB_CTRL_FLAG_NOREPLY,`
			`tdb_null,`
			`NULL,`
			`NULL);`

			`D_NOTICE("Dropping all public IP addresses\n");`
			`ctdb_release_all_ips(ctdb);`
			`}`
create a new event : stopped. This event is called when a node is stopped and is used by eventscripts that need to do certain cleanup and removal of configuration or ip addresses or routing ... Note that a STOPPED node is considered "inactive" and as such will not be running the "recovered" event when the rest of the cluster has recovered. (This used to be ctdb commit 65e9309564611bf937ded3c74a79abff895d7c59) 2009-07-17 06:26:16 +04:00
ctdbd: Remove the "stopped" event It isn't used, superceded by "ipreallocated". Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit c2bb8596a8af6406ef50e53953884df9d6246a96) 2013-02-21 07:28:13 +04:00			`int32_t ctdb_control_stop_node(struct ctdb_context *ctdb)`
add two new controls, CTOP_NODE and CONTINUE_NODE that are used to stop/continue a node instead of using modflags messages (This used to be ctdb commit 54b4a02053a0f98f8c424e7f658890254023d39a) 2009-07-09 06:22:46 +04:00			`{`
ctdb-daemon: Increase priority of logs when node is stopped/continued Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-07-04 08:32:47 +03:00			`DEBUG(DEBUG_ERR, ("Stopping node\n"));`
add two new controls, CTOP_NODE and CONTINUE_NODE that are used to stop/continue a node instead of using modflags messages (This used to be ctdb commit 54b4a02053a0f98f8c424e7f658890254023d39a) 2009-07-09 06:22:46 +04:00			`ctdb->nodes[ctdb->pnn]->flags \|= NODE_FLAGS_STOPPED;`

ctdb-daemon: Make node inactive in the NODE_STOP control Currently some of this is supported by a periodic check in the recovery daemon's main_loop(), which notices the flag change, sets recovery mode active and freezes databases. If STOP_NODE returns immediately then the associated recovery can complete and the node can be continued before databases are actually frozen. Instead, immediately do all of the things that make a node inactive. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14087 RN: Stop "ctdb stop" from completing before freezing databases Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Amitay Isaacs <amitay@samba.org> Autobuild-Date(master): Tue Aug 20 08:32:27 UTC 2019 on sn-devel-184 2019-08-19 14:48:04 +03:00			`ctdb_node_become_inactive(ctdb);`

add two new controls, CTOP_NODE and CONTINUE_NODE that are used to stop/continue a node instead of using modflags messages (This used to be ctdb commit 54b4a02053a0f98f8c424e7f658890254023d39a) 2009-07-09 06:22:46 +04:00			`return 0;`
			`}`

			`int32_t ctdb_control_continue_node(struct ctdb_context *ctdb)`
			`{`
ctdb-daemon: Increase priority of logs when node is stopped/continued Signed-off-by: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Martin Schwenke <martin@meltin.net> 2017-07-04 08:32:47 +03:00			`DEBUG(DEBUG_ERR, ("Continue node\n"));`
add two new controls, CTOP_NODE and CONTINUE_NODE that are used to stop/continue a node instead of using modflags messages (This used to be ctdb commit 54b4a02053a0f98f8c424e7f658890254023d39a) 2009-07-09 06:22:46 +04:00			`ctdb->nodes[ctdb->pnn]->flags &= ~NODE_FLAGS_STOPPED;`

			`return 0;`
			`}`
ReadOnly: add a new control to activate readonly lock capability for a database. let all databases default to not support this until enabled through this control (This used to be ctdb commit 908a07c42e5135a3ba30a625fc4f4e4916de197a) 2011-09-01 05:08:18 +04:00

1244 lines 32 KiB C Raw Normal View History Unescape Escape

1244 lines

32 KiB

C

Raw Normal View History