samba-mirror

mirror of https://github.com/samba-team/samba.git synced 2025-01-13 13:18:06 +03:00

Author	SHA1	Message	Date
Ronnie Sahlberg	4a43428440	The recent change to the recovery daemon to keep track of and verify that all nodes agree on the most recent ip address assignments broke "ctdb moveip ..." since that call would never trigger a full takeover run and thus would immediately trigger an inconsistency. Add a new message to the recovery daemon where we can tell the recovery daemon to update its assignments. BZ62782 (This used to be ctdb commit e7069082e5f0380dcddee247db8754218ce18cab)	2010-05-03 15:47:17 +10:00
Ronnie Sahlberg	05dcbed90e	ctdb regsrvids is much more useful for testing if it sleeps once it has registered its srvid. Othervise, as soon as it terminates, ctdbd will deregister the id automatically. (This used to be ctdb commit 23b059dcb8074872d7900b225790d4df7da071b6)	2010-02-22 15:34:26 +11:00
Ronnie Sahlberg	e01c8454ef	commands that relate to manual failover of ip addresses (moveip) can sometimes take long so allow for a longer timeout for the controls used. (This used to be ctdb commit 144c69b633eeb17e120f962162feed6de3dc16a6)	2010-02-09 18:34:47 +11:00
Ronnie Sahlberg	ca9386a7f4	dont just exit(0) upon successful completion of waiting for an ipreallocate to finish. return success back to the caller instead. otherwise things like 'ctdb enable -n all' will just finish after the first disabled node has become enabled. (This used to be ctdb commit f4eb41cd3a1099da8265351818fba9bd4688a188)	2010-02-09 14:35:10 +11:00
Ronnie Sahlberg	7a889c5f1d	When trying to enable/disable a node. Check if the node is already enabled/disabled and log an information message if so. (This used to be ctdb commit c3eec8f10764a647106087099eeb47b7196f7aac)	2010-02-04 10:03:21 +11:00
Ronnie Sahlberg	7a5254ae69	add two new debug controls to send and receive messages ctdb msglisten and msgsend (This used to be ctdb commit 8c89aac20260dc7f3746e29fe99f17422a77cb88)	2010-02-04 09:45:32 +11:00
Stefan Metzmacher	f2854f75c8	tools/ctdb: add PartiallyOnline state for "ctdb status" and "ctdb status -Y" This is based on the GET_IFACES control against each node. metze (This used to be ctdb commit 38cb972382a09f830673277d0a9bd5d20deafff2)	2010-01-20 11:11:00 +01:00
Stefan Metzmacher	a6437bc707	tools/ctdb: display interfaces in "ctdb ip" and "ctdb ip -Y" outputs metze (This used to be ctdb commit dffa2b05acce8b73c2fdd085311732bf57f01b7f)	2010-01-20 11:11:00 +01:00
Stefan Metzmacher	df5805d6a0	tools/ctdb: add "ctdb ipinfo <ip>" metze (This used to be ctdb commit e05e236fc019bfd3b316609a7c190e0e028a4bbc)	2010-01-20 11:11:00 +01:00
Stefan Metzmacher	a6803f42a5	tools/ctdb: add "ctdb setifacelink <iface> <status>" metze (This used to be ctdb commit 8d0c00b60db69bd10f12da4c676e1142dc37af7a)	2010-01-20 11:11:00 +01:00
Stefan Metzmacher	0ceef7036b	tools/ctdb: add "ctdb ifaces" metze (This used to be ctdb commit 80053d09eed967fb76898f4a53437bed2b43a02f)	2010-01-20 11:11:00 +01:00
Stefan Metzmacher	a23c409e73	tools/ctdb: display INACTIVE status in "ctdb status" and "ctdb status -Y" metze (This used to be ctdb commit 18af37e99ef8ff5623161495be432abfe5e3407f)	2010-01-20 09:44:36 +01:00
Stefan Metzmacher	a03cf0040b	ctdb: print out some hints how to debug a "ctdb catdb" failure metze (This used to be ctdb commit 504cf78d00d1120b556124340b9312f890b8b8b9)	2009-12-16 08:08:33 +01:00
Stefan Metzmacher	965c000c6e	ctdb: add machinereadable output fot "ctdb -Y getdbmap" metze (This used to be ctdb commit 45cfcd44093c7d2681e2ffd5cfb402823e8809f4)	2009-12-16 08:08:33 +01:00
Stefan Metzmacher	aa07a46bf5	ctdb: disallow "ctdb backupdb" on unhealthy databases metze (This used to be ctdb commit ecf799093c1989f5499c9d61ce8cc8a98d759160)	2009-12-16 08:08:33 +01:00
Stefan Metzmacher	c4bc231267	client: add "ctdb dumpdbbackup <filename>" metze (This used to be ctdb commit c63a0368d9d4b526ac1e49d891d3a1b7b8d20320)	2009-12-16 08:08:33 +01:00
Stefan Metzmacher	fb50e08942	tools/ctdb: let "ctdb restoredb" and "ctdb wipedb" mark the db as healthy on all nodes metze (This used to be ctdb commit d1b10b0c0c323c39742a18e98a1dab7e82ddc7be)	2009-12-16 08:08:32 +01:00
Stefan Metzmacher	c56ce3d2f2	tools/ctdb: add "ctdb getdbstatus <dbname>" metze (This used to be ctdb commit 910c19f12448d293a755d1eb46d20f9591f8da7a)	2009-12-16 08:08:32 +01:00
Stefan Metzmacher	927dd3d9e5	tools/ctdb: display db health in "ctdb getdbmap" metze (This used to be ctdb commit c34535ff4dc6a44909283641596e0ed7c2316fbd)	2009-12-16 08:08:32 +01:00
Rusty Russell	cab8da8dc4	ctdb: don't print OUTPUT: for DISABLED scripts In other news, did you know ctime() returns a \n-terminated string? Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 1b4e7bb548976b99f122142b040494b6f9911962)	2009-12-14 15:46:49 +11:00
Rusty Russell	a46c3b4f2a	ctdb: scriptstatus can now query non-monitor events We also no longer return an error before scripts have been run; a special zero-length data means we have never run the scripts. "ctdb scriptstatus all" returns all event script results. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 9b90d671581e390e2892d3a68f3ca98d58bef4df)	2009-12-08 01:50:55 +10:30
Rusty Russell	9e87377e7a	ctdb: support --machinereadable (-Y) for scriptstatus Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 47ffe75848f216568ce3db0a60ca88cfe3d6903a)	2009-12-08 01:31:53 +10:30
Rusty Russell	9753b7e793	eventscript: rename ctdb_monitoring_wire to ctdb_scripts_wire We're going to allow fetching status of all script runs, so this name is no longer appropriate. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit f5cb41ecf3fa986b8af243e8546eb3b985cd902a)	2009-12-08 00:51:24 +10:30
Rusty Russell	c70afe0cd4	eventscript: handle and report generic stat/execution errors Rather than ignoring deleted event scripts (or pretending that they were "OK"), and discarding other stat errors, we save the errno and turn it into a negative status. This gives us a bit more information if we can't execute a script (eg. too many symlinks or other weird errors). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 5d894e1ae5228df6bbe4fc305ccba19803fa3798)	2009-12-07 23:12:19 +10:30
Rusty Russell	b9b75bd065	eventscript: use -ENOEXEC for disabled status value This unifies code paths and simplifies things: we just hand -ENOEXEC to ctdb_ctrl_event_script_stop(). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit eadf5e44ef97d7703a7d3bce0e7ea0f21cb11f14)	2009-12-07 23:11:47 +10:30
Rusty Russell	066a791770	eventscript: use -ETIME for timeout status value This starts the move toward more expressive encoding of return values: positive values mean the script ran, negative means we had a problem with the script (and the value is the errno). This does timeout, but changes the ctdb tool to recognize it. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 0eb1d0aa14e68b598d9e281c8a02b8f94a042fd9)	2009-12-07 23:09:42 +10:30
Michael Adam	92c5d9eefc	ctdb: add command "ctdb wipedb" to wipe the contents of an attached tdb Michael (This used to be ctdb commit 5a7c1e7f15693522bbf1c39a53be2304ece9a134)	2009-12-04 11:30:20 +01:00
Ronnie Sahlberg	cc2d81a77c	make the ringbuffer logging more efficient and marshall the data by writing to a tmpfile instead of continously talloc resizing a blob (This used to be ctdb commit 6427f0b68d60b556a023f64e15e156000ba6f943)	2009-11-18 19:10:50 +11:00
Ronnie Sahlberg	bc2675119d	add an in memory ringbuffer where we store the last 500000 log entries regardless of log level. add commandt to extract this in memory buffer and to clear it (This used to be ctdb commit 29d2ee8d9c6c6f36b2334480f646d6db209f370e)	2009-11-18 12:44:18 +11:00
Ronnie Sahlberg	f88fbb5f1e	suggestion from Christian, dont allow UNHEALTHY nodes to become natgw master, unless all nodes are unhealthy (This used to be ctdb commit e8e7129ff1371065fbd75e1aea844d6d04a96fa9)	2009-11-06 08:19:32 +11:00
Ronnie Sahlberg	fcd2ebc32b	update the uptime command to indicate that time since last is either from alst recovery or from last failover (This used to be ctdb commit 467da12a785ba3367ed9cbdf79440394e9703289)	2009-10-29 10:58:14 +11:00
Ronnie Sahlberg	023d09cd38	Revert "update the "uptime" command to indicate the "time since last" is the time since the last recovery OR failover." This reverts commit 3b0d44497800a16400d05a30bdaf6e6c285d4b36. (This used to be ctdb commit cb36bbb5418290e8e5b770d2d836285b15da2a6f)	2009-10-29 10:49:00 +11:00
Ronnie Sahlberg	279b7ca564	update the "uptime" command to indicate the "time since last" is the time since the last recovery OR failover. (This used to be ctdb commit 3b0d44497800a16400d05a30bdaf6e6c285d4b36)	2009-10-29 10:37:10 +11:00
Ronnie Sahlberg	4d40b86805	for debugging add a global variable holding the pid of the main daemon. change the tracking of time() in the event loop to only check/warn when called from the main daemon (This used to be ctdb commit a10fc51f4c30e85ada6d4b7347b0f9a8ebc76637)	2009-10-27 13:18:52 +11:00
Ronnie Sahlberg	80be59d35e	when we change state between healthy/unhealthy, make sure we ask the recovery master to perform an explicit ip reallocation. This is more reliable and faster than having the recovery dameon track these changes, and since we now have an explicit method to ask the recovery daemon to perform an explicit ip reallocation, we should use this. (This used to be ctdb commit 3807681e74f4bfe92befdae6ed616ff5f1a99880)	2009-10-14 11:59:16 +11:00
Ronnie Sahlberg	98b5caf003	we must break the loop as soon as we find a suitable recmaster does exist otherwise "tdb ipreallocate" will silently fail to update the addresses. (This used to be ctdb commit 346fa055f4106497b87df97da5ebd6e51fa1ef8c)	2009-10-13 09:49:05 +11:00
Ronnie Sahlberg	771802b212	allow setting the recmode even when not completely frozen. we sometimes have to do this when we want to trigger a recovery (This used to be ctdb commit 46194e87e189521375b39b4ef33da2b493429fd8)	2009-10-12 13:06:16 +11:00
Ronnie Sahlberg	d4c98516a2	uptade the freeze/thaw commands to be able to send the requested database priority to freeze/thaw to the daemon. this is encoded in the srvid field of the request header (This used to be ctdb commit 0cb3d33caa42ed783e03bc825b181dde4cf63616)	2009-10-12 09:22:17 +11:00
Ronnie Sahlberg	3219f81710	add a control to read the db priority from a database (This used to be ctdb commit ca6d045e419f308f57e74d4c978907afb05ddb85)	2009-10-10 15:04:18 +11:00
Ronnie Sahlberg	6cf7d8e131	add a control to set a database priority. Let newly created databases default to priority 1. database priorities will be used to control in which order databases are locked during recovery in. (This used to be ctdb commit 67741c0ee01916d94cace8e9462ef02507e06078)	2009-10-10 14:26:09 +11:00
Ronnie Sahlberg	134ed842fa	always send the release/take ip controls to make sure all nodes are updated (This used to be ctdb commit 789703ea684717781c176fd3a2a24d96abde220b)	2009-10-06 12:25:44 +11:00
Ronnie Sahlberg	166b1c97b4	add a new message to ask the recovery daemon to temporarily disable checking ip address consistency. This is useful when we are moving addresses using moveip in the cluster since otherwise if we collide with the recovery daemons own check we could cause a recovery (This used to be ctdb commit 9c63858c0b22c81eaccb9865a414af0bbb2833d4)	2009-10-06 12:11:32 +11:00
Ronnie Sahlberg	617e393f6b	update addip/moveip/delip to make it less likely to trigger an accidental recovery (This used to be ctdb commit 3befe5526e147d49451fddc930aaafc3dbe2e9c1)	2009-10-06 11:41:18 +11:00
Ronnie Sahlberg	709fc77878	When adding a public ip to a node, make sure to push the assignment of ip addresses out to all nodes so all nodes become aware who currently holds the ip. (This used to be ctdb commit e8df6fc301fb7faf72c72eb39ea68d44d1526b00)	2009-10-06 08:19:25 +11:00
Ronnie Sahlberg	22dde50be3	add machinereadable output for the ctdb getreclock command (This used to be ctdb commit 5e7dc36f1649824db2f9dab34bede8b388502a57)	2009-09-28 13:39:54 +10:00
Ronnie Sahlberg	cda5f02c7c	new prototype banning code (This used to be ctdb commit 0c4c2240267af183d54ffd4c0aacda208f6eff6a)	2009-09-04 02:20:39 +10:00
Ronnie Sahlberg	ef9db0efc3	reduce the loglevel for the message that we switch to a different recmaster while waiting for ipreallocate to finish (This used to be ctdb commit e5b25e1386294b1f800c32fb01c69c3c3ce85c26)	2009-08-17 10:56:12 +10:00
Ronnie Sahlberg	486bdd8ca1	if no timeout at all is specified to the ctdb tool, neither using -T nor by setting CGTDB_TIMEOUT, then use 120 seconds as a default timepout before the ctdb command will exit with an error. (This used to be ctdb commit d8d21884736a9610d48cf532e1c6778e511fb7a8)	2009-08-17 10:54:45 +10:00
Ronnie Sahlberg	1cc79905ad	add new controls to make it possible to enable/disable individual eventscripts update scriptstatus output so it lists disabled scripts (This used to be ctdb commit 7e799b7523c9699bd65a8a8207f7e03d668b0b81)	2009-08-13 13:04:08 +10:00
Ronnie Sahlberg	0e09e52824	update STOP/CONTINUE to better handle when we stop the last node (This used to be ctdb commit 9a251078f22aea15b9ca37393e0b5e2740aa21fb)	2009-08-03 12:51:55 +10:00
Ronnie Sahlberg	62c4a841d2	When processing the stop node control reply in the client code we should also check the returned status code in case the _stop() command failed due to the eventscripts failing. If this happens, make "ctdb stop" log an error to the console and try the operation again. (This used to be ctdb commit 20e82e0c48e07d1012549f5277f1f5a3f4bd10d1)	2009-07-29 09:58:40 +10:00
Ronnie Sahlberg	37d68c58b8	add two commands : setlmasterrole and setrecmasterrole to enable/disable these capabilities at runtime (This used to be ctdb commit 51aaed0e9e42e901451292e8dd545297ab725a62)	2009-07-28 13:45:13 +10:00
Ronnie Sahlberg	72e2380e92	add a command "setnatgwstate {on\|off}" that can be used to indicate if this node is using natgw functionality or not. (This used to be ctdb commit 89a9bb29a60a6fb1fba55987e6cf0a4baa695e50)	2009-07-28 09:58:11 +10:00
Ronnie Sahlberg	9c6aa4e420	update the eventscript to ensure that stopped nodes can not become the natgw master also verify that we actually do have a natgw master available if this is configured and make the node unhealthy if not. (This used to be ctdb commit 7f273ee769d671d8c8be87c9187302fb77e814f3)	2009-07-17 09:45:05 +10:00
Ronnie Sahlberg	5ce69e2fa3	if all nodes are STOPPED, pick one of the STOPPED nodes as natgw master (This used to be ctdb commit 8bbd96cfbbe98f3fc19e432797cbf4478f753a0b)	2009-07-17 09:36:22 +10:00
Ronnie Sahlberg	bf9ad9c934	Do not allow STOPPED or DELETED nodes to become the NATGW master (This used to be ctdb commit 4505ea15408ad40dd8deb4041fd75a65a0ad9336)	2009-07-17 09:29:58 +10:00
Ronnie Sahlberg	88f3c40d9c	add two new controls, CTOP_NODE and CONTINUE_NODE that are used to stop/continue a node instead of using modflags messages (This used to be ctdb commit 54b4a02053a0f98f8c424e7f658890254023d39a)	2009-07-09 12:22:46 +10:00
Ronnie Sahlberg	d6a5fd5c9d	remove the header printed for the machinereadable output for natgwlist (This used to be ctdb commit 049271c83a09afb8d6c3e5212cf9ca782956b0c6)	2009-07-09 11:43:37 +10:00
Ronnie Sahlberg	9f0dc4b93b	Add a new node flag : STOPPED This node flag means the node is DISABLED and that all its public ip addresses are failed over, but also that it has been removed from the VNNmap. A STOPPED node should be in recovery mode active untill restarted using the continue command. Adding two new commands "ctdb stop" "ctdb continue" (This used to be ctdb commit d47dab1026deba0554f21282a59bd172209ea066)	2009-07-09 11:38:18 +10:00
Ronnie Sahlberg	20887a15ad	Perform an ipreallocate efter each enable/disable. This will force a wait until the ip addresses have been reallocated after a disable/enable command and will make scripting of enable/disable more predictable. This will cause the command enable/disable to wait until the ip realocation that normally follows shortly after a enable/disable to finish before the command returns to the prompt. (This used to be ctdb commit 6e1f60d8d780c1240aaabb78ecc8550d0480cd7e)	2009-07-06 11:49:55 +10:00
Ronnie Sahlberg	289c58e9b6	add a new command "ctdb ipreallocate", this command will force the recovery master to perform a full ip reallocation process. the ctdb command will block until the ip reallocation has comleted (This used to be ctdb commit abad7b97fe0c066b33f6e75d0953bbed892a3216)	2009-07-02 13:00:26 +10:00
Ronnie Sahlberg	8e435c0605	update enable/disable (This used to be ctdb commit b99afc98bedf1a51d315e311f27c3fc55fd940e7)	2009-07-01 09:33:08 +10:00
Ronnie Sahlberg	2770cb4397	show the valid debuglevels that can be used in the error text when an invalid level was specified to ctdb setdebug (This used to be ctdb commit 421c0566094b91221fab2ea68f2c9bd35d5dfbcb)	2009-07-01 09:21:07 +10:00
Ronnie Sahlberg	93026f4cbf	update the handling of debug levels so that we always can use a literal instead of a numeric value. validate the input values used and refuse setting the debug level to an unknown value (This used to be ctdb commit daec49cea1790bcc64599959faf2159dec2c5929)	2009-07-01 09:17:13 +10:00
Ronnie Sahlberg	9802a0c2f6	when no debuglevel is specified, make 'ctdb setdebug' show the available options (This used to be ctdb commit f4b0825d9da34578b9f90dc9bd7f99fcc2519ddf)	2009-07-01 08:26:00 +10:00
Ronnie Sahlberg	5b235c3999	add a control to set the reclock file (This used to be ctdb commit 36cc2e586f03fa497ee9b06f3e6afc80219c4aaa)	2009-06-25 14:25:18 +10:00
Ronnie Sahlberg	2b253c094c	add a control to read the current reclock file from a node (This used to be ctdb commit ed6a4cbcdcbb4e0df83bec8be67c30288bf9bd41)	2009-06-25 12:17:19 +10:00
Ronnie Sahlberg	2bb687c4cd	remove unused variable (This used to be ctdb commit 2a52336ec021dfe8d56ba72726feb7b2dbd41f68)	2009-06-09 10:58:46 +10:00
Ronnie Sahlberg	ac931b1371	dont require particular values for NoIPFailback and DeterministicIPs when using ctdb moveip (This used to be ctdb commit d350c631850377c09968d2978ef57d2bd0d50116)	2009-06-09 10:57:46 +10:00
Ronnie Sahlberg	f135684766	improve ctdb moveip so that it does not always trigger a recovery. (This used to be ctdb commit 0ca28d7336463ecd2ff65620d8dbcbb496991531)	2009-06-09 10:56:50 +10:00
Ronnie Sahlberg	f6ccf96898	try avoiding to cause a recovery when deleting a public ip from a node (This used to be ctdb commit 6318ea13464e2fe630084c40802d8e697c2cb999)	2009-06-05 17:57:14 +10:00
Ronnie Sahlberg	b046f5e3aa	when adding an ip, try manually adding and takingover the ip instead of triggering a full recovery to do the same thing (This used to be ctdb commit 4d5d22e64270cfb31be6acd71f4f97ec43df5b2c)	2009-06-05 17:00:47 +10:00
Ronnie Sahlberg	79eef7f2b5	dont list DELETED nodes in the ctdb listnodes output (This used to be ctdb commit 7eb137aa4c24c69bd93b98fb3c7108e5f3288ebd)	2009-06-04 13:25:58 +10:00
Ronnie Sahlberg	f691b96d84	make it possible to run 'ctdb listnodes' also if the daemon is not running. in this case, read the nodes file directly instead of asking the local daemon for the list. add an option -Y to provide machinereadable output to listnodes (This used to be ctdb commit 4a55cacc4f5526abd2124460b669e633deeda408)	2009-06-04 13:21:25 +10:00
Ronnie Sahlberg	1dee7a2401	hide all DELETED nodes from the ctdb command output (This used to be ctdb commit 91fdfee371d6be83af60cd38ac34afb295b9987a)	2009-06-01 15:43:30 +10:00
Ronnie Sahlberg	e6170b5389	add a new node state : DELETED. This is used to mark nodes as being DELETED internally in ctdb so that nodes are not renumbered if / when they are removed from the nodes file. This is used to be able to do "ctdb reloadnodes" at runtime without causing nodes to be renumbered. To do this, instead of deleting a node from the nodes file, just comment it out like 1.0.0.1 #1.0.0.2 1.0.0.3 After removing 1.0.0.2 from the cluster, the remaining nodes retain their pnn's from prior to the deletion, namely 0 and 2 Any line in the nodes file that is commented out represents a DELETED pnn (This used to be ctdb commit 6a5e4fd7fa391206b463bb4e976502f3ac5bd343)	2009-06-01 14:18:34 +10:00
Sumit Bose	2fcedf6dac	add missing checks on so far ignored return values Most of these were found during a review by Jim Meyering <meyering@redhat.com> (This used to be ctdb commit 3aee5ee1deb4a19be3bd3a4ce3abbe09de763344)	2009-05-21 11:22:21 +10:00
Ronnie Sahlberg	98a54c4675	Track how long it takes to take out the recovery lock from both the main dameon and also from the recovery daemon. Log this in "ctdb statistics". Also add a varaible "RecLockLatencyMs" that will log an error everytime it takes longer than this to access the reclock file. (This used to be ctdb commit 042377ed803bb8f7ca9d6ea1a387427b7b8ba45a)	2009-05-14 10:33:25 +10:00
Ronnie Sahlberg	93a2829e94	check that a node is banned before trying to unban it. (This used to be ctdb commit 4467b5f88d749d455854512f60a5d313cafa828b)	2009-05-12 18:32:41 +10:00
Ronnie Sahlberg	54a5e6c0c8	Add a -Y machinereadable flag to "lvsmaster" (This used to be ctdb commit bbae698656d5da9a4a5b0fbfc3003844f246d54b)	2009-05-11 14:44:59 +10:00
Ronnie Sahlberg	1ee122e165	in the "lvsmaster" command, return -1 if there is no lvsmaster (This used to be ctdb commit ce6afbdef36e3c386b75709f73ef55efe0bd1987)	2009-05-11 13:56:28 +10:00
Ronnie Sahlberg	6721546b53	change the ctdb command table to allow us to describe commands which can be run independtly of the ctdb daemon. create a new debugging command xpnn which discovers the pnn of the local node and which works even if the local daemon is not running (This used to be ctdb commit cd78765f9400d7abce7929a2dd199f65226e7664)	2009-03-25 14:46:05 +11:00
Ronnie Sahlberg	d7ff332896	update how the NATGW configuration works. allow the cluster to be partitioned into multiple disjoint natgw subsets (This used to be ctdb commit 1046885cd22b5001e0251de2e536b5f6793459be)	2009-03-25 13:37:57 +11:00
Ronnie Sahlberg	7265c713db	we need to set the port properly in the parse_ip helper (This used to be ctdb commit 43fe18d86995744ba61c7a6405b70edcb265930a)	2009-03-24 13:45:11 +11:00
root	629d5ee1fa	add a new command "ctdb scriptstatus" this command shows which eventscripts were executed during the last monitoring cycle and the status from each eventscript. If an eventscript timedout or returned an error we also show the output from the eventscript. Example : [root@rcn1 ctdb-git]# ./bin/ctdb scriptstatus 6 scripts were executed last monitoring cycle 00.ctdb Status:OK Duration:0.021 Mon Mar 23 19:04:32 2009 10.interface Status:OK Duration:0.048 Mon Mar 23 19:04:32 2009 20.multipathd Status:OK Duration:0.011 Mon Mar 23 19:04:33 2009 40.vsftpd Status:OK Duration:0.011 Mon Mar 23 19:04:33 2009 41.httpd Status:OK Duration:0.011 Mon Mar 23 19:04:33 2009 50.samba Status:ERROR Duration:0.057 Mon Mar 23 19:04:33 2009 OUTPUT:ERROR: Samba tcp port 445 is not responding Add a new helper function "switch_from_server_to_client()" which both the recovery daemon can use as well as in the child process we start for running the actual eventscripts. Create several new controls, both for the eventscript child process to inform the master daemon of the current status of the scripts as well as for the ctdb tool to extract this information from the runninc daemon. (This used to be ctdb commit c98f90ad61c9b1e679116fbed948ddca4111968d)	2009-03-23 19:07:45 +11:00
Michael Adam	3cca0f75e4	Fix treatment of link local ipv6 addresses: set the scope id. metze / Michael Signed-off-by: Michael Adam <obnox@samba.org> (This used to be ctdb commit 9d12de1ca6107801dada927729e755c0949d73bf)	2009-01-19 22:50:53 +01:00
root	6c1359ab0d	add better errorchecking that nodes we try to talk to using the "ctdb" tool actually exist and that it is connected. two new dedicated ctdb error codes 21: node does not exist 22: node is disconnected (This used to be ctdb commit 7ee6db06162ad5a554058bb6160ad37b24fe42e0)	2008-12-17 14:26:01 +11:00
root	1bf3006665	update the "ctdb recover" command. block and wait until the clustered has completed the recovery before returning. this makes it easier to script since it avoids the common need for ctdb recover ... complex loop to wait for recovery to complete ... script continues (This used to be ctdb commit 8a0df9324a03b0f17772c64a9331236126c22124)	2008-12-10 12:06:51 +11:00
root	1209079672	add a CTDB_TIMEOUT variable for the ctdb tool. If set this specified the maximum runtime for the ctdb tool before it will terminate with status == 20 Just like the -T ... option would. (This used to be ctdb commit c404d57afb2adda039e676877838927d3073df11)	2008-12-10 12:01:19 +11:00
root	58bf3804f0	make sure we return an errorcode when the ctdb command has hung and is timeodout by the -T <timeout> setting (This used to be ctdb commit 993f626e603b9bbc02942bb55096d63b9a4f456b)	2008-12-10 11:49:51 +11:00
root	762d4be8f9	add a helper that waits until the clueter is no longe rin recovery mode and return the generation number. change the ban/unban logic to wait until we are not in recovery before it bans/unbans the node. also wait until after the cluster has recovered from the ban/unban before returning so that the cluster is in recpovery mode == normal when the command returns. this makes it much easier to script things ... (This used to be ctdb commit 39c77371a2f995025a584691fe61af12dc6ed5d7)	2008-12-09 12:03:42 +11:00
root	e4722f8ce4	return -1 if ctdb ping failed (This used to be ctdb commit 691b9c0f1771afa564a5959405f2e7a54c334d45)	2008-12-08 12:57:40 +11:00
root	e54347fa4e	redo and update how we synchronize flags across the cluster. this simplifies the code and should close a race condition between the local recovery daemon and a remote node when flags are changing. (This used to be ctdb commit 32d460b8469eb53145f04161a5d01166f9b5f09e)	2008-12-05 16:32:30 +11:00
Ronnie Sahlberg	539f044aa3	print the list of valid debug level literals when an invalid debug level is specified in 'ctdb setdebug' (This used to be ctdb commit 979e78cfd96d74686af6f55f726c395a75275803)	2008-12-02 14:08:10 +11:00
Ronnie Sahlberg	edb7241c05	redesign how reloadnodes is implemented. modify the transport methods to allow to restart individual connections and set up destructors properly. only tear down/set-up tcp connections to nodes removed from the cluster or nodes added to the cluster. Leave tcp connections to unchanged nodes connected. make "ctdb reloadnodes" explicitely cause a recovery of the cluster once the files have been realoaded (This used to be ctdb commit d1057ed6de7de9f2a64d8fa012c52647e89b515b)	2008-12-02 13:26:30 +11:00
root	7592a97d16	debuglevel is a signed int, not usnigned. (This used to be ctdb commit e577a276900854622f4e9da9d1ccd7b484d0d1ec)	2008-11-28 11:29:43 +11:00
Ronnie Sahlberg	51cc8b4df8	make it possible to delete an ip from all nodes at once using "ctdb delip x.x.x.x -n all" This is not as straightforward as one might think since during the delete process we don not want the ip to be bouncing from one node to another as node by node deletes it. Thus we first delete the ip from all connected nodes which are not currently hosting it. After this we delete the ip from the node which is hosting it. (This used to be ctdb commit bbd46f341e9aa32d8dbd49f7a9a07cb3f1f92ea3)	2008-11-28 09:52:26 +11:00
Ronnie Sahlberg	b9bd20ce55	add a context and a timed event so that once we have been in recovery mode for too long we drop all public ip addresses (This used to be ctdb commit 403c68f96e1380dd07217c688de2730464f77ea0)	2008-10-22 11:04:41 +11:00
Ronnie Sahlberg	6e490e8cce	verify that the nodes we try to ban/unban are operational and print an error to the user othervise. (This used to be ctdb commit 5747dd2d80af29d6252afb6aeb3e66328ee20de5)	2008-10-15 01:23:57 +11:00
Ronnie Sahlberg	156662e257	Check that a database exists first before we dump its content (and implicitely also create it) using 'ctdb catdb' (This used to be ctdb commit 647003da975d4823abe8ed2bfb46153d68ea0fb0)	2008-09-23 01:38:28 +10:00
Ronnie Sahlberg	54ffdfd50a	i add a new ctdb command "ctdb recmaster" this shows the node id of hte current recmaster (This used to be ctdb commit 3ff0711fd3b288c153218ad33e8462a94b8d3275)	2008-09-12 12:06:53 +10:00
Ronnie Sahlberg	d83fc7e389	when we collect all ip addresses and sort them for the "ctdb ip -n all" output we must look at more than just the first 4 bytes of the sockaddr address or ipv6 wont work (This used to be ctdb commit 4dfbfb4618433d9ed79ca1bdb1e2e51d96d4ee62)	2008-08-22 09:09:08 +10:00
Ronnie Sahlberg	ef997d344f	initial ipv6 patch Signed-off-by: Ronnie Sahlberg <ronniesahlberg@gmail.com> (This used to be ctdb commit 1f131f21386f428bbbbb29098d56c2f64596583b)	2008-08-19 14:58:29 +10:00
Ronnie Sahlberg	ed6ca6a84d	use a local tdb_traverse instead of a ctdb_pulldb to lessen the impact of the system while performing a database backup (This used to be ctdb commit 48fad9c06185a1f2580473cac02b3722e35c2023)	2008-08-14 10:57:08 +10:00
Ronnie Sahlberg	d793154dbb	only freeze the local node when doing a backup and not the entire cluster (This used to be ctdb commit ff413beb4bb31e277e843235a1ce5e5ad7b92c71)	2008-08-14 09:52:23 +10:00
Ronnie Sahlberg	748cf6f274	store the database name, not the backup filename in the database header (This used to be ctdb commit 0674b33a7492cc1a194833f5ca87d8b30457faee)	2008-08-14 08:36:39 +10:00
Ronnie Sahlberg	6f5ee6b5cc	Encode a file version number in the database backup header Encode the database name in the header so we dont need to provide the database name when doing a restore Encode a timestamp in the header telling us when the backup was created (This used to be ctdb commit 77762170ad1dbc4620565bb898af5d493fac117d)	2008-08-14 08:35:19 +10:00
Ronnie Sahlberg	65ae40d4a9	Add two new ctdb commands : ctdb backupdb : which will copy a database out from ctdb and write it to a file ctdb restoredb : which will read a database backup from a file and write it into ctdb (This used to be ctdb commit b567e215f5c58d646a392408b9cc1df8ef029b33)	2008-08-13 22:03:29 +10:00
Ronnie Sahlberg	b9d8bb23af	remove the reclock file we store pnn counts in. This file creates additional locking stress on the backend filesystem and we may not need it anyway. (This used to be ctdb commit 84236e03e40bcf46fa634d106903277c149a734f)	2008-08-06 11:52:26 +10:00
Ronnie Sahlberg	efd840276b	Add three mode commands to the CTDB tool. lvs: which shows which nodes are active LVS servers lvsmaster: which shows which node is the lvs master multiplex node pnn: which prints the pnn of the local node (This used to be ctdb commit 00025eef662b867293829228c681df491cd6f371)	2008-07-10 11:12:58 +10:00
Ronnie Sahlberg	ab8535eaa5	make LVS a capability so that we can see which nodes are configured with LVS and which are not using LVS. "ctdb getcapabilities" (This used to be ctdb commit 172d01fb34f032e098b1c77a7b0f17bf11301640)	2008-07-10 10:37:22 +10:00
Ronnie Sahlberg	ef769e7237	track both when we last started and ended a recovery. make ctdb uptime print how long the recovery took in the recovery daemon when we check that the public ip address allocation on the local node is correct (we have the ips we should have and we dont have any we shouldnt have) use ctdb uptime and check the recovery start/stop times and make sure we dont check for ip allocation inconsistencies during a recovery where the ip address allocation is in flux. (This used to be ctdb commit f86551580349b7f662f9a07e4eb0c1189e38e429)	2008-07-02 13:55:59 +10:00
Ronnie Sahlberg	4b6b094860	add a callback for failed nodes to the async control helper. this callback is called for every node where the control failed (or timed out) when we issue the start recovery control from recovery master, set any node that fails as a culprit so it will eventually be banned (This used to be ctdb commit 72f89bac13cbe8c3ca3e7a942469cd2ff25abba2)	2008-06-12 16:53:36 +10:00
Ronnie Sahlberg	d8433cacb2	first cut to convert takeover_callback_state{} to use ctdb_sock_addr instead of sockaddr_in (This used to be ctdb commit 5444ebd0815e335a75ef4857546e23f490a22338)	2008-06-04 17:12:57 +10:00
Ronnie Sahlberg	7d39ac131b	convert handling of gratious arps and their controls and helpers to use the ctdb_sock_addr structure so tehy work for both ipv4 and ipv6 (This used to be ctdb commit 86d6f53512d358ff68b58dac737ffa7576c3cce6)	2008-06-04 15:13:00 +10:00
Ronnie Sahlberg	1c88f422d5	add a parameter for the tdb-flags to the client function ctdb_attach() so that we can pass TDB_NOSYNC when we attach to a persistent database and want fast unsafe writes instead of slow but safe tdb_transaction writes. enhance the ctdb_persistent test suite to test both safe and unsafe writes (This used to be ctdb commit 4948574f5a290434f3edd0c052cf13f3645deec4)	2008-06-04 10:46:20 +10:00
Ronnie Sahlberg	2d06861247	debugleves can now be negative so print their value using %d instead of %u (This used to be ctdb commit f0b55adae450cac3cf925e111e1dc9628cff4525)	2008-05-29 08:19:35 +10:00
Ronnie Sahlberg	ceaf488f05	do persistent writes in a child process (This used to be ctdb commit 2da3d1f876f5d654f849af8a3e588f5a61300c3d)	2008-05-28 13:04:25 +10:00
Ronnie Sahlberg	9da30f488d	add "machinereadable output" support to "ctdb getmonmode" (This used to be ctdb commit 9aa09aee618fa71787c5d0e7c885e83f4d82236c)	2008-05-16 09:21:44 +10:00
Ronnie Sahlberg	909ff219e0	Start implementing support for ipv6. This enhances the framework for sending tcp tickles to be able to send ipv6 tickles as well. Since we can not use one single RAW socket to send both handcrafted ipv4 and ipv6 packets, instead of always opening TWO sockets, one ipv4 and one ipv6 we get rid of the helper ctdb_sys_open_sending_socket() and just open (and close) a raw socket of the appropriate type inside ctdb_sys_send_tcp(). We know which type of socket v4/v6 to use based on the sin_family of the destination address. Since ctdb_sys_send_tcp() opens its own socket we no longer nede to pass a socket descriptor as a parameter. Get rid of this redundant parameter and fixup all callers. (This used to be ctdb commit 406a2a1e364cf71eb15e5aeec3b87c62f825da92)	2008-05-14 15:47:47 +10:00
Ronnie Sahlberg	123d0b3b1e	fix merge corruption (This used to be ctdb commit 17b1e3b2d72c453a0b2f5a783c28f9dd17334620)	2008-05-08 19:52:27 +10:00
Andrew Tridgell	e8a62cdca4	Merge branch 'master' of git://git.samba.org/sahlberg/ctdb (This used to be ctdb commit cb2c05d5d3f8908eecdad1ae6a1dc8efa1ffcb1e)	2008-05-08 16:58:34 +10:00
Ronnie Sahlberg	92b61cd7d5	Expand the client async framework so that it can take a callback function. This allows us to use the async framework also for controls that return outdata. Add a "capabilities" field to the ctdb_node structure. This field is only initialized and kept valid inside the recovery daemon context and not inside the main ctdb daemon. change the GET_CAPABILITIES control to return the capabilities in outdata instead of in the res return variable. When performing a recovery inside the recovery daemon, read the capabilities from all connected nodes and update the ctdb->nodes list of nodes. when building the new vnnmap after the database rebuild in recovery, do not include any nodes which lack the LMASTER capability in the new vnnmap. Unless there are no available connected node that sports the LMASTER capability in which case we let the local node (recmaster) take on the lmaster role temporarily (i.e. become a member of the vnnmap list) (This used to be ctdb commit 0f1883c69c689b28b0c04148774840b2c4081df6)	2008-05-06 15:42:59 +10:00
Ronnie Sahlberg	a9c45f9513	Add a capabilities field to the ctdb structure Define two capabilities : can be recmaster can be lmaster Default both capabilities to YES Update the ctdb tool to read capabilities off a node (This used to be ctdb commit 50f1255ea9ed15bb8fa11cf838b29afa77e857fd)	2008-05-06 10:02:27 +10:00
Ronnie Sahlberg	6fd1c60741	when deleting a public ip from a node that is currently hosting this ip, try to move the ip address to a different node first (This used to be ctdb commit 9395a05de669c69396e701fb36409ec49d3ebef6)	2008-04-24 21:51:08 +10:00
Ronnie Sahlberg	60583c70bc	when adding a new public ip address to a running node using the 'ctdb addip' command, If no other node is hosting this public ip at the moment, then assign it immediately to the current node. (This used to be ctdb commit a63825e32658b36e0964584758b9a276c18056b8)	2008-04-23 21:05:36 +10:00
Ronnie Sahlberg	b1b8aeb414	make ctdb eventrscipt accept the -n all argument to run the event script on all connected nodes (This used to be ctdb commit 772052e071718f20a19d24d5e06a5a2ef87549f2)	2008-04-22 22:23:57 +02:00
Ronnie Sahlberg	bb237ab5ec	add support for -n all in "ctdb -n all ip" this collects all public addresses from all nodes and presents the public ips for the entire cluster (This used to be ctdb commit cbf79b2158ab21a58aef967e89f0bd60890a7972)	2008-04-22 22:18:54 +02:00
Ronnie Sahlberg	a6cbe34c62	add support for -n all in "ctdb -n all ip" this collects all public addresses from all nodes and presents the public ips for the entire cluster (This used to be ctdb commit 0a4e667f42c6fb23be13651f7b0d0a545a49900b)	2008-04-23 00:55:57 +10:00
Ronnie Sahlberg	f153a45e56	make ctdb eventrscipt accept the -n all argument to run the event script on all connected nodes (This used to be ctdb commit 3fad7d67f2c66ac3a65cfd821fd6db6342f4a3f0)	2008-04-15 18:24:48 +10:00
Ronnie Sahlberg	416409d31b	add a ctdb command to print the ctdb version (This used to be ctdb commit 401fb01f8cb06886e2c5c277a9a70512a9b68579)	2008-04-03 17:07:00 +11:00
Ronnie Sahlberg	d6736b3720	we allocated one byte too little in the blob we need to send as the control to the server. (This used to be ctdb commit 10e585413c217d9b9c32ff3d2fb3d8f24183c458)	2008-04-03 16:35:23 +11:00
Ronnie Sahlberg	e8e67ef576	add a mechanism to force a node to run the eventscripts with arbitrary arguments ctdb eventscript "command argument argument ..." (This used to be ctdb commit 118a16e763d8332c6ce4d8b8e194775fb874c8c8)	2008-04-02 11:13:30 +11:00
Ronnie Sahlberg	27a7f854f5	add improvements to tracking memory usage in ctdbd adn the recovery daemon and a ctdb command to pull the talloc memory map from a recovery daemon ctdb rddumpmemory (This used to be ctdb commit d23950be7406cf288f48b660c0f57a9b8d7bdd05)	2008-04-01 15:34:54 +11:00
Ronnie Sahlberg	a1334246cf	make sure the iface string is nullterminated in the addip control packet (This used to be ctdb commit 983490556bc12fe03de4c22b5fdc12d15c11d43c)	2008-03-31 12:49:39 +11:00
Ronnie Sahlberg	0d7b34c9e5	Add two new controls to add/delete public ip address from a node at runtime. The controls only modify the runtime setting of which public addresses a node can server and does not modify /etc/ctdb/public_addresses. To make the change permanent you also need to edit /etc/ctdb/public_addresses manually. After ip addresses have been added/deleted you need to invoke a recovery for the ip addresses to be redistributed. (This used to be ctdb commit f8294d103fdd8a720d0b0c337d3973c7fdf76b5c)	2008-03-27 09:23:27 +11:00
Ronnie Sahlberg	2863d2cfd1	From M Dietz, Add back the controls to enable/disable monitoring we used to have for debugging but removed a while ago (This used to be ctdb commit 8477f6a079e2beb8c09c19702733c4e17f5032fe)	2008-03-25 08:27:38 +11:00
Ronnie Sahlberg	b1cf2b5653	Update ctdb uptime to provide machinereadable output (This used to be ctdb commit 4f7f8aa6f178115b551ac35f7df2ec5aad054fe2)	2008-03-04 13:29:48 +11:00
Ronnie Sahlberg	61b52e0e64	provide machinereadble -Y output for 'ctdb getdebug' (This used to be ctdb commit 646f4d9a01637685e967fb3ecc042fc97c0b7529)	2008-03-04 13:23:06 +11:00
Ronnie Sahlberg	212fbb42d5	make 'ctdb ip' provide machinereadble output using '-Y' (This used to be ctdb commit 446e2f4e650b12d6fce5677a6841006462c23dba)	2008-03-04 13:18:27 +11:00
Ronnie Sahlberg	d9b534b59d	A new command to 'ctdb' ctdb moveip <IPADDRESS> <NODE> which can be used to manually fail an ip address over to a specific node. This can only be used if DeteministicIPs are disabled and also only if NoIPFailback is enabled. (This used to be ctdb commit ffee062b7e26a6aa6ad254edb58399040ecaa542)	2008-03-04 12:20:23 +11:00
Ronnie Sahlberg	e0036942bc	add a new file <reclock>.pnn where each recovery daemon can lock that byte at offset==pnn to offer an alternative way to detect which nodes are active instead of relying on CONNECTED being accurate. (This used to be ctdb commit 21d3319eaf463e2a00637d440ee2d4d15f53bf09)	2008-02-29 12:37:42 +11:00
Ronnie Sahlberg	4adeafef11	add a control to get the name of the reclock file from the daemon (This used to be ctdb commit 9effb22cc1616d684352d7ebabb359e69adb0f52)	2008-02-29 10:03:39 +11:00
Ronnie Sahlberg	16c4e9c4aa	make the ctdb reloadnodes reload the nodes file on all nodes and restart the transport (This used to be ctdb commit 6272ad33b4af6ea9d6fd0ac877df3f75be45d665)	2008-02-21 08:25:01 +11:00
Ronnie Sahlberg	9f99b44fd1	to make it easier/less disruptive to add nodes to a running cluster add a new control that causes the node to drop the current nodes list and reread it from the nodes file. During this operation, the node will also drop the tcp layer and restart it. When we drop the tcp layer, by talloc_free()ing the ctcp structure add a destructor to ctcp so that we also can clean up and remove the references in the ctdb structure to the transport layer add two new commands for the ctdb tool. one to list all nodes in the nodesfile and the second a command to trigger a node to drop the transport and reinitialize it with the nde nodes file (This used to be ctdb commit 4bc20ac73e9fa94ffd43cccb6eeb438eeff9963c)	2008-02-19 14:44:48 +11:00
Andrew Tridgell	275cd68867	nicer use of structures and use isalpha() (This used to be ctdb commit 19b5fbcd16596a4b6c22056585dd4bd988db3db7)	2008-02-05 10:36:06 +11:00
Ronnie Sahlberg	3f56526037	Specify and print debuglevels by name and not by number (This used to be ctdb commit 79ad830294b8b677fbd0c5ad7ed6fbde71f74f8d)	2008-02-05 10:26:23 +11:00
Andrew Tridgell	f6e53f433b	merge from ronnie (This used to be ctdb commit e7b57d38cf7255be823a223cf15b7526285b4f1c)	2008-02-04 20:07:15 +11:00
Andrew Tridgell	3777346629	partial merge from ronnie (This used to be ctdb commit fd316deb8a9e0545c8efa1bfc8ad83962b310405)	2008-01-29 11:39:06 +11:00
Andrew Tridgell	eb044bb1d6	make ctdb dumpmemory work remotely, and dump the talloc memory tree to stdout. This is much more useful than putting it in the log, and also fixes a bug where the pipe would overflow internally and cause ctdbd to lockup (This used to be ctdb commit e236979e2162d9bd7a495086342168a696cf76c5)	2008-01-22 14:22:41 +11:00

1 2 3 4 5 ...

293 Commits