Andrew Tridgell
76b2822340
- startup frozen, and do an initial recovery
...
- fixed a bug in traverse
- get a lock on the node list file in the recmaster recovery daemon
(This used to be ctdb commit 162a5647535ad1cb3e8e5d4042a2784365fb1913)
2007-05-23 14:35:19 +10:00
Andrew Tridgell
9f7a70657f
start ctdb frozen, and let the election sort things out. This prevents a race on startup
...
(This used to be ctdb commit b788ed3fa64e31e517b4e602e8bd3ae7201ecddd)
2007-05-23 12:23:07 +10:00
Andrew Tridgell
edf03e88c8
global lock should imply the transaction lock
...
(This used to be ctdb commit 765a7e27344476ea8835565839b69872ea6ab6f0)
2007-05-22 13:14:51 +10:00
Ronnie Sahlberg
e989a1bac8
add controls to enable/disable the monitoring of dead nodes
...
(This used to be ctdb commit 79d29c39bb81feb069db3fc6d3d392c1e75a4d13)
2007-05-21 09:24:34 +10:00
Andrew Tridgell
c02fee6ed3
show ctdb control timeout
...
(This used to be ctdb commit 03921eacb942c7aa6b81236c6603dc640cefe2bf)
2007-05-19 21:11:06 +10:00
Andrew Tridgell
d549f1e1a3
merge from ronnie
...
(This used to be ctdb commit 985d718e03510398b9a5cfdf6a4d559a90738a11)
2007-05-19 17:21:58 +10:00
Ronnie Sahlberg
02a9f1b0a0
use ctdb_dead_node() instead of reimplementing the same code again
...
this leaves only one single function where a node is marked as dead
instead of two places
(This used to be ctdb commit aa764ea26cc26d5c1ae188105236da603576f45b)
2007-05-19 16:59:10 +10:00
Andrew Tridgell
ab66fb840e
removed obsolete ctdb_dump tool
...
(This used to be ctdb commit e3ed6fd65896f07fc76405acb2e16f50f04a0a3c)
2007-05-19 14:07:01 +10:00
Andrew Tridgell
45d665ab20
don't count packets received from before the transport told us the node was dead
...
(This used to be ctdb commit 46b5319963afa9da6cf6d83f2a5dc1fa664e14b4)
2007-05-19 14:04:48 +10:00
Andrew Tridgell
13392414b1
nicer date formatting
...
(This used to be ctdb commit 78c256caa738593e335de65df752ff015f3a38e1)
2007-05-19 14:01:02 +10:00
Andrew Tridgell
a14fd9d29c
make sure we don't increment rx_cnt for redirected packets, or for packets that have been requeued after a lockwait
...
(This used to be ctdb commit 92e5569407dba173a27e9645b4339ce3e2c00520)
2007-05-19 13:45:24 +10:00
Ronnie Sahlberg
5f4eed84e0
increase the tx_cnt everytime we send a packet to a node
...
(This used to be ctdb commit d3fea0d87268e552f4ed3a90d10b3af209413f73)
2007-05-19 10:27:17 +10:00
Ronnie Sahlberg
9f7b9faf64
add a node->tx_cnt counter
...
only send keepalive packets if the count is zero
(This used to be ctdb commit 2cbd424231caccf0a531cf6501761115efe68f3e)
2007-05-19 10:20:19 +10:00
Andrew Tridgell
28f2fc669b
a better way to resend calls after recovery
...
(This used to be ctdb commit 444f52e134fc22aaf254d05c86d8b357ded876f4)
2007-05-19 00:56:49 +10:00
Andrew Tridgell
049e1504ee
timeout pending controls immediately when a node becomes disconnected
...
(This used to be ctdb commit 93c4b16f4efef383ba8db83953019ef4821613e0)
2007-05-18 23:48:29 +10:00
Andrew Tridgell
346dfc1bef
- up rx_cnt on all packet types
...
- notice when a node becomes available again
(This used to be ctdb commit e05110dd6112e81f224937dfd7370d963ce9531a)
2007-05-18 23:23:36 +10:00
Ronnie Sahlberg
334a294481
add a missing file :-)
...
(This used to be ctdb commit 29cf1b927f2cebfdc43e22d32a270e956716e2c5)
2007-05-18 20:06:29 +10:00
Ronnie Sahlberg
db4c479568
add dead node detection so that if a node does not generate any
...
keepalive traffic for x seconds it is deemed dead
this triggers a recovery after a while if a ctdbd has been STOPPED
but it doesnt recover automatically when the node reappears
(This used to be ctdb commit d6324afe0d13b5e21d06e347caca433c6b36a32a)
2007-05-18 19:19:35 +10:00
Andrew Tridgell
2d9f0ee606
merge from samba4
...
(This used to be ctdb commit 3028a1b5c21b981c4487b55f026e6645bbeada89)
2007-05-18 11:24:45 +10:00
Andrew Tridgell
51f5bd2b31
merge from samba4
...
(This used to be ctdb commit b280bb01fc9afea5a9168c63d92907dac725b22b)
2007-05-18 08:33:10 +10:00
Andrew Tridgell
49fe66713f
- don't try to send controls to dead nodes
...
- use only connected nodes in a traverse
(This used to be ctdb commit 9a676dd5d331022d946a56c52c42fc6985b93dbc)
2007-05-17 23:23:41 +10:00
Andrew Tridgell
0b11bd99dd
merged from samba4
...
(This used to be ctdb commit 23b1c00f8bf60739aee935bac44981b68477ee1f)
2007-05-17 17:54:02 +10:00
Andrew Tridgell
874fd5c2f7
removed the CTDB_CTRL_FLAG_NOREQUEUE flag
...
(This used to be ctdb commit 366e849f6f350eda78d79cf1ea55c2637e605c86)
2007-05-17 14:10:38 +10:00
Andrew Tridgell
fb77fd0bed
merge from ronnie
...
(This used to be ctdb commit f80c4b02ffb474a4710fa411123448e3ce2d004a)
2007-05-17 12:46:50 +10:00
Andrew Tridgell
6d32a6dd50
merged debug changes from samba4
...
(This used to be ctdb commit f01180148635b9fcc0f50beddb87be0a2071dfd4)
2007-05-17 12:43:09 +10:00
Andrew Tridgell
241ce32f1d
merged events changes from samba4
...
(This used to be ctdb commit cd9280556663bc8929672ada6ea556e097c4b2b3)
2007-05-17 12:39:53 +10:00
Ronnie Sahlberg
f4738f9c41
we no longer pass lmaster across during pulldb so dont print it from
...
catdb either
(This used to be ctdb commit b57d60f4789ea7f0dd69c93f6629d8742e182576)
2007-05-17 12:07:29 +10:00
Ronnie Sahlberg
cc760cf13a
add a control to shutdown/kill a node
...
(This used to be ctdb commit 3802f7304fd59d56062c855987e2561753e85a69)
2007-05-17 10:45:31 +10:00
Ronnie Sahlberg
d6ed77468d
merge from tridge
...
(This used to be ctdb commit 0c6dc471e33e80db00a2b006262c4107f39fa023)
2007-05-16 18:44:51 +10:00
Andrew Tridgell
c105f6d789
- merge from ronnie
...
- fixed a memory leak found by dmitry
(This used to be ctdb commit ae87bf0005666b50850161c3843d6bc7cb5c8971)
2007-05-16 18:10:26 +10:00
Ronnie Sahlberg
f4056d2e28
remove a prototype we no longer need
...
(This used to be ctdb commit 4a11373ec5e8196cf430f18f6171915f790f794b)
2007-05-16 14:45:43 +10:00
Ronnie Sahlberg
a4ebb6d5ef
if a caller specifies a timeout when calling a control, it makes no
...
sense to have the daemon requeue the packets if they timeout or fail to
deliver to the remote node
(This used to be ctdb commit 9fb753046787190970654aeb937e96685ac53184)
2007-05-16 12:34:30 +10:00
Ronnie Sahlberg
4b8ddfccad
merge from tridge
...
(This used to be ctdb commit 8d424b41d6cf2973b28a749d1b8e6a028dad9ffe)
2007-05-16 11:12:28 +10:00
Andrew Tridgell
20d96ad5c5
enable TCP keepalives
...
(This used to be ctdb commit a44f760f6260359201d8431d2f1267af2bc6b1b1)
2007-05-15 18:40:56 +10:00
Andrew Tridgell
a5198559c9
moved the recovery daemon into the main ctdbd and enable it by default
...
(This used to be ctdb commit 2a7d42124731f43d013cb76a798525eab4cc1ee0)
2007-05-15 15:13:36 +10:00
Andrew Tridgell
527b2352ac
fixed two more places where we don't correctly handle write errors on sockets
...
(This used to be ctdb commit f4a71bb63e7f75d21b66f9eaeac997c2029cd146)
2007-05-15 14:08:58 +10:00
Ronnie Sahlberg
5d82f89a55
merge from tridge
...
(This used to be ctdb commit eb64cde53ec5ed6949df1684e5c148f2294b1da7)
2007-05-15 10:34:14 +10:00
Andrew Tridgell
67f5601bef
fixed a fd close error on reconnect
...
(This used to be ctdb commit 240651a6f67f914b06e273696cef6180d788221e)
2007-05-15 10:33:28 +10:00
Ronnie Sahlberg
0d71b6d1e6
merge from tridge
...
(This used to be ctdb commit 0697f59a044deeab126a39bff97bcd5c1101298e)
2007-05-15 10:28:41 +10:00
Andrew Tridgell
c6afe22b92
added a control to get the local vnn
...
(This used to be ctdb commit 0b109f574b710f290372512d0694290ea7cd4368)
2007-05-15 10:17:16 +10:00
Andrew Tridgell
38491de84f
check for error on ctdb_ltdb_store
...
(This used to be ctdb commit c4a34bac4ad4d2f9699e08074668d25586e3c0da)
2007-05-15 10:16:59 +10:00
Andrew Tridgell
cf1056df94
added a -i switch to run ctdbd without forking
...
(This used to be ctdb commit 327df14ecd58f405fbe8b38afa2ee54a8dd0a2e4)
2007-05-15 09:44:33 +10:00
Andrew Tridgell
db4fa58b07
reading on the write side of a pipe isn't allowed - this caused us to run without locking in the lockwait code
...
(This used to be ctdb commit 2ac67ce737f30258915cc25bde531d361092ae14)
2007-05-15 09:44:03 +10:00
Andrew Tridgell
7d3870d41f
AIX needs sin_len field for bind()
...
(This used to be ctdb commit cd6c35d4aa4f4a4cfeedf6902cda84e43d7aeba4)
2007-05-15 09:42:52 +10:00
Ronnie Sahlberg
5d0e7e354f
merge from tridge
...
(This used to be ctdb commit d1dae4fc8f4c2d16d313a27968d67c5825a133d1)
2007-05-14 14:07:19 +10:00
Ronnie Sahlberg
4b6d949cd3
merge from tridge
...
(This used to be ctdb commit 65f4415e618dbbac0260f6a4e51e051e6df64a61)
2007-05-14 14:05:49 +10:00
Andrew Tridgell
3f3eb4f636
we must not free the fde until after we no longer need the lock child
...
(This used to be ctdb commit e06776c7c37b63f5c3165c7043d665e0c1a95337)
2007-05-14 14:01:33 +10:00
Andrew Tridgell
d655a64b5b
kill the lockwait child if the pipe goes away
...
(This used to be ctdb commit bdfa8ba9932fade074a05a6cb6bc14ae3b84618c)
2007-05-14 13:49:01 +10:00
Andrew Tridgell
9e70b95159
don't allow setvnnmap while not frozen
...
(This used to be ctdb commit a73f47f565894cc7e346177d87f2e6813837e1c6)
2007-05-14 13:48:40 +10:00
Andrew Tridgell
5d71cc78bb
don't allow setrecmaster while not frozen
...
(This used to be ctdb commit e84b05ba6062ffc45b7f3c23e88feef1d39069c4)
2007-05-14 13:48:14 +10:00