Andrew Tridgell
ccf4d78e04
- renamed ctdb_control utility to ctdb
...
- use -n to specify node number in ctdb utility
- change 'ctdb status' to 'ctdb statistics'
- added 'ctdb status' which shows status
- added netmask to public IPs, so you don't try a takeover on a
foreign network
- cleaned up tools/ctdb_control.c a lot
- generate usage message at runtime
(This used to be ctdb commit 28de71c03ace7d32a9fd9882fabbd5d668b97656)
2007-05-29 12:16:59 +10:00
Andrew Tridgell
dcd23d47a7
another place where we could send a partial packet
...
(This used to be ctdb commit 4e0c57718a0c9f51b5b2e02bbf516c63c5be0a35)
2007-05-28 08:37:54 +10:00
Andrew Tridgell
9cc3ce8554
automatic cleanup of tcp tickle records
...
(This used to be ctdb commit ede79b571bf89b89f1b8394f262ca0689f8c65f3)
2007-05-28 00:34:40 +10:00
Andrew Tridgell
d41290fbae
added code to ctdb to send a tcp 'tickle' ack when we takeover an
...
IP. A raw tcp ack is sent for each tcp connection held by clients
before the IP takeover.
These acks have a deliberately incorrect sequence number, and should
cause the windows client to send its own ack which will in turn cause
a tcp reset and thus cause windows clients to much more quickly
reconnect to the new node.
(This used to be ctdb commit eef38bfe8461b47489d169c61895d6bb8a8f79a1)
2007-05-27 15:26:29 +10:00
Andrew Tridgell
647540253e
tweak timeouts
...
(This used to be ctdb commit 54a90797469f56d796efd82e9294efff3c5dabcc)
2007-05-27 09:43:25 +10:00
Andrew Tridgell
35955c1476
drop any partialialy send packets when we get a socket write error
...
(This used to be ctdb commit 4f7b97af8e0075de56059fffc57a86cb0c16df46)
2007-05-26 16:41:32 +10:00
Andrew Tridgell
9aa692669b
paranoid checks for bad packets in tcp layer. Close the socket if it gets a bad packet
...
(This used to be ctdb commit 1277089e5c6e1036517c63ee8c8e4ff98cb76cf8)
2007-05-26 16:32:32 +10:00
Andrew Tridgell
2b86216b66
handle corrupt ctdb packets better
...
(This used to be ctdb commit aea4d78645a99eb22860048350b5f20841c17704)
2007-05-26 14:46:12 +10:00
Andrew Tridgell
cc4d8102cd
moved system specific ip code to system.c
...
(This used to be ctdb commit 9de9e4ccda9665108baac12a8716b189d26340b1)
2007-05-26 14:01:08 +10:00
Andrew Tridgell
9e61a5bd77
send a message to clients when an IP has been released
...
(This used to be ctdb commit 8b7ab0b00253462593d368052c2cb10a385b4e63)
2007-05-26 00:05:30 +10:00
Andrew Tridgell
3c3ed0989e
paraoid check for empty db on attach
...
(This used to be ctdb commit f46ecc15d8d6520a7bce721d915897deba84a54f)
2007-05-25 18:48:49 +10:00
Andrew Tridgell
56e3eed3d1
added IP takeover logic for public IPs to ctdb
...
(This used to be ctdb commit 374adb729472670f35cef41269b8719f49c0de0e)
2007-05-25 17:04:13 +10:00
Ronnie Sahlberg
2b6c39a0af
add controls to take over and release an ip address
...
add sending of grat arp both normal grat arp (request) and also
unsolicited grat arp replies
(This used to be ctdb commit 7305c00c21c30bdbafc3722a018513378bd307e6)
2007-05-25 13:05:25 +10:00
Andrew Tridgell
7596347844
make ctdbd realtime if possible
...
(This used to be ctdb commit 8852f6cca52b64a5239c83ab7c6a99ae4edb2597)
2007-05-24 14:52:10 +10:00
Andrew Tridgell
6ba5885f07
raise the control timeout in recovery
...
(This used to be ctdb commit 43424ff66daf28c202c12982f20a9f662b6fb125)
2007-05-24 13:49:27 +10:00
Andrew Tridgell
5dc81da87e
fixed %d which should be %u
...
(This used to be ctdb commit 2792cf718ff1e66fe99f870f683a13baa160f629)
2007-05-23 20:15:09 +10:00
Andrew Tridgell
296e15c9d4
fixed some memory leaks on the traverse code
...
(This used to be ctdb commit 2781cbb7d00c5448449216c8c0c1b37bdc74a6c0)
2007-05-23 20:06:37 +10:00
Andrew Tridgell
70912e2b0c
added automatic vacuuming of empty records during recovery
...
(This used to be ctdb commit f9181a784ac7009df5e9c996f4e0c3e99098b59a)
2007-05-23 17:21:14 +10:00
Andrew Tridgell
3736943ead
- get rid of ctdb_ctrl_get_config
...
- cope with zero timeout in ctdb_control
(This used to be ctdb commit 9f24689eb0ad6813f1ac78e033cdffe5d8afb128)
2007-05-23 15:18:30 +10:00
Andrew Tridgell
74bf76ca10
merge from ronnie
...
(This used to be ctdb commit 267481b67152bc5885884d223085aa9ef5fe73bd)
2007-05-23 14:50:41 +10:00
Andrew Tridgell
76b2822340
- startup frozen, and do an initial recovery
...
- fixed a bug in traverse
- get a lock on the node list file in the recmaster recovery daemon
(This used to be ctdb commit 162a5647535ad1cb3e8e5d4042a2784365fb1913)
2007-05-23 14:35:19 +10:00
Andrew Tridgell
9f7a70657f
start ctdb frozen, and let the election sort things out. This prevents a race on startup
...
(This used to be ctdb commit b788ed3fa64e31e517b4e602e8bd3ae7201ecddd)
2007-05-23 12:23:07 +10:00
Ronnie Sahlberg
e989a1bac8
add controls to enable/disable the monitoring of dead nodes
...
(This used to be ctdb commit 79d29c39bb81feb069db3fc6d3d392c1e75a4d13)
2007-05-21 09:24:34 +10:00
Andrew Tridgell
c02fee6ed3
show ctdb control timeout
...
(This used to be ctdb commit 03921eacb942c7aa6b81236c6603dc640cefe2bf)
2007-05-19 21:11:06 +10:00
Andrew Tridgell
d549f1e1a3
merge from ronnie
...
(This used to be ctdb commit 985d718e03510398b9a5cfdf6a4d559a90738a11)
2007-05-19 17:21:58 +10:00
Ronnie Sahlberg
02a9f1b0a0
use ctdb_dead_node() instead of reimplementing the same code again
...
this leaves only one single function where a node is marked as dead
instead of two places
(This used to be ctdb commit aa764ea26cc26d5c1ae188105236da603576f45b)
2007-05-19 16:59:10 +10:00
Andrew Tridgell
45d665ab20
don't count packets received from before the transport told us the node was dead
...
(This used to be ctdb commit 46b5319963afa9da6cf6d83f2a5dc1fa664e14b4)
2007-05-19 14:04:48 +10:00
Andrew Tridgell
a14fd9d29c
make sure we don't increment rx_cnt for redirected packets, or for packets that have been requeued after a lockwait
...
(This used to be ctdb commit 92e5569407dba173a27e9645b4339ce3e2c00520)
2007-05-19 13:45:24 +10:00
Ronnie Sahlberg
5f4eed84e0
increase the tx_cnt everytime we send a packet to a node
...
(This used to be ctdb commit d3fea0d87268e552f4ed3a90d10b3af209413f73)
2007-05-19 10:27:17 +10:00
Ronnie Sahlberg
9f7b9faf64
add a node->tx_cnt counter
...
only send keepalive packets if the count is zero
(This used to be ctdb commit 2cbd424231caccf0a531cf6501761115efe68f3e)
2007-05-19 10:20:19 +10:00
Andrew Tridgell
28f2fc669b
a better way to resend calls after recovery
...
(This used to be ctdb commit 444f52e134fc22aaf254d05c86d8b357ded876f4)
2007-05-19 00:56:49 +10:00
Andrew Tridgell
049e1504ee
timeout pending controls immediately when a node becomes disconnected
...
(This used to be ctdb commit 93c4b16f4efef383ba8db83953019ef4821613e0)
2007-05-18 23:48:29 +10:00
Andrew Tridgell
346dfc1bef
- up rx_cnt on all packet types
...
- notice when a node becomes available again
(This used to be ctdb commit e05110dd6112e81f224937dfd7370d963ce9531a)
2007-05-18 23:23:36 +10:00
Ronnie Sahlberg
334a294481
add a missing file :-)
...
(This used to be ctdb commit 29cf1b927f2cebfdc43e22d32a270e956716e2c5)
2007-05-18 20:06:29 +10:00
Ronnie Sahlberg
db4c479568
add dead node detection so that if a node does not generate any
...
keepalive traffic for x seconds it is deemed dead
this triggers a recovery after a while if a ctdbd has been STOPPED
but it doesnt recover automatically when the node reappears
(This used to be ctdb commit d6324afe0d13b5e21d06e347caca433c6b36a32a)
2007-05-18 19:19:35 +10:00
Andrew Tridgell
49fe66713f
- don't try to send controls to dead nodes
...
- use only connected nodes in a traverse
(This used to be ctdb commit 9a676dd5d331022d946a56c52c42fc6985b93dbc)
2007-05-17 23:23:41 +10:00
Andrew Tridgell
874fd5c2f7
removed the CTDB_CTRL_FLAG_NOREQUEUE flag
...
(This used to be ctdb commit 366e849f6f350eda78d79cf1ea55c2637e605c86)
2007-05-17 14:10:38 +10:00
Ronnie Sahlberg
f4738f9c41
we no longer pass lmaster across during pulldb so dont print it from
...
catdb either
(This used to be ctdb commit b57d60f4789ea7f0dd69c93f6629d8742e182576)
2007-05-17 12:07:29 +10:00
Ronnie Sahlberg
cc760cf13a
add a control to shutdown/kill a node
...
(This used to be ctdb commit 3802f7304fd59d56062c855987e2561753e85a69)
2007-05-17 10:45:31 +10:00
Andrew Tridgell
c105f6d789
- merge from ronnie
...
- fixed a memory leak found by dmitry
(This used to be ctdb commit ae87bf0005666b50850161c3843d6bc7cb5c8971)
2007-05-16 18:10:26 +10:00
Ronnie Sahlberg
a4ebb6d5ef
if a caller specifies a timeout when calling a control, it makes no
...
sense to have the daemon requeue the packets if they timeout or fail to
deliver to the remote node
(This used to be ctdb commit 9fb753046787190970654aeb937e96685ac53184)
2007-05-16 12:34:30 +10:00
Ronnie Sahlberg
4b8ddfccad
merge from tridge
...
(This used to be ctdb commit 8d424b41d6cf2973b28a749d1b8e6a028dad9ffe)
2007-05-16 11:12:28 +10:00
Andrew Tridgell
a5198559c9
moved the recovery daemon into the main ctdbd and enable it by default
...
(This used to be ctdb commit 2a7d42124731f43d013cb76a798525eab4cc1ee0)
2007-05-15 15:13:36 +10:00
Andrew Tridgell
527b2352ac
fixed two more places where we don't correctly handle write errors on sockets
...
(This used to be ctdb commit f4a71bb63e7f75d21b66f9eaeac997c2029cd146)
2007-05-15 14:08:58 +10:00
Ronnie Sahlberg
0d71b6d1e6
merge from tridge
...
(This used to be ctdb commit 0697f59a044deeab126a39bff97bcd5c1101298e)
2007-05-15 10:28:41 +10:00
Andrew Tridgell
c6afe22b92
added a control to get the local vnn
...
(This used to be ctdb commit 0b109f574b710f290372512d0694290ea7cd4368)
2007-05-15 10:17:16 +10:00
Andrew Tridgell
38491de84f
check for error on ctdb_ltdb_store
...
(This used to be ctdb commit c4a34bac4ad4d2f9699e08074668d25586e3c0da)
2007-05-15 10:16:59 +10:00
Andrew Tridgell
cf1056df94
added a -i switch to run ctdbd without forking
...
(This used to be ctdb commit 327df14ecd58f405fbe8b38afa2ee54a8dd0a2e4)
2007-05-15 09:44:33 +10:00
Andrew Tridgell
db4fa58b07
reading on the write side of a pipe isn't allowed - this caused us to run without locking in the lockwait code
...
(This used to be ctdb commit 2ac67ce737f30258915cc25bde531d361092ae14)
2007-05-15 09:44:03 +10:00
Ronnie Sahlberg
5d0e7e354f
merge from tridge
...
(This used to be ctdb commit d1dae4fc8f4c2d16d313a27968d67c5825a133d1)
2007-05-14 14:07:19 +10:00