Ronnie Sahlberg
906d3435ee
add instructions on how to set up HA-FTP using vsftpd and ctdb
...
(This used to be ctdb commit 908ab29272ff88dde5a3964c704b97c329f2a75c)
2007-06-09 13:36:33 +10:00
Ronnie Sahlberg
e59def72e5
add links to how to configure samba/nfs in the samba/nfs sections
...
(This used to be ctdb commit 6db9f913d9ebce3f5d9b471528376ac26e223b2c)
2007-06-09 13:24:42 +10:00
Ronnie Sahlberg
2c5d3b0456
fix typo
...
(This used to be ctdb commit de2413fa7d1d544cf76a05a65ce7b68977c845c6)
2007-06-09 13:17:24 +10:00
Ronnie Sahlberg
1789fecf0c
add the generated manpage for ctdb so that it is available also for
...
people that do not intend to rebuild the docs
(This used to be ctdb commit 867caaa712dab6402fdaf94d4ac1b154b0f0842c)
2007-06-09 10:40:35 +10:00
Ronnie Sahlberg
365019ab80
add a page on how to configure clustered nfs
...
(This used to be ctdb commit c7a76f94d965b902f3378699adcf14cacd40e3c5)
2007-06-09 09:44:05 +10:00
Ronnie Sahlberg
8279eab9d8
add a page for configuring samba
...
(This used to be ctdb commit d615d09e174af0b680ad9039f29169dbb07b6f9e)
2007-06-09 09:31:56 +10:00
Ronnie Sahlberg
f19fed3bfa
put the text in "generation" inside a para block
...
(This used to be ctdb commit 5261d3203e8471e370bf59a89e1dc1c792254f75)
2007-06-09 09:23:52 +10:00
Ronnie Sahlberg
cd68b9aa8d
add an initial manpage for the ctdb tool
...
(This used to be ctdb commit 3d6fe2aae4584f53a2fcc11a3f3bc7ead4e9c3f3)
2007-06-09 08:41:36 +10:00
Ronnie Sahlberg
6613396ad5
update the blurb for the setmonmode control it takes 0 or 1 as a
...
parameter depending on whether one wants to disabel or enable monitoring
(This used to be ctdb commit 849a1cce6cc3e145925dd4a8a38b2698be0ce8d5)
2007-06-09 07:54:37 +10:00
Andrew Tridgell
4eda38ed21
more web tweaks
...
(This used to be ctdb commit d98a504c4194d3428a00a09ced493797222cc3be)
2007-06-08 14:40:28 +10:00
Andrew Tridgell
cd913ba129
balance the layout
...
(This used to be ctdb commit dba5d9cdfa05076fddc10ee045d3c6ce5522ff4f)
2007-06-08 14:37:11 +10:00
Andrew Tridgell
b925e9e668
convert rest of pages to new format
...
(This used to be ctdb commit c111fb4fdfbc657bc715337003febbd0313870d1)
2007-06-08 14:35:26 +10:00
Andrew Tridgell
d523c48d6d
crop logo
...
(This used to be ctdb commit 844a768c30530261b0b7ddb7e55d8ae029c566b1)
2007-06-08 14:28:31 +10:00
Andrew Tridgell
f82961f777
added logo
...
(This used to be ctdb commit b7a1e9f996a7fcb148dc003a082214984bd339bf)
2007-06-08 14:25:44 +10:00
Andrew Tridgell
044f1adf6d
web tweaks
...
(This used to be ctdb commit 2e77cf049cfe9b1b4db994f87dbc85aa08136d8f)
2007-06-08 14:09:50 +10:00
Andrew Tridgell
bf1df8e2fa
web tweaks
...
(This used to be ctdb commit 68f037a8deeb5302e9d5c0d9ab7e08cc69026ce5)
2007-06-08 14:08:57 +10:00
Andrew Tridgell
8e64c1085d
new web page layout
...
(This used to be ctdb commit 2d863a9152fae3823dcf08268b9f10dc815e198b)
2007-06-08 13:58:53 +10:00
Ronnie Sahlberg
ca88e271c0
add a page for starting and (basic) testing of ctdb based on the
...
wiki
(This used to be ctdb commit e7cdb57ac8ba454421748455c67ddc238f7c0a5d)
2007-06-08 12:57:30 +10:00
Ronnie Sahlberg
ea29fb20ff
remove CTDB_MANAGES_SAMBA from the config page. this should be in
...
the samba-config page instead
(This used to be ctdb commit a52842e52aeaae32daa2fed24711f4684dc679ec)
2007-06-08 12:28:17 +10:00
Ronnie Sahlberg
3c66405164
update the names of envvars to use the CTDB_ prefix
...
(This used to be ctdb commit 79365f5a83fc355bb4e3068600231a4bffe5cf7a)
2007-06-08 12:22:33 +10:00
Ronnie Sahlberg
ecc21cb84c
add a page on how to configure CTDB based on the wiki
...
(This used to be ctdb commit 7c17ded610948893d04fc4b915a21303d202bae4)
2007-06-08 12:17:01 +10:00
Ronnie Sahlberg
5471c6ebf6
add a page (based on the wiki) on how to build samba3 and ctdb
...
(This used to be ctdb commit 4abbf1b75f3c3d7b5c220078d5adb5b4fec43db1)
2007-06-08 11:08:39 +10:00
Ronnie Sahlberg
c0a34e287c
add a webpage for hot to get the code. based on the wikipage
...
(This used to be ctdb commit fbf93efd154112957a17f16351858dc370486b35)
2007-06-08 10:59:58 +10:00
Andrew Tridgell
08ac2fafe1
remove the test commit
...
(This used to be ctdb commit 2a29694e62fab4a57dafcfbe825bdec709a7ef89)
2007-06-07 22:40:56 +10:00
Andrew Tridgell
42d4a4a241
update configure.ac for new code layout
...
(This used to be ctdb commit fe819a241f3bf841e72d5df9cbaba6fc306d5abb)
2007-06-07 22:35:14 +10:00
Andrew Tridgell
14c788f3cb
move more util code to lib/util
...
(This used to be ctdb commit de5ab0584c978a6be4afeacd80c84015b206a3c6)
2007-06-07 22:30:29 +10:00
Andrew Tridgell
06a71762a4
some #include cleanups
...
(This used to be ctdb commit 1a07d87122d51a40cd8ad5fe13533298c26857cb)
2007-06-07 22:26:27 +10:00
Andrew Tridgell
b50096c835
more code rearrangement
...
(This used to be ctdb commit 2bcf3b16163041f03add2e5bf9f1f5fb3599ec24)
2007-06-07 22:16:48 +10:00
Andrew Tridgell
ae3d54094b
start splitting the code into separate client and server pieces
...
(This used to be ctdb commit 603cd77988c181525946cd5eb0f4d0d646b58059)
2007-06-07 22:06:19 +10:00
Andrew Tridgell
3d75c9a51d
later times are a lower priority, not a higher priority
...
(This used to be ctdb commit e96424e7d366df29767c4eeaccdcc0cc975cb8ae)
2007-06-07 19:21:55 +10:00
Andrew Tridgell
dbb803e6af
choose the most connected node first
...
(This used to be ctdb commit c7c17a79fa4f28509e34b6f635fa62517dc458c2)
2007-06-07 19:17:27 +10:00
Andrew Tridgell
df6439d796
formatting fixes
...
(This used to be ctdb commit ed63a2057698aed3931762605b2ea2368681af2b)
2007-06-07 18:39:37 +10:00
Andrew Tridgell
d774192737
use a priority time for the election data, not just the vnn
...
(This used to be ctdb commit a691f9c5cd77194005f0d98483da94b07a48d57d)
2007-06-07 18:37:27 +10:00
Andrew Tridgell
c42ddcda23
validate vnn on node flags change
...
(This used to be ctdb commit 5628ebbcc2aa61b63c761783c70fe4d8a0070607)
2007-06-07 18:13:14 +10:00
Andrew Tridgell
96861466b7
there are now far too many controls for the controls statistics fields to be useful
...
(This used to be ctdb commit f5e188fc7e13b55b6b4081dcc74ea9614a76f9bb)
2007-06-07 18:07:38 +10:00
Andrew Tridgell
3e4d7bef23
get all the tunables at once in recovery daemon
...
(This used to be ctdb commit 8e60be6c22aab145e68b16ede5f32f4430c2af93)
2007-06-07 18:05:25 +10:00
Andrew Tridgell
cb4c33cc68
handle CTDB_CURRENT_NODE in ban commands
...
(This used to be ctdb commit fefb53f1d22c5458a1e107f8352818aee87983de)
2007-06-07 16:48:31 +10:00
Andrew Tridgell
23bf62fe30
added admin commands to ban/unban nodes
...
(This used to be ctdb commit 4dad04172e7e4955b5bf6444a85b19901c9683ad)
2007-06-07 16:34:33 +10:00
Andrew Tridgell
59e74ac6eb
increase rpm release number
...
(This used to be ctdb commit e8d321186b383e60782694a980b0d339b9df88fa)
2007-06-07 15:25:36 +10:00
Andrew Tridgell
2ed57a9ae1
implement a scheme where nodes are banned if they continuously caused the cluster
...
to start a recovery session. The node is banned from the cluster for the RecoveryBanPeriod (default of 5 minutes)
(This used to be ctdb commit 4ad43dd07f526b6002477177fbf55483246c2c0c)
2007-06-07 15:18:55 +10:00
Andrew Tridgell
9754d16d48
merged admin enable/disable change from ronnie
...
(This used to be ctdb commit df17b69dfd83a98f9c711994c7dd51ad2cc0ab8a)
2007-06-07 11:15:22 +10:00
Ronnie Sahlberg
e0fbd04697
distribute the takenover nodes more evenly among the surviving nodes
...
(This used to be ctdb commit 25d18b8bab399cc5d9def081086925896f8de3e9)
2007-06-07 10:19:24 +10:00
Ronnie Sahlberg
d93c6f8db2
show the disabled/permanently disabled status in the machinereadble
...
output for 'ctdb status'
(This used to be ctdb commit a9e920a492e1e91d205ee8b9cd704a7cf85c1e01)
2007-06-07 09:27:51 +10:00
Ronnie Sahlberg
9ff733c784
add a control to permanently enable/disable a node
...
(This used to be ctdb commit d66fdba16ca22f62ddac6882a17614879b08a798)
2007-06-07 09:16:17 +10:00
Andrew Tridgell
341d715f1a
formatting fix for wider variable names
...
(This used to be ctdb commit 195bde145f62221a7bb1b12014ada98ad5df4e9e)
2007-06-06 22:17:46 +10:00
Andrew Tridgell
8fbca613d4
get parents idea of recmode and recmaster when deciding if we should do a takeover run
...
(This used to be ctdb commit 0e8124acd2f1a9b34292c1ee13c7e4cd6fe49876)
2007-06-06 21:56:54 +10:00
Andrew Tridgell
4a7f116746
update flags in parent daemon too
...
(This used to be ctdb commit 8995246d95e670753ab8c61d724d284cac2b414d)
2007-06-06 21:34:36 +10:00
Andrew Tridgell
ae56096b0b
ensure all nodes display disabled nodes correctly
...
(This used to be ctdb commit 959f82cfe926994658f5826007caccb0409003e1)
2007-06-06 21:27:09 +10:00
Andrew Tridgell
d1c225a0b9
disable a node if testparm thinks there is a error, or warning, or an unrecognised option
...
(This used to be ctdb commit ded80c83002a267996b4616e3702988b821cd422)
2007-06-06 19:46:25 +10:00
Andrew Tridgell
22aa4ffed2
handle the case of all nodes being sick for one service
...
(This used to be ctdb commit eca2590e60b4522b13c34e2c6cb8013aa55be323)
2007-06-06 16:49:59 +10:00
Andrew Tridgell
81fad8636f
added timeouts in all event scripts
...
(This used to be ctdb commit d986c91a607ed7c7d4869ea786b5cdf80e7862f1)
2007-06-06 13:45:12 +10:00
Andrew Tridgell
0edb079ea4
increase release number of ctdb
...
(This used to be ctdb commit 630a3cf9a05a315caafccafe539ba6675cdfb4ac)
2007-06-06 12:10:42 +10:00
Andrew Tridgell
76b7361c7e
- added monitoring of rpc ports for nfs, and of Samba ports and directories
...
- added monitoring of the ethernet link state
When monitoring detects an error, the node loses its public IP address
(This used to be ctdb commit 0af57aead8c983511d25774b4ffe09fa5ff26501)
2007-06-06 12:08:42 +10:00
Andrew Tridgell
b4f764c269
fixed error handling in event scripts
...
(This used to be ctdb commit d645c8b0480e7d2765614a226d78510e100016de)
2007-06-06 11:27:06 +10:00
Andrew Tridgell
cafddf76dc
- fixed flags display in logs
...
- added monitor handler to test event script
(This used to be ctdb commit a4c18dddee169df49e5d77d9a94ce9329f169319)
2007-06-06 11:13:24 +10:00
Andrew Tridgell
eaf701fbda
send the right sort of message on monitoring failure
...
(This used to be ctdb commit 9db537d9b11d48a36346db721ed8936ff5ecacb2)
2007-06-06 11:12:45 +10:00
Andrew Tridgell
d5449fa1fe
fixed exit code in makerpms.sh
...
(This used to be ctdb commit e308b75f6e9fae69441531f462351fe09e792cb9)
2007-06-06 11:12:14 +10:00
Andrew Tridgell
e16717eb01
added 40.vsftpd to Makefile.in
...
(This used to be ctdb commit 90260d0d362c6e2b58755724056fbe5e63bd29c9)
2007-06-06 10:41:13 +10:00
Andrew Tridgell
b130540102
merged vsftpd event script from ronnie
...
(This used to be ctdb commit c0b686c43524c6a93c52d85b0079ed820983133e)
2007-06-06 10:29:27 +10:00
Andrew Tridgell
af8834dd02
added health monitoring logic to ctdb, so a node loses its public IP address if one of the sybsystem event scripts reports a problem
...
(This used to be ctdb commit c7a089256d86cec21097453bce5acbccee87413f)
2007-06-06 10:25:46 +10:00
Ronnie Sahlberg
83e1d488eb
ctdb is only a ha solution when combined with a cluster filesystem
...
(This used to be ctdb commit cea43b5530da5adb06146f3ffa18eac113e838cc)
2007-06-05 18:42:49 +10:00
Ronnie Sahlberg
ab55af7ae2
add some text about CTDB and in which scenarios it would be a good
...
candidate to solve the users needs for a HA solution
(This used to be ctdb commit 77d58ba6fc6fa34678aa0201311139201d4679c8)
2007-06-05 18:40:43 +10:00
Ronnie Sahlberg
91a97fea03
provide machinereadable output for ctdb ip
...
(This used to be ctdb commit 86348de0bfdc4f91ff6f5a8eeff06044d512ee43)
2007-06-05 18:32:06 +10:00
Ronnie Sahlberg
317dec2f9e
merge from tridge
...
(This used to be ctdb commit 5f1f889e0e124c5275463795c004ae971945e1ae)
2007-06-05 18:16:45 +10:00
Ronnie Sahlberg
96a12cc4ab
add a simple events script to manage vsftpd
...
(This used to be ctdb commit 413efc7af529e4ebda6f7ea6e36f79ba72a2d1d9)
2007-06-05 18:14:01 +10:00
Andrew Tridgell
be3a00bd73
clean out some more cruft
...
(This used to be ctdb commit ad16c5fe2748b48a6f6c79976359d56d9bed33f4)
2007-06-05 17:57:07 +10:00
Andrew Tridgell
ac55bc4166
first step in health monitoring of cluster nodes. When not healthy they will be marked disabled
...
(This used to be ctdb commit d3dbd9fc4db21632075b56fc52cf95435c63374a)
2007-06-05 17:43:19 +10:00
Andrew Tridgell
ee747b5bd6
set close on exec on pipe in event scripts, so long running scripts don't hold the pipe
...
(This used to be ctdb commit 22662614b4091a4e4282e63d6876097cbf3e3d6e)
2007-06-05 15:18:37 +10:00
Andrew Tridgell
a3048a8942
more unused code
...
(This used to be ctdb commit b01f226949965942c1d64ff3b4ecc0b835d4fecc)
2007-06-05 15:17:53 +10:00
Andrew Tridgell
efcacd76b7
remove an unused function
...
(This used to be ctdb commit 9a36d0e0c110c66fe72dce530318b9bc0ac1ce0b)
2007-06-05 15:17:24 +10:00
Ronnie Sahlberg
32d19d3791
dont use CTDB_MANAGES_NFS for controlling the lockmanager
...
use a dedicated variable CTDB_MANAGES_NFSLOCK since some might want to
use nfs but no lockmanager
(This used to be ctdb commit 1e8cec86617ffb188bd49c70f074a4b350d3fe3d)
2007-06-05 12:43:35 +10:00
Andrew Tridgell
e95215850b
use the right IP from the passed structure in takeip/releaseip calls
...
(This used to be ctdb commit 3db5ba681b11d9ce6474938f221fe68d55a6e1ad)
2007-06-04 23:54:56 +10:00
Andrew Tridgell
7498d3c55d
explain event types
...
(This used to be ctdb commit 551472b78b025d9446ee58420dcec70c600555d0)
2007-06-04 23:54:46 +10:00
Andrew Tridgell
bd81cc521d
ignore commented out entries in /etc/exports
...
(This used to be ctdb commit d316b49ba46e819359f045adfd87da92860fd1b5)
2007-06-04 23:54:22 +10:00
Andrew Tridgell
49a0b4ebdc
use the right IP from the passed structure in takeip/releaseip calls
...
(This used to be ctdb commit 92ab9cbf6ef00de2987c16b145e17ac09dd99ea3)
2007-06-04 23:52:12 +10:00
Andrew Tridgell
0c939990ca
don't crash doing ctdb ip when not doing takeover
...
(This used to be ctdb commit f341a97c13a77711a2daaa9a223123cc4c81e75e)
2007-06-04 22:28:52 +10:00
Andrew Tridgell
ee546dec81
merge from ronnie
...
(This used to be ctdb commit 531d7ea7aca3116e78a4502a1c8b75a3fb764a4f)
2007-06-04 22:13:59 +10:00
Ronnie Sahlberg
4be9a44ba7
add a control that lists all public ip addresses and which node that
...
currently serves it
(This used to be ctdb commit db9b89dc423b31079e5502323e5fd2bbaf82e1e9)
2007-06-04 21:11:51 +10:00
Andrew Tridgell
39ced972ae
make recovery daemon values tunable
...
(This used to be ctdb commit ec29dbf2f5110428df8b97801443ba7addf61353)
2007-06-04 20:22:44 +10:00
Ronnie Sahlberg
1ee8989bd4
merge from tridge
...
(This used to be ctdb commit 3bfede5d46dba5a3654dad9205534391bc339461)
2007-06-04 20:10:53 +10:00
Ronnie Sahlberg
79b54a624e
change the takoverip/releaseip controls to pass a structure containing
...
both the nodenumber and the id of the node that has taken over that
address in addition to the public address itself so that all nodes
can learn which node is currently hosting each of the public addresses
(This used to be ctdb commit 53e9ff790387b85a36fa9c3c44cd4c95cbdf35da)
2007-06-04 20:07:37 +10:00
Andrew Tridgell
fcce534f23
allow setting of variables at startup in config file
...
(This used to be ctdb commit db39ca7c0ee1441113ac3279cb75b3cb38eecd1b)
2007-06-04 20:05:31 +10:00
Andrew Tridgell
dbb2ec43dd
added tunables settable using ctdb command line tool
...
(This used to be ctdb commit 73d440f8cb19373cfad7a2f0f0ca4f963c57ff29)
2007-06-04 19:53:19 +10:00
Andrew Tridgell
f1d81386e6
- start moving tunable variables into their own structure
...
- fixed the test scripts to use a separate dbdir
(This used to be ctdb commit 396752e8908c48373564e915e2d49cfc9ff61eba)
2007-06-04 17:46:37 +10:00
Andrew Tridgell
a57991c0eb
remove some cruft thats not needed any more
...
(This used to be ctdb commit c4308805b997740b77e058c1a14b84cb400a7c30)
2007-06-04 17:23:55 +10:00
Andrew Tridgell
62b30e478d
make sure we don't have any namespace collision problems with config variables
...
(This used to be ctdb commit dde9024b25fe12cf25c059e5accb3ca21838b130)
2007-06-04 15:44:52 +10:00
Ronnie Sahlberg
464ed12991
merge from tridge
...
(This used to be ctdb commit 948b449748a126386f49ef9e763cfffd8b651516)
2007-06-04 15:44:13 +10:00
Andrew Tridgell
cc9f6d30d8
split out the basic interface handling, and run event scripts in a deterministic order
...
(This used to be ctdb commit 399e993a4a233a5953e1e7264141e5c7c8c8c711)
2007-06-04 15:09:03 +10:00
Andrew Tridgell
73e626bc6b
automatically bring up interfaces that we manage. This allows ctdb to work without requiring two IPs per public interface
...
(This used to be ctdb commit 221850dcf9c28698eb3a1baf33cbf7f9137ac502)
2007-06-04 14:16:51 +10:00
Ronnie Sahlberg
8a53a6aa29
show the second column in the machinereadable output for ctdb status as
...
IP
(This used to be ctdb commit 9ee38e8cfc4b602f6769549a83a1302138e055a1)
2007-06-04 13:31:58 +10:00
Ronnie Sahlberg
a3e4e204dc
add the ip address to the nodemap structure we pull from a server and
...
display the physical address of a node when we do a ctdb status
(This used to be ctdb commit 660bf30db713f0680acd3f74275ad603b35a0c24)
2007-06-04 13:26:07 +10:00
Andrew Tridgell
837fb236b9
handle NETWORKING var not existing
...
(This used to be ctdb commit f8cf9f81e8f81818dc141eda5419c2749a0652a4)
2007-06-03 22:11:48 +10:00
Andrew Tridgell
e763874872
make the init scripts more portable about location of system config files
...
(This used to be ctdb commit 65f3e2bc722e314b2c51c3bfdc544b408a8a64cf)
2007-06-03 22:07:07 +10:00
Ronnie Sahlberg
5dde7e27e0
add a -Y option to generate machine readable output.
...
print 'ctdb status' in machinereadable form as
:VNN:0|1:
(This used to be ctdb commit 1aa6a632ec59d854fc5579fedad0d66b1b46ae8c)
2007-06-03 19:50:51 +10:00
Ronnie Sahlberg
dac3f7d23c
ubuntu uses a different style of init scripts than redhat and suse
...
(This used to be ctdb commit 6d3bee5d1a7dd6718045c673cfd150e3207ea970)
2007-06-03 19:24:52 +10:00
Ronnie Sahlberg
8175804757
print an error message to stdout if we failed to open the logfile for
...
the daemon
(This used to be ctdb commit fca953b1a3f3d6bf18264ecda1c75c68b60e2008)
2007-06-03 18:59:27 +10:00
Andrew Tridgell
b9973e1d3e
more portability tweaks in the init script
...
(This used to be ctdb commit 83a1c79e95af93a9ccfe78556ac5692c0315a3e4)
2007-06-03 17:53:26 +10:00
Andrew Tridgell
bc6ec7d0cf
fixed location of init.d directory to work on SLES and RHEL
...
(This used to be ctdb commit 962fa046c9ddde509690408030896d3335609341)
2007-06-03 17:39:11 +10:00
Andrew Tridgell
2fea5e7b0e
doc updates
...
(This used to be ctdb commit 1ccb6eca3f4846c4d77d48a7a13f2d5913f8cf80)
2007-06-03 16:33:08 +10:00
Andrew Tridgell
2ea6a86cab
web page tidy ups
...
(This used to be ctdb commit 9a9a0b356329905e08e843a99fd1d82a709e8986)
2007-06-03 15:56:23 +10:00
Andrew Tridgell
996d97b145
merge from ronnie
...
(This used to be ctdb commit 3d3b2eac028dc7d54e7ef0987a5f4063c4dcbe5a)
2007-06-03 14:41:28 +10:00
Andrew Tridgell
b4542aa00a
don't start nfs services unless the relevant directories are available
...
(This used to be ctdb commit e0468d61119b6581f5ec458641568d03714a5786)
2007-06-03 14:39:27 +10:00
Andrew Tridgell
6371ee188d
test commit
...
(This used to be ctdb commit 757affaa3430723136b3f2520d3893d79c6c1bbb)
2007-06-03 13:21:07 +10:00
Ronnie Sahlberg
74c6882c36
mention that ctdb offers cross cluster messaging to applications
...
(This used to be ctdb commit 7e51fdd07760bba894578d8e07873894340eb981)
2007-06-03 11:48:13 +10:00
Ronnie Sahlberg
7cb974e2c1
merged with tridge
...
update to the web page for ctdb
(This used to be ctdb commit 1ab2c05da217d80fe782bd3c8fcd9549d688dc7d)
2007-06-03 11:40:50 +10:00
Andrew Tridgell
ee3ce951ce
do a full restart in init cron call
...
(This used to be ctdb commit ed181dce8f307bd8f36de42351d04f39b2396836)
2007-06-03 10:29:57 +10:00
Andrew Tridgell
518d410075
fixed a race condition in the handling of the recovery lock
...
(This used to be ctdb commit 3b98c5ad23662259b0eed399ab0c8037cf9b2b0b)
2007-06-03 10:29:14 +10:00
Ronnie Sahlberg
61136d0da0
when we get a dmaster error, show the database id in the log so we can
...
track which db it happens for
(This used to be ctdb commit 3aedb67f4adf769b605b43f9d39d34ef17b0085d)
2007-06-03 09:58:51 +10:00
Andrew Tridgell
a795986baa
docs on how to use statd-callout
...
(This used to be ctdb commit 4a75111b4f3f93dc42c9ced2d23f3cc933712017)
2007-06-02 19:45:06 +10:00
Andrew Tridgell
794d6dd59d
move config files to config/ directory
...
(This used to be ctdb commit f95de519b885c8e1f40df0cda70fd796e479a22a)
2007-06-02 19:40:07 +10:00
Andrew Tridgell
90158f7ad7
removed some old cruft
...
(This used to be ctdb commit 44040d5ecabcd32ee26c0285e3786dfb9b4f8126)
2007-06-02 19:23:48 +10:00
Andrew Tridgell
7166c51204
ctdb_test.c is gone
...
(This used to be ctdb commit b93b5f5d28a4f2065caa38f65d6b71868cd0e9e4)
2007-06-02 18:57:21 +10:00
Andrew Tridgell
2f5af51c53
add an easy way to setup ctdb to start/stop samba
...
(This used to be ctdb commit b0d9f427d83aff5b9a5c54b7b7c9d45d418e2352)
2007-06-02 18:51:05 +10:00
Ronnie Sahlberg
394ee9aff7
fix broken link to the CTDB setup page
...
(This used to be ctdb commit cf762c18aac8b9a137377d9ee9022c6d9a57452d)
2007-06-02 16:50:58 +10:00
Ronnie Sahlberg
ebe34b4353
update the evens scripts for nfs and nfslock to honour CTDB_MANAGES_NFS
...
which is set in /etc/sysconfig/nfs
(This used to be ctdb commit bf475269231a6129f88b37f4da69e06efcf4ed77)
2007-06-02 16:44:15 +10:00
Ronnie Sahlberg
5dc243ff93
STATD_SHARED_DIRECTORY should be define din the nfs sysconfig file and
...
not the ctdb sysconfig file since this variable has nothing at all to do
with ctdb
(This used to be ctdb commit d17073b7da5ecba1b93a5ed4fbdf86bf052fdc90)
2007-06-02 16:33:17 +10:00
Andrew Tridgell
c5e4ce360a
make test now works again
...
(This used to be ctdb commit 439d87bbb9840f82937e51aff4fe2b80160878c6)
2007-06-02 13:31:36 +10:00
Andrew Tridgell
68963d865a
first step towards fixing "make test" with the new daemon system
...
(This used to be ctdb commit f95f7e4c93dea482e6cf0614b5415229a7c9f3fb)
2007-06-02 13:16:11 +10:00
Andrew Tridgell
ebf12646cf
- make specification of a recovery lock file compulsory
...
- die if someone other than the recmaster can get the recovery lock
(This used to be ctdb commit a827d0d0e430ca8ad5d521367e45097185492869)
2007-06-02 11:36:42 +10:00
Andrew Tridgell
4f72a202d9
- moved cmdline options that are only relevant to ctdbd into ctdbd.c
...
- fixed a valgrind error on failing to send a control
- don't mark node dead when already disconnected
- moved node list lock code into common code
(This used to be ctdb commit bcc0432d0fea7ef223f82ccee81cf35c18144b1b)
2007-06-02 10:03:28 +10:00
Andrew Tridgell
69edd8e617
another place we need to cope with the strange epoll fork semantics
...
(This used to be ctdb commit cb1eeec15e61a7eb401e9e65306642343d5deb6e)
2007-06-02 10:01:17 +10:00
Andrew Tridgell
27b0e323e6
disable realtime scheduler in event scripts
...
(This used to be ctdb commit 56225ac6fdfe754289bc7d5e0fc8d21c81a7aa8e)
2007-06-02 08:46:49 +10:00
Andrew Tridgell
5e5701a7b8
- make calling of recovered event script async
...
- shutdown sockets before calling shutdown script
(This used to be ctdb commit c5e099feef94a014a77742b6cc1d0afe78ef9da9)
2007-06-02 08:41:19 +10:00
Andrew Tridgell
21f6fa5359
put nfs events in spec and Makefile.in
...
(This used to be ctdb commit 07f03c61c621d8428ecdb56ee08ba895a57ed9c0)
2007-06-02 00:11:53 +10:00
Andrew Tridgell
3a0395dffd
added nfs event script
...
(This used to be ctdb commit a708a635a1be355d2e8d382166f58f65f669c8ea)
2007-06-02 00:10:22 +10:00
Andrew Tridgell
18d8559cee
don't strictly need netcat
...
(This used to be ctdb commit 992eed38fc6c474c99ce7242e7b7b35bc1e4bd7b)
2007-06-01 23:33:01 +10:00
Andrew Tridgell
dff9a6ecd1
make the packaging much more portable - tested on SLES9 and RHEL4
...
(This used to be ctdb commit 9521e3eee42b11303a2d6e0f5c05d0c0de4292d8)
2007-06-01 23:25:33 +10:00
Andrew Tridgell
1fa2600c8b
- make symlink relative in install
...
- include ctdb functions in samba and nfslock event scripts
(This used to be ctdb commit 08e2278069346b1fc49774603aa26c68222cf67f)
2007-06-01 21:20:05 +10:00
Andrew Tridgell
02bfa10daa
tidy up the install somewhat
...
(This used to be ctdb commit 07e202ae8451ffa67c40973625996224cfd4e158)
2007-06-01 21:10:49 +10:00
Andrew Tridgell
b5890ad2c1
split out events for each subsystem separately
...
(This used to be ctdb commit 03c629a72f234dcc783fa1085e7edba09597c241)
2007-06-01 20:54:26 +10:00
Andrew Tridgell
559a8bd278
use a subdirectory for ctdb state files
...
(This used to be ctdb commit 71ebf272be42e313715f0f100be9f5567127eb73)
2007-06-01 19:16:58 +10:00
Andrew Tridgell
7db1d04d5c
make the running of the takeover and release event scripts async, to prevent outages due to slow scripts
...
(This used to be ctdb commit 4189be97eee7ab2a50335c860f2fcd9566667d01)
2007-06-01 19:05:41 +10:00
Andrew Tridgell
869d70d9c9
convert ctdbd.sh tests to use an event script
...
(This used to be ctdb commit b3eff804fe59238ed37cd692cc04785089a89dc3)
2007-06-01 19:04:51 +10:00
Andrew Tridgell
165bdba7ec
better download instructions
...
(This used to be ctdb commit f50298262e3cc2896d7d098efa6601c915d3feca)
2007-06-01 17:16:19 +10:00
Andrew Tridgell
e015e5d769
added package download
...
(This used to be ctdb commit fa0d8efe3298a38c52e6be1116bd1b4c8c959f76)
2007-06-01 16:07:13 +10:00
Andrew Tridgell
680a3ad547
merge initial web site from ronnie
...
(This used to be ctdb commit ee0b29b0625a1202e4af5a2ab3b19ca0b48e8e99)
2007-06-01 15:33:45 +10:00
Ronnie Sahlberg
0b70f9500e
add a developers section
...
(This used to be ctdb commit af90c7b3b80c808ad3047175cd09031c0282538d)
2007-06-01 15:31:23 +10:00
Ronnie Sahlberg
52291b18a5
initial webpage
...
(This used to be ctdb commit b24b48fe6e9da38036452e82a1b3d6311ebaa6f4)
2007-06-01 15:23:57 +10:00
Andrew Tridgell
f5171454b3
log dates/time in event startup messages
...
(This used to be ctdb commit 60a2f704f2e0544035778d00e91041e09351ed8f)
2007-06-01 15:23:16 +10:00
Andrew Tridgell
95ed6f8725
added CTDB_WAIT_DIRECTORIES support
...
(This used to be ctdb commit fa888e8b1715d7460f5718d3e1fe17e4caaa15c3)
2007-06-01 13:50:18 +10:00
Ronnie Sahlberg
86d0fc8e4f
it is -f not -x to check if a file exists
...
(This used to be ctdb commit 52457d5e811f91c051ce0fa32739667a1d21862a)
2007-06-01 13:26:14 +10:00
Ronnie Sahlberg
425b3c56c6
- create /etc/ctdb/taken_ips and /etc/ctdb/changed_ips analog to the
...
existing /etc/ctdb/released_ips
- only call the statd-callout script if the ips have changed and call
it with a "notify" argument. we need to restart nfslock service in
both cases
- change statd-callout to explicitely restart the lock manager and statd
when "notify" is called. copy the state directory for each held ip
from shared storage to /tmp then use sm-notify to send notifications to
all monitored clients
(This used to be ctdb commit 800f15a27af885a3f83430d3bc411cc72ac40e86)
2007-06-01 13:14:05 +10:00
Andrew Tridgell
bf3b740a1b
ctdb is GPL not LGPL
...
(This used to be ctdb commit 8624378010d1c2a1438e1e701339dfba7276f960)
2007-05-31 13:50:53 +10:00
Andrew Tridgell
7ccb338862
if there is no node available to take an IP, don't consider that an error
...
(This used to be ctdb commit fee6561626c7070418bc8f4d610a728329ac4bcc)
2007-05-31 11:22:43 +10:00
Andrew Tridgell
d86298248f
better location for statd-callout
...
(This used to be ctdb commit cc208c447b732aeeaefd6a889711d3cd83ea128e)
2007-05-31 11:14:07 +10:00
Andrew Tridgell
c6d4478fda
added hooks to make nfs statd behave correctly on failover
...
(This used to be ctdb commit a1ee84fc47892b6c18d417ccf714211fcb07952e)
2007-05-31 11:09:45 +10:00
Andrew Tridgell
cc87648933
merge lib/replace from samba4
...
(This used to be ctdb commit d1a0bcbe331b05ad7a3f5f7a33e4c9b5242c43f2)
2007-05-30 18:06:57 +10:00
Andrew Tridgell
89ed660067
tell newly connected nodes about any tcp tickle records that we have that they don't have
...
(This used to be ctdb commit 71dd59c1662d429d260c25090e0ca26db48d4075)
2007-05-30 16:37:39 +10:00
Andrew Tridgell
d510ce3281
use our own netmask when deciding if we should takeover a IP, not the other nodes
...
- check if ctdb dies while waiting for the startup event
(This used to be ctdb commit 8b59f73c527a6d0a8abe8030dc3cbbc4329657be)
2007-05-30 16:11:39 +10:00
Andrew Tridgell
1e72af9c51
close sockets when we exec scripts
...
(This used to be ctdb commit 0fac2164db4279db2d7d376a34be05b890304087)
2007-05-30 15:43:25 +10:00
Andrew Tridgell
c833b06a35
we need to listen at transport initialise stage to find our own node number
...
(This used to be ctdb commit 4a9455dfbe95e53884b46ad26dba0c33e3432ba9)
2007-05-30 14:46:14 +10:00
Andrew Tridgell
3c062bb5ae
- use a CTDB_BROADCAST_ALL for the attach message so it goes to currently disconnected nodes
...
- start node monitoring only after transport starts
- check if a node is already disconnected in the node dead function
(This used to be ctdb commit b81ab6d507797282237768380c6f0e5a4c6519a5)
2007-05-30 14:35:22 +10:00
Andrew Tridgell
8ed48aac51
don't start the transport connecting to the other nodes until after the startup event script has run
...
(This used to be ctdb commit afca3cc74211aa2e18b1f74d36b2add8dffcfdc7)
2007-05-30 13:26:50 +10:00
Andrew Tridgell
3eb96b4553
- nice messages while waiting for tcp services to come up
...
- added more comments to sysconfig file
(This used to be ctdb commit 9cbe7ad147a73cd6594fa7bcee0544fd986ad8c0)
2007-05-30 12:37:03 +10:00
Andrew Tridgell
b382fac817
wait for local tcp services like smbd to come up before allowing ctdb to start talking to other nodes
...
(This used to be ctdb commit 04eea084ebf1710ea66ccb03ac661e3b2f58d96f)
2007-05-30 12:27:58 +10:00
Andrew Tridgell
0a8b9f21d8
fixed onnode symlink install
...
(This used to be ctdb commit fd6a474fb52263512a0496ec8c9dc5443902d45a)
2007-05-30 11:17:52 +10:00
Andrew Tridgell
7cd7081beb
support ctdb status -n all
...
(This used to be ctdb commit 8ff2ea29fc60a1e9854bf0c59c360e29f35d3b69)
2007-05-30 11:12:50 +10:00
Andrew Tridgell
229846cdd2
moved onnode into ctdb from s3 examples/ctdb
...
(This used to be ctdb commit a3fdaebf1a90ff3c2843a592f6c657a8eae42975)
2007-05-30 11:00:43 +10:00
Andrew Tridgell
5747a5a358
auto-restart NFS if its running when we release an IP
...
(This used to be ctdb commit 2e1e1e8e34bf4c15decbbc8f0ca88004a2ed67df)
2007-05-30 10:21:16 +10:00
Andrew Tridgell
71bb8a9e41
samba3 needs ctdb_private.h installed to build
...
(This used to be ctdb commit 3df7a4b3cd08efbf5d33379b8c1c5f5bee14954b)
2007-05-29 22:53:28 +10:00
Andrew Tridgell
9891c6b975
flush any local arp entries for the given ip on add/del
...
(This used to be ctdb commit 814decd66423e955b443f0729ceec581c0d0c0e3)
2007-05-29 19:34:04 +10:00
Andrew Tridgell
3b146e7616
don't block SIGCHLD, or we lose return values from system() !
...
nicer log messages from events script
(This used to be ctdb commit 5ed2b496675a6a47d7ad87519a97bc4f293e6730)
2007-05-29 17:23:29 +10:00
Andrew Tridgell
7de87f8492
fixed system() return handling
...
(This used to be ctdb commit 0fa75d7429f6f6e58f98c3cebccf6014eb3b474e)
2007-05-29 17:00:08 +10:00
Andrew Tridgell
a7a0f99d98
fixed broadcast controls from the command line
...
(This used to be ctdb commit 54464e0b5123265780013a0a46c8b94709d227dc)
2007-05-29 16:34:50 +10:00
Andrew Tridgell
2f7fcecb59
fixed shell syntax in events script
...
(This used to be ctdb commit 629435807e7927a0e1524cd3e4b2aa216a651e2c)
2007-05-29 16:28:18 +10:00
Andrew Tridgell
bf3f0f4b2f
- ignore blank lines at end of lists
...
- rpm tweaks
(This used to be ctdb commit 3506464fa914c5aad10fe22283563d021ca45ca0)
2007-05-29 16:23:47 +10:00
Andrew Tridgell
578b2a585d
- make more options configurable
...
- fixed some warnings
(This used to be ctdb commit e08bb371827b14a80a131ce8e83145cd468e7e1f)
2007-05-29 16:02:02 +10:00
Andrew Tridgell
0f605ccdb2
more build tweaks
...
(This used to be ctdb commit d4c3380c7a9d332311c5d03ba151e7f2f90796a3)
2007-05-29 15:52:03 +10:00
Andrew Tridgell
0e089d11e0
update packaging for new defaults
...
(This used to be ctdb commit 68f167d74fdc3b55c416ed1f7054817b51f69899)
2007-05-29 15:36:42 +10:00
Andrew Tridgell
5a4c3b0b24
default log file to reasonable location
...
(This used to be ctdb commit 36b0a43c5d58d8171c1340603486e64051d696ac)
2007-05-29 15:26:38 +10:00
Andrew Tridgell
873c3a5934
use autoconf for more paths
...
(This used to be ctdb commit b765a391632621dfe3b129b85782e87f586ae2eb)
2007-05-29 15:20:41 +10:00
Andrew Tridgell
2d9e0ad56a
use /etc/services for ctdb
...
(This used to be ctdb commit 64bf6964ff33320c5351337c7f8ed4da5bd71275)
2007-05-29 15:15:00 +10:00
Andrew Tridgell
db5862279b
merge from jim
...
(This used to be ctdb commit a41acd1835af91176a2de1c251a8709dc42b65ac)
2007-05-29 14:46:29 +10:00
Andrew Tridgell
1140d5a20a
fixed more warnings on 64 bit boxes
...
(This used to be ctdb commit 2f6eae476203f8a8b28e083553204c01f224c8a5)
2007-05-29 13:58:41 +10:00
Andrew Tridgell
bc891232b6
fixed some debug messages
...
(This used to be ctdb commit 037f0149c0c0e65af0a1669b9a52586129e4b48f)
2007-05-29 13:48:30 +10:00
Andrew Tridgell
edcaa0d6a0
clean shutdown in ctdb - release all our IPs
...
(This used to be ctdb commit 2f196cb6a86eb85205d7de1c4cadd4e1e701c06f)
2007-05-29 13:33:59 +10:00
Andrew Tridgell
1455d7d7ad
don't need maskbits to ip addr del
...
(This used to be ctdb commit 93125b460a44934f30bb995ff3c5365ac5a263d5)
2007-05-29 13:21:37 +10:00
Andrew Tridgell
6cd49d7842
fixed syntax of /sbin/ip
...
(This used to be ctdb commit 9791901dda000fbef6e520531f39ead575531721)
2007-05-29 13:09:15 +10:00
Andrew Tridgell
1becc9f2e7
made events script executable
...
(This used to be ctdb commit 54934884ae2bfe8b7d155aa22ee90b2d0a674def)
2007-05-29 13:04:52 +10:00
Andrew Tridgell
a39eff68a8
added an example ctdb event script
...
(This used to be ctdb commit f97b75497d005306c5f893c3182f1c2a9b4dc6b7)
2007-05-29 13:01:31 +10:00
Andrew Tridgell
ead091449b
call the event script on recovery too
...
(This used to be ctdb commit 8c43a91cbd6e502c93bd6cc51df1272eae426709)
2007-05-29 12:55:24 +10:00
Andrew Tridgell
dfadb60318
- moved ctdbd specific options to ctdbd.c from cmdline.c
...
- allow a event script to be specified that will take IPs, release
IPs, and handle recovery in system specific ways
- redirect stderr in subcommands to the log
(This used to be ctdb commit de0fc9ba370db781f9c46406ed180c8211946c7a)
2007-05-29 12:49:25 +10:00
Andrew Tridgell
d35a475fd2
fix sense of inet_aton() call
...
(This used to be ctdb commit 39788dd804beee479f0cb3be8f4b984c3c7aad2e)
2007-05-29 12:26:21 +10:00
Andrew Tridgell
ccf4d78e04
- renamed ctdb_control utility to ctdb
...
- use -n to specify node number in ctdb utility
- change 'ctdb status' to 'ctdb statistics'
- added 'ctdb status' which shows status
- added netmask to public IPs, so you don't try a takeover on a
foreign network
- cleaned up tools/ctdb_control.c a lot
- generate usage message at runtime
(This used to be ctdb commit 28de71c03ace7d32a9fd9882fabbd5d668b97656)
2007-05-29 12:16:59 +10:00
Andrew Tridgell
9838027ff8
rename ctdb_control utility to ctdb
...
(This used to be ctdb commit 7fc3868a1a49aa8b49b277793eede903606b4353)
2007-05-29 10:59:12 +10:00
jmcd@samba.org
239f698b63
Next round of packaging updates:
...
- Create/delete source symlink as needed during rpm build
- init script updates:
- optional port, debuglevel, logfile
- autodetect logfile location if smbd exists
- build cluster_addresses file
- create tmp dbdir (Tridge, do you want this to be a fixed location instead?)
(This used to be ctdb commit 6695ec4845b71135db3fc5ed76afde7f20d8e24e)
2007-05-28 11:38:04 -04:00
Andrew Tridgell
116ed6f9d6
merged packaging from jim
...
(This used to be ctdb commit a66a70b9a03d9bea974748cfc9a183b206539ab7)
2007-05-28 20:11:45 +10:00
Andrew Tridgell
dcd23d47a7
another place where we could send a partial packet
...
(This used to be ctdb commit 4e0c57718a0c9f51b5b2e02bbf516c63c5be0a35)
2007-05-28 08:37:54 +10:00
jmcd@samba.org
3be6ed7d42
Inital rpm build files
...
(This used to be ctdb commit 8830592b9202d6edab16af5c717608096597c517)
2007-05-27 16:48:18 -04:00
Andrew Tridgell
53586a4a74
when handing over an IP to another node, also tell them of any tcp connections we were handling, so they can send tickle acks for those connections
...
(This used to be ctdb commit 52c225720da22a149a85ed3cfc07d630cc8b3acd)
2007-05-28 00:51:10 +10:00
Andrew Tridgell
9cc3ce8554
automatic cleanup of tcp tickle records
...
(This used to be ctdb commit ede79b571bf89b89f1b8394f262ca0689f8c65f3)
2007-05-28 00:34:40 +10:00
Andrew Tridgell
b01cdc814f
fixed error reporting in tickle ack code
...
(This used to be ctdb commit a19555a2e2f3c7ca3147f6421fa2f751a3bdfdee)
2007-05-28 00:01:51 +10:00
Andrew Tridgell
99e38ed6ab
use a window size that is obvious in sniffs
...
(This used to be ctdb commit 4d167711a600ad7bb042a5df143dd282b0e50462)
2007-05-27 17:23:35 +10:00
Andrew Tridgell
922d054bca
remove experimental code
...
(This used to be ctdb commit f1d91002247bedb2f163cc9a9515bbe2bbc2692e)
2007-05-27 16:58:43 +10:00
Andrew Tridgell
957ec5d63a
fixed tcp data offset and checksum
...
(This used to be ctdb commit 2df23e0d3df52b746e9aee8d194ad1da16b62657)
2007-05-27 16:56:12 +10:00
Andrew Tridgell
eae66741ec
send on the right socket!
...
(This used to be ctdb commit 2a5253a9da4d0144f4e96d9b8f6ac636849d146a)
2007-05-27 15:47:43 +10:00
Andrew Tridgell
d41290fbae
added code to ctdb to send a tcp 'tickle' ack when we takeover an
...
IP. A raw tcp ack is sent for each tcp connection held by clients
before the IP takeover.
These acks have a deliberately incorrect sequence number, and should
cause the windows client to send its own ack which will in turn cause
a tcp reset and thus cause windows clients to much more quickly
reconnect to the new node.
(This used to be ctdb commit eef38bfe8461b47489d169c61895d6bb8a8f79a1)
2007-05-27 15:26:29 +10:00
Andrew Tridgell
4577eb1cbc
added function to send a raw tcp ack packet
...
(This used to be ctdb commit 634847f3eb5d8183c43d39f0300c0379ef1db3bc)
2007-05-27 13:39:36 +10:00
Andrew Tridgell
647540253e
tweak timeouts
...
(This used to be ctdb commit 54a90797469f56d796efd82e9294efff3c5dabcc)
2007-05-27 09:43:25 +10:00
Andrew Tridgell
7ff6e17ca1
removed bogus alignment check
...
(This used to be ctdb commit 93fd5fd01dc61a53a91e319d5cbbe0fc8f740717)
2007-05-26 18:13:19 +10:00
Andrew Tridgell
35955c1476
drop any partialialy send packets when we get a socket write error
...
(This used to be ctdb commit 4f7b97af8e0075de56059fffc57a86cb0c16df46)
2007-05-26 16:41:32 +10:00
Andrew Tridgell
47b20f7e26
show op type of badly aligned packets in tcp layer
...
(This used to be ctdb commit 6a3e1faa2ce77ee021154d66aeaa99c51bbc8b06)
2007-05-26 16:35:41 +10:00
Andrew Tridgell
9aa692669b
paranoid checks for bad packets in tcp layer. Close the socket if it gets a bad packet
...
(This used to be ctdb commit 1277089e5c6e1036517c63ee8c8e4ff98cb76cf8)
2007-05-26 16:32:32 +10:00
Andrew Tridgell
2b86216b66
handle corrupt ctdb packets better
...
(This used to be ctdb commit aea4d78645a99eb22860048350b5f20841c17704)
2007-05-26 14:46:12 +10:00
Andrew Tridgell
cc4d8102cd
moved system specific ip code to system.c
...
(This used to be ctdb commit 9de9e4ccda9665108baac12a8716b189d26340b1)
2007-05-26 14:01:08 +10:00
Andrew Tridgell
750ae1a35b
send the message from daemon context
...
(This used to be ctdb commit 3bd7d1ebd5fc47f31caa654b5935a3cd05fb6882)
2007-05-26 00:21:07 +10:00
Andrew Tridgell
9e61a5bd77
send a message to clients when an IP has been released
...
(This used to be ctdb commit 8b7ab0b00253462593d368052c2cb10a385b4e63)
2007-05-26 00:05:30 +10:00
Andrew Tridgell
07ade57802
make sure we find out about new nodes as fast as possible
...
(This used to be ctdb commit 73f2c77166e2053625d0f76c370cf7e789a63fdf)
2007-05-25 22:07:45 +10:00
Andrew Tridgell
31053286c5
keep sending ARPs for 2 minutes, every 5 seconds
...
(This used to be ctdb commit d5223f2eed4a762b93a101c720286568578ce7ed)
2007-05-25 21:27:26 +10:00
Andrew Tridgell
7a9e40b288
consider a node dead after 6 seconds, not 15
...
(This used to be ctdb commit b055907f0bd2fa0e83bd84e49039fa868905b941)
2007-05-25 20:00:06 +10:00
Andrew Tridgell
3c3ed0989e
paraoid check for empty db on attach
...
(This used to be ctdb commit f46ecc15d8d6520a7bce721d915897deba84a54f)
2007-05-25 18:48:49 +10:00
Andrew Tridgell
613c6b9c4c
new files for IP takeover
...
(This used to be ctdb commit 9232501a6bb9ee5d67a3b7cc87752d82ede162a6)
2007-05-25 17:16:50 +10:00
Andrew Tridgell
56e3eed3d1
added IP takeover logic for public IPs to ctdb
...
(This used to be ctdb commit 374adb729472670f35cef41269b8719f49c0de0e)
2007-05-25 17:04:13 +10:00
Ronnie Sahlberg
2b6c39a0af
add controls to take over and release an ip address
...
add sending of grat arp both normal grat arp (request) and also
unsolicited grat arp replies
(This used to be ctdb commit 7305c00c21c30bdbafc3722a018513378bd307e6)
2007-05-25 13:05:25 +10:00
Andrew Tridgell
7596347844
make ctdbd realtime if possible
...
(This used to be ctdb commit 8852f6cca52b64a5239c83ab7c6a99ae4edb2597)
2007-05-24 14:52:10 +10:00
Andrew Tridgell
6ba5885f07
raise the control timeout in recovery
...
(This used to be ctdb commit 43424ff66daf28c202c12982f20a9f662b6fb125)
2007-05-24 13:49:27 +10:00
Andrew Tridgell
3909d83eef
merge from ronnie
...
(This used to be ctdb commit f31e750760bd98e2214ea76f359824aca6f5826f)
2007-05-24 10:10:02 +10:00
Ronnie Sahlberg
2aface246e
add a new command for ctdb_control to trigger a recovery
...
(This used to be ctdb commit 6da2a4ab1b9c955d55a1c6817506a74539623892)
2007-05-24 08:08:45 +10:00
Andrew Tridgell
5dc81da87e
fixed %d which should be %u
...
(This used to be ctdb commit 2792cf718ff1e66fe99f870f683a13baa160f629)
2007-05-23 20:15:09 +10:00
Andrew Tridgell
296e15c9d4
fixed some memory leaks on the traverse code
...
(This used to be ctdb commit 2781cbb7d00c5448449216c8c0c1b37bdc74a6c0)
2007-05-23 20:06:37 +10:00
Andrew Tridgell
70912e2b0c
added automatic vacuuming of empty records during recovery
...
(This used to be ctdb commit f9181a784ac7009df5e9c996f4e0c3e99098b59a)
2007-05-23 17:21:14 +10:00
Andrew Tridgell
3736943ead
- get rid of ctdb_ctrl_get_config
...
- cope with zero timeout in ctdb_control
(This used to be ctdb commit 9f24689eb0ad6813f1ac78e033cdffe5d8afb128)
2007-05-23 15:18:30 +10:00
Andrew Tridgell
74bf76ca10
merge from ronnie
...
(This used to be ctdb commit 267481b67152bc5885884d223085aa9ef5fe73bd)
2007-05-23 14:50:41 +10:00
Andrew Tridgell
76b2822340
- startup frozen, and do an initial recovery
...
- fixed a bug in traverse
- get a lock on the node list file in the recmaster recovery daemon
(This used to be ctdb commit 162a5647535ad1cb3e8e5d4042a2784365fb1913)
2007-05-23 14:35:19 +10:00
Andrew Tridgell
9f7a70657f
start ctdb frozen, and let the election sort things out. This prevents a race on startup
...
(This used to be ctdb commit b788ed3fa64e31e517b4e602e8bd3ae7201ecddd)
2007-05-23 12:23:07 +10:00
Andrew Tridgell
edf03e88c8
global lock should imply the transaction lock
...
(This used to be ctdb commit 765a7e27344476ea8835565839b69872ea6ab6f0)
2007-05-22 13:14:51 +10:00
Ronnie Sahlberg
e989a1bac8
add controls to enable/disable the monitoring of dead nodes
...
(This used to be ctdb commit 79d29c39bb81feb069db3fc6d3d392c1e75a4d13)
2007-05-21 09:24:34 +10:00
Andrew Tridgell
c02fee6ed3
show ctdb control timeout
...
(This used to be ctdb commit 03921eacb942c7aa6b81236c6603dc640cefe2bf)
2007-05-19 21:11:06 +10:00
Andrew Tridgell
d549f1e1a3
merge from ronnie
...
(This used to be ctdb commit 985d718e03510398b9a5cfdf6a4d559a90738a11)
2007-05-19 17:21:58 +10:00
Ronnie Sahlberg
02a9f1b0a0
use ctdb_dead_node() instead of reimplementing the same code again
...
this leaves only one single function where a node is marked as dead
instead of two places
(This used to be ctdb commit aa764ea26cc26d5c1ae188105236da603576f45b)
2007-05-19 16:59:10 +10:00
Andrew Tridgell
ab66fb840e
removed obsolete ctdb_dump tool
...
(This used to be ctdb commit e3ed6fd65896f07fc76405acb2e16f50f04a0a3c)
2007-05-19 14:07:01 +10:00
Andrew Tridgell
45d665ab20
don't count packets received from before the transport told us the node was dead
...
(This used to be ctdb commit 46b5319963afa9da6cf6d83f2a5dc1fa664e14b4)
2007-05-19 14:04:48 +10:00
Andrew Tridgell
13392414b1
nicer date formatting
...
(This used to be ctdb commit 78c256caa738593e335de65df752ff015f3a38e1)
2007-05-19 14:01:02 +10:00
Andrew Tridgell
a14fd9d29c
make sure we don't increment rx_cnt for redirected packets, or for packets that have been requeued after a lockwait
...
(This used to be ctdb commit 92e5569407dba173a27e9645b4339ce3e2c00520)
2007-05-19 13:45:24 +10:00
Ronnie Sahlberg
5f4eed84e0
increase the tx_cnt everytime we send a packet to a node
...
(This used to be ctdb commit d3fea0d87268e552f4ed3a90d10b3af209413f73)
2007-05-19 10:27:17 +10:00
Ronnie Sahlberg
9f7b9faf64
add a node->tx_cnt counter
...
only send keepalive packets if the count is zero
(This used to be ctdb commit 2cbd424231caccf0a531cf6501761115efe68f3e)
2007-05-19 10:20:19 +10:00
Andrew Tridgell
28f2fc669b
a better way to resend calls after recovery
...
(This used to be ctdb commit 444f52e134fc22aaf254d05c86d8b357ded876f4)
2007-05-19 00:56:49 +10:00
Andrew Tridgell
049e1504ee
timeout pending controls immediately when a node becomes disconnected
...
(This used to be ctdb commit 93c4b16f4efef383ba8db83953019ef4821613e0)
2007-05-18 23:48:29 +10:00
Andrew Tridgell
346dfc1bef
- up rx_cnt on all packet types
...
- notice when a node becomes available again
(This used to be ctdb commit e05110dd6112e81f224937dfd7370d963ce9531a)
2007-05-18 23:23:36 +10:00
Ronnie Sahlberg
334a294481
add a missing file :-)
...
(This used to be ctdb commit 29cf1b927f2cebfdc43e22d32a270e956716e2c5)
2007-05-18 20:06:29 +10:00
Ronnie Sahlberg
db4c479568
add dead node detection so that if a node does not generate any
...
keepalive traffic for x seconds it is deemed dead
this triggers a recovery after a while if a ctdbd has been STOPPED
but it doesnt recover automatically when the node reappears
(This used to be ctdb commit d6324afe0d13b5e21d06e347caca433c6b36a32a)
2007-05-18 19:19:35 +10:00
Andrew Tridgell
2d9f0ee606
merge from samba4
...
(This used to be ctdb commit 3028a1b5c21b981c4487b55f026e6645bbeada89)
2007-05-18 11:24:45 +10:00
Andrew Tridgell
51f5bd2b31
merge from samba4
...
(This used to be ctdb commit b280bb01fc9afea5a9168c63d92907dac725b22b)
2007-05-18 08:33:10 +10:00
Andrew Tridgell
49fe66713f
- don't try to send controls to dead nodes
...
- use only connected nodes in a traverse
(This used to be ctdb commit 9a676dd5d331022d946a56c52c42fc6985b93dbc)
2007-05-17 23:23:41 +10:00
Andrew Tridgell
0b11bd99dd
merged from samba4
...
(This used to be ctdb commit 23b1c00f8bf60739aee935bac44981b68477ee1f)
2007-05-17 17:54:02 +10:00
Andrew Tridgell
874fd5c2f7
removed the CTDB_CTRL_FLAG_NOREQUEUE flag
...
(This used to be ctdb commit 366e849f6f350eda78d79cf1ea55c2637e605c86)
2007-05-17 14:10:38 +10:00
Andrew Tridgell
fb77fd0bed
merge from ronnie
...
(This used to be ctdb commit f80c4b02ffb474a4710fa411123448e3ce2d004a)
2007-05-17 12:46:50 +10:00
Andrew Tridgell
6d32a6dd50
merged debug changes from samba4
...
(This used to be ctdb commit f01180148635b9fcc0f50beddb87be0a2071dfd4)
2007-05-17 12:43:09 +10:00
Andrew Tridgell
241ce32f1d
merged events changes from samba4
...
(This used to be ctdb commit cd9280556663bc8929672ada6ea556e097c4b2b3)
2007-05-17 12:39:53 +10:00
Ronnie Sahlberg
f4738f9c41
we no longer pass lmaster across during pulldb so dont print it from
...
catdb either
(This used to be ctdb commit b57d60f4789ea7f0dd69c93f6629d8742e182576)
2007-05-17 12:07:29 +10:00
Ronnie Sahlberg
cc760cf13a
add a control to shutdown/kill a node
...
(This used to be ctdb commit 3802f7304fd59d56062c855987e2561753e85a69)
2007-05-17 10:45:31 +10:00
Ronnie Sahlberg
d6ed77468d
merge from tridge
...
(This used to be ctdb commit 0c6dc471e33e80db00a2b006262c4107f39fa023)
2007-05-16 18:44:51 +10:00
Andrew Tridgell
c105f6d789
- merge from ronnie
...
- fixed a memory leak found by dmitry
(This used to be ctdb commit ae87bf0005666b50850161c3843d6bc7cb5c8971)
2007-05-16 18:10:26 +10:00
Ronnie Sahlberg
f4056d2e28
remove a prototype we no longer need
...
(This used to be ctdb commit 4a11373ec5e8196cf430f18f6171915f790f794b)
2007-05-16 14:45:43 +10:00
Ronnie Sahlberg
a4ebb6d5ef
if a caller specifies a timeout when calling a control, it makes no
...
sense to have the daemon requeue the packets if they timeout or fail to
deliver to the remote node
(This used to be ctdb commit 9fb753046787190970654aeb937e96685ac53184)
2007-05-16 12:34:30 +10:00
Ronnie Sahlberg
4b8ddfccad
merge from tridge
...
(This used to be ctdb commit 8d424b41d6cf2973b28a749d1b8e6a028dad9ffe)
2007-05-16 11:12:28 +10:00
Andrew Tridgell
20d96ad5c5
enable TCP keepalives
...
(This used to be ctdb commit a44f760f6260359201d8431d2f1267af2bc6b1b1)
2007-05-15 18:40:56 +10:00
Andrew Tridgell
a5198559c9
moved the recovery daemon into the main ctdbd and enable it by default
...
(This used to be ctdb commit 2a7d42124731f43d013cb76a798525eab4cc1ee0)
2007-05-15 15:13:36 +10:00
Andrew Tridgell
527b2352ac
fixed two more places where we don't correctly handle write errors on sockets
...
(This used to be ctdb commit f4a71bb63e7f75d21b66f9eaeac997c2029cd146)
2007-05-15 14:08:58 +10:00
Ronnie Sahlberg
5d82f89a55
merge from tridge
...
(This used to be ctdb commit eb64cde53ec5ed6949df1684e5c148f2294b1da7)
2007-05-15 10:34:14 +10:00
Andrew Tridgell
67f5601bef
fixed a fd close error on reconnect
...
(This used to be ctdb commit 240651a6f67f914b06e273696cef6180d788221e)
2007-05-15 10:33:28 +10:00
Ronnie Sahlberg
0d71b6d1e6
merge from tridge
...
(This used to be ctdb commit 0697f59a044deeab126a39bff97bcd5c1101298e)
2007-05-15 10:28:41 +10:00
Andrew Tridgell
c6afe22b92
added a control to get the local vnn
...
(This used to be ctdb commit 0b109f574b710f290372512d0694290ea7cd4368)
2007-05-15 10:17:16 +10:00
Andrew Tridgell
38491de84f
check for error on ctdb_ltdb_store
...
(This used to be ctdb commit c4a34bac4ad4d2f9699e08074668d25586e3c0da)
2007-05-15 10:16:59 +10:00
Andrew Tridgell
cf1056df94
added a -i switch to run ctdbd without forking
...
(This used to be ctdb commit 327df14ecd58f405fbe8b38afa2ee54a8dd0a2e4)
2007-05-15 09:44:33 +10:00
Andrew Tridgell
db4fa58b07
reading on the write side of a pipe isn't allowed - this caused us to run without locking in the lockwait code
...
(This used to be ctdb commit 2ac67ce737f30258915cc25bde531d361092ae14)
2007-05-15 09:44:03 +10:00
Andrew Tridgell
7d3870d41f
AIX needs sin_len field for bind()
...
(This used to be ctdb commit cd6c35d4aa4f4a4cfeedf6902cda84e43d7aeba4)
2007-05-15 09:42:52 +10:00
Ronnie Sahlberg
5d0e7e354f
merge from tridge
...
(This used to be ctdb commit d1dae4fc8f4c2d16d313a27968d67c5825a133d1)
2007-05-14 14:07:19 +10:00
Ronnie Sahlberg
4b6d949cd3
merge from tridge
...
(This used to be ctdb commit 65f4415e618dbbac0260f6a4e51e051e6df64a61)
2007-05-14 14:05:49 +10:00
Andrew Tridgell
3f3eb4f636
we must not free the fde until after we no longer need the lock child
...
(This used to be ctdb commit e06776c7c37b63f5c3165c7043d665e0c1a95337)
2007-05-14 14:01:33 +10:00
Andrew Tridgell
d655a64b5b
kill the lockwait child if the pipe goes away
...
(This used to be ctdb commit bdfa8ba9932fade074a05a6cb6bc14ae3b84618c)
2007-05-14 13:49:01 +10:00
Andrew Tridgell
9e70b95159
don't allow setvnnmap while not frozen
...
(This used to be ctdb commit a73f47f565894cc7e346177d87f2e6813837e1c6)
2007-05-14 13:48:40 +10:00
Andrew Tridgell
5d71cc78bb
don't allow setrecmaster while not frozen
...
(This used to be ctdb commit e84b05ba6062ffc45b7f3c23e88feef1d39069c4)
2007-05-14 13:48:14 +10:00
Ronnie Sahlberg
ed466e20b6
remove the control to bump the rsn since we dont need it anymore
...
(This used to be ctdb commit a646b6d77bd8adf6c986259c534a05400c4bde11)
2007-05-14 08:03:48 +10:00
Ronnie Sahlberg
4bacd385ef
add a mising parameter to the new signature for ctdb_control
...
(This used to be ctdb commit 3a3304cd48d644c758f416ec283faf3ba9690c04)
2007-05-14 06:50:24 +10:00
Ronnie Sahlberg
4f7fc688f7
merge from tridge
...
(This used to be ctdb commit 7bca79ad6357149fd7c6b28ce4b05de3d223a7de)
2007-05-14 06:25:15 +10:00
Andrew Tridgell
5839eb1e80
make sure the ctdb control socket is secure
...
(This used to be ctdb commit 2954f2e501a418af578e75e8705b0b39a77c1861)
2007-05-13 09:20:16 +10:00
Andrew Tridgell
81826da2df
added error messages in ctdb_control replies
...
(This used to be ctdb commit bd848f5b760e6b2a73ebfc67fd8adb3c31479fb5)
2007-05-12 21:25:26 +10:00
Andrew Tridgell
58dc8d4e19
prioritise the dmaster in case of matching rsn
...
(This used to be ctdb commit 4996a12174aa0d215a5b14cb970bdf83eed34a39)
2007-05-12 19:57:12 +10:00
Andrew Tridgell
7f570322c3
the invalid dmaster is no longer needed in recovery
...
(This used to be ctdb commit bd638ea63d11485bc3a8c50d923262a48095c2f3)
2007-05-12 19:56:31 +10:00
Andrew Tridgell
68fc85d162
the retry client code is no longer needed now that we use a freeze on recovery
...
(This used to be ctdb commit 4213475a2db93b149705bfbb578c78936124c608)
2007-05-12 19:55:55 +10:00
Andrew Tridgell
df49a66de4
ensure we propogate the correct rsn for a request dmaster
...
(This used to be ctdb commit 70c1c67db865db8a49b56e8e3e8fd56ec5063208)
2007-05-12 19:55:18 +10:00
Andrew Tridgell
78eb2f96af
simplify the generation checking on incoming call packets
...
(This used to be ctdb commit 87ee47f7fbbf71228bc9cc16faff86b4c59333a2)
2007-05-12 19:54:40 +10:00
Andrew Tridgell
36ccc10389
make sure we ignore requeued ctdb_call packets of older generations except for packets from the client
...
(This used to be ctdb commit facab105fbd7fe50f96bdd763ae50ddc54fbdacc)
2007-05-12 18:08:50 +10:00
Andrew Tridgell
5bd0e50086
added -t option to ctdb_control
...
(This used to be ctdb commit 658141280eeb121a570d71c4b0af36d03004f320)
2007-05-12 16:04:56 +10:00
Andrew Tridgell
7dcf82cda4
- nicer message if freeze child dies
...
- change local generation count after recovery/freeze started
(This used to be ctdb commit d9768142797f083a8c09b55d6a8a93cc12089348)
2007-05-12 15:59:49 +10:00
Andrew Tridgell
2c90d9e794
show total frozen/recoving in status
...
(This used to be ctdb commit 0d0eb66a63fe6912edb85bf7387ac76acb70babd)
2007-05-12 15:51:08 +10:00
Andrew Tridgell
b327cd872d
report number of frozen/thawed nodes
...
(This used to be ctdb commit 997720bc0e15d882aefed3464fe285674beed691)
2007-05-12 15:44:56 +10:00
Andrew Tridgell
cb81a2eca8
watch for the freeze child exiting
...
(This used to be ctdb commit 7f350eca8598022ebd198b2476d1f2c2a8f03a8d)
2007-05-12 15:44:35 +10:00
Andrew Tridgell
f7e3004f0a
more robust freeze/thaw logic
...
(This used to be ctdb commit 51c1e51aeb7dfac1683584df7ef1bef98c092f76)
2007-05-12 15:29:06 +10:00
Andrew Tridgell
9cf77dd23f
separate out the freeze/thaw handling from recovery
...
(This used to be ctdb commit 0b0640bd8b8334961f240e0cf276ac112cd6e616)
2007-05-12 15:15:27 +10:00
Andrew Tridgell
74a799a83b
added lockwait child code for entering recovery mode. A child processes holds lockall locks for the entire recovery process
...
(This used to be ctdb commit f892f30def75b0d964c35eae38c4cf675597dd28)
2007-05-12 14:34:21 +10:00
Andrew Tridgell
ae55e4181d
added _mark calls for tdb_lockall
...
(This used to be ctdb commit e59134fd2af67c746b907c23fdcde2eccbbe17cf)
2007-05-12 14:33:10 +10:00
Andrew Tridgell
85aff64ed8
fixed debug message
...
(This used to be ctdb commit 9802bf1ef9104b31977020e803b0f81da71c7169)
2007-05-11 17:29:21 +10:00
Ronnie Sahlberg
0c9bb4bb44
we have to get a NEW generation id after completing recovery
...
to solve a race condition with the logic to retransmit in
ctdb_call.c/ctdb_call_timeout()
(This used to be ctdb commit 1044ddca9ff5c434816de35d3f659aa182704e97)
2007-05-11 12:03:19 +10:00
Ronnie Sahlberg
7769a2d45e
merge from tridge
...
(This used to be ctdb commit 826058b547b8e836f0a7066e9479e481ad9c472e)
2007-05-11 10:37:42 +10:00
Ronnie Sahlberg
9ec3024287
add a control to bump the rsn number for all records in a database
...
use this control from the recovery daemon to ensure that the recmaster
always have a higher rsn than andy other node for the records after
recovery completes
(This used to be ctdb commit 6fb6a8b981a804bfcc460c4481c51c7c647230f6)
2007-05-11 10:36:47 +10:00
Andrew Tridgell
63acf8ab95
- merge from ronnie
...
- increment rsn only in become_dmaster
- add torture check for rsn regression in ctdb_ltdb_store
(This used to be ctdb commit 8047506a08bb53ee01aa64f25c9f72839e1e2d68)
2007-05-11 10:33:43 +10:00
Ronnie Sahlberg
9eeb4f1a51
we must bump the rsn everytime we do a REQ_DMASTER or a REPLY_DMASTER
...
to make sure that the "merge records based on rsn during recovery" will
merge correctly.
this is extra important since samba3 never bumps the record when it
writes new data to it !
(This used to be ctdb commit 857e67204065603592c2dbbadbd8667ebba9ccdb)
2007-05-11 06:08:17 +10:00
Ronnie Sahlberg
325713dfeb
make ctdb_control catdb work again
...
(This used to be ctdb commit 40a8fb68c71be0b9f54ae88bf8aa39a4c71f3b5a)
2007-05-11 05:40:11 +10:00
Andrew Tridgell
f8765b19bf
- got rid of the complex hand marshalling in the recovery controls
...
- fixed the re-send of ctdb calls after a generation change
- fixed a reqid idr leak in controls
- removed the write_record test code
- use the new nonblock lockall code to prevent ctdbd from ever doing a
blocking lock that could deadlock with smbd
- moved more of the recovery controls into ctdb_recover.c
(This used to be ctdb commit 565a21aa4f1e842309986ab97d6244801153deec)
2007-05-10 17:43:45 +10:00
Andrew Tridgell
698d2a6af4
added nonblocking varients of the two lockall functions to tdb
...
(This used to be ctdb commit 2e99fa41ce01fa282bc0f3244ca42a78173743ed)
2007-05-10 17:43:08 +10:00
Andrew Tridgell
15bc97cdaa
better timeout handling for calls, controls and traverses
...
(This used to be ctdb commit 63346a6c59d4821b4c443939b5d88db8cd20f5fe)
2007-05-10 14:06:48 +10:00
Andrew Tridgell
31cd92dc7e
merge from ronnie
...
(This used to be ctdb commit 92b7a849565730744c75a7fb776173554e9f57bf)
2007-05-10 13:15:58 +10:00
Andrew Tridgell
50390bcb18
setup the random number generator a bit better
...
(This used to be ctdb commit 708585eb0ed31b0df6543a1d7a20b82e751877c2)
2007-05-10 13:10:23 +10:00
Ronnie Sahlberg
a54390197a
create a correct vnnmap structure to prevent a segv
...
(This used to be ctdb commit 17777bb5e6208e97a82a171243c6c406f53ee02e)
2007-05-10 10:10:58 +10:00
Ronnie Sahlberg
82e37a9886
update ctdb_control to create a correct ctdb_vnn_map->map array
...
(This used to be ctdb commit e510cc89068557881688d6cada38915b3e51f8cd)
2007-05-10 10:03:21 +10:00
Ronnie Sahlberg
a56a2501ac
when starting a new election, also force all nodes into recovery mode so
...
there is no internode traffic to interfere with our election
(This used to be ctdb commit ccfb67a076c72a0e7f2b6dc5fce9c19f652ba2ad)
2007-05-10 09:48:14 +10:00
Ronnie Sahlberg
4370dc1e75
when starting recovery repoint dmaster to an invalid node and not the
...
current vnn
(This used to be ctdb commit 3c2dcc7448b335cf42e8f7edffba21229dccbd79)
2007-05-10 09:46:10 +10:00
Ronnie Sahlberg
325f321409
merge from tridge
...
(This used to be ctdb commit 8c5e6836280499243c0cd247093844a891f00da3)
2007-05-10 09:44:28 +10:00
Ronnie Sahlberg
639e4374e5
actually check the remote nodes and not just the local node
...
(This used to be ctdb commit 09df21be6361743d320fafc120718211eece85c3)
2007-05-10 09:43:01 +10:00
Andrew Tridgell
1e38ae491f
remove old s3 recovery code
...
fixed vnnmap wire format in recover daemon
(This used to be ctdb commit e03fab7bfe0cf43f40c49a3d63e75dc44001d8d8)
2007-05-10 08:49:57 +10:00
Andrew Tridgell
2a82665532
fixed setvnnmap to use wire structures too
...
(This used to be ctdb commit 1208e4219d220b80e2f74974cac8ed2b8956d3ef)
2007-05-10 08:22:26 +10:00
Andrew Tridgell
682df74d59
separate the wire format and internal format for the vnn_map
...
(This used to be ctdb commit 9a71718d87c5162f1423d85c2e86a01f6771925e)
2007-05-10 08:13:19 +10:00
Andrew Tridgell
a8f83423f4
moved the vnn_map initialisation out of the cmdline code
...
(This used to be ctdb commit 81492b840d608dc724d5a25ddef6eb0ce12b95fb)
2007-05-10 07:55:46 +10:00
Andrew Tridgell
ba47b43c6b
merged ronnies code to delay client requests when in recovery mode
...
(This used to be ctdb commit dfca37076d642f3407c63dfe3b685287d27c8f8d)
2007-05-10 07:43:18 +10:00
Ronnie Sahlberg
cbb6f99f41
merge from tridge
...
(This used to be ctdb commit 190cca8488dff982062ae7b1a82cb33cc1cdfaf7)
2007-05-10 06:55:28 +10:00
Ronnie Sahlberg
bbaaf2bbf4
hang the event from the retry structure instead of the hdr structure
...
(This used to be ctdb commit 8536c8c3a30a986ba4945d02aef82b47495ce3f8)
2007-05-09 14:08:11 +10:00
Ronnie Sahlberg
c938c1b5de
when we are in recovery mode and we get a REQ_CALL from a client,
...
defer it for one second and try again
(This used to be ctdb commit 606fb6414b97d1813056982cda7c0fe84d746e67)
2007-05-09 14:06:47 +10:00
Andrew Tridgell
d2a90cc5a5
merge from ronnie
...
(This used to be ctdb commit f67a4842e7b1efb2ad61c41e4895c7698e564bf3)
2007-05-09 11:54:37 +10:00
Ronnie Sahlberg
6929739b7f
add a command line flag to ctdbd to start a recovery daemon.
...
update the recovery test script to start all ctdb daemons with a
recovery daemon
(This used to be ctdb commit 47794e16df285cacefc30208d892d931a6e46b96)
2007-05-09 09:59:23 +10:00
Ronnie Sahlberg
92333fce03
change the name of the recovery daemon to ctdb_recoverd
...
(This used to be ctdb commit b0cf919e4f38961e5cf4e1e79a0cfe4bb4a96d76)
2007-05-09 09:31:53 +10:00
Ronnie Sahlberg
2befe18e29
add a small tool to monitor recovery
...
(This used to be ctdb commit b45936828713c31ee670e2106b49c2351234f310)
2007-05-09 08:05:53 +10:00
Andrew Tridgell
fdb8144e62
fixed a problem with the number of timed events growing without bound with the new seqnum code
...
(This used to be ctdb commit 6109ae3dae8d93c93a2dc76cc561ea6e21458aa6)
2007-05-08 21:16:29 +10:00
Ronnie Sahlberg
5efa3d88c5
we must repoint dmaster to an invalid node during recovery to stop the
...
shortcut from working
(This used to be ctdb commit 5e18930be8c0efb87aa9e2780d9457634b24e156)
2007-05-08 14:51:55 +10:00
Ronnie Sahlberg
e11eebd070
fix alignment bug for pulldb
...
(This used to be ctdb commit f1188289c18805c2c5f8bae61d73df3fc762faee)
2007-05-08 14:42:00 +10:00
Ronnie Sahlberg
a1866c6eeb
hang the timeout event off state and thus we dont need to explicitely
...
free it and also we wont accidentally return from the function without
killing the event first
(This used to be ctdb commit e3d72d024ef7342a808e5c488fd646a39e5fac78)
2007-05-07 07:54:17 +10:00
Ronnie Sahlberg
6bfb5f61ca
it now works to talloc_free() the timed event if we no longer want it to
...
trigger
this must have been a sideeffect of a different bug in the recoverd.c
code that has now been fixed
(This used to be ctdb commit 676446fd1083c371ad0ff72dd8c636ec8e6d1423)
2007-05-07 07:47:16 +10:00
Ronnie Sahlberg
39d81cffb1
recovery daemon with recovery master election
...
election is primitive, it elects the lowest vnn as the recovery master
two new controls, to get/set recovery master for a node
to use recovery daemon, start one
./bin/recoverd --socket=ctdb.socket*
for each ctdb daemon
it has been briefly tested by deleting and adding nodes to a 4 node
cluster but needs more testing
(This used to be ctdb commit 541d1cc49d46d44042a31a8404d521412ef2fdb3)
2007-05-07 06:51:58 +10:00
Ronnie Sahlberg
a9657f6aa5
add new controls to get and set the recovery master node of a daemon
...
i.e. which node is "elected" to check for and drive recovery
(This used to be ctdb commit d577093eb4b619392c71ab5ce81e8c02565d93f0)
2007-05-07 05:02:48 +10:00
Ronnie Sahlberg
97bc457321
add a test in the function that checks whether the cluster needs
...
recovery or not that all active nodes are in normal mode.
If we discover that some node is still in recoverymode it may indicate
that a previous recovery ended prematurely and thus we should start a
new recovery
(This used to be ctdb commit c15517872e6c98c8c425a8d47d2b348ecb0620b0)
2007-05-07 04:41:12 +10:00
Ronnie Sahlberg
1c438a7256
update a comment to be more desciptive
...
(This used to be ctdb commit 96082c54d830974bf9a4d5bad33ad60379a85798)
2007-05-06 12:46:56 +10:00
Ronnie Sahlberg
1fa2bf831a
change a lot of printf into debug statements
...
(This used to be ctdb commit 6edb9149c7eb36da47e4e6a9dd3ede22263ce3f9)
2007-05-06 10:51:25 +10:00
Ronnie Sahlberg
8a12672992
break out the code to update all nodes to the new vnnmap into a helper
...
function
(This used to be ctdb commit 81d39177949b54715710907d14ddc888dc09b064)
2007-05-06 10:42:18 +10:00
Ronnie Sahlberg
ee83202da6
create a helper function for recovery to push all local databases out
...
onto the remote nodes
(This used to be ctdb commit 1ba76d374652cfa29e56fb77c7190349e42d3bcc)
2007-05-06 10:38:44 +10:00
Ronnie Sahlberg
5fb41f4c3b
add an extra blank line
...
(This used to be ctdb commit 75096dde58df6532abbf5b9ebd771e8810156483)
2007-05-06 10:30:18 +10:00
Ronnie Sahlberg
9281cb192c
break the code that repoints dmaster for all local and remote records
...
into a separate helper function
(This used to be ctdb commit d5ab30d0ac21e736eb34eaa19bccfee5f0ce7cfb)
2007-05-06 10:22:13 +10:00
Ronnie Sahlberg
d51a19f2ba
create a helper function for recovery that pulls and merges all remote
...
databases onto the local node
(This used to be ctdb commit 5cecc47449c369f91e83389a94b987ac32b1e3f4)
2007-05-06 10:16:48 +10:00
Ronnie Sahlberg
d6ce023c68
create a helper function to make sure the local node that does recovery
...
has all the databases that exist on any other remote node
(This used to be ctdb commit 0f436e3d40fea6e6a146019b0c664e80e81e88b4)
2007-05-06 10:12:42 +10:00
Ronnie Sahlberg
0e436f5058
add a helper function to create all missing remote databases detected
...
during recovery
(This used to be ctdb commit 04758c6f7d8f61260be6d2472380cb7904984427)
2007-05-06 10:04:37 +10:00
Ronnie Sahlberg
cadfb24b41
break out the setting/clearing of recovery mode into a dedicated helper
...
function
(This used to be ctdb commit dba4e4f8aa4f2fde1e9f8d93bdf3a33f7de8ce18)
2007-05-06 09:53:12 +10:00
Ronnie Sahlberg
c9aafae5ce
dont allocate arrays where we can just return a single integer
...
(This used to be ctdb commit 07bc338e490e0f7018808a2450bc54863eb88c94)
2007-05-06 08:05:22 +10:00
Ronnie Sahlberg
dceab7ff3e
dont use arrays where a uint32_t works just as well
...
(This used to be ctdb commit 843e974b29c93df891ae7cf13323ee960a334f60)
2007-05-06 07:52:20 +10:00
Ronnie Sahlberg
ad41dff7bf
add a ifdeffed out block to the call.
...
we really should kill the event in case the call completed before the
timeout so that we can also make timed_out non-static
(This used to be ctdb commit f297eed589b1d4e188f77f195683365cf91d0e62)
2007-05-06 07:32:16 +10:00
Ronnie Sahlberg
4f2cdc2d8b
hte timed_out variable needs to be static and can not be on the stack
...
since if the command times out and we return from ctdb_control we may
have events that can trigger later which will overwrite data that is no
longer in our stackframe
(This used to be ctdb commit 93942543092be618c0bd8ef68b470b0789bad7ad)
2007-05-06 07:07:47 +10:00
Ronnie Sahlberg
c6bd23ee11
update to rhe recovery daemon
...
ctdb_ctrl_ calls are timedout due to nodes arriving or leaving the
cluster it crashes the recovery daemon afterwards with a SEGV but no
useful stack backtrace
(This used to be ctdb commit cd3abc7349e86555ccd87cd47a1dcc2adad2f46c)
2007-05-06 06:58:01 +10:00
Ronnie Sahlberg
60d4b0e8b4
in the recover test
...
start the daemons with explicit socketnames and explicit ip address/port
remove all --socket= from all ctdb_control calls since they are not
needed anymore
(This used to be ctdb commit 593a959d428f5b4a913117a9b5c8fe65a3eb950e)
2007-05-06 06:06:39 +10:00
Ronnie Sahlberg
7bbcc964f2
add support in catdb to dump the content of a specific nodes tdb instead
...
of traversing the full cluster.
this makes it easier to debug recovery
update the test script for recovery to reflect the newish signatures to
ctdb_control
the catdb control does still segfault however when there are missing
nodes in the cluster as there are toward the end of the recovery test
(This used to be ctdb commit 8de2a97c14a444f817ceb36461314f10c9601ecc)
2007-05-06 05:53:15 +10:00
Ronnie Sahlberg
0f6d9c73d8
merge from tridge
...
(This used to be ctdb commit 08173e3ab77178b9841db0081a51b93291d9e8dc)
2007-05-06 04:38:41 +10:00
Ronnie Sahlberg
25edbc9a50
add a control to get the pid of a daemon.
...
this makes it possible to kill a specific daemon in the recover test
script
(This used to be ctdb commit 2fa394b4c80988cb1a6d04b236ec64cc9d9e8a40)
2007-05-06 04:31:22 +10:00