Ronnie Sahlberg
f4738f9c41
we no longer pass lmaster across during pulldb so dont print it from
...
catdb either
(This used to be ctdb commit b57d60f4789ea7f0dd69c93f6629d8742e182576)
2007-05-17 12:07:29 +10:00
Ronnie Sahlberg
cc760cf13a
add a control to shutdown/kill a node
...
(This used to be ctdb commit 3802f7304fd59d56062c855987e2561753e85a69)
2007-05-17 10:45:31 +10:00
Andrew Tridgell
c105f6d789
- merge from ronnie
...
- fixed a memory leak found by dmitry
(This used to be ctdb commit ae87bf0005666b50850161c3843d6bc7cb5c8971)
2007-05-16 18:10:26 +10:00
Ronnie Sahlberg
a4ebb6d5ef
if a caller specifies a timeout when calling a control, it makes no
...
sense to have the daemon requeue the packets if they timeout or fail to
deliver to the remote node
(This used to be ctdb commit 9fb753046787190970654aeb937e96685ac53184)
2007-05-16 12:34:30 +10:00
Ronnie Sahlberg
4b8ddfccad
merge from tridge
...
(This used to be ctdb commit 8d424b41d6cf2973b28a749d1b8e6a028dad9ffe)
2007-05-16 11:12:28 +10:00
Andrew Tridgell
a5198559c9
moved the recovery daemon into the main ctdbd and enable it by default
...
(This used to be ctdb commit 2a7d42124731f43d013cb76a798525eab4cc1ee0)
2007-05-15 15:13:36 +10:00
Andrew Tridgell
527b2352ac
fixed two more places where we don't correctly handle write errors on sockets
...
(This used to be ctdb commit f4a71bb63e7f75d21b66f9eaeac997c2029cd146)
2007-05-15 14:08:58 +10:00
Ronnie Sahlberg
0d71b6d1e6
merge from tridge
...
(This used to be ctdb commit 0697f59a044deeab126a39bff97bcd5c1101298e)
2007-05-15 10:28:41 +10:00
Andrew Tridgell
c6afe22b92
added a control to get the local vnn
...
(This used to be ctdb commit 0b109f574b710f290372512d0694290ea7cd4368)
2007-05-15 10:17:16 +10:00
Andrew Tridgell
38491de84f
check for error on ctdb_ltdb_store
...
(This used to be ctdb commit c4a34bac4ad4d2f9699e08074668d25586e3c0da)
2007-05-15 10:16:59 +10:00
Andrew Tridgell
cf1056df94
added a -i switch to run ctdbd without forking
...
(This used to be ctdb commit 327df14ecd58f405fbe8b38afa2ee54a8dd0a2e4)
2007-05-15 09:44:33 +10:00
Andrew Tridgell
db4fa58b07
reading on the write side of a pipe isn't allowed - this caused us to run without locking in the lockwait code
...
(This used to be ctdb commit 2ac67ce737f30258915cc25bde531d361092ae14)
2007-05-15 09:44:03 +10:00
Ronnie Sahlberg
5d0e7e354f
merge from tridge
...
(This used to be ctdb commit d1dae4fc8f4c2d16d313a27968d67c5825a133d1)
2007-05-14 14:07:19 +10:00
Ronnie Sahlberg
4b6d949cd3
merge from tridge
...
(This used to be ctdb commit 65f4415e618dbbac0260f6a4e51e051e6df64a61)
2007-05-14 14:05:49 +10:00
Andrew Tridgell
3f3eb4f636
we must not free the fde until after we no longer need the lock child
...
(This used to be ctdb commit e06776c7c37b63f5c3165c7043d665e0c1a95337)
2007-05-14 14:01:33 +10:00
Andrew Tridgell
d655a64b5b
kill the lockwait child if the pipe goes away
...
(This used to be ctdb commit bdfa8ba9932fade074a05a6cb6bc14ae3b84618c)
2007-05-14 13:49:01 +10:00
Andrew Tridgell
9e70b95159
don't allow setvnnmap while not frozen
...
(This used to be ctdb commit a73f47f565894cc7e346177d87f2e6813837e1c6)
2007-05-14 13:48:40 +10:00
Andrew Tridgell
5d71cc78bb
don't allow setrecmaster while not frozen
...
(This used to be ctdb commit e84b05ba6062ffc45b7f3c23e88feef1d39069c4)
2007-05-14 13:48:14 +10:00
Ronnie Sahlberg
ed466e20b6
remove the control to bump the rsn since we dont need it anymore
...
(This used to be ctdb commit a646b6d77bd8adf6c986259c534a05400c4bde11)
2007-05-14 08:03:48 +10:00
Ronnie Sahlberg
4bacd385ef
add a mising parameter to the new signature for ctdb_control
...
(This used to be ctdb commit 3a3304cd48d644c758f416ec283faf3ba9690c04)
2007-05-14 06:50:24 +10:00
Ronnie Sahlberg
4f7fc688f7
merge from tridge
...
(This used to be ctdb commit 7bca79ad6357149fd7c6b28ce4b05de3d223a7de)
2007-05-14 06:25:15 +10:00
Andrew Tridgell
5839eb1e80
make sure the ctdb control socket is secure
...
(This used to be ctdb commit 2954f2e501a418af578e75e8705b0b39a77c1861)
2007-05-13 09:20:16 +10:00
Andrew Tridgell
81826da2df
added error messages in ctdb_control replies
...
(This used to be ctdb commit bd848f5b760e6b2a73ebfc67fd8adb3c31479fb5)
2007-05-12 21:25:26 +10:00
Andrew Tridgell
58dc8d4e19
prioritise the dmaster in case of matching rsn
...
(This used to be ctdb commit 4996a12174aa0d215a5b14cb970bdf83eed34a39)
2007-05-12 19:57:12 +10:00
Andrew Tridgell
68fc85d162
the retry client code is no longer needed now that we use a freeze on recovery
...
(This used to be ctdb commit 4213475a2db93b149705bfbb578c78936124c608)
2007-05-12 19:55:55 +10:00
Andrew Tridgell
df49a66de4
ensure we propogate the correct rsn for a request dmaster
...
(This used to be ctdb commit 70c1c67db865db8a49b56e8e3e8fd56ec5063208)
2007-05-12 19:55:18 +10:00
Andrew Tridgell
78eb2f96af
simplify the generation checking on incoming call packets
...
(This used to be ctdb commit 87ee47f7fbbf71228bc9cc16faff86b4c59333a2)
2007-05-12 19:54:40 +10:00
Andrew Tridgell
36ccc10389
make sure we ignore requeued ctdb_call packets of older generations except for packets from the client
...
(This used to be ctdb commit facab105fbd7fe50f96bdd763ae50ddc54fbdacc)
2007-05-12 18:08:50 +10:00
Andrew Tridgell
7dcf82cda4
- nicer message if freeze child dies
...
- change local generation count after recovery/freeze started
(This used to be ctdb commit d9768142797f083a8c09b55d6a8a93cc12089348)
2007-05-12 15:59:49 +10:00
Andrew Tridgell
2c90d9e794
show total frozen/recoving in status
...
(This used to be ctdb commit 0d0eb66a63fe6912edb85bf7387ac76acb70babd)
2007-05-12 15:51:08 +10:00
Andrew Tridgell
cb81a2eca8
watch for the freeze child exiting
...
(This used to be ctdb commit 7f350eca8598022ebd198b2476d1f2c2a8f03a8d)
2007-05-12 15:44:35 +10:00
Andrew Tridgell
f7e3004f0a
more robust freeze/thaw logic
...
(This used to be ctdb commit 51c1e51aeb7dfac1683584df7ef1bef98c092f76)
2007-05-12 15:29:06 +10:00
Andrew Tridgell
9cf77dd23f
separate out the freeze/thaw handling from recovery
...
(This used to be ctdb commit 0b0640bd8b8334961f240e0cf276ac112cd6e616)
2007-05-12 15:15:27 +10:00
Andrew Tridgell
74a799a83b
added lockwait child code for entering recovery mode. A child processes holds lockall locks for the entire recovery process
...
(This used to be ctdb commit f892f30def75b0d964c35eae38c4cf675597dd28)
2007-05-12 14:34:21 +10:00
Andrew Tridgell
85aff64ed8
fixed debug message
...
(This used to be ctdb commit 9802bf1ef9104b31977020e803b0f81da71c7169)
2007-05-11 17:29:21 +10:00
Ronnie Sahlberg
0c9bb4bb44
we have to get a NEW generation id after completing recovery
...
to solve a race condition with the logic to retransmit in
ctdb_call.c/ctdb_call_timeout()
(This used to be ctdb commit 1044ddca9ff5c434816de35d3f659aa182704e97)
2007-05-11 12:03:19 +10:00
Ronnie Sahlberg
7769a2d45e
merge from tridge
...
(This used to be ctdb commit 826058b547b8e836f0a7066e9479e481ad9c472e)
2007-05-11 10:37:42 +10:00
Ronnie Sahlberg
9ec3024287
add a control to bump the rsn number for all records in a database
...
use this control from the recovery daemon to ensure that the recmaster
always have a higher rsn than andy other node for the records after
recovery completes
(This used to be ctdb commit 6fb6a8b981a804bfcc460c4481c51c7c647230f6)
2007-05-11 10:36:47 +10:00
Andrew Tridgell
63acf8ab95
- merge from ronnie
...
- increment rsn only in become_dmaster
- add torture check for rsn regression in ctdb_ltdb_store
(This used to be ctdb commit 8047506a08bb53ee01aa64f25c9f72839e1e2d68)
2007-05-11 10:33:43 +10:00
Ronnie Sahlberg
9eeb4f1a51
we must bump the rsn everytime we do a REQ_DMASTER or a REPLY_DMASTER
...
to make sure that the "merge records based on rsn during recovery" will
merge correctly.
this is extra important since samba3 never bumps the record when it
writes new data to it !
(This used to be ctdb commit 857e67204065603592c2dbbadbd8667ebba9ccdb)
2007-05-11 06:08:17 +10:00
Ronnie Sahlberg
325713dfeb
make ctdb_control catdb work again
...
(This used to be ctdb commit 40a8fb68c71be0b9f54ae88bf8aa39a4c71f3b5a)
2007-05-11 05:40:11 +10:00
Andrew Tridgell
f8765b19bf
- got rid of the complex hand marshalling in the recovery controls
...
- fixed the re-send of ctdb calls after a generation change
- fixed a reqid idr leak in controls
- removed the write_record test code
- use the new nonblock lockall code to prevent ctdbd from ever doing a
blocking lock that could deadlock with smbd
- moved more of the recovery controls into ctdb_recover.c
(This used to be ctdb commit 565a21aa4f1e842309986ab97d6244801153deec)
2007-05-10 17:43:45 +10:00
Andrew Tridgell
15bc97cdaa
better timeout handling for calls, controls and traverses
...
(This used to be ctdb commit 63346a6c59d4821b4c443939b5d88db8cd20f5fe)
2007-05-10 14:06:48 +10:00
Andrew Tridgell
2a82665532
fixed setvnnmap to use wire structures too
...
(This used to be ctdb commit 1208e4219d220b80e2f74974cac8ed2b8956d3ef)
2007-05-10 08:22:26 +10:00
Andrew Tridgell
682df74d59
separate the wire format and internal format for the vnn_map
...
(This used to be ctdb commit 9a71718d87c5162f1423d85c2e86a01f6771925e)
2007-05-10 08:13:19 +10:00
Andrew Tridgell
a8f83423f4
moved the vnn_map initialisation out of the cmdline code
...
(This used to be ctdb commit 81492b840d608dc724d5a25ddef6eb0ce12b95fb)
2007-05-10 07:55:46 +10:00
Andrew Tridgell
ba47b43c6b
merged ronnies code to delay client requests when in recovery mode
...
(This used to be ctdb commit dfca37076d642f3407c63dfe3b685287d27c8f8d)
2007-05-10 07:43:18 +10:00
Ronnie Sahlberg
bbaaf2bbf4
hang the event from the retry structure instead of the hdr structure
...
(This used to be ctdb commit 8536c8c3a30a986ba4945d02aef82b47495ce3f8)
2007-05-09 14:08:11 +10:00
Ronnie Sahlberg
c938c1b5de
when we are in recovery mode and we get a REQ_CALL from a client,
...
defer it for one second and try again
(This used to be ctdb commit 606fb6414b97d1813056982cda7c0fe84d746e67)
2007-05-09 14:06:47 +10:00
Andrew Tridgell
d2a90cc5a5
merge from ronnie
...
(This used to be ctdb commit f67a4842e7b1efb2ad61c41e4895c7698e564bf3)
2007-05-09 11:54:37 +10:00