Andrew Tridgell
db4fa58b07
reading on the write side of a pipe isn't allowed - this caused us to run without locking in the lockwait code
...
(This used to be ctdb commit 2ac67ce737f30258915cc25bde531d361092ae14)
2007-05-15 09:44:03 +10:00
Andrew Tridgell
7d3870d41f
AIX needs sin_len field for bind()
...
(This used to be ctdb commit cd6c35d4aa4f4a4cfeedf6902cda84e43d7aeba4)
2007-05-15 09:42:52 +10:00
Andrew Tridgell
3f3eb4f636
we must not free the fde until after we no longer need the lock child
...
(This used to be ctdb commit e06776c7c37b63f5c3165c7043d665e0c1a95337)
2007-05-14 14:01:33 +10:00
Andrew Tridgell
d655a64b5b
kill the lockwait child if the pipe goes away
...
(This used to be ctdb commit bdfa8ba9932fade074a05a6cb6bc14ae3b84618c)
2007-05-14 13:49:01 +10:00
Andrew Tridgell
9e70b95159
don't allow setvnnmap while not frozen
...
(This used to be ctdb commit a73f47f565894cc7e346177d87f2e6813837e1c6)
2007-05-14 13:48:40 +10:00
Andrew Tridgell
5d71cc78bb
don't allow setrecmaster while not frozen
...
(This used to be ctdb commit e84b05ba6062ffc45b7f3c23e88feef1d39069c4)
2007-05-14 13:48:14 +10:00
Andrew Tridgell
5839eb1e80
make sure the ctdb control socket is secure
...
(This used to be ctdb commit 2954f2e501a418af578e75e8705b0b39a77c1861)
2007-05-13 09:20:16 +10:00
Andrew Tridgell
81826da2df
added error messages in ctdb_control replies
...
(This used to be ctdb commit bd848f5b760e6b2a73ebfc67fd8adb3c31479fb5)
2007-05-12 21:25:26 +10:00
Andrew Tridgell
58dc8d4e19
prioritise the dmaster in case of matching rsn
...
(This used to be ctdb commit 4996a12174aa0d215a5b14cb970bdf83eed34a39)
2007-05-12 19:57:12 +10:00
Andrew Tridgell
7f570322c3
the invalid dmaster is no longer needed in recovery
...
(This used to be ctdb commit bd638ea63d11485bc3a8c50d923262a48095c2f3)
2007-05-12 19:56:31 +10:00
Andrew Tridgell
68fc85d162
the retry client code is no longer needed now that we use a freeze on recovery
...
(This used to be ctdb commit 4213475a2db93b149705bfbb578c78936124c608)
2007-05-12 19:55:55 +10:00
Andrew Tridgell
df49a66de4
ensure we propogate the correct rsn for a request dmaster
...
(This used to be ctdb commit 70c1c67db865db8a49b56e8e3e8fd56ec5063208)
2007-05-12 19:55:18 +10:00
Andrew Tridgell
78eb2f96af
simplify the generation checking on incoming call packets
...
(This used to be ctdb commit 87ee47f7fbbf71228bc9cc16faff86b4c59333a2)
2007-05-12 19:54:40 +10:00
Andrew Tridgell
36ccc10389
make sure we ignore requeued ctdb_call packets of older generations except for packets from the client
...
(This used to be ctdb commit facab105fbd7fe50f96bdd763ae50ddc54fbdacc)
2007-05-12 18:08:50 +10:00
Andrew Tridgell
5bd0e50086
added -t option to ctdb_control
...
(This used to be ctdb commit 658141280eeb121a570d71c4b0af36d03004f320)
2007-05-12 16:04:56 +10:00
Andrew Tridgell
7dcf82cda4
- nicer message if freeze child dies
...
- change local generation count after recovery/freeze started
(This used to be ctdb commit d9768142797f083a8c09b55d6a8a93cc12089348)
2007-05-12 15:59:49 +10:00
Andrew Tridgell
2c90d9e794
show total frozen/recoving in status
...
(This used to be ctdb commit 0d0eb66a63fe6912edb85bf7387ac76acb70babd)
2007-05-12 15:51:08 +10:00
Andrew Tridgell
b327cd872d
report number of frozen/thawed nodes
...
(This used to be ctdb commit 997720bc0e15d882aefed3464fe285674beed691)
2007-05-12 15:44:56 +10:00
Andrew Tridgell
cb81a2eca8
watch for the freeze child exiting
...
(This used to be ctdb commit 7f350eca8598022ebd198b2476d1f2c2a8f03a8d)
2007-05-12 15:44:35 +10:00
Andrew Tridgell
f7e3004f0a
more robust freeze/thaw logic
...
(This used to be ctdb commit 51c1e51aeb7dfac1683584df7ef1bef98c092f76)
2007-05-12 15:29:06 +10:00
Andrew Tridgell
9cf77dd23f
separate out the freeze/thaw handling from recovery
...
(This used to be ctdb commit 0b0640bd8b8334961f240e0cf276ac112cd6e616)
2007-05-12 15:15:27 +10:00
Andrew Tridgell
74a799a83b
added lockwait child code for entering recovery mode. A child processes holds lockall locks for the entire recovery process
...
(This used to be ctdb commit f892f30def75b0d964c35eae38c4cf675597dd28)
2007-05-12 14:34:21 +10:00
Andrew Tridgell
ae55e4181d
added _mark calls for tdb_lockall
...
(This used to be ctdb commit e59134fd2af67c746b907c23fdcde2eccbbe17cf)
2007-05-12 14:33:10 +10:00
Andrew Tridgell
85aff64ed8
fixed debug message
...
(This used to be ctdb commit 9802bf1ef9104b31977020e803b0f81da71c7169)
2007-05-11 17:29:21 +10:00
Andrew Tridgell
63acf8ab95
- merge from ronnie
...
- increment rsn only in become_dmaster
- add torture check for rsn regression in ctdb_ltdb_store
(This used to be ctdb commit 8047506a08bb53ee01aa64f25c9f72839e1e2d68)
2007-05-11 10:33:43 +10:00
Ronnie Sahlberg
9eeb4f1a51
we must bump the rsn everytime we do a REQ_DMASTER or a REPLY_DMASTER
...
to make sure that the "merge records based on rsn during recovery" will
merge correctly.
this is extra important since samba3 never bumps the record when it
writes new data to it !
(This used to be ctdb commit 857e67204065603592c2dbbadbd8667ebba9ccdb)
2007-05-11 06:08:17 +10:00
Ronnie Sahlberg
325713dfeb
make ctdb_control catdb work again
...
(This used to be ctdb commit 40a8fb68c71be0b9f54ae88bf8aa39a4c71f3b5a)
2007-05-11 05:40:11 +10:00
Ronnie Sahlberg
343e44918c
merge from tridge
...
(This used to be ctdb commit f261f554ccf5d85a90f504cc20fc6f1f8b3f14d6)
2007-05-10 17:59:51 +10:00
Andrew Tridgell
f8765b19bf
- got rid of the complex hand marshalling in the recovery controls
...
- fixed the re-send of ctdb calls after a generation change
- fixed a reqid idr leak in controls
- removed the write_record test code
- use the new nonblock lockall code to prevent ctdbd from ever doing a
blocking lock that could deadlock with smbd
- moved more of the recovery controls into ctdb_recover.c
(This used to be ctdb commit 565a21aa4f1e842309986ab97d6244801153deec)
2007-05-10 17:43:45 +10:00
Andrew Tridgell
698d2a6af4
added nonblocking varients of the two lockall functions to tdb
...
(This used to be ctdb commit 2e99fa41ce01fa282bc0f3244ca42a78173743ed)
2007-05-10 17:43:08 +10:00
Andrew Tridgell
15bc97cdaa
better timeout handling for calls, controls and traverses
...
(This used to be ctdb commit 63346a6c59d4821b4c443939b5d88db8cd20f5fe)
2007-05-10 14:06:48 +10:00
Andrew Tridgell
31cd92dc7e
merge from ronnie
...
(This used to be ctdb commit 92b7a849565730744c75a7fb776173554e9f57bf)
2007-05-10 13:15:58 +10:00
Andrew Tridgell
50390bcb18
setup the random number generator a bit better
...
(This used to be ctdb commit 708585eb0ed31b0df6543a1d7a20b82e751877c2)
2007-05-10 13:10:23 +10:00
Ronnie Sahlberg
a54390197a
create a correct vnnmap structure to prevent a segv
...
(This used to be ctdb commit 17777bb5e6208e97a82a171243c6c406f53ee02e)
2007-05-10 10:10:58 +10:00
Ronnie Sahlberg
82e37a9886
update ctdb_control to create a correct ctdb_vnn_map->map array
...
(This used to be ctdb commit e510cc89068557881688d6cada38915b3e51f8cd)
2007-05-10 10:03:21 +10:00
Ronnie Sahlberg
a56a2501ac
when starting a new election, also force all nodes into recovery mode so
...
there is no internode traffic to interfere with our election
(This used to be ctdb commit ccfb67a076c72a0e7f2b6dc5fce9c19f652ba2ad)
2007-05-10 09:48:14 +10:00
Ronnie Sahlberg
4370dc1e75
when starting recovery repoint dmaster to an invalid node and not the
...
current vnn
(This used to be ctdb commit 3c2dcc7448b335cf42e8f7edffba21229dccbd79)
2007-05-10 09:46:10 +10:00
Ronnie Sahlberg
325f321409
merge from tridge
...
(This used to be ctdb commit 8c5e6836280499243c0cd247093844a891f00da3)
2007-05-10 09:44:28 +10:00
Ronnie Sahlberg
639e4374e5
actually check the remote nodes and not just the local node
...
(This used to be ctdb commit 09df21be6361743d320fafc120718211eece85c3)
2007-05-10 09:43:01 +10:00
Andrew Tridgell
1e38ae491f
remove old s3 recovery code
...
fixed vnnmap wire format in recover daemon
(This used to be ctdb commit e03fab7bfe0cf43f40c49a3d63e75dc44001d8d8)
2007-05-10 08:49:57 +10:00
Andrew Tridgell
2a82665532
fixed setvnnmap to use wire structures too
...
(This used to be ctdb commit 1208e4219d220b80e2f74974cac8ed2b8956d3ef)
2007-05-10 08:22:26 +10:00
Andrew Tridgell
682df74d59
separate the wire format and internal format for the vnn_map
...
(This used to be ctdb commit 9a71718d87c5162f1423d85c2e86a01f6771925e)
2007-05-10 08:13:19 +10:00
Andrew Tridgell
a8f83423f4
moved the vnn_map initialisation out of the cmdline code
...
(This used to be ctdb commit 81492b840d608dc724d5a25ddef6eb0ce12b95fb)
2007-05-10 07:55:46 +10:00
Andrew Tridgell
ba47b43c6b
merged ronnies code to delay client requests when in recovery mode
...
(This used to be ctdb commit dfca37076d642f3407c63dfe3b685287d27c8f8d)
2007-05-10 07:43:18 +10:00
Ronnie Sahlberg
cbb6f99f41
merge from tridge
...
(This used to be ctdb commit 190cca8488dff982062ae7b1a82cb33cc1cdfaf7)
2007-05-10 06:55:28 +10:00
Ronnie Sahlberg
bbaaf2bbf4
hang the event from the retry structure instead of the hdr structure
...
(This used to be ctdb commit 8536c8c3a30a986ba4945d02aef82b47495ce3f8)
2007-05-09 14:08:11 +10:00
Ronnie Sahlberg
c938c1b5de
when we are in recovery mode and we get a REQ_CALL from a client,
...
defer it for one second and try again
(This used to be ctdb commit 606fb6414b97d1813056982cda7c0fe84d746e67)
2007-05-09 14:06:47 +10:00
Andrew Tridgell
d2a90cc5a5
merge from ronnie
...
(This used to be ctdb commit f67a4842e7b1efb2ad61c41e4895c7698e564bf3)
2007-05-09 11:54:37 +10:00
Ronnie Sahlberg
6929739b7f
add a command line flag to ctdbd to start a recovery daemon.
...
update the recovery test script to start all ctdb daemons with a
recovery daemon
(This used to be ctdb commit 47794e16df285cacefc30208d892d931a6e46b96)
2007-05-09 09:59:23 +10:00
Ronnie Sahlberg
92333fce03
change the name of the recovery daemon to ctdb_recoverd
...
(This used to be ctdb commit b0cf919e4f38961e5cf4e1e79a0cfe4bb4a96d76)
2007-05-09 09:31:53 +10:00