Ronnie Sahlberg
9f4b0dab03
only check port 21 when monitoring vsftpd
...
(This used to be ctdb commit 41b0d71aaee186138eddc97d49503841fa26f234)
2007-11-15 06:56:02 +11:00
Ronnie Sahlberg
dfa6829621
add CTDB_MANAGES_WINBIND to /etc/sysconfig/ctdb to allow ctdb to be used
...
in environments where samba is used without winbind
(This used to be ctdb commit 1ae5af14f90fd81a20b14c02c0c5ad355a609134)
2007-11-14 16:17:52 +11:00
Ronnie Sahlberg
04ff06d135
merge from tridge
...
(This used to be ctdb commit 8e5d488ae2f7cae2a7a6386ed85a3d26f7d39261)
2007-11-13 13:27:00 +11:00
Andrew Tridgell
0943fba979
make it easier to test starting large numbers of virtual nodes
...
(This used to be ctdb commit cf61bf8b8806d29772985c904d5ee15c24d4d767)
2007-11-13 10:28:06 +11:00
Andrew Tridgell
45f0fdfc20
make election handling much more scalable
...
(This used to be ctdb commit 05938d462b92bd9ecb8e35f53651bded47c48675)
2007-11-13 10:27:44 +11:00
Ronnie Sahlberg
3d98766334
merge from tridge
...
(This used to be ctdb commit f2c8ef106e41c38f73adfc196b6cc328174fbd58)
2007-11-13 07:38:58 +11:00
Andrew Tridgell
3427793f01
don't do the first startup event until we are out of recovery
...
(This used to be ctdb commit 689940eb6e23f16ee063331caf3986613a8963ea)
2007-11-12 13:10:15 +11:00
Ronnie Sahlberg
71dfab2e31
merge from tridge
...
(This used to be ctdb commit 6ccf7a2545c57545111a6236c9b4b493b8464060)
2007-11-12 12:28:20 +11:00
Andrew Tridgell
bde886988b
prevent a deadly embrace between smbd and ctdbd by moving the calling
...
of the startup event scripts after the point where recovery has
started and the node is in normal operation
This makes the 'startup' script just a special type of the 'monitor'
script which is called first
(This used to be ctdb commit 7424c30a5fd04aea0137c466b4318c3f185280d8)
2007-11-12 10:53:11 +11:00
Ronnie Sahlberg
3c1f9882a8
revert 773
...
(This used to be ctdb commit 5a1c8f458ddc9b0ff532afda6007e32db10a71c8)
2007-11-12 10:23:35 +11:00
Ronnie Sahlberg
df5dd43e7c
add a new tunable "CheckNodesFile" that when set to 0 will disable the
...
check in the recovery daemon that all nodes are using the same
/etc/ctdb/nodes file.
Also add some more missing checks that the pnn used is a valid pnn
before using it to dereferencing the ctdb->nodes array
This is useful since it allows us to add more physical nodes to a an
existing cluster without having to bring down the entire cluster.
The to add an additional node to an existing cluster would then be
1, on all nodes set CheckNodesFile=0 using 'ctdb setvar'
2, on all nodes add CTDB_SET_CheckNodesFile=0 to /etc/sysconfig/ctdb
For each each node, one at a time :
3, use 'ctdb disable' to stop the hosted services
4, service ctdb stop
5, service ctdb start
Once all nodes have been restarted
6, on all nodes remove CTDB_SET_CheckNodesFile=0 from
/etc/sysconfig/ctdb
7, on all nodes set CheckNodesFile=0 using 'ctdb setvar'
8, configure and start up the new node
During this procedure, only one node at a time was brought
down/restarted and was so only for a short period.
(This used to be ctdb commit 462501a32143e943ce350bd904a47c0955414a51)
2007-11-05 13:36:11 +11:00
Andrew Tridgell
82bd652749
patch from michael adam
...
(This used to be ctdb commit a7a3bef90f033bab5cb110a6ef77a8bef48f2588)
2007-11-02 13:20:29 +11:00
Ronnie Sahlberg
8b1ad1073b
merge from tridge
...
(This used to be ctdb commit 10302eeecc36c4ce94a4e2e0e57864be790325da)
2007-11-01 09:00:14 +11:00
Andrew Tridgell
29e48fe54a
increase release number
...
(This used to be ctdb commit dc648b1bb6becc52dcf900add97418a5634367eb)
2007-10-30 10:19:43 +11:00
Andrew Tridgell
684282f7a1
added bonding info to ctdb_diagnostics
...
(This used to be ctdb commit 71b5fc434bc5d88eb0669ee29aa932ba12737e07)
2007-10-30 10:18:52 +11:00
Andrew Tridgell
87bfa7d61e
merge from ronnie
...
(This used to be ctdb commit 22b110549ff35f2560043abd5d85bed4b35295ee)
2007-10-29 13:43:12 +11:00
root
2a70ac8801
the while loop in the startup event runs as a subshell so we need an extra || exit 1 at the end
...
to propagate the error code back to the caller of the script
(This used to be ctdb commit c30d5c328784059949f5e82a07008e9632234f20)
2007-10-29 12:34:45 +11:00
Ronnie Sahlberg
8599f2008d
if bond* interfaces are used as public interfaces we can not rely on ethtool but
...
have to check /proc for the status instead
(This used to be ctdb commit 4ed7747267aea265b7a71c651abf6d5db4f4718b)
2007-10-29 10:51:16 +11:00
Ronnie Sahlberg
ba6f9ae4a7
merge from tridge
...
(This used to be ctdb commit c7777b966f6a6e0f4126c03300338fdc822ac6c9)
2007-10-29 08:50:51 +11:00
Ronnie Sahlberg
bd73497a18
merge from tridge
...
(This used to be ctdb commit 919ba610c61cfaf5ecc1ab64ad8be34a80d928f4)
2007-10-29 08:40:46 +11:00
Andrew Tridgell
6d75f0703e
added monitoring of ftp ports
...
(This used to be ctdb commit 4780e078fb55d69053f78a4bbc7c67e569bb5dae)
2007-10-26 14:53:09 +10:00
Ronnie Sahlberg
533a530177
since service nfs stop/start sometimes fail to bring up the mount daemon on rhel5
...
check if mountd is running during monitoring and if it is not, try to restart it
(This used to be ctdb commit 3d4b74669164b519398aeeacd59714f1e3884eff)
2007-10-23 12:35:43 +10:00
Andrew Tridgell
1d6b4f418d
update release number
...
(This used to be ctdb commit fe6766940b2cf8a84ed51824158c956362a5806d)
2007-10-23 11:56:52 +10:00
Andrew Tridgell
2cea351f45
merge from ronnie
...
(This used to be ctdb commit cc70a2cc5f5400d6480cb609e1fa203236917976)
2007-10-23 11:45:36 +10:00
Ronnie Sahlberg
44ab81763d
merge from tridge
...
(This used to be ctdb commit 938e375a80ce2f1827117c38554f576f73a5c71e)
2007-10-23 06:42:45 +10:00
Andrew Tridgell
6e6de1e4b7
fixed a problem with backgrounding onnnode
...
(This used to be ctdb commit 4e23630224bb219cfbbf129c4562da5a4c2d601a)
2007-10-22 21:11:02 +10:00
Andrew Tridgell
8e22bca5ca
fixed a double close of a socket, leading to an EPOLL error
...
(This used to be ctdb commit bbe8ad842bdfedd37ef14a6be07ad939113fe9b1)
2007-10-22 16:41:11 +10:00
Ronnie Sahlberg
6a32af60b8
nfs may take a while to stop so do it in hte background
...
(This used to be ctdb commit 2ccaeaf6a65731c17173a4945e3e00e230e67d35)
2007-10-22 15:14:49 +10:00
Andrew Tridgell
2d8afd85d5
another place where we need to mark connect_fde as freed
...
(This used to be ctdb commit d047fbeafebe4b150602f9a91802795659058b16)
2007-10-22 15:13:32 +10:00
Andrew Tridgell
2931ed5d17
fixed a valgrind uninitialised memory error due to pad bytes
...
(This used to be ctdb commit aea9b0c8d467fe19815c046969e9c1049a3a20ac)
2007-10-22 15:13:08 +10:00
Andrew Tridgell
f09537e7f1
prevent a double free
...
(This used to be ctdb commit 5a1b923abb36c6deb99ae178fdd54f12235dc309)
2007-10-22 14:07:35 +10:00
Ronnie Sahlberg
1d6a74f943
when shutting down, we should stop monitoring
...
(This used to be ctdb commit 325683ef8f326f0565a827ff2c493adcab6e0d64)
2007-10-22 12:34:51 +10:00
Ronnie Sahlberg
4a97876fb7
when we are shutting down, we should first shut down the recovery daemon
...
(This used to be ctdb commit 39ade6b329adcd3234124d6a8daaa6181abf739b)
2007-10-22 12:34:08 +10:00
Andrew Tridgell
3507664562
merge from ronnie
...
(This used to be ctdb commit b47fdc1fc86431c9159b595047faa76ba31f6829)
2007-10-22 10:26:25 +10:00
Ronnie Sahlberg
f022df1d40
dont set parameters in statd-callout if they should be set they
...
bshould be set from 10.interfaces
(This used to be ctdb commit 0c7c2dae0a976922de58793d576855bc37cd38e1)
2007-10-22 10:18:38 +10:00
Ronnie Sahlberg
caad5dc38d
dont set some of the sysctl variables in statd-callout. these are
...
mainly useful for avoiding ack-storms when doing very rapid
failover/failback during testing but should not be required in
real-world.
this gets rid of a lof of annoying messages from the messages file
(This used to be ctdb commit 50d289dcce2caa7c7be9b6faa3b38b69c2237038)
2007-10-21 06:42:33 +10:00
Ronnie Sahlberg
34f06575f8
merge from tridge
...
(This used to be ctdb commit a45cfb29d9a0babccddc6aa26e71c00524da1d97)
2007-10-19 15:19:25 +10:00
Andrew Tridgell
1a8338e443
increase release number
...
(This used to be ctdb commit 747ff96f1d93c52ba7548d0540266b0277d88ac1)
2007-10-19 12:22:24 +10:00
Ronnie Sahlberg
600264ac02
dont close the file, just set the fd to -1
...
(This used to be ctdb commit 04b26aa09e69b3c9fa1db245b5123c3cc02db8af)
2007-10-19 11:03:12 +10:00
Andrew Tridgell
f47f758fe8
merge from ronnie
...
(This used to be ctdb commit d444fdc7782496abe4b27003b647ac49fb52e6be)
2007-10-19 09:39:07 +10:00
Andrew Tridgell
623e216dcf
remove a incorrectly added file
...
(This used to be ctdb commit ff01a32db81b6c04d42634f5660181c270988264)
2007-10-19 09:30:55 +10:00
Ronnie Sahlberg
e81e008a36
add missing ) in the IB transport (which i dont compile for)
...
(This used to be ctdb commit 7f7a184bae87d46bd589d11068b6443b007366b4)
2007-10-19 09:05:37 +10:00
Ronnie Sahlberg
fe7b5b4d85
add a stub restart method for IB
...
(This used to be ctdb commit d318504ad5a49dbdfa307be39ae88df839e6308d)
2007-10-19 09:04:52 +10:00
Ronnie Sahlberg
d1ba047b7f
add a new transport method so that when a node is marked as dead, we
...
shut down and restart the transport
othervise, if we use the tcp transport the tcp connection might try to
retransmit the queued data during the time the node is unavailable.
this together with the exponential backoff for tcp means that the tcp
connection quickly reaches the maximum backoff rto which is often 60 or
120 seconds. this would mean that it could take up to 60/120 seconds
before the tcp layer detects that the connection is dead and it has to
be reestablished.
(This used to be ctdb commit 0256db470879ce556b0f00070f7ebeaf37e529ab)
2007-10-19 08:58:30 +10:00
Ronnie Sahlberg
755511d28d
set the flags explicitely isnstead of masking them in
...
(This used to be ctdb commit 27a5f9dead44890683f9dbc4f07cda11264aa03b)
2007-10-18 16:54:00 +10:00
Andrew Tridgell
b814462c38
added some debug lines to help track down a problem
...
(This used to be ctdb commit 2ca31e9de179f76e392a26cc8305e2473357c760)
2007-10-18 16:27:36 +10:00
Ronnie Sahlberg
06cdb1ff31
merge from tridge
...
(This used to be ctdb commit ad03e63906270c9c076ffdb1f62f912bb414ea10)
2007-10-18 15:53:50 +10:00
Andrew Tridgell
5e3d5b1314
merge from ronnie
...
(This used to be ctdb commit a6b094fdede0ae850e87877fad0b9dd1f3a26869)
2007-10-18 15:51:15 +10:00
Andrew Tridgell
d939a2901b
merge from ronnie
...
(This used to be ctdb commit 75d4b386293e186a6bb8532515585ab72670d663)
2007-10-18 15:44:02 +10:00
Ronnie Sahlberg
e4ec6e9d6b
flush the route cache when we have added the single public ip to the
...
node
cleanup and remove everything when we do a shutdown event
(This used to be ctdb commit 221432f45073bc7624803058c8bbf18838e7ceeb)
2007-10-18 14:13:48 +10:00