samba-mirror

mirror of https://github.com/samba-team/samba.git synced 2025-02-04 17:47:26 +03:00

Author	SHA1	Message	Date
Martin Schwenke	96b3517356	Test suite: better debug info when the cluster is unexpectedly unhealthy. cluster_is_healthy() is now run locally in tests and internally causes _cluster_is_healthy() to be run on node 0. When it detects that the cluster is unhealthy and $ctdb_test_restart_scheduled is not true, debug information is printed. This replaces the previous use of $CTDB_TEST_CLEANING_UP. To avoid spurious debug on expected restarts, added scheduled restarts to several tests. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit ee7caae3a55a64fb50cd28fa2fd4663c5dd83b4f)	2009-07-06 17:52:11 +10:00
Martin Schwenke	d90d54ea3e	Make ctdbd restarts in tests more reliable. This works around potential race conditions in the init script where the restart operation is not necessarily reliable. It just wraps the actual restart in a loop and tries for a successful restart up to 5 times. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 1cac8a0ad429f29d1508158c7f7c42a2f1a22945)	2009-07-06 16:40:31 +10:00
Martin Schwenke	35f998346e	When testing make the time taken for some operations more obvious. If wait_until() does not timeout, print the time taken for the command to succeed. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit bdb856ee22816ae1f6b8d15856555f488054f489)	2009-07-06 16:39:08 +10:00
Martin Schwenke	5d67aa2332	New tests for different aspects of failover. 3 separate tests: * Check that gratuitous ARPs are received and take effect. * Check that ping still works after failover. * Check, via SSH, that the hostname changes after failover. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 92011cc05bbdb517ec6a4573f5cb9f6f21c3059e)	2009-07-03 20:55:02 +10:00
Martin Schwenke	613341d150	Updates to TCP tickle tests and supporting functions. * Removed a race from tpcdump_start(). It seems impossible to tell when tcpdump is actually ready to capture packets. So this function now generates some dummy ping packets and waits until it sees them in the output file. * tcpdump_start() sets $tcpdump_filter. This is the default filter for tcpdump_wait() and tcpdump_show(), but other filters may be passed to those functions. * New functions tcptickle_sniff_start() and tcptickle_sniff_wait_show() handle capturing TCP tickle packets. These are used by complex/31_nfs_tickle.sh and complex/32_cifs_tickle.sh. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 8e2a89935a969340bfead8ed040d74703947cb81)	2009-07-03 20:44:55 +10:00
Martin Schwenke	7b3abce684	Add an extra ctdb recovery to test function restart_ctdb(). There are still very rare cases where IPs haven't been reallocated before the beginning of the next test, so this adds a sleep and an extra call to "ctdb recover" to restart_ctdb(). Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit c2bdb77d91761c003e2f0e6918a27c54150f6030)	2009-07-03 18:01:29 +10:00
Martin Schwenke	dba6c1ca77	Fix the run_tests script so that the number of columns is never 0. Sometimes "stty size" reports 0, for example when running in a shell under Emacs. In this case, we just change it to 80. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit e309cb3f95efcf6cff7d7c19713d7b161a138383)	2009-07-03 17:58:38 +10:00
Martin Schwenke	0d425a07d4	Separate test cleanup code in output and clean up ctdb restart code. * ctdb_restart_when_done() now schedules a restart by setting an explicit variable that is respected in ctdb_test_exit(), rather than adding a restart to $ctdb_test_exit_hook. This means that restarts are all done in one place. * ctdb_test_exit() turns off "set -e" to make sure that all cleanup happens. * ctdb_test_exit() now prints a clear message indicating where the test ends and the cleanup begins. This message also includes the return code of the test. * Add debug in cluster_is_healthy to try to capture information about unexpected unhealthiness when a test starts. * Simplify simple/07_ctdb_process_exists.sh so that the exit code is generated more obviously. * Remove redundant calls to ctdb_test_exit at the end of tests, since they're done automatically via a trap. Also remove any preceding warnings of restarts or final hints about test success/failure. * Allow multi-digit debug levels in simple/12_ctdb_getdebug.sh and simple/13_ctdb_setdebug.sh. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit b6fa044a1364cbb3008085041453ee4885f7ced1)	2009-07-03 17:40:16 +10:00
Martin Schwenke	4697829e7c	Fix minor onnode bugs relating to local daemons. Commit a0f5148ac749758e2dfbd6099e829c5bf1d900e6 caused a subtle regression. Due to the subtlety, this description is much longer than the 1 line patch that fixes it! The regression, where a process that invokes onnode is unexpectedly blocked, is only apparent if the following conditions are met: 1. $CTDB_NODES_SOCKETS is set; 2. The command passed to onnode attempts to background a process; and 3. onnode is run in certain types of subshell (e.g. foo=$(onnode ...)). In particular, when testing against local daemons (i.e. condition (1) is met), tests/simple/07_ctdb_process_exists.sh would fail (because it does both (2), (3)). The problem is caused by the use of file descriptor 3 in the code that allows separate filtering of stdout and stderr. A backgrounded process will have this descriptor open and the $(...) construct appears to wait for all file descriptors to be closed. This only happens with local daemons because SSH is replaced by a shell and file descriptor 3 leaks into that shell. It does not occur when SSH is used because the file descriptor does not leak into the remote shell where the process is backgrounded. The fix is simply to redirect file descriptor 3 to /dev/null in the fakessh function, which is used when $CTDB_NODES_SOCKETS is set. Also fixed is another minor bug when the -o option and $CTDB_NODES_SOCKETS are used in combination. The code uses the node name as a suffix for the output filename(s). Usually this is an IP address. However, when $CTDB_NODES_SOCKETS is in use the node name is the socket name, which might be a path several directories deep. Each output file is created via a simple redirection and this would fail if unexpected directories appear in the filename. 3 possible fixes were considered: 1. Replace all '/'s in the node name by '_'s. Nice and simple. 2. Use the basename of the node name. However, sockets may be in different directories but have the same basename. 3. Create all required directories before redirecting. This is a little more complex and probably doesn't meet the user's expectations. Option (1) is implemented here. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit c97d56d93d9c1007a4e85affb19ed0c2d0e11b6d)	2009-06-19 12:12:39 +10:00
Martin Schwenke	62871fbcd5	Clean up handling the of CTDB restarts in testcases. Glitches during restarts of the CTDB cluster have been causing some tests to fail. This is because restarts are initiated in the body of many tests. This adds a simple function ctdb_restart_when_done, which schedules a restart using an existing hook in the test exit code. This function is now used in tests that need to restart CTDB. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit d440e83bb4f0c19c085915d0f0e87cc0dabbc569)	2009-06-19 11:40:09 +10:00
Martin Schwenke	1f3a602b88	Merge commit 'origin/master' (This used to be ctdb commit 8ddd5165f573fc6beaae589b86a6afa4bc17f32a)	2009-06-16 12:56:55 +10:00
Martin Schwenke	9467e39c82	Merge branch 'new_tests' (This used to be ctdb commit 10531b50e2d306a5e62b8d488a1acc9e75b0ad4b)	2009-06-16 12:52:10 +10:00
Martin Schwenke	ffff61c13b	New tests for NFS and CIFS tickles. New tests/complex/ subdirectory contains 2 new tests to ensure that NFS and CIFS connections are tracked by CTDB and that tickle resets are sent when a node is disabled. Changes to ctdb_test_functions.bash to support these tests. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 31cc46eb157ca1301312f14879e4fb4da7d81088)	2009-06-16 12:47:59 +10:00
Martin Schwenke	22d4b40b6d	Increase threshold in 51_ctdb_bench from 2% to 5%. The threshold for the difference in the number messages sent in either direction around the ring of nodes was set to 2%. Something environmental is causing this different to sometimes be as high as 3%. We're confident it isn't a CTDB issue so we're increasing the threshold to 5%. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit d5ca4ab325fce1f81361a4d79810cb543979ce57)	2009-06-16 12:42:29 +10:00
Martin Schwenke	ad3c89095e	Make 51_ctdb_bench.sh more tolerant. Limit the allowable difference in message counts in either direction around the ring to 5% (up from 2%). There is something environmental making this blow out to 3% very occasionally when there's no obvious problem with ctdb. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit d6e6909ac629212b3028e13b958e1a17c64bee8c)	2009-06-10 16:15:09 +10:00
Ronnie Sahlberg	d1c40424f6	When we ban a node, only drop the IPs on the node being banned, not on every node (This used to be ctdb commit 46e8c3737e6ff54fc80de8e962e922924c27bc35)	2009-06-10 10:35:20 +10:00
Ronnie Sahlberg	2bb687c4cd	remove unused variable (This used to be ctdb commit 2a52336ec021dfe8d56ba72726feb7b2dbd41f68)	2009-06-09 10:58:46 +10:00
Ronnie Sahlberg	ac931b1371	dont require particular values for NoIPFailback and DeterministicIPs when using ctdb moveip (This used to be ctdb commit d350c631850377c09968d2978ef57d2bd0d50116)	2009-06-09 10:57:46 +10:00
Ronnie Sahlberg	f135684766	improve ctdb moveip so that it does not always trigger a recovery. (This used to be ctdb commit 0ca28d7336463ecd2ff65620d8dbcbb496991531)	2009-06-09 10:56:50 +10:00
Ronnie Sahlberg	f6ccf96898	try avoiding to cause a recovery when deleting a public ip from a node (This used to be ctdb commit 6318ea13464e2fe630084c40802d8e697c2cb999)	2009-06-05 17:57:14 +10:00
Ronnie Sahlberg	b046f5e3aa	when adding an ip, try manually adding and takingover the ip instead of triggering a full recovery to do the same thing (This used to be ctdb commit 4d5d22e64270cfb31be6acd71f4f97ec43df5b2c)	2009-06-05 17:00:47 +10:00
Ronnie Sahlberg	79eef7f2b5	dont list DELETED nodes in the ctdb listnodes output (This used to be ctdb commit 7eb137aa4c24c69bd93b98fb3c7108e5f3288ebd)	2009-06-04 13:25:58 +10:00
Ronnie Sahlberg	f691b96d84	make it possible to run 'ctdb listnodes' also if the daemon is not running. in this case, read the nodes file directly instead of asking the local daemon for the list. add an option -Y to provide machinereadable output to listnodes (This used to be ctdb commit 4a55cacc4f5526abd2124460b669e633deeda408)	2009-06-04 13:21:25 +10:00
Ronnie Sahlberg	85d67197fe	From William Jojo <w.jojo[AT]hvcc.edu> AIX dont have getopt.h by default. Dont try including this file when building on AIX (This used to be ctdb commit 06b33a826e71e1dd2f9e02ad614be55535d42045)	2009-06-04 09:41:05 +10:00
Martin Schwenke	0219d12fd4	Merge branch 'init_rewrite' Conflicts: config/ctdb.init Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 92be87b5bfed7882b48f4034c82dfdb031f3afdc)	2009-06-02 16:40:01 +10:00
Martin Schwenke	1c2e7871eb	Merge commit 'origin/master' (This used to be ctdb commit 135b72828fc76856fa8f6d7f9c820120de05596b)	2009-06-02 16:29:25 +10:00
Martin Schwenke	b1b1cbb274	Initscript cleanups. * Move building of CTDB_OPTIONS to new function build_ctdb_options() and have it use a helper function for readability. * New functions check_persistent_databases() and set_ctdb_variables(). * Remove valgrind-specific stop code, since the general pkill should kill ctdbd when running under valgrind. * Remove some bash-isms (e.g. >& /dev/null) since the script is /bin/sh. * Make indentation consistent. * Minor clean-ups. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 951dbcb29fd53cf51a08958efe185db4954d24f3)	2009-06-02 16:07:08 +10:00
Martin Schwenke	1f9ef465e3	Fix minor problem in previous initscript commit. The valgrind start case should not use daemon, since this is specific to Red Hat. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 1ea6af7007fe3b5a48d48440a0924c71d7a6000a)	2009-06-02 15:54:04 +10:00
Ronnie Sahlberg	e2810c0cb4	new version 1.0.84 (This used to be ctdb commit af1b3de978089a9819716b33c13c941b5558cb17)	2009-06-02 15:05:41 +10:00
Ronnie Sahlberg	45aa542064	teach ONNODE about deleted nodes (This used to be ctdb commit 03d304e72a5839dc8d8d2e2312b346c21dca5774)	2009-06-02 15:03:44 +10:00
Ronnie Sahlberg	f49c71fa4f	new version 1.0.83 (This used to be ctdb commit f236fa289f3115b1f4eb108eb668392dc520f61a)	2009-06-02 13:13:03 +10:00
Ronnie Sahlberg	676f7e0206	idocument how to remove a node from an existing cluster using 'ctdb reloadnodes' (This used to be ctdb commit e3d9722e332f132bd47dc41621d4e1d2b5c9c62a)	2009-06-02 12:43:11 +10:00
Martin Schwenke	4a09cc639b	Initscript fixes, mostly for "stop" action. Use a local variable $ctdbd so that we always run ctdbd from the the same place and so that we know what to kill. This variable respects the $CTDBD environment variable, which may be used to specify an alternative location for the daemon. In the important cases use "pkill -0 -f" to check if ctdbd is running. Also, remove the special case for killing ctdbd when running under valgrind. The regular case will handle this just fine. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit ee5d49324155e3e51371f6f8e5ed9eef4179f08d)	2009-06-02 10:01:50 +10:00
Ronnie Sahlberg	1dee7a2401	hide all DELETED nodes from the ctdb command output (This used to be ctdb commit 91fdfee371d6be83af60cd38ac34afb295b9987a)	2009-06-01 15:43:30 +10:00
Ronnie Sahlberg	5371e3a793	lower the loglevel when we long that we skip an evenscript because it is not executable (This used to be ctdb commit c265df3c7950aab51b8b6ef17040229b97345c35)	2009-06-01 15:29:36 +10:00
Ronnie Sahlberg	6c0c3577f8	dont try to queue packets for sending to (recently) deleted nodes since these nodes do not have a queue. (This used to be ctdb commit 1b7c88ae7643f9bcc52b1d33095f97de88fc2316)	2009-06-01 14:56:19 +10:00
Ronnie Sahlberg	8a0880c843	when building the initial vnnmap, make sure to skip any deleted nodes (This used to be ctdb commit 0cd66c744cd9533ce8d4c4374bcee3bf49b66dae)	2009-06-01 14:44:15 +10:00
Ronnie Sahlberg	dc5e4906cc	use num_nodes and the nodes array instead of walking the vnnmap when counting the number of active nodes (This used to be ctdb commit df20cd9b05ad9ca72e32ccc42354eafc12b68c04)	2009-06-01 14:39:34 +10:00
Ronnie Sahlberg	e6170b5389	add a new node state : DELETED. This is used to mark nodes as being DELETED internally in ctdb so that nodes are not renumbered if / when they are removed from the nodes file. This is used to be able to do "ctdb reloadnodes" at runtime without causing nodes to be renumbered. To do this, instead of deleting a node from the nodes file, just comment it out like 1.0.0.1 #1.0.0.2 1.0.0.3 After removing 1.0.0.2 from the cluster, the remaining nodes retain their pnn's from prior to the deletion, namely 0 and 2 Any line in the nodes file that is commented out represents a DELETED pnn (This used to be ctdb commit 6a5e4fd7fa391206b463bb4e976502f3ac5bd343)	2009-06-01 14:18:34 +10:00
Ronnie Sahlberg	4259156050	dont remove the socket when the dameon stops. This can race if the service is immediately restarted (This used to be ctdb commit b18356764cd49d934eab901e596bb75c6e3ecdf8)	2009-05-29 18:16:13 +10:00
Ronnie Sahlberg	6feb7a1bf8	New attempt at TDB transaction nesting allow/disallow. Make the default be that transaction is not allowed and any attempt to create a nested transaction will fail with TDB_ERR_NESTING. If an application can cope with transaction nesting and the implicit semantics of tdb_transaction_commit(), it can enable transaction nesting by using the TDB_ALLOW_NESTING flag. (This used to be ctdb commit 3e49e41c21eb8c53084aa8cc7fd3557bdd8eb7b6)	2009-05-25 17:04:42 +10:00
Ronnie Sahlberg	96340bd166	Revert "we only need to have transaction nesting disabled when we start the new transaction for the recovery" This reverts commit bf8dae63d10498e6b6179bbacdd72f1ff0fc60be. (This used to be ctdb commit 87292029cb444ffab130ff7dae47a629c2d15787)	2009-05-25 16:55:27 +10:00
Ronnie Sahlberg	270907faec	Revert "set the TDB_NO_NESTING flag for the tdb before we start a transaction from within recovery" This reverts commit 1b2029dbb055ff07367ebc1f307f5241320227b2. (This used to be ctdb commit 9762a3408f10409b629637d237ec513a825a6059)	2009-05-25 16:55:02 +10:00
Ronnie Sahlberg	c429ca114d	Revert "add TDB_NO_NESTING. When this flag is set tdb will not allow any nested transactions and tdb_transaction_start() will implicitely _cancel() any pending transactions before starting any new ones." This reverts commit 459e4ee135bd1cd24c15e5325906eb4ecfd550ec. (This used to be ctdb commit f1c6f7dd47bb1081781c0a0d567a92bbbc0aa5d5)	2009-05-25 16:54:25 +10:00
Ronnie Sahlberg	caf0e863a4	remove the obsolete ipmux component. this is replaced by LVS since a long time (This used to be ctdb commit dca41ec04788922ce5f4c52d346872b3e35f8cbb)	2009-05-25 12:33:52 +10:00
Ronnie Sahlberg	7b163bca18	fix the git path to the repository (This used to be ctdb commit b0c32a96f4176747ca772be664888f5c3c483b98)	2009-05-25 12:15:13 +10:00
Ronnie Sahlberg	e85fb3d9c5	install the 31.clamd script as 644 by default (This used to be ctdb commit e57c47b75fa501223c57040eac73392b42ae549d)	2009-05-25 12:11:07 +10:00
Ronnie Sahlberg	f62b433946	add 31.clamd to the install and the rpm (This used to be ctdb commit bfc6ac07f8b7b326e75d8c9bf73051a440ee0011)	2009-05-25 12:11:01 +10:00
Ronnie Sahlberg	e999ade7bb	From Flavio Carmo Junior <carmo.flavio@gmail.com> Add an eventscript to manage ClamAV (This used to be ctdb commit bb4ef6c4d2bc3578bdf4432517e98f85ec94e3b6)	2009-05-25 12:10:29 +10:00
Ronnie Sahlberg	691379b13d	From Flavio Carmo Junior <carmo.flavio@gmail.com> (with modifications) Add a webpage about CLAMAV support in CTDB (This used to be ctdb commit 5fc14f98902ae98abed35eaab3b3495226dcac38)	2009-05-25 12:08:50 +10:00

1 2 3 4 5 ...

2208 Commits