samba-mirror

mirror of https://github.com/samba-team/samba.git synced 2025-01-26 10:04:02 +03:00

Author	SHA1	Message	Date
Ronnie Sahlberg	a8db1adcd6	add a command to write a record to a persistent database "ctdb pstore <db> <key> <file containing possibly binary data>" (This used to be ctdb commit 14184ab7c80a3ef16c54b4ab168fd635b7add445)	2010-08-24 14:00:18 +10:00
Ronnie Sahlberg	4da818504a	get rid of two compiler warnings (This used to be ctdb commit 0865f0e6ef671396aa862f6a79a48a4891d72122)	2010-08-24 14:00:10 +10:00
Ronnie Sahlberg	401732a56b	Add a command "ctdb pfetch <db> <record>" to read a record from a persistent database. (This used to be ctdb commit 3bef831b96ce8b40457ed4de527f0d62fa6a5b00)	2010-08-24 14:00:02 +10:00
Ronnie Sahlberg	ccdb91a169	move the directives to build the devel file to the end of the specfile so that the dependencies are right or else the dependencies all end up in the devel package and not the main ctdb package (This used to be ctdb commit 6e4347eb8e62c28987820f6e58626271c900b011)	2010-08-23 16:00:19 +10:00
Ronnie Sahlberg	e040a966af	Dont set next_interval to 0. This can cause ctdbd to spin at 100% in the eventsystem, creating a timed event that will immediately trigger again and again. On uniprocessors this cause the eventscript we are actually waiting for to basically become cpu starved and never complete. (This used to be ctdb commit 92c8408fba957a8ded13f7e285da290502735234)	2010-08-20 15:00:45 +10:00
Ronnie Sahlberg	1ef66379d7	ctdb ip is very busy. revert the defauls case back to only showing the ip and node and only display the extra info if -v verbose output is requested (This used to be ctdb commit 6488651aa7e105c57324f4a300760a010d098fbb)	2010-08-20 11:38:34 +10:00
Ronnie Sahlberg	08a5b0c7c5	add a new commandline flag -v to enable verbose output (This used to be ctdb commit 96dd9f40f9464c3d9de98f1323568724a1e31dc9)	2010-08-20 11:28:24 +10:00
Ronnie Sahlberg	388d18cc93	make it possible to "ctdb gettickle" to only list tickles for a certain port. Default is to continue to show all tickles, but if a second argument is given, only tickles for that port will be shown. (This used to be ctdb commit 5b985eb2cbbb92bf6ccfcacd633d793bcd4e3ec1)	2010-08-20 11:25:12 +10:00
Ronnie Sahlberg	7229922d97	Dont use the deprecated talloc_append_string() Use talloc_strdup_append() instead (This used to be ctdb commit e41581347af5ef26d429d38ed48fa46244f0dbfc)	2010-08-20 11:03:17 +10:00
Ronnie Sahlberg	32a2297b20	We need the deprecated talloc_append_string() for now so set the TALLOC_DEPRECATED sympol to allow use of this call from ctdb_client.c (This used to be ctdb commit 3afa5d945a56952a7f211af068d671945de960e5)	2010-08-19 14:48:19 +10:00
Ronnie Sahlberg	2e8aac6689	Merge commit 'rusty/ports-from-1.0.112' into foo (This used to be ctdb commit 13e58d92f5f1723e850a82ae030d0ca57e89b1ee)	2010-08-19 13:17:56 +10:00
Ronnie Sahlberg	4c05f1900c	Merge commit 'rusty/vacuum-fix-master' (This used to be ctdb commit dc301b324d2c14a2425a965c076113c4fe97903e)	2010-08-19 13:16:35 +10:00
Ronnie Sahlberg	729f1ddea0	On RHEL, "service nfs stop;service nfs start" and "service nfs restart" sometimes (very rarely) fails to restart the service. Add a function to restart NFSd on SLES and RHEL-like systems. If we detect the system is unhealthy due to kNFSd not running, try to restart the service again "service nfs restart" and hope for the best. CQ1019372 (This used to be ctdb commit 25c4ce7e919f13226219f036bcffd2be76b2f06c)	2010-08-19 07:18:22 +10:00
Ronnie Sahlberg	31126b2ef0	Add machinereadable output for the "ctgdb gettickles <ip>" command (This used to be ctdb commit c3eb53509331045074579468d94ed7e31101bba4)	2010-08-18 14:37:16 +10:00
Ronnie Sahlberg	5aa5f3e7bf	Remove the structure ctdb_control_tcp_vnn since this is identical to the structure ctdb_tcp_connection. Add a new "ctdb deltickle" command to delete tickles from the database. This can ONLY be used for tickles created by "ctdb addtickle". Push any "addtickle/deltickle" updates to other nodes every TickleUpdateInterval seconds' (This used to be ctdb commit acded034e2f0dcae4c2c9e54e16a001caf23caec)	2010-08-18 12:36:03 +10:00
Rusty Russell	9fbb191b78	logging: give a unique logging name to each forked child. This means we can distinguish which child is logging, esp. via syslog where we have no pid. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 68b3761a0874429b90731741f0531f76dcfbb081)	2010-08-18 11:46:32 +09:30
Rusty Russell	1a009aff73	takeover: prevent crash by avoiding free in traverse on RST timeout After 5 attempts to send a RST to a client without any response, we free "con"; this is done during a traverse. This frees the node we are walking through (the node is made a child of "con" down in rb_tree.c's trbt_create_node() (Valgrind would catch this, as Martin confirmed). So, we create a temporary parent and reparent onto that; then we free that parent after the traverse, thus deleting the unwanted nodes. CQ:S1019041 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 08f7f85477610a4916c1ec866aa467b28f1bbec3)	2010-08-18 11:40:17 +09:30
Martin Schwenke	6ce1501aa1	Move NAT gateway firewall rules to recovered\|updatenatgw events. The existing code wasn't working as designed in the start event. It should work here. BZ: 62613 Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit aeb70c7e7822854eb87873a5c7783e27e6e72318)	2010-08-18 11:40:07 +09:30
Rusty Russell	5f2d43157d	vacuum: disabling vacuuming during a freeze We shouldn't even think about vacuuming when we've frozen the database (which is earlier than when we set CTDB_RECOVERY_ACTIVE) CQ:S1018154 & S1018349 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit d8df6835a931082af232c4b94f1dede6f16169f9)	2010-08-18 11:01:52 +09:30
Rusty Russell	0b07f91d36	vacuum: fix crash on vacuum abort Martin Schwenke discovered that 517f05e42f17766b1e8db8f1f4789cbad968e304 ("freeze: abort vacuuming when we're going to freeze.") used ctdb_db for a logging message which is in fact uninitialized, causing a crash (even if it wasn't actually logged). Initialize it properly. Also fix incorrect format in another logging message introduced in that same change. CQ:S1019093 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 8e518950ba281502318d6300f7a5ec6cdf6b5674)	2010-08-18 11:00:11 +09:30
Rusty Russell	af55c910a4	freeze: abort vacuuming when we're going to freeze. There are some reports of freeze timeouts, and it looks like vacuuming might be the culprit. So we add code to tell them to abort when a freeze is going on. (This is based on the 1.0.112 branch version 517f05e42f, but far simpler since tdb is now robust against processes being killed during transaction commit) CQ:S1018154 & S1018349 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit f5d7dc679501e607c2c83a248a89d3cada9df146)	2010-08-18 10:54:28 +09:30
Ronnie Sahlberg	44ff992806	Add a new "ctdb addtickle" command to manually add tickles to ctdbd This can be used to set ctdbd up to generate a tickle for non-samba services. (samba contains code to set tickles up automatically) (This used to be ctdb commit 7ef2cddad5326fdcc26138906948342039829495)	2010-08-18 11:09:32 +10:00
Ronnie Sahlberg	0e5be63bca	update the example for the new signature of ctdb_set_message_handler_send() (This used to be ctdb commit 6aabe52d5ba629291aa630bc96a2b74dcecc5209)	2010-08-18 10:18:35 +10:00
Ronnie Sahlberg	e8ffb0d8a4	We use eventloop nesting in a couple of places, notably the sync parts of the recovery daemon. Initialize all event contexts to allow nesting (This used to be ctdb commit 5bf6bd5e7f33aabbeb7b9707716ef99cf471e590)	2010-08-18 10:11:59 +10:00
Ronnie Sahlberg	ddf3c621c1	Merge commit 'rusty/libctdb-new' into foo (This used to be ctdb commit 1566d2d23ab698896b3b6a76974a5c7452db4a62)	2010-08-18 09:53:52 +10:00
Rusty Russell	f93440c4b7	event: Update events to latest Samba version 0.9.8 In Samba this is now called "tevent", and while we use the backwards compatibility wrappers they don't offer EVENT_FD_AUTOCLOSE: that is now a separate tevent_fd_set_auto_close() function. This is based on Samba version 7f29f817fa939ef1bbb740584f09e76e2ecd5b06. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 85e5e760cc91eb3157d3a88996ce474491646726)	2010-08-18 09:16:31 +09:30
Rusty Russell	532e4a7077	talloc: update to 2.0.3 version from SAMBA This is based on SAMBA as at revision 2de63aa2801a907905b3e05557074af5b896d486. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit cecd93be0a0aab868430dd43f8276bfb4e35f02e)	2010-08-18 09:11:58 +09:30
Volker Lendecke	a79168f587	Correctly set docdir (This used to be ctdb commit a69916d0687309766b0014dc9cee6a966aaa89da)	2010-08-16 11:28:05 +10:00
Rusty Russell	c27094742b	tdb: workaround starvation problem in locking entire database. (Imported from SAMBA 11ab43084b10cf53b530cdc3a6036c898b79ca38) We saw tdb_lockall() take 71 seconds under heavy load; this is because Linux (at least) doesn't prevent new small locks being obtained while we're waiting for a big log. The workaround is to do divide and conquer using non-blocking chainlocks: if we get down to a single chain we block. Using a simple test program where children did "hold lock for 100ms, sleep for 1 second" the time to do tdb_lockall() dropped signifiantly. There are ln(hashsize) locks taken in the contended case, but that's slow anyway. More analysis is given in my blog at http://rusty.ozlabs.org/?p=120 This may also help transactions, though in that case it's the initial read lock which uses this gradual locking routine; the update-to-write-lock code is separate and still tries to update in one go. Even though ABI doesn't change, minor version bumped so behavior change can be easily detected. CQ:S1018154 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 9ec0009443a0ac4187ce5212a5143689daa58a02)	2010-08-16 10:22:21 +09:30
Rusty Russell	546eff9c93	tdb: Fix tdb_check() to work with read-only tdb databases. (Import from SAMBA bc1c82ea137e1bf6cb55139a666c56ebb2226b23) The function tdb_lockall() uses F_WRLCK internally, which doesn't work on a fd opened with O_RDONLY. Use tdb_lockall_read() instead. (This used to be ctdb commit a5db1122ec48d7e7384066848457c850c1a6cf3c)	2010-08-16 10:20:59 +09:30
Rusty Russell	fa2a32d5ef	tdb: remove unused variable in tdb_new_database(). (Imported from SAMBA 2eab1d7fdcb54f9ec27431ca4858eb64cb1bd835) (This used to be ctdb commit 52a87e608d0406aee9df99f7ac3ce16e834b520b)	2010-08-16 10:20:53 +09:30
Rusty Russell	55010cab63	tdb: fix short write logic in tdb_new_database Commit 207a213c/24fed55d purported to fix the problem of signals during tdb_new_database (which could cause a spurious short write, hence a failure). However, the code is wrong: newdb+written is not correct. Fix this by introducing a general tdb_write_all() and using it here and in the tracing code. Cc: Stefan Metzmacher <metze@samba.org> Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 27ba0e5a6681063225df7244a85aa304c51c6948)	2010-08-16 10:20:19 +09:30
Ronnie Sahlberg	8b0bbf960b	Create a new command "ctdb sync" that isd just an alias for "ctdb ipreallocate" (This used to be ctdb commit eededd592c92c59b435f0046989b2327fcc280b1)	2010-08-10 09:49:55 +10:00
Ronnie Sahlberg	7139faaeac	Update a log message to reflect that this does no longer only happen when trying/failing to ban a node. (This used to be ctdb commit dc6b143c4785449e8c4ef7a46bf16adba750ab56)	2010-08-10 09:48:50 +10:00
Rusty Russell	a65cb6a9ae	libctdb: add synchronous message handling and unregister, with tests. It turns out that we do want a separate private arg for the message handler and the completion callback, so we change that. We also fix the prototypes of the remove_message functions as we implement them. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 332375246eccd95da626f434f6d49dd9458a9787)	2010-08-09 15:41:32 +09:30
Ronnie Sahlberg	f7ead50738	Merge remote branch 'martins/master' (This used to be ctdb commit 9ca09ee9129b787428a2ceac9731b12166dc8718)	2010-08-09 11:35:38 +10:00
Martin Schwenke	0f18859a6c	Add some command-line options to ctdb_diagnostics. In some contexts ctdb_diagnostics generates too many errors when it is run on heterogeneous and machine-configured clusters. In some clusters some nodes are expected to be differently configured and also machine-generated configured files can have comments containing timestamps. This adds some command-line options that can be used to reduce the number of errors reported: -n <nodes> Comma separated list of nodes to operate on -c Ignore comment lines (starting with '#') in file comparisons -w Ignore whitespace in file comparisons --no-ads Do not use commands that assume an Active Directory Server The -n option simply allows ctdb_diagnostics to operate on a subset of nodes, avoiding file comparisons with and data collection on nodes that are differently configured. For file comparisons, instead of showing each file on the current node and then comparing other nodes to that file, the file from the first (available or requested) nodes is shown and then other nodes are compared to that. That has resulted in changes in output - that is, ctdb diagnostics no longer prints messages referencing the current node. -c and -w are used to weaken comparisons between configuration files. --no-ads can be used to avoid running ADS-specific commands if a cluster uses LDAP (or other non-ADS) configuration. This also fixes a number of bugs in related code: * A call to onnode was losing the >> NODE ... << lines because they now go to stderr. This was changed in onnode long ago but ctdb_diagnostics was never updated to match. * ctdb_diagnostics was counting lines in /etc/ctdb/nodes to determine what nodes to operate on. For some time the nodes file has supported syntax that makes this invalid. "ctdb listnodes -Y" is now used to list available nodes. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 36c8244a0f68c7c9bbee40982f230e9d14d3c0ea)	2010-08-06 11:10:56 +10:00
Ronnie Sahlberg	4424c115cb	iupdate the docs that ctdb freeze is no more (This used to be ctdb commit 79ef9909dfa0904d789c69eb6b9c80e8908a1100)	2010-08-05 16:35:37 +10:00
Ronnie Sahlberg	043045dcc5	remove the "ctdb freeze" debugging command (This used to be ctdb commit bd005b987255eb65cd3826dce984281ee757daf6)	2010-08-05 16:30:47 +10:00
Martin Schwenke	b50ec65963	Test suite: remove unnecessary verbosity from enable/continue tests. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 69c95b2a42f55b80cd8d91a90ab55166f964163b)	2010-08-05 16:03:21 +10:00
Martin Schwenke	f66b5b46d6	Test suite: Fix typo in continue test. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit c2bce140da7c4b118394ee77bb9d0348d27e7e95)	2010-08-05 16:01:23 +10:00
Martin Schwenke	77ad2be488	Test suite: weaken ctdb continue/enable tests for non-deterministic IPs. These tests currently wait for the old IPs to fail back to the test node. This isn't guaranteed with DeterministicIPs disabled. This changes those tests to wait until the test node gets at least 1 IP assigned. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit e9b3f5b1b51d541a911a27eb4348b368f28d185e)	2010-08-05 15:58:56 +10:00
Martin Schwenke	b930c885b3	initscript: wait until we can ping ctdbd before setting tunables. Currently we do a "sleep 1" after starting and before running set_ctdb_variables to set the tunables. This is too arbitrary and might fail if the system is heavily loaded. This, for example, could result in some nodes running with DeterministicIPs and some without, in which case a different IP allocation algorithm would run depending on who is the recmaster! This makes the start function wait until "ctdb ping" succeeds (with 10 second timeout) before trying to run set_ctdb_variables. If a timeout occurs then the start function attempts to kill ctdbd before exiting with a failure. It also cleans up the status reporting code for Red Hat and SUSE so that the final status code is reported. Currently there are cases where a correct status is prematurely reported before a failure occurs. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit cdcd05662a30b51caaeeab4ac44138cac2474e0a)	2010-08-05 15:29:40 +10:00
Martin Schwenke	774582c360	Test suite - make the ctdb_fetch test cope with "Reqid wrap!" messages. Recent CTDB notice the wrap and print this message. The test needs to cope. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit b93b60ec96d02ce4f54921e85a5c5554d1fc0c55)	2010-08-05 13:43:50 +10:00
Martin Schwenke	dff9282917	Test suite: remove thaw/freeze tests. They test debugging commands that no longer operate as expected. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit d33fa4d6557aab1938049f194c2de55f2c395bd2)	2010-08-05 11:40:05 +10:00
Martin Schwenke	4817f7e4ba	Test suite - fix addip test. The test currently checks that all existing IPs plus the newly added IP are on the test node after "ctdb addip" is run. With DeterministicIPs enabled, if the new IP is "before" other IPs then the other IPs may be shuffled by the deterministic IPs modulo algorithm. This will happen on the 1st recovery after the move. Sometimes this recovery happens before we get the list of IPs to check and sometimes after, so the test is racy. The fix is to simply check for the presence of the new IP and not worry about the others. This reduces whatever value this test had... but you can't have everything. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 1ef7c8e64c7a39330be09ae4d00b70238133e0b5)	2010-08-04 16:08:12 +10:00
Martin Schwenke	9aa6a99740	Merge remote branch 'martins/master' (This used to be ctdb commit 5d9e4b6ee7d2b5290a74e7be79bdf51a43b72f43)	2010-08-04 16:05:39 +10:00
Martin Schwenke	7edcb89857	Test suite - try to make addip test more reliable and add some debugging. This test is failing in some situations. The "ctdb addip" command works but the IP never appears in the "ctdb ip" output. Try restricting the last octet to be between 101-199. At the moment addresses like 10.0.2.1 are being chosen and these are often the address of the host machine in autocluster configurations... so might cause weirdness. Also add some debugging if checking for the IP address times out. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit ae52cb63756bc60de8d32e01bac5d70975a1c7a0)	2010-08-04 13:16:06 +10:00
Martin Schwenke	807567e992	Testing: IP allocation simulation - add option to change odds of a failure. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit b2a2e301025d7fbfe5eeaac436693cde6d404490)	2010-08-03 11:51:14 +10:00
Martin Schwenke	4ffb6495ff	Testing: IP allocation simulation - clean up usage message. Group options better and make the language consistent between options. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit bc38c17e4115fae00c89d00537fdcfe621111b37)	2010-08-03 11:41:50 +10:00

1 2 3 4 5 ...

3253 Commits