samba-mirror

mirror of https://github.com/samba-team/samba.git synced 2025-02-04 17:47:26 +03:00

Author	SHA1	Message	Date
Amitay Isaacs	6e650b6ee5	eventscripts: Remove calls to "smbstatus -np" for samba cleanup This is an artifact from older versions of Samba. In the newer versions of Samba, "smbstatus -np" command does not do anything useful, but causes a traverse in CTDB which is expensive and causes CPU utilization to shoot up. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 053b89c6dbce47001505524606889334559d2ec4)	2013-02-11 11:25:49 +11:00
Martin Schwenke	689384a7b4	Logging: Fix breakage when freeing the log ringbuffer Commit a82d3ec12f0fda16d6bfa8442a07595de897c10e broke fetching from the log ringbuffer. The solution there is still generally good: there is no need to keep the ringbuffer in children created by ctdb_fork()... except for those special children that are created to fetch data from the ringbuffer! Introduce a new function ctdb_fork_no_free_ringbuffer() that does everything ctdb_fork() needs to do except free the ringbuffer (i.e. it is the old ctdb_fork() function). The new ctdb_fork() function just calls that function and then frees the ringbuffer in the child. This means all callers of ctdb_fork() have the convenience of having the ringbuffer freed. There are 3 special cases: * Forking the recovery daemon. We want to be able to fetch from the ringbuffer there. * The ringbuffer fetching code. Change the 2 calls in this code (main daemon, recovery daemon) to call ctdb_fork_no_free_ringbuffer() instead. While we're here, clear the log ringbuffer when the recovery deamon is forked, since it will contain a copy of the messages from the main daemon. Note to self: always test... even the most obvious patches... ;-) Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 00db5fa00474f8a83f1aa3b603fd756cc9b49ff4)	2013-02-07 11:26:29 +11:00
Volker Lendecke	140be1e267	Fix a comment typo Signed-off-by: Volker Lendecke <vl@samba.org> Reviewed-by: Michael Adam <obnox@samba.org> (This used to be ctdb commit b940e3a24daa73ca9b2896b7a449240136442b53)	2013-02-06 12:35:10 +01:00
Martin Schwenke	8c9eedbce3	initscript: export CTDB_EXTERNAL_TRACE This means it can be set like any other configuration option in the configuration file, without needing to export it there. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit a0ef73e197dc9147f7718e0813fe803ff0b3d54d)	2013-02-05 16:05:13 +11:00
Martin Schwenke	37632efde0	ctdbd: Don't use a fixed length buffer for the hung script command The amount of data to write into the buffer wasn't constrained anywhere... Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 9b0d56b16775aa16f33bdfdf831256e085fa3339)	2013-02-05 16:05:13 +11:00
Martin Schwenke	e883720461	ctdbd: Complain loudly if CTDB_DEBUG_HUNG_SCRIPT script isn't executable This is quite easy to misconfigure by failing to set the execute bit on the script. Better to complain loudly. This is a debugging facilty rather than core CTDB functionality, so it doesn't need a subtle mechanism to disable it at run-time. To disable the designated script at run-time either edit it to put an "exit 0" at the top or move it aside and symlink to /bin/true. This is implemented by actually removing the code that checks that the file exists and is executable. The output from the shell when the system() function fails is just as useful. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 3400b2ed34b6eb9496eb55f1aab6f89d2952060d)	2013-02-05 16:05:13 +11:00
Martin Schwenke	bc5f0a2b65	ctdbd: Remove command-line option --debug-hung-script Use an environment variable instead. This just means that the initscript exports CTDB_DEBUG_HUNG_SCRIPT and the code checks for the environment variable. The justification for this simplification is that more debug options will be arriving soon and we want to handle them consistently without needing to add a command-line option for each. So, the convention will be to use an environment variable for each debug option. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 0581f9a84e58764d194f4e04064c2c5b393c348b)	2013-02-05 16:05:13 +11:00
Martin Schwenke	f2428cadd8	ctdbd: Remove debug_hung_script_ctx The only allocation against this context is by ctdb_fork_with_logging(). This memory is freed by ctdb_log_handler() anyway. There should be no memory leak. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 501461cc3e132d4adee9e91b5d4513a26bae2846)	2013-02-05 16:05:13 +11:00
Martin Schwenke	a0c88ec816	ctdbd: Message logged at exit should be different for different processes Some subprocesses print "CTDB daemon shutting down" when they exit and this can be confusing. Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit f1ffe1112b7e342d7f1228ca816a8e5918f893cf)	2013-02-05 16:03:41 +11:00
Amitay Isaacs	11c75419cd	daemon: Make sure all the traverse children are terminated if traverse times out When traverse times out, callback function is called with key and data set to tdb_null. This is also the way to signal end of traverse. So if the traverse times out, callback function treats it as traverse ended and frees state without calling the destructor. Keep track if the traverse timed out, so callback function can take appropriate action for traverse timeout and traverse end. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 35da9a7c2a0f5e54e61588c3c3455f06ebc66822)	2013-02-05 14:42:19 +11:00
Martin Schwenke	8dc3219e9b	Logging: Free the ringbuffer in child processes created with ctdb_fork() At the moment the log ringbuffer is duplicated in every child process. Althought it is copy-on-write we want to see if it is contributing to out-of-memory situations when there are a lot of children. The ringbuffer isn't accessible from any of the children anyway... Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit a82d3ec12f0fda16d6bfa8442a07595de897c10e)	2013-02-05 12:40:30 +11:00
Martin Schwenke	f2ba0e8a65	Logging: New function ctdb_log_ringbuffer_free() Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit a4f622e85168f59417c11705f1734e0352e1d44a)	2013-02-05 12:40:30 +11:00
Martin Schwenke	35074ad6d5	build: Fix a Makefile.in typo Objects are named *.o ;-) Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 25a20409fb39a94b64c13990c0eba4f75d482ecd)	2013-02-05 12:13:57 +11:00
Martin Schwenke	fe130ca85b	tools/ctdb: Fix a compiler warning Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit d1ec06d30148e6fd344625a2fbf1c22391bd908a)	2013-02-04 11:22:43 +11:00
Amitay Isaacs	385325ad90	recoverd: Fix printing of node flags from local information Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 124e2a471aeda9c900fd898178a30522d7d74221)	2013-01-23 16:56:03 +11:00
Mathieu Parent	264f847631	common: Don't lie on unimplemented gratuitous arp Signed-off-by: Mathieu Parent <math.parent@gmail.com> (This used to be ctdb commit b054193d1d19a8eef998fa690899501f79badb8a)	2013-01-22 18:04:00 +11:00
Mathieu Parent	52a99cbac6	tests: Test portability Curiously test_ctdb_sys_check_iface_exists fails on Linux Signed-off-by: Mathieu Parent <math.parent@gmail.com> (This used to be ctdb commit 109f428aa34f8f4cc0329880d2f4a5593a6cc6f3)	2013-01-22 18:03:47 +11:00
Mathieu Parent	fd8d3cfeba	common: FreeBSD+kFreeBSD: Implement get_process_name (same as in Linux) Signed-off-by: Mathieu Parent <math.parent@gmail.com> (This used to be ctdb commit 258092aaf6b7a9bdc14f0fb35e8bd7f7dc742b3f)	2013-01-22 18:03:44 +11:00
Mathieu Parent	384b9b2a7b	common: Detailed platform-specific FIXME Signed-off-by: Mathieu Parent <math.parent@gmail.com> (This used to be ctdb commit d202b2fdd4fd70172e5e44583627b57a1b7ad2ed)	2013-01-22 18:03:41 +11:00
Mathieu Parent	e6cf53635d	build: Update config.guess 2012-12-30 and config.sub to 2013-01-11 Signed-off-by: Mathieu Parent <math.parent@gmail.com> (This used to be ctdb commit 3c6a9b73364c9543366fa033c778145dc7a152a9)	2013-01-22 18:03:37 +11:00
Mathieu Parent	69afd9abc5	doc: allows to -> allows one to Signed-off-by: Mathieu Parent <math.parent@gmail.com> (This used to be ctdb commit 95fc493a7d4145f976cb3fe928d9e92faec4dd71)	2013-01-22 18:03:35 +11:00
Mathieu Parent	75625cd572	build: Add missing LDFLAGS Original Author: Simon Ruderich <simon@ruderich.org> Signed-off-by: Mathieu Parent <math.parent@gmail.com> (This used to be ctdb commit 506ecd186759675a1cf50a0a05a285fee03fc51e)	2013-01-22 18:03:31 +11:00
Srikrishan Malik	28cbe527d4	Changes for unobtrusive recovery and new method for health check. Unobtrusive recovery: Ganesha will not be restarted on failovers. Ganesha health: Use the counters in /var/lib/nfs/ganesha_local to track progress instead of the null call which can timeout if the server is too busy. Signed-off-by: Srikrishan Malik <srimalik@in.ibm.com> Signed-off-by: Lance Russell <lancerus@us.ibm.com> (This used to be ctdb commit 0e651e9da0f1f3c836b4474612ab13d0ccd272d9)	2013-01-11 17:16:46 +11:00
Amitay Isaacs	96ba396697	recoverd: Create recoverd monitoring timed events off recoverd context This ensures that when shutting down CTDB, all the timed events associated with monitoring recoverd are destroyed and recoverd is not restarted. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 7393e2b290f9879ff72d5c5a9ce933034129f0e8)	2013-01-09 16:22:39 +11:00
Amitay Isaacs	4a6fa39ff9	daemon: Protect against double free of callback state while shutting down When CTDB is shut down and monitoring has been stopped, monitor_context gets freed and all the callback states hanging off it. This includes callback state for current_monitor, if the current monitor event has not yet finished. As a result, when the shutdown event is called, current_monitor->callback state is not NULL, but it's actually freed and it's a dangling reference. So before executing callback function and freeing callback state check if ctdb->monitor->monitor_context is not NULL. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 7d8546ee4353851f0543d0ca2c4c67cb0cc75aea)	2013-01-09 14:39:23 +11:00
Amitay Isaacs	30299c387f	daemon: On shutdown, destroy timed events that check if recoverd is active When CTDB is shutting down, recovery daemon is stopped, but the event that checks if recovery daemon is still alive is not destroyed. So recovery master is restarted during shutdown if CTDB daemon takes longer to shutdown. There are two processes that check if recovery daemon is working. 1. ctdb_check_recd() - which checks every 30 seconds if the recovery daemon process exists. 2. ctdb_recd_ping_timeout() - which is triggered when recovery daemon fails to ping CTDB daemon. Both the events are periodic and need to be destroyed when shutting down. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 746168df2e691058e601016110fae818c6a265c3)	2013-01-09 13:20:26 +11:00
Amitay Isaacs	cad815164c	tests: Add a test for recovery of persistent databases Ensure that RSN based recovery and __db_sequence_number__ based recovery methods for persistent databases work correctly. They should not cause corruption of the database. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 45d439a1ab093b420c27b1502ef109021833c7af)	2013-01-09 13:18:34 +11:00
Amitay Isaacs	d85e40fbdb	tools/ctdb: Add setdbseqnum command to set __db_sequence_number__ Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit efaac27a9ed52ed0f436c7e194013fd06e8b02b3)	2013-01-09 13:18:34 +11:00
Amitay Isaacs	4ca50c7243	tools/ctdb: Re-factor code to check if db exists given name or id Most of the commands related to database operations can now use the common code (db_exists()) to refer to database with either name or id. In addition to return db_id for db_name, the function returns all the flags set for the database. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit ca6e7eccc90f2869c220231666bf284798342bce)	2013-01-09 13:18:33 +11:00
Amitay Isaacs	d8a3ec42c1	tools/ctdb: Add pdelete command to delete a record from persistent database Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit d23adec89b69e7c6f96c8e1417ef4ca4c9edc57e)	2013-01-09 13:18:33 +11:00
Amitay Isaacs	9eeb94c5c0	daemon: Update the comment and remove redundant check in ctdb_start_transport() ctdb_start_transport() is called just before "setup" event, when CTDB is ready to process the requests. "startup" event happens much later after a successful recovery. Transport method ctdb->methods is successfully initialized before ctdb_start_transport() is called. No need to check again. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 9a70a4d23d00f6cb996c061ba3dfb7c47b4f6a4f)	2013-01-09 13:18:33 +11:00
Martin Schwenke	aca9299669	eventscripts: Fail the setup event if CTDB does not become ready Currently it silently continues without attempting to set tunables. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 735ec99b99c7bb579851ce8293011aaf1dcc552a)	2013-01-09 12:45:59 +11:00
Martin Schwenke	4f622fe9fb	scripts: Make script_log() use supplied message, stop logger from hanging When using syslog any provided message arguments are ignored and not passed to logger. This means that logger blocks waiting on stdin. That's bad. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 50abf597cefe6f8ea2a2ff7694bf84641344a9b1)	2013-01-08 15:18:47 +11:00
Martin Schwenke	095fac9491	scripts: Rework ctdb-crash-cleanup.sh so that it uses existing functions This improves maintainability. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit e2aaa64925cca359c71520e01a18fc9461b0da4d)	2013-01-08 15:18:47 +11:00
Martin Schwenke	d801b02681	scripts: Make drop_all_public_ips() more robust Incorporate some of the logic from ctdb-crash-cleanup.sh that ensures IPs are deleted even if they have the wrong netmask or are on the wrong interface. Factoring out some of the code will allow it to be used elsewhere. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 03356fd5ae7a3ac35fde0289cbea7c71ecf07367)	2013-01-08 15:18:47 +11:00
Martin Schwenke	663858e5de	ctdbd: Default value for debug_hung_script should use ETCDIR That is, it should use whatever was specified in ./configure and should not hardcode /etc. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 13e5e609b262847b607e7af7e0685f44e7cb8e36)	2013-01-08 15:18:47 +11:00
Martin Schwenke	4157efdcbb	scripts: debug-hung-script.sh doesn't need functions/loadconfig Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 8507303b525d20c74e8ec4e7c4f5f275945cd3b6)	2013-01-08 15:18:47 +11:00
Martin Schwenke	f5226c9a75	scripts: statd-callout should calculate CTDB_BASE if it is not set Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 376015ba5ad6b7703ae9949a1d40a0c72dfaba0c)	2013-01-08 15:18:46 +11:00
Martin Schwenke	297b98d5b6	eventscripts: Each script should set CTDB_BASE if it is not set This makes it easier to run the scripts externally. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 740ea8ea5084149c8b552a01ee1c98c558b12384)	2013-01-08 15:18:46 +11:00
Martin Schwenke	0eb757329e	scripts: Move drop_all_public_ips() to the functions file ... so it can be improved and used elsewhere. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit b23c30253cc9eb274b895cac0f8c65245ba0a200)	2013-01-08 15:18:46 +11:00
Martin Schwenke	ef7329a415	tests/simple: Add test to check recovery daemon IP verification Also update ips_are_on_nodeglob() to handle negation. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 13a5944f8a27d43006acfffba76958693cae7702)	2013-01-08 13:27:10 +11:00
Martin Schwenke	e2306dd82b	tests/eventscripts: Ratchet down debug level for ctdb_takeover_tests The default IP allocation algorithm used by ctdb_takeover_tests changed from "non-deterministic IPs" to "LCP2". The latter generates a lot more debug output. ctdb_takeover_tests is used by the ctdb tool stub to calculate IP address changes for failovers. This resulted in unexpected debug output that caused tests to fail. Since eventscript tests don't care how IP allocations are arrived at, the best solution is to turn down the debug level. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 3cc596d2b459d834f9785b3a98027e46431ff2b9)	2013-01-08 10:53:28 +11:00
Martin Schwenke	53bd183683	recoverd: Separate each IP allocation algorithm into its own function This makes the code much more readable and maintainable. As a side effect, fix a memory leak in LCP2. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 6a1d88a17321f7e1dc84b4823d5e7588516a6904)	2013-01-08 10:16:11 +11:00
Martin Schwenke	2e8df43561	recoverd: New function unassign_unsuitable_ips() Move the code into a new function so it can be called from a number of places. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 8adb255e62dbe60d1e983047acd7b9c941231d11)	2013-01-08 10:16:11 +11:00
Martin Schwenke	bcefb76884	recoverd: Move failback retry loop into basic_failback() and lcp2_failback() The retry loop is currently in ctdb_takeover_run_core(). Pushing it into each function will make it possible to put each algorithm into a separate top-level function. This will make the code much clearer and more maintainable. Also keep associated test code compatible. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit f6ce18d011dd9043b04256690d826deb2640cd89)	2013-01-08 10:16:11 +11:00
Martin Schwenke	443fbb9e01	recoverd: Trying to failback more IPs no longer allocates unassigned IPs Neither basic_failback() nor lcp2_failback() unassign IPs anymore, so there's no point looping back that far. Also fix a unit test that now fails because looping back to handle unassigned IPs is no longer logged. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit c09aeaecad7d3232b1c07bab826b96818756f5e0)	2013-01-08 10:16:11 +11:00
Martin Schwenke	dfa7ce7b73	recoverd: basic_failback() can call find_takeover_node() directly Instead of unassigning, looping back and depending on basic_allocate_unassigned. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 4dc08e37dec464c8785a2ddae15c7c69d3c81ac3)	2013-01-08 10:16:11 +11:00
Martin Schwenke	326328d520	recoverd: Don't do failback at all when deterministic IPs are in use This seems to be the right thing to do instead of calling into the failback code and continually skipping the release of an IP. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 4c87e7cb3fa2cf2e034fa8454364e0a7fe0c8f81)	2013-01-08 10:16:11 +11:00
Martin Schwenke	ef403f70f2	recoverd: Move the test for both 'DeterministicIPs' and 'NoIPFailback' set If this is done earlier then some other logic can be improved. Also, this should be a warning since no error condition is set. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit e06476e07197b7327b8bdac9c0b2e7281798ffec)	2013-01-08 10:16:11 +11:00
Martin Schwenke	a3911ed7bf	recoverd: Fix a memory leak in IP allocation Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit bcd5f587aff3ba536cb0b5ef00d2d802352bae25)	2013-01-08 10:16:11 +11:00

... 3 4 5 6 7 ...

4550 Commits