samba-mirror

mirror of https://github.com/samba-team/samba.git synced 2024-12-22 13:34:15 +03:00

Author	SHA1	Message	Date
Martin Schwenke	8065366b33	eventscripts: Simplify handling of $service name in service_management Complicated argument handling was introduced to deal with multiple services per eventscript. This was a failure and we split 50.samba. This simplifies several functions to use global $service_name unconditionally instead of having an optional argument. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit e24baac0d2952e86d5ff31235901f06e2f2b2449)	2013-05-06 15:45:10 +10:00
Martin Schwenke	4c9438b2a3	eventscripts: Simplify handling of $service name in reconfigure functions Complicated argument handling was introduced to deal with multiple services per eventscript. This was a failure and we split 50.samba. This simplifies several functions to use global $service_name unconditionally instead of having an optional argument. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit c2ea72ff565222f9edab408638bd45dbba6e8ff7)	2013-05-06 15:45:10 +10:00
Martin Schwenke	642848b916	eventscripts: Remove unused function ctdb_check_counter_equal() Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit fd536a26b310b5bf9628da62cca0b425f4a54030)	2013-05-06 15:45:10 +10:00
Martin Schwenke	bbd0ed0e29	scripts: Fix script_log() regression 5940a2494e9e43a83f2bca098bd04dfc1a8f2e93 makes script_log() always pass a message to logger, so script_log() can no longer log stdin. Put all the tag fu in the actual tag so the message argument is empty if no message was passed. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 9dee4c84273633b9ad82e94dabbf0e6f86edbcef)	2013-05-06 15:43:16 +10:00
Martin Schwenke	823edbf6fe	scripts: Ensure even external scripts get tagged in logs as "ctdbd" Our practice is to search logs for "ctdbd:". We want to make sure we find everything. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 5940a2494e9e43a83f2bca098bd04dfc1a8f2e93)	2013-04-22 13:58:36 +10:00
Martin Schwenke	fb8be43d6d	eventscripts: Ensure directories are created Previous commits stopped the top level of the script from creating certain directories but some functions assume that required directories exist. Create those directories instead. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 0076cfc4666e5a96eb2c8affb59585b090840e00)	2013-04-22 13:58:36 +10:00
Martin Schwenke	903f4c394c	scripts: Clean up update_tickles() and handling of associated directory Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 700cf95a1f29b4b88460a00a55d57a9e397011e0)	2013-04-19 13:13:36 +10:00
Martin Schwenke	100a0eed90	scripts: Use $CTDB_SCRIPT_DEBUGLEVEL instead of something more complex The current logic is horrible and creates an unnecessary file. Let's make the script debug level independent of ctddb's debug level. * Have debug() use $CTDB_SCRIPT_DEBUGLEVEL directly * Remove ctdb_set_current_debuglevel() * Remove the "getdebug" command from ctdb stub in eventscript unit tests * Update relevant eventscript unit tests to use $CTDB_SCRIPT_DEBUGLEVEL Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 85efa446c7f5c5af1c3a960001aa777775ae562f)	2013-04-19 13:13:36 +10:00
Srikrishan Malik	28cbe527d4	Changes for unobtrusive recovery and new method for health check. Unobtrusive recovery: Ganesha will not be restarted on failovers. Ganesha health: Use the counters in /var/lib/nfs/ganesha_local to track progress instead of the null call which can timeout if the server is too busy. Signed-off-by: Srikrishan Malik <srimalik@in.ibm.com> Signed-off-by: Lance Russell <lancerus@us.ibm.com> (This used to be ctdb commit 0e651e9da0f1f3c836b4474612ab13d0ccd272d9)	2013-01-11 17:16:46 +11:00
Martin Schwenke	4f622fe9fb	scripts: Make script_log() use supplied message, stop logger from hanging When using syslog any provided message arguments are ignored and not passed to logger. This means that logger blocks waiting on stdin. That's bad. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 50abf597cefe6f8ea2a2ff7694bf84641344a9b1)	2013-01-08 15:18:47 +11:00
Martin Schwenke	d801b02681	scripts: Make drop_all_public_ips() more robust Incorporate some of the logic from ctdb-crash-cleanup.sh that ensures IPs are deleted even if they have the wrong netmask or are on the wrong interface. Factoring out some of the code will allow it to be used elsewhere. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 03356fd5ae7a3ac35fde0289cbea7c71ecf07367)	2013-01-08 15:18:47 +11:00
Martin Schwenke	0eb757329e	scripts: Move drop_all_public_ips() to the functions file ... so it can be improved and used elsewhere. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit b23c30253cc9eb274b895cac0f8c65245ba0a200)	2013-01-08 15:18:46 +11:00
Martin Schwenke	217ad07b72	Eventscripts: Change the default reconfigure action to do nothing A default action of restarting the service doesn't obey the principle of least surprise. It cause the NFS service to be implicitly reintroduced. This allows no-op functions to be removed from some eventscripts and service restart functions to be added to others. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit c75b5e5b4d000f5c7dab403df8238ceed390c1c0)	2013-01-07 10:35:39 +11:00
Martin Schwenke	9f6b30a517	scripts: Refactor logging code in initscript and functions file Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 5ee242c949a98bb7397e0f7368b20d44c06fe772)	2012-10-18 20:05:43 +11:00
Michael Adam	6372592982	config/functions: fix a comment ctdb_check_counter_limits does not fail but succeed if count >= limit Signed-off-by: Michael Adam <obnox@samba.org> (This used to be ctdb commit af540ef728303b4a0a188b17c695e9aefab34489)	2012-10-17 21:56:58 +02:00
Martin Schwenke	d33b12a1c5	Eventscripts: Add service-start and service-stop pseudo-events Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit be4ad110ede9981b181ac28f31ffd855a879d5df)	2012-10-10 14:54:53 +11:00
Martin Schwenke	2d719e5c84	eventscripts: Auto-start/stop services in background If $CTDB_SERVICE_AUTOSTARTSTOP="yes" then service start/stop is done in the background with logging. Fix some unit tests for samba and winbind. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 3a3dae4cb5ec8b4b8381a4013adda25b87641f3a)	2012-10-03 08:48:23 +10:00
Martin Schwenke	835e0b6d49	Eventscripts: Modernise 60.ganesha to match 60.nfs Originally from Srikrishan Malik <srikrishan.malik@in.ibm.com> with some style changes by me. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 637cab6304dae66b85668506028c76ea1ee88980)	2012-05-16 17:24:21 +10:00
Martin Schwenke	92eb004162	Eventscript functions: add optional version to nfs_check_rpc_service() This can be optional because the 1st item of each action-triple is a test comparison that starts with '-'. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 92f74fd589467b46c758e116e97417edfe8773d7)	2012-05-16 17:05:05 +10:00
Martin Schwenke	476cf45049	Eventscript functions - no longer require interface_modify.sh Make add_ip_to_iface() and delete_ip_from_iface() do their own locking so the external script is no longer required. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 93f90caf91246074d9359bf31a39b26212cccc42)	2012-03-22 15:30:27 +11:00
Martin Schwenke	0b2c3d7d24	Eventscript functions - remove now-unused route/IP re-add script logic This is no longer used by 13.per_ip_routing or anything else. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 2a2ea6c61a05af2d0765e964abcc7ef04047431e)	2012-03-22 15:30:26 +11:00
Martin Schwenke	940efdb8e9	Eventscript functions - remove functions only used by 13.per_ip_routing The relevant functions are now in that script. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 45c3476d12bf0f52966b72d286f101fce1382cd2)	2012-03-22 15:30:26 +11:00
Martin Schwenke	0d67779c67	Eventscript functions - add new function die() Args: 1. Error message to be printed. 2. Option exit code (default 1) Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 97b0c138cb97e30db27c40b4ee1481109ae90c78)	2012-03-22 15:30:26 +11:00
Mathieu Parent	91431262be	config/functions: CTDB_VARDIR is /var/lib/ctdb on Debian-like systems (This used to be ctdb commit 56160eccb62178f645b017b1257677a1e854b2bc)	2011-11-08 16:31:03 +11:00
Mathieu Parent	a1919fd316	apache's service name is not always httpd Solution 2 of <https://bugzilla.samba.org/show_bug.cgi?id=8317> (This used to be ctdb commit 8b9ac5cd8d867ff4866ac464c570d9293d03a91e)	2011-10-12 20:07:45 +11:00
Martin Schwenke	205c7c7663	Eventscripts - enhance ctdb_replay_monitor_status() Print useful output and return a suitable exit code. The DISABLED and TIMEDOUT statuses use fake negative return codes, and these can't be faked from the shell. So we map DISABLED to OK and TIMEDOUT to ERROR - this should avoid nearly all surprises. When we do this we add a note to the beginning of the output. The alternative is to "fix" ctdbd to use only codes that can actually be returned by shell scripts. However, the reason for using negative codes is probably to distinguish them from real ones... Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit dda44d026e0c1b02feb02185b8c200a542be341a)	2011-08-31 15:34:43 +10:00
Martin Schwenke	aa64622137	Eventscripts - use ctdb scriptstatus -Y when replaying status Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 5be904fb1fbd546618d25509b41ab836db62a70a)	2011-08-30 16:34:43 +10:00
Martin Schwenke	b97625acb6	Eventscripts: add a synchronous synthetic reconfigure event. In the current code services can only be reconfigured asynchronously. This means that configuration file changes can be made, an asychronous reconfigure event can be triggered, and it always succeeds. Some time later when a service is actually reconfigured then a failure may be seen This adds a synthetic reconfigure event that reconfigures a service synchronously so that any failure is reported on exit. ctdb_service_check_reconfigure() is essentially reimplemented. If a reconfigure event is in flight and an ipreallocated or monitor event occurs then any scheduled asynchronous reconfigure is deferred until the next monitor cycle. This is to avoid reconfigures trampling on each other. In this case a monitor event will also replay the previous status to try to avoid exposing any temporary instability. If a reconfigure event collides with another reconfigure event it will exit with status 2, indicating that the reconfigure should be retried. The reconfigure event is implemented using a subprocess to control the exit from the synthetic event. As before, if a monitor event causes a scheduled synchronous reconfigure to occure then it will replay the previous status for the service, given that a reconfigure can cause temporary instability. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 220578bfd3507152b29ba4c28942f9d5e8733886)	2011-08-30 14:29:48 +10:00
Martin Schwenke	7980a4cb44	Eventscripts - new function ctdb_check_args() Pass this "$@" to do common eventscript argument checking. For regular use putting this in 00.ctdb would be enough. However, for developer testing it can be useful to call this in other eventscripts. For example, 10.interfaces and 13.per_ip_routing currently check these by hand. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 36de7e7fd6dfeed61ef9977b8d5b568f90a9707b)	2011-08-30 09:33:47 +10:00
Martin Schwenke	63729fc35d	Eventscripts - ctdb_check_tcp_ports() bug fix. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit e8d9c0b251c84d6fdf6ea7d972e5f7d1d0222f9b)	2011-08-30 09:33:47 +10:00
Martin Schwenke	194de8faf8	Eventscripts - fix debugging buglet in ctdb_check_tcp_ports_ctdb() Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 61000e38d6016e58f67e292393756d0bd5262ae5)	2011-08-30 09:33:47 +10:00
Martin Schwenke	9257b57f2c	Eventscripts: New configuration variable CTDB_SERVICE_AUTOSTARTSTOP. Some of the current auto-start/stop logic is broken, particularly for Samba. Fixing it is non-trivial. If $CTDB_SERVICE_AUTOSTARTSTOP is "yes" then auto-start/stop services when told to newly manage or no longer manage them. This defaults to "yes". However, if using a canned configuration file that doesn't set $CTDB_SERVICE_AUTOSTARTSTOP then this stops the auto-start-stop logic from working. Therefore, this works around CQ S1026685 - on the system in question another daemon controls service auto-start/stop and CTDB just gets in the way. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit ef71b8290ae49117d7bcc7166598b77cb64cc8a0)	2011-08-30 09:33:47 +10:00
Martin Schwenke	6e7dbf0543	Eventscripts - new default TCP port checker using "ctdb checktcpport" New function ctdb_check_tcp_ports_ctdb(). This should be fast... and is now the default checker. If it fails in an unexpected way we fall back to the nmap and netstat checkers. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit a1e16a707ce204817531a61455000361f972080a)	2011-08-17 14:02:45 +10:00
Martin Schwenke	1374327f6e	Eventscripts - generalise TCP port checking plus new nmap-based checker Split the netstat-specific parts of ctdb_check_tcp_ports() into new function ctdb_check_tcp_ports_netstat(). Implement new ctdb_check_tcp_ports_nmap() function that uses "nmap -PS" to check if the desired ports are listening. ctdb_check_ctdb_ports() now uses new configuration variable CTDB_TCP_PORT_CHECKERS to decide which port checkers to try. Default value is currently "nmap netstat". If nmap is not found then this will fall back to netstat - if logging is at debug level this will also fill the logs with message saying the nmap checker failed. This indicates that either nmap should be installed or the default value of CTDB_TCP_PORT_CHECKERS should be changed (in a configuration file) to avoid trying to use nmap. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit d9651175b40b9454e7d4e98291955fcf1445085e)	2011-08-17 12:12:20 +10:00
Martin Schwenke	62f654d3d2	Eventscripts - ctdb_check_tcp_ports() only prints netstat output if debugging Use the new debug function to conditionally print the netstat output. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 44c14aeeb11080980fe07c7396d06843a4870747)	2011-08-17 10:39:54 +10:00
Martin Schwenke	86792724a2	Eventscripts - weaken TCP port check message if CTDB has just been started. Sometimes smbd and other services can take a while to start, especially when there is a lot of activity after ctdbd has just started. The TCP port check can then pollute the logs with lots of "ERROR" messages and possibly extra debug. This creates a flag file when a service is started (but not restarted) and this flag is removed the first time that TCP port checks succeed for that service. When a port check fails and the flag file still exists, a less extreme "INFO" message is printed rather than the usual "ERROR" message. This means that until the node actually becomes healthy we see more friendly messages. The subtext is that we're hearing false positive reports "recreates" of CQ S1024874 (samba stopped responding on port 445) quite often when ctdbd is started. This reduces the chances of people reporting such false recreates... Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 571865eb6ef847857129d0b1e2ba5fa7254bfe8c)	2011-08-17 10:39:53 +10:00
Martin Schwenke	5c9fbb55ce	Eventscript functions: optimise ctdb_check_tcp_ports() and add debug. ctdb_check_tcp_ports() runs "netstat -a -t -n" in a loop for each port. There are 2 problems with this: * Netstat is run on each loop iteration when it need only be run once. * The -a option is used to list all connections but the function only cares about the listening ports. There may be many thousands of non-listening ports to grep through. This changes ctdb_check_tcp_ports() to run netstat with the -l option instead of the -a option. It also only runs netstat once before the main loop. When a port is found to not be listening the output of the netstat command is now dumped to help with debugging. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 830355a8b18c53cfcc3ad1e3009bbb1a7a681fa0)	2011-08-17 10:39:53 +10:00
Martin Schwenke	f0f9271301	Eventscripts: add a debug() function and call ctdb_set_current_debuglevel() The debug function passes its arguments to echo if $CTDB_CURRENT_DEBUGLEVEL is >= 4 (i.e. DEBUG). If no args are given then use stdin - this allows the function to be used with here documents. To ensure $CTDB_CURRENT_DEBUGLEVEL is set, ctdb_set_current_debuglevel() is called near the end of the functions file. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 6143483d9f87322578c00f12081e381f425226ca)	2011-08-17 10:39:35 +10:00
Martin Schwenke	171bef3d68	Eventscripts - new function ctdb_set_current_debuglevel() This function ensures that CTDB_CURRENT_DEBUGLEVEL is set. It works like this: 1. If it is already set then do nothing, since it might have been set some other way. The recommended "other way" would be to add a file in rc.local.d/. 2. If it is not set then set it by sourcing /var/ctdb/eventscript_debuglevel. 3. If this file does not exist then create it using output from "ctdb getdebug". If the optional 1st argument is set to "create" then don't source an existing file but create a new one instead - this is useful for creating the file just once in each event run in, say, 00.ctdb. If there's a problem getting the debug level from ctdb then it is silently set to 0 - no use spamming logs if our debug code is broken... Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 93910921c8a25f2b029733cd938069ff7c7bdab7)	2011-08-17 09:00:46 +10:00
Martin Schwenke	32fe247e37	Eventscripts: In 60.nfs don't restart NFS when restarting rpc.lockd. This effectively reverts 953dbfbddad656a64e30a6aca115cb1479d11573 and is a policy decision. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 380c9263eb37db5a250264316e250c2160908263)	2011-08-12 16:28:09 +10:00
Martin Schwenke	398116ff29	Eventscripts: clean up 60.nfs monitor event. This adds a helper function called nfs_check_rpc_service() and uses it to make the monitor event much more readable. An example of usage is as follows: nfs_check_rpc_service "mountd" \ -ge 10 "verbose restart:b unhealthy" \ -eq 5 "restart:b" The first argument to nfs_check_rpc_service() is the name of the RPC service to be checked. The RPC service corresponding to this command is checked for availability using the rpcinfo command. If the service is available then the function succeeds and subsequent arguments are ignored. If the rpcinfo check fails then a failure counter for that particular RPC service is incremented and subsequent arguments are processed in groups of 3: 1. An integer comparison operator supported by test. 2. An integer failure limit. 3. An action string. The value of the failure counter is checked using (1) and (2) above. The first check that succeeds has its action string processed - note that this explains the somewhat curious reverse ordering of checks. It the example above: * If the counter is >= 10 then a verbose message is printed describing the failure, the service is restarted in the background and the node is marked as unhealthy (via an "exit 1" from the function). * If the counter is == 5 then the service us restarted in the background. For more action options please see the code. This also changes the ctdb_check_rpc() function so that it no longer takes a program number to check. It now just takes a real RPC program name that rpcinfo can resolve via /etc/rpc. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 9b66057964756a6245bafb436eb6106fb6a2866e)	2011-08-12 14:16:14 +10:00
Martin Schwenke	3a760b09ed	Evenscripts: improvements to ctdb_service_check_reconfigure(). * Make this function applicable to "ipreallocated" event too. * Monitor event should not always succeed just because we reconfigure. If the service was unhealthy before the reconfigure and we end the reconfigure with "exit 0" then we can cause the node's health status to flip-flop. To avoid this we return the status of the service from the previous monitor event. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 21dfcbbdccd906fcd6ab7bba81418ce565bf63aa)	2011-08-11 10:46:57 +10:00
Martin Schwenke	2a14f91722	Eventscript functions: new function ctdb_check_counter(). This should eventually be able to replace ctdb_check_counter_limit() and ctdb_check_counter_equal(), although it doesn't issue warnings like the former. It takes 4 optional arguments: 1. _msg - If "error" then over limit causes an error message and and exit 1. Anything else fails silently but the function returns 1. Default is "error". 2. _op - An integer operator supported by test (e.g. -eq, -ge, -gt). Default is -ge. 3. _limit - Limit for the counter to be used in comparison. Default is $service_fail_limit. 4. _service_name - Used to identify the counter. Default is $service_name. For example: ctdb_check_counter error -ge 5 foo will print a message and exit 1 if the counter for foo is >= 5, whereas ctdb_check_counter check -ge 5 foo will just return 1 if the counter for foo is >= 5, and ctdb_counter_check with print a message and exit 1 if the counter for $service_name is >= $service_fail_limit. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 5b01b7233515669e995e037205796e265643b176)	2011-08-11 10:46:56 +10:00
Martin Schwenke	219c6fd55b	Eventscripts: remove unused remove_ip() function. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 881af7c1417962b9b3ade6565b3e8eb9f9df7a97)	2011-08-11 10:46:56 +10:00
Martin Schwenke	5c948528b5	Eventscripts: startstop_nfs stop no longer redirects output to /dev/null. When stopping (as opposed to restarting) it is useful to see this information. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit a9ab1937239761dc32b143c9d225447bc6f090b4)	2011-08-11 10:46:56 +10:00
Martin Schwenke	caee6f1508	Eventscripts: fix typo in _ctdb_counter_common(). Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit f57d1722b6aa082f3f826171acc57d7d796ea95c)	2011-08-11 10:46:56 +10:00
Martin Schwenke	ab693dbcc0	Eventscripts: improve log messages in ctdb_start_stop_service(). Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 6da7095192fb172a06b434cfb02f4bfa6221b343)	2011-08-11 10:46:56 +10:00
Martin Schwenke	1b956b2b0a	Eventscript functions: fix counter regression. d362be7d32079ac1390d67056ce107bfbca2c937 wasn't well thought out. Subsequent commits depend on ctdb_counter_init() taking an argument, so this makes those cases work. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 05a8fcfbac3da2b5843b31e0fe258255cc761190)	2011-08-11 10:46:56 +10:00
Martin Schwenke	217edfa1c8	Eventscript functions: ctdb_service_check-reconfigure() acts only on monitor. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit beabf506a5eb68fc50fdbf8772c1d2bb0f7951e3)	2011-08-11 10:46:56 +10:00
Martin Schwenke	820d9b30ea	Eventscripts: rejig the reconfigure infrastructure. * Add an optional service name argument to existing reconfigure functions. * User function service_reconfigure() instead of variable $service_reconfigure to specify how a service is reconfigured. * New function ctdb_service_check_reconfigure() reconfigures a service if it is flagged for reconfigure. * Remove $service_reconfigure settings from 40.vsftpd and 41.httpd - they're the defaults. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 15d4111d0761d82f57d5d4f0b1227812d14e4d7c)	2011-08-11 10:46:20 +10:00
Martin Schwenke	5b5bd3d27b	Eventscript functions: move flagging of managed services. Move flagging of managed or unmanaged services into ctdb_service_start() and ctdb_service_stop(). That way services will be correctly flagged if they are started from the startup and shutdown events. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 8675744cbd90b5a5095ed6fff7b36ae82004a457)	2011-08-11 10:46:20 +10:00
Martin Schwenke	428e32d647	Eventscript function: change service_start into a function. service_start is currently a variable. This makes passing arguments hard. We change it to be a function and put default definitions into the functions file. We use a convention that if a service name argument is passed to a redefined version of service_start() or service_stop() then it will act unconditionally. If no argument is passed then it can use internal logic to decide if services should really be started. This is useful when a single eventscript handles multiple services. This is a cherry-pick of ae38895 that needed to be reset mid-stream. There is still some breakage following this commit. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 86e4aefed9fd1028660c98e3ea758c2b75ffc1d8)	2011-08-11 10:46:20 +10:00
Martin Schwenke	f60802c776	Eventscript functions: add optional event name argument to fail count functions. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit b14f18649f42aab80ce0336c15ab6159f241c9af)	2011-08-11 10:46:20 +10:00
Martin Schwenke	ea6a53e2b3	Eventscript functions - optimise is_ctdb_managed_service(). This function generates a lot of trace when running under "set -x". This is due to the backward compatibility code. This adds 3 optimisations: 1. Before invoking the backward compatiblity code, is_ctdb_managed_service() returns early if the service is listed in $CTDB_MANAGED_SERVICES. 2. ctdb_compat_managed_service() actually now updates $CTDB_MANAGED_SERVICES instead of temporary variable $t. This means that a subsequent call to is_ctdb_managed_service() will short circuit due to optimisation (1). 3. ctdb_compat_managed_service() only adds a service to $CTDB_MANAGED_SERVICES if it is the service being checked by is_ctdb_managed_service(). This stops irrelevant services being added to $CTDB_MANAGED_SERVICES multiple times by multiple calls to is_ctdb_managed_service(). Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 758f4667c60089e09a0439c1eb74f5e426ca5e2e)	2011-08-11 10:46:20 +10:00
Martin Schwenke	ee38b9a159	Eventscript functions: new function ctdb_setup_service_state_dir(). To be used by eventscripts to create a per-service directory for their own state data. $service_state_dir is set to point to the new directory. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit a273554791c2a5281aee28f8e2be0c514e14c91e)	2011-08-09 16:35:07 +10:00
Martin Schwenke	ec33c04283	Eventscript functions: new functions to remember/check if service managed. This was done ad hoc and was badly named. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 9a084a121f629b2c1bcefc1e4c4a4a5cacf53987)	2011-08-09 16:20:08 +10:00
Martin Schwenke	72362e7b56	Eventscripts: source a file specified by $CTDB_RC_LOCAL in functions file. Another unit testing hook. This is easier than dropping files into rc.local.d/ and then removing them. The file has to be executable. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit b13ac3bdaf326a6cdfd87da9195eb9630806c418)	2011-08-08 13:51:32 +10:00
Martin Schwenke	394bbe8454	Eventscript functions - use $CTDB_VARDIR instead of local $ctdb_spool_dir. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit d0c6d9b19f0dd8946f9504b0d1cf50dd21f7a592)	2011-08-08 13:21:23 +10:00
Martin Schwenke	cfdccc5cac	Eventscripts: use set_proc() in startstop_nfs(). Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 5a3d5c6b1ca3682bb45104e50061871dec6e9b1d)	2011-08-03 19:57:40 +10:00
Martin Schwenke	75bbc93c0b	Eventscripts: remove unnecessary absolute paths from external commands. For eventscript unit testing it will be necessary to override external commands to allow stub implementations to be used. If absolute paths aren't used then this can be done using either a fake bin/ subdirectory or by using shell functions. This removes all of the simple cases of absolute paths. Signed-off-by: Martin Schwenke <martin@meltin.net> Conflicts: config/ctdb.init config/events.d/50.samba Keep old code but remove absolute paths. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 05851d50b0078de8bf4691442d718825adca6fe8)	2011-08-03 17:19:15 +10:00
Martin Schwenke	5f4ab05766	Eventscripts: new functions set_proc() and get_proc(). These provide a thin layer around writing and reading files in /proc. They can be easily replaced by stubs for unit testing. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 637f9d8af517b73c72ed8f3cc2a2661f11eb2126)	2011-08-03 17:04:58 +10:00
Martin Schwenke	571e55ac0d	Eventscripts: remove ctdb_wait_command() and ctdb_wait_tcp_ports() functions. These haven't been used for a long time. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit f5fd361cadb3ea18d29e2d7215a7853718e48d00)	2011-08-03 17:02:41 +10:00
Martin Schwenke	e3a9991e46	Eventscripts: iptables() should put lock in $CTDB_VARDIR. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 3f04793f391c63b78ffb9c9851ab3f0daf3ed50a)	2011-08-03 16:55:43 +10:00
Martin Schwenke	3bbfdfcdd3	Make Emacs recognise that the eventscript functions file is a shell script. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit a6dfb76cfa759f6f9409f24368111c4f85ca0fbf)	2011-08-03 16:49:38 +10:00
Martin Schwenke	3380c6ce1d	Eventscript functions: add $CTDB_ETCDIR and hook service() functions. * $CTDB_ETCDIR defaults to /etc but can be changed for testing. All hard-coded instances of /etc have been changed to $CTDB_ETCDIR. This includes references to /etc/init.d and /etc/sysconfig. * service() and nice_service() functions now call new function _service(). This makes it easier to override these functions (say, in rc.local) for testing and call most of the existing functionality using _service(). Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit f43c9a7604b779bb6257ddb2bf3cbe266d496a63)	2011-08-03 16:45:54 +10:00
Martin Schwenke	d31fbcab4b	Set $CTDB_VARDIR in the functions file. This will be needed when eventscripts that use it are called externally. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit ebd53b66b0cc66d9d04830781886234167fc2164)	2011-08-03 16:44:49 +10:00
Martin Schwenke	3efd5ef77c	Eventscripts: only autostart during a monitor event. Otherwise we might short-circuit events that are run only once and actually need to do something. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit c4f9e8a43540bc049b2771e0a2d76d37b9d17331)	2011-01-11 16:48:50 +11:00
Martin Schwenke	fb8f199651	Eventscripts: print a message when reconfiguring a service. Otherwise there can be strange error messages from services stopping/starting, without any context. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 8bcf7ab164429ddc0ae530133e114f186a8146dd)	2011-01-11 16:48:17 +11:00
Martin Schwenke	934ae76d38	Eventscripts: work around NFS restart failure under load. "service nfs restart" can fail. To stop nfsd it sends a SIGINT and nfsd might take a while to process it if the system is loaded. Starting nfsd may then fail because resources are still in use. This does some /proc magic to tell nfsd to do no more processing. It then runs service stop, kills nfsd with SIGKILL, and then runs service start. This is much less likely to fail. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit a9bf4f82852975b0b627f61ceb2d23401f630805)	2011-01-11 16:47:43 +11:00
Ronnie Sahlberg	8147d29598	add a missing part of the import of the previous ganesha patch (This used to be ctdb commit 171b8855bb2feae7f7dd6a079571f3113dedd6f4)	2010-12-06 11:50:15 +11:00
Ronnie Sahlberg	ebcc866ae0	update autostart/stop to work for samba (This used to be ctdb commit 37ab57e2adaecc3f7996ea20af45a5df0cd8be76)	2010-11-22 20:42:26 +11:00
Martin Schwenke	a2af87482b	Eventscript functions - catch failures in ctdb_service_start(). ctdb_service_start() currently succeeds if ctdb_counter_init() succeeds. This changes it to fail when a service start fails. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit ddb73962d72d933bf0edc28be0dbb45bea7e5ef4)	2010-11-18 12:15:05 +11:00
Martin Schwenke	3ab768e8d4	50.samba eventscript should stop/start services when they become (un)managed. When the value of $CTDB_MANAGES_SAMBA or $CTDB_MANAGES_WINBIND (or corresponding changes are made to $CTDB_MANAGED_VERSIONS), the associated service should be started or stopped as necessary. This add calls to ctdb_start_stop_service() to manage starting/stopping samba and winbind. An associated cleanup is made to the initial checks that one of $CTDB_MANAGES_SAMBA or $CTDB_MANAGES_WINBIND is set, replacing them with calls to is_ctdb_managed_service(). To handle the winbind cases ctdb_start_stop_service() and is_ctdb_managed_service() are updated to take an optional service name parameter. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit d98f175e8420d921a123ae9c0ce00945350b1537)	2010-11-18 12:12:30 +11:00
Ronnie Sahlberg	4fe85e5be5	add a new support function ctdb_check_counter_equal() update nfs to try to restart the service after 10 consecutive failures and to flag the node unhealthy after 15 add similar function to mountd (This used to be ctdb commit 1569a54bb82fc433895ed68f816cf48399ad9d40)	2010-11-17 13:54:57 +11:00
Martin Schwenke	8fe1ec3754	Eventscripts: make loadconfig() function hookable by the test suite. Rename loadconfig() to _loadconfig(). Add a new loadconfig() that simply calls _loadconfig(). This makes it easy for the test suite to override loadconfig(). Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 1d77a3adfff893b3c01b87f791e72c0d3148425c)	2010-11-17 11:46:48 +11:00
Martin Schwenke	6ab5ae2c9b	60.nfs only fails or warns after 10 consecutive nfsd/statd failures. These failures are sometimes the result of slow restarts so we want to avoid dirtying the logs or marking a node unhealthy because of them, unless they are excessive. For these 2 cases we use the existing fail counting code but hack a temporary service_name in a subshell to allow separate fail counts. We also update ctdb_check_rpc() so that it captures the error output from rpcinfo and we add a message including the service name to the beginning. The error is printed to stdout but is also stored in ctdb_check_rpc_out to allow it to be conditionally used by the caller. This function also now returns non-zero rather than exiting on failure. Other direct rpcinfo calls are relaced by called to ctdb_check_rpc() for consistency. Option handling code for service restarts is cleaned up so that fits in 80 columns. A more informative restart messageis now used in all cases, printing the exact command being used to start a service. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 79c25fe241cf5d8f92e23d3736823ebaf4e1769d)	2010-11-17 11:43:09 +11:00
Ronnie Sahlberg	b4e3a95039	try to restart NFS LOCKD if it failed to start (This used to be ctdb commit 2913cc93a9a172caf9e0d6675cfa4de4cc957b13)	2010-10-14 08:13:09 +11:00
Ronnie Sahlberg	dc2f87737d	Dont store temporary runtime data in $CTDB_BASE/state since that will usually be /etc/ctdb/state and storing this under /etc is just wrong. Add a new variable CTDB_VARDIR that defaults to /var/ctdb and store the data there instead. (This used to be ctdb commit 516423c25afa9861d9988096efa8a4a2b12b31b1)	2010-09-03 12:43:28 +10:00
Martin Schwenke	a104d1d823	NFS tickles: use addtickle/deltickle instead of shared tickle directory. This adds a new function update_tickles() that tracks tickles for a given port using the new ctdb addtickle/deltickle commands. This function is used in events.d/60.nfs to handle NFS tickles. events.d/61.nfstickle is removed. The /proc/sys/net/ipv4/tcp_tw_recycle setup is also moved to events.d/60.nfs. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit dca4c4ebf3c35f8db3ae208efb7a83abbf726ed6)	2010-08-26 14:59:59 +10:00
Ronnie Sahlberg	729f1ddea0	On RHEL, "service nfs stop;service nfs start" and "service nfs restart" sometimes (very rarely) fails to restart the service. Add a function to restart NFSd on SLES and RHEL-like systems. If we detect the system is unhealthy due to kNFSd not running, try to restart the service again "service nfs restart" and hope for the best. CQ1019372 (This used to be ctdb commit 25c4ce7e919f13226219f036bcffd2be76b2f06c)	2010-08-19 07:18:22 +10:00
Rusty Russell	34ce8a4f02	config: wrap iptables in flock to avoid concurrancy. When doing a releaseip event, we do them in parallel for all the separate IPs. This creates a problem for iptables, which isn't reentrant, giving the strange message: iptables encountered unknown error "18446744073709551615" while initializing table "filter" The worst possible symptom of this is that releaseip won't remove the rule which prevents us listening to clients during releaseip, and the node will be healthy but non-responsive. The simple workaround is to flock-wrap iptables. Better would be to rework the code so we didn't need to use iptables in these paths. CQ:S1018353 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 72d6914ee913272312d7b68f1be5ad05ad06587d)	2010-07-15 10:45:24 +09:30
Michael Adam	b40fa22239	functions: when checking for a directory also check whether it can be accessed. Thanks to "waKKu" on irc for this improvement. Michael (This used to be ctdb commit 81e1483dd0ce2cd091721e456c0c194cc58442f3)	2010-05-11 11:29:45 +02:00
Ronnie Sahlberg	d4f7a59960	Merge root@10.1.1.27:/shared/ctdb/ctdb-git (This used to be ctdb commit e59310132d8126ee3afc191b5db56e80a32986e8)	2010-03-11 18:15:41 +11:00
Wolfgang Mueller-Friedt	e26a26fd7a	ctdb_setstatus in /etc/ctdb/functions was not working correctly because it was called with a wrong parameter list (This used to be ctdb commit e1e285d9f7fa3237dbbacca52a4eb2b264fa5986)	2010-03-11 17:52:42 +11:00
Mathieu Parent	e7bca0dcfc	Correct nice_service() nice takes a binary as argument and not a function or builtin command (This used to be ctdb commit e21b40db64b314a24caa2bc611cb48b93decb5aa)	2010-03-10 17:39:56 +11:00
Stefan Metzmacher	d71c40cad7	config: make remove_ip() a wrapper of delete_ip_from_iface() metze (This used to be ctdb commit e66d6636b80e3614f183366ec92fc3c6d5c323da)	2010-02-23 10:38:48 +01:00
Stefan Metzmacher	3bd1910428	config: interface_modify states in a $CTDB_BASE/state/interface_modify directory metze (This used to be ctdb commit 756c8b953fef7132dae74b5b244baeb3108dec54)	2010-02-23 10:38:48 +01:00
Stefan Metzmacher	d8ab328ee1	config: add setup_iface_ip_readd_script() helper function This adds a generic infrastructure to register scripts which will be called when the delete_ip_from_iface() funtion needs to readd secondary ips to an interface. metze (This used to be ctdb commit ac97d65f44e8dc8bf2ec8f68e4db3448521755a2)	2010-02-23 10:38:47 +01:00
Martin Schwenke	56b178e1a2	eventscripts: stop loadconfig function from loading ctdb config file twice. If "$1" was empty than loadconfig would load the ctdb config twice. This stops that from happening. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 0406d406da70aaee7ad6aac236114905c5d03ed2)	2010-01-22 17:19:12 +11:00
Stefan Metzmacher	2ecf8053f9	config: add some ipv4 helper shell functions Many thanks to Michael Adam <obnox@samba.org> for the basic work. metze (This used to be ctdb commit ff9c641763702ae99632bbf4d0825d578440c074)	2010-01-20 11:10:57 +01:00
Stefan Metzmacher	4493ba6ffa	config: add interface_modify.sh and call it under flock to make modification on interfaces atomic When two releaseip events run in parallel it's possible that the 2nd script readds a secondary ip that was removed by the 1st script. metze (This used to be ctdb commit e02417b2a55c45ac2c125b1b3463c9c39e7bc07a)	2010-01-20 11:10:48 +01:00
Stefan Metzmacher	d01870f138	config/functions: add tickle_tcp_connections() metze (This used to be ctdb commit 2397f13d7b5ca3847ef148187c6b179d06f6a47a)	2010-01-20 09:44:37 +01:00
Martin Schwenke	db25ca69e5	Eventscript argument cleanups and introduction of ctdb_standard_event_handler. The functions file no longer causes a side-effect by doing a shift. It also doesn't set a convenience variable for $1. All eventscripts now explicitly use "$1" in their case statement, as does the initscript. The absence of a shift means that the takeip/releaseip events now explicitly reference $2-$4 rather than $1-$3. New function ctdb_standard_event_handler handles the status and setstatus events, and exits for either of those events. It is called via a default case in each eventscript, replacing an explicit status case where applicable. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 3d55408cbbb3bb71670b80f3dad5639ea0be5b5b)	2009-12-01 17:43:47 +11:00
Martin Schwenke	ad431c3520	Event scripts: functions file now intercepts status and setstatus. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit a1f37fdc5217e57d2d643d77a811afca747685e0)	2009-11-27 15:57:33 +11:00
Martin Schwenke	ece15620c0	Event scripts: use $script_name rather than $service name for status. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 517e9d9b188b18dffc712a8fecddb41540d27b8d)	2009-11-25 16:42:14 +11:00
Martin Schwenke	ee10ea202b	Event scripts: Respect CTDB_MANAGES_NFS and add function log_status_cat. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 5d97c07be13a8209a81dfc8f73e49371949e4dc3)	2009-11-25 16:34:49 +11:00
Martin Schwenke	1edcb89948	More eventscript cleanups. Initial smoke testing seems OK. Apart from lots of cleanup work, this also fixes a bug where the share checks didn't used to cope with directory names containing spaces. The previous commit also loaded the config incorrectly. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 3c93336ab92c2e4829ff4dc360045bfa6df21d50)	2009-11-25 16:30:47 +11:00
Martin Schwenke	a4a048b5cd	Now vaguely tested initscript updates. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit f1e350f9edb74cc44b6c5be4c062fd93e98ba8c4)	2009-11-19 16:48:19 +11:00
Martin Schwenke	ee513c1ba2	More untested eventscript factorisation. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit ac655b0a65b32d809d47fec9821f7f31bb2fe2a7)	2009-11-19 15:00:17 +11:00
Martin Schwenke	73cb65bf1a	Eventscripts: Untested factorisations and introduction of status event. This is the first stage of an experimental change to eventscripts. Ronnie and I did a few hours of factorisation of 40.vsftpd and applied many of the changes to 41.httpd. Other eventscripts were also modified. At this stage this is completely untested. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 364e70b763f0ccd7714d15723ad3ea4d7e2968a1)	2009-11-13 18:28:25 +11:00
Ronnie Sahlberg	3cbaf935af	sugegstion from metze, use killtcp and kill both directions of the nfs connections. we used to kill only one direction since the other direction was unkillble but recent kernels allow us to kill both (This used to be ctdb commit 8001ae580bcc28d45f6026b529d7ffc247cbba34)	2009-11-06 09:54:03 +11:00
Ronnie Sahlberg	58780f4137	add a direcotry where multiple local scripts can be added to run when executing eventscripts (This used to be ctdb commit 27d152a918680a59c7412aec7e1772f25b72d469)	2009-10-19 16:22:15 +11:00
Martin Schwenke	ab98c1b0f1	Clean up ctdb_check_directories* eventscript functions. There are 2 problems with this code: * The loop in ctdb_check_directories_probe() breaks on filenames containing whitespace. The fix to protect them is to pass "$@" to this function and have it operate on "$@". Note that there's still a problem with whitespace in filenames in the 50.samba eventscript. To fix this ctdb_check_directories_probe should read the filenames from stdin. Another time... * The check for '%' in filenames in ctdb_check_directories_probe() ends up involving several forks. On a modern machine this can cost a couple of minutes when checking a large number of directories. The fix is to use a case statement. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit eb1fecaef9aa5cb85dff7d4f7af8a9878deabed8)	2009-10-12 16:32:49 +11:00
Martin Schwenke	78b7043411	40.vsftpd monitor event only fails after 2 failures to connect to port 21. Change the monitor event in 40.vsftpd so it only fails if there are 2 successive failures connecting to port 21. This reduces the likelihood of unhealthy nodes due to vsftpd being restarted for reconfiguration due to node failover or system reconfiguration. New eventscript functions ctdb_counter_init, ctdb_counter_incr, ctdb_counter_limit. These are used to count arbitrary things in eventscripts, depending on the eventscript name and a tag that is passed, and determine if a specified limit has been hit. They're good for counting failures! These functions are used in 40.vsftpd and also in 01.reclock - the latter used to do the counting without these functions. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit cfe63636a163730ae9ad3554b78519b3c07d8896)	2009-09-30 21:05:16 +10:00
Ronnie Sahlberg	029fd6b00f	Revert "try to restart statd everytime it fails, not just the first time" This reverts commit 4f7b39a4871af28df1c4545ec37db179fa47a7da. (This used to be ctdb commit db7b96304e4725f29b12398b7582e385daed63ed)	2009-09-15 19:33:35 +10:00
Ronnie Sahlberg	59cacded72	try to restart statd everytime it fails, not just the first time (This used to be ctdb commit 4f7b39a4871af28df1c4545ec37db179fa47a7da)	2009-09-15 13:35:58 +10:00
Michael Adam	d8f9dad26b	Rename the CTDB_INIT_STYLE "ubuntu" to "debian" - this is where it comes from. Micheal (This used to be ctdb commit b060911683d8ac201806d35a505867fe3ba9519f)	2009-09-09 09:52:13 +02:00
Ronnie Sahlberg	934d8a6b5f	From : Flavio Carmo Junior <carmo.flavio@gmail.com> Add a helper function that checks whether a unix domain socket exists and there is a daemon LISTENING to it similar to the existing function to check for a daemon LISTENING to a tcp/ip socket. (This used to be ctdb commit 025a836ab3be3c078fccd8c10b10dfffbfdd94d0)	2009-05-19 08:47:19 +10:00
Ronnie Sahlberg	4be3e86405	create a function "remote_ip" which can be used from scripts to remove a single ip from an interface. use this fucntion from the natgw eventscript (This used to be ctdb commit feab5f30b2d6cebf4dd28abc5a81f93424a4c852)	2009-04-08 12:49:28 +10:00
Ronnie Sahlberg	36ec47d610	create a varient of kill_tcp_connections that only kills off the local side of a connection (This used to be ctdb commit dc2f28f7c988364b5d45f3048be4db3e5ff113b3)	2009-03-24 14:05:31 +11:00
root	e7de72a1ac	use netstat to check first and only fall back to netcat if netstat is unavailable (This used to be ctdb commit dfb16ce9ed65048d30109851737a9075d071ecdb)	2009-02-05 14:44:46 +11:00
Michael Adam	77bd2b6c91	ctdb_check_tcp_ports: correctly detect listeners on ipv6 :::<port> w/out netcat The netstat test only grepped for the ipv4 wildcard address. Now the ipv6 wildcard listener is correctly detected as well. Michael (This used to be ctdb commit 78e7928797e239e71f96eb001460a0dbf943e18f)	2009-01-30 22:45:52 +01:00
Michael Adam	bbf36eebb9	ctdb_check_tcp_ports: fail the check if neither netstat nor netcat/nc is found Michael (This used to be ctdb commit 25d04bbe9528fafc68751f7beb22daeee3163d34)	2009-01-30 22:45:52 +01:00
Michael Adam	ba6612ec12	ctdb_check_tcp_ports: cope with multiple locations of netcat or nc This fixes tcp port monitor events on systems, where netcat or nc is not found in /usr/bin/, Debian, for instance. The patch also separates the process of finding the binaries and calling them, moving the detection outside of the loop over the ports list. Michael (This used to be ctdb commit 3adf100e7f0c04aaf2da9ae4c6984cdb708c3b57)	2009-01-30 22:45:39 +01:00
Michael Adam	a2d6abdb34	funcions: make (nice_)service a noop for empty service name Michael (This used to be ctdb commit 4cac2a16b70be772e4f1520020762f63c0bf3efe)	2009-01-16 13:31:02 +01:00
Michael Adam	a6ea1b20e5	functions: add detect_init_style(). Michael (This used to be ctdb commit ab34a9480b59c649a4fc73a466c8ca0975453ed9)	2009-01-16 13:26:57 +01:00
Michael Adam	f844ca744a	skip directories containing macros (%) in ctdb_check_directories_probe This prevents the monitor action of 50.samba from failing on e.g. a typical [homes] service with "path = /home/%S" . Michael (This used to be ctdb commit 023d6c2e3017d323b5a70f987f3b4e0b8b8f0f7b)	2008-12-16 09:51:36 +11:00
Ronnie Sahlberg	9ce657b044	When we harvest all tcp connections to kill off after a takeip/releaseip event we must also harvest the ipv4 connections which may be presented in ::ff:xxxx:xxxx form by netstat (This used to be ctdb commit 293d12a40501320a21efaf592b8f20e8590a5197)	2008-08-20 12:50:50 +10:00
Ronnie Sahlberg	43536648c5	update the socketkiller in the eventscripts to be able to handle ipv6 (This used to be ctdb commit 6da7b36b7ccc4ee9b809867ea32036f09a801bb3)	2008-08-20 09:47:00 +10:00
Andrew Tridgell	d47fe5f83b	ensure we use killtcp on non-NFS/non-CIFS ports for faster failover of other protocols (This used to be ctdb commit aefcb1f817581ac8cd67712d07159fc802f96623)	2008-08-01 14:17:50 +10:00
Andrew Tridgell	4eac51341c	allow for probing of directories without raising an error (This used to be ctdb commit 8fed021d11160b137f4140ea02947347250e2959)	2008-07-23 15:35:46 +10:00
Ronnie Sahlberg	6bf597d061	mark /etc/ctdb/functions as a config file to keep rpmlint happy (This used to be ctdb commit 8f6cd88e74de24af8dde2b6cabb2348c4f914b99)	2008-07-09 10:24:19 +10:00
Ronnie Sahlberg	2d644b3fbe	Replace \s with [[:space:]] in our regexps we use for egrep. Kevin Collins noticed that RHEL5 grep-2.5.1-54.2.el5 built for x86 does not handle \s while the exact same RHEL5 package for amd64 does! [[:space:]] is more portable. Even across the same package version ( different architecture ) from the same vendor :-) (This used to be ctdb commit fd7bb21c4f9289fc34a57f9d8cb7c13a02d06096)	2008-07-09 10:03:21 +10:00
Ronnie Sahlberg	03cbb27a79	make /etc/ctdb/functions executable and add a hashbang to it so rpmlint wont complain (This used to be ctdb commit 9b8179ad043a80e0e18eeba427a7b7b15690d039)	2008-06-27 09:29:38 +10:00
Ronnie Sahlberg	ea86c31da6	shell scripts need extra spaces sometime (This used to be ctdb commit f6409b19972fa94257af9aa51def539f639bc226)	2008-04-10 07:01:22 +10:00
Ronnie Sahlberg	b902e09350	add possibility to provide site local modifications to the event system through a /etc/ctdb/rc.local script that is sources by /etc/ctdb/functions (This used to be ctdb commit a5b7dd97e3faf0c4f289240307d0e22a67cf2353)	2008-04-10 06:50:12 +10:00
Ronnie Sahlberg	8da0e15a07	from Mathieu PARENT <math.parent@gmail.com> Simulate "nice service" on systems that do not have "service" (This used to be ctdb commit d0e6dcbadaf41745d423640e5ff5bafd9f68eb88)	2008-02-13 08:20:20 +11:00
Ronnie Sahlberg	42702fa770	add helpers to stop/start nfs lockmanager on different platforms (This used to be ctdb commit 3b797d851bd4bdb8ec2b3981061c668d2cf0f97c)	2008-02-11 09:52:09 +11:00
Ronnie Sahlberg	0e31eaed57	create a startstop_nfs function that can start/stop the nfs service of different platforms (This used to be ctdb commit f6cc6bd1f62138fbf812d1917f7341e2fa2323da)	2008-02-11 09:35:37 +11:00
Ronnie Sahlberg	81232a9e29	dont use absolute pathnames for the netstat tool it can be either in /bin or /usr/bin (This used to be ctdb commit 4ab09e90a8a81b26d2e2af168cfce3c49a98c0e5)	2008-02-07 15:41:48 +11:00
Andrew Tridgell	d815bc6f26	the event scripts no longer need to show a date, as its done by the main ctdbd logging function (This used to be ctdb commit b5e691c4c3fe80b219a9ac355d28b766cb0303f3)	2008-01-16 22:06:44 +11:00
Ronnie Sahlberg	fa5d51c238	move the kill_tcp_connections() function from 10.interfaces to functions (This used to be ctdb commit 055948530fb16bf49c42fc4489f29a21665156c0)	2007-10-11 07:27:38 +10:00
Andrew Tridgell	c62490569b	cope with non-standard install dirs in event scripts (This used to be ctdb commit 52fff5345873690a9cc86495f414343eaa3bd540)	2007-09-14 14:14:03 +10:00
Andrew Tridgell	4f261ae191	remove more cruft from the logs (This used to be ctdb commit b67f35c483b6cbb5facaa6380c7794709f44213a)	2007-09-13 10:39:05 +10:00
Andrew Tridgell	1b53ecc445	remove clutter from ctdb log file (This used to be ctdb commit 54d5dcaaee0498f40bbee5059cc72d0ca75d33b7)	2007-09-13 10:03:18 +10:00
Ronnie Sahlberg	8e89b27098	try netstat as a last attempt to check a tcp port in ctdb_check_tcp_ports() as well (This used to be ctdb commit ad0292726f9cfc8afe3733b30ac2d5621e9a48f1)	2007-07-15 09:29:08 +10:00
Ronnie Sahlberg	4c276ded1f	if we dont have nc or netcat, try using netstat as a final attempt to check for tcp ports (the check for these tools should not really use hardcoded paths) (This used to be ctdb commit 56d77082c07a519dd3804cc24cc7ba889b8469ff)	2007-07-15 09:26:54 +10:00
Ronnie Sahlberg	3890fde07f	if we dont have /etc/sysconfig and we dont have /etc/default check /etc/ctdb/sysconfig as a last option (This used to be ctdb commit 1043929ceb0cd04ab6466e9a5d7d52f9af1cb8e8)	2007-07-15 09:13:50 +10:00
Ronnie Sahlberg	82824e0680	when we have found that /etc/rc.d/init.d/SERVICE exists, then run that script and not /etc/rc.d/SERVICE (This used to be ctdb commit 7f0c3a02ef11fd19c8cd5116fd451ebd10ba5d1b)	2007-07-15 08:54:48 +10:00
Andrew Tridgell	fc73bc5c24	added --nosetsched option to ctdbd (This used to be ctdb commit 4cbbb88c1735c7d112e751e22da1c1c69e09bf4a)	2007-07-13 08:47:02 +10:00
Andrew Tridgell	9d0a595594	check winbind in monitoring event too (This used to be ctdb commit bccba656c21d0edbd9840401a3c43a76b1b3bc05)	2007-06-17 12:05:29 +10:00
Andrew Tridgell	d683080b08	- wait for winbind on samba start - use $PATH for ctdb status (This used to be ctdb commit cf8d837cead1cbcb22c71ebbc3947970d1a565a3)	2007-06-17 11:57:42 +10:00
Andrew Tridgell	76b7361c7e	- added monitoring of rpc ports for nfs, and of Samba ports and directories - added monitoring of the ethernet link state When monitoring detects an error, the node loses its public IP address (This used to be ctdb commit 0af57aead8c983511d25774b4ffe09fa5ff26501)	2007-06-06 12:08:42 +10:00
Andrew Tridgell	e763874872	make the init scripts more portable about location of system config files (This used to be ctdb commit 65f3e2bc722e314b2c51c3bfdc544b408a8a64cf)	2007-06-03 22:07:07 +10:00
Andrew Tridgell	794d6dd59d	move config files to config/ directory (This used to be ctdb commit f95de519b885c8e1f40df0cda70fd796e479a22a)	2007-06-02 19:40:07 +10:00

... 3 4 5 6 7

345 Commits