samba-mirror

mirror of https://github.com/samba-team/samba.git synced 2025-01-08 21:18:16 +03:00

Ignoring revisions in .git-blame-ignore-revs. Click here to bypass and see the normal blame view.

259 lines

7.3 KiB

Plaintext

Raw Normal View History

added hooks to make nfs statd behave correctly on failover (This used to be ctdb commit a1ee84fc47892b6c18d417ccf714211fcb07952e) 2007-05-31 05:09:45 +04:00			`#!/bin/sh`

ctdb-scripts: Fix some bit-rotted comments and whitespace The top comment in the file is no longer true. The comment about notifications doesn't really apply anymore since upstream sm-notify is used and it does "the right thing". shfmt wants to remove a space before a semicolon, so do that too. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2024-11-26 03:25:09 +03:00			`# statd must be configured to use statd_callout, CTDB's binary`
			`# counterpart to this script, as its availability call-out.`
ctdb-scripts: Use nfsconf as a last resort to set NFS_HOSTNAME If nfsconf exists then use it as last resort to attempt to extract [statd]:name from /etc/nfs.conf. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14444 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2020-07-13 03:16:33 +03:00			`#`
ctdb-scripts: Improve documentation Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-13 03:39:37 +03:00			`# Modern NFS utils versions use /etc/nfs.conf:`
			`#`
			`# [statd]`
			`# name = mycluster`
ctdb-failover: Split statd_callout add-client/del-client rpc.statd is single-threaded and runs its HA callout synchronously. If it is too slow then latency accumulates and rpc.statd's backlog grows. Running a pair of add-client/del-client events with the current code averages ~0.030s in my test environment. This mean that 1000 clients reclaiming locks after failover can easily cause 10s of latency. This could cause rpc.statd to become unresponsive, resulting in a time out for an rpcinfo-based health check of the status service. Split the add-client/del-client events out to a standalone statd_callout executable, written in C, to be used as the HA callout for rpc.statd. All other functions move to statd_callout_helper. Now, running a pair of add-client/del-client events in my test environment averages only ~0.002s. This seems less likely to cause latency problems. The standalone statd_callout executable needs to read a configuration file, which is generated by statd_callout_helper from the "startup" event. It also needs access to a list of currently assigned public IPs. For backward compatibility, during installation a symlink is created from $CTDB_BASE/statd-callout to the new statd_callout, which is installed in the helper directory. Testing this as part of the eventscript unit tests starts to become even more of a hack than it used to be. However, the dependency on stubs and the corresponding setup of fake state makes it hard to move this elsewhere. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Tue Jun 25 04:24:57 UTC 2024 on atb-devel-224 2024-05-10 04:42:26 +03:00			`# ha-callout = /usr/local/libexec/ctdb/statd_callout`
ctdb-scripts: Improve documentation Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-13 03:39:37 +03:00			`#`
			`# Older Linux versions may use something like the following...`
ctdb-scripts: Use nfsconf as a last resort to set NFS_HOSTNAME If nfsconf exists then use it as last resort to attempt to extract [statd]:name from /etc/nfs.conf. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14444 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2020-07-13 03:16:33 +03:00			`#`
			`# /etc/sysconfig/nfs (Red Hat) or /etc/default/nfs-common (Debian):`
ctdb-scripts: Remove unused variable NFS_HOSTNAME This was passed to CTDB's old smnotify. This has been replaced by use of nfs-utils' sm-notify, which doesn't need this. In test, a fake NFS_HOSTNAME is still needed. Real sm-notify will get it from a reverse host lookup of the IP address. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2024-05-01 03:22:05 +03:00			`# STATD_HOSTNAME="mycluster -H /usr/local/libexec/ctdb/statd_callout"`
ctdb-scripts: Use nfsconf as a last resort to set NFS_HOSTNAME If nfsconf exists then use it as last resort to attempt to extract [statd]:name from /etc/nfs.conf. BUG: https://bugzilla.samba.org/show_bug.cgi?id=14444 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2020-07-13 03:16:33 +03:00			`#`
ctdb-scripts: Use nfs-utils' sm-notify instead of CTDB's smnotify CTDB's smnotify does not support IPv6 and is difficult to maintain. So, create directories of files and pass them to NFS util's sm-notify. There is an implied change here, because NFS utils sm-notify stopped sending IP addresses as mon_name back in 2010: http://git.linux-nfs.org/?p=steved/nfs-utils.git;a=commitdiff;h=900df0e7c0b9006d72d8459b30dc2cd69ce495a5 This will change advice given in the wiki to use a hostname for the cluster with round-robin DNS, since this is what is best supported. Another behavioural change is that sm-notify only sends "up" notifications with an odd state. Signed-off-by: Martin Schwenke <martin@meltin.net> Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-03-03 07:44:08 +03:00			`# If using Linux kernel NFS then the following should also be set in`
			`# /etc/nfs.conf:`
			`#`
			`# [sm-notify]`
			`# lift-grace = n`
			`#`
			`# See sm-notify(8) for details. This doesn't matter when using`
			`# NFS-Ganesha because sm-notify's attempt to lift grace will fail`
			`# silently if /proc/fs/lockd/nlm_end_grace is not found.`
			`#`
docs on how to use statd-callout (This used to be ctdb commit 4a75111b4f3f93dc42c9ced2d23f3cc933712017) 2007-06-02 13:45:06 +04:00
ctdb-scripts: Fix some bit-rotted comments and whitespace The top comment in the file is no longer true. The comment about notifications doesn't really apply anymore since upstream sm-notify is used and it does "the right thing". shfmt wants to remove a space before a semicolon, so do that too. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2024-11-26 03:25:09 +03:00			`if [ -z "$CTDB_BASE" ]; then`
ctdb-failover: Split statd_callout add-client/del-client rpc.statd is single-threaded and runs its HA callout synchronously. If it is too slow then latency accumulates and rpc.statd's backlog grows. Running a pair of add-client/del-client events with the current code averages ~0.030s in my test environment. This mean that 1000 clients reclaiming locks after failover can easily cause 10s of latency. This could cause rpc.statd to become unresponsive, resulting in a time out for an rpcinfo-based health check of the status service. Split the add-client/del-client events out to a standalone statd_callout executable, written in C, to be used as the HA callout for rpc.statd. All other functions move to statd_callout_helper. Now, running a pair of add-client/del-client events in my test environment averages only ~0.002s. This seems less likely to cause latency problems. The standalone statd_callout executable needs to read a configuration file, which is generated by statd_callout_helper from the "startup" event. It also needs access to a list of currently assigned public IPs. For backward compatibility, during installation a symlink is created from $CTDB_BASE/statd-callout to the new statd_callout, which is installed in the helper directory. Testing this as part of the eventscript unit tests starts to become even more of a hack than it used to be. However, the dependency on stubs and the corresponding setup of fake state makes it hard to move this elsewhere. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Tue Jun 25 04:24:57 UTC 2024 on atb-devel-224 2024-05-10 04:42:26 +03:00			`export CTDB_BASE="/usr/local/etc/ctdb"`
			`fi`
cope with non-standard install dirs in event scripts (This used to be ctdb commit 52fff5345873690a9cc86495f414343eaa3bd540) 2007-09-14 08:14:03 +04:00
ctdb-scripts: Update script boilerplate to avoid shellcheck warnings * Assign the output of dirname to temporary variable to avoid word splitting when directory name contains whitespace * Drop export of CTDB_BASE to avoid masking broken return value - functions file does the export anyway * Quote path when including functions file Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-06-29 10:36:05 +03:00			`. "${CTDB_BASE}/functions"`
ctdb-scripts: Rewrite statd-callout to avoid 10 minute lag This is naive and assumes no performance problems when updating persistent DBs. It also does no error handling. Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Michael Adam <obnox@samba.org> 2013-11-08 09:41:11 +04:00
			`# Overwrite this so we get some logging`
ctdb-scripts: Reformat with "shfmt -w -p -i 0 -fn" Best reviewed with "git show -w". Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-16 04:09:02 +03:00			`die()`
ctdb-scripts: Rewrite statd-callout to avoid 10 minute lag This is naive and assumes no performance problems when updating persistent DBs. It also does no error handling. Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Michael Adam <obnox@samba.org> 2013-11-08 09:41:11 +04:00			`{`
ctdb-failover: Split statd_callout add-client/del-client rpc.statd is single-threaded and runs its HA callout synchronously. If it is too slow then latency accumulates and rpc.statd's backlog grows. Running a pair of add-client/del-client events with the current code averages ~0.030s in my test environment. This mean that 1000 clients reclaiming locks after failover can easily cause 10s of latency. This could cause rpc.statd to become unresponsive, resulting in a time out for an rpcinfo-based health check of the status service. Split the add-client/del-client events out to a standalone statd_callout executable, written in C, to be used as the HA callout for rpc.statd. All other functions move to statd_callout_helper. Now, running a pair of add-client/del-client events in my test environment averages only ~0.002s. This seems less likely to cause latency problems. The standalone statd_callout executable needs to read a configuration file, which is generated by statd_callout_helper from the "startup" event. It also needs access to a list of currently assigned public IPs. For backward compatibility, during installation a symlink is created from $CTDB_BASE/statd-callout to the new statd_callout, which is installed in the helper directory. Testing this as part of the eventscript unit tests starts to become even more of a hack than it used to be. However, the dependency on stubs and the corresponding setup of fake state makes it hard to move this elsewhere. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Tue Jun 25 04:24:57 UTC 2024 on atb-devel-224 2024-05-10 04:42:26 +03:00			`script_log "statd_callout_helper" "$@"`
ctdb-scripts: Reformat with "shfmt -w -p -i 0 -fn" Best reviewed with "git show -w". Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-16 04:09:02 +03:00			`exit 1`
ctdb-scripts: Rewrite statd-callout to avoid 10 minute lag This is naive and assumes no performance problems when updating persistent DBs. It also does no error handling. Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Michael Adam <obnox@samba.org> 2013-11-08 09:41:11 +04:00			`}`

ctdb-scripts: Clean up statd-callout This means there will be 2 loops reading the data but the code flow is much more obvious. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-03-02 08:43:51 +03:00			`############################################################`
60.nfs: we must always restart the lockmanager when the cluster has been reconfigured and ip addresses has changed. This is to make sure we get a clusterwide grace period for nfs locking. if we dont do this and only restart locking on the nodes that were direclty affected, a different client can take out a conflicting lock from a different node before affected clients has had a chance to reclaim all the locks lost during reconfigure. grace period on rhel5 kernel has bene increased to 90 seconds! statd-callout: we must restart lockmanager to ensure a clusterwide grace period for nfs. this makes locking "more correct" for nfs clients and prevents other clients/nodes from taking out a conflicting lock while a different client/node tries to reclaim lost locks. This makes it "almost consistent" for NFS clients but there is still the possibility that a cifs client can take out a conflicting lock before an nfs client has had a chance to reclaim an existing lock. This can not be solved with anything less than making the kernel nfs lock manager "samba aware" and making samba aware of the internal state of the kernel lock manager so that they can cooperate. we can not just stop/start the lockmanager back to back in rhel5 since if they are stopped/started too close to eachother then when the new lockmanager upon starting up sends out statd notifications two things can happen: 1, new lockmanager sends out notification BEFORE it has registered with portmapper leading to lockmanager starts lockmanager sends notification to the client client tries to recover the lock and tries to portmap the lockmanager port on the server. server is not (yet) registered with portmapper and server responds "no such program" to hte clients request to discover where lockmanager is. client then just completely gives up reclaiming the lock and doesnt even reattempt the portmapper call after some timeout. ==> lock reclaim failed. 2, if they are started back to back, and a client tries to reclaim the lock the lockmanager sometimes sends two responses back to back to the client. one with status NLM_GRANTED (==you got the lock reclaimed) and one with status NLM_DENIED (==you could not get the lock reclaimed) This confuses the client and leads to the server thinking that the client does have the lock and the client thinking it has not got the lock and orphaned locks result. We also send out additional notification messages of different formats to allow more legacy clients to interoperate with locking. (This used to be ctdb commit 13208c1aab2942e28dff87e38e6794bf0c026033) 2007-09-07 02:52:56 +04:00
ctdb-scripts: Use ctdb_setup_state_dir() Replace all uses of ctdb_setup_service_state_dir() by ctdb_setup_state_dir(). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2018-03-07 03:12:29 +03:00			`ctdb_setup_state_dir "service" "nfs"`
ctdb-scripts: Change statd-callout to be more scalable Updating ctdb.tdb on each add-client, del-client and each delete during notify was too ambitious. Persistent transactions do not perform well enough to do this. Revert to having add-client and del-client create touch files. Each monitor event calls "statd-callout update" to convert touch files into ctdb.tdb records. Update testcases to do the "update" and add an extra test. Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2015-02-13 12:55:43 +03:00
ctdb-scripts: Set ownership of statd-callout state directory For add-client and del-client, statd-callout is called by rpc.statd, which runs as rpcuser, statd or some other non-root system user. This means that add-client and del-client can't write in the statd-callout state directory if it is only writable by root. rpc.statd must be able to write to its own local system statd state directory, so find this directory and use it as a reference to set the ownership of CTDB's statd-callout state directory. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-19 05:17:44 +03:00			`find_statd_sm_dir()`
			`{`
			`if [ -n "$CTDB_TEST_MODE" ]; then`
			`_f="${CTDB_TEST_TMP_DIR}/sm"`
			`mkdir -p "$_f" "${_f}.bak"`
			`echo "$_f"`
			`return`
			`fi`

			`for _sm_dir in /var/lib/nfs/statd/sm /var/lib/nfs/sm; do`
			`if [ -d "$_sm_dir" ]; then`
			`echo "$_sm_dir"`
			`break`
			`fi`
			`done`
			`}`

			`# Ensure the state directory exists and can be written when called as`
			`# a non-root user. Assume the user to run as is the owner of the`
			`# system statd sm directory, since both rpc.statd and sm-notify run as`
			`# this directory's owner, so it can read and modify the directory.`
			`create_add_del_client_dir()`
			`{`
			`_dir="$1"`

			`if [ ! -d "$_dir" ]; then`
			`mkdir -p "$_dir" \|\| die "Failed to create directory \"${_dir}\""`
			`ref=$(find_statd_sm_dir)`
			`[ -n "$ref" ] \|\| die "Failed to find statd sm directory"`
			`chown --reference="$ref" "$_dir"`
			`fi`
			`}`

ctdb-scripts: Use ctdb_setup_state_dir() Replace all uses of ctdb_setup_service_state_dir() by ctdb_setup_state_dir(). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2018-03-07 03:12:29 +03:00			`# script_state_dir set by ctdb_setup_state_dir()`
			`# shellcheck disable=SC2154`
ctdb-failover: Split statd_callout add-client/del-client rpc.statd is single-threaded and runs its HA callout synchronously. If it is too slow then latency accumulates and rpc.statd's backlog grows. Running a pair of add-client/del-client events with the current code averages ~0.030s in my test environment. This mean that 1000 clients reclaiming locks after failover can easily cause 10s of latency. This could cause rpc.statd to become unresponsive, resulting in a time out for an rpcinfo-based health check of the status service. Split the add-client/del-client events out to a standalone statd_callout executable, written in C, to be used as the HA callout for rpc.statd. All other functions move to statd_callout_helper. Now, running a pair of add-client/del-client events in my test environment averages only ~0.002s. This seems less likely to cause latency problems. The standalone statd_callout executable needs to read a configuration file, which is generated by statd_callout_helper from the "startup" event. It also needs access to a list of currently assigned public IPs. For backward compatibility, during installation a symlink is created from $CTDB_BASE/statd-callout to the new statd_callout, which is installed in the helper directory. Testing this as part of the eventscript unit tests starts to become even more of a hack than it used to be. However, the dependency on stubs and the corresponding setup of fake state makes it hard to move this elsewhere. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Tue Jun 25 04:24:57 UTC 2024 on atb-devel-224 2024-05-10 04:42:26 +03:00			`statd_callout_state_dir="${script_state_dir}/statd_callout"`
ctdb-scripts: Use ctdb_setup_state_dir() Replace all uses of ctdb_setup_service_state_dir() by ctdb_setup_state_dir(). Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2018-03-07 03:12:29 +03:00
ctdb-scripts: Move ctdb.tdb attach to statd-callout All of the other uses of ctdb.tdb are in statd-callout. New variable statd_callout_db makes it easy to change the database name in future, perhaps even allowing it to be configurable. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-29 06:11:46 +03:00			`statd_callout_db="ctdb.tdb"`
ctdb-scripts: Avoid globally changing to queue directory Add new variables statd_callout_state_dir and statd_callout_queue_dir - the latter is for files queued by add-client/del-client. Use $statd_callout_queue_dir to avoid a global cd to the queue directory near the top of the script. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2024-05-08 07:44:13 +03:00			`statd_callout_queue_dir="${statd_callout_state_dir}/queue"`
ctdb-scripts: Move ctdb.tdb attach to statd-callout All of the other uses of ctdb.tdb are in statd-callout. New variable statd_callout_db makes it easy to change the database name in future, perhaps even allowing it to be configurable. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-29 06:11:46 +03:00
ctdb-scripts: Clean up statd-callout This means there will be 2 loops reading the data but the code flow is much more obvious. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-03-02 08:43:51 +03:00			`############################################################`

ctdb-scripts: Use nfs-utils' sm-notify instead of CTDB's smnotify CTDB's smnotify does not support IPv6 and is difficult to maintain. So, create directories of files and pass them to NFS util's sm-notify. There is an implied change here, because NFS utils sm-notify stopped sending IP addresses as mon_name back in 2010: http://git.linux-nfs.org/?p=steved/nfs-utils.git;a=commitdiff;h=900df0e7c0b9006d72d8459b30dc2cd69ce495a5 This will change advice given in the wiki to use a hostname for the cluster with round-robin DNS, since this is what is best supported. Another behavioural change is that sm-notify only sends "up" notifications with an odd state. Signed-off-by: Martin Schwenke <martin@meltin.net> Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-03-03 07:44:08 +03:00			`# Read pairs of:`
			`# server-IP client-IP`
			`# from stdin and send associated SM_NOTIFY packets.`
ctdb-scripts: Reformat with "shfmt -w -p -i 0 -fn" Best reviewed with "git show -w". Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-16 04:09:02 +03:00			`send_notifies()`
ctdb-scripts: Clean up statd-callout This means there will be 2 loops reading the data but the code flow is much more obvious. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-03-02 08:43:51 +03:00			`{`
			`# State must monotonically increase, across the entire`
ctdb-scripts: Use nfs-utils' sm-notify instead of CTDB's smnotify CTDB's smnotify does not support IPv6 and is difficult to maintain. So, create directories of files and pass them to NFS util's sm-notify. There is an implied change here, because NFS utils sm-notify stopped sending IP addresses as mon_name back in 2010: http://git.linux-nfs.org/?p=steved/nfs-utils.git;a=commitdiff;h=900df0e7c0b9006d72d8459b30dc2cd69ce495a5 This will change advice given in the wiki to use a hostname for the cluster with round-robin DNS, since this is what is best supported. Another behavioural change is that sm-notify only sends "up" notifications with an odd state. Signed-off-by: Martin Schwenke <martin@meltin.net> Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-03-03 07:44:08 +03:00			`# cluster. Use seconds since epoch and assume the time is in`
ctdb-scripts: Clean up statd-callout This means there will be 2 loops reading the data but the code flow is much more obvious. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-03-02 08:43:51 +03:00			`# sync across nodes. Even numbers mean service is shut down,`
ctdb-scripts: Use nfs-utils' sm-notify instead of CTDB's smnotify CTDB's smnotify does not support IPv6 and is difficult to maintain. So, create directories of files and pass them to NFS util's sm-notify. There is an implied change here, because NFS utils sm-notify stopped sending IP addresses as mon_name back in 2010: http://git.linux-nfs.org/?p=steved/nfs-utils.git;a=commitdiff;h=900df0e7c0b9006d72d8459b30dc2cd69ce495a5 This will change advice given in the wiki to use a hostname for the cluster with round-robin DNS, since this is what is best supported. Another behavioural change is that sm-notify only sends "up" notifications with an odd state. Signed-off-by: Martin Schwenke <martin@meltin.net> Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-03-03 07:44:08 +03:00			`# odd numbers mean service is up. However, sm-notify always`
			`# reads the state and converts it to odd (if necessary, by`
			`# adding 1 when it is even) because it only sends "up"`
			`# notifications. Note that there is a 2038 issue here but we`
			`# will get to that later.`
			`_state=$(date '+%s')`

			`_helper="${CTDB_HELPER_BINDIR}/ctdb_smnotify_helper"`
ctdb-scripts: Clean up statd-callout This means there will be 2 loops reading the data but the code flow is much more obvious. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-03-02 08:43:51 +03:00
ctdb-scripts: Use nfs-utils' sm-notify instead of CTDB's smnotify CTDB's smnotify does not support IPv6 and is difficult to maintain. So, create directories of files and pass them to NFS util's sm-notify. There is an implied change here, because NFS utils sm-notify stopped sending IP addresses as mon_name back in 2010: http://git.linux-nfs.org/?p=steved/nfs-utils.git;a=commitdiff;h=900df0e7c0b9006d72d8459b30dc2cd69ce495a5 This will change advice given in the wiki to use a hostname for the cluster with round-robin DNS, since this is what is best supported. Another behavioural change is that sm-notify only sends "up" notifications with an odd state. Signed-off-by: Martin Schwenke <martin@meltin.net> Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-03-03 07:44:08 +03:00			`_notify_dir="${statd_callout_state_dir}/sm-notify"`
			`mkdir -p "$_notify_dir"`
ctdb-scripts: Clean up statd-callout This means there will be 2 loops reading the data but the code flow is much more obvious. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-03-02 08:43:51 +03:00
ctdb-scripts: Avoid ShellCheck warning SC2162 SC2162 read without -r will mangle backslashes. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-19 02:43:33 +03:00			`while read -r _sip _cip; do`
ctdb-scripts: Use nfs-utils' sm-notify instead of CTDB's smnotify CTDB's smnotify does not support IPv6 and is difficult to maintain. So, create directories of files and pass them to NFS util's sm-notify. There is an implied change here, because NFS utils sm-notify stopped sending IP addresses as mon_name back in 2010: http://git.linux-nfs.org/?p=steved/nfs-utils.git;a=commitdiff;h=900df0e7c0b9006d72d8459b30dc2cd69ce495a5 This will change advice given in the wiki to use a hostname for the cluster with round-robin DNS, since this is what is best supported. Another behavioural change is that sm-notify only sends "up" notifications with an odd state. Signed-off-by: Martin Schwenke <martin@meltin.net> Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-03-03 07:44:08 +03:00			`# Create a directory per server IP containing a file`
			`# for each client IP`
			`mkdir -p \`
			`"${_notify_dir}/${_sip}/sm" \`
			`"${_notify_dir}/${_sip}/sm.bak"`

			`_out="${_notify_dir}/${_sip}/sm/${_cip}"`
			`"$_helper" "monitor" "$_cip" "$_sip" >"$_out"`
			`done`
ctdb-scripts: Clean up statd-callout This means there will be 2 loops reading the data but the code flow is much more obvious. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-03-02 08:43:51 +03:00
ctdb-scripts: Use nfs-utils' sm-notify instead of CTDB's smnotify CTDB's smnotify does not support IPv6 and is difficult to maintain. So, create directories of files and pass them to NFS util's sm-notify. There is an implied change here, because NFS utils sm-notify stopped sending IP addresses as mon_name back in 2010: http://git.linux-nfs.org/?p=steved/nfs-utils.git;a=commitdiff;h=900df0e7c0b9006d72d8459b30dc2cd69ce495a5 This will change advice given in the wiki to use a hostname for the cluster with round-robin DNS, since this is what is best supported. Another behavioural change is that sm-notify only sends "up" notifications with an odd state. Signed-off-by: Martin Schwenke <martin@meltin.net> Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-03-03 07:44:08 +03:00			`# Send notifications for server startup`
			`_ref=$(find_statd_sm_dir)`
			`for _sip_dir in "$_notify_dir"/*; do`
			`if [ "$_sip_dir" = "${_notify_dir}/*" ]; then`
			`break`
ctdb-scripts: Clean up statd-callout This means there will be 2 loops reading the data but the code flow is much more obvious. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-03-02 08:43:51 +03:00			`fi`

ctdb-scripts: Use nfs-utils' sm-notify instead of CTDB's smnotify CTDB's smnotify does not support IPv6 and is difficult to maintain. So, create directories of files and pass them to NFS util's sm-notify. There is an implied change here, because NFS utils sm-notify stopped sending IP addresses as mon_name back in 2010: http://git.linux-nfs.org/?p=steved/nfs-utils.git;a=commitdiff;h=900df0e7c0b9006d72d8459b30dc2cd69ce495a5 This will change advice given in the wiki to use a hostname for the cluster with round-robin DNS, since this is what is best supported. Another behavioural change is that sm-notify only sends "up" notifications with an odd state. Signed-off-by: Martin Schwenke <martin@meltin.net> Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-03-03 07:44:08 +03:00			`_sip="${_sip_dir##*/}" # basename`

			`# Write the state as a host order 32-bit integer. See`
			`# note at top of function about state.`
			`_out="${_sip_dir}/state"`
			`"$_helper" "state" "$_state" >"$_out"`

			`# The ownership of the directory and contents should`
			`# match the system's statd sm directory, so that`
			`# sm-notify drops privileges and switches to run as`
			`# the directory owner.`
			`chown -R --reference="$_ref" "$_sip_dir"`
			`timeout 10 sm-notify -d -f -m 0 -n -P "$_sip_dir" -v "$_sip"`

			`rm -rf "$_sip_dir"`
ctdb-scripts: Clean up statd-callout This means there will be 2 loops reading the data but the code flow is much more obvious. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-03-02 08:43:51 +03:00			`done`
			`}`

ctdb-scripts: Reformat with "shfmt -w -p -i 0 -fn" Best reviewed with "git show -w". Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-16 04:09:02 +03:00			`delete_records()`
ctdb-scripts: Clean up statd-callout This means there will be 2 loops reading the data but the code flow is much more obvious. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-03-02 08:43:51 +03:00			`{`
ctdb-scripts: Avoid ShellCheck warning SC2162 SC2162 read without -r will mangle backslashes. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-19 02:43:33 +03:00			`while read -r _sip _cip; do`
ctdb-scripts: Clean up statd-callout This means there will be 2 loops reading the data but the code flow is much more obvious. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-03-02 08:43:51 +03:00			`_key="statd-state@${_sip}@${_cip}"`
			`echo "\"${_key}\" \"\""`
ctdb-scripts: Move ctdb.tdb attach to statd-callout All of the other uses of ctdb.tdb are in statd-callout. New variable statd_callout_db makes it easy to change the database name in future, perhaps even allowing it to be configurable. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-29 06:11:46 +03:00			`done \| $CTDB ptrans "$statd_callout_db"`
ctdb-scripts: Clean up statd-callout This means there will be 2 loops reading the data but the code flow is much more obvious. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-03-02 08:43:51 +03:00			`}`

			`############################################################`

ctdb-scripts: Move ctdb.tdb attach to statd-callout All of the other uses of ctdb.tdb are in statd-callout. New variable statd_callout_db makes it easy to change the database name in future, perhaps even allowing it to be configurable. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-29 06:11:46 +03:00			`# Keep a file per server-IP/client-IP pair, to keep track of the last`
			`# "add-client" or "del-client'. These get pushed to a database during`
			`# "update", which will generally be run once each "monitor" cycle. In`
			`# this way we avoid scalability problems with flood of persistent`
ctdb-scripts: Reformat with "shfmt -w -p -i 0 -fn" Best reviewed with "git show -w". Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-16 04:09:02 +03:00			`# transactions after a "notify" when all the clients re-take their`
			`# locks.`

ctdb-scripts: Move ctdb.tdb attach to statd-callout All of the other uses of ctdb.tdb are in statd-callout. New variable statd_callout_db makes it easy to change the database name in future, perhaps even allowing it to be configurable. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-29 06:11:46 +03:00			`startup()`
			`{`
ctdb-scripts: Set ownership of statd-callout state directory For add-client and del-client, statd-callout is called by rpc.statd, which runs as rpcuser, statd or some other non-root system user. This means that add-client and del-client can't write in the statd-callout state directory if it is only writable by root. rpc.statd must be able to write to its own local system statd state directory, so find this directory and use it as a reference to set the ownership of CTDB's statd-callout state directory. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-19 05:17:44 +03:00			`create_add_del_client_dir "$statd_callout_queue_dir"`
ctdb-scripts: Move state directory creation to "startup" action Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-29 06:25:03 +03:00
ctdb-scripts: Move ctdb.tdb attach to statd-callout All of the other uses of ctdb.tdb are in statd-callout. New variable statd_callout_db makes it easy to change the database name in future, perhaps even allowing it to be configurable. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-29 06:11:46 +03:00			`$CTDB attach "$statd_callout_db" persistent`
ctdb-failover: Split statd_callout add-client/del-client rpc.statd is single-threaded and runs its HA callout synchronously. If it is too slow then latency accumulates and rpc.statd's backlog grows. Running a pair of add-client/del-client events with the current code averages ~0.030s in my test environment. This mean that 1000 clients reclaiming locks after failover can easily cause 10s of latency. This could cause rpc.statd to become unresponsive, resulting in a time out for an rpcinfo-based health check of the status service. Split the add-client/del-client events out to a standalone statd_callout executable, written in C, to be used as the HA callout for rpc.statd. All other functions move to statd_callout_helper. Now, running a pair of add-client/del-client events in my test environment averages only ~0.002s. This seems less likely to cause latency problems. The standalone statd_callout executable needs to read a configuration file, which is generated by statd_callout_helper from the "startup" event. It also needs access to a list of currently assigned public IPs. For backward compatibility, during installation a symlink is created from $CTDB_BASE/statd-callout to the new statd_callout, which is installed in the helper directory. Testing this as part of the eventscript unit tests starts to become even more of a hack than it used to be. However, the dependency on stubs and the corresponding setup of fake state makes it hard to move this elsewhere. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Tue Jun 25 04:24:57 UTC 2024 on atb-devel-224 2024-05-10 04:42:26 +03:00
			`_default="${CTDB_SCRIPT_VARDIR}/statd_callout.conf"`
			`_config_file="${CTDB_STATD_CALLOUT_CONFIG_FILE:-"${_default}"}"`
			`cat >"$_config_file" <<EOF`
			`persistent_db`
			`${statd_callout_queue_dir}`
			`${CTDB_MY_PUBLIC_IPS_CACHE}`
			`EOF`
ctdb-scripts: Move ctdb.tdb attach to statd-callout All of the other uses of ctdb.tdb are in statd-callout. New variable statd_callout_db makes it easy to change the database name in future, perhaps even allowing it to be configurable. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-29 06:11:46 +03:00			`}`

			`############################################################`

			`case "$1" in`
			`startup)`
			`startup`
			`;;`

ctdb-scripts: Reformat with "shfmt -w -p -i 0 -fn" Best reviewed with "git show -w". Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-16 04:09:02 +03:00			`update)`
ctdb-scripts: Avoid globally changing to queue directory Add new variables statd_callout_state_dir and statd_callout_queue_dir - the latter is for files queued by add-client/del-client. Use $statd_callout_queue_dir to avoid a global cd to the queue directory near the top of the script. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2024-05-08 07:44:13 +03:00			`cd "$statd_callout_queue_dir" \|\|`
			`die "Failed to change directory to \"${statd_callout_queue_dir}\""`
ctdb-scripts: Reformat with "shfmt -w -p -i 0 -fn" Best reviewed with "git show -w". Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-16 04:09:02 +03:00			`files=$(echo statd-state@*)`
			`if [ "$files" = "statd-state@*" ]; then`
			`# No files!`
			`exit 0`
ctdb-scripts: Change statd-callout to be more scalable Updating ctdb.tdb on each add-client, del-client and each delete during notify was too ambitious. Persistent transactions do not perform well enough to do this. Revert to having add-client and del-client create touch files. Each monitor event calls "statd-callout update" to convert touch files into ctdb.tdb records. Update testcases to do the "update" and add an extra test. Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2015-02-13 12:55:43 +03:00			`fi`
ctdb-scripts: Avoid connecting to ctdbd in add-client/del-client rpc.statd runs statd-callout as a non-root user, which is currently hacked around using some sudo logic that fails to work in some contexts (e.g. in a container). Use $CTDB_MY_PUBLIC_IPS_CACHE to access the node's currently assigned public IPs, for add-client/del-client. This avoids connecting to ctdbd when called from rpc.statd. Also, use $CTDB_MY_PUBLIC_IPS_CACHE in other places where it makes sense. Connections to ctdbd are still made in the "notify" action, but this is always run as root. In the test code, set the PNN after public addresses setup so that the cache of assigned IPs correctly initialised. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-29 03:12:44 +03:00			`sed_expr=$(awk '{`
			`ip = $1; gsub(/\./, "\\.", ip);`
			`printf "/statd-state@%s@/p\n", ip }' "$CTDB_MY_PUBLIC_IPS_CACHE")`
ctdb-scripts: Avoid shellcheck warnings SC2046, SC2086 (double-quoting) SC2046: Quote this to prevent word splitting. SC2086: Double quote to prevent globbing and word splitting. Add some quoting where it makes sense. Use shellcheck directives for false-positives. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2016-07-06 10:31:51 +03:00			`# Intentional multi-word expansion for multiple files`
			`# shellcheck disable=SC2086`
ctdb-scripts: Avoid no-op "ctdb ptrans" call This causes unnecessary g_lock activity and overhead. This could be optimised in ctdb.c:control_ptrans(). However, that makes the code more complex. Let's only do that if we get more potentially no-op uses. Note no optimisation is needed in the "notify" case because there is already an early exit if there are no items. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-01-04 01:53:54 +03:00			`items=$(sed -n "$sed_expr" $files)`
ctdb-scripts: Reformat with "shfmt -w -p -i 0 -fn" Best reviewed with "git show -w". Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-16 04:09:02 +03:00			`if [ -n "$items" ]; then`
ctdb-scripts: Move ctdb.tdb attach to statd-callout All of the other uses of ctdb.tdb are in statd-callout. New variable statd_callout_db makes it easy to change the database name in future, perhaps even allowing it to be configurable. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-29 06:11:46 +03:00			`if echo "$items" \| $CTDB ptrans "$statd_callout_db"; then`
ctdb-scripts: Avoid no-op "ctdb ptrans" call This causes unnecessary g_lock activity and overhead. This could be optimised in ctdb.c:control_ptrans(). However, that makes the code more complex. Let's only do that if we get more potentially no-op uses. Note no optimisation is needed in the "notify" case because there is already an early exit if there are no items. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-01-04 01:53:54 +03:00			`# shellcheck disable=SC2086`
			`rm $files`
			`fi`
ctdb-scripts: Change statd-callout to be more scalable Updating ctdb.tdb on each add-client, del-client and each delete during notify was too ambitious. Persistent transactions do not perform well enough to do this. Revert to having add-client and del-client create touch files. Each monitor event calls "statd-callout update" to convert touch files into ctdb.tdb records. Update testcases to do the "update" and add an extra test. Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2015-02-13 12:55:43 +03:00			`fi`
ctdb-scripts: Avoid no-op "ctdb ptrans" call This causes unnecessary g_lock activity and overhead. This could be optimised in ctdb.c:control_ptrans(). However, that makes the code more complex. Let's only do that if we get more potentially no-op uses. Note no optimisation is needed in the "notify" case because there is already an early exit if there are no items. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-01-04 01:53:54 +03:00			`;;`
ctdb-scripts: Change statd-callout to be more scalable Updating ctdb.tdb on each add-client, del-client and each delete during notify was too ambitious. Persistent transactions do not perform well enough to do this. Revert to having add-client and del-client create touch files. Each monitor event calls "statd-callout update" to convert touch files into ctdb.tdb records. Update testcases to do the "update" and add an extra test. Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2015-02-13 12:55:43 +03:00
ctdb-scripts: Reformat with "shfmt -w -p -i 0 -fn" Best reviewed with "git show -w". Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-16 04:09:02 +03:00			`notify)`
60.nfs: we must always restart the lockmanager when the cluster has been reconfigured and ip addresses has changed. This is to make sure we get a clusterwide grace period for nfs locking. if we dont do this and only restart locking on the nodes that were direclty affected, a different client can take out a conflicting lock from a different node before affected clients has had a chance to reclaim all the locks lost during reconfigure. grace period on rhel5 kernel has bene increased to 90 seconds! statd-callout: we must restart lockmanager to ensure a clusterwide grace period for nfs. this makes locking "more correct" for nfs clients and prevents other clients/nodes from taking out a conflicting lock while a different client/node tries to reclaim lost locks. This makes it "almost consistent" for NFS clients but there is still the possibility that a cifs client can take out a conflicting lock before an nfs client has had a chance to reclaim an existing lock. This can not be solved with anything less than making the kernel nfs lock manager "samba aware" and making samba aware of the internal state of the kernel lock manager so that they can cooperate. we can not just stop/start the lockmanager back to back in rhel5 since if they are stopped/started too close to eachother then when the new lockmanager upon starting up sends out statd notifications two things can happen: 1, new lockmanager sends out notification BEFORE it has registered with portmapper leading to lockmanager starts lockmanager sends notification to the client client tries to recover the lock and tries to portmap the lockmanager port on the server. server is not (yet) registered with portmapper and server responds "no such program" to hte clients request to discover where lockmanager is. client then just completely gives up reclaiming the lock and doesnt even reattempt the portmapper call after some timeout. ==> lock reclaim failed. 2, if they are started back to back, and a client tries to reclaim the lock the lockmanager sometimes sends two responses back to back to the client. one with status NLM_GRANTED (==you got the lock reclaimed) and one with status NLM_DENIED (==you could not get the lock reclaimed) This confuses the client and leads to the server thinking that the client does have the lock and the client thinking it has not got the lock and orphaned locks result. We also send out additional notification messages of different formats to allow more legacy clients to interoperate with locking. (This used to be ctdb commit 13208c1aab2942e28dff87e38e6794bf0c026033) 2007-09-07 02:52:56 +04:00			`# we must restart the lockmanager (on all nodes) so that we get`
Fix various spelling errors Reviewed-by: Andrew Bartlett <abartlet@samba.org> Reviewed-by: Michael Adam <obnox@samba.org> Autobuild-User(master): Andrew Bartlett <abartlet@samba.org> Autobuild-Date(master): Fri Nov 6 13:43:45 CET 2015 on sn-devel-104 2015-07-27 00:02:57 +03:00			`# a clusterwide grace period (so other clients don't take out`
60.nfs: we must always restart the lockmanager when the cluster has been reconfigured and ip addresses has changed. This is to make sure we get a clusterwide grace period for nfs locking. if we dont do this and only restart locking on the nodes that were direclty affected, a different client can take out a conflicting lock from a different node before affected clients has had a chance to reclaim all the locks lost during reconfigure. grace period on rhel5 kernel has bene increased to 90 seconds! statd-callout: we must restart lockmanager to ensure a clusterwide grace period for nfs. this makes locking "more correct" for nfs clients and prevents other clients/nodes from taking out a conflicting lock while a different client/node tries to reclaim lost locks. This makes it "almost consistent" for NFS clients but there is still the possibility that a cifs client can take out a conflicting lock before an nfs client has had a chance to reclaim an existing lock. This can not be solved with anything less than making the kernel nfs lock manager "samba aware" and making samba aware of the internal state of the kernel lock manager so that they can cooperate. we can not just stop/start the lockmanager back to back in rhel5 since if they are stopped/started too close to eachother then when the new lockmanager upon starting up sends out statd notifications two things can happen: 1, new lockmanager sends out notification BEFORE it has registered with portmapper leading to lockmanager starts lockmanager sends notification to the client client tries to recover the lock and tries to portmap the lockmanager port on the server. server is not (yet) registered with portmapper and server responds "no such program" to hte clients request to discover where lockmanager is. client then just completely gives up reclaiming the lock and doesnt even reattempt the portmapper call after some timeout. ==> lock reclaim failed. 2, if they are started back to back, and a client tries to reclaim the lock the lockmanager sometimes sends two responses back to back to the client. one with status NLM_GRANTED (==you got the lock reclaimed) and one with status NLM_DENIED (==you could not get the lock reclaimed) This confuses the client and leads to the server thinking that the client does have the lock and the client thinking it has not got the lock and orphaned locks result. We also send out additional notification messages of different formats to allow more legacy clients to interoperate with locking. (This used to be ctdb commit 13208c1aab2942e28dff87e38e6794bf0c026033) 2007-09-07 02:52:56 +04:00			`# conflicting locks through other nodes before all locks have been`
			`# reclaimed)`

			`# we need these settings to make sure that no tcp connections survive`
			`# across a very fast failover/failback`
dont set parameters in statd-callout if they should be set they bshould be set from 10.interfaces (This used to be ctdb commit 0c7c2dae0a976922de58793d576855bc37cd38e1) 2007-10-22 04:18:38 +04:00			`#echo 10 > /proc/sys/net/ipv4/tcp_fin_timeout`
dont set some of the sysctl variables in statd-callout. these are mainly useful for avoiding ack-storms when doing very rapid failover/failback during testing but should not be required in real-world. this gets rid of a lof of annoying messages from the messages file (This used to be ctdb commit 50d289dcce2caa7c7be9b6faa3b38b69c2237038) 2007-10-21 00:42:33 +04:00			`#echo 0 > /proc/sys/net/ipv4/tcp_max_tw_buckets`
			`#echo 0 > /proc/sys/net/ipv4/tcp_max_orphans`
60.nfs: we must always restart the lockmanager when the cluster has been reconfigured and ip addresses has changed. This is to make sure we get a clusterwide grace period for nfs locking. if we dont do this and only restart locking on the nodes that were direclty affected, a different client can take out a conflicting lock from a different node before affected clients has had a chance to reclaim all the locks lost during reconfigure. grace period on rhel5 kernel has bene increased to 90 seconds! statd-callout: we must restart lockmanager to ensure a clusterwide grace period for nfs. this makes locking "more correct" for nfs clients and prevents other clients/nodes from taking out a conflicting lock while a different client/node tries to reclaim lost locks. This makes it "almost consistent" for NFS clients but there is still the possibility that a cifs client can take out a conflicting lock before an nfs client has had a chance to reclaim an existing lock. This can not be solved with anything less than making the kernel nfs lock manager "samba aware" and making samba aware of the internal state of the kernel lock manager so that they can cooperate. we can not just stop/start the lockmanager back to back in rhel5 since if they are stopped/started too close to eachother then when the new lockmanager upon starting up sends out statd notifications two things can happen: 1, new lockmanager sends out notification BEFORE it has registered with portmapper leading to lockmanager starts lockmanager sends notification to the client client tries to recover the lock and tries to portmap the lockmanager port on the server. server is not (yet) registered with portmapper and server responds "no such program" to hte clients request to discover where lockmanager is. client then just completely gives up reclaiming the lock and doesnt even reattempt the portmapper call after some timeout. ==> lock reclaim failed. 2, if they are started back to back, and a client tries to reclaim the lock the lockmanager sometimes sends two responses back to back to the client. one with status NLM_GRANTED (==you got the lock reclaimed) and one with status NLM_DENIED (==you could not get the lock reclaimed) This confuses the client and leads to the server thinking that the client does have the lock and the client thinking it has not got the lock and orphaned locks result. We also send out additional notification messages of different formats to allow more legacy clients to interoperate with locking. (This used to be ctdb commit 13208c1aab2942e28dff87e38e6794bf0c026033) 2007-09-07 02:52:56 +04:00
ctdb-scripts: Reformat with "shfmt -w -p -i 0 -fn" Best reviewed with "git show -w". Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-16 04:09:02 +03:00			`# Delete the notification list for statd, we don't want it to`
Remove the dependency on the underlying cluster filesystem for handling the clusterwide persistent data associated with the lock manager and statd notifications. Use persistent databases to store this data instead of a shared directory. (This used to be ctdb commit fc0678d351187cfa4c71123f97c0f493aacd5d16) 2010-08-30 12:13:28 +04:00			`# ping any clients`
ctdb-scripts: Use find_statd_sm_dir() in one more place Take advantage of new function find_statd_sm_dir() when clearing the local system statd state directory, so it uses the correct directory when running on a non-RH distro. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-08-02 06:37:03 +03:00			`dir=$(find_statd_sm_dir)`
			`rm -f "${dir}/"* "${dir}.bak/"*`
add a short delay after stopping nfslock to make it less likely that "weird" things happen (This used to be ctdb commit 4934c083cbcc19714094e08a0b7da1fb6fdc8a5a) 2007-09-07 06:14:53 +04:00
ctdb-scripts: Parameterise 60.nfs with $CTDB_NFS_CALLOUT The goal is to have a single NFS eventscript. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2015-06-24 14:36:14 +03:00			`# We must also let some time pass between stopping and`
			`# restarting the lock manager. Otherwise there is a window`
			`# where the lock manager will respond "strangely" immediately`
			`# after restarting it, which causes clients to fail to reclaim`
			`# their locks.`
ctdb-scripts: Initialise CTDB_NFS_CALLOUT in statd-callout Some configurations may set CTDB_NFS_CALLOUT to the empty string. They may do this if they allow a choice of NFS implementations. In this case the default call-out for Linux kernel NFS should be used. However, statd-callout does not call nfs_callout_init() to set the default. Therefore, statd-callout is unable to restart the lock manager, so the grace period is never entered. statd-callout must call nfs_callout_init() before trying to restart the lock manager. BUG: https://bugzilla.samba.org/show_bug.cgi?id=12589 Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> Autobuild-User(master): Martin Schwenke <martins@samba.org> Autobuild-Date(master): Thu Feb 16 09:21:03 CET 2017 on sn-devel-144 2017-02-14 01:04:41 +03:00			`nfs_callout_init`
ctdb-scripts: Remove 60.ganesha, replace with callout for 60.nfs This isn't a straightforward move of code from 60.ganesha to the callout. Simplifications have been made to allow better interoperation with the new NFS checking logic. The following configuration variables have been removed: CTDB_GANESHA_REC_SUBDIR Edit NFS ganesha callout to change this location CTDB_NFS_SERVER_MODE, NFS_SERVER_MODE Use CTDB_NFS_CALLOUT instead CTDB_NFS_SKIP_KNFSD_ALIVE_CHECK, CTDB_SKIP_GANESHA_NFSD_CHECK Disable the corresponding .check file instead Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2015-07-01 11:32:35 +03:00			`"$CTDB_NFS_CALLOUT" "stop" "nlockmgr" >/dev/null 2>&1`
ctdb-scripts: Reformat with "shfmt -w -p -i 0 -fn" Best reviewed with "git show -w". Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-16 04:09:02 +03:00			`sleep 2`
ctdb-scripts: Remove 60.ganesha, replace with callout for 60.nfs This isn't a straightforward move of code from 60.ganesha to the callout. Simplifications have been made to allow better interoperation with the new NFS checking logic. The following configuration variables have been removed: CTDB_GANESHA_REC_SUBDIR Edit NFS ganesha callout to change this location CTDB_NFS_SERVER_MODE, NFS_SERVER_MODE Use CTDB_NFS_CALLOUT instead CTDB_NFS_SKIP_KNFSD_ALIVE_CHECK, CTDB_SKIP_GANESHA_NFSD_CHECK Disable the corresponding .check file instead Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2015-07-01 11:32:35 +03:00			`"$CTDB_NFS_CALLOUT" "start" "nlockmgr" >/dev/null 2>&1`
60.nfs: we must always restart the lockmanager when the cluster has been reconfigured and ip addresses has changed. This is to make sure we get a clusterwide grace period for nfs locking. if we dont do this and only restart locking on the nodes that were direclty affected, a different client can take out a conflicting lock from a different node before affected clients has had a chance to reclaim all the locks lost during reconfigure. grace period on rhel5 kernel has bene increased to 90 seconds! statd-callout: we must restart lockmanager to ensure a clusterwide grace period for nfs. this makes locking "more correct" for nfs clients and prevents other clients/nodes from taking out a conflicting lock while a different client/node tries to reclaim lost locks. This makes it "almost consistent" for NFS clients but there is still the possibility that a cifs client can take out a conflicting lock before an nfs client has had a chance to reclaim an existing lock. This can not be solved with anything less than making the kernel nfs lock manager "samba aware" and making samba aware of the internal state of the kernel lock manager so that they can cooperate. we can not just stop/start the lockmanager back to back in rhel5 since if they are stopped/started too close to eachother then when the new lockmanager upon starting up sends out statd notifications two things can happen: 1, new lockmanager sends out notification BEFORE it has registered with portmapper leading to lockmanager starts lockmanager sends notification to the client client tries to recover the lock and tries to portmap the lockmanager port on the server. server is not (yet) registered with portmapper and server responds "no such program" to hte clients request to discover where lockmanager is. client then just completely gives up reclaiming the lock and doesnt even reattempt the portmapper call after some timeout. ==> lock reclaim failed. 2, if they are started back to back, and a client tries to reclaim the lock the lockmanager sometimes sends two responses back to back to the client. one with status NLM_GRANTED (==you got the lock reclaimed) and one with status NLM_DENIED (==you could not get the lock reclaimed) This confuses the client and leads to the server thinking that the client does have the lock and the client thinking it has not got the lock and orphaned locks result. We also send out additional notification messages of different formats to allow more legacy clients to interoperate with locking. (This used to be ctdb commit 13208c1aab2942e28dff87e38e6794bf0c026033) 2007-09-07 02:52:56 +04:00
ctdb-scripts: Rewrite statd-callout to avoid 10 minute lag This is naive and assumes no performance problems when updating persistent DBs. It also does no error handling. Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Michael Adam <obnox@samba.org> 2013-11-08 09:41:11 +04:00			`# Construct a sed expression to take catdb output and produce pairs of:`
			`# server-IP client-IP`
			`# but only for the server-IPs that are hosted on this node.`
ctdb-scripts: Avoid connecting to ctdbd in add-client/del-client rpc.statd runs statd-callout as a non-root user, which is currently hacked around using some sudo logic that fails to work in some contexts (e.g. in a container). Use $CTDB_MY_PUBLIC_IPS_CACHE to access the node's currently assigned public IPs, for add-client/del-client. This avoids connecting to ctdbd when called from rpc.statd. Also, use $CTDB_MY_PUBLIC_IPS_CACHE in other places where it makes sense. Connections to ctdbd are still made in the "notify" action, but this is always run as root. In the test code, set the PNN after public addresses setup so that the cache of assigned IPs correctly initialised. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-29 03:12:44 +03:00			`sed_expr=$(awk '{`
			`ip = $1; gsub(/\./, "\\.", ip);`
			`printf "s/^key.=.statd-state@\$%s\$@\$[^\"]\$./\\1 \\2/p\n", ip }' \`
			`"$CTDB_MY_PUBLIC_IPS_CACHE")`
ctdb-scripts: Rewrite statd-callout to avoid 10 minute lag This is naive and assumes no performance problems when updating persistent DBs. It also does no error handling. Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Michael Adam <obnox@samba.org> 2013-11-08 09:41:11 +04:00
ctdb-scripts: Move ctdb.tdb attach to statd-callout All of the other uses of ctdb.tdb are in statd-callout. New variable statd_callout_db makes it easy to change the database name in future, perhaps even allowing it to be configurable. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-29 06:11:46 +03:00			`statd_state=$($CTDB catdb "$statd_callout_db" \|`
			`sed -n "$sed_expr" \|`
			`sort)`
ctdb-scripts: Add an early exit to statd-callout's notify case If $statd_state is empty then the loop will run once and print spurious errors. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Michael Adam <obnox@samba.org> 2013-11-19 08:40:08 +04:00			`[ -n "$statd_state" ] \|\| exit 0`
ctdb-scripts: Rewrite statd-callout to avoid 10 minute lag This is naive and assumes no performance problems when updating persistent DBs. It also does no error handling. Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Michael Adam <obnox@samba.org> 2013-11-08 09:41:11 +04:00
ctdb-scripts: Clean up statd-callout This means there will be 2 loops reading the data but the code flow is much more obvious. Signed-off-by: Martin Schwenke <martin@meltin.net> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2017-03-02 08:43:51 +03:00			`echo "$statd_state" \| send_notifies`
			`echo "$statd_state" \| delete_records`
ctdb-scripts: Change statd-callout to be more scalable Updating ctdb.tdb on each add-client, del-client and each delete during notify was too ambitious. Persistent transactions do not perform well enough to do this. Revert to having add-client and del-client create touch files. Each monitor event calls "statd-callout update" to convert touch files into ctdb.tdb records. Update testcases to do the "update" and add an extra test. Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> Reviewed-by: Amitay Isaacs <amitay@gmail.com> 2015-02-13 12:55:43 +03:00
			`# Remove any stale touch files (i.e. for IPs not currently`
			`# hosted on this node and created since the last "update").`
			`# There's nothing else we can do with them at this stage.`
ctdb-scripts: Avoid connecting to ctdbd in add-client/del-client rpc.statd runs statd-callout as a non-root user, which is currently hacked around using some sudo logic that fails to work in some contexts (e.g. in a container). Use $CTDB_MY_PUBLIC_IPS_CACHE to access the node's currently assigned public IPs, for add-client/del-client. This avoids connecting to ctdbd when called from rpc.statd. Also, use $CTDB_MY_PUBLIC_IPS_CACHE in other places where it makes sense. Connections to ctdbd are still made in the "notify" action, but this is always run as root. In the test code, set the PNN after public addresses setup so that the cache of assigned IPs correctly initialised. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-29 03:12:44 +03:00			`pnn=$(ctdb_get_pnn)`
			`$CTDB ip all \|`
			`tail -n +2 \|`
ctdb-scripts: Reformat with "shfmt -w -p -i 0 -fn" Best reviewed with "git show -w". Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-16 04:09:02 +03:00			`awk -v pnn="$pnn" 'pnn != $2 { print $1 }' \|`
ctdb-scripts: Avoid ShellCheck warning SC2162 SC2162 read without -r will mangle backslashes. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-19 02:43:33 +03:00			`while read -r sip; do`
ctdb-scripts: Avoid globally changing to queue directory Add new variables statd_callout_state_dir and statd_callout_queue_dir - the latter is for files queued by add-client/del-client. Use $statd_callout_queue_dir to avoid a global cd to the queue directory near the top of the script. Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2024-05-08 07:44:13 +03:00			`rm -f "${statd_callout_queue_dir}/statd-state@${sip}@"*`
ctdb-scripts: Reformat with "shfmt -w -p -i 0 -fn" Best reviewed with "git show -w". Signed-off-by: Martin Schwenke <mschwenke@ddn.com> Reviewed-by: Volker Lendecke <vl@samba.org> 2023-06-16 04:09:02 +03:00			`done`
added hooks to make nfs statd behave correctly on failover (This used to be ctdb commit a1ee84fc47892b6c18d417ccf714211fcb07952e) 2007-05-31 05:09:45 +04:00			`;;`
			`esac`

259 lines 7.3 KiB Plaintext Raw Normal View History Unescape Escape

259 lines

7.3 KiB

Plaintext

Raw Normal View History