samba-mirror

mirror of https://github.com/samba-team/samba.git synced 2024-12-24 21:34:56 +03:00

Author	SHA1	Message	Date
Ronnie Sahlberg	8edcd3f83f	during startup make sure to delete any public addresses from any interface (This used to be ctdb commit 18d80ea6db39e61f60e4c01de164d58bcbd8ab10)	2007-09-14 10:37:10 +10:00
Ronnie Sahlberg	6052078b53	let each node verify that they have a correct assignment of public ip addresses (i.e. htey hold those they should hold and they dont hold any of those they shouldnt hold) if an inconsistency is found, mark the local node as recovery mode active and wait for the recovery master to trigger a full blown recovery (This used to be ctdb commit 55a5bfc8244c5b9cdda3f11992f384f00566b5dc)	2007-09-14 10:16:36 +10:00
Ronnie Sahlberg	4c20141659	update the section about event scripts (This used to be ctdb commit a0744480c85a4e8648bd0ae7600f90d311b931ea)	2007-09-14 08:56:27 +10:00
Ronnie Sahlberg	528b8af87f	disable nfsv4 in etc/sysconfig/nfs (This used to be ctdb commit b71e11f0e27bb3ff908ad171aa5b1f724609ad05)	2007-09-14 08:15:24 +10:00
Ronnie Sahlberg	4186d8eaba	when a ctdb_takeover_run has failed we must make sure that need_takeover_run is set to true or else we might forget to rerun it again during the next recovery othervise, need_takeover_run is only set to true IFF the node flags for a remote node and the local nodes differ. It is possible that a takeover run fails and thus the reassignment of ip addresses is incomplete but before we get back to the test in monitor_cluster() that all the node flags of all nodes have converged and they now match each others again. and thus causing monitor_cluster() to fail to realize that a takeover run is needed. (This used to be ctdb commit ae7e866787cebd14394983ce1834387c959d1022)	2007-09-13 14:51:37 +10:00
Ronnie Sahlberg	ab1c8c074e	merge from tridge (This used to be ctdb commit eda3caa77be352967a41ff9bddda5296c94797a9)	2007-09-13 14:28:18 +10:00
Andrew Tridgell	9d50595b8a	prevent recursion in the calling of ctdb_takeover_run (This used to be ctdb commit 0fbdeb7c91b965d9bc5ecc7b24e31070378d8f1d)	2007-09-13 14:08:18 +10:00
Andrew Tridgell	6fa6101b1a	more shell scripting fixes in 10.interface (This used to be ctdb commit 4ee2230b3f2ae7437a9d0cf973eb4645d276accd)	2007-09-13 11:57:42 +10:00
Andrew Tridgell	30de14fe79	force recovery if unable to tell a node to release an IP (This used to be ctdb commit 6895788d2499344a03357e5c1103cb8383e9eaf7)	2007-09-13 11:19:49 +10:00
Andrew Tridgell	25940014c0	fixed script errors in 10.interface (This used to be ctdb commit 0c759614d27758cef3eba5942b2cccad54193cbb)	2007-09-13 11:19:30 +10:00
Andrew Tridgell	3c0f61cb92	we don't need the is_loopback logic in ctdb any more (This used to be ctdb commit 4ecf29ade0099c7180932288191de9840c8d90a9)	2007-09-13 10:45:06 +10:00
Andrew Tridgell	4f261ae191	remove more cruft from the logs (This used to be ctdb commit b67f35c483b6cbb5facaa6380c7794709f44213a)	2007-09-13 10:39:05 +10:00
Andrew Tridgell	023b885793	new approach for killing TCP connections on IP release (This used to be ctdb commit c33a0db29b5604966f582b1f8c5fd66760c72197)	2007-09-13 10:24:48 +10:00
Andrew Tridgell	1b53ecc445	remove clutter from ctdb log file (This used to be ctdb commit 54d5dcaaee0498f40bbee5059cc72d0ca75d33b7)	2007-09-13 10:03:18 +10:00
Andrew Tridgell	a919f6927a	fixed return code (This used to be ctdb commit 30165b5a19f9bd9d1f62c9c222df0711c1c6a927)	2007-09-13 10:02:56 +10:00
Andrew Tridgell	96c54c6188	handle hung or slow ctdb daemons on shutdown (This used to be ctdb commit a3089211782ab12387c1b04efa28914c94d89b30)	2007-09-12 13:26:24 +10:00
Andrew Tridgell	6c77184d96	- set arp_ignore to prevent replying to arp requests for addresses on loopback - put removed IPs on loopback with scope host - check for nul strings in ethtool call ; (This used to be ctdb commit e2df1d6d08e67a36ff05a590a34c56e900741287)	2007-09-12 13:23:36 +10:00
Andrew Tridgell	67bd64ef35	- don't allow the registration of clients with IPs we don't hold - change some debug levels to make tracking of IP release problems easier (This used to be ctdb commit 5f9aed62adaf87750f953412c55b29c58e4bb6c0)	2007-09-12 13:22:31 +10:00
Andrew Tridgell	a478c78f03	changed some debug levels (This used to be ctdb commit ed764533e1c2f8982e1577ca5e7f5f4482a15345)	2007-09-12 13:21:19 +10:00
Ronnie Sahlberg	536d393452	use the public addresses variable instead of hardcoding the path (This used to be ctdb commit 8e23f173cda8a76bbc243863bfc49fe8c7b907f4)	2007-09-12 07:28:24 +10:00
Ronnie Sahlberg	98f968d8d3	move all ip addresses onto loopback when we startup ctdb (This used to be ctdb commit 5d7500f7d93f0d36ffbf3c966c5b38f82f0376c7)	2007-09-12 07:26:30 +10:00
Andrew Tridgell	a6728e0520	fixed location of arp_filter (This used to be ctdb commit ea239c82fca2b9a648d21e5c603e632011958452)	2007-09-11 16:38:32 +10:00
Andrew Tridgell	5b65a6c7f0	get interface right (This used to be ctdb commit e0edc38d7e897f7de2850eb2cfd17fea75c16fcc)	2007-09-10 20:45:27 +10:00
Ronnie Sahlberg	a9a8ad07b4	grab the interface name from tok and not from the uninitialized array (This used to be ctdb commit 23a47ca2331a163b5fde03bd2f6f1d478633aede)	2007-09-10 16:34:11 +10:00
Ronnie Sahlberg	9c1b2f4856	merged patch from tridge (This used to be ctdb commit 90ab044093f67b656e21861ce12d6fee5794d21f)	2007-09-10 16:23:06 +10:00
Andrew Tridgell	8cd7ca149e	fixed a pointer cast warning (This used to be ctdb commit df0e7a4aa13112d613702d8ea0fb0e18510d293c)	2007-09-10 15:16:17 +10:00
Andrew Tridgell	57d8102cf8	added back --public-interface to startup script (This used to be ctdb commit 9e9cb3c0da7251f522c655366ef0868037577a9c)	2007-09-10 15:09:28 +10:00
Andrew Tridgell	f3ae1cdb02	- use struct sockaddr_in more consistently instead of string addresses - allow for public_address lines with a defaulting interface (This used to be ctdb commit 29cb760f76e639a0f2ce1d553645a9dc26ee09e5)	2007-09-10 14:27:29 +10:00
Andrew Tridgell	70ec39b1b1	add back in --public-interface as a default (This used to be ctdb commit cdf56daf69b2c8381ee673943e982ad20f19affd)	2007-09-10 14:26:35 +10:00
Andrew Tridgell	42168177ef	merge from ronnie (This used to be ctdb commit 1f21d4d563232926c35d03c4d69eb69190823dc6)	2007-09-10 13:21:11 +10:00
Andrew Tridgell	f3927719c9	add crontab and sysctl output (This used to be ctdb commit b1b59f3294ee7a5ed6d685f373bf19d3152170fa)	2007-09-10 11:27:07 +10:00
Ronnie Sahlberg	50381480eb	update a comment (This used to be ctdb commit e7d3ef4443686529299e8f293398cc0522235627)	2007-09-10 07:45:57 +10:00
Ronnie Sahlberg	4ac749bfa4	change the signature to ctdb_sys_have_ip() to also return: a bool that specifies whether the ip was held by a loopback adaptor or not the name of the interface where the ip was held when we release an ip address from an interface, move the ip address over to the loopback interface when we release an ip address after we have move it onto loopback, use 60.nfs to kill off the server side (the local part) of the tcp connection so that the tcp connections dont survive a failover/failback 61.nfstickle, since we kill hte tcp connections when we release an ip address we no longer need to restart the nfs service in 61.nfstickle update ctdb_takeover to use the new signature for ctdb_sys_have_ip when we add a tcp connection to kill in ctdb_killtcp_add_connection() check if either the srouce or destination address match a known public address (This used to be ctdb commit f9fd2a4719c50f6b8e01d0a1b3a74b76b52ecaf3)	2007-09-10 07:20:44 +10:00
Ronnie Sahlberg	0ebd7beb4b	set /proc/sys/net/ipv4/conf/all/arp_filter to 1 by default when 10.interfaces startsup this setting makes the system only respond to APR requests from the NIC where the ip address is tied to and adds to the "principle of least surprise" when using multihoming servers (This used to be ctdb commit 39ddf347dc45f599964a4c17e67e71faed00e544)	2007-09-08 08:09:02 +10:00
Ronnie Sahlberg	d91b28f8b7	ctdb ip must loop over all connected nodes to pull hte public ip list and merge into a big list since with the deassociation between a node and a public ipaddress the /etc/ctdb/public_addresses files can differ between nodes and no node know about all public addresses that a cluster can use (This used to be ctdb commit e208294fed183977cacc44b2cd1195c11d967c18)	2007-09-07 16:45:19 +10:00
Ronnie Sahlberg	3cad21d6be	remove the ctdb publicip command this command no longer makes sense when there is no on-to-one mapping between a node and its default public ip (This used to be ctdb commit 91280db7f6dd3d659edd86fae21ba347d6f9da9e)	2007-09-07 15:39:26 +10:00
Ronnie Sahlberg	d0dd8df752	update web nfs with the new NFS_HOSTNAME variable we need to be able to stat notify using the correct hostname (This used to be ctdb commit 1498e33e48a4654e02b74a00ef7473fed3225d69)	2007-09-07 12:20:48 +10:00
Ronnie Sahlberg	eb7a15730e	add a short delay after stopping nfslock to make it less likely that "weird" things happen (This used to be ctdb commit 4934c083cbcc19714094e08a0b7da1fb6fdc8a5a)	2007-09-07 12:14:53 +10:00
Ronnie Sahlberg	68c37f9b41	merge from tridge (This used to be ctdb commit 58c918b1bfe09c31049769dee266129cbad4cb20)	2007-09-07 09:21:40 +10:00
Ronnie Sahlberg	fa872de664	60.nfs: we must always restart the lockmanager when the cluster has been reconfigured and ip addresses has changed. This is to make sure we get a clusterwide grace period for nfs locking. if we dont do this and only restart locking on the nodes that were direclty affected, a different client can take out a conflicting lock from a different node before affected clients has had a chance to reclaim all the locks lost during reconfigure. grace period on rhel5 kernel has bene increased to 90 seconds! statd-callout: we must restart lockmanager to ensure a clusterwide grace period for nfs. this makes locking "more correct" for nfs clients and prevents other clients/nodes from taking out a conflicting lock while a different client/node tries to reclaim lost locks. This makes it "almost consistent" for NFS clients but there is still the possibility that a cifs client can take out a conflicting lock before an nfs client has had a chance to reclaim an existing lock. This can not be solved with anything less than making the kernel nfs lock manager "samba aware" and making samba aware of the internal state of the kernel lock manager so that they can cooperate. we can not just stop/start the lockmanager back to back in rhel5 since if they are stopped/started too close to eachother then when the new lockmanager upon starting up sends out statd notifications two things can happen: 1, new lockmanager sends out notification BEFORE it has registered with portmapper leading to lockmanager starts lockmanager sends notification to the client client tries to recover the lock and tries to portmap the lockmanager port on the server. server is not (yet) registered with portmapper and server responds "no such program" to hte clients request to discover where lockmanager is. client then just completely gives up reclaiming the lock and doesnt even reattempt the portmapper call after some timeout. ==> lock reclaim failed. 2, if they are started back to back, and a client tries to reclaim the lock the lockmanager sometimes sends two responses back to back to the client. one with status NLM_GRANTED (==you got the lock reclaimed) and one with status NLM_DENIED (==you could not get the lock reclaimed) This confuses the client and leads to the server thinking that the client does have the lock and the client thinking it has not got the lock and orphaned locks result. We also send out additional notification messages of different formats to allow more legacy clients to interoperate with locking. (This used to be ctdb commit 13208c1aab2942e28dff87e38e6794bf0c026033)	2007-09-07 08:52:56 +10:00
Ronnie Sahlberg	82984577f1	we dont need the rpc.statd on shared directory neither do we need PUBLIC_IP anymore (This used to be ctdb commit fd571ac87f65928e92dde6977745083bf381df1a)	2007-09-06 11:32:18 +10:00
Ronnie Sahlberg	00453a375a	improve the handling of hosts to notify with statd (This used to be ctdb commit cc87bda7e344bc777b9620a6211e62de4dce4e3b)	2007-09-06 11:30:49 +10:00
Ronnie Sahlberg	19546fb007	specify the additional ports for nfs (This used to be ctdb commit 1934163f0b393738615a05854082a7d488003e1c)	2007-09-06 10:26:44 +10:00
Ronnie Sahlberg	f7d193e9ce	the event scripts for nfs are called 60.nfs and 61.nfstickle (This used to be ctdb commit b15f1c25560320993b93aa3d943985dab4e47947)	2007-09-06 10:18:13 +10:00
Ronnie Sahlberg	0781616ef9	document NFS_TICKLE_SHARED_DIRECTORY on our web page (This used to be ctdb commit 40ec29f602897e9b01a6747806f502ab38423d54)	2007-09-06 08:21:11 +10:00
Ronnie Sahlberg	46eecfea27	we dont use 'sendip' any more so dont check for it and exit from the 61.nfstickles script if it is missing from the host (This used to be ctdb commit 8eac441e24f4ef33b55f9eaa4856b5c1e1c15213)	2007-09-05 15:39:51 +10:00
Ronnie Sahlberg	a9c8456ed6	we should always get data back from getnodemap (This used to be ctdb commit ff999a4b56f714c58c81baa454a2d39d04944136)	2007-09-05 14:59:29 +10:00
Ronnie Sahlberg	e4eeceaf3a	dont dereference vnn before we have assigned it a pointer value (This used to be ctdb commit 2a8fc69aea8527b22a3fe57427677e4caff57338)	2007-09-05 14:29:44 +10:00
Andrew Tridgell	c572d3c226	added a diagnostics tool for ctdb (This used to be ctdb commit 032a2238caf688656b00e06bf363182368e037e1)	2007-09-05 14:20:34 +10:00
Ronnie Sahlberg	77ec4d5248	allow different nodes in the cluster to use different public_addresses files so that we can partition the cluster into different subsets of nodes which each serve a different subset of the public addresses (This used to be ctdb commit 889e0fe69e4c88c6166282b12843b8d9727552d6)	2007-09-04 23:15:23 +10:00

1 2 3 4 5 ...

1072 Commits