samba-mirror

mirror of https://github.com/samba-team/samba.git synced 2025-05-22 22:50:19 +03:00

Author	SHA1	Message	Date
Michael Adam	3cb3620acf	vacuum: check lmaster against num_nodes instead of vnn_map->size When lmaster is bigger than the biggest recorded node number, then exit the traverse with error. (This used to be ctdb commit 3930c7796b72bbf275bbca8aaeceec3e705a964b)	2011-03-14 13:35:43 +01:00
Michael Adam	dd75ae8c15	vacuum: reduce indentation of the loop sending VACUUM_FETCH controls This slightly improves the code structure in that loop. (This used to be ctdb commit bc4990e600c53433a924a0d70e3488a5a6bdc1ff)	2011-03-14 13:35:43 +01:00
Michael Adam	70fa7394fa	vacuum: correctly send TRY_DELETE_RECORDS ctrl to all active nodes Originally, the control was sent to all records in the vnn_map, but there was something still missing here: When a node can not become lmaster (via CTDB_CAPABILITY_LMASTER=no) then it will not be part of the vnn_map. So such a node would be active but never receive the TRY_DELETE_RECORDS control from a vacuuming run. This is fixed in this change by correctly building the list of active nodes first in the same way that the recovery process does it. (This used to be ctdb commit 49247df4a47a8a107fa7dd7b187e69e243e6bdbe)	2011-03-14 13:35:43 +01:00
Michael Adam	33b5d9c1a1	vacuum: in ctdb_vacuum_db, fix the length of the array of vacuum fetch lists This patch fixes segfaults in the vacuum child when at least one node has been stopped or removed from the cluster: The size of the vnn_map is only the number of active nodes (that can be lmaster). But the node numbers that are referenced by the vnn_map spread over all configured nodes. Since the array of vacuum fetch lists is referenced by the key's lmaster's node number later on, the array needs to be of size num_nodes instad of vnn_map->size. (This used to be ctdb commit 136508e3f4dd0acc210dde938ad59ef38b63d3a1)	2011-03-11 23:00:51 +01:00
Michael Adam	8d49f4878c	Fix typos in a comment in vacuum_traverse. (This used to be ctdb commit 6a4df8242ee4d095ff03229a168b83bcd84c8a7a)	2011-03-09 21:14:14 +01:00
Michael Adam	eb5a0cc1e2	tests: fix segfault in store test when connection to ctdbd failed. (This used to be ctdb commit 43a15d2906b3f9d08af234f55a3a0f614571d3a7)	2011-03-09 11:22:21 +01:00
Michael Adam	1a08df9a46	tests: fix segfault in fetch_one test when connection to ctdbd fails (This used to be ctdb commit 89c8e52de3878b8e29c6a24725eb66c18fd1f52a)	2011-03-09 11:22:14 +01:00
Michael Adam	71960f55b2	tests: fix segfault in fetch test when connection to ctdb failed. (This used to be ctdb commit f493eb31abf956fd38347c2ff77026e380a9664d)	2011-03-09 11:22:05 +01:00
Michael Adam	9936e1cf0f	tests: fix segfault in randrec test when connection to daemon fails. (This used to be ctdb commit 715999d7ffbf10f3a57de3ee08a293165333553a)	2011-03-09 11:19:28 +01:00
Michael Adam	d2d51e23ba	gitignore: add tags file (This used to be ctdb commit 1e2ff096b78d160f5c535bccfc5482830372fd56)	2011-03-09 10:51:56 +01:00
Michael Adam	43bc87eda6	gitignore: add vi swap files (This used to be ctdb commit ac4dad619111e7153fe66eb2713f3b2ce63d3999)	2011-03-09 10:51:47 +01:00
Ronnie Sahlberg	a1abcd41e0	Restart recovery dameon if it looks like it hung. Dont shutdown ctdbd completely, that only makes the problem worse. (This used to be ctdb commit 221ecc2509f6d267d1854c1042ff945a620510bb)	2011-03-07 06:39:10 +11:00
Ronnie Sahlberg	49a30783d3	If/when the recovery daemon terminates unexpectedly, try to restart it again from the main daemon instead of just shutting down the main deamon too. While it does not address the reason for recovery daemon shutting down, it reduces the impact of such issues and makes the system more robust. (This used to be ctdb commit 0566ef3d6cef809bda204877c493c80ff9eb2c40)	2011-03-01 12:13:58 +11:00
Ronnie Sahlberg	b611de93ad	ATTACH_DB: simplify the code slightly and change the semantics to only refuse a db attach during recovery IF we can associate the request from a genuine real client instead of deciding this on whether client_id is zero or This will suppress/avoid messages like these : DB Attach to database %s refused. Can not match clientid... (This used to be ctdb commit b05ccf366df985e0a3365aacc75761ebd438deaf)	2011-03-01 12:13:46 +11:00
Ronnie Sahlberg	8acb677c9c	Deferred attach : at early startup, defer any db attach calls until we are out of recovery. (This used to be ctdb commit eeaabd579841f60ab2c5b004cbbb1f5de2bfe685)	2011-03-01 12:13:34 +11:00
Ronnie Sahlberg	e00ca55fa4	Dont return error if trying to set db priority on a db that does not yet exist. Just treat as a nop. When the database is created later it will get its priority set properly. (This used to be ctdb commit 05c934b10ad2690be9d75c9033a0b849bf16455d)	2011-02-25 10:25:01 +11:00
Michael Adam	53b558a3bc	server: add a comment explaining the call redirect logic in ctdb_call_send_redirect(). (This used to be ctdb commit 81663b81687c0ba681500cca6aa8174bb9587ad2)	2011-02-24 10:35:26 +01:00
Michael Adam	40e922f4e6	recover: finish pending trans3 commits when a recovery is finished. When the end_recovery control is received, pending trans3 commits are finished. During the recovery, all the actions like persistent_callback and persistent_store_timeout had been disabled to let the recovery do its job. After the recover is completed, send the reply to the waiting clients. (This used to be ctdb commit f7dfeb7143f574c2434f7dd16917380dfd1f4f64)	2011-02-24 10:35:26 +01:00
Michael Adam	2bd04f0ff8	persistent: add ctdb_persistent_finish_trans3_commits(). This function walks all databases and checks for running trans3 commits. It sends replies to all of them (with error code) and ends them. To be called when a recovery finishes. (This used to be ctdb commit 70ba153b532528bdccea70c5ea28972257f384c1)	2011-02-24 10:35:26 +01:00
Michael Adam	ee44c23cd5	daemon: correctly end a running trans3_commit if the client disconnects. (This used to be ctdb commit 9e0898db6df52d9bc799dd87bfea8c72d5f70ba0)	2011-02-24 10:35:25 +01:00
Michael Adam	0b3d8d28f6	persistent: add a client context to the persistent_stat and track the db_id The db_id is tracked in the client context as an indication that a transaction commit is in progress. This is cleared in the persistent_state talloc destructor. This is in order to properly treat running trans3_commits if the client disconnects. (This used to be ctdb commit e886ff24f4e3e250944289db95916b948893d26c)	2011-02-24 10:35:25 +01:00
Michael Adam	65f7a44987	persistent: reject trans3_control when a commit is already active. This should actually never happen. (This used to be ctdb commit f416e76838fe2adf629d4356d1cc87054b1af164)	2011-02-24 10:35:25 +01:00
Michael Adam	01c2c0c262	persistent: allocate the persistent state in the ctdb_db struct in trans3_commit Make sure that ctdb_db->persistent_state is correctly NULL-ed when the state is freed. This way, we can use ctdb_db->persistent_state as an indication for whether a transaction commit is currently running. (This used to be ctdb commit 761cb235193564a0f337d0308f0a9e6de0ef2710)	2011-02-24 10:35:25 +01:00
Michael Adam	503b647319	persistent: add a ctdb_db context to the ctdb_persistent_state struct. (This used to be ctdb commit a14917c983c3b9bbbf38f5ddeecdbbe5bde32364)	2011-02-24 10:35:25 +01:00
Michael Adam	ace1efb878	persistent: add a ctdb_persistent_state member to the ctdb_db context. To be used for tracking running transaction commits through recoveries. (This used to be ctdb commit 1237e15df4af58a3d220eea42a4b75e21e65029f)	2011-02-24 10:35:25 +01:00
Michael Adam	76acf72bc5	persistent_callback: print "no error message given" instead of "(null)" (This used to be ctdb commit d871a38978219e004833608c11aae98fe47614b9)	2011-02-24 10:35:25 +01:00
Michael Adam	e050266690	persistent: reduce indentation for the finishing moves in ctdb_persistent_callback (This used to be ctdb commit 2c2d1646eb753ea9561f085bcb101153267b052b)	2011-02-24 10:35:24 +01:00
Michael Adam	033ba0b466	persistent: if a node failed to update_record, trigger a recovery and stop processing of the update_record replies in order to let the recovery finish the trans3_commit control. (This used to be ctdb commit cab95570dc1eefb08abbac5ae411c29f699b51cc)	2011-02-24 10:35:24 +01:00
Michael Adam	0c93a2932c	persistent_store_timout: do not really time out the trans3_commit control in recovery If a recovery was started, then all further processing of the update_record controls sent by the trans3_commit control and timing them out is disabled. The recovery should trigger sending the reply for the update record control when finished. (This used to be ctdb commit 983c1ca2e18ecd60fca69bfe9e116125cc695857)	2011-02-24 10:35:24 +01:00
Michael Adam	c9df23ae1d	persistent_callback: ignore the update-recordreturn code of remote node in recovery If a recovery was started, then all further processing of the update_record controls sent by the trans3_commit control is disabled. The recovery should trigger sending the reply for the update record control when finished. (This used to be ctdb commit 12cf0619255b12230843cd8bb49cbfdea376ca2f)	2011-02-24 10:35:24 +01:00
Ronnie Sahlberg	92f86534ac	ctdb_req_dmaster from non-master If we find a situatior where we get a stray packet with the wrong dmaster, dont suicide with ctdb_fatal() since this is too disruptive. Just drop the stray packet and force a recovery to make sure all is good again. CQ S1022004 (This used to be ctdb commit 62b7fe853db37c0a90e48a0332a3426a8dcb4ed8)	2011-02-18 11:29:44 +11:00
Ronnie Sahlberg	a453e79050	50.samba : Tell winbind about every time we add/remove and ip from the node CQ S1021636 (This used to be ctdb commit 87b279027616cffbcedfd534ac0032cd51238dfe)	2011-02-18 11:29:35 +11:00
Ronnie Sahlberg	65f44e159f	Add two new flags for the ltdb header. One of which signals that the record has never been migrated to/from a node while containing data. This property "has never been migrated while non-zero" is important later to provide heuristics on which records we might be able to purge from the tdb files cheaply, i.e. without having to rely on the full-blown database vacuum. These records are belived to be very common and the pattern would look like this : 1, no record exists at all. 2, client opens a file 3, samba requests the record for this file 4, an empty record is created on the LMASTER 5, the empty record is migrated to the DMASTER 6, samba writes a <sharemode> to the record locally and the record grows 7, client finishes working the file and closes the file 8, samba removes the sharemode and the record becomes empty again. 9, much later : vacuuming will delete the record At stage 8, since the record has never been migrated onto a node wile being non-zero it would be safe, and much more efficient to just delete the record completely from the database and hand it back to the LMASTER. The flags occupy the same uint32_t as was previously used for laccessor/lacount in the header. For now, make sure the flags only define/use the top 16 bits of this field so that we are sure we dont collide with bits set to one from previous generations of the ctdb cluster database prior to this change in semantics of this word. This is a rework of Michaels patch : commit 2af1a47cbe1a608496c8caf3eb0c990eb7259a0d Author: Michael Adam <obnox@samba.org> Date: Tue Nov 30 17:00:54 2010 +0100 add a DEFAULT record flag and a MIGRATED_WITH_DATA record flag. (This used to be ctdb commit e075670dee8e6ecaba54986f87a85be3d0528b6b)	2011-02-18 10:14:56 +11:00
Ronnie Sahlberg	d32a4dd501	remove checking for filesystems and filesystem health from the cnfs script. remove the gpfsmount and gpfsumount entry points (This used to be ctdb commit 7db5a4832a9555be53c301f198f72b9e075a8ae7)	2011-02-18 10:11:56 +11:00
Ronnie Sahlberg	ef0ab7eee1	60.nfs Dont update the statd settings that often. When we have very many nodes and very many ips, this would generate a lot of unnessecary load on the system (This used to be ctdb commit 0c030c9384500f340d8382c20e1e91b11aa377e9)	2011-02-18 10:10:34 +11:00
Ronnie Sahlberg	b57bd0f896	Remove LACOUNT and LACCESSOR and migrate the records immediately. This concept didnt work out and it is really just as expensive as a full migration anyway, without the benefit of caching the data for subsequence accesses. Now, migrate the records immediately on first access. This will be combined with a "cheap vacuum-lite" for special empty records to prevent growth of databases. Later extensions to mimic read-only behaviour of records will include proper shared read-only locking of database records, making the laccessor/lacount read-only access to the data obsolete anyway. By removing this special case and handling of lacount laccessor makes the codapath where shared read-only locking will be be implemented simpler, and frees up space in the ctdb_ltdb header for use by vacuuming flags as well as read-only locking flags. (This used to be ctdb commit 155dd1f4885fe142c6f8bd09430f65daf8a17e51)	2011-02-18 10:08:32 +11:00
Ronnie Sahlberg	0aa2282c9c	change the hash function to use the much better Jenkins hash from the tdb library cq S1020233 (This used to be ctdb commit b86feb6fe463dfdb67b2798491df18a4c434a430)	2011-02-18 10:05:09 +11:00
Ronnie Sahlberg	c23f2e8bea	We default to non-deterministic ip now where ips are "sticky" and dont change too much. This means we can simplify the way we add ips significantly and stop trying to move them. We also check if the node already hosts the ip, in which case we used to return an error. Instead just print an error string but return 0, ok. This makes it easier to script, and works around broken scripts. CQ1021034 (This used to be ctdb commit 307e5e95548155a31682dfcb0956834d0c85838e)	2011-02-08 17:06:10 +11:00
Ronnie Sahlberg	40bd94bd5e	If the node is stopped, put a log entry in /var/log/* to indicate this is why we never become ready (This used to be ctdb commit ef1de8211f83259ea37dcd57562139a3b63d9631)	2011-02-02 14:09:56 +11:00
Ronnie Sahlberg	0f33605866	LockWait congestion. Add a dlist to track all active lockwait child processes. Everytime creating a new lockwait handle, check if there is already an active lockwait process for this database/key and if so, send the new request straight to the overflow queue. This means we will only have one active lockwaic child process for a certain key, even if there were thousands of fetch-lock requests for this key. When the lockwait processing finishes for the original request, the processing in d_overflow() will automagically process all remaining keys as well. Add back a --nosetsched argument to make it easier to run under gdb (This used to be ctdb commit 3e9317a2e1f687b04bf51575d47fcd4faa6e6515)	2011-01-24 12:21:58 +11:00
Ronnie Sahlberg	f91f063fe0	Compile fix (This used to be ctdb commit a81da1e67cd11734839c3fa7ae1ddaaf3459416d)	2011-01-24 12:21:53 +11:00
Rusty Russell	e57362ecf4	ctdb_lockwait: create overflow queue. Once we have more than 200 children waiting on a particular db, don't create any more. Just put them on an overflow queue, and when a child gets a lock search that queue to see if others were after the same lock (they probably were). (This used to be ctdb commit 5e614e8cfd1e9a4b13035a0e400b7a60a745b510)	2011-01-24 12:21:50 +11:00
Ronnie Sahlberg	b2d7554b32	Add a new test tool that fetch locks a record and then blocks until it receives user input to unlock the record again. (This used to be ctdb commit 1b3c5278aa1bf712606e2ec138e6be7b2e8a6ad1)	2011-01-24 12:21:46 +11:00
Ronnie Sahlberg	3f819741ad	ctdb: hold transaction locks during freeze, mark during recover. Make the ctdb parent "mark" the transaction lock once the child process has frozen/locked the entire database. This stops the ctdb daemon from using a blocking fcntl() locking on the tdb during the read traverse during recovery. CQ 1021388 (This used to be ctdb commit 52ee2b3ce822344d0f55ac040fe25f6ec5c0d7c2)	2011-01-18 14:07:44 +11:00
Rusty Russell	e68b97ffc9	tdb: expose transaction lock infrastructure for ctdb tdb_traverse_read() grabs the transaction lock. This can cause ctdbd (which uses it) to block when it should not; expose mark and normal variants of this lock, so ctdbd's child (the recovery daemon) can acquire it and the ctdbd parent can mark it was held. (This used to be ctdb commit d09fa845bd848d04507853809acf42e0471b44bf)	2011-01-18 14:07:41 +11:00
Ronnie Sahlberg	849ef2e39b	change Christinas previous patch to only perform the check/logging if we are the main ctdb daemon. Other daemons/child processes are not guaranteed to get events on regular basis so those should not be checked. (This used to be ctdb commit ac2afe9c25753b837d5f6396020e0f3c65ef3628)	2011-01-17 12:01:28 +11:00
Christian Ambach	ad56f321c8	improve timing issue detections the original "Time jumped" messages are too coarse to interpret exactly what was going wrong inside of CTDB. This patch removes the original logs and adds two other logs that differentiate between the time it took to work on an event and the time it took to get the next event. (This used to be ctdb commit fd8d54292f10b35bc4960d64cfa6843ce9aba225)	2011-01-17 11:56:55 +11:00
Ronnie Sahlberg	fcd98a7e59	LIBCTDB: add support for traverse (This used to be ctdb commit 9463e04038ba36792583f83bd95c1af322dc283a)	2011-01-14 17:38:56 +11:00
Ronnie Sahlberg	6494574d8f	db_exists() takes 3 arguments, not two. (This used to be ctdb commit 2c02fc2d45cd7364d7bee0d6a89f1386131ef002)	2011-01-14 09:53:25 +11:00
Ronnie Sahlberg	d903473d82	We can not always rely on the recovery daemon pinging us in a timely manner so we need a "ticker" in the main ctdbd daemon too to ensure we get at least one event to process every second. This will improve the accuracy of "Time jumped" messages and remove false positives when the recovery daemon is "slow". (This used to be ctdb commit 70154e5e19e219de086b2995d41e8f6e069ee20d)	2011-01-14 09:47:44 +11:00

1 2 3 4 5 ...

3252 Commits