samba-mirror

mirror of https://github.com/samba-team/samba.git synced 2025-05-04 06:50:23 +03:00

Author	SHA1	Message	Date
Michael Adam	04ba95c09d	s3:dbwrap_ctdb: increase the number of commit retries 5-->100 This is to cope with timeouts when recoveries and transactions collide. Maybe 100 is too hight, but 10 or even 20 have been too low in a very busy environment. Michael	2009-12-05 17:59:36 +01:00
Michael Adam	d92d770d23	s3:dbwrap_ctdb: increase the rsn of the __transaction_lock__ when storing so that it is correctly handled by recoveries. Also set the dmaster explicitly. Michael	2009-12-05 17:59:35 +01:00
Michael Adam	25bdf27eaa	s3:dbwrap_ctdb: add debug message to transaction_fetch_start() for the case that another local process has started a transaction bewteen releasing the transaction_lock record and starting the transaction. Michael	2009-11-03 01:02:38 +01:00
Michael Adam	9fef6a6666	s3:dbwrap_ctdb: split combined check in two and add descriptive debug in db_ctdb_transaction_fetch_start() for error conditions when re-fetching the transaction_lock record inside the transaction Michael	2009-11-03 01:02:38 +01:00
Michael Adam	f37439efd2	s3:dbwrap_ctdb: fix race condition with concurrent transactions on the same node. In ctdb_transaction_commit(), when the trans2_commit control fails, there is a race condition in the 1 second sleep between the local transaction_cancel and the call to ctdb_replay_transaction(): The database is not locked, and neither is the transaction_lock record. So another client can start and possibly complete a new transaction in this gap, but only on the same node: The locking of the transaction_lock record on a different node which involves migration of the record to the other node has been disabled by introduction of the transaction_active flag on the db which closes precisely this gap from the start of the commit until the call to TRANS2_FINISH or TRANS2_ERROR. But this mechanism does not cover the case where a process on the same node tries to start a transaction: There is no obstacle to locking the transaction_lock record because the record does not need to be migrated. This commit closes this race condition in ctdb_transaction_fetch_start() by using the new ctdb_ctrl_transaction_active() call to ask the local ctdb daemon whether it has a transaction running on the database. If so, the check is repeated until the running transaction is done. This does introduce an additional call to the local ctdbd when starting transactions, but it does close the (hopefully) last race condition. Michael	2009-11-03 01:02:37 +01:00
Michael Adam	9be4d3dd4f	s3:dbwrap_ctdb: add new db_ctdb_transaction_active() that calls CTDB_CONTROL_TRANS2_COMMIT Michael	2009-11-03 01:02:37 +01:00
Michael Adam	9bd6b9d9f6	s3:dbwrap_ctdb: fix a race in starting concurrent transactions on a single node There are two races in concurrent transactions on a single node. One in starting a transaction and one with replay during commit. This commit closes the first race by storing the client pid in the transaction-lock record and comparing the stored pid against its own pid after releasing the lock and refetching the record inside the transaction. Michael	2009-11-03 01:02:36 +01:00
Michael Adam	8d61b8abbc	s3:dbwrap_ctdb: use db_ctdb_ltdb_fetch() inside db_ctdb_transaction_fetch_start Michael	2009-11-03 01:02:36 +01:00
Michael Adam	0ec476fca1	s3:dbwrap_ctdb: use db_ctdb_ltdb_fetch() inside db_ctdb_transaction_fetch() Michael	2009-11-03 01:02:36 +01:00
Michael Adam	4973ff66ac	s3:dbwrap_ctdb: add a function db_ctdb_ltdb_fetch() This fetches a record from the db and splits out the ctdb header. Michael	2009-11-03 01:02:35 +01:00
Michael Adam	6a898348fa	s3:dbrwap_ctdb: add a function db_ctdb_ltdb_store() and use it in db_ctdb_store() and db_ctdb_transaction_store(). Michael	2009-11-03 01:02:35 +01:00
Michael Adam	d5aa758482	s3:dbwrap_ctdb: reformat a comment slightly to enhance clearness. Michael	2009-11-03 01:02:35 +01:00
Michael Adam	a1cf12e1f6	s3:dbwrap_ctdb: set dmaster in ctdb_transaction_store() also when updating an existing record not only when creating a record. This matches commit e9194a130327d6b05a8ab90bd976475b0e93b06d from ctdb-master. Michael	2009-09-11 15:39:53 +02:00
Michael Adam	f5a5c6a5dc	s3:dbwrap_ctdb: fix some function header comments Michael	2009-05-25 22:16:46 +02:00
Michael Adam	bb0fb97562	s3:dbwrap_ctdb_marshall_add: don't leak the ctdb_rec_data to the outside Michael	2009-03-04 22:49:25 +01:00
Stefan Metzmacher	a83b327f1b	s3:dbwrap: add get_flags() hook to db_context metze	2009-01-19 17:06:02 +01:00
Jelmer Vernooij	f3f9446ec1	Rename hex_encode to hex_encode_talloc,for consistency with samba 4 and heimdal.	2008-10-18 16:16:57 +02:00
Andrew Tridgell	7caa8c85ac	fixed an (unlikely) memory leak	2008-09-29 14:01:01 +02:00
Andrew Tridgell	acf5f2e5b0	fixed a segfault on the ctdb destructor code	2008-09-29 14:01:00 +02:00
Volker Lendecke	4e479737f3	Fix some nonempty blank lines (This used to be commit 010c7101e59477f0d5f3bf11c17f474ec6f79cc1)	2008-08-24 12:48:30 +02:00
Volker Lendecke	3d13cdfa92	Fix some C++ warnings (This used to be commit dd9e4e6db04acf20f6ef7705955358c7ca442bbd)	2008-08-24 12:48:23 +02:00
Andrew Tridgell	11331eeae5	allow nested ctdb transactions in the same manner that they are allowed for tdb. This is needed for the registry db backend. (This used to be commit 4b04ec29c76df837a7909725bbbf4c79d5abdb4d)	2008-08-13 11:54:11 +02:00
Andrew Tridgell	65a78a6a52	drop retries to 5 (This used to be commit a2f70fc175b748ef160a998d0563c28381ea3466)	2008-08-13 11:54:11 +02:00
Andrew Tridgell	ca64c340c7	use CTDB_CONTROL_TRANS2_COMMIT_RETRY to prevent the counter getting out of sync (This used to be commit 571ec7893c8b40959c005d510c039e3f231ffc67)	2008-08-13 11:54:11 +02:00
Andrew Tridgell	fe3dd9b3e6	fixed lots of places that paniced on a failed transaction_commit, thinking it was a failure of a transaction cancel (This used to be commit 22dbe158ed62ae47bbcb41bba3db345294f75437)	2008-08-13 11:54:10 +02:00
Andrew Tridgell	0f8a6859e6	cope with the control failing completely without returning a status (This used to be commit fe6a03e7b11cd859fddae5ba924ea5e071b8ccea)	2008-08-13 11:54:10 +02:00
Andrew Tridgell	2592565bde	handle two special cases 1) when all nodes write the same value to the record, or when writing a value that is already there, we can skip the write and save ourselves a network transactions 2) when all remote nodes fail an update, and we then fail a replay, we don't need to trigger a recovery. This solves a corner case where we could get into a recovery loop (This used to be commit 2481bfce4307274806584b0d8e295cc7f638e184)	2008-08-13 11:54:10 +02:00
Andrew Tridgell	62bbcc6135	put a limit on the number of retries. I found a case where a recovery could lead to it blocking forever (This used to be commit a633390d3a7cb04a7c4e14cba9c533621793287e)	2008-08-13 11:54:09 +02:00
Andrew Tridgell	7e9229e17a	we need to commit, not cancel, on record destruction (This used to be commit ba64a757f86fb60994e12e81416083ac0fa11c21)	2008-08-13 11:54:09 +02:00
Andrew Tridgell	5031f2a6e2	all persistent databases now do all stores via automatic transactions (This used to be commit 76fbe56e827193d939676da23a580aa0f9394dd1)	2008-08-13 11:54:09 +02:00
Andrew Tridgell	ee314d6930	fixed fetch of empty records (This used to be commit 037516f1362c8d64da1d47a0cdaf83198d3eaeaf)	2008-08-13 11:54:09 +02:00
Andrew Tridgell	b3f4b7768f	cleanup debugging and fix handling of empty transaction (This used to be commit 2e85cbe88b3d1674b915f62e02be7d005fddaa39)	2008-08-13 11:54:08 +02:00
Andrew Tridgell	0f41961e4f	first cut at adding full transactions for ctdb to samba3 (This used to be commit f91a3e0f7b7737c1d0667cd961ea950e2b93e592)	2008-08-13 11:54:08 +02:00
Michael Adam	286974e35a	dbwrap ctdb: fix a DEBUG message. Michael (This used to be commit d776d8df262e1753fb428450140df94e63035af5)	2008-08-13 11:54:08 +02:00
Michael Adam	ebaf208fc3	dbwrap ctdb: don't retry when tdb_store failed in db_ctdb_persistent_store(). Only retry when ctdbd_persisten_update() failed. Michael (This used to be commit ff413a4614c8b272a34b2a9e56a329a8e8749a34)	2008-08-13 11:54:07 +02:00
Michael Adam	b45305b5d8	dbwrap ctdb: add a partial mapping from tdb_error to NTSTATUS and use it for store. Michael (This used to be commit eaf76c751f9bde2843174b400c109304831df83e)	2008-08-13 11:54:07 +02:00
Michael Adam	873e74705f	dbwrap ctdb: add db_ctdb_delete_persistent() and use it for persistent DBs as delete_rec operation from fetch_locked() Michael (This used to be commit f4aab595a0219305fbedf8890e787b690660a55a)	2008-08-13 11:54:07 +02:00
Michael Adam	dd7ac4f38d	dbwrap ctdb: call db_ctdb_store() in db_ctdb_delete(). to reduce code duplication. Michael (This used to be commit 09a197e756459877cab7b4d09f534c6a41cfdd71)	2008-08-13 11:54:07 +02:00
Michael Adam	5dcf20961e	dbwrap ctdb: add a retry loop to the persistent store operation. This is because ctdbd can fail in performing the persistent_store due to race conditions, and this does not mean it can't succeed the next time. To not loop infinitely, this makes use of a new parametric option: "dbwrap ctdb:max store retries" (integer) which defaults to 5 and sets the upper limit for the number or repeats of the fetch/store cycle. Michael (This used to be commit 2bcc9e6ecef876030e552a607d92597f60203db2)	2008-08-13 11:54:06 +02:00
Michael Adam	ed66929647	dbwrap ctdb: release the lock before calling ctdbd_persistent_store() in the persistent db_ctdb_store operation. This is to prevent deadlocks in db_ctdb_persistent_store(). There is a tradeoff: Usually, the record is still locked after db->store operation. This lock is usually released via the talloc destructor with the TALLOC_FREE to the record. So we have two choices: - Either re-lock the record after the call to persistent_store or cancel_persistent update and this way not changing any assumptions callers may have about the state, but possibly introducing new race conditions. - Or don't lock the record again but just remove the talloc_destructor. This is less racy but assumes that the lock is always released via TALLOC_FREE of the record. I choose the first variant for now since it seems less racy. We can't guarantee that we succeed in getting the lock anyways. The only real danger here is that a caller performs multiple store operations after a fetch_locked() which is currently not the case. Michael (This used to be commit d004c9a7281d2577c3ba2012c8f790cc198ea700)	2008-08-13 11:54:06 +02:00
Michael Adam	fd070dc9af	dbwrap ctdb: remove erroneously duplicated comment. Michael (This used to be commit c939c55e5182258092faceefa58a7f328f18619e)	2008-08-13 11:54:06 +02:00
Ronnie Sahlberg	fb97047a84	Use transaction start/cancel for persistent writes to avoid leaving the database in an inconsistent state if we crash during the operation Signed-off-by: Ronnie Sahlberg <ronniesahlberg@gmail.com> (This used to be commit 09329f1f9114af44fc4e5e4f29a7315912313125)	2008-08-13 11:54:06 +02:00
Andrew Tridgell	f8534d5c78	fixed permissions on ctdb databases (This used to be commit 123fc3980a83d956bffaa689f3af81bbf81ce1c1)	2008-08-06 10:51:04 +02:00
Volker Lendecke	541b8dec4e	Add transactions to the dbwrap API Only filled in for tdb so far, for rbt it's pointless, and ctdb itself needs to be extended (This used to be commit 0a55e018dd68af06d84332d54148bbfb0b510b22)	2008-03-10 21:08:44 +01:00
Alexander Bokovoy	68694369fc	Merge CTDB-related fixes from samba-ctdb 3.0 branch (http://samba.org/~tridge/3_0-ctdb ) Signed-off-by: Alexander Bokovoy <ab@samba.org>(This used to be commit 0c8e23afbbb2d081fc23908bafcad04650bfacea)	2008-01-16 12:09:48 +03:00
Volker Lendecke	95b9e23095	Fix dbwrap debug output (This used to be commit 9f9c933c16abacb2d0aa7bc7faa5b1ddac61b0e5)	2007-11-09 15:10:14 +01:00
Volker Lendecke	a116d7c7d9	r24773: Fix a ctdb connection lockup The lockup could happen when packet_read_sync() gets two packets in a row, the first one being an async message, and the second one being the response to a ctdb request. Also add some debug msg to ctdb_conn.c, and cut off the "locking key" messages to only dump 20 hex chars at debug level 10. >10 will dump everything. (This used to be commit 0a55880a240b619810371a19144dd0a75208adfe)	2007-10-10 12:30:20 -05:00
Stefan Metzmacher	ebdfd34548	r24113: some little fixes to get the correct error message when using "clustering = yes" and ctdbd isn't running metze (This used to be commit c5f020ba1fdefe0422dd466b9c68ff67c74ceddd)	2007-10-10 12:29:08 -05:00
Andrew Tridgell	5e54558c6d	r23784: use the GPLv3 boilerplate as recommended by the FSF and the license text (This used to be commit b0132e94fc5fef936aa766fb99a306b3628e9f07)	2007-10-10 12:28:22 -05:00
Jeremy Allison	d824b98f80	r23779: Change from v2 or later to v3 or later. Jeremy. (This used to be commit 407e6e695b8366369b7c76af1ff76869b45347b3)	2007-10-10 12:28:20 -05:00

1 2

51 Commits