samba-mirror

mirror of https://github.com/samba-team/samba.git synced 2025-01-12 09:18:10 +03:00

Author	SHA1	Message	Date
Ronnie Sahlberg	37608d70fc	ReadOnly: Add clientside code to fetch readonly records (This used to be ctdb commit 6fccc902bce21fa6ff13ed08ee3341bbf8be39f2)	2011-08-23 10:34:15 +10:00
Ronnie Sahlberg	1bbd4cbf35	ReadOnly: Add a ctdb_ltdb_fetch_readonly() helper function (This used to be ctdb commit 8551420fb331dd2a897f4619278a981fcefb96e8)	2011-08-23 10:33:17 +10:00
Ronnie Sahlberg	17f0e0890c	ReadOnly: Add a new flag to call request packet to indicate that the client wants a readonly delegation (This used to be ctdb commit a3f54a556e97170eedf43708d58dd32446ca5840)	2011-08-23 10:29:40 +10:00
Ronnie Sahlberg	dda2616cf5	ReadOnly: Add a function to start a revoke of all delegations for a record. This triggers a child process to be created to perform the actual potentially blocking calls that are required. (This used to be ctdb commit 7d575ee92c95bc4aab78a33bc1aac7ff0811ab3a)	2011-08-23 10:27:31 +10:00
Ronnie Sahlberg	1bb855bd52	ReadOnly: Add functions to register CALLs to a context used to handle deferal of processing of CALL commands. Once the contexts are freed, the deferred calls are re-issued to the input packet processing functions again. This is needed when/if a CALL can not currently be processed by the main engine due to the record being locked down for revoking of all delegations. The data is passed through several layers of callbacks, and finally a timed event callback to ensure that the processing of the packet will be restarted again at the topmost eventloop, avoinding event loop nesting. (This used to be ctdb commit cc6f78efcfa3b8caeffbd68018e6dfbf81488dce)	2011-08-23 10:25:57 +10:00
Ronnie Sahlberg	3d495c48d2	ReadOnly: Add an extra flag to ctdb_call_local to specify whether we want to write the record and header back to the tdb (for example we do when performing dmaster migrations) (This used to be ctdb commit b935e83255aeb3754b2fd37cf5611e02f7283514)	2011-08-23 10:25:05 +10:00
Ronnie Sahlberg	1441b77cce	ReadOnly: Add "readonly" flag to the ctdb_db_context to indicate if this database supports readonly operations or not. Add a private lock-less tdb file to the ctdb_db_context to use for tracking delegarions for records Assume all databases will support readonly mode for now and se thte flag for all databases. At later stage we will add support to control on a per database level whether delegations will be supported or not. (This used to be ctdb commit 502f86f79944df4bac9094f716e54110c511dc24)	2011-08-23 10:24:26 +10:00
Ronnie Sahlberg	8f63a5dadd	ReadOnly: Add 4 new record flags to handle read only delegation and revoking of delegations (This used to be ctdb commit 875b0bede217547b51f02648b6a28a3c98b6b949)	2011-08-23 10:17:08 +10:00
Ronnie Sahlberg	e8127f0e0f	ReadOnly: Add clientside functions to send the UPDATE_RECORD control (This used to be ctdb commit 74a5b3d7bafd8827a4ee80095fde5798263821e4)	2011-08-23 10:11:38 +10:00
Ronnie Sahlberg	f924b3f40e	ReadOnly: Add helper functions to manipulate a TDB_DATA as a bitmap for nodes that we are tracking as having a readonly delegation (This used to be ctdb commit d10084e62d37674bb8d9e31d457fd23e050545be)	2011-08-23 10:09:42 +10:00
Ronnie Sahlberg	00a870f759	ReadOnly records: Add a new RPC function FETCH_WITH_HEADER. This function differs from the old FETCH in that this function will also fetch the record header and not just the record data (This used to be ctdb commit c7196d16e8e03bb2a64be164d15a7502300eae0e)	2011-08-23 10:06:59 +10:00
Volker Lendecke	21bb8abc93	libctdb: "ctdb_request_free" does not need the ctdb_connection parameter Signed-off-by: Michael Adam <obnox@samba.org> (This used to be ctdb commit 5a5ed2a43b76bec69494b6cdc6451527f5c472e5)	2011-08-22 17:11:07 +02:00
Martin Schwenke	5ac67504ca	Tests: Initial test code for LCP2 IP allocation algorithm. Move struct ctdb_public_ip_list to ctdb_private.h and put some definitions for some functions from ctdb_takeover.c there. This allows those functions to be called from unit tests. Add ctdb_takeover_tests.c and the Makefile support to build it. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 9d34be0233edf3bc022345c0494c4b2a4d7f8480)	2011-07-29 09:01:36 +10:00
Martin Schwenke	ff1a81c872	IP allocation - add LCP2 algorithm. The current non-deterministic IP allocation algorithm balances IPs across the whole cluster. It does not consider different interfaces/VLANs/subnets, so these different groups of IPs aren't generally well balanced. This adds the LCP2 algorithm for IP allocation and allows it to be enabled by setting the "LCP2PublicIPs" tunable to 1. The LCP2 algorithm calculates the imbalance of a node by totalling the squares of the distances between each IP on the node. The IP distance is defined as the length longest common prefix (LCP) of bits that is found when comparing 2 IPs. The imbalance of a cluster is the maximum imbalance for any node. At each step the algorithm selects an allocation to the IP/node combination that results in the choosing the allocation that best reduces the imbalance of the cluster. The implementation splits out the IP allocation part of ctdb_takeover_run() into new function ctdb_takeover_run_core(), and then extracts out the basic IP assignment code into new functions basic_allocate_unassigned() and basic_failback(). 3 new functions lcp2_init(), lcp2_allocate_unassigned() and lcp2_failback() implement the LCP2 algorithm, and are hooked into ctdb_takeover_run_core(). Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 61fc7fbd0235469df22deb6581c6bd47e30bc0be)	2011-07-29 09:01:17 +10:00
Michael Adam	827e871ec4	ctdb_private.h: add record flag CTDB_REC_FLAG_AUTOMATIC This is a flag that shall signa that a record has been automatically generated by ctdb and not by an explicit client store operation. This will be used in the ctdb_ltdb_fetch operation which stores an empty record with default initial header before trying to migrate the record from the dmaster when the record does not exist in the local tdb. (This used to be ctdb commit 46381a3cb58ccc11422af8f7798c80ea8d72294f)	2011-03-14 13:35:51 +01:00
Michael Adam	9e8d6b82b5	server: Use the ctdb_ltdb_store_server() in the ctdb daemon for non-persistent dbs This is realized by adding a ctdb_ltdb_store_fn function pointer to the db context and filling it in the attach procedure for non-persistent dbs. (This used to be ctdb commit df49ec44de80affa5ccc637dec12a20a26e8706e)	2011-03-14 13:35:50 +01:00
Michael Adam	a6b13b21c1	client: add accessor function ctdb_header_from_record_handle(). (This used to be ctdb commit cf57efd440ccc3db381386f4749bfcbf8ac5ecae)	2011-03-14 13:35:50 +01:00
Michael Adam	50bd249990	vacuum: add ctdb_local_schedule_for_deletion() (This used to be ctdb commit b70bc141d84f7355d2c6c901961b7366db566980)	2011-03-14 13:35:49 +01:00
Michael Adam	8569fcbc83	server: implement a new control SCHEDULE_FOR_DELETION to fill the delete_queue. (This used to be ctdb commit 680223074e992b32ccf6f42cb80c3fa93074fee7)	2011-03-14 13:35:49 +01:00
Michael Adam	46a05397a4	control: add a new control opcode CTDB_CONTROL_SCHEDULE_FOR_DELETION (This used to be ctdb commit 4cebfa33db3c7effa087f753530c52b2dd8550e6)	2011-03-14 13:35:49 +01:00
Michael Adam	77d4d156d3	control: add macro CHECK_CONTROL_MIN_DATA_SIZE. This is for the control dispatcher to check whether the input data has a required minimum size. (This used to be ctdb commit 2038e745db33cc5c3b4e2db8a00a57ede03906a2)	2011-03-14 13:35:49 +01:00
Michael Adam	9d20f76052	Add a tunable VacuumFastPathCount. This will control how many fast-path vacuuming runs wil have to be done, before a full vacuuming will be triggered, i.e. one with a db-traversal. (This used to be ctdb commit 0d997ec7e61a7bee2cb05456f9c7d5e6f7a44797)	2011-03-14 13:35:47 +01:00
Michael Adam	cd061f3dee	Add a delete_queue to the ctdb database context struct. This list will be filled by the client using a new delete control. The list will then be used to implement a fast-path vacuuming that will traverse this list instead of traversing the database. (This used to be ctdb commit 9bbedf786b26bb074f668b31f29a9032af958673)	2011-03-14 13:35:45 +01:00
Michael Adam	f7eeb42219	add a new record flag CTDB_REC_FLAG_VACUUM_MIGRATED. This is to be used internally. The purpose is to flag a record as been migrated by a VACUUM_MIGRATION, which is triggered by a VACUUM_FETCH message as part of the vacuuming. The local store routine will base its decision whether to delete or to store the record (among other things) upon the value of this flag. This flag should never be stored in the local database copies. (This used to be ctdb commit dd2449c422f323f9b5485e45107a9cc5acc09e08)	2011-03-14 13:35:44 +01:00
Michael Adam	f3fbd31d85	call: Move definition of call flags down to the definition of the flags field. (This used to be ctdb commit 86c844fb08a7fd33e94f56b8d5e43278120e1162)	2011-03-14 13:35:44 +01:00
Michael Adam	a2c11d6edc	call: add new call flag CTDB_CALL_FLAG_VACUUM_MIGRATION This is to be used when the CTDB_SRVID_VACUUM_FETCH message triggers the migration of deleted records to the lmaster. The lmaster can then delete records that have not been migrated with data instead of storing them. (This used to be ctdb commit 455cc6616e10b7f09589f9b87cb60f591bb502b0)	2011-03-14 13:35:44 +01:00
Ronnie Sahlberg	8acb677c9c	Deferred attach : at early startup, defer any db attach calls until we are out of recovery. (This used to be ctdb commit eeaabd579841f60ab2c5b004cbbb1f5de2bfe685)	2011-03-01 12:13:34 +11:00
Michael Adam	2bd04f0ff8	persistent: add ctdb_persistent_finish_trans3_commits(). This function walks all databases and checks for running trans3 commits. It sends replies to all of them (with error code) and ends them. To be called when a recovery finishes. (This used to be ctdb commit 70ba153b532528bdccea70c5ea28972257f384c1)	2011-02-24 10:35:26 +01:00
Michael Adam	ace1efb878	persistent: add a ctdb_persistent_state member to the ctdb_db context. To be used for tracking running transaction commits through recoveries. (This used to be ctdb commit 1237e15df4af58a3d220eea42a4b75e21e65029f)	2011-02-24 10:35:25 +01:00
Ronnie Sahlberg	65f44e159f	Add two new flags for the ltdb header. One of which signals that the record has never been migrated to/from a node while containing data. This property "has never been migrated while non-zero" is important later to provide heuristics on which records we might be able to purge from the tdb files cheaply, i.e. without having to rely on the full-blown database vacuum. These records are belived to be very common and the pattern would look like this : 1, no record exists at all. 2, client opens a file 3, samba requests the record for this file 4, an empty record is created on the LMASTER 5, the empty record is migrated to the DMASTER 6, samba writes a <sharemode> to the record locally and the record grows 7, client finishes working the file and closes the file 8, samba removes the sharemode and the record becomes empty again. 9, much later : vacuuming will delete the record At stage 8, since the record has never been migrated onto a node wile being non-zero it would be safe, and much more efficient to just delete the record completely from the database and hand it back to the LMASTER. The flags occupy the same uint32_t as was previously used for laccessor/lacount in the header. For now, make sure the flags only define/use the top 16 bits of this field so that we are sure we dont collide with bits set to one from previous generations of the ctdb cluster database prior to this change in semantics of this word. This is a rework of Michaels patch : commit 2af1a47cbe1a608496c8caf3eb0c990eb7259a0d Author: Michael Adam <obnox@samba.org> Date: Tue Nov 30 17:00:54 2010 +0100 add a DEFAULT record flag and a MIGRATED_WITH_DATA record flag. (This used to be ctdb commit e075670dee8e6ecaba54986f87a85be3d0528b6b)	2011-02-18 10:14:56 +11:00
Ronnie Sahlberg	b57bd0f896	Remove LACOUNT and LACCESSOR and migrate the records immediately. This concept didnt work out and it is really just as expensive as a full migration anyway, without the benefit of caching the data for subsequence accesses. Now, migrate the records immediately on first access. This will be combined with a "cheap vacuum-lite" for special empty records to prevent growth of databases. Later extensions to mimic read-only behaviour of records will include proper shared read-only locking of database records, making the laccessor/lacount read-only access to the data obsolete anyway. By removing this special case and handling of lacount laccessor makes the codapath where shared read-only locking will be be implemented simpler, and frees up space in the ctdb_ltdb header for use by vacuuming flags as well as read-only locking flags. (This used to be ctdb commit 155dd1f4885fe142c6f8bd09430f65daf8a17e51)	2011-02-18 10:08:32 +11:00
Ronnie Sahlberg	0f33605866	LockWait congestion. Add a dlist to track all active lockwait child processes. Everytime creating a new lockwait handle, check if there is already an active lockwait process for this database/key and if so, send the new request straight to the overflow queue. This means we will only have one active lockwaic child process for a certain key, even if there were thousands of fetch-lock requests for this key. When the lockwait processing finishes for the original request, the processing in d_overflow() will automagically process all remaining keys as well. Add back a --nosetsched argument to make it easier to run under gdb (This used to be ctdb commit 3e9317a2e1f687b04bf51575d47fcd4faa6e6515)	2011-01-24 12:21:58 +11:00
Rusty Russell	e57362ecf4	ctdb_lockwait: create overflow queue. Once we have more than 200 children waiting on a particular db, don't create any more. Just put them on an overflow queue, and when a child gets a lock search that queue to see if others were after the same lock (they probably were). (This used to be ctdb commit 5e614e8cfd1e9a4b13035a0e400b7a60a745b510)	2011-01-24 12:21:50 +11:00
Ronnie Sahlberg	fcd98a7e59	LIBCTDB: add support for traverse (This used to be ctdb commit 9463e04038ba36792583f83bd95c1af322dc283a)	2011-01-14 17:38:56 +11:00
Ronnie Sahlberg	c4006ce844	Add ctdb_fork(0 which will fork a child process and drop the real-time scheduler for the child. Use ctdb_fork() from callers where we dont want the child to be running at real-time privilege. (This used to be ctdb commit 58795a4c9e0624e20fa3e0023b65127053edd103)	2011-01-11 07:40:41 +11:00
Ronnie Sahlberg	ea0df6d882	Revert scheduling back to use real-time processes Revert this patch: commit 482c302d46e2162d0cf552f8456bc49573ae729d We may need to use real-time processes for the main daemon and the recovery daemon to handle the cases where systems come under very high loads. (This used to be ctdb commit 08bef9dcab6e4da15fc783f8624e5ed09aa060b5)	2011-01-11 07:40:35 +11:00
Ronnie Sahlberg	c69ada0090	add a new ctdb_ltdb function to delete a record in a normal database (This used to be ctdb commit fe9070ec9be69e6a6fcbf9899e7ced24541c9c3a)	2010-12-07 15:32:30 +11:00
Ronnie Sahlberg	83e68b62dd	delay loading the public ip address file until after we have started the transport and discovered ouw own pnn number (This used to be ctdb commit 1b57fc866fc836b5dbd3ef7b646e5a0f4280e81e)	2010-11-10 14:55:24 +11:00
Ronnie Sahlberg	5f76f3c0e2	Add a new tunable : DisableIPFailover that when set to non 0 will stopp any ip reallocations at all from happening. (This used to be ctdb commit d8d37493478a26c5f1809a5f3df89ffd6e149281)	2010-11-10 14:55:24 +11:00
Ronnie Sahlberg	5ef29f9f25	Update latency countes to show min/max and average (This used to be ctdb commit 1919e949af4641ffe919123e44b02fb87c13ab9f)	2010-10-11 15:12:24 +11:00
Ronnie Sahlberg	3ba7ac13eb	Create a tunable for how often to collect rolling statistics and initialize it to 1 second (This used to be ctdb commit cb8c779bb5d9862abbe08919aa181a1a1b2bef18)	2010-09-30 15:00:57 +10:00
Ronnie Sahlberg	9f66a93f12	Add rolling statistics that are collected across 10 second intervals. Add a new command "ctdb stats [num]" that prints the [num] most recent statistics intervals collected. (This used to be ctdb commit e6e16fcd5a45ebd3739a8160c8fb5f44494edb9e)	2010-09-29 12:14:45 +10:00
Ronnie Sahlberg	41b6e09fb1	Add a new statistics structure to keep the current running statistics (This used to be ctdb commit 09e5a2fb47c312f71f455cdbf8d9cabcca1041a4)	2010-09-29 12:14:35 +10:00
Ronnie Sahlberg	39c367a68f	Create macros to update the statistics counters and use these macros everywhere instead of manipulating the coutenrs directly. (This used to be ctdb commit 2e648df890e5713bc575965d87937827b068d0d7)	2010-09-29 12:14:24 +10:00
Ronnie Sahlberg	c6e20a06c7	set up a handler to catch and log debug messages from the tevent layer (This used to be ctdb commit fdb4c02f595fa207310a9a48da3fefd653fa9e4b)	2010-09-28 08:30:26 +10:00
Ronnie Sahlberg	22ea35f17d	adda GETPUBLICIPS control to libctdb and use this in the test example enhance the test example to show the new releaseip/takeip messages (This used to be ctdb commit 21cc57883e6c02b0e037211b26d1d866d5d7f03d)	2010-09-15 14:58:11 +10:00
Stefan Metzmacher	0b5bd411ca	server/banning: also release all ips if we're banning ourself metze (This used to be ctdb commit c386f2c62f06f1c60047b7d4b1ec7a9eec11873c)	2010-09-14 15:50:31 +10:00
Ronnie Sahlberg	d8d8b9e1d7	add a new serverid to send a message everytime an ip address is taken on the local node (This used to be ctdb commit 1261f3d9702800a4e59550c881350daf479f00ef)	2010-09-13 15:43:19 +10:00
Ronnie Sahlberg	991a6ae2a0	Update the comment for the range reserved for SAMBA and define a new symbol to represent this range similarly to NFSD and ISCSID Keep the old symbol name to be backward compatible with software using these headers. (This used to be ctdb commit 2ce34e50d057ba95249117a581658a5ad7e8eb60)	2010-09-13 15:10:36 +10:00
Ronnie Sahlberg	09a08b0da3	define and reserve a range of ctdb message ports for use by nfs and iscsi servers (This used to be ctdb commit 84a44ac8ee74dd7af15e378c6cafbedb95feec60)	2010-09-13 15:10:24 +10:00
Ronnie Sahlberg	65382a59d1	Add two new server types to the server_id structure. NFSD and ISCSID for now. (This used to be ctdb commit 4cd4bab68f0ba0305a585a2aabcb6871cdb11d96)	2010-09-13 15:10:12 +10:00
Ronnie Sahlberg	a2c874bd61	Implement a new function GETNODEMAP in libctdb. This function returns a pointer to a nodemap structure. The returned structure must later be freed by calling ctdb_free_nodemap(). Move the definition of ctdb_sock_addr from ctdb_client.h to ctdb_protocol.h Move the definition of the node flags, ctdb_node_and_flags and ctdb_node_map from ctdb_private.h to ctdb_protocol.h Add both sync and async example for ctdb_getnodemap to the test application libctdb/tst.c (This used to be ctdb commit 31c10eb2b337fd7d8a97a1f9e69b0e7570fec71d)	2010-09-13 14:32:11 +10:00
Ronnie Sahlberg	c95f4258d8	Add a new event "ipreallocated" This is called everytime a reallocation is performed. While STARTRECOVERY/RECOVERED events are only called when we do ipreallocation as part of a full database/cluster recovery, this new event can be used to trigger on when we just do a light failover due to a node becomming unhealthy. I.e. situations where we do a failover but we do not perform a full cluster recovery. Use this to trigger for natgw so we select a new natgw master node when failover happens and not just when cluster rebuilds happen. (This used to be ctdb commit 7f4c591388adae20e98984001385cba26598ec67)	2010-08-30 18:09:30 +10:00
Ronnie Sahlberg	2e8aac6689	Merge commit 'rusty/ports-from-1.0.112' into foo (This used to be ctdb commit 13e58d92f5f1723e850a82ae030d0ca57e89b1ee)	2010-08-19 13:17:56 +10:00
Ronnie Sahlberg	4c05f1900c	Merge commit 'rusty/vacuum-fix-master' (This used to be ctdb commit dc301b324d2c14a2425a965c076113c4fe97903e)	2010-08-19 13:16:35 +10:00
Ronnie Sahlberg	5aa5f3e7bf	Remove the structure ctdb_control_tcp_vnn since this is identical to the structure ctdb_tcp_connection. Add a new "ctdb deltickle" command to delete tickles from the database. This can ONLY be used for tickles created by "ctdb addtickle". Push any "addtickle/deltickle" updates to other nodes every TickleUpdateInterval seconds' (This used to be ctdb commit acded034e2f0dcae4c2c9e54e16a001caf23caec)	2010-08-18 12:36:03 +10:00
Rusty Russell	9fbb191b78	logging: give a unique logging name to each forked child. This means we can distinguish which child is logging, esp. via syslog where we have no pid. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 68b3761a0874429b90731741f0531f76dcfbb081)	2010-08-18 11:46:32 +09:30
Rusty Russell	af55c910a4	freeze: abort vacuuming when we're going to freeze. There are some reports of freeze timeouts, and it looks like vacuuming might be the culprit. So we add code to tell them to abort when a freeze is going on. (This is based on the 1.0.112 branch version 517f05e42f, but far simpler since tdb is now robust against processes being killed during transaction commit) CQ:S1018154 & S1018349 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit f5d7dc679501e607c2c83a248a89d3cada9df146)	2010-08-18 10:54:28 +09:30
Ronnie Sahlberg	ddf3c621c1	Merge commit 'rusty/libctdb-new' into foo (This used to be ctdb commit 1566d2d23ab698896b3b6a76974a5c7452db4a62)	2010-08-18 09:53:52 +10:00
Rusty Russell	f93440c4b7	event: Update events to latest Samba version 0.9.8 In Samba this is now called "tevent", and while we use the backwards compatibility wrappers they don't offer EVENT_FD_AUTOCLOSE: that is now a separate tevent_fd_set_auto_close() function. This is based on Samba version `7f29f817fa`. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 85e5e760cc91eb3157d3a88996ce474491646726)	2010-08-18 09:16:31 +09:30
Rusty Russell	a65cb6a9ae	libctdb: add synchronous message handling and unregister, with tests. It turns out that we do want a separate private arg for the message handler and the completion callback, so we change that. We also fix the prototypes of the remove_message functions as we implement them. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 332375246eccd95da626f434f6d49dd9458a9787)	2010-08-09 15:41:32 +09:30
Ronnie Sahlberg	c5de7cfb8c	Merge commit 'rusty/master' (This used to be ctdb commit b4391c00476cde74101736986dfcd2be6c959edc)	2010-07-30 16:25:40 +10:00
Evan Kinney	0557c418e3	ctdb: Fixed use of reserved word "private" in typedefs In include/ctdb.h, ctdb_callback_t and ctdb_rrl_callback_t were defined with a void private variable. The variable name was changed to void private_data to avoid issues encountered in the Samba autoconf script. Evan Kinney <evan.kinney@sas.com> (This used to be ctdb commit 1f453aa4b5e749468c7788afac09c6f0900ea18f)	2010-07-29 17:16:36 +10:00
Rusty Russell	7061ceffd8	Report client for queue errors. We've been seeing "Invalid packet of length 0" errors, but we don't know what is sending them. Add a name for each queue, and print nread. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit e6cf0e8f14f4263fbd8b995418909199924827e9)	2010-07-01 23:08:49 +10:00
Rusty Russell	8946028a07	speed startup: add --sloppy-start. The extra recovery interval wait was introduced in 821333afb458 but no explanation was provided in that message. Nonetheless, if starting the entire cluster for the first time, it should be safe to skip this. We use the commandline arg --sloppy-start which should discourage people from using it outside testing. Seconds between ctdbd first log message and node healthy: BEFORE: 16.10 AFTER: 4.03 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 509e2e89ae233a0e91998d95267bf62f296a73cd)	2010-06-22 22:52:34 +09:30
Rusty Russell	cfe0edc0b9	libctdb: implement synchronous readrecordlock interface. Because this doesn't use a generic callback, it's not quite as trivial as the other sync wrappers. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 1f20b938d46d4fcd50d2b473c1ab8dc31d178d2d)	2010-06-21 14:47:34 +09:30
Rusty Russell	b93e65eaf7	libctdb: implement ctdb_disconnect and ctdb_detachdb These are important for testing, since we can easily tell if we leak memory if there are outstanding allocations after calling these. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 18a212aa40d0ff9ff59775c6fcf9dc973e991460)	2010-06-18 15:35:52 +09:30
Rusty Russell	5f9e4b60ae	Delay reusing ids to make protocol more robust Ronnie and I tracked down a bug which seems to be caused by a node running so slowly that we timed out the request and reused the request id before it responded. The result was that we unlocked the wrong record, leading to the following: ctdbd: tdb_unlock: count is 0 ctdbd: tdb_chainunlock failed smbd[1630912]: [2010/06/08 15:32:28.251716, 0] lib/util_sock.c:1491(get_peer_addr_internal) ctdbd: Could not find idr:43 ctdbd: server/ctdb_call.c:492 reqid 43 not found This exact problem is now detected, but in general we want to delay id reuse as long as possible to make our system more robust. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 9eb9c53ef29f4871ae2fe62fc5cb6145fca89eed)	2010-06-10 08:58:55 +09:30
Rusty Russell	7589b58138	libctdb: more bool conversion, and accompany lock by ctdb_db in API I missed some int->bool conversions previously, particularly the return of ctdb_writerecord(). By always handing functions ctdb_connection or ctdb_db, we keep it consistent with the rest of the API and can do extra lock consistency checks. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 3f939956ddd693cba6ea5c655288f4f5ca95f768)	2010-06-08 17:11:40 +09:30
Rusty Russell	866cca9637	libctdb: clarify logging levels Now we have more messages, it seems to make sense to document their usage and make them consistent. In particular, LOG_CRIT for internal libctdb problems, LOG_ALERT for API misuse. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit a6fed3f577c7ec51df38ed15ecb9db6ea2ae7c8f)	2010-06-08 16:53:17 +09:30
Ronnie Sahlberg	b9e5c8a47b	Split ctdb_release_lock() into a function to release the locvk and another function to free the data structures. This allows us to keep the datastructure valid after the lock has been released by the application and we can trap and warn when the application is accessing the lock after it has been released. I.e. application bugs. (This used to be ctdb commit 463a266205f145cd9c4c36b9c59d3747eeef0e2e)	2010-06-05 15:38:11 +10:00
Rusty Russell	3510980049	libctdb: documentation Full documentation for all the functions. This looks longer than it is, because it sorts them into async and sync parts, and also renames some formal parameters. Added TODO to libctdb directory to track our plans. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 108e9c2450876a9f8821aa7efd5be971eee5afd3)	2010-06-04 20:30:08 +09:30
Rusty Russell	c5b4768816	libctdb: use values from ctdb_protocol.h, don't re-declare We're best off including ctdb_protocol.h to get these, even if we document the important ones in ctdb.h. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit cdc19dc73032470d57f38bf825d8113b3a0c8cd1)	2010-06-04 20:22:03 +09:30
Rusty Russell	3a569c14bc	libctdb: use bool in API Return bool instead of -1/0; that's what the young kids are doing these days! Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit e285b5d5a9d4fbc4f75dbb237d2fcdbd84f2d605)	2010-06-04 20:19:25 +09:30
Rusty Russell	379fd4e606	libctdb: add logging infrastructure This is based on Ronnie's work, merged with mine. That means errors are all my fault. Differences from Ronnie's: 1) use syslog's LOG_ levels directly. 2) typesafe arg to log function, and use it (eg stderr) in helper function. 3) store fn in ctdb context, and expose ctdb_log_level directly thru API. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 86259aa395555aaf7b2fae7326caa2ea62961092)	2010-06-04 20:27:03 +09:30
Rusty Russell	cc8435852c	libctdb: add ctdb arg to more functions. This is going to help for logging, since we want it there. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 0786152472bc43efae4c896f7c6c07c6e080b9b2)	2010-06-04 16:54:08 +09:30
Rusty Russell	94df6f322d	libctdb: change callback for ctdb_readrecordlock. After discussion with Ronnie, we decided to revisit this interface. We use the name ctdb_readrecordlock_async, as it is not always a send, and we use a specific callback to avoid the "fake request" creation on the fast path. The request itself is never exposed: this means it can't be cancelled, but we can revisit that later if need be. This makes both use and implementation simpler. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 03b5546ae45a60ab41eb4f7159a45bfdbf959888)	2010-06-04 13:33:08 +09:30
Ronnie Sahlberg	2d4b98381f	ctdb_req_control contains 4 padding bytes. Create an explicit pad variable here and set it to 0 when creating a control to keep valgrind happy. PDUs are padded to 8 byte boundary. If padding is used, memset it to 0 to keep valgrind happy. (This used to be ctdb commit 8818d5c483558c0faa6a3923ed5e675fdcfc13af)	2010-06-02 16:49:05 +10:00
Ronnie Sahlberg	f94291c37d	Make the call to free the request explicit in the callback instead of implicit (This used to be ctdb commit 573e4e2d2bd09dd9579150cce926de774a0b609c)	2010-06-02 13:49:34 +10:00
Ronnie Sahlberg	53ea238c6c	Add a variable for start/current time to ctdb statistics and print the time startistics was taken and for how long the statistics have been collected to the "ctdb statistics" output. (This used to be ctdb commit 1bdfe0cd3370a335b960ce1ef97eade93b0cd2fa)	2010-06-02 13:14:53 +10:00
Ronnie Sahlberg	8666094e92	add a function to read the current socketname from the ctdb structure (This used to be ctdb commit 112d252b2ab614eeac38e4a1658cd1e85f6eb829)	2010-06-02 10:25:31 +10:00
Ronnie Sahlberg	3c7350b8c6	rename ctdb_remove_message_handler to ctdb_client_remove_message_handler to avoid conflict with the function of the same name in libctdb (This used to be ctdb commit 636ed76d04c8c499a911eb0d72d54b71b0a73d31)	2010-06-02 10:05:58 +10:00
Ronnie Sahlberg	f1b8bd94bb	rename ctdb_message_fn_t to ctdb_msg_fn_t to avoid a conflict with the type of the same name used in libctdb (This used to be ctdb commit 49e23f8329649e4d9eefab47c9b158fcc7210d07)	2010-06-02 10:00:58 +10:00
Ronnie Sahlberg	bc208bc916	rename ctdb_set_message_handler to ctdb_client_set_message_handler to avoid a colission with the function of the same name in libctdb (This used to be ctdb commit 41dbdd4fc0ab560420fb0e24a3179ff7c94c5bb7)	2010-06-02 09:51:47 +10:00
Ronnie Sahlberg	761a075de9	rename ctdb_send_message to ctdb_client_send_message to resolve colission with the function of the same name in libctdb (This used to be ctdb commit ac3292c12832484a22715f1d46aa23f3b7c8a6f6)	2010-06-02 09:45:21 +10:00
Ronnie Sahlberg	bdbf7077e8	rename ccan/typesafe_cb.h to ctdb_typesafe_cb.h and add this file to the install/rpm (This used to be ctdb commit 96f186240a17386de1e02eb3af392d97bb55a1ae)	2010-06-02 09:18:48 +10:00
Rusty Russell	bd8d302589	libctdb: tweak interface for readrecordlock Previously we could hang in poll with the callback pending (since we fake it): explicitly call it immediately. Note: I experienced corruption using DLIST_ADD_END (ctdb->pnn was blatted when adding to the message_handler list). I switched them all to DLIST_ADD, but maybe I'm using it wrong? (This used to be ctdb commit 3727165f0d206999d2cfc2800ff8868640868c7c)	2010-05-24 13:52:17 +09:30
Rusty Russell	30f4d01df1	libctdb: uniform callbacks, _recv functions to pull out data. This is a bit tricky for those cases where we need to do multiple or zero I/Os (eg. attachdb and readrecordlock), but works well for the simple cases. (This used to be ctdb commit ebe4dd724338c156423cfdcc10a75b68c2084cde)	2010-05-24 13:17:36 +09:30
Rusty Russell	7046a1ad0a	libctdb: API changes from Ronnie's version These simplifications mostly came up due to the implementation. o Rename ctdb_context to ctdb_connection. We already have a ctdb_context internally in ctdbd; don't confuse them! o Rename ctdb_handle to struct ctdb_request. From the user POV it's a request, and it's also useful internally to avoid implicit cast to/from void *. o Rename ctdb_db_context to ctdb_db. o Introduce ctdb_lock. This provides an explicit "lock object" you get from readrecordlock and have to hand to those functions which need you to hold a lock. o status args are "int" not int32_t. Should this be a bool? o Remove last traces on generic callback. Without semi-sync API, this doesn't help anything and loses type safety. o Remove the semi-async API. We can add this later, but I think a sync and async API is enough for our poor users for the moment :) o Registering a message handler also takes a callback. This way you can tell if it failed. Not sure if this is overkill, but it's consistent. o ctdb_service() takes an revents arg Strictly not necessary for a nonblocking fd, but nice to know if a read or write is possible. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 86e1f93df856f9627182ed0e18bfcff6866c0954)	2010-05-20 16:07:30 +09:30
Rusty Russell	bbfb992f55	libctdb: ctdb.h and tst.c from Ronnie This imports ctdb.h and tst.c from Ronnie's work: it's a separate commit for now to make the changes obvious. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 09f05cbfc883e5aac33d3781b163cde178ece4cf)	2010-05-20 16:01:28 +09:30
Rusty Russell	d5f6026a22	libctdb: reorganize headers: remove ctdb.h, add ctdb_client.h and ctdb_protocol.h ctdb_client.h is the existing internal client interface (which was mainly in ctdb.h), and ctdb_protocol.h is the information needed for the wire protocol only. ctdb.h will be the new, shiny, libctdb API. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 4bba6b8cd47b352f98d41f9f06258d5ac3c9adef)	2010-05-20 15:18:30 +09:30
Ronnie Sahlberg	6f1221e9e1	Add the number of performed recoveries to the "ctdb statistics" output. (This used to be ctdb commit fa045733cb81412f0d02ab52d74eabc7efca8b3d)	2010-05-11 09:44:53 +10:00
Rusty Russell	72c275dd70	ctdb: use full range of IDR This resolves a problem with huge numbers of requests which could overflow 16 bits. Fortunately, the IDR should scale reasonably well, so we can simply hold all the requests. Although noone checks for failure, I added a constant for that. BZ: 60540 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 72efc4122e37798227c3420a65ed1f706ca9ebe7)	2010-05-11 09:44:43 +10:00
Ronnie sahlberg	46f00a2478	Merge commit 'rusty/signal-fix' (This used to be ctdb commit 221a9bb41c3a7af0cc65cda78365010893ca1430)	2010-05-03 15:57:41 +10:00
Ronnie Sahlberg	4a43428440	The recent change to the recovery daemon to keep track of and verify that all nodes agree on the most recent ip address assignments broke "ctdb moveip ..." since that call would never trigger a full takeover run and thus would immediately trigger an inconsistency. Add a new message to the recovery daemon where we can tell the recovery daemon to update its assignments. BZ62782 (This used to be ctdb commit e7069082e5f0380dcddee247db8754218ce18cab)	2010-05-03 15:47:17 +10:00
Rusty Russell	e1b59b6a47	eventscript: don't do debugging system() from inside signal handler In the case of a timeout, we dump a log of what's happening to a file in /tmp. We do it from the signal handler, which is an unreliable hack (BZ58365). Instead, create another (lower-priority) child to do the dump, then kill the timedout script. Note that this doesn't quite work as intended (the dump is often run after the script has been killed), so the next patch resolves this. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (This used to be ctdb commit 7ee5ecc8d53e78e2dec21197b74a74cc4ae1834c)	2010-04-08 15:13:29 +09:30
Ronnie Sahlberg	06885ea9a7	In the recovery daemon, keep track of which node we have assigned public ip addresses and verify that the remote nodes have/keep a consistent view of assigned addresses. If a remote node has an inconsistent view of addresses visavi the recovery master this will trigger a full ip reallocation. (This used to be ctdb commit f3bf2ab61f8dbbc806ec23a68a87aaedd458e712)	2010-04-08 14:25:26 +10:00
Stefan Metzmacher	3419e9c4dd	server: add "setup" event This is needed because the "init" event can't use 'ctdb' commands. metze (This used to be ctdb commit 1493436b6b24eb05a23b7a339071ad85f70de8f4)	2010-02-23 10:38:49 +01:00
Stefan Metzmacher	98ee69c66d	server: add updateip event metze (This used to be ctdb commit 712ed0c4c0bff1be9e96a54b62512787a4aa6259)	2010-01-20 11:11:01 +01:00
Stefan Metzmacher	32d00d0a0d	controls: add stups for GET_PUBLIC_IP_INFO, GET_IFACES and SET_IFACE_LINK_STATE metze (This used to be ctdb commit a2c9e4578e149eccb2c6183f64a6b657eb95c5e1)	2010-01-20 11:10:59 +01:00

1 2 3 4 5 ...

651 Commits