haproxy

Author	SHA1	Message	Date
Willy Tarreau	e09101e8d9	BUILD: lua: silence some compiler warnings about potential null derefs (#2 ) Here we make sure that appctx is always taken from the unchecked value since we know it's an appctx, which explains why it's immediately dereferenced. A missing test was added to ensure that task_new() does not return a NULL. This may be backported to 1.8.	2018-10-16 17:39:05 +02:00
Willy Tarreau	526aed219f	Revert "BUILD: lua: silence some compiler warnings about potential null derefs" This reverts commit f1ffb39b614b0d9654c9450ac6e8c88cfc942784. It breaks Lua causing some timeouts. Removing the __unreachable() statement from WILL_LJMP() fixes it. It's very strange and unclear whether it's an issue with WILL_LJMP() not fullfilling its promise of not returning, if the code emitted with __unreachable() gets broken, or anything else. Let's revert this for now.	2018-10-16 17:32:55 +02:00
Willy Tarreau	a9c0252b2e	BUG/MEDIUM: threads: fix thread_release() at the end of the rendez-vous point There is a bug in this function used to release other threads. It leaves the current thread marked as harmless. If after this another thread does a thread_isolate(), but before the first one reaches poll(), the second thread will believe it's alone while it's not. This must be backported to 1.8 since the rendez-vous point was merged into 1.8.14.	2018-10-16 17:03:16 +02:00
Willy Tarreau	e18db9e984	MEDIUM: pools: implement a thread-local cache for pool entries Each thread now keeps the last ~512 kB of freed objects into a local cache. There are some heuristics involved so that a specific pool cannot use more than 1/8 of the total cache in number of objects. Tests have shown that 512 kB is an optimal size on a 24-thread test running on a dual-socket machine, resulting in an overall 7.5% performance increase and a cache miss ratio reducing from 19.2 to 17.7%. Anyway it seems pointless to keep more than an L2 cache, which probably explains why sizes between 256 and 512 kB are optimal. Cached objects appear in two lists, one per pool and one LRU to help with fair eviction. Currently there is no way to check each thread's cache state nor to flush it. This cache cannot be disabled and is enabled as soon as the lockless pools are enabled (i.e.: threads are enabled, no pool debugging is in use and the CPU supports a double word CAS).	2018-10-16 13:46:08 +02:00
Willy Tarreau	146794dc4f	MINOR: pools: split pool_free() in the lockfree variant This separates the validity tests from the code committing the object to the pool, in order to ease insertion of the thread-local cache.	2018-10-16 10:29:28 +02:00
Willy Tarreau	0a93b6413f	MINOR: pools: allocate most memory pools from an array For caching it will be convenient to have indexes associated with pools, without having to dereference the pool itself. One solution could consist in replacing all pool pointers with integers but this would limit the number of allocatable pools. Instead here we allocate the 32 first pools from a pre-allocated array whose base address is known so that it's trivial to convert a pool to an index in this array. Pools that cannot fit there will be allocated normally.	2018-10-16 10:29:26 +02:00
Bertrand Jacquin	d5e4de8e5f	DOC: Fix a few typos these are mostly spelling mistakes, some of them might be candidate for backporting as well.	2018-10-15 19:38:15 +02:00
Willy Tarreau	8d8747abe0	OPTIM: tasks: group all tree roots per cache line Currently we have per-thread arrays of trees and counts, but these ones unfortunately share cache lines and are accessed very often. This patch moves the task-specific stuff into a structure taking a multiple of a cache line, and has one such per thread. Just doing this has reduced the cache miss ratio from 19.2% to 18.7% and increased the 12-thread test performance by 3%. It starts to become visible that we really need a process-wide per-thread storage area that would cover more than just these parts of the tasks. The code was arranged so that it's easy to move the pieces elsewhere if needed.	2018-10-15 19:06:13 +02:00
Willy Tarreau	b20aa9eef3	MAJOR: tasks: create per-thread wait queues Now we still have a main contention point with the timers in the main wait queue, but the vast majority of the tasks are pinned to a single thread. This patch creates a per-thread wait queue and queues a task to the local wait queue without any locking if the task is bound to a single thread (the current one) otherwise to the shared queue using locking. This significantly reduces contention on the wait queue. A test with 12 threads showed 11 ms spent in the WQ lock compared to 4.7 seconds in the same test without this change. The cache miss ratio decreased from 19.7% to 19.2% on the 12-thread test, and its performance increased by 1.5%. Another indirect benefit is that the average queue size is divided by the number of threads, which roughly removes log(nbthreads) levels in the tree and further speeds up lookups.	2018-10-15 19:04:40 +02:00
Willy Tarreau	87d54a9a6d	MEDIUM: fd/threads: only grab the fd's lock if the FD has more than one thread The vast majority of FDs are only seen by one thread. Currently the lock on FDs costs a lot because it's touched often, though there should be very little contention. This patch ensures that the lock is only grabbed if the FD is shared by more than one thread, since otherwise the situation is safe. Doing so resulted in a 15% performance boost on a 12-threads test.	2018-10-15 13:25:06 +02:00
Willy Tarreau	9504dd64c6	MINOR: config: use atleast2() instead of my_popcountl() where relevant Quite often we used my_popcountl() just to check for > 1 bit set. Now we have an easier solution, let's use it.	2018-10-15 13:25:06 +02:00
Willy Tarreau	98d334bd94	MINOR: tools: add a new function atleast2() to test masks for more than 1 bit For threads it's common to have to check if a mask contains more than one bit set. Let's have this "atleast2()" function report this.	2018-10-15 13:25:06 +02:00
Willy Tarreau	d944344f01	BUILD: peers: check allocation error during peers_init_sync() peers_init_sync() doesn't check task_new()'s return value and doesn't return any result to indicate success or failure. Let's make it return an int and check it from the caller. This can be backported as far as 1.6.	2018-10-15 13:24:43 +02:00
Willy Tarreau	848522f05d	BUILD: stick-table: make sure not to fail on task_new() during initialization Gcc reports a potential null-deref error in the stick-table init code. While not critical there, it's trivial to fix. This check has been missing since 1.4 so this fix can be backported to all supported versions.	2018-10-15 13:24:43 +02:00
Willy Tarreau	a8825520b7	BUILD: ssl: fix another null-deref warning in ssl_sock_switchctx_cbk() This null-deref cannot happen either as there necesarily is a listener where this function is called. Let's use __objt_listener() to address this. This may be backported to 1.8.	2018-10-15 13:24:43 +02:00
Willy Tarreau	b729077710	BUILD: ssl: fix null-deref warning in ssl_fc_cipherlist_str sample fetch Gcc 6.4 detects a potential null-deref warning in smp_fetch_ssl_fc_cl_str(). This one is not real since already addressed a few lines above. Let's use __objt_conn() instead of objt_conn() to avoid the extra test that confuses it. This could be backported to 1.8.	2018-10-15 13:24:43 +02:00
Willy Tarreau	f1ffb39b61	BUILD: lua: silence some compiler warnings about potential null derefs These ones are on error paths that are properly handled by luaL_error() which does a longjmp() but the compiler cannot know it. By adding an __unreachable() statement in WILL_LJMP(), there is no ambiguity anymore. This may be backported to 1.8 but the previous patch (BUILD: compiler: add a new statement "__unreachable()") is needed for this.	2018-10-15 13:24:43 +02:00
Willy Tarreau	8d26f02e69	BUILD: compiler: add a new statement "__unreachable()" This statement is used as a hint for the compiler so that it knows that the location where it's placed cannot be reached. It will mostly be used after longjmp() or equivalent statements that deal with error processing and that the compiler doesn't know will not return on certain conditions, so that it doesn't complain about null dereferences on error paths.	2018-10-15 13:24:43 +02:00
Willy Tarreau	e5f229e639	BUG/MEDIUM: stream: don't crash on out-of-memory In case pool_alloc() fails in stream_new(), we try to detach the stream from the list before it has been added, dereferencing a NULL. In order to fix it, simply move the LIST_DEL call upwards. This must be backported to 1.8.	2018-10-15 13:24:43 +02:00
William Lallemand	dd319a5b1d	BUG/MEDIUM: mworker: don't poll on LI_O_INHERITED listeners The listeners with the LI_O_INHERITED flag were deleted but not unbound which is a problem since we have a polling in the master. This patch unbind every listeners which are not require for the master, but does not close the FD of those that have a LI_O_INHERITED flag.	2018-10-12 19:30:18 +02:00
Willy Tarreau	b3fb56db10	MINOR: h2: add a new flag to quickly distinguish front vs back connection We will need to know if a mux was created for a front or a back connection and once it's established it's much harder, so let's introduce H2_CF_IS_BACK for this.	2018-10-12 16:58:41 +02:00
Willy Tarreau	a8e4954856	MINOR: h2: split h2c_stream_new() into h2s_new() + h2c_frt_stream_new() For backend connections we'll have to initialize streams but not allocate conn_streams since they'll already be there. Thus this patch splits the h2c_stream_new() function into one dedicated to allocation of a new stream and another one supposed to attach this stream to an existing frontend connection.	2018-10-12 16:58:01 +02:00
Willy Tarreau	0b37d658e6	MINOR: h2: retrieve the front proxy from the caller instead of the session Till now in order to figure the timeouts, we used to retrieve the proxy from the session's owner, but the new API provides it so it's better to simply take it from the caller at init time. We take this opportunity to store the pointer to the proxy into the h2 connection so that we can reuse it later when needed.	2018-10-12 16:58:01 +02:00
Willy Tarreau	7dc24e49cc	MINOR: h2: unify the mux init function The init function was split into the mux init and the front init, but it appears that most of the code will be common between the two sides when implementing the backend init. Thus let's simply make this a unique h2_init() function.	2018-10-12 16:58:01 +02:00
Willy Tarreau	6bf641a61d	MINOR: h2: don't try to send data before preface h2_snd_buf() must not accept to send data if the preface was not yet received nor sent. At the moment it doesn't happen but it can with server-side H2.	2018-10-12 16:58:01 +02:00
Willy Tarreau	7f0cc49645	CLEANUP: h2: rename h2c_snd_settings() to h2c_send_settings() It's the only function not called h2c_send_<something>() and it took me a while to find it.	2018-10-12 16:58:01 +02:00
Willy Tarreau	ab0e1da3a9	MEDIUM: h2: stop relying on H2_SS_IDLE / H2_SS_CLOSED At a few places we check these states to detect if a stream has valid data/errcode or is one of the two dummy streams (idle or closed). It will become problematic for outgoing streams as it will not be possible to report errors for example since the stream will switch from IDLE state only after sending a HEADERS frame. There is a safer solution consisting in checking the stream ID, which may only be zero in the dummy streams. This patch changes the test to only rely on the stream ID.	2018-10-12 16:58:01 +02:00
Willy Tarreau	c1f40b38a6	MINOR: chunk: add chunk_cpy() and chunk_cat() Sometimes we need to concatenate constant chunks to existing ones, but no function currently exists to do this easily, hence these two new ones.	2018-10-12 16:58:01 +02:00
Willy Tarreau	9fa267dada	MINOR: log: make sess_log() support sess=NULL At many places in muxes we'll have to add tests to check if the connection is front or back before deciding to log. Instead let's centralize this test in sess_log() to simply do nothing when sess=NULL.	2018-10-12 16:58:01 +02:00
Christopher Faulet	25da9e34f1	MINOR: h1: Add the flag H1_MF_NO_PHDR to not add pseudo-headers during parsing Some pseudo-headers are added during the headers parsing, mainly for the mux H2. With this flag, it is possible to not add them. This avoid some boring filtering in the mux H1.	2018-10-12 16:15:18 +02:00
Christopher Faulet	1dc2b49556	MINOR: h1: Change the union h1_sl to use indirect strings to store infos Instead of using offsets relating to the parsed buffer to store start line infos, we now use indirect strings. So now, these infos remain valid only if the origin buffer remains untouched. But it's not a real problem because this union is used during the parsing and never stored to a later use.	2018-10-12 16:14:57 +02:00
Christopher Faulet	08088e77c6	MINOR: conn-stream: Add CL_FL_NOT_FIRST flag This flags will be used by multiplexers to warn a conn-stream (and, by transitivity, a stream) it is not the first one created by the mux. It will help mux H1 to handle keep-alive connections.	2018-10-12 16:09:26 +02:00
Christopher Faulet	ff08a92797	MINOR: h1: Add EOH marker during headers parsing When headers parsing ends, a pseudo header with an empty name and an empty value is added to the array of parsed headers to mark its end. It is convenient to loop on this array, but not really useful if we want remove the last header or add a new one, because we don't really know where is the last CRLF (the empty line ending the headers block). So now, instead the name of this pseudo header points on this last CRLF. Its length is still 0 and its value is still empty, so loops on the array remains unchanged.	2018-10-12 16:08:27 +02:00
Christopher Faulet	315b39c391	MINOR: http: Use same flag for httpclose and forceclose options Since keep-alive mode is the default mode, the passive close has disappeared, and in the code, httpclose and forceclose options are handled the same way: connections with the client and the server are closed as soon as the request and the response are received and missing "Connection: close" header is added in each direction. So to make things clearer, forceclose is now an alias for httpclose. And httpclose is explicitly an active close. So the old passive close does not exist anymore. Internally, the flag PR_O_HTTP_PCL has been removed and PR_O_HTTP_FCL has been replaced by PR_O_HTTP_CLO. In HTTP analyzers, the checks done to find the right mode to use, depending on proxies options and "Connection: " header value, have been simplified. This should only be a cleanup and no changes are expected.	2018-10-12 16:07:56 +02:00
Christopher Faulet	4212a30ad1	MEDIUM: http: Ignore http-tunnel option on backend This option is frontends specific, so there is no reason to support it on backends. So now, it is ignored if it is set on a backend and a warning is emitted during the startup. The change is quite trivial, but the commit is tagged as MEDIUM because it is a small breakage with previous versions and configurations using this options could emit a warning now.	2018-10-12 16:05:53 +02:00
Christopher Faulet	98db9768e5	MEDIUM: http: Ignore http-pretend-keepalive option on frontend This option is backends specific, so there is no reason to support it on frontends. So now, it is ignored if it is set on a frontend and a warning is emitted during the startup. The change is quite trivial, but the commit is tagged as MEDIUM because it is a small breakage with previous versions and configurations using this options could emit a warning now.	2018-10-12 16:01:26 +02:00
Christopher Faulet	10079f59b7	MINOR: http: Export some functions and do cleanup to prepare HTTP refactoring To ease the refactoring, the function "http_header_add_tail" have been remove. Now, "http_header_add_tail2" is always used. And the function "capture_headers" have been renamed into "http_capture_headers". Finally, some functions have been exported.	2018-10-12 16:00:45 +02:00
Christopher Faulet	702226c827	MINOR: stats: Add missing include "proto/stats.h" must include "types/stats.h".	2018-10-12 16:00:32 +02:00
Christopher Faulet	7e266c7936	MINOR: http: Move comment about some HTTP macros in the right header file HTTP_FLG_* and HTTP_IS_* were moved from "proto/proto_http.h" to "common/http.h" but the associated comment was forgotten during the move. This is 1.9-specific and should not be backported.	2018-10-12 16:00:24 +02:00
Olivier Houchard	4fdec7aafa	BUG/MEDIUM: stream: Make sure to unsubscribe before si_release_endpoint. Make sure we unsubscribe from events before si_release_endpoint destroys the conn_stream, or it will be never called. To do so, move the call to unsubscribe to si_release_endpoint() directly. This is 1.9-specific and shouldn't be backported.	2018-10-11 17:16:43 +02:00
Emeric Brun	c8c0ed91cb	BUG/MEDIUM: mworker: segfault receiving SIGUSR1 followed by SIGTERM. This bug appeared only if nbthread > 1. Handling the pipe with the master, multiple threads of the same worker could process the deinit(). In addition, deinit() was called while some other threads were still performing some tasks. This patch assign the handler of the pipe with master to only the first thread and removes the call to deinit() before exiting with an error. This patch should be backported in v1.8.	2018-10-11 16:29:38 +02:00
Olivier Houchard	dddfe31265	BUG/MEDIUM: h2: Make sure we're not in the send list on flow control. If we can't send data for a stream because of its flow control, make sure not to put it in the send_list, until the flow control lets it send again. This is specific to 1.9, and should not be backported.	2018-10-11 15:35:05 +02:00
Olivier Houchard	fa8aa867b9	MEDIUM: connections: Change struct wait_list to wait_event. When subscribing, we don't need to provide a list element, only the h2 mux needs it. So instead, Add a list element to struct h2s, and use it when a list is needed. This forces us to use the unsubscribe method, since we can't just unsubscribe by using LIST_DEL anymore. This patch is larger than it should be because it includes some renaming.	2018-10-11 15:34:39 +02:00
Olivier Houchard	83a0cd8a36	MINOR: connections: Introduce an unsubscribe method. As we don't know how subscriptions are handled, we can't just assume we can use LIST_DEL() to unsubscribe, so introduce a new method to mux and connections to do so.	2018-10-11 15:34:21 +02:00
mildis	5ab01cb011	BUG/MINOR: checks: queues null-deref queues can be null if calloc() failed. Bypass free* calls when calloc did fail.	2018-10-11 15:17:47 +02:00
mildis	cd2d7de44e	BUG/MINOR: h2: null-deref h2c can be null if pool_alloc() failed. Bypass tasklet_free and pool_free if pool_alloc did fail.	2018-10-11 15:17:27 +02:00
Willy Tarreau	27346b01aa	OPTIM: tools: optimize my_ffsl() for x86_64 This call is now used quite a bit in the fd cache, to decide which cache to add/remove the fd to/from, when waking up a task for a single thread in __task_wakeup(), in fd_cant_recv() and in fd_process_cached_events(), and we can replace it with a single instruction, removing ~30 instructions and ~80 bytes from the inner loop of some of these functions. In addition the test for zero value was replaced with a comment saying that it is illegal and leads to an undefined behaviour. The code does not make use of this useless case today.	2018-10-10 19:24:23 +02:00
Willy Tarreau	2325d8af93	BUG/MINOR: threads: move declaration of capabilities to config.h In commit f161d0f51 ("BUG/MINOR: pools/threads: don't ignore DEBUG_UAF on double-word CAS capable archs") I moved some defines and accidently messed up with lockfree pools. The problem is that the HA_HAVE_CAS_DW macro is not defined anymore where the CONFIG_HAP_LOCKLESS_POOLS macro is set, so this fix implicitly disabled lockfree pools. This patch fixes this by moving the capabilities definition to config.h (probably that we'd benefit from having an "arch.h" file to declare the capabilities offered by the architecture). In a test on a 12-core machine, we used to measure 19s spent in the pool lock for 1M requests without this patch, and 0 with it so that's definitely a net saving. No backport is required, this is only for 1.9.	2018-10-10 18:29:23 +02:00
Emeric Brun	7ad43e7928	BUG/MEDIUM: Cur/CumSslConns counters not threadsafe. CurSslConns inc/dec operations are not threadsafe. The unsigned CurSslConns counter can wrap to a negative value. So we could notice connection rejects because of MaxSslConns limit artificially exceeded. CumSslConns inc operation are also not threadsafe so we could miss some connections and show inconsistenties values compared to CumConns. This fix should be backported to v1.8.	2018-10-10 18:05:33 +02:00
Willy Tarreau	0b25d5e99f	MEDIUM: task: perform a single tree lookup per run queue batch The run queue is designed to perform a single tree lookup and to use multiple passes to eb32sc_next(). The scheduler rework took a conservative approach first but this is not needed anymore and it increases the processing cost of process_runnable_tasks() and even the time during which the RQ lock is held if the global queue is heavily loaded. Let's simply move the initial lookup to the entry of the loop like the previous scheduler used to do. This has reduced by a factor of 5.5 the number of calls to eb32sc_lookup_get() there.	2018-10-10 16:42:46 +02:00

1 2 3 4 5 ...

8107 Commits