haproxy

Author	SHA1	Message	Date
Willy Tarreau	387bd4f69f	CLEANUP: global: introduce variable pid_bit to avoid shifts with relative_pid At a number of places, bitmasks are used for process affinity and to map listeners to processes. Every time 1UL<<(relative_pid-1) is used. Let's create a "pid_bit" variable corresponding to this value to clean this up.	2017-11-10 19:08:14 +01:00
Willy Tarreau	9a398beac3	BUG/MEDIUM: stream: don't ignore res.analyse_exp anymore It happens that no single analyser has ever needed to set res.analyse_exp, so that process_stream() didn't consider it when computing the next task expiration date. Since Lua actions were introduced in 1.6, this can be needed on http-response actions for example, so let's ensure it's properly handled. Thanks to Nick Dimov for reporting this bug. The fix needs to be backported to 1.7 and 1.6.	2017-11-10 17:14:23 +01:00
Willy Tarreau	5d9846f4b3	MINOR: cli: make "show fd" report the fd's thread mask This is useful to know what thread(s) an fd is scheduled to be handled on. It's worth noting that at the moment the "show fd"d doesn't seem totally thread-safe.	2017-11-10 16:53:09 +01:00
Willy Tarreau	28b55c6fed	CLEANUP: mux: remove the unused "release()" function In commit 53a4766 ("MEDIUM: connection: start to introduce a mux layer between xprt and data") we introduced a release() function which ends up never being used. Let's get rid of it now.	2017-11-10 16:43:05 +01:00
Willy Tarreau	7ce3f09513	BUG/MEDIUM: threads/cli: fix "show sess" locking on release The recent thread updates on the CLI broke "show sess" by unlocking the stream twice instead of lock+unlock. No backport is needed.	2017-11-10 16:24:41 +01:00
Willy Tarreau	22cf59bbba	BUG/MEDIUM: h2: support orphaned streams When a stream_interface performs a shutw() then a shutr(), the stream is marked closed. Then cs_destroy() calls h2_detach() and it cannot fail since we're on the leaving path of the caller. The problem is that in order to close streams we usually have to send either an emty DATA frame with the ES flag set or an RST_STREAM frame, and the mux buffer might already be full, forcing the stream to be queued. The forced removal of this stream causes this last message to silently disappear, and the client to wait forever for a response. This commit ensures we can detach the conn_stream from the h2 stream if the stream is blocked, effectively making the h2 stream an orphan, ensures that the mux can deal with orphaned streams after processing them, and that the demux can kill them upon receipt of GOAWAY.	2017-11-10 11:48:15 +01:00
Willy Tarreau	aa39860aef	MINOR: tools: don't use unlikely() in hex2i() This small inline function causes some pain to the compiler when used inside other functions due to its use of the unlikely() hint for non-digits. It causes the letters to be processed far away in the calling function and makes the code less efficient. Removing these unlikely() hints has increased the chunk size parsing by around 5%.	2017-11-10 11:19:54 +01:00
Willy Tarreau	b15e3fefc9	BUG/MEDIUM: h1: ensure the chunk size parser can deal with full buffers The HTTP/1 code always has the reserve left available so the buffer is never full there. But with HTTP/2 we have to deal with full buffers, and it happens that the chunk size parser cannot tell the difference between a full buffer and an empty one since it compares the start and the stop pointer. Let's change this to instead deal with the number of bytes left to process. As a side effect, this code ends up being about 10% faster than the previous one, even on HTTP/1.	2017-11-10 11:17:08 +01:00
Willy Tarreau	8c0ea7d21a	BUG/MEDIUM: h2: split the function to send RST_STREAM There is an issue with how the RST_STREAM frames are sent. Some of them are sent from the demux, either for valid or for closed streams, and some are sent from the mux always for valid streams. At the moment the demux stream ID is used, which is wrong for all streams being muxed, and sometimes results in certain bad HTTP responses causing the emission of an RST_STREAM referencing stream zero. In addition, the stream's blocked flags could be updated even if the stream was the closed or idle ones. We really need to split the function for the two distinct use cases where one is used to send an RST on a condition detected at the connection level (such as a closed stream) and the other one is used to send an RST for a condition detected at the stream level. The first one is used only in the demux, and the other one only by a valid stream.	2017-11-10 10:05:24 +01:00
Christopher Faulet	09fdf4b112	BUG/MINOR: pattern: Rely on the sample type to copy it in pattern_exec_match To be thread safe, the function pattern_exec_match copy data (the pattern and the inner sample) in thread-local variables. But when the sample is duplicated, we must check its type and not the pattern one. This is specific to threads, no backport is needed.	2017-11-09 17:19:20 +01:00
Christopher Faulet	c5a9d5bf23	BUG/MEDIUM: stream-int: Don't loss write's notifs when a stream is woken up When a write activity is reported on a channel, it is important to keep this information for the stream because it take part on the analyzers' triggering. When some data are written, the flag CF_WRITE_PARTIAL is set. It participates to the task's timeout updates and to the stream's waking. It is also used in CF_MASK_ANALYSER mask to trigger channels anaylzers. In the past, it was cleared by process_stream. Because of a bug (fixed in commit 95fad5ba4 ["BUG/MAJOR: stream-int: don't re-arm recv if send fails"]), It is now cleared before each send and in stream_int_notify. So it is possible to loss this information when process_stream is called, preventing analyzers to be called, and possibly leading to a stalled stream. Today, this happens in HTTP2 when you call the stat page or when you use the cache filter. In fact, this happens when the response is sent by an applet. In HTTP1, everything seems to work as expected. To fix the problem, we need to make the difference between the write activity reported to lower layers and the one reported to the stream. So the flag CF_WRITE_EVENT has been added to notify the stream of the write activity on a channel. It is set when a send succedded and reset by process_stream. It is also used in CF_MASK_ANALYSER. finally, it is checked in stream_int_notify to wake up a stream and in channel_check_timeouts. This bug is probably present in 1.7 but it seems to have no effect. So for now, no needs to backport it.	2017-11-09 15:16:05 +01:00
Willy Tarreau	a87f202b49	BUG/MEDIUM: h2: reject non-3-digit status codes If the H1 parser would report a status code length not consisting in exactly 3 digits, the error case was confused with a lack of buffer room and was causing the parser to loop infinitely.	2017-11-09 11:23:00 +01:00
Willy Tarreau	1b4cf9b754	BUG/MINOR: h1: the HTTP/1 make status code parser check for digits The H1 parser used by the H2 gateway was a bit lax and could validate non-numbers in the status code. Since it computes the code on the fly it's problematic, as "30:" is read as status code 310. Let's properly check that it's a number now. No backport needed.	2017-11-09 11:15:45 +01:00
Willy Tarreau	ddfbd83780	BUILD: shctx: do not depend on openssl anymore The build breaks on a machine without openssl/crypto.h because shctx still loads openssl-compat.h while it doesn't need it anymore since the code was moved : In file included from src/shctx.c:20:0: include/proto/openssl-compat.h:3:28: fatal error: openssl/crypto.h: No such file or directory #include <openssl/crypto.h> Just remove include openssl-compat from shctx.	2017-11-08 14:33:36 +01:00
Willy Tarreau	46c9d3e6cb	BUILD: ssl: fix build of backend without ssl Commit 522eea7 ("MINOR: ssl: Handle sending early data to server.") added a dependency on SRV_SSL_O_EARLY_DATA which only exists when USE_OPENSSL is defined (which is probably not the best solution) and breaks the build when ssl is not enabled. Just add an ifdef USE_OPENSSL around the block for now.	2017-11-08 14:28:08 +01:00
Olivier Houchard	522eea7110	MINOR: ssl: Handle sending early data to server. This adds a new keyword on the "server" line, "allow-0rtt", if set, we'll try to send early data to the server, as long as the client sent early data, as in case the server rejects the early data, we no longer have them, and can't resend them, so the only option we have is to send back a 425, and we need to be sure the client knows how to interpret it correctly.	2017-11-08 14:11:10 +01:00
Olivier Houchard	cfdef2e312	MINOR: ssl: Spell 0x10101000L correctly. Issue added in 1.8-dev by c2aae74 ("MEDIUM: ssl: Handle early data with OpenSSL 1.1.1"), no impact on older versions.	2017-11-08 14:10:02 +01:00
Olivier Houchard	bd84ac8737	MINOR: ssl: Handle session resumption with TLS 1.3 With TLS 1.3, session aren't established until after the main handshake has completed. So we can't just rely on calling SSL_get1_session(). Instead, we now register a callback for the "new session" event. This should work for previous versions of TLS as well.	2017-11-08 14:08:07 +01:00
Olivier Houchard	35a63cc1c7	BUG/MINOR; ssl: Don't assume we have a ssl_bind_conf because a SNI is matched. We only have a ssl_bind_conf if crt-list is used, however we can still match a certificate SNI, so don't assume we have a ssl_bind_conf.	2017-11-08 14:08:07 +01:00
Willy Tarreau	9e45b33f7e	BUG/MAJOR: threads/tasks: fix the scheduler again My recent change in commit ce4e0aa ("MEDIUM: task: change the construction of the loop in process_runnable_tasks()") was bogus as it used to keep the rq_next across an unlock/lock sequence, occasionally leading to crashes for tasks that are eligible to any thread. We must use the lookup call for each new batch instead. The problem is easily triggered with such a configuration : global nbthread 4 listen check mode http bind 0.0.0.0:8080 redirect location / option httpchk GET / server s1 127.0.0.1:8080 check inter 1 server s2 127.0.0.1:8080 check inter 1 Thanks to Olivier for diagnosing this one. No backport is needed.	2017-11-08 14:05:19 +01:00
Willy Tarreau	ecd2e15919	BUG/MINOR: stream-int: don't set MSG_MORE on closed request path Commit 4ac4928 ("BUG/MINOR: stream-int: don't set MSG_MORE on SHUTW_NOW without AUTO_CLOSE") was incomplete. H2 reveals another situation where the input stream is marked closed with the request and we set MSG_MORE, causing a delay before the request leaves. Better avoid setting the flag on the request path for close cases in general.	2017-11-07 15:07:25 +01:00
Emeric Brun	11f5886e5c	BUG/MINOR: comp: fix compilation warning compiling without compression. This is specific to threads, no backport is needed.	2017-11-07 14:48:13 +01:00
Emeric Brun	d8b3b65faa	BUG/MEDIUM: splice/threads: pipe reuse list was not protected. The list is now protected using a global spinlock.	2017-11-07 14:47:28 +01:00
Willy Tarreau	926fa4c098	BUG/MINOR: h2: don't send GOAWAY on failed response As part of the detection for intentional closes, we can kill the connection if a shutw() happens before the headers. But it can also happen that an invalid response is not properly parsed, preventing any headers frame from being sent and making the function believe it was an abort. Now instead we check if any response was received from the stream, regardless of the fact that it was properly converted.	2017-11-07 14:47:04 +01:00
Willy Tarreau	c4312d3dfd	MINOR: h2: add new stream flag H2_SF_OUTGOING_DATA This one indicates whether we've received data to mux out. It helps make the difference between a clean close and a an erroneous one.	2017-11-07 14:47:04 +01:00
Willy Tarreau	58e3208714	BUG/MINOR: h2: correctly check for H2_SF_ES_SENT before closing In h2_shutw() we must not send another empty frame (nor RST) after one has been sent, as the stream is already in HLOC/CLOSED state.	2017-11-07 14:47:04 +01:00
Willy Tarreau	6d8b682f9a	BUG/MEDIUM: h2: properly set H2_SF_ES_SENT when sending the final frame When sending DATA+ES, it's important to set H2_SF_ES_SENT as we don't want to emit is several times nor to send an RST afterwards.	2017-11-07 14:47:04 +01:00
Willy Tarreau	e6ae77f64f	MINOR: h2: don't re-enable the connection's task when we're closing It's pointless to requeue the task when we're closing, so swap the order of the task_queue() and h2_release(). It also matches what was written in the comment regarding re-arming the timer.	2017-11-07 14:47:04 +01:00
Willy Tarreau	83906c2f91	BUG/MEDIUM: h2: don't close the connection is there are data left h2_detach() is called after a stream was closed, and it evaluates if it's worth closing the connection. The issue there is that the connection is closed too early in case there's demand for closing after the last stream, even if some data remain in the mux. Let's change the condition to check for this.	2017-11-07 14:47:04 +01:00
Christopher Faulet	f5d79ac5a8	BUILD: enable USE_THREAD for Solaris build.	2017-11-07 11:10:35 +01:00
Christopher Faulet	2a944ee16b	BUILD: threads: Rename SPIN/RWLOCK macros using HA_ prefix This remove any name conflicts, especially on Solaris.	2017-11-07 11:10:24 +01:00
Willy Tarreau	7d8e4af46a	BUG/MEDIUM: h2: fix some wrong error codes on connections When the assignment of the connection state was moved into h2c_error(), 3 of them were missed because they were wrong, using H2_SS_ERROR instead. This resulted in the connection's state being set to H2_CS_ERROR2 in fact, so the error was not properly sent.	2017-11-07 11:08:28 +01:00
Willy Tarreau	721c974e5e	MEDIUM: h2: remove the H2_SS_RESET intermediate state This one was created to maintain the knowledge that a stream was closed after having sent an RST_STREAM frame but that's not needed anymore and it confuses certain conditions on the error processing path. It's time to get rid of it.	2017-11-07 11:05:42 +01:00
Willy Tarreau	319994a2e9	BUG/MEDIUM: h2: don't try (and fail) to send non-existing data in the mux The call to xprt->snd_buf() was not conditionned on the presence of data in the buffer, resulting in snd_buf() returning 0 and never disabling the polling. It was revealed by the previous bug on error processing but must properly be handled.	2017-11-07 11:03:56 +01:00
Willy Tarreau	3eabe9b174	BUG/MEDIUM: h2: properly send the GOAWAY frame in the mux A typo on a condition prevented H2_CS_ERROR from being processed, leading to an infinite loop on connection error.	2017-11-07 11:03:01 +01:00
Willy Tarreau	c6795ca7c1	BUG/MEDIUM: h2: properly send an RST_STREAM on mux stream error Some stream errors are detected on the MUX path (eg: H1 response encoding). The ones forgot to emit an RST_STREAM frame, causing the client to wait and/or to see the connection being immediately closed. This is now fixed.	2017-11-07 09:43:06 +01:00
Willy Tarreau	6743420778	BUG/MINOR: h2: set the "HEADERS_SENT" flag on stream, not connection This flag was added after the GOAWAY flags were introduced and mistakenly placed in the connection, but that doesn't make sense as it's specific to the stream. The main impact is the risk of returning a DATA0+ES frame for an error instead of an RST_STREAM.	2017-11-06 20:20:51 +01:00
Daniel Schneller	d06f31c443	DOC: Add note about encrypted password CPU usage From first-hand experience I realized that using encrypted passwords in userlists can quickly become overwhelming for busy sites. In my case just about 100 rq/s were enough to drive (user) CPU usage from 2-3% up to >90%. While it is perfectly explicable why this is the case, having it mentioned in the relevant documentation section might spare someone some confusion in the future.	2017-11-06 18:37:35 +01:00
Olivier Houchard	283810773a	BUG/MINOR: dns: Don't lock the server lock in snr_check_ip_callback(). snr_check_ip_callback() may be called with the server lock, so don't attempt to lock it again, instead, make sure the callers always have the lock before calling it.	2017-11-06 18:34:42 +01:00
Olivier Houchard	55dcdf4c39	BUG/MINOR: dns: Don't try to get the server lock if it's already held. dns_link_resolution() can be called with the server lock already held, so don't attempt to lock it again in that case.	2017-11-06 18:34:24 +01:00
Willy Tarreau	f0c531ab55	MEDIUM: tasks: implement a lockless scheduler for single-thread usage The scheduler is complex and uses local queues to amortize the cost of locks. But all this comes with a cost that is quite observable with single-thread workloads. The purpose of this patch is to reimplement the much simpler scheduler for the case where threads are not used. The code is very small and simple. It doesn't impact the multi-threaded performance at all, and provides a nice 10% performance increase in single-thread by reaching 606kreq/s on the tests that showed 550kreq/s before.	2017-11-06 11:20:11 +01:00
Willy Tarreau	9d4b56b88e	MINOR: tasks: only visit filled task slots after processing them process_runnable_tasks() needs to requeue or wake up tasks after processing them in batches. By only refilling the existing ones, we avoid revisiting all the queue. The performance gain is measurable starting with two threads, where the request rate climbs to 657k/s compared to 644k.	2017-11-06 11:20:11 +01:00
Willy Tarreau	88ac59be4d	MINOR: threads: use faster locks for the spin locks The spin locks used to rely on W locks, which involve a loop waiting for readers to leave, and this doesn't happen here. It's more efficient to use S locks instead, which are also mutually exclusive and do not have this loop. This saves one test per spinlock and a few tens of bytes allowing certain functions to be inlined.	2017-11-06 11:20:11 +01:00
Willy Tarreau	ce4e0aa7f3	MEDIUM: task: change the construction of the loop in process_runnable_tasks() This patch slightly rearranges the loop to pack the locked code a little bit, and to try to concentrate accesses to the tree together to benefit more from the cache. It also fixes how the loop handles the right margin : now that is guaranteed that the retrieved nodes are filtered to only match the current thread, we don't need to rewind every 16 entries. Instead we can rewind each time we reach the right margin again. With this change, we now achieve the following performance for 10 H2 conns each containing 100 streams : 1 thread : 550kreq/s 2 thread : 644kreq/s 3 thread : 598kreq/s	2017-11-06 11:20:11 +01:00
Willy Tarreau	b992ba16ef	MINOR: task: simplify wake_expired_tasks() to avoid unlocking in the loop This function is sensitive, let's make it shorter by factoring out the unlock and leave code. This reduced the function's size by a few tens of bytes and increased the overall performance by about 1%.	2017-11-06 11:20:11 +01:00
Willy Tarreau	8d38805d3d	MAJOR: task: make use of the scope-aware ebtree functions Currently the task scheduler suffers from an O(n) lookup when skipping tasks that are not for the current thread. The reason is that eb32_lookup_ge() has no information about the current thread so it always revisits many tasks for other threads before finding its own tasks. This is particularly visible with HTTP/2 since the number of concurrent streams created at once causes long series of tasks for the same stream in the scheduler. With only 10 connections and 100 streams each, by running on two threads, the performance drops from 640kreq/s to 11.2kreq/s! Lookup metrics show that for only 200000 task lookups, 430 million skips had to be performed, which means that on average, each lookup leads to 2150 nodes to be visited. This commit backports the principle of scope lookups for ebtrees from the ebtree_v7 development tree. The idea is that each node contains a mask indicating the union of the scopes for the nodes below it, which is fed during insertion, and used during lookups. Then during lookups, branches that do not contain any leaf matching the requested scope are simply ignored. This perfectly matches a thread mask, allowing a thread to only extract the tasks it cares about from the run queue, and to always find them in O(log(n)) instead of O(n). Thus the scheduler uses tid_bit and task->thread_mask as the ebtree scope here. Doing this has recovered most of the performance, as can be seen on the test below with two threads, 10 connections, 100 streams each, and 1 million requests total : Before After Gain test duration : 89.6s 4.73s x19 HTTP requests/s (DEBUG) : 11200 211300 x19 HTTP requests/s (PROD) : 15900 447000 x28 spin_lock time : 85.2s 0.46s /185 time per lookup : 13us 40ns /325 Even when going to 6 threads (on 3 hyperthreaded CPU cores), the performance stays around 284000 req/s, showing that the contention is much lower. A test showed that there's no benefit in using this for the wait queue though.	2017-11-06 11:20:11 +01:00
Willy Tarreau	8878b6c4cb	MINOR: ebtree: implement eb32sc_lookup_ge_or_first() In the scheduler we always have to loop back to the beginning after we don't find the last entry, so let's implement this in a new lookup function instead. The resulting code is slightly faster, mostly due to the fact that there's much less inlined code in the fast path.	2017-11-06 11:20:11 +01:00
Willy Tarreau	d1d55ac619	MEDIUM: ebtree: only consider the branches matching the scope in lookups Now when looking up a node via eb32sc_first(), eb32sc_next(), and eb32sc_lookup_ge(), we only focus on the branches matching the requested scope. The code must be careful to miss no branch. It changes a little bit from the previous one because the scope stored on the intermediary nodes is not exact (since we don't propagate upwards during deletion), so in case a lookup fails, we have to walk up and pick the next matching entry.	2017-11-06 11:20:11 +01:00
Willy Tarreau	ef8d0dcefd	MINOR: ebtree: update the eb32sc parent node's scope on delete During a delete operation, if the deleted node is above its leaf's parent, this parent will replace the node and then go up. In this case it is important to update the new parent's scope to reflect the presence of other branches. It's worth noting that in theory we should precisely recompute the exact node value, but it seems that it's not worth it for the rare cases there is a mismatch.	2017-11-06 11:20:11 +01:00
Willy Tarreau	5d19fd479a	MEDIUM: ebtree: specify the scope of every node inserted via eb32sc Here we mark each visited node with the scope bits of the node being inserted. This will allow the lookup to skip certain non-interesting nodes.	2017-11-06 11:20:11 +01:00

... 3 4 5 6 7 ...

7211 Commits