haproxy

Author	SHA1	Message	Date
Fr�d�ric L�caille	12a0317fed	MINOR: quic: Add "no-quic" global option Add "no-quic" to "global" section to disable the use of QUIC transport protocol by all configured QUIC listeners. This is listeners with QUIC addresses on their "bind" lines. Internally, the socket addresses binding is skipped by protocol_bind_all() for receivers with <proto_quic4> or <proto_quic6> as protocol (see protocol struct). Add information about "no-quic" global option to the documentation. Must be backported to 2.7.	2023-01-17 16:35:20 +01:00
Willy Tarreau	e77f4306ba	BUG/MEDIUM: stconn: also consider SE_FL_EOI to switch to SE_FL_ERROR In se_fl_set_error() we used to switch to SE_FL_ERROR only when there is already SE_FL_EOS indicating that the read side is closed. But that is not sufficient, we need to consider all cases where no more reads will be performed on the connection, and as such also include SE_FL_EOI. Without this, some aborted connections during a transfer sometimes only stop after the timeout, because the ERR_PENDING is never promoted to ERROR. This must be backported to 2.7 and requires previous patch "CLEANUP: stconn: always use se_fl_set_error() to set the pending error".	2023-01-17 16:27:35 +01:00
Christopher Faulet	2e47e3a1cf	MINOR: htx: Add an HTX value for the extra field is payload length is unknown When the payload length cannot be determined, the htx extra field is set to the magical vlaue ULLONG_MAX. It is not obvious. This a dedicated HTX value is now used. Now, HTX_UNKOWN_PAYLOAD_LENGTH must be used in this case, instead of ULLONG_MAX.	2023-01-13 11:51:11 +01:00
Christopher Faulet	4da82395d8	CLEANUP: http-ana: Remove HTTP_MSG_ERROR state This state is now unused. Thus it can be removed.	2023-01-13 11:22:13 +01:00
Christopher Faulet	71236dedb9	MINOR: http-ana: Add a function to set HTTP termination flags There is already a function to set termination flags but it is not well suited for HTTP streams. So a function, dedicated to the HTTP analysis, was added. This way, this new function will be called for HTTP analysers on error. And if the error is not caugth at this stage, the generic function will still be called from process_stream(). Here, by default a PRXCOND error is reported and depending on the stream state, the reson will be set accordingly: * If the backend SC is in INI state, SF_FINST_T is reported on tarpit and SF_FINST_R otherwise. * SF_FINST_Q is the server connection is queued * SF_FINST_C in any connection attempt state (REQ/TAR/ASS/CONN/CER/RDY). Except for applets, a SF_FINST_R is reported. * Once the server connection is established, SF_FINST_H is reported while HTTP_MSG_DATA state on the response side. * SF_FINST_L is reported if the response is in HTTP_MSG_DONE state or higher and a client error/timeout was reported. * Otherwise SF_FINST_D is reported.	2023-01-13 09:45:23 +01:00
Willy Tarreau	6be8d09a61	OPTIM: global: move byte counts out of global and per-thread During multiple tests we've already noticed that shared stats counters have become a real bottleneck under large thread counts. With QUIC it's pretty visible, with qc_snd_buf() taking 2.5% of the CPU on a 48-thread machine at only 25 Gbps, and this CPU is entirely spent in the atomic increment of the byte count and byte rate. It's also visible in H1/H2 but slightly less since we're working with larger buffers, hence less frequent updates. These counters are exclusively used to report the byte count in "show info" and the byte rate in the stats. Let's move them to the thread_ctx struct and make the stats reader just collect each thread's stats when requested. That's way more efficient than competing on a single cache line. After this, qc_snd_buf has totally disappeared from the perf profile and tests made in h1 show roughly 1% performance increase on small objects.	2023-01-12 16:37:45 +01:00
Amaury Denoyelle	0a1154afb5	MINOR: mux-quic: use send-list for STOP_SENDING/RESET_STREAM emission When a STOP_SENDING or RESET_STREAM must be send, its corresponding qcs is inserted into <qcc.send_list> via qcc_reset_stream() or qcc_abort_stream_read(). This allows to remove the iteration on full qcs tree in qc_send(). Instead, STOP_SENDING and RESET_STREAM is done in the loop over <qcc.send_list> as with STREAM frames. This should improve slightly the performance, most notably when large number of streams are opened. This must be backported up to 2.7.	2023-01-10 17:49:50 +01:00
Amaury Denoyelle	f9b03265f0	MEDIUM: h3: send SETTINGS before STREAM frames Complete qcc_send_stream() function to allow to specify if the stream should be handled in priority. Internally this will insert the qcs instance in front of <qcc.send_list> to be able to treat it before other streams. This functionality is useful when some QUIC streams should be sent before others. Most notably, this is used to guarantee that H3 SETTINGS is done first via the control stream. This must be backported up to 2.7.	2023-01-10 17:49:50 +01:00
Amaury Denoyelle	20f2a425ff	MAJOR: mux-quic: rework stream sending priorization Implement a mechanism to register streams ready to send data in new STREAM frames. Internally, this is implemented with a new list <qcc.send_list> which contains qcs instances. A qcs can be registered safely using the new function qcc_send_stream(). This is done automatically in qc_send_buf() which covers most cases. Also, application layer is free to use it for internal usage streams. This is currently the case for H3 control stream with SETTINGS sending. The main point of this patch is to handle stream sending fairly. This is in stark contrast with previous code where streams with lower ID were always prioritized. This could cause other streams to be indefinitely blocked behind a stream which has a lot of data to transfer. Now, streams are handled in an order scheduled by se_desc layer. This commit is the first one of a serie which will bring other improvments which also relied on the send_list implementation. This must be backported up to 2.7 when deemed sufficiently stable.	2023-01-10 17:49:50 +01:00
Christopher Faulet	da89e9b95b	MINOR: channel/applets: Stop to test CF_WRITE_ERROR flag if CF_SHUTW is enough In applets, we stop processing when a write error (CF_WRITE_ERROR) or a shutdown for writes (CF_SHUTW) is detected. However, any write error leads to an immediate shutdown for writes. Thus, it is enough to only test if CF_SHUTW is set.	2023-01-09 18:41:08 +01:00
Christopher Faulet	4b490b7517	MINOR: channel: Stop to test CF_READ_ERROR flag if CF_SHUTR is enough When a read error (CF_READ_ERROR) is reported, a shutdown for reads is always performed (CF_SHUTR). Thus, there is no reason to check if CF_READ_ERROR is set if CF_SHUTR is also checked.	2023-01-09 18:41:08 +01:00
Christopher Faulet	2357718217	MEDIUM: channel: Remove CF_READ_ATTACHED and report CF_READ_EVENT instead CF_READ_ATTACHED flag is only used in input events for stream analyzers, CF_MASK_ANALYSER. A read event can be reported instead and this flag can be removed. We must only take care to report a read event when the client connection is upgraded from TCP to HTTP.	2023-01-09 18:41:08 +01:00
Christopher Faulet	049fbcd36a	MINOR: channel: Remove CF_ANA_TIMEOUT and report CF_READ_EVENT instead It appears CF_ANA_TIMEOUT is flag only used in CF_MASK_ANALYSER. All analyzer timeout relies on the analysis expiration date (chn->analyse_exp). Worst, once set, this flag is never removed. Thus this flag can be removed and replaced by a read event (CF_READ_EVENT).	2023-01-09 18:41:08 +01:00
Christopher Faulet	a63f8f379f	MINOR: channel: Remove CF_WRITE_ACTIVITY Thanks to previous changes, CF_WRITE_ACTIVITY flags can be removed. Everywhere it was used, its value is now directly used (CF_WRITE_EVENT\|CF_WRITE_ERROR).	2023-01-09 18:41:08 +01:00
Christopher Faulet	33e03cec5f	MINOR: channel: Remove CF_READ_ACTIVITY Thanks to previous changes, CF_READ_ACTIVITY flags can be removed. Everywhere it was used, its value is now directly used (CF_READ_EVENT\|CF_READ_ERROR).	2023-01-09 18:41:08 +01:00
Christopher Faulet	d898841530	MEDIUM: channel: Use CF_WRITE_EVENT instead of CF_WRITE_PARTIAL Just like CF_READ_PARTIAL, CF_WRITE_PARTIAL is now merged with CF_WRITE_EVENT. There a subtlety in sc_notify(). The "connect" event (formely CF_WRITE_NULL) is now detected with (CF_WRITE_EVENT + sc->state < SC_ST_EST).	2023-01-09 18:41:08 +01:00
Christopher Faulet	285f7616ee	MEDIUM: channel: Use CF_READ_EVENT instead of CF_READ_PARTIAL CF_READ_PARTIAL flag is now merged with CF_READ_EVENT. It means CF_READ_EVENT is set when a read0 is received (formely CF_READ_NULL) or when data are received (formely CF_READ_ACTIVITY). There is nothing special here, except conditions to wake the stream up in sc_notify(). Indeed, the test was a bit changed to reflect recent change. read0 event is now formalized by (CF_READ_EVENT + CF_SHUTR).	2023-01-09 18:41:08 +01:00
Christopher Faulet	b96f2aa380	REORG: channel: Rename CF_WRITE_NULL to CF_WRITE_EVENT As for CF_READ_NULL, it appears CF_WRITE_NULL and other write events on a channel are mainly used to wake up the stream and may be replace by on write event. In this patch, we introduce CF_WRITE_EVENT flag as a replacement to CF_WRITE_EVENT_NULL. There is no breaking change for now, it is just a rename. Gradually, other write events will be merged with this one.	2023-01-09 18:41:08 +01:00
Christopher Faulet	6e1bbc446b	REORG: channel: Rename CF_READ_NULL to CF_READ_EVENT CF_READ_NULL flag is not really useful and used. It is a transient event used to wakeup the stream. As we will see, all read events on a channel may be resumed to only one and are all used to wake up the stream. In this patch, we introduce CF_READ_EVENT flag as a replacement to CF_READ_NULL. There is no breaking change for now, it is just a rename. Gradually, other read events will be merged with this one.	2023-01-09 18:41:08 +01:00
Willy Tarreau	5a72d03a58	MINOR: stick-table: implement the sc-add-gpc() action This action increments the General Purpose Counter at the index <idx> of the array associated to the sticky counter designated by <sc-id> by the value of either integer <int> or the integer evaluation of expression <expr>. Integers and expressions are limited to unsigned 32-bit values. If an error occurs, this action silently fails and the actions evaluation continues. <idx> is an integer between 0 and 99 and <sc-id> is an integer between 0 and 2. It also silently fails if the there is no GPC stored at this index. The entry in the table is refreshed even if the value is zero. The 'gpc_rate' is automatically adjusted to reflect the average growth rate of the gpc value. The main use of this action is to count scores or total volumes (e.g. estimated danger per source IP reported by the server or a WAF, total uploaded bytes, etc).	2023-01-07 09:11:22 +01:00
Willy Tarreau	6c0117168e	MEDIUM: stick-table: set the track-sc limit at boottime via tune.stick-counters The number of stick-counter entries usable by track-sc rules is currently set at build time. There is no good value for this since the vast majority of users don't need any, most need only a few and rare users need more. Adding more counters for everyone increases memory and CPU usages for no reason. This patch moves the per-session and per-stream arrays to a pool of a size defined at boot time. This way it becomes possible to set the number of entries at boot time via a new global setting "tune.stick-counters" that sets the limit for the whole process. When not set, the MAX_SESS_STR_CTR value still applies, or 3 if not set, as before. It is also possible to lower the value to 0 to save a bit of memory if not used at all. Note that a few low-level sample-fetch functions had to be protected due to the ability to use sample-fetches in the global section to set some variables.	2023-01-06 18:08:49 +01:00
Christopher Faulet	61aded057d	BUG/MAJOR: buf: Fix copy of wrapping output data when a buffer is realigned There is a bug in b_slow_realign() function when wrapping output data are copied in the swap buffer. block1 and block2 sizes are inverted. Thus blocks with a wrong size are copied. It leads to data mixin if the first block is in reality larger than the second one or to a copy of data outside the buffer is the first block is smaller than the second one. The bug was introduced when the buffer API was refactored in 1.9. It was found by a code review and seems never to have been triggered in almost 5 years. However, we cannot exclude it is responsible of some unresolved bugs. This patch should fix issue #1978. It must be backported as far as 2.0.	2023-01-05 09:34:49 +01:00
Willy Tarreau	6e70a3986c	BUILD: makefile: only consider settings from enabled options Due to the previous SSL exception we coudln't restrict the collected CFLAGS/LDFLAGS to those of enabled options, so all of them were considered if set. The problem is that it would prevent simply disabling a build option without unsetting its xxx_CFLAGS or _LDFLAGS values if those had incompatible values (e.g. -lfoo). Now that only existing options are listed in collect_opts_flags, we can safely check that the option is set and only consider its settings in this case. Thus OT_LDFLAGS will not be used if USE_OT is not set for example.	2022-12-23 17:01:55 +01:00
Willy Tarreau	6a2cd33509	BUILD: makefile: remove the special case of the SSL option By creating USE_SSL and enabling it when USE_OPENSSL is set, we can get rid of the special case that was made with it regarding cflags collect and when resetting options. The option doesn't need to be manually set, though in the future it might prove useful if other non-openssl API are supported.	2022-12-23 16:53:35 +01:00
Willy Tarreau	2b8d0978f3	BUILD: makefile: make all OpenSSL variants use the same settings It's getting complicated to configure includes and lib dirs for OpenSSL API variants such as WolfSSL, because some settings are common and others are specific but carry a prefix that doesn't match the USE_* rule scheme. This patch simplifies everything by considering that all SSL libs will use SSL_INC, SSL_LIB, SSL_CFLAGS and SSL_LDFLAGS. That's much more convenient. This works thanks to the settings collector which explicitly checks the SSL_* settings. When USE_OPENSSL_WOLFSSL is set, then USE_OPENSSL is implied, so that there's no need to duplicate maintenance effort.	2022-12-23 16:53:35 +01:00
Willy Tarreau	8fa2f49f24	BUILD: makefile: add a function to collect all options' CFLAGS/LDFLAGS The new function collect_opts_flags now scans all USE_* options defined in use_opts and appends the corresponding _CFLAGS and _LDFLAGS to OPTIONS_{C,LD}FLAGS respectively. This will be useful to get rid of all the individual concatenations to these variables.	2022-12-23 16:53:35 +01:00
Willy Tarreau	b14e89e322	BUILD: makefile: initialize all build options' variables at once A lot of _SRC, _INC, _LIB etc variables are set and expected to be initialized to an empty string by default. However, an in-depth review of all of them showed that WOLFSSL_{INC,LIB}, SSL_{INC,LIB}, LUA_{INC,LIB}, and maybe others were not always initialized and could sometimes leak from the environment and as such cause strange build issues when running from cascaded scripts that had exported them. The approach taken here consists in iterating over all USE_* options and unsetting any _SRC, _INC, _LIB, _CFLAGS and _LDFLAGS that follows the same name. For the few variable names options that don't exactly match the build option (SSL & WOLFSSL), these ones are specifically added to the list. The few that were explicitly cleared in their own sections were just removed since not needed anymore. Note that an "undefine" command appeared in GNU make 3.82 but since we support older ones we can only initialize the variables to an empty string here. It's not a problem in practice. We're now certain that these variables are empty wherever they are used, and that it is possible to just append to them, or use them as-is.	2022-12-23 16:53:35 +01:00
Willy Tarreau	848362f2d2	BUILD: makefile: sort the features list The features list that appears in -vv appears in a random order, which always makes it a pain to look for certain features. Let's sort it.	2022-12-23 16:53:35 +01:00
Willy Tarreau	69e7b7f677	BUILD: makefile: move common options-oriented macros to include/make/options.mk Some macros and functions are barely understandable and are only used to iterate over known options from the use_opts list. Better assign them a name and move them into a dedicated file to clean the makefile a little bit. Now at least "use_opts" only appears once, where it is defined. This also allowed to completely remove the BUILD_FEATURES macro that caused some confusion until previous commit.	2022-12-23 16:53:35 +01:00
Amaury Denoyelle	663e872e3a	MEDIUM: mux-quic: implement STOP_SENDING emission Implement STOP_SENDING. This is divided in two main functions : * qcc_abort_stream_read() which can be used by application protocol to request for a STOP_SENDING. This set the flag QC_SF_READ_ABORTED. * qcs_send_reset() is a static function called after the preceding one. It will send a STOP_SENDING via qcc_send(). QC_SF_READ_ABORTED flag is now properly used : if activated on a stream during qcc_recv(), <qcc.app_ops.decode_qcs> callback is skipped. Also, abort reading on unknown unidirection remote stream is now fully supported with the emission of a STOP_SENDING as specified by RFC 9000. This commit is part of implementing H3 errors at the stream level. This will allows the H3 layer to request the peer to close its endpoint for an error on a stream. This should be backported up to 2.7.	2022-12-22 16:38:16 +01:00
Amaury Denoyelle	5854fc08cc	MINOR: mux-quic: handle RESET_STREAM reception Implement RESET_STREAM reception by mux-quic. On reception, qcs instance will be mark as remotely closed and its Rx buffer released. The stream layer will be flagged on error if still attached. This commit is part of implementing H3 errors at the stream level. Indeed, on H3 stream errors, STOP_SENDING + RESET_STREAM should be emitted. The STOP_SENDING will in turn generate a RESET_STREAM by the remote peer which will be handled thanks to this patch. This should be backported up to 2.7.	2022-12-22 16:38:04 +01:00
Amaury Denoyelle	a473f196f1	MEDIUM: mux-quic: implement shutw Implement mux_ops shutw operation for QUIC mux. A RESET_STREAM is emitted unless the stream is already closed due to all data or RESET_STREAM already transmitted. This operation is notably useful when upper stream layer wants to close the connection early due to an error. This was tested by using a HTTP server which listens with PROXY protocol support. The corresponding server line on haproxy configuration deliberately not specify send-proxy. This causes the server to close abruptly the connection. Without this patch, nothing was done on the QUIC stream which was kept open until the whole connection is closed. Now, a proper RESET_STREAM is emitted to report the error. This should be backported up to 2.7.	2022-12-22 16:22:39 +01:00
William Lallemand	be6a873096	BUG/MINOR: httpclient/log: free of invalid ptr with httpclient_log_format free_proxy() must check if the ptr is not httpclient_log_format before trying to free p->conf.logformat_string. No backport needed.	2022-12-22 15:39:31 +01:00
Christopher Faulet	c960a3b60f	BUG/MINOR: pool/stats: Use ullong to report total pool usage in bytes in stats The same change was already performed for the cli. The stats applet and the prometheus exporter are also concerned. Both use the stats API and rely on pool functions to get total pool usage in bytes. pool_total_allocated() and pool_total_used() must return 64 bits unsigned integer to avoid any wrapping around 4G. This may be backported to all versions.	2022-12-22 13:46:21 +01:00
Remi Tricot-Le Breton	c8d814ed63	MINOR: ssl: Move OCSP code to a dedicated source file This is a simple cleanup that moves OCSP related code to a dedicated file instead of interlacing it in some pure ssl connection code.	2022-12-21 11:21:07 +01:00
Remi Tricot-Le Breton	6477bbd78d	MEDIUM: ssl: Add ocsp update task main function This patch contains the main function of the ocsp auto update mechanism as well as an init and destroy function of the task used for this. The task is not created in this patch but in a later one. The function has two distinct parts and the branching to one or the other is completely based on the fact that the cur_ocsp pointer of the ssl_ocsp_task_ctx member is set. If the pointer is not set, we need to look at the first item of the update tree and see if it needs to be updated. If it does not we simply wait until the time is right and let the task asleep. If it does need to be updated, we simply build and send the corresponding ocsp request thanks to the http_client. The task is then sent to sleep with an expire time set to infinity. The http_client will wake it back up once the response is received (or a timeout occurs). Just note that during this whole process the cetificate_ocsp object corresponding to the entry being updated is taken out of the update tree and only stored in the ssl_ocsp_task_ctx context. Once the task is waken up by the http_client, it branches on the response processing part of the function which basically checks that the response is valid and inserts it into the ocsp_response tree. The task then goes back to sleep until another entry needs to be updated.	2022-12-21 11:21:07 +01:00
Remi Tricot-Le Breton	fb2b9988e8	MINOR: ssl: Store 'ocsp-update' mode in the ckch_data and check for inconsistencies The 'ocsp-update' option is parsed at the same time as all the other bind line options but it does not actually have anything to do with the bind line since it concerns the frontend certificate instead. For that reason, we should have a mean to identify inconsistencies in the configuration and raise an error when a given certificate has two different ocsp-update modes specified in one or more crt-lists. The simplest way to do it is to store the ocsp update mode directly in the ckch and not only in the ssl_bind_conf.	2022-12-21 11:21:07 +01:00
Remi Tricot-Le Breton	03c5ffff8e	MINOR: ssl: Add crt-list ocsp-update option This option will define how the ocsp update mechanism behaves. The option can either be set to 'on' or 'off' and can only be specified in a crt-list entry so that we ensure that it concerns a single certificate. The 'off' mode is the default one and corresponds to the old behavior (no automatic update). When the option is set to 'on', we will try to get an ocsp response whenever an ocsp uri can be found in the frontend's certificate. The only limitation of this mode is that the certificate's issuer will have to be known in order for the OCSP certid to be built. This patch only adds the parsing of the option. The full functionality will come in a later commit.	2022-12-21 11:21:07 +01:00
Remi Tricot-Le Breton	cc346678dc	MEDIUM: ssl: Add ocsp_certid in ckch structure and discard ocsp buffer early The ocsp_response member of the cert_key_and_chain structure is only used temporarily. During a standard init process where an ocsp response is provided, this ocsp file is first copied into the ocsp_response buffer without any ocsp-related parsing (see ssl_sock_load_ocsp_response_from_file), and then the contents are actually interpreted and inserted into the actual ocsp tree (cert_ocsp_tree) later in the process (see ssl_sock_load_ocsp). If the response was deemed valid, it is then copied into the actual ocsp_response structure's 'response' field (see ssl_sock_load_ocsp_response). From this point, the ocsp_response field of the cert_key_and_chain object could be discarded since actual ocsp operations will be based of the certificate_ocsp object. The only remaining runtime use of the ckch's ocsp_response field was in the CLI, and more precisely in the 'show ssl cert' mechanism. This constraint could be removed by adding an OCSP_CERTID directly in the ckch because the buffer was only used to get this id. This patch then adds the OCSP_CERTID pointer in the ckch, it clears the ocsp_response buffer early and simplifies the ckch_store_build_certid function.	2022-12-21 11:21:07 +01:00
Remi Tricot-Le Breton	c0b4058e7e	MINOR: ssl: Add helper function that checks the validity of an OCSP response This helper function will check that an OCSP response is valid, meaning that the proper "Content-Type: application/ocsp-response" header is present and the data itself is a proper OCSP_RESPONSE that can be checked thanks to the issuer certificate.	2022-12-21 11:21:07 +01:00
Remi Tricot-Le Breton	e09d2ae598	MINOR: ssl: Add OCSP request helper function This function creates the url and body that will be used to build a proper OCSP request for a given certid (following section A.1 of RFC6960).	2022-12-21 11:21:07 +01:00
Remi Tricot-Le Breton	47a4f1239d	MINOR: ssl: Add helper function that extracts an OCSP URI from a certificate This function extracts the first OCSP URI (if any) contained in a certificate. It only takes the first of potentially multiple URIs.	2022-12-21 11:21:07 +01:00
Remi Tricot-Le Breton	95e7cf1ddf	MINOR: httpclient: Make the CLI flags public for future use Those flags used by the http_client in its CLI function might come to use for OCSP updates that will strongly rely on the http client.	2022-12-21 11:21:07 +01:00
Remi Tricot-Le Breton	2b96364b35	MINOR: ssl: Add a lock to the OCSP response tree The tree that contains OCSP responses is never locked despite being used at runtime for OCSP stapling as well as the CLI through "set ssl cert" and "set ssl ocsp-response" commands. Everything works though because the certificate_ocsp structure is refcounted and the tree's entries are cleaned up when SSL_CTXs are destroyed (thanks to an ex_data entry in which the certificate_ocsp pointer is stored). This new lock will come to use when the OCSP auto update mechanism is fully implemented because this new feature will be based on another tree that stores the same certificate_ocsp members and updates their contents periodically.	2022-12-21 11:21:07 +01:00
Willy Tarreau	eed7826529	BUG/MEDIUM: quic: properly take shards into account on bind lines Shards were completely forgotten in commit f5a0c8abf ("MEDIUM: quic: respect the threads assigned to a bind line"). The thread mask is taken from the bind_conf, but since shards were introduced in 2.5, the per-listener mask is held by the receiver and can be smaller than the bind_conf's mask. The effect here is that the traffic is not distributed to the appropriate thread. At first glance it's not dramatic since it remains one of the threads eligible by the bind_conf, but it still means that in some contexts such as "shards by-thread", some concurrency may persist on listeners while they're expected to be alone. One identified impact is that it requires more rxbufs than necessary, but there may possibly be other not yet identified side effects. This must be backported to 2.7 and everywhere the commit above is backported.	2022-12-21 09:27:26 +01:00
Amaury Denoyelle	15337fd808	BUG/MEDIUM: mux-quic: fix double delete from qcc.opening_list qcs instances for bidirectional streams are inserted in <qcc.opening_list>. It is removed from the list once a full HTTP request has been parsed. This is required to implement http-request timeout. In case a stream is deleted before receiving full HTTP request, it also must be removed from <qcc.opening_list>. This was not the case on first implementation but has been fixed by the following patch : 641a65ff3cccd394eed49378c6ccdb8ba0a101a7 BUG/MINOR: mux-quic: remove qcs from opening-list on free This means that now a stream can be deleted from the list in two different functions. Sadly, as LIST_DELETE was used in both cases, nothing prevented a double-deletion from the list, even though LIST_INLIST was used. Both calls are replaced with LIST_DEL_INIT which is idempotent. This bug causes memory corruption which results in most cases in a segfault, most of times outside of mux-quic code itself. It has been found first by gabrieltz who reported it on the github issue #1903. Big thanks to him for his testing. This bug also causes failures on several 'M' transfer testcase of QUIC interop-runner. The s2n-quic client is particularly useful in this case as segfaults triggers were most of the times on the LIST_DELETE operation itself. This is probably due to its encapsulating of HEADERS frame with fin bit delayed in a following empty STREAM frame. This must be backported wherever the above patch is, up to 2.6.	2022-12-21 08:58:04 +01:00
Willy Tarreau	e327b4a73e	MINOR: freq_ctr: add opportunistic versions of swrate_add() Some uses of swrate_add() only consist in getting a rough estimate of a frequency. There are cases where speed matters more than accuracy (e.g. pools). For such use cases, let's just stop looping on the CAS, if the update fails, another thread is already providing input, and it's not dramatic to lose the race. All these functions are now suffixed with "_opportunistic".	2022-12-20 14:51:12 +01:00
Willy Tarreau	284cfc67b8	MINOR: pool: make the thread-local hot cache size configurable Till now it was only possible to change the thread local hot cache size at build time using CONFIG_HAP_POOL_CACHE_SIZE. But along benchmarks it was sometimes noticed a huge contention in the lower level memory allocators indicating that larger caches could be beneficial, especially on machines with large L2 CPUs. Given that the checks against this value was no longer on a hot path anymore, there was no reason for continuing to force it to be tuned at build time. So this patch allows to set it by tune.memory-hot-size. It's worth noting that during the boot phase the value remains zero so that it's possible to know if the value was set or not, which opens the possibility that we try to automatically adjust it based on the per-cpu L2 cache size or the use of certain protocols (none of this is done yet).	2022-12-20 14:51:12 +01:00
Willy Tarreau	4dd33d9c32	OPTIM: pool: split the read_mostly from read_write parts in pool_head Performance profiling on a 48-thread machine showed a lot of time spent in pool_free(), precisely at the point where pool->limit was retrieved. And the reason is simple. Some parts of the pool_head are heavily updated only when facing a cache miss ("allocated", "used", "needed_avg"), while others are always accessed (limit, flags, size). The fact that both entries were stored into the same cache line makes it very difficult for each thread to access these precious info even when working with its own cache. By just splitting the fields apart, a test on QUIC (which stresses pools a lot) more than doubled performance from 42 Gbps to 96 Gbps! Given that the patch only reorders fields and addresses such a significant contention, it should be backported to 2.7 and 2.6.	2022-12-20 14:51:12 +01:00
William Lallemand	46bea1c616	BUILD: peers: peers-t.h depends on stick-table-t.h peers-t.h uses "struct stktable" as well as STKTABLE_DATA_TYPES which are defined in stick-table-t.h. It works by accident because stick-table-t.h was always included before. But could provoke build issue with EXTRA code. To be backported as far as 2.2.	2022-12-16 15:51:44 +01:00

1 2 3 4 5 ...

6642 Commits