haproxy

Author	SHA1	Message	Date
Christopher Faulet	d02e1170f8	BUG/MAJOR: buf: Fix copy of wrapping output data when a buffer is realigned There is a bug in b_slow_realign() function when wrapping output data are copied in the swap buffer. block1 and block2 sizes are inverted. Thus blocks with a wrong size are copied. It leads to data mixin if the first block is in reality larger than the second one or to a copy of data outside the buffer is the first block is smaller than the second one. The bug was introduced when the buffer API was refactored in 1.9. It was found by a code review and seems never to have been triggered in almost 5 years. However, we cannot exclude it is responsible of some unresolved bugs. This patch should fix issue #1978. It must be backported as far as 2.0. (cherry picked from commit 61aded057dafc419f62b9534d03e6c99a3405f7a) Signed-off-by: Willy Tarreau <w@1wt.eu> (cherry picked from commit 4a048c13f5ec3bcd060c8af955fe51694400b69d) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2023-01-20 09:33:34 +01:00
Christopher Faulet	a6ff9f5361	BUG/MINOR: pool/stats: Use ullong to report total pool usage in bytes in stats The same change was already performed for the cli. The stats applet and the prometheus exporter are also concerned. Both use the stats API and rely on pool functions to get total pool usage in bytes. pool_total_allocated() and pool_total_used() must return 64 bits unsigned integer to avoid any wrapping around 4G. This may be backported to all versions. (cherry picked from commit c960a3b60f5d05b82cdac2a33ab22ca465787e60) Signed-off-by: Willy Tarreau <w@1wt.eu> (cherry picked from commit b174d82dff11d7fb67e9a7f53c20a658f23dd9e7) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2023-01-20 09:29:07 +01:00
Amaury Denoyelle	b118355b00	BUG/MEDIUM: mux-quic: fix double delete from qcc.opening_list qcs instances for bidirectional streams are inserted in <qcc.opening_list>. It is removed from the list once a full HTTP request has been parsed. This is required to implement http-request timeout. In case a stream is deleted before receiving full HTTP request, it also must be removed from <qcc.opening_list>. This was not the case on first implementation but has been fixed by the following patch : 641a65ff3cccd394eed49378c6ccdb8ba0a101a7 BUG/MINOR: mux-quic: remove qcs from opening-list on free This means that now a stream can be deleted from the list in two different functions. Sadly, as LIST_DELETE was used in both cases, nothing prevented a double-deletion from the list, even though LIST_INLIST was used. Both calls are replaced with LIST_DEL_INIT which is idempotent. This bug causes memory corruption which results in most cases in a segfault, most of times outside of mux-quic code itself. It has been found first by gabrieltz who reported it on the github issue #1903. Big thanks to him for his testing. This bug also causes failures on several 'M' transfer testcase of QUIC interop-runner. The s2n-quic client is particularly useful in this case as segfaults triggers were most of the times on the LIST_DELETE operation itself. This is probably due to its encapsulating of HEADERS frame with fin bit delayed in a following empty STREAM frame. This must be backported wherever the above patch is, up to 2.6. (cherry picked from commit 15337fd8085288ac10de66bb048c8d655fbb0f25) Signed-off-by: Willy Tarreau <w@1wt.eu> (cherry picked from commit 151737fa818ffb37c8eb1706ef16722b6dd68f8b) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2023-01-20 09:28:16 +01:00
Willy Tarreau	8b227febbf	OPTIM: pool: split the read_mostly from read_write parts in pool_head Performance profiling on a 48-thread machine showed a lot of time spent in pool_free(), precisely at the point where pool->limit was retrieved. And the reason is simple. Some parts of the pool_head are heavily updated only when facing a cache miss ("allocated", "used", "needed_avg"), while others are always accessed (limit, flags, size). The fact that both entries were stored into the same cache line makes it very difficult for each thread to access these precious info even when working with its own cache. By just splitting the fields apart, a test on QUIC (which stresses pools a lot) more than doubled performance from 42 Gbps to 96 Gbps! Given that the patch only reorders fields and addresses such a significant contention, it should be backported to 2.7 and 2.6. (cherry picked from commit 4dd33d9c322d3be167c3d672aebf6108f4f7889b) Signed-off-by: Willy Tarreau <w@1wt.eu> (cherry picked from commit 7d1b6977199fb663f39c928f3f159fd078d1b30d) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2023-01-20 09:27:52 +01:00
Aurelien DARRAGON	dabfdf987f	MINOR: stats: introduce stats field ctx Add a new value in stats ctx: field. Implement field support in line dumping parent functions stats_print_proxy_field_json() and stats_dump_proxy_to_buffer(). This will allow child dumping functions to support partial line dumping when needed. ie: when dumping buffer is exhausted: do a partial send and wait for a new buffer to finish the dump. Thanks to field ctx, the function can start dumping where it left off on previous (unterminated) invokation. (cherry picked from commit 559418419048426faaa216f8bb9ad254f0052d4f) Signed-off-by: William Lallemand <wlallemand@haproxy.org> (cherry picked from commit 84f6ea521b4f92779b15d5cd4de6539462dba54a) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2023-01-20 09:26:52 +01:00
Amaury Denoyelle	e78bb0f6b1	BUG/MINOR: quic: properly handle alloc failure in qc_new_conn() qc_new_conn() is used to allocate a quic_conn instance and its various internal members. If one allocation fails, quic_conn_release() is used to cleanup things. For the moment, pool_zalloc() is used which ensures that all content is null. However, some members must be initialized to a special values to be able to use quic_conn_release() safely. This is the case for quic_conn lists and its tasklet. Also, some quic_conn internal allocation functions were doing their own cleanup on failure without reset to NULL. This caused an issue with quic_conn_release() which also frees this members. To fix this, these functions now only return an error without cleanup. It is the caller responsibility to free the allocated content, which is done via quic_conn_release(). Without this patch, allocation failure in qc_new_conn() would often result in segfault. This was reproduced easily using fail-alloc at 10%. This should be backported up to 2.6. (cherry picked from commit dbf6ad470b3206f64254141e7cf80a980261be29) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com> (cherry picked from commit d35d46916d8ff53b13c08862297f49b5d881d738) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2023-01-20 09:25:07 +01:00
Amaury Denoyelle	d1975e2fc7	MINOR: http: extract content-length parsing from H2 Extract function h2_parse_cont_len_header() in the generic HTTP module. This allows to reuse it for all HTTP/x parsers. The function is now available as http_parse_cont_len_header(). Most notably, this will be reused in the next bugfix for the H3 parser. This is necessary to check that content-length header match the length of DATA frames. Thus, it must be backported to 2.6. (cherry picked from commit 15f3cc4b389d1e92f7d537a2321ad027cf3b5a15) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com> (cherry picked from commit 76d3becee5c10aacabb5cb26b6776c00ca5b9ae6) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2023-01-20 09:24:03 +01:00
Cedric Paillet	d8567a9fea	MINOR: promex: introduce haproxy_backend_agg_check_status This patch introduces haproxy_backend_agg_check_status metric as we wanted in 42d7c402d but with the right data source. This patch could be backported as far as 2.4. (cherry picked from commit e06e31ea3b62ef8ccb911ac3969ae70f7bbb7574) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com> (cherry picked from commit f0319e0f56581873f906f79dc218bf6f10b8f6c2) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2023-01-20 09:19:28 +01:00
Cedric Paillet	7962dcc093	BUG/MINOR: promex: create haproxy_backend_agg_server_status haproxy_backend_agg_server_check_status currently aggregates haproxy_server_status instead of haproxy_server_check_status. We deprecate this and create a new one, haproxy_backend_agg_server_status to clarify what it really does. This patch could be backported as far as 2.4. (cherry picked from commit 7d6644e689f15b329789a355ea2812ea0223fe4f) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com> (cherry picked from commit 2c0d7982e7612b2e7157170aa7109f20b780bb64) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2023-01-20 09:19:24 +01:00
Remi Tricot-Le Breton	0ce2ac2a72	BUG/MINOR: ssl: Fix potential overflow Coverity raised a potential overflow issue in these new functions that work on unsigned long long objects. They were added in commit 9b25982 "BUG/MEDIUM: ssl: Verify error codes can exceed 63". This patch needs to be backported alongside 9b25982. (cherry picked from commit e239e4938d89956e7820be4a0f26e782a86bcf6d) Signed-off-by: William Lallemand <wlallemand@haproxy.org>	2023-01-17 15:11:27 +01:00
Remi Tricot-Le Breton	64fa46abcc	BUG/MEDIUM: ssl: Verify error codes can exceed 63 The CRT and CA verify error codes were stored in 6 bits each in the xprt_st field of the ssl_sock_ctx meaning that only error code up to 63 could be stored. Likewise, the ca-ignore-err and crt-ignore-err options relied on two unsigned long longs that were used as bitfields for all the ignored error codes. On the latest OpenSSL1.1.1 and with OpenSSLv3 and newer, verify errors have exceeded this value so these two storages must be increased. The error codes will now be stored on 7 bits each and the ignore-err bitfields are replaced by a big enough array and dedicated bit get and set functions. It can be backported on all stable branches. [wla: let it be tested a little while before backport] Signed-off-by: William Lallemand <wlallemand@haproxy.org> (cherry picked from commit 9b25982716f0416c28f8fc894c58eb40885cf9e5) Signed-off-by: William Lallemand <wlallemand@haproxy.org>	2023-01-17 15:11:22 +01:00
William Lallemand	f0994946ae	BUILD: peers: peers-t.h depends on stick-table-t.h peers-t.h uses "struct stktable" as well as STKTABLE_DATA_TYPES which are defined in stick-table-t.h. It works by accident because stick-table-t.h was always included before. But could provoke build issue with EXTRA code. To be backported as far as 2.2. (cherry picked from commit 46bea1c6163731a45749e4429fbd1294441a7c68) Signed-off-by: William Lallemand <wlallemand@haproxy.org> (cherry picked from commit 5c89a0c0484b706cfa10398be8539f39c7b311e9) Signed-off-by: William Lallemand <wlallemand@haproxy.org>	2022-12-16 16:03:01 +01:00
Amaury Denoyelle	8083a52d73	CLEANUP: ncbuf: inline small functions ncbuf API relies on lot of small functions. Mark these functions as inline to reduce call invocations and facilitate compiler optimizations to reduce code size. This should be backported up to 2.6. (cherry picked from commit d64a26f0238f386065e26654e6a8a925f96c8baa) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-12-02 15:31:28 +01:00
William Lallemand	b1351c1a05	BUG/MINOR: ssl: shut the ca-file errors emitted during httpclient init With an OpenSSL library which use the wrong OPENSSLDIR, HAProxy tries to load the OPENSSLDIR/certs/ into @system-ca, but emits a warning when it can't. This patch fixes the issue by allowing to shut the error when the SSL configuration for the httpclient is not explicit. Must be backported in 2.6. (cherry picked from commit 0a2d63236c4ada9a33f7e9495aa332fdcd9f5f82) [wla: context changed in httpclient_precheck()] Signed-off-by: William Lallemand <wlallemand@haproxy.org>	2022-11-25 09:58:29 +01:00
Willy Tarreau	51743eea8b	BUG/MINOR: server/idle: at least use atomic stores when updating max_used_conns In 2.2, some idle conns usage metrics were added by commit cf612a045 ("MINOR: servers: Add a counter for the number of currently used connections."), which mentioned that the operation doesn't need to be atomic since we're not seeking exact values. This is true but at least we should use atomic stores to make sure not to cause invalid values to appear on archs that wouldn't guarantee atomicity when writing an int, such as writing two 16-bit words. This is pretty unlikely on our targets but better keep the code safe against this. This may be backported as far as 2.2. (cherry picked from commit 9dc231a6b23fc7d5cf3c233b46e00b9e251325b4) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-11-25 09:25:57 +01:00
Frédéric Lécaille	38c47fb838	BUG/MAJOR: quic: Crash after discarding packet number spaces This previous patch was not sufficient to prevent haproxy from crashing when some Handshake packets had to be inspected before being possibly retransmitted: "BUG/MAJOR: quic: Crash upon retransmission of dgrams with several packets" This patch introduced another issue: access to packets which have been released because still attached to others (in the same datagram). This was the case for instance when discarding the Initial packet number space before inspecting an Handshake packet in the same datagram through its ->prev or member in our case. This patch implements quic_tx_packet_dgram_detach() which detaches a packet from the adjacent ones in the same datagram to be called when ackwowledging a packet (as done in the previous commit) and when releasing its memory. This was, we are sure the released packets will not be accessed during retransmissions. Thank you to @gabrieltz for having reported this issue in GH #1903. Must be backported to 2.6. (cherry picked from commit 74b5f7b31b6c68e220e68cb4a0b302137a9a7362) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-11-25 09:25:23 +01:00
Frédéric Lécaille	05e719b7bd	BUG/MAJOR: quic: Crash upon retransmission of dgrams with several packets As revealed by some traces provided by @gabrieltz in GH #1903 issue, there are clients (chrome I guess) which acknowledge only one packet among others in the same datagram. This is the case for the first datagram sent by a QUIC haproxy listener made an Initial packet followed by an Handshake one. In this identified case, this is the Handshake packet only which is acknowledged. But if the client is able to respond with an Handshake packet (ACK frame) this is because it has successfully parsed the Initial packet. So, why not also acknowledging it? AFAIK, this is mandatory. On our side, when restransmitting this datagram, the Handshake packet was accessed from the Initial packet after having being released. Anyway. There is an issue on our side. Obviously, we must not expect an implementation to respect the RFC especially when it want to build an attack ;) With this simple patch for each TX packet we send, we also set the previous one in addition to the next one. When a packet is acknowledged, we detach the next one and the next one in the same datagram from this packet, so that it cannot be resent when resending these packets (the previous one, in our case). Thank you to @gabrieltz for having reported this issue. Must be backported to 2.6. (cherry picked from commit 814645f42fae3e6dea994f88aa3b67cf43958dcf) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-11-25 09:25:23 +01:00
Amaury Denoyelle	d4c880d649	BUG/MINOR: quic: fix subscribe operation Subscribing was not properly designed between quic-conn and quic MUX layers. Align this as with in other haproxy components : <subs> field is moved from the MUX to the quic-conn structure. All mention of qcc MUX is cleaned up in quic_conn_subscribe()/quic_conn_unsubscribe(). Thanks to this change, ACK reception notification has been simplified. It's now unnecessary to check for the MUX existence before waking it. Instead, if <subs> quic-conn field is set, just wake-up the upper layer tasklet without mentionning MUX. This should probably be extended to other part in quic-conn code. This should be backported up to 2.6. (cherry picked from commit bbb1c68508ceebb98ac4234c906a65a42596e6ea) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-11-17 16:34:22 +01:00
Amaury Denoyelle	9c15bd5d37	MINOR: quic: display unknown error sendto counter on stat page This patch complete the previous incomplete commit. The new counter sendto_err_unknown is now displayed on stats page/CLI show stats. This is related to github issue #1903. This should be backported up to 2.6. (cherry picked from commit 7941ead3aa00c9e83fadf70a1d6d515d20421ad0) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-25 11:52:33 +02:00
Amaury Denoyelle	934659e0ce	MINOR: quic: do not crash on unhandled sendto error Remove ABORT_NOW() statement on unhandled sendto error. Instead use a dedicated counter sendto_err_unknown to report these cases. If we detect increment of this counter, strace can be used to detect errno value : $ strace -p $(pidof haproxy) -f -e trace=sendto -Z This should be backported up to 2.6. This should help to debug github issue #1903. (cherry picked from commit 1d9f170eddd8703ba550e91322298e88e8280075) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-25 11:52:22 +02:00
Amaury Denoyelle	9287b4047b	BUG/MINOR: mux-quic: complete flow-control for uni streams Max stream data was not enforced and respect for local/remote uni streams. Previously, qcs instances incorrectly reused the limit defined from bidirectional ones. This is now fixed. Two fields are added in qcc structure connection : * value for local flow control to enforce on remote uni streams * value for remote flow control to respect on local uni streams These two values can be reused to properly initialized msd field of a qcs instance in qcs_new(). The rest of the code is similar. This must be backported up to 2.6. (cherry picked from commit 176174f7e4734ca8d7a27a622be44ec386d36f4c) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-25 11:50:54 +02:00
William Lallemand	3fd456abc7	BUG/MEDIUM: httpclient/lua: crash when the lua task timeout before the httpclient When the lua task finished before the httpclient that are associated to it, there is a risk that the httpclient try to task_wakeup() the lua task which does not exist anymore. To fix this issue the httpclient used in a lua task are stored in a list, and the httpclient are destroyed at the end of the lua task. Must be backported in 2.5 and 2.6. (cherry picked from commit bb581423b3ba48dfafb53b70205483f766242a6b) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-25 11:49:59 +02:00
Amaury Denoyelle	2b697ca18f	MINOR: quic: define first packet flag Received packets treatment has some difference regarding if this is the first one or not of the encapsulating datagram. Previously, this was set via a function argument. Simplify this by defining a new Rx packet flag named QUIC_FL_RX_PACKET_DGRAM_FIRST. This change does not have functional impact. It will simplify API when qc_lstnr_pkt_rcv() is broken into several functions : their number of arguments will be reduced thanks to this patch. This should be backported up to 2.6. (cherry picked from commit deb7c87f5525d5645dd7c94fb187603edbb8d27a) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-25 11:49:29 +02:00
Amaury Denoyelle	2f0d2c3197	MINOR: quic: extend pn_offset field from quic_rx_packet pn_offset field was only set if header protection cannot be removed. Extend the usage of this field : it is now set everytime on packet parsing in qc_lstnr_pkt_rcv(). This change helps to clean up API of Rx functions by removing unnecessary variables and function argument. This change has no functional impact. It is a part of a refactoring series on qc_lstnr_pkt_rcv(). The objective is facilitate integration of FD-owned socket patches. This should be backported up to 2.6. (cherry picked from commit 845169da584655dedce3286e7e0011fab3f10507) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-25 11:49:25 +02:00
Amaury Denoyelle	60abfa542e	MINOR: quic: add version field on quic_rx_packet Add a new field version on quic_rx_packet structure. This is set on header parsing in qc_lstnr_pkt_rcv() function. This change has no functional impact. It is a part of a refactoring series on qc_lstnr_pkt_rcv(). The objective is facilitate integration of FD-owned socket patches. This should be backported up to 2.6. (cherry picked from commit 0eae57273b3a2b585ddc19d6f97cdc05f2203f0b) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-25 11:49:22 +02:00
Amaury Denoyelle	6f2acccd90	CLEANUP: quic: improve naming for rxbuf/datagrams handling QUIC datagrams are read from a random thread. They are then redispatch to the connection thread according to the first packet DCID. These operations are implemented through a special buffer designed to avoid locking. Refactor this code with the following changes : * <rxbuf> type is renamed <quic_receiver_buf>. Its list element is also renamed to highligh its attach point to a receiver. * <quic_dgram> and <quic_receiver_buf> definition are moved to quic_sock-t.h. This helps to reduce the size of quic_conn-t.h. * <quic_dgram> list elements are renamed to highlight their attach point into a <quic_receiver_buf> and a <quic_dghdlr>. This should be backported up to 2.6. (cherry picked from commit 1cba8d60f3f9ef6abf632386f490fd0b0dc2e6b5) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-25 11:46:11 +02:00
Amaury Denoyelle	94cf0e576d	CLEANUP: quic: remove unused rxbufs member in receiver rxbuf is the structure used to store QUIC datagrams and redispatch them to the connection thread. Each receiver manages a list of rxbuf. This was stored both as an array and a mt_list. Currently, only mt_list is needed so removed <rxbufs> member from receiver structure. This should be backported up to 2.6. (cherry picked from commit 8c4d062d25f57ec983ea8e71a3ccc2f1f9ba7b00) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-25 11:46:08 +02:00
Frédéric Lécaille	118e94d0c0	MINOR: quic: Split the secrets key allocation in two parts Implement quic_tls_secrets_keys_alloc()/quic_tls_secrets_keys_free() to allocate the memory for only one direction (RX or TX). Modify ha_quic_set_encryption_secrets() to call these functions for one of this direction (or both). So, for now on we can rely on the value of the secret keys to know if it was derived. Remove QUIC_FL_TLS_SECRETS_SET flag which is no more useful. Consequently, the secrets are dumped by the traces only if derived. Must be backported to 2.6. (cherry picked from commit e1a49cfd4dc58b1923e99931dd507d2945e5ec8e) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-25 11:46:04 +02:00
Frédéric Lécaille	e0212bcb47	BUG/MINOR: quic: Stalled 0RTT connections with big ClientHello TLS message This issue was reproduced with -Q picoquic client option to split a big ClientHello message into two Initial packets and haproxy as server without any knowledged of any previous ORTT session (restarted after a firt 0RTT session). The ORTT received packets were removed from their queue when the second Initial packet was parsed, and the QUIC handshake state never progressed and remained at Initial state. To avoid such situations, after having treated some Initial packets we always check if there are ORTT packets to parse and we never remove them from their queue. This will be done after the hanshake is completed or upon idle timeout expiration. Also add more traces to be able to analize the handshake progression. Tested with ngtcp2 and picoquic Must be backported to 2.6. (cherry picked from commit 4aa7d8197ac565976c2b2eec93a9b98e905829df) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-25 11:46:01 +02:00
Frédéric Lécaille	c094639d65	MINOR: quic: Use a non-contiguous buffer for RX CRYPTO data Implement quic_get_ncbuf() to dynamically allocate a new ncbuf to be attached to any quic_cstream struct which needs such a buffer. Note that there is no quic_cstream for 0RTT encryption level. quic_free_ncbuf() is added to release the memory allocated for a non-contiguous buffer. Modify qc_handle_crypto_frm() to call this function and allocate an ncbuf for crypto data which are not received in order. The crypto data which are received in order are not buffered but provide to the TLS stack (calling qc_provide_cdata()). Modify qc_treat_rx_crypto_frms() which is called after having provided the in order received crypto data to the TLS stack to provide again the remaining crypto data which has been buffered, if possible (if they are in order). Each time buffered CRYPTO data were consumed, we try to release the memory allocated for the non-contiguous buffer (ncbuf). Also move rx.crypto.offset quic_enc_level struct member to rx.offset quic_cstream struct member. Must be backported to 2.6. (cherry picked from commit 9f9263ed13ab28a492323484c4c4be89f218848f) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-25 11:45:58 +02:00
Frédéric Lécaille	6b84c17d99	MINOR: quic: New quic_cstream object implementation Add new quic_cstream struct definition to implement the CRYPTO data stream. This is a simplication of the qcs object (QUIC streams) for the CRYPTO data without any information about the flow control. They are not attached to any tree, but to a QUIC encryption level, one by encryption level except for the early data encryption level (for 0RTT). A stream descriptor is also allocated for each CRYPTO data stream. Must be backported to 2.6 (cherry picked from commit 7e3f7c47e9acdf074c678cdee4202192fffb7de4) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-25 11:45:49 +02:00
Willy Tarreau	b088056f3d	CLEANUP: quic/receiver: remove the now unused tx_qring list The tx_qrings[] and tx_qring_list in the receiver are not used anymore since commit f2476053f ("MINOR: quic: replace custom buf on Tx by default struct buffer"), the only place where they're referenced was in quic_alloc_tx_rings_listener(), which by the way implies that these were not even freed on exit. Let's just remove them. This should be backported to 2.6 since the commit above also was. (cherry picked from commit cab054bbf9908f3732648e5236d889650a6e33f7) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-25 11:42:38 +02:00
Amaury Denoyelle	3701ab7823	MEDIUM: quic: retrieve frontend destination address Retrieve the frontend destination address for a QUIC connection. This address is retrieve from the first received datagram and then stored in the associated quic-conn. This feature relies on IP_PKTINFO or affiliated flags support on the socket. This flag is set for each QUIC listeners in sock_inet_bind_receiver(). To retrieve the destination address, recvfrom() has been replaced by recvmsg() syscall. This operation and parsing of msghdr structure has been extracted in a wrapper quic_recv(). This change is useful to finalize the implementation of 'dst' sample fetch. As such, quic_sock_get_dst() has been edited to return local address from the quic-conn. As a best effort, if local address is not available due to kernel non-support of IP_PKTINFO, address of the listener is returned instead. This should be backported up to 2.6. (cherry picked from commit 97ecc7a8ea5339a753507c3d4e4cd83028c6d038) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-25 11:42:27 +02:00
Amaury Denoyelle	68f165013c	MINOR: quic: limit usage of ssl_sock_ctx in favor of quic_conn Continue on the cleanup of QUIC stack and components. quic_conn uses internally a ssl_sock_ctx to handle mandatory TLS QUIC integration. However, this is merely as a convenience, and it is not equivalent to stackable ssl xprt layer in the context of HTTP1 or 2. To better emphasize this, ssl_sock_ctx usage in quic_conn has been removed wherever it is not necessary : namely in functions not related to TLS. quic_conn struct now contains its own wait_event for tasklet quic_conn_io_cb(). This should be backported up to 2.6. (cherry picked from commit 2ed840015f2d7ca37af77c0d9808cac3b0441d40) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-10 08:43:48 +02:00
Willy Tarreau	d22ff05c3e	MINOR: fd: add a new function to only raise RLIMIT_NOFILE In issue #1866 an issue was reported under docker, by which a user cannot lower the number of FD needed. It looks like a restriction imposed in this environment, but it results in an error while it ought not have to in the case of shrinking. This patch adds a new function raise_rlim_nofile() that takes the desired new setting, compares it to the current one, and only calls setrlimit() if one of the values in the new setting is larger than the older one. As such it will continue to emit warnings and errors in case of failure to raise the limit but will never shrink it. This patch is only preliminary to another one, but will have to be backported where relevant (likely only 2.6). (cherry picked from commit 922a907926d1cb02a8a10c7cdb9917755c934c84) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-10 08:43:48 +02:00
Amaury Denoyelle	4a6be622b2	CLEANUP: quic: create a dedicated quic_conn module xprt_quic module was too large and did not reflect the true architecture by contrast to the other protocols in haproxy. Extract code related to XPRT layer and keep it under xprt_quic module. This code should only contains a simple API to communicate between QUIC lower layer and connection/MUX. The vast majority of the code has been moved into a new module named quic_conn. This module is responsible to the implementation of QUIC lower layer. Conceptually, it overlaps with TCP kernel implementation when comparing QUIC and HTTP1/2 stacks of haproxy. This should be backported up to 2.6. (cherry picked from commit 92fa63f73596bf7e567b7bbd600dd8621a1b49ad) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-10 08:43:48 +02:00
Amaury Denoyelle	228883ca32	CLEANUP: quic: remove duplicated varint code from xprt_quic.h There was some identical code between xprt_quic and quic_enc modules. This concerns helper on QUIC varint type. Keep only the version in quic_enc file : this should help to reduce dependency on xprt_quic module. Note that quic_max_int_by_size() has been removed and is replaced by the identical quic_max_int(). This should be backported up to 2.6. (cherry picked from commit a2639383ece04c2fee3bbdda54dab66a640f6aa1) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-10 08:43:48 +02:00
Amaury Denoyelle	69baa24752	CLEANUP: quic: fix headers Clean up quic sources by adjusting headers list included depending on the actual dependency of each source file. On some occasion, xprt_quic.h was removed from included list. This is useful to help reducing the dependency on this single file and cleaning up QUIC haproxy architecture. This should be backported up to 2.6. (cherry picked from commit 5c25dc5bfd5d253925f954aab072a2bf1fd1d6e2) [cf: Include <haproxy/global.h> from cfgparse-quic.c instead of only <haproxy/global-t.h">. On 2.7, it is shipped with "tools.h" (tools.h > cli.h > global.h). But not on the 2.6] Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-10 08:41:20 +02:00
Amaury Denoyelle	90a008239e	BUG/MINOR: quic: adjust quic_tls prototypes Two prototypes in quic_tls module were not identical to the actual function definition. * quic_tls_decrypt2() : the second argument const attribute is not present, to be able to use it with EVP_CIPHER_CTX_ctlr(). As a consequence of this change, token field of quic_rx_packet is now declared as non-const. * quic_tls_generate_retry_integrity_tag() : the second argument type differ between the two. Adjust this by fixing it to as unsigned char to match EVP_EncryptUpdate() SSL function. This situation did not seem to have any visible effect. However, this is clearly an undefined behavior and should be treated as a bug. This should be backported up to 2.6. (cherry picked from commit f3c40f83fbfc6fb60ba5608ccfbd00fb51e6f9b3) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-10 07:43:42 +02:00
Amaury Denoyelle	adf910e519	CLEANUP: quic: remove global var definition in quic_tls header Some variables related to QUIC TLS were defined in a header file : their definitions are now moved properly in the implementation file, with only declarations in the header. This should be backported up to 2.6. (cherry picked from commit a19bb6f0b2af1971775e4a88edfaed85d42162c6) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-10 07:43:39 +02:00
Willy Tarreau	22beb2ad21	BUG/MINOR: backend: only enforce turn-around state when not redispatching In github issue #1878, Bart Butler reported observing turn-around states (1 second pause) after connection retries going to different servers, while this ought not happen. In fact it does happen because back_handle_st_cer() enforces the TAR state for any algo that's not round-robin. This means that even leastconn has it, as well as hashes after the number of servers changed. Prior to doing that, the call to stream_choose_redispatch() has already had a chance to perform the correct choice and to check the algo and the number of retries left. So instead we should just let that function deal with the algo when needed (and focus on deterministic ones), and let the former just obey. Bart confirmed that the fixed version works as expected (no more delays during retries). This may be backported to older releases, though it doesn't seem very important. At least Bart would like to have it in 2.4 so let's go there for now after it has cooked a few weeks in 2.6. (cherry picked from commit 406efb96d135efe1d5a85bf58c589f7b6dbd8c70) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-10 07:43:13 +02:00
Willy Tarreau	c564abd107	BUG/MAJOR: conn-idle: fix hash indexing issues on idle conns Idle connections do not work on 32-bit machines due to an alignment issue causing the connection nodes to be indexed with their lower 32-bits set to zero and the higher 32 ones containing the 32 lower bitss of the hash. The cause is the use of ebmb_node with an aligned data, as on this platform ebmb_node is only 32-bit aligned, leaving a hole before the following hash which is a uint64_t: $ pahole -C conn_hash_node ./haproxy struct conn_hash_node { struct ebmb_node node; /* 0 20 / / XXX 4 bytes hole, try to pack / int64_t hash; / 24 8 / struct connection conn; /* 32 4 / / size: 40, cachelines: 1, members: 3 / / sum members: 32, holes: 1, sum holes: 4 / / padding: 4 / / last cacheline: 40 bytes */ }; Instead, eb64 nodes should be used when it comes to simply storing a 64-bit key, and that is what this patch does. For backports, a variant consisting in simply marking the "hash" member with a "packed" attribute on the struct also does the job (tested), and might be preferable if the fix is difficult to adapt. Only 2.6 and 2.5 are affected by this. (cherry picked from commit 852234848241f61a976f8856123a34a3c19275ba) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-10-10 07:40:32 +02:00
Aurelien DARRAGON	9779c1b5d7	BUG/MINOR: log: improper behavior when escaping log data Patrick Hemmer reported an improper log behavior when using log-format to escape log data (+E option): Some bytes were truncated from the output: - escape_string() function now takes an extra parameter that allow the caller to specify input string stop pointer in case the input string is not guaranteed to be zero-terminated. - Minors checks were added into lf_text_len() to make sure dst string will not overflow. - lf_text_len() now makes proper use of escape_string() function. This should be backported as far as 1.8. (cherry picked from commit c5bff8e550cf49b0cb3a7abb998b2c915323eca9) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-09-20 16:31:39 +02:00
Amaury Denoyelle	28e18246be	BUG/MEDIUM: mux-quic: properly trim HTX buffer on snd_buf reset MUX QUIC snd_buf operation whill return early if a qcs instance is resetted. In this case, HTX is left untouched and the callback returns the whole bufer size. This lead to an undefined behavior as the stream layer is notified about a transfer but does not see its HTX buffer emptied. In the end, the transfer may stall which will lead to a leak on session. To fix this, HTX buffer is now resetted when snd_buf is short-circuited. This should fix the issue as now the stream layer can continue the transfer until its completion. This patch has already been tested by Tristan and is reported to solve the github issue #1801. This should be backported up to 2.6. (cherry picked from commit 0ed617ac2ff377ce60bd9c8fd97fd9da32d43971) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-09-20 15:58:22 +02:00
Amaury Denoyelle	53e4116b6b	MINOR: mux-quic: refactor snd_buf Factorize common code between h3 and hq-interop snd_buf operation. This is inserted in MUX QUIC snd_buf own callback. The h3/hq-interop API has been adjusted to directly receive a HTX message instead of a plain buf. This led to extracting part of MUX QUIC snd_buf in qmux_http module. This should be backported up to 2.6. (cherry picked from commit 9534e59bb9057cfa5762f9c119579a67f705de37) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-09-20 15:58:18 +02:00
Amaury Denoyelle	0859fbf203	REORG: mux-quic: export HTTP related function in a dedicated file Extract function dealing with HTX outside of MUX QUIC. For the moment, only rcv_buf stream operation is concerned. The main objective is to be able to support both TCP and HTTP proxy mode with a common base and add specialized modules on top of it. This should be backported up to 2.6. (cherry picked from commit d80fbcaca266696a7d6de7342876d104c42e91e9) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-09-20 15:58:14 +02:00
Amaury Denoyelle	f3cab28f96	REORG: mux-quic: extract traces in a dedicated source file QUIC MUX implements several APIs to interface with stream, quic-conn and app-ops layers. It is planified to better separate this roles, possibly by using several files. The first step is to extract QUIC MUX traces in a dedicated source files. This will allow to reuse traces in multiple files. The main objective is to be able to support both TCP and HTTP proxy mode with a common base and add specialized modules on top of it. This should be backported up to 2.6. (cherry picked from commit 36d50bff22563ba650918ccedaa695fcb6b8fa3e) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-09-20 15:58:06 +02:00
Amaury Denoyelle	4075ed06e6	BUG/MEDIUM: mux-quic: fix nb_hreq decrement nb_hreq is a counter on qcc for active HTTP requests. It is incremented for each qcs where a full HTTP request was received. It is decremented when the stream is closed locally : - on HTTP response fully transmitted - on stream reset A bug will occur if a stream is resetted without having processed a full HTTP request. nb_hreq will be decremented whereas it was not incremented. This will lead to a crash when building with DEBUG_STRICT=2. If BUG_ON_HOT are not active, nb_hreq counter will wrap which may break the timeout logic for the connection. This bug was triggered on haproxy.org. It can be reproduced by simulating the reception of a STOP_SENDING frame instead of a STREAM one by patching qc_handle_strm_frm() : + if (quic_stream_is_bidi(strm_frm->id)) + qcc_recv_stop_sending(qc->qcc, strm_frm->id, 0); + //ret = qcc_recv(qc->qcc, strm_frm->id, strm_frm->len, + // strm_frm->offset.key, strm_frm->fin, + // (char *)strm_frm->data); To fix this bug, a qcs is now flagged with a new QC_SF_HREQ_RECV. This is set when the full HTTP request is received. When the stream is closed locally, nb_hreq will be decremented only if this flag was set. This must be backported up to 2.6. (cherry picked from commit afb7b9d8e5a70a741bbb890945fa9ff51dad027d) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-09-20 15:52:18 +02:00
Amaury Denoyelle	01a5be8c38	CLEANUP: mux-quic: remove stconn usage in h3/hq Small cleanup on snd_buf for application protocol layer. * do not export h3_snd_buf * replace stconn by a qcs argument. This is better as h3/hq-interop only uses the qcs instance. This should be backported up to 2.6. (cherry picked from commit 8d4ac48d3def189190c29b6f1f5d697b180f7e30) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-09-19 11:41:38 +02:00
Amaury Denoyelle	57b3c47e70	BUG/MEDIUM: mux-quic: fix crash on early app-ops release H3 SETTINGS emission has recently been delayed. The idea is to send it with the first STREAM to reduce sendto syscall invocation. This was implemented in the following patch : 3dd79d378c86b3ebf60e029f518add5f1ed54815 MINOR: h3: Send the h3 settings with others streams (requests) This patch works fine under nominal conditions. However, it will cause a crash if a HTTP/3 connection is released before having sent any data, for example when receiving an invalid first request. In this case, qc_release will first free qcc.app_ops HTTP/3 application protocol layer via release callback. Then qc_send is called to emit any closing frames built by app_ops release invocation. However, in qc_send, as no data has been sent, it will try to complete application layer protocol intialization, with a SETTINGS emission for HTTP/3. Thus, qcc.app_ops is reused, which is invalid as it has been just freed. This will cause a crash with h3_finalize in the call stack. This bug can be reproduced artificially by generating incomplete HTTP/3 requests. This will in time trigger http-request timeout without any data send. This is done by editing qc_handle_strm_frm function. - ret = qcc_recv(qc->qcc, strm_frm->id, strm_frm->len, + ret = qcc_recv(qc->qcc, strm_frm->id, strm_frm->len - 1, strm_frm->offset.key, strm_frm->fin, (char *)strm_frm->data); To fix this, application layer closing API has been adjusted to be done in two-steps. A new shutdown callback is implemented : it is used by the HTTP/3 layer to generate GOAWAY frame in qc_release prologue. Application layer context qcc.app_ops is then freed later in qc_release via the release operation which is now only used to liberate app layer ressources. This fixes the problem as the intermediary qc_send invocation will be able to reuse app_ops before it is freed. This patch fixes the crash, but it would be better to adjust H3 SETTINGS emission in case of early connection closing : in this case, there is no need to send it. This should be implemented in a future patch. This should fix the crash recently experienced by Tristan in github issue #1801. This must be backported up to 2.6. (cherry picked from commit f8aaf8bdfa40e21b1a2f600c3ed6455bf9b6a763) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2022-09-19 11:41:25 +02:00

1 2 3 4 5 ...

6381 Commits