haproxy

Author	SHA1	Message	Date
Amaury Denoyelle	260fa5648e	BUG/MINOR: mux-quic: do not close STREAM with empty FIN if no data sent A stream may be shut without any HTX EOM reported to report a proper closure. This is the case for QCS instances flagged with QC_SF_UNKNOWN_PL_LENGTH. Shut is performed with an empty FIN emission instead of a RESET_STREAM. This has been implemented since the following patch : `24962dd178` BUG/MEDIUM: mux-quic: do not emit RESET_STREAM for unknown length However, in case of HTTP/3, an empty FIN should only be done after a full message is emitted, which requires at least a HEADERS frame. If an empty FIN is emitted without it, client may interpret this as invalid and close the connection. To prevent this, fallback to a RESET_STREAM emission if no data were emitted on the stream. This was reproduced using ngtcp2-client with 10% loss (-r 0.1) on a remote host, with httpterm request "/?s=100k&C=1&b=0&P=400". An error ERR_H3_FRAME_UNEXPECTED is returned by ngtcp2-client when the bug occurs. Note that this change is incomplete. The message validity depends solely on the application protocol in use. As such, a new app_ops callback should be implemented to ensure the stream is closed accordingly. However, this first patch ensures that at least HTTP/3 case is valid while keeping a minimal backport process. This should be backported up to 2.8. (cherry picked from commit 68c8c910238f0b759d75b4da2128370abf184cd1) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-23 17:23:49 +02:00
Valentine Krasnobaeva	7e69007d4a	BUG/MINOR: mworker: fix mworker-max-reloads parser Before this patch, when wrong argument was provided in the configuration for mworker-max-reloads keyword, parser shows these errors below on the stderr: [WARNING] (1820317) : config : parsing [haproxy.cfg:154] : (null)parsing [haproxy.cfg:154] : 'mworker-max-reloads' expects an integer argument. In a case, when by mistake two arguments were provided instead of one, this has also triggered a buggy error message: [ALERT] (1820668) : config : parsing [haproxy.cfg:154] : 'mworker-max-reloads' cannot handle unexpected argument '45'. [WARNING] (1820668) : config : parsing [haproxy.cfg:154] : (null) So, as 'mworker-max-reloads' is parsed in discovery mode by master process let's align now its parser with all others, which could be called for this mode. Like this in cases, when there are too many args or argument isn't a valid integer we return proper error codes to global section parser and messages are formated properly. This fix should be backported in all stable versions. (cherry picked from commit af1d170122369094a1f3869791fb34fb7286e31e) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-23 17:23:37 +02:00
Aurelien DARRAGON	fceb0f42e7	DOC: config: fix rfc7239 forwarded typo in desc replace specicy with specify in rfc7239 forwarded option description. Multiple occurences were found. May be backported in 2.8. (cherry picked from commit 45cbbdc84551e51cdaf0046e1371e8495d053fb5) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-23 17:22:26 +02:00
Frederic Lecaille	cf5344e507	BUG/MEDIUM: quic: avoid freezing 0RTT connections This issue came with this commit: f627b92 BUG/MEDIUM: quic: always validate sender address on 0-RTT and could be easily reproduced with picoquic QUIC client with -Q option which splits a big ClientHello TLS message into two Initial datagrams. A second condition must be fulfilled to reprodue this issue: picoquic must not send the token provided by haproxy (NEW_TOKEN). To do that, haproxy must be patched to prevent it to send such tokens. Under these conditions, if haproxy has enough time to reply to the first Initial datagrams, when it receives the second Initial datagram it sends a Retry paquet. Then the client ignores the Retry paquet as mentionned by RFC 9000: 17.2.5.2. Handling a Retry Packet A client MUST accept and process at most one Retry packet for each connection attempt. After the client has received and processed an Initial or Retry packet from the server, it MUST discard any subsequent Retry packets that it receives. On its side, haproxy has closed the connection. When it receives the second Initial datagram, it open a new connection but with Initial packets it cannot decrypt (wrong ODCID) leaving the client without response. To fix this, as the aim of the token (NEW_TOKEN) sent by haproxy is to validate the peer address, in place of closing the connection when no token was received for a 0RTT connection, one leaves this validation to the handshake process. Indeed, the peer adress is validated during the handshake when a valid handshake packet is received by the listener. But as one does not want haproxy to process 0RTT data when no token was received, one does not accept the connection before the successful handshake completion. In addition to this, the 0RTT packets are not released after successful handshake completion when no token was received to leave a chance to haproxy to process these 0RTT data in such case (see quic_conn_io_cb()). Must be backported as far as 2.9. (cherry picked from commit b1af5dabf0c4af1eda3a520a90332df1f4c12dcf) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-23 17:22:18 +02:00
Frederic Lecaille	b47a34ce8d	BUG/MINOR: quic: avoid leaking post handshake frames This bug came with this commit: f627b92 BUG/MEDIUM: quic: always validate sender address on 0-RTT If an error happens in quic_build_post_handshake_frames() during the code exexuted for th NEW_TOKEN frame allocation, some could leak because of the wrong label used to interrupt this function asap. Replace the "goto leave" by "goto err" to deallocated such frames to fix this issue. Must be backported as far as 2.9. (cherry picked from commit 19aa320f640f701544c3441787da1577a2479590) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-23 17:22:13 +02:00
Christopher Faulet	11ba718b69	REGTESTS: Never reuse server connection in http-messaging/truncated.vtc A "Connection: close" header is added to responses to avoid any connection reuse. This should avoid errors on the client side. (cherry picked from commit e7be13da87f8ec00470ef60bb43b85f0480fd85d) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-23 17:22:02 +02:00
Christopher Faulet	7d3fed6bf8	BUG/MAJOR: filters/htx: Add a flag to state the payload is altered by a filter When a filter is registered on the data, it means it may change the payload length by rewritting data. It means consumers of the message cannot trust the expected length of payload as announced by the producer. The commit `8bd835b2d2` ("MEDIUM: filters/htx: Don't rely on HTX extra field if payload is filtered") was pushed to solve this issue. When the HTTP payload of a message is filtered, the extra field is set to 0 to be sure it will never be used by error by any consumer. However, it is not enough. Indeed, the filters must be called before fowarding some data. They cannot be by-passed. But if a consumer is unable to flush the HTX message, some outgoing data can remain blocked in the channel's buffer. If some new data are then pushed because there is some room in the channel's buffe, the producer will set the HTX extra field. At this stage, if the consumer is unblocked and can send again data, it is possible to call it to forward outgoing data blocked in the channel's buffer before waking the stream up to filter new input data. It is the purpose of the data fast-forwarding. In this case, the HTX extra field will be seen by the consumer. It is unexpected and leads to undefined behavior. One consequence of this bug is to perform a wrong chunking on compressed messages, leading to processing errors at the end of the message, reported as "ID--" in logs. To fix the bug, a HTX flag is added to state the payload of the current HTX message is altered. When this flag is set (HTX_FL_ALTERED_PAYLOAD), the HTX extra field must not be trusted. And to keep things simple, when this flag is set, the HTX extra field is automatically set to 0 when the HTX message is loaded, in htxbuf() function. It is probably the less intrusive way to fix the bug for now. But this part must be reviewed to save meta-info of the HTX message outside of the message itself. This commit should solve the issue #2741. It must be backported as far as 2.9. (cherry picked from commit 52a3d807fc332b57b62f5e30aa6f697636a22695) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-23 17:21:56 +02:00
Christopher Faulet	dc103ef13e	BUG/MEDIUM: stconn: Check FF data of SC to perform a shutdown in sc_notify() In sc_notify() function, the consumer side of the SC is tested to verify if we must perform a shutdown on the endpoint. To do so, no output data must be present in the buffer and in the iobuf. However, there is a bug here, the iobuf of the opposite SC is tested instead of the one of the current SC. So a shutdown can be performed on the endpoint while there are still output data in the iobuf that must be sent. Concretely, it can only be data blocked in a pipe. Because of this bug, data blocked in the pipe will be never sent. I've not tested but I guess this may block the stream during the client or server timeout. This patch must be backported as far as 2.9. (cherry picked from commit 0fcfed9e231f2bc3963fe6085598970db2174af1) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-23 17:21:46 +02:00
Christopher Faulet	bd3df74651	BUG/MINOR: http-ana: Don't report a server abort if response payload is invalid If a parsing error is reported by the mux on the response payload, a proxy error (PRXCOND) must be reported instead of a server abort (SRVCL). Because of this bug, inavlid response may are reported as "SD--" or "SL--" in logs instead of "PD--" or "PL--". This patch must be backported to all stable versions. (cherry picked from commit 6790067e79566b2ca5943e72200361c40001bde2) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-23 17:21:40 +02:00
Christopher Faulet	38b1197a78	BUG/MEDIUM: stconn: Wait iobuf is empty to shut SE down during a check send When a send attempt is performed on the opposite side from sc_notify() and all outgoing data are sent while a shut was scheduled, the SE is shut down because we consider all data were sent and no more are expected. However, here we must also be carefull to have sent all pending data in the iobuf. Indeed, some spliced data may be blocked. In this case, if the SE is shut down, these data may be lost. This patch should fix the original bug reported in #2749. It must be backported as far as 2.9. (cherry picked from commit 48f1e2b6fe8457bb5b9d8db9447157c244d871b7) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-23 17:21:33 +02:00
William Lallemand	d356446984	BUG/MINOR: httpclient: return NULL when no proxy available during httpclient_new() Latest patches on the mworker rework skipped the httpclient_proxy creation by accident. This is not supposed to happen because haproxy is supposed to stop when the proxy creation failed, but it shows a flaw in the API. When the httpclient_proxy or the proxy used in parameter of httpclient_new_from_proxy() is NULL, it will be dereferenced and cause a crash. The patch only returns a NULL when doing an httpclient_new() if the proxy is not available. Must be backported as far as 2.7. (cherry picked from commit e7b7072943d658702eba3651d66c6093f1a79fa8) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-23 17:21:27 +02:00
Willy Tarreau	bd773295ae	BUG/MEDIUM: queue: make sure never to queue when there's no more served conns Since commit 53f52e67a0 ("BUG/MEDIUM: queue: always dequeue the backend when redistributing the last server"), we've got two reports again still showing the theoretically impossible condition in pendconn_add(), including a single threaded one. Thanks to the traces, the issue could be tracked down to the redispatch part. In fact, in non-determinist LB algorithms (RR, LC, FAS), we don't perform the LB if there are pending connections in the backend, since it indicates that previous attempts already failed, so we directly return SRV_STATUS_FULL. And contrary to a previous belief, it is possible to meet this condition with be->served==0 when redispatching (and likely with maxconn not greater than the number of threads). The problem is that in this case, the entry is queued and then the pendconn_must_try_again() function checks if any connections are currently being served to detect if we missed a race, and tries again, but that situation is not caused by a concurrent thread and will never fix itself, resulting in the loop. All that part around pendconn_must_try_again() is still quite brittle, and a safer approach would involve a sequence counter to detect new arrivals and dequeues during the pendconn_add() call. But it's more sensitive work, probably for a later fix. This fix must be backported wherever the fix above was backported. Thanks to Patrick Hemmer, as well as Damien Claisse and Basha Mougamadou from Criteo for their help on tracking this one! (cherry picked from commit ca275d99ce02e72d707fc87da133d739cdda5146) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-23 17:21:13 +02:00
Amaury Denoyelle	1bd64e5d46	BUG/MEDIUM: mux-quic: ensure timeout server is active for short requests If a small request is received on QUIC MUX frontend, it can be transmitted directly with the FIN on attach operation. rcv_buf is skipped by the stream layer. Thus, it is necessary to ensure that there is similar behavior when FIN is reported either on attach or rcv_buf. One difference was that se_expect_data() was called only for rcv_buf but not on attach. This most obvious effect is that stream timeout was deactivated for this request : client timeout was disabled on EOI but server one not armed due to previous se_expect_no_data(). This prevents the early closure of too long requests. To fix this, add an invokation of se_expect_data() on attach operation. This bug can simply be detected using httpterm with delay request (for example /?t=10000) and using smaller client/server timeouts. The bug is present if the request is not aborted on timeout but instead continue until its proper HTTP 200 termination. This has been introduced by the following commit : `85eabfbf67` MEDIUM: mux-quic: Don't expect data from server as long as request is unfinished This must be backported up to 2.8. (cherry picked from commit 232083c3e5ca3f23a44fa64def6a88dd257c3b23) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-23 16:41:52 +02:00
Aurelien DARRAGON	d5d2dd25df	BUG/MEDIUM: hlua: properly handle sample func errors in hlua_run_sample_{fetch,conv}() To execute sample fetches and converters from lua. hlua API leverages the sample API. Prior to executing the sample func, the arg checker is called from hlua_run_sample_{fetch,conv}() to detect potential errors. However, hlua_run_sample_{fetch,conv}() both pass NULL as <err> argument, but it is wrong for two reasons. First we miss an opportunity to report precise error messages to help the user know what went wrong during the check.. and more importantly, some val check functions consider that the <err> pointer is never NULL. This is the case for example with check_crypto_hmac(). Because of this, when such val check functions encounter an error, they will crash the process because they will try to de-reference NULL. This bug was discovered and reported by GH user @JB0925 on #2745. Perhaps val check functions should make sure that the provided <err> pointer is != NULL prior to de-referencing it. But since there are multiple occurences found in the code and the API isn't clear about that, it is easier to fix the hlua part (caller) for now. To fix the issue, let's always provide a valid <err> pointer when leveraging val_arg() check function pointer, and make use of it in case or error to report relevant message to the user before freeing it. It should be backported to all stable versions. (cherry picked from commit f88f162868df9053ca71e3be0628221c36153d9a) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-23 16:41:46 +02:00
Aurelien DARRAGON	de48cdf1fc	BUG/MEDIUM: hlua: make hlua_ctx_renew() safe hlua_ctx_renew() is called from unsafe places where the caller doesn't expect it to LJMP.. however hlua_ctx_renew() makes use of Lua library function that could potentially raise errors, such as lua_newthread(), and it does nothing to catch errors. Because of this, haproxy could unexpectedly crash. This was discovered and reported by GH user @JB0925 on #2745. To fix the issue, let's simply make hlua_ctx_renew() safe by applying the same logic implemented for hlua_ctx_init() or hlua_ctx_destroy(), which is catching Lua errors by leveraging SET_SAFE_LJMP_PARENT() helper. It should be backported to all stable versions. (cherry picked from commit d0e01051813bde5cb06bebe88102f2b2885b3dea) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-23 16:41:37 +02:00
Aurelien DARRAGON	2529cbc926	BUG/MEDIUM: server: server stuck in maintenance after FQDN change Pierre Bonnat reported that SRV-based server-template recently stopped to work properly. After reviewing the changes, it was found that the regression was caused by a4d04c6 ("BUG/MINOR: server: make sure the HMAINT state is part of MAINT") Indeed, HMAINT is not a regular maintenance flag. It was implemented in `b418c122` a4d04c6 ("BUG/MINOR: server: make sure the HMAINT state is part of MAINT"). This flag is only set (and never removed) when the server FQDN is changed from its initial config-time value. This can happen with "set server fqdn" command as well as SRV records updates from the DNS. This flag should ideally belong to server flags.. but it was stored under srv_admin enum because cur_admin is properly exported/imported via server state-file while regular server's flags are not. Due to a4d04c6, when a server FQDN changes, the server is considered in maintenance, and since the HMAINT flag is never removed, the server is stuck in maintenance. To fix the issue, we partially revert a4d04c6. But this latter commit is right on one point: HMAINT flag was way too confusing and mixed-up between regular MAINT flags, thus there's nothing to blame about a4d04c6 as it was error-prone anyway.. To prevent such kind of bugs from happening again, let's rename HMAINT to something more explicit (SRV_ADMF_FQDN_CHANGED) and make it stand out under srv_admin enum so we're not tempted to mix it with regular maintenance flags anymore. Since a4d04c6 was set to be backported in all versions, this patch must be backported there as well. (cherry picked from commit 85298189bf4c268b15c33aea95e0cc35113e25f0) Signed-off-by: Aurelien DARRAGON <adarragon@haproxy.com>	2024-10-21 11:19:02 +02:00
Willy Tarreau	b8adef065d	MEDIUM: debug: on panic, make the target thread automatically allocate its buf One main problem with panic dumps is that they're filling the dumping thread's trash, and that the global thread_dump_buffer is too small to catch enough of them. Here we're proceeding differently. When dumping threads for a panic, we're passing the magic value 0x2 as the buffer, and it will instruct the target thread to allocate its own buffer using get_trash_chunk() (which is signal safe), so that each thread dumps into its own buffer. Then the thread will wait for the buffer to be consumed, and will assign its own thread_dump_buffer to it. This way we can simply dump all threads' buffers from gdb like this: (gdb) set $t=0 while ($t < global.nbthread) printf "%s\n", ha_thread_ctx[$t].thread_dump_buffer.area set $t=$t+1 end For now we make it wait forever since it's only called on panic and we want to make sure the thread doesn't leave and continues to use that trash buffer or do other nasty stuff. That way the dumping thread will make all of them die. This would be useful to backport to the most recent branches to help troubleshooting. It backports well to 2.9, except for some trivial context in tinfo-t.h for an updated comment. 2.8 and older would also require TAINTED_PANIC. The following previous patches are required: MINOR: debug: make mark_tainted() return the previous value MINOR: chunk: drop the global thread_dump_buffer MINOR: debug: split ha_thread_dump() in two parts MINOR: debug: slightly change the thread_dump_pointer signification MINOR: debug: make ha_thread_dump_done() take the pointer to be used MINOR: debug: replace ha_thread_dump() with its two components (cherry picked from commit 278b9613a32ddb0b5ffa1d66e6c25f434758c65a) [wt: ctx updt in tinfo-t for comment] Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-10-19 16:03:26 +02:00
Willy Tarreau	1b176e7659	MINOR: debug: replace ha_thread_dump() with its two components At the few places we were calling ha_thread_dump(), now we're calling separately ha_thread_dump_fill() and ha_thread_dump_done() once the data are consumed. (cherry picked from commit afeac4bc026e64e27d99e50480bff5bc1ee60cb9) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-10-19 15:54:53 +02:00
Willy Tarreau	97c10c4bbc	MINOR: debug: make ha_thread_dump_done() take the pointer to be used This will allow the caller to decide whether to definitely clear the pointer and release the thread, or to leave it unlocked so that it's easy to analyse from the struct (the goal will be to use that in panic() so that cores are easy to analyse). (cherry picked from commit d7c34ba479f79ed42c709c64679b4e3d1310cddd) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-10-19 15:54:53 +02:00
Willy Tarreau	6cea1c809c	MINOR: debug: slightly change the thread_dump_pointer signification Now the thread_dump_pointer is returned ORed with 1 once done, or NULL when cancelled (for now noone cancels). The goal will be to permit the callee to provide its own pointer. The ha_thread_dump_fill() function now returns the buffer pointer that was used (without OR 1) or NULL, for ease of use from the caller. (cherry picked from commit 091de0f9b2553463660a11c56598c4970d6b1066) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-10-19 15:54:53 +02:00
Willy Tarreau	534db5cd0a	MINOR: debug: split ha_thread_dump() in two parts We want to have a function to trigger the dump and another one to wait for it to be completed. This will be important to permit panic dumps to be done on local threads. For now this does not change anything, as the function still calls the two new functions one after the other. (cherry picked from commit 2036f5bba1637c8dad3eca577aa3bed4cc53caaf) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-10-19 15:54:53 +02:00
Willy Tarreau	afcac79f35	MINOR: chunk: drop the global thread_dump_buffer This variable is not very useful and is confusing anyway. It was mostly used to detect that a panic dump was still in progress, but we can now check mark_tainted() for this. The pointer was set to one of the dumping thread's trash chunks. Let's temporarily continue to copy the dumps to that trash, we'll remove it later. (cherry picked from commit a6698304e03847f3e114b22b4229b382f3ddffd1) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-10-19 15:54:53 +02:00
Willy Tarreau	922bd2d1c6	MINOR: debug: make mark_tainted() return the previous value Since mark_tainted() uses atomic ops to update the tainted status, let's make it return the prior value, which will allow the caller to detect if it's the first one to set it or not. (cherry picked from commit 8e048603d1bba0721433a5ae0480fbf1ab6f4897) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-10-19 15:54:53 +02:00
Christopher Faulet	6d98108aa9	BUG/MINOR: http-ana: Disable fast-fwd for unfinished req waiting for upgrade If a request is waiting for a protocol upgrade but it is not finished, the data fast-forwarding is disabled. Otherwise, the request analyzers will miss the end of the message. This case is possible since the commit 01fb1a54 ("BUG/MEDIUM: mux-h1/mux-h2: Reject upgrades with payload on H2 side only"). Indeed, before, a protocol upgrade was not allowed for request with payload. But it is now possible and this comes with a side-effect. It is not really satisfying but for now there is no other way to sync the muxes and the applicative stream. It seems to be a reasonnable fix for now, waiting for a deeper refactoring. This patch must be backported with the commit above. (cherry picked from commit cea1379cf1fcd5fd9c3b3b104f4c5374bc44bce2) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-02 10:35:20 +02:00
Christopher Faulet	f4441c8805	BUG/MINOR: mux-h1: Fix condition to set EOI on SE during zero-copy forwarding During zero-copy data forwarding, the producer must set the EOI flag on the SE when end of the message is reached. It is already done but there is a case where this flag is set while it should not. When a request wants to perform a protocol upgrade and it is waiting for the server response, the flag must not be set because the HTTP message is finished but some data are possibly still expected, depending on the server response. On a 101-switching-protocol, more data will be sent because the producer is switch to TUNNEL state. So, now, the right condition is used. In DONE state, SE_FL_EOI flag is set on the sedesc iff: - it is the response - it is the request and the response is also in DONNE state - it is a request but no a protocol upgrade nor a CONNECT This patch must be backported as far as 2.9. (cherry picked from commit 6b39e245e1d58698310fbb06b8122423232be9a4) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-02 10:35:14 +02:00
Willy Tarreau	c8cde77467	BUG/MEDIUM: queue: always dequeue the backend when redistributing the last server An interesting bug was revealed by commit 5541d4995d ("BUG/MEDIUM: queue: deal with a rare TOCTOU in assign_server_and_queue()"). When shutting down a server to redistribute its connections, no check is made on the backend's queue. If we're turning off the last server and the backend has pending connections, these ones will wait there till the queue timeout. But worse, since the commit above, we can enter an endless loop in the following situation: - streams are present in the backend's queue - streams are purged on the last server via srv_shutdown_streams() - that one calls pendconn_redistribute(srv) which does not purge the backend's pendconns - a stream performs some load balancing and enters assign_server_and_queue() - assign_server() is called in turn - the LB algo is non-deterministic and there are entries in the backend's queue. The function notices it and returns SRV_STATUS_FULL - assign_server_and_queue() calls pendconn_add() to add the connection to the backend's queue - on return, pendconn_must_try_again() is called, it figures there's no stream served anymore on the server nor the proxy, so it removes the pendconn from the queue and returns 1 - assign_server_and_queue() loops back to the beginning to try again, while the conditions have not changed, resulting in an endless loop. Ideally a change count should be used in the queues so that it's possible to detect that some dequeuing happened and/or that a last stream has left. But that wouldn't completely solve the problem that is that we must never ever add to a queue when there's no server streams to dequeue the new entries. The current solution consists in making pendconn_redistribute() take care of the proxy after the server in case there's no more server available on the proxy. It at least ensures that no pending streams are left in the backend's queue when shutting streams down or when the last server goes down. The try_again loop remains necessary to deal with inevitable races during pendconn additions. It could be limited to a few rounds, though, but it should never trigger if the conditions are sufficient to permit it to converge. One way to reproduce the issue is to run a config with a single server with maxconn 1 and plenty of threads, then run in loops series of: "disable server px/s;shutdown sessions server px/s; wait 100ms server-removable px/s; show servers conn px; enable server px/s" on the CLI at ~10/s while injecting with around 40 concurrent conns at 40-100k RPS. In this case in 10s - 1mn the crash can appear with a backtrace like this one for at least 1 thread: #0 pendconn_add (strm=strm@entry=0x17f2ce0) at src/queue.c:487 #1 0x000000000064797d in assign_server_and_queue (s=s@entry=0x17f2ce0) at src/backend.c:1064 #2 0x000000000064a928 in srv_redispatch_connect (s=s@entry=0x17f2ce0) at src/backend.c:1962 #3 0x000000000064ac54 in back_handle_st_req (s=s@entry=0x17f2ce0) at src/backend.c:2287 #4 0x00000000005ae1d5 in process_stream (t=t@entry=0x17f4ab0, context=0x17f2ce0, state=<optimized out>) at src/stream.c:2336 It's worth noting that other threads may often appear waiting after the poller and one in server_atomic_sync() waiting for isolation, because the event that is processed when shutting the server down is consumed under isolation, and having less threads available to dequeue remaining requests increases the probability to trigger the problem, though it is not at all necessary (some less common traces never show them). This should carefully be backported wherever the commit above was backported. (cherry picked from commit 53f52e67a08fc2cd5ae64caf4148a5de884f9959) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-10-02 10:17:31 +02:00
Willy Tarreau	16cfaec89b	MINOR: server: make srv_shutdown_sessions() call pendconn_redistribute() When shutting down server sessions, the queue was not considered, which is a problem if some element reached the queue at the moment the server was going down, because there will be no more requests to kick them out of it. Let's always make sure we scan the queue to kick these streams out of it and that they can possibly find a more suitable server. This may make a difference in the time it takes to shut down a server on the CLI when lots of servers are in the queue. It might be interesting to backport this to 3.0 but probably not much further. (cherry picked from commit 1d403caf8aa59c9070f30ea16017261cab679fe2) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-10-02 10:17:31 +02:00
Willy Tarreau	9417cf320f	BUG/MINOR: queue: make sure that maintenance redispatches server queue Turning a server to maintenance currently doesn't redispatch the server queue unless there's an explicit "option redispatch" and no "option persist", while the former has never really been the purpose of this test. Better refine this so that forced maintenance also causes the queue to be flushed, and possibly redispatched unless the proxy has option persist. This way now when turning a server to maintenance, the queue is immediately flushed and streams can decide what to do. This can be backported, though there's no need to go far since it was never directly reported and only noticed as part of debugging some rare "shutdown sessions" strangeness, which it might participate to. (cherry picked from commit 1385e33eb089093dbc970dbc2759d2969ae533c5) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-10-02 10:17:31 +02:00
Willy Tarreau	7858f66348	BUG/MEDIUM: stream: make stream_shutdown() async-safe The solution found in commit b500e84e24 ("BUG/MINOR: server: shut down streams under thread isolation") to deal with inter-thread stream shutdown doesn't work fine because there exists code paths involving a server lock which can then deadlock on thread_isolate(). A better solution then consists in deferring the shutdown to the stream itself and just wake it up for that. The only thing is that TASK_WOKEN_OTHER is a bit too generic and we need to pass at least 2 types of events (SF_ERR_DOWN and SF_ERR_KILLED), so we're now leveraging the new TASK_F_UEVT1 and _UEVT2 flags on the task's state to convey these info. The caller only needs to wake the task up with these flags set, and the stream handler will then finish the job locally using stream_shutdown_self(). This needs to be carefully backported to all branches affected by the dequeuing issue and containing any of the 5541d4995d ("BUG/MEDIUM: queue: deal with a rare TOCTOU in assign_server_and_queue()"), and/or b11495652e ("BUG/MEDIUM: queue: implement a flag to check for the dequeuing"). (cherry picked from commit b8e3b0a18d59b4f52b4ecb7ae61cef0b8b2402a0) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-10-02 10:17:31 +02:00
Willy Tarreau	d88cd382bf	MINOR: task: define two new one-shot events for use with WOKEN_OTHER or MSG TASK_WOKEN_MSG only says "someone sent you a message" but doesn't convey any info about the message. TASK_WOKEN_OTHER says "you're woken for another reason" but doesn't tell which one. Most often they're used as-is by the task handlers to report very specific situations. For some important control notifications, having the ability to modulate the message a little bit is useful, so let's define two user event types UEVT1 and UEVT2 to be used in conjunction with TASK_WOKEN_MSG or _OTHER so that the application can know that a specific condition was explicitly requested. It will be used this way: task_wakeup(s->task, TASK_WOKEN_MSG \| TASK_F_UEVT1); or: task_wakeup(s->task, TASK_WOKEN_OTHER \| TASK_F_UEVT2); Since events are cumulative, keep in mind not to consider a 3rd value as the combination of EVT1+EVT2; these really mean that the two events appeared (though in unspecified order). (cherry picked from commit b5281283bb00248152ba361f8d7306b37febbee0) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-10-02 10:17:31 +02:00
Willy Tarreau	908baaa399	MINOR: tools: do not attempt to use backtrace() on linux without glibc The function is provided by glibc. Nothing prevents us from using our own outside of glibc there (tested on aarch64 with musl). We still do not enable it by default as we don't yet know if all archs work well, but it's sufficient to pass USE_BACKTRACE=1 when building with musl to verify it's OK. (cherry picked from commit 7caf073faa6962de15a18dadcaf200df95ce7889) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-10-02 10:17:31 +02:00
Willy Tarreau	a1b7920db0	BUILD: tools: only include execinfo.h for the real backtrace() function No need to include this possibly non-existing file when using our own backtrace() implementation, it's only needed for the libc-provided one. Because of this it's currently not possible to build musl with backtrace enabled. (cherry picked from commit 1c4776dbc3e7d0a73bd62bae78d509c3c54045d6) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-10-02 10:17:31 +02:00
Valentine Krasnobaeva	d53d6fb55f	BUG/MINOR: cfgparse-global: fix allowed args number for setenv Keywords setenv and presetenv take 2 arguments: variable name and value. So, the total number, that should be passed to alertif_too_many_args is 2 ("setenv <name> <value>") instead of 3. For alertif_too_many_args the first argument index is 0. This should be backported in all stable versions. (cherry picked from commit df68f7ec96c0c9d0e8f50643b96ec9062b7aa658) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-01 11:21:56 +02:00
Willy Tarreau	f882ef8372	BUG/MINOR: server: make sure the HMAINT state is part of MAINT In 1.8 when adding "set server fqdn" with commit `b418c1228c` ("MINOR: server: cli: Add server FQDNs to server-state file and stats socket."), the HMAINT flag was not made part of the MAINT ones, so technically speaking when changing the FQDN, the server is not completely considered as in maintenance mode. In its defense, the code location around that was completely messy, with the aggregator flag being hidden between other values and purposely but discretely ignoring one of the flags, so the comments were updated to make the intent clearer (particularly regarding CMAINT which looked like it was also forgotten while it was on purpose). This can be backported anywhere. (cherry picked from commit a4d04c649af448cf2a728ebb91837ed69eea6217) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-01 08:06:47 +02:00
Oliver Dala	dcd946dcde	BUG/MEDIUM: cli: Deadlock when setting frontend maxconn The proxy lock state isn't passed down to relax_listener through dequeue_proxy_listeners, which causes a deadlock in relax_listener when it tries to get that lock. Backporting: Older versions didn't have relax_listener and directly called resume_listener in dequeue_proxy_listeners. lpx should just be passed directly to resume_listener then. The bug was introduced in commit `001328873c` [cf: This patch should fix the issue #2726. It must be backported as far as 2.4] (cherry picked from commit a889413f5ec5e2c0c6ed30debfd16f666dbc0f99) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-01 08:01:43 +02:00
Christopher Faulet	7ba52732e4	BUG/MEDIUM: cli: Be sure to catch immediate client abort A client abort while nothing was sent is properly handled except when this immediately happens after the connection was accepted. The read0 event is caught before the CLI applet is created. In that case, the shutdown is not handled and the applet is no longer wakeup. In that case, the stream remains blocked and no timeout are armed. The bug was due to the fact that when the applet I/O handler was called for the first time, the applet context was initialized and nothing more was performed. A shutdown, if any, would be handled on the next call. In that case, it was too late. Now, afet the init step, we loop to eval the first command. There is no command here but the shutdown will be tested. This patch should fix the issue #2727. It must be backported to 3.0. (cherry picked from commit 14a413033c15aa4622368348ac6b142e7ec2c989) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-01 08:00:55 +02:00
Amaury Denoyelle	ad6f5987b4	BUG/MINOR: mux-quic: report glitches to session Glitch counter was implemented for QUIC/HTTP3. The counter is stored in the QCC MUX connection instance. However, this is never reported at the session level which is necessary if glitch counter is tracked via a stick-table. To fix this, use session_add_glitch_ctr() in various QUIC MUX functions which may increment glitch counter. This should be backported up to 3.0. (cherry picked from commit fcd6d29acf108c55e1e1c17c04aeae621327adff) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-01 07:59:23 +02:00
Willy Tarreau	1ec7475858	REGTESTS: shorten a bit the delay for the h1/h2 upgrade test Commit d6c4ed9a96 ("REGTESTS: h1/h2: Update script testing H1/H2 protocol upgrades") introduced a 0.5 second delay which is higher than those of most other tests (usually 0.05 or 0.2) and triggers timeouts on my side. Let's just shorten it to 0.2 since its goal is only to send data separately. Note: maybe a barrier approach would be possible, though not studied. (cherry picked from commit 33deb4babe83b361f9f7423030cd8cc8318ca2bf) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-01 07:46:33 +02:00
Christopher Faulet	dd4a345f6b	REGTESTS: h1/h2: Update script testing H1/H2 protocol upgrades "http-messaging/protocol_upgrade.vtc" script was updated to test upgrades for requests with a payload. It should fail when the request is sent to a H2 server. When sent to a H1 server, it should succeed, except if the server replies before the end of the request. (cherry picked from commit d6c4ed9a960d5125b3ca0b879eb127353a0b45c0) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-01 07:46:33 +02:00
Christopher Faulet	ca7af50149	BUG/MEDIUM: mux-h1/mux-h2: Reject upgrades with payload on H2 side only Since `1d2d77b27` ("MEDIUM: mux-h1: Return a 501-not-implemented for upgrade requests with a body"), it is no longer possible to perform a protocol upgrade for requests with a payload. The main reason was to be able to support protocol upgrade for H1 client requesting a H2 server. In that case, the upgrade request is converted to a CONNECT request. So, it is not possible to convey a payload in that case. But, it is a problem for anyone wanting to perform upgrades on H1 server using requests with a payload. It is uncommon but valid. So, now, it is the H2 multiplexer responsibility to reject upgrade requests, on server side, if there is a payload. An INTERNAL_ERROR is returned for the H2S in that case. On H1 side, the upgrade is now allowed, but only if the server waits for the end of the request to return the 101-Switching-protocol response. Indeed, it is quite hard to synchronise the frontend side and the backend side in that case. Asking to servers to fully consume the request payload before returned the response seems reasonable. This patch should fix the issue #2684. It could be backported after a period of observation, as far as 2.4 if possible. But only if it is not too hard. It depends on "MINOR: mux-h1: Set EOI on SE during demux when both side are in DONE state". (cherry picked from commit 001fb1a5488b5059ca4c26a82dd4f00e9850f1b4) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-01 07:46:33 +02:00
Christopher Faulet	2c0cae806b	MINOR: mux-h1: Set EOI on SE during demux when both side are in DONE state For now, this case is already handled for all requests except for those waiting for a tunnel establishment (CONNECT and protocol upgrades). It is not an issue because only bodyless requests are supported in these cases. So the request is always finished at the end of headers and therefore before the response. However, to relax conditions for full H1 protocol upgrades (H1 client and server), this case will be necessary. Indeed, the idea is to be able to perform protocol upgrades for requests with a payload. Today, the "Upgrade:" header is removed before sending the request to the server. But to support this case, this patch is required to properly finish transaction when the server does not perform the upgrade. (cherry picked from commit ad1ef94612fb6cba5b68f7d19c4646491b6da4af) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-01 07:46:33 +02:00
Amaury Denoyelle	ce10357477	BUG/MINOR: h2: reject extended connect for h2c protocol This commit prevents forwarding of an HTTP/2 Extended CONNECT when "h2c" or "h2" token is set as targetted protocol. Contrary to the previous commit which deals with HTTP/1 mux, this time the request is rejected and a RESET_STREAM is reported to the client. This must be backported up to 2.4 after a period of observation. (cherry picked from commit 7a5a30d28a3ee100c62603f7212cc1b313c53311) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-01 07:46:33 +02:00
Amaury Denoyelle	cba44958ae	BUG/MINOR: h1: do not forward h2c upgrade header token haproxy supports tunnel establishment through HTTP Upgrade mechanism. Since the following commit, extended CONNECT is also supported for HTTP/2 both on frontend and backend side. commit `9bf957335e` MEDIUM: mux_h2: generate Extended CONNECT from htx upgrade As specified by HTTP/2 rfc, "h2c" can be used by an HTTP/1.1 client to request an upgrade to HTTP/2. In haproxy, this is not supported so it silently ignores this. However, Connection and Upgrade headers are forwarded as-is on the backend side. If using HTTP/1 on the backend side and the server supports this upgrade mechanism, haproxy won't be able to parse the HTTP response. If using HTTP/2, mux backend tries to incorrectly convert the request to an Extended CONNECT with h2c protocol, which may also prevent the response to be transmitted. To fix this, flag HTTP/1 request with "h2c" or "h2" token in an upgrade header. On converting the header list to HTX, the upgrade header is skipped if any of this token is present and the H1_MF_CONN_UPG flag is removed. This issue can easily be reproduced using curl --http2 argument to connect to an HTTP/1 frontend. This must be backported up to 2.4 after a period of observation. (cherry picked from commit 7b89aa5b1943c1537a844cc375f7e8a86bac7942) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-10-01 07:46:33 +02:00
Christopher Faulet	10827a18ec	MINOR: connection: No longer include stconn type header in connection-t.h It is a small change, but it is cleaner to no include stconn-t.h header in connection-t.h, mainly to avoid circular definitions. The related issue is #2502. (cherry picked from commit 4b8098bf4831c0dfca4a058bd3170a5ed7ae8bbf) Signed-off-by: William Lallemand <wlallemand@haproxy.com>	2024-09-30 15:40:44 +02:00
Christopher Faulet	8e879a52e2	[RELEASE] Released version 3.0.5 Released version 3.0.5 with the following main changes : - BUG/MEDIUM: server/addr: fix tune.events.max-events-at-once event miss and leak - BUG/MEDIUM: stconn: Report error on SC on send if a previous SE error was set - BUG/MEDIUM: mux-pt/mux-h1: Release the pipe on connection error on sending path - BUILD: mux-pt: Use the right name for the sedesc variable - BUG/MINOR: stconn: bs.id and fs.id had their dependencies incorrect - BUG/MEDIUM: ssl: reactivate 0-RTT for AWS-LC - BUG/MEDIUM: ssl: 0-RTT initialized at the wrong place for AWS-LC - BUG/MEDIUM: quic: prevent conn freeze on 0RTT undeciphered content - BUG/MEDIUM: http-ana: Report error on write error waiting for the response - BUG/MEDIUM: h2: Only report early HTX EOM for tunneled streams - BUG/MEDIUM: mux-h2: Propagate term flags to SE on error in h2s_wake_one_stream - BUG/MEDIUM: peer: Notify the applet won't consume data when it waits for sync - BUG/MINOR: fcgi-app: handle a possible strdup() failure - DOC: configuration: fix alphabetical ordering of {bs,fs}.aborted - BUG/MINOR: trace/quic: enable conn/session pointer recovery from quic_conn - BUG/MINOR: trace/quic: permit to lock on frontend/connect/session etc - BUG/MEDIUM: trace: fix null deref in lockon mechanism since TRACE_ENABLED() - BUG/MINOR: trace: automatically start in waiting mode with "start <evt>" - BUG/MINOR: trace/quic: make "qconn" selectable as a lockon criterion - BUG/MINOR: quic/trace: make quic_conn_enc_level_init() emit NEW not CLOSE - BUG/MINOR: proto_tcp: delete fd from fdtab if listen() fails - BUG/MINOR: proto_tcp: keep error msg if listen() fails - MINOR: channel: implement ci_insert() function - BUG/MEDIUM: mworker/cli: fix pipelined modes on master CLI - REGTESTS: mcli: test the pipelined commands on master CLI - BUG/MINOR: mux-quic: do not send too big MAX_STREAMS ID - BUG/MINOR: proto_uxst: delete fd from fdtab if listen() fails - BUG/MINOR: h3: properly reject too long header responses - BUG/MINOR: pattern: pat_ref_set: fix UAF reported by coverity - BUG/MINOR: pattern: pat_ref_set: return 0 if err was found - DOC: config: correct the table for option tcplog - BUG/MINOR: cfgparse-global: remove tune.fast-forward from common_kw_list - BUILD: quic: 32bits build broken by wrong integer conversions for printf() - BUG/MEDIUM: clock: also update the date offset on time jumps - MINOR: tools: Implement ipaddrcpy(). - MINOR: quic: Implement quic_tls_derive_token_secret(). - MEDIUM: ssl/quic: implement quic crypto with EVP_AEAD - MINOR: quic: Token for future connections implementation. - BUG/MINOR: quic: Missing incrementation in NEW_TOKEN frame builder - MINOR: quic: Modify NEW_TOKEN frame structure (qf_new_token struct) - MINOR: quic: Implement qc_ssl_eary_data_accepted(). - MINOR: quic: Add trace for QUIC_EV_CONN_IO_CB event. - BUG/MEDIUM: quic: always validate sender address on 0-RTT - BUG/MINOR: quic: Crash from trace dumping SSL eary data status (AWS-LC) - BUG/MINOR: quic: Too short datagram during packet building failures (aws-lc only) - DOC: configuration: place the HAPROXY_HTTP_LOG_FMT example on the correct line - REGTESTS: fix random failures with wrong_ip_port_logging.vtc under load - BUG/MEDIUM: clock: detect and cover jumps during execution - BUG/MINOR: pattern: prevent const sample from being tampered in pat_match_beg() - BUG/MEDIUM: pattern: prevent UAF on reused pattern expr - BUG/MAJOR: mux-h1: Wake SC to perform 0-copy forwarding in CLOSING state - BUG/MINOR: h1-htx: Don't flag response as bodyless when a tunnel is established - BUG/MINOR: pattern: do not leave a leading comma on "set" error messages - MEDIUM: h1: Accept invalid T-E values with accept-invalid-http-response option - BUG/MINOR: polling: fix time reporting when using busy polling - BUG/MINOR: clock: make time jump corrections a bit more accurate - BUG/MINOR: clock: validate that now_offset still applies to the current date - BUG/MEDIUM: queue: implement a flag to check for the dequeuing - BUG/MINOR: peers: local entries updates may not be advertised after resync - DOC: config: Explicitly list relaxing rules for accept-invalid-http-* options - BUG/MEDIUM: sc_strm/applet: Wake applet after a successfull synchronous send - BUG/MEDIUM: cache/stats: Wait to have the request before sending the response - BUG/MEDIUM: promex: Wait to have the request before sending the response - BUG/MINOR: cfgparse-listen: fix option httpslog override warning message - MINOR: quic: convert qc_stream_desc release field to flags - MINOR: quic: implement function to check if STREAM is fully acked - BUG/MEDIUM: quic: handle retransmit for standalone FIN STREAM - BUG/MINOR: quic: prevent freeze after early QCS closure	2024-09-19 14:07:01 +02:00
Amaury Denoyelle	3eaa58695e	BUG/MINOR: quic: prevent freeze after early QCS closure A connection freeze may occur if a QCS is released before transmitting any data. This can happen when an error is detected early by the stream, for example during HTTP response headers encoding, forcing the whole connection closure. In this case, a connection error is registered by the QUIC MUX to the lower layer. MUX is then release and xprt layer is notified to prepare CONNECTION_CLOSE emission. However, this is prevented because quic_conn streams tree is not empty as it contains the qc_stream_desc previously attached to the failed QCS instance. The connection will freeze until QUIC idle timeout. This situation is caused by an omission during qc_stream_desc release operation. In the described situation, qc_stream_desc current buffer is empty and can thus by removed, which is the purpose of this patch. This unblocks this previously failed situation, with qc_stream_desc removal from quic_conn tree. This issue can be reproduced by modifying H3/QPACK code to return an early error during HEADERS response processing. This must be backported up to 2.6, after a period of observation. (cherry picked from commit 3ef1ee477d3fef54633465b603206dd00bfefeba) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-09-19 10:54:02 +02:00
Amaury Denoyelle	0f7ae826ed	BUG/MEDIUM: quic: handle retransmit for standalone FIN STREAM STREAM frames have dedicated handling on retransmission. A special check is done to remove data already acked in case of duplicated frames, thus only unacked data are retransmitted. This handling is faulty in case of an empty STREAM frame with FIN set. On retransmission, this frame does not cover any unacked range as it is empty and is thus discarded. This may cause the transfer to freeze with the client waiting indefinitely for the FIN notification. To handle retransmission of empty FIN STREAM frame, qc_stream_desc layer have been extended. A new flag QC_SD_FL_WAIT_FOR_FIN is set by MUX QUIC when FIN has been transmitted. If set, it prevents qc_stream_desc to be freed until FIN is acknowledged. On retransmission side, qc_stream_frm_is_acked() has been updated. It now reports false if FIN bit is set on the frame and qc_stream_desc has QC_SD_FL_WAIT_FOR_FIN set. This must be backported up to 2.6. However, this modifies heavily critical section for ACK handling and retransmission. As such, it must be backported only after a period of observation. This issue can be reproduced by using the following socat command as server to add delay between the response and connection closure : $ socat TCP-LISTEN:<port>,fork,reuseaddr,crlf SYSTEM:'echo "HTTP/1.1 200 OK"; echo ""; sleep 1;' On the client side, ngtcp2 can be used to simulate packet drop. Without this patch, connection will be interrupted on QUIC idle timeout or haproxy client timeout with ERR_DRAINING on ngtcp2 : $ ngtcp2-client --exit-on-all-streams-close -r 0.3 <host> <port> "http://<host>:<port>/?s=32o" Alternatively to ngtcp2 random loss, an extra haproxy patch can also be used to force skipping the emission of the empty STREAM frame : diff --git a/include/haproxy/quic_tx-t.h b/include/haproxy/quic_tx-t.h index efbdfe687..1ff899acd 100644 --- a/include/haproxy/quic_tx-t.h +++ b/include/haproxy/quic_tx-t.h @@ -26,6 +26,8 @@ extern struct pool_head pool_head_quic_cc_buf; / Flag a sent packet as being probing with old data / #define QUIC_FL_TX_PACKET_PROBE_WITH_OLD_DATA (1UL << 5) +#define QUIC_FL_TX_PACKET_SKIP_SENDTO (1UL << 6) + / Structure to store enough information about TX QUIC packets. / struct quic_tx_packet { / List entry point. / diff --git a/src/quic_tx.c b/src/quic_tx.c index 2f199ac3c..2702fc9b9 100644 --- a/src/quic_tx.c +++ b/src/quic_tx.c @@ -318,7 +318,7 @@ static int qc_send_ppkts(struct buffer buf, struct ssl_sock_ctx ctx) tmpbuf.size = tmpbuf.data = dglen; TRACE_PROTO("TX dgram", QUIC_EV_CONN_SPPKTS, qc); - if (!skip_sendto) { + if (!skip_sendto && !(first_pkt->flags & QUIC_FL_TX_PACKET_SKIP_SENDTO)) { int ret = qc_snd_buf(qc, &tmpbuf, tmpbuf.data, 0, gso); if (ret < 0) { if (gso && ret == -EIO) { @@ -354,6 +354,7 @@ static int qc_send_ppkts(struct buffer buf, struct ssl_sock_ctx ctx) qc->cntrs.sent_bytes_gso += ret; } } + first_pkt->flags &= ~QUIC_FL_TX_PACKET_SKIP_SENDTO; b_del(buf, dglen + QUIC_DGRAM_HEADLEN); qc->bytes.tx += tmpbuf.data; @@ -2066,6 +2067,17 @@ static int qc_do_build_pkt(unsigned char pos, const unsigned char *end, continue; } + switch (cf->type) { + case QUIC_FT_STREAM_8 ... QUIC_FT_STREAM_F: + if (!cf->stream.len && (qc->flags & QUIC_FL_CONN_TX_MUX_CONTEXT)) { + TRACE_USER("artificially drop packet with empty STREAM frame", QUIC_EV_CONN_TXPKT, qc); + pkt->flags \|= QUIC_FL_TX_PACKET_SKIP_SENDTO; + } + break; + default: + break; + } + quic_tx_packet_refinc(pkt); cf->pkt = pkt; } (cherry picked from commit e177cf341cf3665e340855312714fe002688b2db) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-09-19 10:54:02 +02:00
Amaury Denoyelle	af8d1c24e5	MINOR: quic: implement function to check if STREAM is fully acked When a STREAM frame is retransmitted, a check is performed to remove range of data already acked from it. This is useful when STREAM frames are duplicated and splitted to cover different data ranges. The newly retransmitted frame contains only unacked data. This process is performed similarly in qc_dup_pkt_frms() and qc_build_frms(). Refactor the code into a new function named qc_stream_frm_is_acked(). It returns true if frame data are already fully acked and retransmission can be avoided. If only a partial range of data is acknowledged, frame content is updated to only cover the unacked data. This patch does not have any functional change. However, it simplifies retransmission for STREAM frames. Also, it will be reused to fix retransmission for empty STREAM frames with FIN set from the following patch : BUG/MEDIUM: quic: handle retransmit for standalone FIN STREAM As such, it must be backported prior to it. (cherry picked from commit 714009b7bcf921836c2df7fc0d26b2ad257c8307) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-09-19 10:54:02 +02:00
Amaury Denoyelle	cbdf1ecb19	MINOR: quic: convert qc_stream_desc release field to flags qc_stream_desc had a field <release> used as a boolean. Convert it with a new <flags> field and QC_SD_FL_RELEASE value as equivalent. The purpose of this patch is to be able to extend qc_stream_desc by adding newer flags values. This patch is required for the following patch BUG/MEDIUM: quic: handle retransmit for standalone FIN STREAM As such, it must be backported prior to it. (cherry picked from commit bb9ac256a1e5468535b8242dc762e7bb0d9a8bf3) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-09-19 10:54:02 +02:00
Aurelien DARRAGON	5ac8a0b27d	BUG/MINOR: cfgparse-listen: fix option httpslog override warning message "option httpslog" override warning messaged used to be reported as "option httplog", probably as a result of copy paste without adjusting the context. Let's fix that to prevent emitting confusing warning messages The issue exists since `98b930d` ("MINOR: ssl: Define a default https log format"), thus it should be backported up to 2.6 (cherry picked from commit 17e52c922b577e1b677098b34e47cd0a85f31e8b) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-09-18 14:49:31 +02:00

1 2 3 4 5 ...

27314 Commits