haproxy

Author	SHA1	Message	Date
Aurelien DARRAGON	c693da8896	DOC: lua: fix yield-dependent methods expected contexts Contrary to what the doc states, it is not expected (nor relevant) to use yield-dependent methods such as core.yield() or core.(m)sleep() from contexts that don't support yielding. Such contexts include body, init, fetches and converters. Thus the doc got it wrong since the beginning, because such methods were never supported from the above contexts, yet it was listed in the list of compatible contexts (probably the result of a copy-paste), which is error-prone because it could either cause a Lua runtime error to be thrown, or be ignored in some other cases. It should be backported to all stable versions. (cherry picked from commit 501827ebe0ad8f4121c4397267afbc7968e3d9af) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-22 15:44:48 +01:00
Christopher Faulet	b85468d1f3	DOC: config: Move fs.* and bs.* in section about L5 samples These sample fetch functions were added in the wrong section. Move them in the section about sample fetch functions at L5 layer. (cherry picked from commit e68c6852adb7051a30e209c5a0604f192182b42d) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-22 15:44:06 +01:00
Christopher Faulet	c6a4e359e3	DOC: config: Move wait_end in section about internal samples wait_end is an internal sample fetch functions and not a L6 one. So move it in the corresponding section. (cherry picked from commit 4ccc3f40488bfeed93f0df7d339444fe6503ee4e) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-22 15:41:20 +01:00
Christopher Faulet	c9d735fd3f	DOC: config: Slightly improve the %Tr documentation Specify -1 can also be reported for %Tr delay when the response is invalid. (cherry picked from commit e9021a4ca1d6a70cb647441aae78ec4d35bb7c1a) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-22 15:40:14 +01:00
Christopher Faulet	9464b240ed	BUG/MINOR: http_ana: Report -1 for %Tr for invalid response only The server response time is erroneously reported as -1 when it is intercepted by HAProxy. As stated in the documentation, the server response time is reported as -1 when the last response header was never seen. It happens when a server timeout is triggered before the server managed to process the request. It also happens if the response is invalid. This may be reported by the mux during the response parsing, but also by the HTTP analyzers. However, in this last case, the response time must only be reported as -1 on 502. This patch must be backported to all stable versions. It should fix the issue #2384. (cherry picked from commit 5863d33fce702c46b77c07d4ea82e036b11417a6) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-22 15:40:07 +01:00
Christopher Faulet	598c140650	DOC: config: Fix a typo in "1.3.1. The Request line" At the beginning of the last paragraph of this section, HTTP/3 was used instead of HTTP/2. It is not fixed. (cherry picked from commit 18de419f9647ad5fe0006900e2c1587bffd49c24) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-22 15:39:52 +01:00
Christopher Faulet	453076bbcc	DOC: config: A a space before ':' for {bs,fs}.aborted and {bs,fs}.rst_code A space was missing before the ':' for the sample fetch functions above. It was an issue for the text to HTML conversion script. So, let's fix it. (cherry picked from commit 3af2d91b3b6ebe1587bcb17f5fb223436df67253) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-22 15:39:43 +01:00
Willy Tarreau	9d74f3692b	BUG/MINOR: peers: make sure to always apply offsets to now_ms in expiration Now_ms can be zero nowadays, so it's not suitable for direct assignment to t->expire, as there's a risk that the timer never wakes up once assigned (TICK_ETERNITY). Let's use tick_add(now_ms, 0) for an immediate wakeup instead. The impact here might be a reconnect programmed upon signal receipt at the wrapping date not having a working timeout. This should be backported where it applies. (cherry picked from commit ed55ff878d5af35dae70f78023ab2141d36e5866) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-22 15:35:13 +01:00
Willy Tarreau	b8c4edbc49	BUG/MINOR: mux_quic: make sure to always apply offsets to now_ms in expiration Now_ms can be zero nowadays, so it's not suitable for direct assignment to t->expire, as there's a risk that the timer never wakes up once assigned (TICK_ETERNITY). Let's use tick_add(now_ms, 0) for an immediate wakeup instead. The impact looks nul since the task is also woken up, but better not leave such tasks in the timer tree anyway. This should be backported where it applies. (cherry picked from commit f66bfcff96082ce5c98c635c5da7a9ba157a20af) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-22 15:35:10 +01:00
Willy Tarreau	4ebe6dcb31	BUG/MEDIUM: mailers: make sure to always apply offsets to now_ms in expiration Now_ms can be zero nowadays, so it's not suitable for direct assignment to t->expire, as there's a risk that the timer never wakes up once assigned (TICK_ETERNITY). Let's use tick_add(now_ms, 0) for an immediate wakeup instead. The impact here might be mailers suddenly stopping. This should be backported where it applies. (cherry picked from commit 841be4cdd15b3d0834a478cc95ebda0f47171b4d) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-22 15:35:02 +01:00
Willy Tarreau	1ad88e6a79	BUG/MEDIUM: checks: make sure to always apply offsets to now_ms in expiration Now_ms can be zero nowadays, so it's not suitable for direct assignment to t->expire, as there's a risk that the timer never wakes up once assigned (TICK_ETERNITY). Let's use tick_add(now_ms, 0) for an immediate wakeup instead. The impact here might be health checks suddenly stopping. This should be backported where it applies. (cherry picked from commit 2f287f14f355e734e512732e35aebf993d000792) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-22 15:34:36 +01:00
Christopher Faulet	7cf18ab7c8	BUG/MINOR: Don't report early srv aborts on request forwarding in DONE state L7-retries may be ignored if server aborts are detected during the request forwarding, when the request is already in DONE state. When a request was fully processed (so in HTTP_MSG_DONE state) and is waiting for be forwarded to the server, there is a test to detect server aborts, to be able to report the error. However, this test must be skipped if the response was not received yet, to let the reponse analyszers handle the abort. It is important to properly handle the retries. This test must only be performed if the response analysis was finished. It means the response must be at least in HTTP_MSG_BODY state. This patch should be backported as far as 2.8. (cherry picked from commit a930e99f4699676ea72f72ba1fb99c953da0d74e) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-22 15:34:08 +01:00
Christopher Faulet	1b18a4cad1	BUG/MEDIUM: mux-h2: Don't send RST_STREAM frame for streams with no ID On server side, the H2 stream is first created with an unassigned ID (ID == 0). Its ID is assigned when the request is emitted, before formatting the HEADERS frame. However, the session may be aborted during that stage. We must take care to not emit RST_STREAM frame for this stream, because it does not exist yet for the server. It is especially important to do so because, depending on the timing, it may also happens before the H2 PREFACE was sent. This patch must be backported to all stable versions. It is related to issue (cherry picked from commit f065d0009888c394e5f93dfdaa2ae79958b2c2e2) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-22 15:33:00 +01:00
Christopher Faulet	33b0ca4440	BUG/MEDIUM: resolvers: Insert a non-executed resulution in front of the wait list When a resolver is woken up to process DNS resolutions, it is possible to trigger an infinite loop on the resolver's wait list because delayed resolutions are always reinserted at the end of this list. This leads the watchdog to kill the process. By re-inserting them in front of the list, that fixes the bug. When a resolver tries to send the queries for the resolutions in its wait list, it may be unable to proceed for a resolution. This may happen because the resolution must be skipped (no hostname to resolv, a resolution already in-progress) or when an error occurred. In that case, the resolution is re-inserted in the resolver's wait list to be retry later, on a next wakeup. However, the resolution is inserted at the end of the wait list. So it is immediately reevaluated, in the same execution loop, instead of to be delayed. Most of time, it is not an issue because the resolution is considered as not expired on the second run. But it is an problem when the internal time wraps and is equal to 0. In that case, the resolution expiration date is badly computed and it is always considered as expired. If two or more resolutions are in that state, the resolver loops for ever on its wait list, until the process is killed by the watchdog. So we can argue that the way the resolution expiration date is computed must be fixed. And it would be true in a perfect world. However, the resolvers code is so crapy that it is hard to be sure to not introduce regressions. It is farly easier to re-insert delayed resolutions in front of the wait list. This fixes the issue and at worst, these resolutions will be evaluated one time too many on the next wakeup and only if now_ms was equal to 0 on the prior wakeup. This patch should be backported to all stable versions. On 2.2, LIST_ADD() must be used instead of LIST_INSERT() (cherry picked from commit 8f28dbeea94e11e2327362755f16d18b301fd153) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-13 10:59:44 +01:00
Valentine Krasnobaeva	e771877f82	BUG/MINOR: cli: don't show sockpairs in HAPROXY_CLI and HAPROXY_MASTER_CLI Before this fix, HAPROXY_CLI and HAPROXY_MASTER_CLI have contained along with CLI sockets addresses internal sockpairs, which are used only for master CLI (reload sockpair and sockpair shared with a worker process). These internal sockpairs are always need to be hidden. At the moment there is no any client, who uses sockpair addresses for the stats listener or in order to connect to master CLI. So, let's simply not copy these internal sockpair addresses of MASTER and GLOBAL proxy listeners. As listeners with sockpairs are skipped and they can be presented in the listeners list in any order, let's add semicolon separator between addresses only in the case, when there are already some string saved in the trash and we are sure, that we are adding a new address to it. Otherwise, we could have such weird output: HAPROXY_MASTER_CLI=unix@/tmp/mcli.sock;; This fix is need to be backported in all stable versions. (cherry picked from commit 113745e6f0c0ef8fe89e89fdfdcc6ed994889d4a) [cf: ctx adjt] Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-13 10:59:22 +01:00
Amaury Denoyelle	51a13e6905	BUG/MEDIUM: quic: prevent crash due to CRYPTO parsing error A packet which contains several splitted and out of order CRYPTO frames may be parsed multiple times to ensure it can be handled via ncbuf. Only 3 iterations can be performed to prevent excessive CPU usage. There is a risk of crash if packet parsing is interrupted after maximum iterations is reached, or no progress can be made on the ncbuf. This is because <frm> may be dangling after list_for_each_entry_safe() The crash occurs on qc_frm_free() invokation, on error path of qc_parse_pkt_frms(). To fix it, always reset frm to NULL after list_for_each_entry_safe() to ensure it is not dangling. This should fix new report on github isue #2776. This regression has been triggered by the following patch : 1767196d5b2d8d1e557f7b3911a940000166ecda BUG/MINOR: quic: repeat packet parsing to deal with fragmented CRYPTO As such, it must be backported up to 2.6, after the above patch. (cherry picked from commit 2975e8805d9e84010bf5199a2365d650923dbb2c) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-13 10:57:20 +01:00
Amaury Denoyelle	85fa6d5b77	BUG/MINOR: guid/server: ensure thread-safety on GUID insert/delete Since 3.0, it is possible to assign a GUID to proxies, listeners and servers. These objects are stored in a global tree guid_tree. Proxies and listeners are static. However, servers may be added or deleted at runtime, which imply that guid_tree must be protected. Fix this by declaring a read-write lock to protect tree access. For now, only guid_insert() and guid_remove() are protected using a write lock. Outside of these, GUID tree is not accessed at runtime. If server CLI commands are extended to support GUID as server identifier, lookup operation should be extended with a read lock protection. Note that during stat-file preloading, GUID tree is accessed for lookup. However, as it is performed on startup which is single threaded, there is no need for lock here. A BUG_ON() has been added to ensure this precondition remains true. This bug could caused a segfault when using dynamic servers with GUID. However, it was never reproduced for now. This must be backported up to 3.0. To avoid a conflict issue, the previous cleanup patch can be merged before it. (cherry picked from commit 8e0e7d9d1af5b2dfec2e625d2c19dd034c36eb04) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-13 10:57:15 +01:00
Amaury Denoyelle	d68329f014	CLEANUP: guid: remove global tree export guid_tree is not directly used outside of functions provided by the guid module. Remove its export from the include file. (cherry picked from commit b70880cdc9c01602197fd124c84ab264f6b4ddfb) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-13 10:57:10 +01:00
Amaury Denoyelle	f3bddfa8eb	BUG/MINOR: quic: repeat packet parsing to deal with fragmented CRYPTO A ClientHello may be splitted accross several different CRYPTO frames, then mixed in a single QUIC packet. This is used notably by clients such as chrome to render the first Initial packet opaque to middleboxes. Each packet frame is handled sequentially. Out-of-order CRYPTO frames are buffered in a ncbuf, until gaps are filled and data is transferred to the SSL stack. If CRYPTO frames are heavily splitted with small fragments, buffering may fail as ncbuf does not support small gaps. This causes the whole packet to be rejected and unacknowledged. It could be solved if the client reemits its ClientHello after remixing its CRYPTO frames. This patch is written to improve CRYPTO frame parsing. Each CRYPTO frames which cannot be buffered due to ncbuf limitation are now stored in a temporary list. Packet parsing is completed until all frames have been handled. If temporary list is not empty, reparsing is done on the stored frames. With the newly buffered CRYPTO frames, ncbuf insert operation may this time succeeds if the frame now covers a whole gap. Reparsing will loop until either no progress can be made or it has been done at least 3 times, to prevent CPU utilization. This patch should fix github issue #2776. This should be backported up to 2.6, after a period of observation. Note that it relies on the following refactor patches : MINOR: quic: extend return value of CRYPTO parsing MINOR: quic: use dynamically allocated frame on parsing MINOR: quic: simplify qc_parse_pkt_frms() return path (cherry picked from commit 1767196d5b2d8d1e557f7b3911a940000166ecda) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-08 15:54:11 +01:00
Amaury Denoyelle	19c4b37c9f	MINOR: quic: extend return value of CRYPTO parsing qc_handle_crypto_frm() is the function used to handled a newly received CRYPTO frame. Change its API to use a newly dedicated return type. This allows to report if the frame was properly handled, ignored if already parsed previously or rejected after a fatal error. This commit does not have any functional changes. However, it allows to simplify qc_handle_crypto_frm() API by removing <fast_retrans> as output parameter. Also, this patch will be necessary to support multiple iteration of packet parsing for CRYPTO frames. (cherry picked from commit d65e782c8cd2f8554404dd1424e2d64f3786edb1) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-08 15:54:11 +01:00
Amaury Denoyelle	67aa5ae0e5	MINOR: quic: use dynamically allocated frame on parsing qc_parse_pkt_frms() is the function responsible to parse a received QUIC packet. Payload is decoded and splitted into individual frames which are then handled individually. Previously, frame was used as locally stack allocated. Change this to work on a dynamically allocated frame. This commit does bring any functional changes. However, it will be useful to extend packet parsing. In particular, it will be necessary to save some frames during parsing to reparse them after the others. (cherry picked from commit 190fc97606560568bf4a611d92c1e70aed057843) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-08 15:54:11 +01:00
Amaury Denoyelle	9c41bc6d2a	MINOR: quic: simplify qc_parse_pkt_frms() return path Change qc_parse_pkt_frms() return path for normal and error cases. Most notably, it allows to remove local variable ret as now return value is hardcoded on normal and err label. This also allows to define a different trace for error leaving code. (cherry picked from commit 498a99a84956535a9ce2a61cb908d0fc81165606) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-08 15:54:11 +01:00
Amaury Denoyelle	05658956ae	BUG/MEDIUM: quic: support wait-for-handshake wait-for-handshake http-request action was completely ineffective with QUIC protocol. This commit implements its support for QUIC. QUIC MUX layer is extended to support wait-for-handshake. A new function qcc_handle_wait_for_hs() is executed during qcc_io_process(). It detects if MUX processing occurs after underlying QUIC handshake completion. If this is the case, it indicates that early data may be received. As such, connection is flagged with CO_FL_EARLY_SSL_HS, which is necessary to block stream processing on wait-for-handshake action. After this, qcc subscribs on quic_conn layer for RECV notification. This is used to detect QUIC handshake completion. Thus, qcc_handle_wait_for_hs() can be reexecuted one last time, to remove CO_FL_EARLY_SSL_HS and notify every streams flagged as SE_FL_WAIT_FOR_HS. This patch must be backported up to 2.6, after a mandatory period of observation. Note that it relies on the backport of the two previous patches : - MINOR: quic: notify connection layer on handshake completion - BUG/MINOR: stream: unblock stream on wait-for-handshake completion (cherry picked from commit 0918c41ef63964a986c627d20b8a1324de639cc2) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-08 15:54:11 +01:00
Amaury Denoyelle	35dbd3ea0f	BUG/MINOR: stream: unblock stream on wait-for-handshake completion wait-for-handshake is an http-request action which permits to delay the processing of content received as TLS early data. The action yields as long as connection handshake is in progress. In the meantime, stconn is flagged with SE_FL_WAIT_FOR_HS. When the handshake is finished, MUX layer is responsible to woken up SE_FL_WAIT_FOR_HS flagged stconn instances to restart the stream processing. On sc_conn_process(), SE_FL_WAIT_FOR_HS flag is removed and stream layer is woken up. However, there may be a blocking after MUX notification. sc_conn_recv() may return 0 due to no new data reception, which prevents sc_conn_process() execution. The stream is thus blocked until its timeout. To fix this, checks in sc_conn_recv() about the handshake termination condition. If true, explicitely returns 1 to ensure sc_conn_process() will be executed. Note that this bug is not reproducible due to various conditions related to early data implementation in haproxy. Indeed, connection layer instantiation is always delayed until SSL handshake completion, which prevents the handling of early data as expected. This fix will be necessary to implement wait-for-handshake support for QUIC. As such, it must be backported with the next commit up to 2.6, after a mandatory period of observation. (cherry picked from commit 73031e81cdd5cf5ba889ed4c676a4ae6284f5cf6) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-08 15:54:11 +01:00
Amaury Denoyelle	f45ea9b8d9	MINOR: quic: notify connection layer on handshake completion Wake up connection layer on QUIC handshake completion via quic_conn_io_cb. Select SUB_RETRY_RECV as this was previously unused by QUIC MUX layer. For the moment, QUIC MUX never subscribes for handshake completion. However, this will be necessary for features such as the delaying of early data forwarding via wait-for-handshake. This patch will be necessary to implement wait-for-handshake support for QUIC. As such, it must be backported with next commits up to 2.6, after a mandatory period of observation. (cherry picked from commit 5a5950e42d7060ee311e51438f4f16ad0effefd9) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-08 15:54:11 +01:00
Aurelien DARRAGON	690ee88577	BUG/MEDIUM: pattern: prevent uninitialized reads in pat_match_{str,beg} Using valgrind when running map_beg or map_str, the following error is reported: ==242644== Conditional jump or move depends on uninitialised value(s) ==242644== at 0x2E4AB1: pat_match_str (pattern.c:457) ==242644== by 0x2E81ED: pattern_exec_match (pattern.c:2560) ==242644== by 0x343176: sample_conv_map (map.c:211) ==242644== by 0x27522F: sample_process_cnv (sample.c:1330) ==242644== by 0x2752DB: sample_process (sample.c:1373) ==242644== by 0x319917: action_store (vars.c:814) ==242644== by 0x24D451: http_req_get_intercept_rule (http_ana.c:2697) In fact, the error is legit, because in pat_match_{beg,str}, we dereference the buffer on len+1 to check if a value was previously set, and then decide to force NULL-byte if it wasn't set. But the approach is no longer compatible with current architecture: data past str.data is not guaranteed to be initialized in the buffer. Thus we cannot dereference the value, else we expose us to uninitialized read errors. Moreover, the check is useless, because we systematically set the ending byte to 0 when the conditions are met. Finally, restoring the older value after the lookup is not relevant: indeed, either the sample is marked as const and in such case it is already duplicated, or the sample is not const and we forcefully add a terminating NULL byte outside from the actual string bytes (since we're past str.data), so as we didn't alter effective string data and that data past str.data cannot be dereferenced anyway as it isn't guaranteed to be initialized, there's no point in restoring previous uninitialized data. It could be backported in all stable versions. But since this was only detected by valgrind and isn't known to cause issues in existing deployments, it's probably better to wait a bit before backporting it to avoid any breakage.. although the fix should be theoretically harmless. (cherry picked from commit 8157c1caf26618d77b32be7906e4b608a8c0729b) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-08 15:54:10 +01:00
Christopher Faulet	c2c009086d	[RELEASE] Released version 3.0.6 Released version 3.0.6 with the following main changes : - MINOR: connection: No longer include stconn type header in connection-t.h - BUG/MINOR: h1: do not forward h2c upgrade header token - BUG/MINOR: h2: reject extended connect for h2c protocol - MINOR: mux-h1: Set EOI on SE during demux when both side are in DONE state - BUG/MEDIUM: mux-h1/mux-h2: Reject upgrades with payload on H2 side only - REGTESTS: h1/h2: Update script testing H1/H2 protocol upgrades - REGTESTS: shorten a bit the delay for the h1/h2 upgrade test - BUG/MINOR: mux-quic: report glitches to session - BUG/MEDIUM: cli: Be sure to catch immediate client abort - BUG/MEDIUM: cli: Deadlock when setting frontend maxconn - BUG/MINOR: server: make sure the HMAINT state is part of MAINT - BUG/MINOR: cfgparse-global: fix allowed args number for setenv - BUILD: tools: only include execinfo.h for the real backtrace() function - MINOR: tools: do not attempt to use backtrace() on linux without glibc - MINOR: task: define two new one-shot events for use with WOKEN_OTHER or MSG - BUG/MEDIUM: stream: make stream_shutdown() async-safe - BUG/MINOR: queue: make sure that maintenance redispatches server queue - MINOR: server: make srv_shutdown_sessions() call pendconn_redistribute() - BUG/MEDIUM: queue: always dequeue the backend when redistributing the last server - BUG/MINOR: mux-h1: Fix condition to set EOI on SE during zero-copy forwarding - BUG/MINOR: http-ana: Disable fast-fwd for unfinished req waiting for upgrade - MINOR: debug: make mark_tainted() return the previous value - MINOR: chunk: drop the global thread_dump_buffer - MINOR: debug: split ha_thread_dump() in two parts - MINOR: debug: slightly change the thread_dump_pointer signification - MINOR: debug: make ha_thread_dump_done() take the pointer to be used - MINOR: debug: replace ha_thread_dump() with its two components - MEDIUM: debug: on panic, make the target thread automatically allocate its buf - BUG/MEDIUM: server: server stuck in maintenance after FQDN change - BUG/MEDIUM: hlua: make hlua_ctx_renew() safe - BUG/MEDIUM: hlua: properly handle sample func errors in hlua_run_sample_{fetch,conv}() - BUG/MEDIUM: mux-quic: ensure timeout server is active for short requests - BUG/MEDIUM: queue: make sure never to queue when there's no more served conns - BUG/MINOR: httpclient: return NULL when no proxy available during httpclient_new() - BUG/MEDIUM: stconn: Wait iobuf is empty to shut SE down during a check send - BUG/MINOR: http-ana: Don't report a server abort if response payload is invalid - BUG/MEDIUM: stconn: Check FF data of SC to perform a shutdown in sc_notify() - BUG/MAJOR: filters/htx: Add a flag to state the payload is altered by a filter - REGTESTS: Never reuse server connection in http-messaging/truncated.vtc - BUG/MINOR: quic: avoid leaking post handshake frames - BUG/MEDIUM: quic: avoid freezing 0RTT connections - DOC: config: fix rfc7239 forwarded typo in desc - BUG/MINOR: mworker: fix mworker-max-reloads parser - BUG/MINOR: mux-quic: do not close STREAM with empty FIN if no data sent - BUG/MEDIUM: stats-html: Never dump more data than expected during 0-copy FF - BUG/MEDIUM: mux-h2: Remove H2S from send list if data are sent via 0-copy FF - BUG/MEDIUM: connection/http-reuse: fix address collision on unhandled address families - MINOR: activity/memprofile: always return "other" bin on NULL return address - MINOR: activity/memprofile: show per-DSO stats - BUG/MINOR: server: fix dynamic server leak with check on failed init - BUG/MEDIUM: stconn: Report blocked send if sends are blocked by an error - BUG/MINOR: http-ana: Fix wrong client abort reports during responses forwarding - BUG/MINOR: stconn: Don't disable 0-copy FF if EOS was reported on consumer side - BUG/MEDIUM: server: fix race on servers_list during server deletion - BUILD: debug: silence a build warning with threads disabled - MINOR: pools: export the pools variable - MINOR: debug: place a magic pattern at the beginning of post_mortem - MINOR: debug: place the post_mortem struct in its own section. - MINOR: debug: store important pointers in post_mortem - MINOR: cli: remove non-printable characters from 'debug dev fd' - BUG/MINOR: trace: stop rewriting argv with -dt - BUG/MINOR: ssl/cli: 'set ssl cert' does not check the transaction name correctly - DOC: config: add missing glitch_{cnt,rate} data types - DOC: config: add missing glitch_{cnt,rate} sample definitions - BUG/MEDIUM: mux-h1: Fix how timeouts are applied on H1 connections - BUG/MINOR: http-ana: Report internal error if an action yields on a final eval - MINOR: stream: Save last evaluated rule on invalid yield - BUG/MEDIUM: promex: Fix dump of extra counters - DOC: config: document connection error 44 (reverse connect failure) - CLEANUP: connection: properly name the CO_ER_SSL_FATAL enum entry - BUG/MINOR: quic: fix malformed probing packet building - MINOR: cli/debug: show dev: add cmdline and version - MINOR: stream/stats: Expose the current number of streams in stats - MINOR: stream/stats: Expose the total number of streams ever created in stats - BUG/MINOR: stats: Fix the name for the total number of streams created - MINOR: connection: add more connection error codes to cover common errno - MINOR: rawsock: set connection error codes when returning from recv/send/splice - MINOR: connection: add new sample fetch functions fc_err_name and bc_err_name - MINOR: debug: print gdb hints when crashing - MINOR: debug: do not limit backtraces to stuck threads - MINOR: debug: also add a pointer to struct global to post_mortem - MINOR: debug: also add fdtab and acitvity to struct post_mortem - MINOR: debug: remove the redundant process.thread_info array from post_mortem - MINOR: wdt: move the local timers to a struct - MINOR: debug: add a function to dump a stuck thread - DEBUG: wdt: better detect apparently locked up threads and warn about them - DEBUG: cli: make it possible for "debug dev loop" to trigger warnings - DEBUG: wdt: make the blocked traffic warning delay configurable - DEBUG: wdt: add a stats counter "BlockedTrafficWarnings" in show info - BUILD: debug: also declare strlen() in __ABORT_NOW() - BUILD: Missing inclusion header for ssize_t type - MINOR: debug: move the "recover now" warn message after the optional notes	2024-11-07 17:32:22 +01:00
Willy Tarreau	31d93dad1e	MINOR: debug: move the "recover now" warn message after the optional notes At the end of the too long processing warning added by commit 0950778b3a ("MINOR: debug: add a function to dump a stuck thread"), there can be some optional notes about lua and memory trimming. However it's a bit awkward that they appear after the "trying to recover now" message. Let's just move that message after the notes. (cherry picked from commit 5dcf2012fc035e790c118590a12240e0769fbcaa) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-11-07 07:58:26 +01:00
Frederic Lecaille	5c0e150b00	BUILD: Missing inclusion header for ssize_t type Compilation issue detected as follows by gcc: In file included from src/ncbuf.c:19: src/ncbuf.c: In function 'ncb_write_off': include/haproxy/bug.h:144:10: error: unknown type name 'ssize_t' 144 \| extern ssize_t write(int, const void *, size_t); \ (cherry picked from commit bc9821fd26b3a118415f579cdfa6e430b03f96da) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-11-06 19:12:33 +01:00
Willy Tarreau	efef36866e	BUILD: debug: also declare strlen() in __ABORT_NOW() Previous commit 8f204fa8ae ("MINOR: debug: print gdb hints when crashing") broken on the CI where strlen() isn't known. Let's forward-declare it in the __ABORT_NOW() functions, just like write(). No backport is needed. (cherry picked from commit 2d27c80288c0acee85326c0574ed70d0b2e486ef) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-11-06 19:12:27 +01:00
Willy Tarreau	ad732f17fc	DEBUG: wdt: add a stats counter "BlockedTrafficWarnings" in show info Every time a warning is issued about traffic being blocked, let's increment a global counter so that we can check for this situation in "show info". (cherry picked from commit 84dd05e7d83eeee4e7b8c64dc656cdd608c78806) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-11-06 19:04:55 +01:00
Willy Tarreau	5904fe57bc	DEBUG: wdt: make the blocked traffic warning delay configurable The new global "warn-blocked-traffic-after" allows one to configure after how much time a warning should be emitted when traffic is blocked. (cherry picked from commit 6127e5a4e9722c1b47f5a9810fd41892b675557b) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-11-06 19:04:55 +01:00
Willy Tarreau	650f633d44	DEBUG: cli: make it possible for "debug dev loop" to trigger warnings A new argument "warn" allows to force the emission of a warning while stuck in the loop by making the internal state inconsistent. (cherry picked from commit 7337c422247b7af342048cfd48ac0aa2a4b7335e) [wt: backported only to help testing the watchdog backports] Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-11-06 19:04:38 +01:00
Willy Tarreau	a44922fb10	DEBUG: wdt: better detect apparently locked up threads and warn about them In order to help users detect when threads are behaving abnormally, let's try to emit a warning when one is no longer making any progress. This will allow to catch faulty situations more accurately, instead of occasionally triggering just after the long task. It will also let users know that there is something wrong with their configuration, and inspect the call trace to figure whether they're using excessively long rules or Lua for example (the usual warnings about lua-load vs lua-load-per-thread are still reported). The warning will only be emitted for threads not yet marked as stuck so as not to interfere with panic dumps and avoid sending a warning just before a panic. A tainted flag is set when this happens however (0x2000). (cherry picked from commit 148eb5875fb7e6c46c0a9eac486dcb7b3bca931d) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-11-06 19:04:38 +01:00
Willy Tarreau	80ea59459c	MINOR: debug: add a function to dump a stuck thread There's currently no way to just emit a warning informing that a thread is stuck without crashing. This is a problem because sometimes users would benefit from this info to clean up their configuration (e.g. abuse of map_regm, lua-load etc). This commit adds a new function ha_stuck_warning() that will emit a warning indicating that the designated thread has been stuck for XX milliseconds, with a number of streams blocked, and will make that thread dump its own state. The warning will then be sent to stderr, along with some reminders about the impacts of such situations to encourage users to fix their configuration. In order not to disrupt operations, a local 4kB buffer is allocated in the stack. This should be quite sufficient. For now the function is not used. (cherry picked from commit 0950778b3a13fe31ff83223827d6692076cba5e5) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-11-06 19:04:38 +01:00
Willy Tarreau	e50dc3bd87	MINOR: wdt: move the local timers to a struct Better have a local struct for per-thread timers, as this will allow us to store extra info that are useful to improve accurate reporting. (cherry picked from commit 3f4d646849a253f3dc15972e40023495725efe98) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-11-06 19:04:38 +01:00
Willy Tarreau	a2da8ef7ff	MINOR: debug: remove the redundant process.thread_info array from post_mortem That one is huge and unneeded since we now have the pointer to the whole thread_info[] array, which does contain the freshest version of these info and many more. Let's just get rid of it entirely. (cherry picked from commit 52240680f1d98cc7eb1e762a04becaf54660e96b) [wt: adjusted ctx in feed_post_mortem_late()] Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-11-06 19:04:38 +01:00
Willy Tarreau	068b4a20c0	MINOR: debug: also add fdtab and acitvity to struct post_mortem These ones are often used as well when trying to analyse sequences of events, let's add them. (cherry picked from commit da5cf52173853bcacb12c6ebb045fe395d4b3ba6) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-11-06 19:04:38 +01:00
Willy Tarreau	7f09a7a935	MINOR: debug: also add a pointer to struct global to post_mortem The pointer to struct global is also an important element to have in post_mortem given that it's used a lot to take decisions in the code. Let's just add it. It's worth noting that we could get rid of argc/argv at this point since they're also present in the global struct, but they don't cost much there anyway. (cherry picked from commit 2f04ebe14aca91f4a0fafcd03a0f310d98d97aaf) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-11-06 19:04:38 +01:00
Willy Tarreau	c984817bb8	MINOR: debug: do not limit backtraces to stuck threads Historically for size limitation reasons, we would only dump the backtrace of stuck threads. The problem is that when triggering a panic or other reasons, we have no backtrace, which effectively limits it to the watchdog timer. It's also visible in "show threads" which used to report backtraces for all threads in 2.4 and displays none nowadays, making its use much more limited. A first approach could be to just dump the thread that triggers the panic (in addition to stuck threads). But that remains quite limited since "show threads" would still display nothing. This patch takes a better approach consisting in dumping all non-idle threads. This way the output is less polluted that with the older approach (no need to dump all those waiting in the poller), and all active threads are visible, in panics as well as in "show threads". As such, the CLI command "debug dev panic" now dmups backtraces again. This is already a benefit which will ease testing of various locations against the ability to resolve useful symbols. (cherry picked from commit 4adb2d864d7e3ca9df1e39beabf7b2ffa5aee35c) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-11-06 19:04:38 +01:00
Willy Tarreau	96847724af	MINOR: debug: print gdb hints when crashing To make bug reporting easier for users, when crashing, let's suggest what to do. Typically when a BUG_ON() matches, only the current thread is useful the vast majority of the time, while when the watchdog triggers, all threads are interesting. The messages are printed at the end after the dump. We may adjust these with wiki links in the future is more detailed instructions are relevant. (cherry picked from commit 8f204fa8aeadef3faea4471ba9cfd93d9d168960) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-11-06 19:04:38 +01:00
Willy Tarreau	2913ab11dc	MINOR: connection: add new sample fetch functions fc_err_name and bc_err_name These functions return a symbolic error code such as ECONNRESET to keep logs compact while making them human-readable. It's a good alternative to the numeric code in that it's more expressive, and a good one to the full message since it's shorter and more precise (some codes even match errno names). The doc was updated so that the symbolic names appear in the table. It could be useful to backport this feature to help with troubleshooting some issues, though backporting the doc might possibly be more annoying in case users have local patches already, so maybe the table update does not need to be backported in this case. (cherry picked from commit 601b34fe7bd50c733a437f26817580bbd56c8d56) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-11-06 19:04:38 +01:00
Willy Tarreau	3b36ac5726	MINOR: rawsock: set connection error codes when returning from recv/send/splice For a long time the errno values returned by recv/send/splice() were not translated to connection error codes. There are not that many eligible and having them would help a lot when debugging some complex issues where logs disagree with network traces. Let's add them now. (cherry picked from commit 822d82caf4165f0f6da681737c7e3db17d01f599) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-11-06 19:04:38 +01:00
Willy Tarreau	6200536920	MINOR: connection: add more connection error codes to cover common errno While we get reports of connection setup errors in fc_err/bc_err, we don't have the equivalent for the recv/send/splice syscalls. Let's add provisions for new codes that cover the common errno values that recv/send/splice can return, i.e. ECONNREFUSED, ENOMEM, EBADF, EFAULT, EINVAL, ENOTCONN, ENOTSOCK, ENOBUFS, EPIPE. We also add a special case for when the poller reported the error itself. It's worth noting that EBADF/EFAULT/EINVAL will generally indicate serious bugs in the code and should not be reported. The only thing is that it's quite hard to forcefully (and reliably) trigger these errors in automated tests as the timing is critical. Using iptables to manually reset established connections in the middle of large transfers at least permits to see some ECONNRESET and/or EPIPE, but the other ones are harder to trigger. (cherry picked from commit 00c383ff65c6378327382d2c055f66efb098498d) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-11-06 19:04:38 +01:00
Christopher Faulet	aa35557e76	BUG/MINOR: stats: Fix the name for the total number of streams created Because of a copy/paste error, CurrStreams was reused by mistake. It should be "CumStreams" No backports needed. (cherry picked from commit 131b877565db423930909f0c26f25e000cbd6e3b) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-11-06 18:59:58 +01:00
Christopher Faulet	acc009f882	MINOR: stream/stats: Expose the total number of streams ever created in stats A shared counter is added in the thread context to track the total number of streams created on the thread. This number is then reported in stats. It will be a useful information to diagnose some bugs. (cherry picked from commit 273d322b6fa8117423bbdc9b818002563d4fd3a3) [wt: ctx adj in tinfo-t] Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-11-06 18:59:58 +01:00
Christopher Faulet	fb9c53581b	MINOR: stream/stats: Expose the current number of streams in stats A shared counter is added in the thread context to track the current number of streams. This number is then reported in stats. It will be a useful information to diagnose some bugs. (cherry picked from commit 18ee22ff766bd7399947af3be2b512ac5827b3c8) [wt: adj ctx in tinfo-t] Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-11-06 18:57:42 +01:00
Valentine Krasnobaeva	5ca7eb5e84	MINOR: cli/debug: show dev: add cmdline and version 'show dev' command is very convenient to obtain haproxy debugging information, while process is run in container. Let's extend its output with version and cmdline. cmdline is useful in a way, as it shows absolute binary path and its arguments, because sometimes the person, who is debugging failing container is not the same, who has created and deployed it. argc and argv are stored in the exported global structure, because feed_post_mortem() is added as a post check function callback in the post_check_list. So we can't simply change the signature of feed_post_mortem(), without breaking other post check callbacks APIs. Parsers are not supposed to modify argv, so we can safely bypass its pointer to debug_parse_cli_show_dev(), without copying all argument stings somewhere in the heap or on stack. (cherry picked from commit 0d79c9bedfa564e3c032c1e910c29949f5133d91) Signed-off-by: Willy Tarreau <w@1wt.eu>	2024-11-06 18:57:42 +01:00
Frederic Lecaille	4655bd1e64	BUG/MINOR: quic: fix malformed probing packet building This bug arrived with this commit: cdfceb10a MINOR: quic: refactor qc_prep_pkts() loop which prevents haproxy from sending PING only packets/datagrams (some packets/datagrams with only PING frame as ack-eliciting frames inside). Such packets/datagrams are useful in rare cases during retransmissions when one wants to probe the peer without exceeding the anti-amplification limit. Modify the condition passed to qc_build_pkt() to add padding to the current datagram. One does not want to do that when probing the peer without ack-eliciting frames passed as <frms> parameter. Indeed qc_build_pkt() calls qc_do_build_pkt() which supports this case: if <probe> is true (probing required), qc_do_build_pkt() handles the case where some padding must be added to a PING only packet/datagram. This is the case when probing with an empty <frms> frame list of ack-eliciting frames without exceeding the anti-amplification limit from qc_dgrams_retransmit(). Add some comments to qc_build_pkt() and qc_do_build_pkt() to clarify this as this code is easy to break! Thank you for @Tristan971 for having reported this issue in GH #2709. Must be backported to 3.0. (cherry picked from commit 217e467e89d15f3c22e11fe144458afbf718c8a8) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-06 15:55:11 +01:00
Willy Tarreau	c91c678b12	CLEANUP: connection: properly name the CO_ER_SSL_FATAL enum entry It was the only one prefixed with "CO_ERR_", making it harder to batch process and to look up. It was added in 2.5 by commit `61944f7a73` ("MINOR: ssl: Set connection error code in case of SSL read or write fatal failure") so it can be backported as far as 2.6 if needed to help integrate other patches. (cherry picked from commit 393957908bf492ff6660fba239106f0da7988fe8) Signed-off-by: Christopher Faulet <cfaulet@haproxy.com>	2024-11-06 15:53:56 +01:00

1 2 3 4 5 ...

22744 Commits