linux

iv/linux

History

Florian Westphal 62e7151ae3 netfilter: bridge: confirm multicast packets before passing them up the stack conntrack nf_confirm logic cannot handle cloned skbs referencing the same nf_conn entry, which will happen for multicast (broadcast) frames on bridges. Example: macvlan0 \| br0 / \ ethX ethY ethX (or Y) receives a L2 multicast or broadcast packet containing an IP packet, flow is not yet in conntrack table. 1. skb passes through bridge and fake-ip (br_netfilter)Prerouting. -> skb->_nfct now references a unconfirmed entry 2. skb is broad/mcast packet. bridge now passes clones out on each bridge interface. 3. skb gets passed up the stack. 4. In macvlan case, macvlan driver retains clone(s) of the mcast skb and schedules a work queue to send them out on the lower devices. The clone skb->_nfct is not a copy, it is the same entry as the original skb. The macvlan rx handler then returns RX_HANDLER_PASS. 5. Normal conntrack hooks (in NF_INET_LOCAL_IN) confirm the orig skb. The Macvlan broadcast worker and normal confirm path will race. This race will not happen if step 2 already confirmed a clone. In that case later steps perform skb_clone() with skb->_nfct already confirmed (in hash table). This works fine. But such confirmation won't happen when eb/ip/nftables rules dropped the packets before they reached the nf_confirm step in postrouting. Pablo points out that nf_conntrack_bridge doesn't allow use of stateful nat, so we can safely discard the nf_conn entry and let inet call conntrack again. This doesn't work for bridge netfilter: skb could have a nat transformation. Also bridge nf prevents re-invocation of inet prerouting via 'sabotage_in' hook. Work around this problem by explicit confirmation of the entry at LOCAL_IN time, before upper layer has a chance to clone the unconfirmed entry. The downside is that this disables NAT and conntrack helpers. Alternative fix would be to add locking to all code parts that deal with unconfirmed packets, but even if that could be done in a sane way this opens up other problems, for example: -m physdev --physdev-out eth0 -j SNAT --snat-to 1.2.3.4 -m physdev --physdev-out eth1 -j SNAT --snat-to 1.2.3.5 For multicast case, only one of such conflicting mappings will be created, conntrack only handles 1:1 NAT mappings. Users should set create a setup that explicitly marks such traffic NOTRACK (conntrack bypass) to avoid this, but we cannot auto-bypass them, ruleset might have accept rules for untracked traffic already, so user-visible behaviour would change. Suggested-by: Pablo Neira Ayuso <pablo@netfilter.org> Fixes: `1da177e4c3` ("Linux-2.6.12-rc2") Closes: https://bugzilla.kernel.org/show_bug.cgi?id=217777 Signed-off-by: Florian Westphal <fw@strlen.de> Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>		2024-02-29 00:22:44 +01:00
..
netfilter	netfilter: bridge: confirm multicast packets before passing them up the stack	2024-02-29 00:22:44 +01:00
br_arp_nd_proxy.c	bridge: Add per-{Port, VLAN} neighbor suppression data path support	2023-04-21 08:25:50 +01:00
br_cfm_netlink.c	bridge: cfm: fix enum typo in br_cc_ccm_tx_parse	2023-12-26 22:38:13 +00:00
br_cfm.c	bridge: cfm: remove redundant return	2021-06-22 10:35:15 -07:00
br_device.c	bridge: mdb: Add MDB bulk deletion support	2023-12-20 11:27:20 +00:00
br_fdb.c	net: bridge: Track and limit dynamically learned FDB entries	2023-10-17 17:39:01 -07:00
br_forward.c	net: bridge: use DEV_STATS_INC()	2023-09-19 13:35:15 +02:00
br_if.c	net: bridge: keep ports without IFF_UNICAST_FLT in BR_PROMISC mode	2023-07-03 09:11:34 +01:00
br_input.c	bridge: mcast: Rename MDB entry get function	2023-10-27 10:51:41 +01:00
br_ioctl.c	Merge git://git.kernel.org/pub/scm/linux/kernel/git/bpf/bpf-next	2021-12-31 14:35:40 +00:00
br_mdb.c	bridge: mdb: Add MDB bulk deletion support	2023-12-20 11:27:20 +00:00
br_mrp_netlink.c	bridge: mrp: Use hlist_head instead of list_head for mrp	2020-11-09 16:42:12 -08:00
br_mrp_switchdev.c	bridge: mrp: Extend br_mrp_switchdev to detect better the errors	2021-02-16 14:47:46 -08:00
br_mrp.c	net: bridge: mrp: Update the Test frames for MRA	2021-06-28 15:46:10 -07:00
br_mst.c	net: bridge: mst: Add helper to query a port's MST state	2022-03-17 16:49:58 -07:00
br_multicast_eht.c	treewide: Convert del_timer() to timer_shutdown()	2022-12-25 13:38:09 -08:00
br_multicast.c	bridge: mcast: fix disabled snooping after long uptime	2024-01-30 18:06:56 -08:00
br_netfilter_hooks.c	netfilter: bridge: confirm multicast packets before passing them up the stack	2024-02-29 00:22:44 +01:00
br_netfilter_ipv6.c	netfilter: bridge: replace physindev with physinif in nf_bridge_info	2024-01-17 12:02:49 +01:00
br_netlink_tunnel.c	net: bridge: Set strict_start_type at two policies	2023-02-06 08:48:25 +00:00
br_netlink.c	net: bridge: Set strict_start_type for br_policy	2023-10-17 17:39:02 -07:00
br_nf_core.c	net: dst: Switch to rcuref_t reference counting	2023-03-28 18:52:28 -07:00
br_private_cfm.h	bridge: cfm: Kernel space implementation of CFM. CCM frame RX added.	2020-10-29 18:39:43 -07:00
br_private_mcast_eht.h	net: bridge: multicast: use multicast contexts instead of bridge or port	2021-07-20 05:41:19 -07:00
br_private_mrp.h	net: bridge: mrp: Update the Test frames for MRA	2021-06-28 15:46:10 -07:00
br_private_stp.h
br_private_tunnel.h	bridge: always declare tunnel functions	2023-05-17 21:28:58 -07:00
br_private.h	bridge: mcast: fix disabled snooping after long uptime	2024-01-30 18:06:56 -08:00
br_stp_bpdu.c
br_stp_if.c	net: use eth_hw_addr_set()	2021-10-02 14:18:25 +01:00
br_stp_timer.c
br_stp.c	net: bridge: mst: Multiple Spanning Tree (MST) mode	2022-03-17 16:49:57 -07:00
br_switchdev.c	net: bridge: switchdev: Ensure deferred event delivery on unoffload	2024-02-16 09:36:37 +00:00
br_sysfs_br.c	bridge: Fix flushing of dynamic FDB entries	2022-11-02 20:47:09 -07:00
br_sysfs_if.c	bridge: move from strlcpy with unused retval to strscpy	2022-08-22 17:57:30 -07:00
br_vlan_options.c	bridge: vlan: Allow setting VLAN neighbor suppression state	2023-04-21 08:25:50 +01:00
br_vlan_tunnel.c	bridge: Add backup nexthop ID support	2023-07-19 10:53:49 +01:00
br_vlan.c	bridge: vlan: Allow setting VLAN neighbor suppression state	2023-04-21 08:25:50 +01:00
br.c	net: bridge: fill in MODULE_DESCRIPTION()	2023-10-27 11:16:44 +01:00
Kconfig	bridge: cfm: Add BRIDGE_CFM to Kconfig.	2020-10-29 18:39:43 -07:00
Makefile	net: bridge: mst: Multiple Spanning Tree (MST) mode	2022-03-17 16:49:57 -07:00