linux

iv/linux

Go to file

Alexei Starovoitov 519e1de94b Merge branch 'add-internal-only-bpf-per-cpu-instruction'

Andrii Nakryiko says:

====================
Add internal-only BPF per-CPU instruction

Add a new BPF instruction for resolving per-CPU memory addresses.

New instruction is a special form of BPF_ALU64 | BPF_MOV | BPF_X, with
insns->off set to BPF_ADDR_PERCPU (== -1). It resolves provided per-CPU offset
to an absolute address where per-CPU data resides for "this" CPU.

This patch set implements support for it in x86-64 BPF JIT only.

Using the new instruction, we also implement inlining for three cases:
  - bpf_get_smp_processor_id(), which allows to avoid unnecessary trivial
    function call, saving a bit of performance and also not polluting LBR
    records with unnecessary function call/return records;
  - PERCPU_ARRAY's bpf_map_lookup_elem() is completely inlined, bringing its
    performance to implementing per-CPU data structures using global variables
    in BPF (which is an awesome improvement, see benchmarks below);
  - PERCPU_HASH's bpf_map_lookup_elem() is partially inlined, just like the
    same for non-PERCPU HASH map; this still saves a bit of overhead.

To validate performance benefits, I hacked together a tiny benchmark doing
only bpf_map_lookup_elem() and incrementing the value by 1 for PERCPU_ARRAY
(arr-inc benchmark below) and PERCPU_HASH (hash-inc benchmark below) maps. To
establish a baseline, I also implemented logic similar to PERCPU_ARRAY based
on global variable array using bpf_get_smp_processor_id() to index array for
current CPU (glob-arr-inc benchmark below).

BEFORE
======
glob-arr-inc   :  163.685 ± 0.092M/s
arr-inc        :  138.096 ± 0.160M/s
hash-inc       :   66.855 ± 0.123M/s

AFTER
=====
glob-arr-inc   :  173.921 ± 0.039M/s (+6%)
arr-inc        :  170.729 ± 0.210M/s (+23.7%)
hash-inc       :   68.673 ± 0.070M/s (+2.7%)

As can be seen, PERCPU_HASH gets a modest +2.7% improvement, while global
array-based gets a nice +6% due to inlining of bpf_get_smp_processor_id().

But what's really important is that arr-inc benchmark basically catches up
with glob-arr-inc, resulting in +23.7% improvement. This means that in
practice it won't be necessary to avoid PERCPU_ARRAY anymore if performance is
critical (e.g., high-frequent stats collection, which is often a practical use
for PERCPU_ARRAY today).

v1->v2:
  - use BPF_ALU64 | BPF_MOV instruction instead of LDX (Alexei);
  - dropped the direct per-CPU memory read instruction, it can always be added
    back, if necessary;
  - guarded bpf_get_smp_processor_id() behind x86-64 check (Alexei);
  - switched all per-cpu addr casts to (unsigned long) to avoid sparse
    warnings.
====================

Link: https://lore.kernel.org/r/20240402021307.1012571-1-andrii@kernel.org
Signed-off-by: Alexei Starovoitov <ast@kernel.org>

2024-04-03 10:30:13 -07:00

arch

bpf: add special internal-only MOV instruction to resolve per-CPU addrs

2024-04-03 10:29:55 -07:00

block

vfs-6.9-rc1.fixes

2024-03-18 09:15:50 -07:00

certs

This update includes the following changes:

2023-11-02 16:15:30 -10:00

crypto

This push fixes a regression that broke iwd as well as a divide by

2024-03-25 10:48:23 -07:00

Documentation

dt-bindings: net: renesas,etheravb: Add optional MDIO bus node

2024-03-28 18:17:52 -07:00

drivers

ravb: Add support for an optional MDIO mode

2024-03-28 18:17:52 -07:00

Changes since last update:

2024-03-27 20:24:09 -07:00

include

bpf: add special internal-only MOV instruction to resolve per-CPU addrs

2024-04-03 10:29:55 -07:00

init

init: open /initrd.image with O_LARGEFILE

2024-03-26 11:07:19 -07:00

io_uring

io_uring/sqpoll: early exit thread if task_context wasn't allocated

2024-03-18 20:22:42 -06:00

ipc

sysctl changes for v6.9-rc1

2024-03-18 14:59:13 -07:00

kernel

bpf: inline bpf_map_lookup_elem() helper for PERCPU_HASH map

2024-04-03 10:29:56 -07:00

lib

hardening fixes for v6.9-rc1

2024-03-23 08:43:21 -07:00

LICENSES

LICENSES: Add the copyleft-next-0.3.1 license

2022-11-08 15:44:01 +01:00

mm: zswap: fix data loss on SWP_SYNCHRONOUS_IO devices

2024-03-26 11:14:12 -07:00

net

bpf: Remove CONFIG_X86 and CONFIG_DYNAMIC_FTRACE guard from the tcp-cc kfuncs

2024-03-28 18:31:40 -07:00

rust

Kbuild updates for v6.9

2024-03-21 14:41:00 -07:00

samples

Tracing updates for 6.9:

2024-03-18 15:11:44 -07:00

scripts

Including fixes from bpf, WiFi and netfilter.

2024-03-28 13:09:37 -07:00

security

- Kuan-Wei Chiu has developed the well-named series "lib min_heap: Min

2024-03-14 18:03:09 -07:00

sound

sound fixes #2 for 6.9-rc2

2024-03-22 09:44:19 -07:00

tools

selftests/xsk: Add new test case for AF_XDP under max ring sizes

2024-04-03 16:04:14 +02:00

usr

Kbuild updates for v6.8

2024-01-18 17:57:07 -08:00

virt

KVM Xen and pfncache changes for 6.9:

2024-03-11 10:42:55 -04:00

.clang-format

clang-format: Update with v6.7-rc4's for_each macro list

2023-12-08 23:54:38 +01:00

.cocciconfig

…

.editorconfig

Add .editorconfig file for basic formatting

2023-12-28 16:22:47 +09:00

.get_maintainer.ignore

Add Jeff Kirsher to .get_maintainer.ignore

2024-03-08 11:36:54 +00:00

.gitattributes

.gitattributes: set diff driver for Rust source code files

2023-05-31 17:48:25 +02:00

.gitignore

kbuild: create a list of all built DTB files

2024-02-19 18:20:39 +09:00

.mailmap

Including fixes from bpf, WiFi and netfilter.

2024-03-28 13:09:37 -07:00

.rustfmt.toml

rust: add .rustfmt.toml

2022-09-28 09:02:20 +02:00

COPYING

COPYING: state that all contributions really are covered by this file

2020-02-10 13:32:20 -08:00

CREDITS

Not a ton of stuff happening in the clk framework in this pull request. We got

2024-03-15 11:48:01 -07:00

Kbuild

Kbuild updates for v6.1

2022-10-10 12:00:45 -07:00

Kconfig

kbuild: ensure full rebuild when the compiler is updated

2020-05-12 13:28:33 +09:00

MAINTAINERS

Including fixes from bpf, WiFi and netfilter.

2024-03-28 13:09:37 -07:00

Makefile

Linux 6.9-rc1

2024-03-24 14:10:05 -07:00

README

README: Fix spelling

2024-03-18 03:36:32 -06:00

README

Linux kernel
============

There are several guides for kernel developers and users. These guides can
be rendered in a number of formats, like HTML and PDF. Please read
Documentation/admin-guide/README.rst first.

In order to build the documentation, use ``make htmldocs`` or
``make pdfdocs``.  The formatted documentation can also be read online at:

    https://www.kernel.org/doc/html/latest/

There are various text files in the Documentation/ subdirectory,
several of them using the reStructuredText markup notation.

Please read the Documentation/process/changes.rst file, as it contains the
requirements for building and running the kernel, and information about
the problems which may result by upgrading your kernel.

Languages

C 97.6%

Assembly 1%

Shell 0.5%

Python 0.3%

Makefile 0.3%