linux

iv/linux

History

Maciej Fijalkowski 10397994d3 libbpf: xsk: Use bpf_link

Currently, if there are multiple xdpsock instances running on a single
interface and in case one of the instances is terminated, the rest of
them are left in an inoperable state due to the fact of unloaded XDP
prog from interface.

Consider the scenario below:

// load xdp prog and xskmap and add entry to xskmap at idx 10
$ sudo ./xdpsock -i ens801f0 -t -q 10

// add entry to xskmap at idx 11
$ sudo ./xdpsock -i ens801f0 -t -q 11

terminate one of the processes and another one is unable to work due to
the fact that the XDP prog was unloaded from interface.

To address that, step away from setting bpf prog in favour of bpf_link.
This means that refcounting of BPF resources will be done automatically
by bpf_link itself.

Provide backward compatibility by checking if underlying system is
bpf_link capable. Do this by looking up/creating bpf_link on loopback
device. If it failed in any way, stick with netlink-based XDP prog.
therwise, use bpf_link-based logic.

When setting up BPF resources during xsk socket creation, check whether
bpf_link for a given ifindex already exists via set of calls to
bpf_link_get_next_id -> bpf_link_get_fd_by_id -> bpf_obj_get_info_by_fd
and comparing the ifindexes from bpf_link and xsk socket.

For case where resources exist but they are not AF_XDP related, bail out
and ask user to remove existing prog and then retry.

Lastly, do a bit of refactoring within __xsk_setup_xdp_prog and pull out
existing code branches based on prog_id value onto separate functions
that are responsible for resource initialization if prog_id was 0 and
for lookup existing resources for non-zero prog_id as that implies that
XDP program is present on the underlying net device. This in turn makes
it easier to follow, especially the teardown part of both branches.

Signed-off-by: Maciej Fijalkowski <maciej.fijalkowski@intel.com>
Signed-off-by: Alexei Starovoitov <ast@kernel.org>
Link: https://lore.kernel.org/bpf/20210329224316.17793-7-maciej.fijalkowski@intel.com

2021-03-30 09:24:38 -07:00

.gitignore

libbpf: Stop using feature-detection Makefiles

2021-02-04 01:22:00 +01:00

bpf_core_read.h

libbpf: Clarify kernel type use with USER variants of CORE reading macros

2021-01-08 14:06:03 -08:00

bpf_endian.h

libbpf: Make bpf_endian co-exist with vmlinux.h

2020-07-01 09:06:12 +02:00

bpf_helpers.h

libbpf: provide NULL and KERNEL_VERSION macros in bpf_helpers.h

2021-03-17 18:48:05 -07:00

bpf_prog_linfo.c

libbpf: Centralize poisoning and poison reallocarray()

2020-08-18 18:38:25 -07:00

bpf_tracing.h

libbpf: Switch tracing and CO-RE helper macros to bpf_probe_read_kernel()

2020-08-18 17:16:15 -07:00

bpf.c

libbpf: Support attachment of BPF tracing programs to kernel modules

2020-12-03 17:38:21 -08:00

bpf.h

libbpf: Add support for freplace attachment in bpf_link_create

2020-09-29 13:09:24 -07:00

btf_dump.c

Merge https://git.kernel.org/pub/scm/linux/kernel/git/bpf/bpf-next

2021-03-25 16:30:46 -07:00

btf.c

libbpf: Add generic BTF type shallow copy API

2021-03-18 16:14:22 -07:00

btf.h

libbpf: Add explicit padding to btf_dump_emit_type_decl_opts

2021-03-19 14:03:39 -07:00

Build

libbpf: Add BPF static linker APIs

2021-03-18 16:14:22 -07:00

hashmap.c

libbpf: Centralize poisoning and poison reallocarray()

2020-08-18 18:38:25 -07:00

hashmap.h

libbpf, hashmap: Fix undefined behavior in hash_bits

2020-11-02 23:33:51 +01:00

libbpf_common.h

libbpf: Implement generalized .BTF.ext func/line info adjustment

2020-09-03 17:14:40 -07:00

libbpf_errno.c

libbpf: Poison kernel-only integer types

2020-01-10 10:38:00 -08:00

libbpf_internal.h

libbpf: Add BPF static linker APIs

2021-03-18 16:14:22 -07:00

libbpf_probes.c

libbpf: Add support for task local storage

2020-11-06 08:08:37 -08:00

libbpf.c

libbpf: Support extern kernel function

2021-03-26 20:41:51 -07:00

libbpf.h

libbpf: Add bpf object kern_version attribute setter

2021-03-25 19:23:27 -07:00

libbpf.map

libbpf: Add bpf object kern_version attribute setter

2021-03-25 19:23:27 -07:00

libbpf.pc.template

libbpf: Add zlib as a dependency in pkg-config template

2019-12-16 14:55:29 -08:00

linker.c

libbpf: Fix memory leak when emitting final btf_ext

2021-03-30 07:38:36 -07:00

Makefile

Merge https://git.kernel.org/pub/scm/linux/kernel/git/bpf/bpf-next

2021-03-25 16:30:46 -07:00

netlink.c

libbpf: Use SOCK_CLOEXEC when opening the netlink socket

2021-03-18 00:50:21 +01:00

nlattr.c

libbpf: Centralize poisoning and poison reallocarray()

2020-08-18 18:38:25 -07:00

nlattr.h

libbpf: relicense libbpf as LGPL-2.1 OR BSD-2-Clause

2018-10-08 10:09:48 +02:00

README.rst

libbpf: add perf_buffer_ prefix to README

2019-07-08 15:35:43 +02:00

ringbuf.c

Merge https://git.kernel.org/pub/scm/linux/kernel/git/bpf/bpf-next

2020-12-14 15:34:36 -08:00

str_error.c

libbpf: Poison kernel-only integer types

2020-01-10 10:38:00 -08:00

str_error.h

libbpf: relicense libbpf as LGPL-2.1 OR BSD-2-Clause

2018-10-08 10:09:48 +02:00

strset.c

libbpf: Extract internal set-of-strings datastructure APIs

2021-03-18 16:14:22 -07:00

strset.h

libbpf: Extract internal set-of-strings datastructure APIs

2021-03-18 16:14:22 -07:00

xsk.c

libbpf: xsk: Use bpf_link

2021-03-30 09:24:38 -07:00

xsk.h

libbpf: xsk: Move barriers from libbpf_util.h to xsk.h

2021-03-10 13:45:16 -08:00

README.rst

.. SPDX-License-Identifier: (LGPL-2.1 OR BSD-2-Clause)

libbpf API naming convention
============================

libbpf API provides access to a few logically separated groups of
functions and types. Every group has its own naming convention
described here. It's recommended to follow these conventions whenever a
new function or type is added to keep libbpf API clean and consistent.

All types and functions provided by libbpf API should have one of the
following prefixes: ``bpf_``, ``btf_``, ``libbpf_``, ``xsk_``,
``perf_buffer_``.

System call wrappers
--------------------

System call wrappers are simple wrappers for commands supported by
sys_bpf system call. These wrappers should go to ``bpf.h`` header file
and map one-on-one to corresponding commands.

For example ``bpf_map_lookup_elem`` wraps ``BPF_MAP_LOOKUP_ELEM``
command of sys_bpf, ``bpf_prog_attach`` wraps ``BPF_PROG_ATTACH``, etc.

Objects
-------

Another class of types and functions provided by libbpf API is "objects"
and functions to work with them. Objects are high-level abstractions
such as BPF program or BPF map. They're represented by corresponding
structures such as ``struct bpf_object``, ``struct bpf_program``,
``struct bpf_map``, etc.

Structures are forward declared and access to their fields should be
provided via corresponding getters and setters rather than directly.

These objects are associated with corresponding parts of ELF object that
contains compiled BPF programs.

For example ``struct bpf_object`` represents ELF object itself created
from an ELF file or from a buffer, ``struct bpf_program`` represents a
program in ELF object and ``struct bpf_map`` is a map.

Functions that work with an object have names built from object name,
double underscore and part that describes function purpose.

For example ``bpf_object__open`` consists of the name of corresponding
object, ``bpf_object``, double underscore and ``open`` that defines the
purpose of the function to open ELF file and create ``bpf_object`` from
it.

Another example: ``bpf_program__load`` is named for corresponding
object, ``bpf_program``, that is separated from other part of the name
by double underscore.

All objects and corresponding functions other than BTF related should go
to ``libbpf.h``. BTF types and functions should go to ``btf.h``.

Auxiliary functions
-------------------

Auxiliary functions and types that don't fit well in any of categories
described above should have ``libbpf_`` prefix, e.g.
``libbpf_get_error`` or ``libbpf_prog_type_by_name``.

AF_XDP functions
-------------------

AF_XDP functions should have an ``xsk_`` prefix, e.g.
``xsk_umem__get_data`` or ``xsk_umem__create``. The interface consists
of both low-level ring access functions and high-level configuration
functions. These can be mixed and matched. Note that these functions
are not reentrant for performance reasons.

Please take a look at Documentation/networking/af_xdp.rst in the Linux
kernel source tree on how to use XDP sockets and for some common
mistakes in case you do not get any traffic up to user space.

libbpf ABI
==========

libbpf can be both linked statically or used as DSO. To avoid possible
conflicts with other libraries an application is linked with, all
non-static libbpf symbols should have one of the prefixes mentioned in
API documentation above. See API naming convention to choose the right
name for a new symbol.

Symbol visibility
-----------------

libbpf follow the model when all global symbols have visibility "hidden"
by default and to make a symbol visible it has to be explicitly
attributed with ``LIBBPF_API`` macro. For example:

.. code-block:: c

        LIBBPF_API int bpf_prog_get_fd_by_id(__u32 id);

This prevents from accidentally exporting a symbol, that is not supposed
to be a part of ABI what, in turn, improves both libbpf developer- and
user-experiences.

ABI versionning
---------------

To make future ABI extensions possible libbpf ABI is versioned.
Versioning is implemented by ``libbpf.map`` version script that is
passed to linker.

Version name is ``LIBBPF_`` prefix + three-component numeric version,
starting from ``0.0.1``.

Every time ABI is being changed, e.g. because a new symbol is added or
semantic of existing symbol is changed, ABI version should be bumped.
This bump in ABI version is at most once per kernel development cycle.

For example, if current state of ``libbpf.map`` is:

.. code-block::
        LIBBPF_0.0.1 {
        	global:
                        bpf_func_a;
                        bpf_func_b;
        	local:
        		\*;
        };

, and a new symbol ``bpf_func_c`` is being introduced, then
``libbpf.map`` should be changed like this:

.. code-block::
        LIBBPF_0.0.1 {
        	global:
                        bpf_func_a;
                        bpf_func_b;
        	local:
        		\*;
        };
        LIBBPF_0.0.2 {
                global:
                        bpf_func_c;
        } LIBBPF_0.0.1;

, where new version ``LIBBPF_0.0.2`` depends on the previous
``LIBBPF_0.0.1``.

Format of version script and ways to handle ABI changes, including
incompatible ones, described in details in [1].

Stand-alone build
=================

Under https://github.com/libbpf/libbpf there is a (semi-)automated
mirror of the mainline's version of libbpf for a stand-alone build.

However, all changes to libbpf's code base must be upstreamed through
the mainline kernel tree.

License
=======

libbpf is dual-licensed under LGPL 2.1 and BSD 2-Clause.

Links
=====

[1] https://www.akkadia.org/drepper/dsohowto.pdf
    (Chapter 3. Maintaining APIs and ABIs).