linux

iv/linux

History

David S. Miller 20eb08b2b0 mlx5-updates-2019-04-22

This series includes updates to mlx5e driver RX data path and some
 significant XDP RX/TX improvements to overcome/mitigate HW and PCIE
 bottlenecks.
 
 From Tariq:
 1) Some Enhancements in rq->flags
 2) Stabilize RX packet rate (on Striding RQ) with
 multiple outstanding UMR posts
 In this patch, we add support for multiple outstanding UMR posts,
  to allow faster gap closure between consuming MPWQEs and reposting
 them back into the WQ.
 
 Performance test:
 As expected, huge improvement in large-scale (48 cores).
 
 xdp_redirect_map, 64B UDP multi-stream.
 Redirect from ConnectX-5 100Gbps to ConnectX-6 100Gbps.
 CPU: Intel(R) Xeon(R) CPU E5-2680 v3 @ 2.50GHz.
 
 Before: Unstable, 7 to 30 Mpps
 After:  Stable,   at 70.5 Mpps
 
 From Shay:
 3) XDP, Inline small packets into the TX MPWQE in XDP xmit flow
 
 Upon high packet rate with multiple CPUs TX workloads, much of the HCA's
 resources are spent on prefetching TX descriptors, thus affecting
 transmission rates.
 This patch comes to mitigate this problem by moving some workload to the
 CPU and reducing the HW data prefetch overhead for small packets (<= 256B).
 
 When forwarding packets with XDP, a packet that is smaller
 than a certain size (set to ~256 bytes) would be sent inline within
 its WQE TX descrptor (mem-copied), when the hardware tx queue is congested
 beyond a pre-defined water-mark.
 
 Performance:
     Tested packet rate for UDP 64Byte multi-stream
     over two dual port ConnectX-5 100Gbps NICs.
     CPU: Intel(R) Xeon(R) CPU E5-2680 v3 @ 2.50GHz
 
     * Tested with hyper-threading disabled
 
     XDP_TX:
 
     |          | before | after   |       |
     | 24 rings | 51Mpps | 116Mpps | +126% |
     | 1 ring   | 12Mpps | 12Mpps  | same  |
 
     XDP_REDIRECT:
 
     ** Below is the transmit rate, not the redirection rate
     which might be larger, and is not affected by this patch.
 
     |          | before  | after   |      |
     | 32 rings | 64Mpps  | 92Mpps  | +43% |
     | 1 ring   | 6.4Mpps | 6.4Mpps | same |
 
 As we can see, feature significantly improves scaling, without
 hurting single ring performance.
 
 From Maxim:
 4) Some trivial refactoring and code improvements prior to a larger series
 to support AF_XDP.
 
 -Saeed.
 -----BEGIN PGP SIGNATURE-----
 
 iQEcBAABAgAGBQJcv2LjAAoJEEg/ir3gV/o+90gIAI8+4lwkXZAVk4mxf9PMjxuB
 bQiKd80e++26sgrNHCyuWZnIzTQqYAnUJ3WRC+Kk1pFTo1O23A+fvweT8m1dqAvP
 Z/5ktfbAeF3fwOVu7aGu9vh4zJEWJj8oO+I+G+OaOe2iV7FVTTFnWHxiiCfungAW
 oUnXozq4vERSQLechqqgz6nACxOPgEOCJrp4T9lDYSbqZizHgFttmInMQguq/7KS
 LvITcNu3EF5l4y2LxwCFiKRgGc2y/belU63AK+2pQUXhH46kQPEHdncdLg5d9QYA
 xJwthn697qxS0PIP5oHPHNVN+qJXfuUHVonXqVOAJebGQnV82of6+sPweRxwh1s=
 =MfAR
 -----END PGP SIGNATURE-----

Merge tag 'mlx5-updates-2019-04-22' of git://git.kernel.org/pub/scm/linux/kernel/git/saeed/linux

Saeed Mahameed says:

====================
mlx5-updates-2019-04-22

This series includes updates to mlx5e driver RX data path and some
significant XDP RX/TX improvements to overcome/mitigate HW and PCIE
bottlenecks.

From Tariq:
1) Some Enhancements in rq->flags
2) Stabilize RX packet rate (on Striding RQ) with
multiple outstanding UMR posts
In this patch, we add support for multiple outstanding UMR posts,
 to allow faster gap closure between consuming MPWQEs and reposting
them back into the WQ.

Performance test:
As expected, huge improvement in large-scale (48 cores).

xdp_redirect_map, 64B UDP multi-stream.
Redirect from ConnectX-5 100Gbps to ConnectX-6 100Gbps.
CPU: Intel(R) Xeon(R) CPU E5-2680 v3 @ 2.50GHz.

Before: Unstable, 7 to 30 Mpps
After:  Stable,   at 70.5 Mpps

From Shay:
3) XDP, Inline small packets into the TX MPWQE in XDP xmit flow

Upon high packet rate with multiple CPUs TX workloads, much of the HCA's
resources are spent on prefetching TX descriptors, thus affecting
transmission rates.
This patch comes to mitigate this problem by moving some workload to the
CPU and reducing the HW data prefetch overhead for small packets (<= 256B).

When forwarding packets with XDP, a packet that is smaller
than a certain size (set to ~256 bytes) would be sent inline within
its WQE TX descrptor (mem-copied), when the hardware tx queue is congested
beyond a pre-defined water-mark.

Performance:
    Tested packet rate for UDP 64Byte multi-stream
    over two dual port ConnectX-5 100Gbps NICs.
    CPU: Intel(R) Xeon(R) CPU E5-2680 v3 @ 2.50GHz

    * Tested with hyper-threading disabled

    XDP_TX:

    |          | before | after   |       |
    | 24 rings | 51Mpps | 116Mpps | +126% |
    | 1 ring   | 12Mpps | 12Mpps  | same  |

    XDP_REDIRECT:

    ** Below is the transmit rate, not the redirection rate
    which might be larger, and is not affected by this patch.

    |          | before  | after   |      |
    | 32 rings | 64Mpps  | 92Mpps  | +43% |
    | 1 ring   | 6.4Mpps | 6.4Mpps | same |

As we can see, feature significantly improves scaling, without
hurting single ring performance.

From Maxim:
4) Some trivial refactoring and code improvements prior to a larger series
to support AF_XDP.
====================

Acked-by: Jakub Kicinski <jakub.kicinski@netronome.com>
Signed-off-by: David S. Miller <davem@davemloft.net>

2019-04-23 17:03:40 -07:00

accessibility

…

acpi

libnvdimm fixes v5.1-rc6

2019-04-15 16:48:51 -07:00

amba

ARM: 8836/1: drivers: amba: Update component matching to use the CoreSight UCI values.

2019-02-26 11:23:49 +00:00

android

binder: fix race between munmap() and direct reclaim

2019-03-21 06:51:32 +01:00

ata

libata: fix using DMA buffers on stack

2019-03-28 08:16:04 -06:00

atm

atm: iphase: fix misuse of %x

2019-04-21 10:37:26 -07:00

auxdisplay

auxdisplay: charlcd: make backlight initial state configurable

2019-03-17 08:48:45 +01:00

base

Device properties framework fix for 5.1-rc2

2019-03-22 12:08:52 -07:00

bcma

…

block

Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net

2019-04-17 11:26:25 -07:00

bluetooth

Bluetooth: btusb: request wake pin with NOAUTOEN

2019-04-09 17:38:24 -10:00

bus

ARM: SoC driver updates for 5.1

2019-03-06 09:41:12 -08:00

cdrom

cdrom: Fix race condition in cdrom_sysctl_register

2019-02-08 06:46:59 -07:00

char

ipmi: fix sleep-in-atomic in free_user at cleanup SRCU user->release_barrier

2019-04-17 10:29:27 -05:00

clk

clk: imx: Fix PLL_1416X not rounding rates

2019-04-12 14:21:43 -07:00

clocksource

clocksource/drivers/clps711x: Remove board support

2019-03-24 11:30:11 +01:00

connector

connector: fix unsafe usage of ->real_parent

2019-03-08 15:06:38 -08:00

cpufreq

cpufreq/intel_pstate: Load only on Intel hardware

2019-04-01 23:39:23 +02:00

cpuidle

cpuidle: governor: Add new governors to cpuidle_governors again

2019-03-12 23:46:55 +01:00

crypto

crypto: caam - fix copy of next buffer for xcbc and cmac

2019-03-28 13:54:32 +08:00

dax

device-dax for 5.1

2019-03-16 13:05:32 -07:00

dca

…

devfreq

…

dio

…

dma

dmaengine: stm32-mdma: Revert "dmaengine: stm32-mdma: Add a check on read_u32_array"

2019-03-25 21:56:54 +05:30

dma-buf

…

edac

Merge branch 'ras-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

2019-03-08 09:11:39 -08:00

eisa

…

extcon

extcon: ptn5150: Fix return value check in ptn5150_i2c_probe()

2019-02-11 17:21:38 +09:00

firewire

…

firmware

memblock: drop memblock_alloc_*_nopanic() variants

2019-03-12 10:04:02 -07:00

fmc

…

fpga

Merge 5.0-rc6 into char-misc-next

2019-02-11 09:05:58 +01:00

fsi

…

gnss

gnss: add driver for mediatek receivers

2019-02-15 16:54:38 +01:00

gpio

gpio fixes for v5.1-rc3

2019-03-29 03:04:47 +01:00

gpu

- Revert back to max link rate and lane count on eDP.

2019-04-12 13:39:32 +10:00

hid

HID: input: add mapping for Assistant key

2019-04-03 13:33:25 +02:00

hsi

HSI: omap_ssi_port: fix debugfs_simple_attr.cocci warnings

2019-02-14 12:36:21 +01:00

Char/Misc driver patches for 5.1-rc1

2019-03-06 14:18:59 -08:00

hwmon

hwmon: (ntc_thermistor) Fix temperature type reporting

2019-03-29 09:51:44 -07:00

hwspinlock

…

hwtracing

ARM updates for 5.1-rc1

2019-03-15 14:37:46 -07:00

i2c

i2c: imx: don't leak the i2c adapter on error

2019-04-06 17:54:28 +02:00

i3c

- Add a /* fall-through */ comment in the dw-i3c-master driver

2019-03-04 19:05:02 -08:00

ide

Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/ide

2019-03-11 09:34:00 -07:00

idle

intel_idle: add support for Jacobsville

2019-02-15 10:49:14 +01:00

iio

- New Drivers

2019-03-08 10:02:58 -08:00

infiniband

Linux 5.1-rc1

2019-04-22 15:25:39 -07:00

input

Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/dtor/input

2019-03-11 10:57:11 -07:00

interconnect

…

iommu

iommu/amd: Set exclusion range correctly

2019-04-12 12:59:45 +02:00

ipack

…

irqchip

irqchip/irq-ls1x: Missing error code in ls1x_intc_of_init()

2019-04-05 14:37:56 +02:00

isdn

Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net

2019-04-17 11:26:25 -07:00

leds

leds: trigger: netdev: use memcpy in device_name_store

2019-03-30 19:09:32 +01:00

lightnvm

lightnvm: pblk: fix crash in pblk_end_partial_read due to multipage bvecs

2019-04-10 12:17:01 -06:00

macintosh

treewide: add checks for the return value of memblock_alloc*()

2019-03-12 10:04:02 -07:00

mailbox

mailbox: imx: keep MU irq working during suspend/resume

2019-03-11 02:51:43 -05:00

mcb

…

dm integrity: fix deadlock with overlapping I/O

2019-04-05 18:49:08 -04:00

media

bpf: add map helper functions push, pop, peek in more BPF programs

2019-04-16 10:24:02 +02:00

memory

…

memstick

…

message

…

mfd

mfd: sun6i-prcm: Allow to compile with COMPILE_TEST

2019-04-03 08:38:07 +01:00

misc

5.1 Merge Window Pull Request

2019-03-09 15:53:03 -08:00

mmc

mmc: sdhci-omap: Don't finish_mrq() on a command error during tuning

2019-04-11 12:40:32 +02:00

mtd

mtd: cfi: fix deadloop in cfi_cmdset_0002.c do_write_buffer

2019-04-05 00:39:19 +02:00

mux

…

net

mlx5-updates-2019-04-22

2019-04-23 17:03:40 -07:00

nfc

…

ntb

Fixes for switchtec debugability and mapping table entries, NTB

2019-03-15 14:32:59 -07:00

nubus

…

nvdimm

libnvdimm/pmem: fix a possible OOB access when read and write pmem

2019-04-07 14:36:04 -07:00

nvme

nvmet: fix discover log page when offsets are used

2019-04-11 17:28:30 +02:00

nvmem

Char/Misc driver patches for 5.1-rc1

2019-03-06 14:18:59 -08:00

of: fix kmemleak crash caused by imbalance in early memory reservation

2019-03-12 10:04:02 -07:00

opp

PM / OPP: Update performance state when freq == old_freq

2019-03-12 09:45:56 +01:00

oprofile

…

parisc

Revert: parisc: Use F_EXTEND() macro in iosapic code

2019-04-06 19:07:55 +02:00

parport

Revert "parport: daisy: use new parport device model"

2019-03-25 14:49:00 -07:00

pci

PCI: pciehp: Ignore Link State Changes after powering off a slot

2019-04-10 16:06:43 -05:00

pcmcia

…

perf

arm64 updates for 5.1:

2019-03-10 10:17:23 -07:00

phy

phy: sun4i-usb: Support set_mode to USB_HOST for non-OTG PHYs

2019-03-26 16:48:55 +09:00

pinctrl

This is the bulk of pin control changes for the v5.1 kernel cycle.

2019-03-11 11:12:50 -07:00

platform

Here's more than a handful of clk driver fixes for changes that came in

2019-04-13 14:33:56 -07:00

pnp

ACPI/ACPICA: Trivial: fix spelling mistakes and fix whitespace formatting

2019-02-24 21:12:01 +01:00

power

power: reset: at91-reset: add support for sam9x60 SoC

2019-02-20 00:41:01 +01:00

powercap

powercap/intel_rapl: add Ice Lake mobile

2019-02-18 11:31:39 +01:00

pps

…

ps3

…

ptp

Merge branch 'timers-2038-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

2019-03-05 14:08:26 -08:00

pwm

pwm: atmel: Remove useless symbolic definitions

2019-03-04 12:52:49 +01:00

rapidio

rapidio/mport_cdev: mark expected switch fall-through

2019-03-07 18:32:02 -08:00

ras

…

regulator

regulator: mc13xxx: Constify regulator_ops variables

2019-03-04 00:01:08 +00:00

remoteproc

remoteproc updates for v5.1

2019-03-14 09:00:06 -07:00

reset

reset: meson-audio-arb: Fix missing .owner setting of reset_controller_dev

2019-03-25 16:22:10 +01:00

rpmsg

rpmsg: virtio: change header file sort style

2019-02-20 21:15:54 -08:00

rtc

rtc: da9063: set uie_unsupported when relevant

2019-04-02 23:33:09 +02:00

s390

s390/qeth: stop/wake TX queues based on their fill level

2019-04-17 10:33:59 -07:00

sbus

…

scsi

for-linus-20190412

2019-04-13 16:23:16 -07:00

sfi

…

siox

…

slimbus

…

soc

This pull request brings in a build fix for arm64 with bcm2835

2019-03-18 10:31:24 -07:00

soundwire

…

spi

pci-v5.1-changes

2019-03-09 14:57:08 -08:00

spmi

spmi: pmic-arb: select IRQ_DOMAIN_HIERARCHY in Kconfig

2019-02-14 09:14:50 +01:00

ssb

…

staging

Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net

2019-04-05 14:14:19 -07:00

target

Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net

2019-03-27 17:37:58 -07:00

…

tee

ARM: SoC driver updates for 5.1

2019-03-06 09:41:12 -08:00

thermal

Merge branches 'fixes' and 'thermal-intel' into next

2019-03-18 22:37:44 +08:00

thunderbolt

…

tty

tty: fix NULL pointer issue when tty_port ops is not set

2019-03-28 01:21:21 +09:00

uio

…

usb

USB-serial fixes for 5.1-rc3

2019-03-29 15:31:16 +01:00

uwb

…

vfio

vfio/type1: Limit DMA mappings per container

2019-04-03 12:43:05 -06:00

vhost

vhost: reject zero size iova range

2019-04-10 22:45:38 -07:00

video

fbdev changes for v5.1:

2019-03-15 14:22:59 -07:00

virt

virt: vbox: Implement passing requestor info to the host for VirtualBox 6.0.x

2019-03-28 01:55:18 +09:00

virtio

virtio: Honour 'may_reduce_num' in vring_create_virtqueue

2019-04-08 17:05:52 -04:00

visorbus

…

vlynq

…

vme

…

watchdog

linux-watchdog 5.1-rc1 tag

2019-03-11 11:22:15 -07:00

xen

xen: fixes for 5.1-rc4

2019-04-07 06:12:10 -10:00

zorro

…

Kconfig

…

Makefile

IOMMU Updates for Linux v5.1

2019-03-10 12:29:52 -07:00