cc9f8349cb
This is a NEON acceleration method that can improve performance by approximately 20%. I got the following data from the centos 7.5 on Huawei's HISI1616 chip: [ 93.837726] xor: measuring software checksum speed [ 93.874039] 8regs : 7123.200 MB/sec [ 93.914038] 32regs : 7180.300 MB/sec [ 93.954043] arm64_neon: 9856.000 MB/sec [ 93.954047] xor: using function: arm64_neon (9856.000 MB/sec) I believe this code can bring some optimization for all arm64 platform. thanks for Ard Biesheuvel's suggestions. Signed-off-by: Jackie Liu <liuyun01@kylinos.cn> Reviewed-by: Ard Biesheuvel <ard.biesheuvel@linaro.org> Signed-off-by: Will Deacon <will.deacon@arm.com>
30 lines
687 B
Makefile
30 lines
687 B
Makefile
generic-y += bugs.h
|
|
generic-y += delay.h
|
|
generic-y += div64.h
|
|
generic-y += dma.h
|
|
generic-y += dma-contiguous.h
|
|
generic-y += early_ioremap.h
|
|
generic-y += emergency-restart.h
|
|
generic-y += hw_irq.h
|
|
generic-y += irq_regs.h
|
|
generic-y += kdebug.h
|
|
generic-y += kmap_types.h
|
|
generic-y += local.h
|
|
generic-y += local64.h
|
|
generic-y += mcs_spinlock.h
|
|
generic-y += mm-arch-hooks.h
|
|
generic-y += msi.h
|
|
generic-y += preempt.h
|
|
generic-y += qrwlock.h
|
|
generic-y += qspinlock.h
|
|
generic-y += rwsem.h
|
|
generic-y += segment.h
|
|
generic-y += serial.h
|
|
generic-y += set_memory.h
|
|
generic-y += sizes.h
|
|
generic-y += switch_to.h
|
|
generic-y += trace_clock.h
|
|
generic-y += unaligned.h
|
|
generic-y += user.h
|
|
generic-y += vga.h
|