linux

iv/linux

History

Eric Biggers e787060bdf crypto: x86/aes-xts - wire up VAES + AVX2 implementation

Add an AES-XTS implementation "xts-aes-vaes-avx2" for x86_64 CPUs with
the VAES, VPCLMULQDQ, and AVX2 extensions, but not AVX512 or AVX10.
This implementation uses ymm registers to operate on two AES blocks at a
time.  The assembly code is instantiated using a macro so that most of
the source code is shared with other implementations.

This is the optimal implementation on AMD Zen 3.  It should also be the
optimal implementation on Intel Alder Lake, which similarly supports
VAES but not AVX512.  Comparing to xts-aes-aesni-avx on Zen 3,
xts-aes-vaes-avx2 provides 70% higher AES-256-XTS decryption throughput
with 4096-byte messages, or 23% higher with 512-byte messages.

A large improvement is also seen with CPUs that do support AVX512 (e.g.,
98% higher AES-256-XTS decryption throughput on Ice Lake with 4096-byte
messages), though the following patches add AVX512 optimized
implementations to get a bit more performance on those CPUs.

Signed-off-by: Eric Biggers <ebiggers@google.com>
Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au>

2024-04-05 15:46:33 +08:00

.gitignore

…

aegis128-aesni-asm.S

crypto: x86/aegis128 - Use RIP-relative addressing

2023-04-20 18:20:04 +08:00

aegis128-aesni-glue.c

…

aes_ctrby8_avx-x86_64.S

crypto: x86/aesni-xctr - Add accelerated implementation of XCTR

2022-06-10 16:40:17 +08:00

aes-xts-avx-x86_64.S

crypto: x86/aes-xts - wire up VAES + AVX2 implementation

2024-04-05 15:46:33 +08:00

aesni-intel_asm.S

crypto: x86/aesni - Update aesni_set_key() to return void

2024-04-02 10:49:39 +08:00

aesni-intel_avx-x86_64.S

arch/x86: Fix typos

2024-01-03 11:46:22 +01:00

aesni-intel_glue.c

crypto: x86/aes-xts - wire up VAES + AVX2 implementation

2024-04-05 15:46:33 +08:00

aria_aesni_avx2_glue.c

crypto: x86/aria-avx2 - fix build failure with old binutils

2023-01-20 18:29:32 +08:00

aria_aesni_avx_glue.c

crypto: x86/aria-avx - fix build failure with old binutils

2023-01-20 18:29:31 +08:00

aria_gfni_avx512_glue.c

crypto: x86/aria - implement aria-avx512

2023-01-06 17:15:47 +08:00

aria-aesni-avx2-asm_64.S

crypto: x86/aria - Use RIP-relative addressing

2023-04-20 18:20:04 +08:00

aria-aesni-avx-asm_64.S

crypto: x86/aria - Use 16 byte alignment for GFNI constant vectors

2023-05-24 18:10:27 +08:00

aria-avx.h

crypto: x86/aria - implement aria-avx512

2023-01-06 17:15:47 +08:00

aria-gfni-avx512-asm_64.S

crypto: x86/aria - Use RIP-relative addressing

2023-04-20 18:20:04 +08:00

blake2s-core.S

x86: Prepare asm files for straight-line-speculation

2021-12-08 12:25:37 +01:00

blake2s-glue.c

crypto: blake2s: remove module_init and module.h inclusion

2023-04-13 13:13:51 -07:00

blowfish_glue.c

crypto: x86/blowfish - Convert to use ECB/CBC helpers

2023-02-10 17:20:19 +08:00

blowfish-x86_64-asm_64.S

crypto: x86/blowfish - Eliminate use of SYM_TYPED_FUNC_START in asm

2023-02-10 17:20:19 +08:00

camellia_aesni_avx2_glue.c

crypto: x86 - use local headers for x86 specific shared declarations

2021-01-14 17:10:30 +11:00

camellia_aesni_avx_glue.c

crypto: x86 - use local headers for x86 specific shared declarations

2021-01-14 17:10:30 +11:00

camellia_glue.c

crypto: x86 - eliminate anonymous module_init & module_exit

2022-04-08 16:13:31 +08:00

camellia-aesni-avx2-asm_64.S

crypto: x86/camellia - Use RIP-relative addressing

2023-04-20 18:20:04 +08:00

camellia-aesni-avx-asm_64.S

crypto: x86/camellia - Use RIP-relative addressing

2023-04-20 18:20:04 +08:00

camellia-x86_64-asm_64.S

crypto: x86/camellia - Use RIP-relative addressing

2023-04-20 18:20:04 +08:00

camellia.h

crypto: x86 - use local headers for x86 specific shared declarations

2021-01-14 17:10:30 +11:00

cast5_avx_glue.c

…

cast5-avx-x86_64-asm_64.S

crypto: x86/cast5 - Use RIP-relative addressing

2023-04-20 18:20:04 +08:00

cast6_avx_glue.c

…

cast6-avx-x86_64-asm_64.S

crypto: x86/cast6 - Use RIP-relative addressing

2023-04-20 18:20:04 +08:00

chacha_glue.c

…

chacha-avx2-x86_64.S

x86: Prepare asm files for straight-line-speculation

2021-12-08 12:25:37 +01:00

chacha-avx512vl-x86_64.S

crypto: x86/chacha20 - Avoid spurious jumps to other functions

2022-03-25 16:21:05 +12:00

chacha-ssse3-x86_64.S

x86: Prepare asm files for straight-line-speculation

2021-12-08 12:25:37 +01:00

crc32-pclmul_asm.S

crypto: x86/crc32 - Use local .L symbols for code

2023-04-20 18:20:04 +08:00

crc32-pclmul_glue.c

x86: Fix various typos in comments, take #2

2021-03-21 23:50:28 +01:00

crc32c-intel_glue.c

…

crc32c-pcl-intel-asm_64.S

arch/x86: Fix typos

2024-01-03 11:46:22 +01:00

crct10dif-pcl-asm_64.S

crypto: x86/crct10dif-pcl: Remove redundant alignments

2022-10-17 16:41:01 +02:00

crct10dif-pclmul_glue.c

…

curve25519-x86_64.c

crypto: x86/curve25519 - use in/out register constraints more precisely

2021-12-24 14:18:22 +11:00

des3_ede_glue.c

crypto: x86/des3 - Remove unused inline function des3_ede_enc_blk_3way()

2022-02-23 15:28:32 +12:00

des3_ede-asm_64.S

crypto: x86/des3 - Use RIP-relative addressing

2023-04-20 18:20:04 +08:00

ecb_cbc_helpers.h

crypto: x86 - exit fpu context earlier in ECB/CBC macros

2023-02-03 12:54:54 +08:00

ghash-clmulni-intel_asm.S

crypto: x86/ghash - Use RIP-relative addressing

2023-04-20 18:20:04 +08:00

ghash-clmulni-intel_glue.c

crypto: x86/ghash - add comment and fix broken link

2022-12-30 17:57:42 +08:00

glue_helper-asm-avx2.S

…

glue_helper-asm-avx.S

…

Kconfig

crypto: x86/sm4 - Remove cfb(sm4)

2023-12-08 11:59:45 +08:00

Makefile

crypto: x86/aes-xts - add AES-XTS assembly macro for modern CPUs

2024-04-05 15:46:33 +08:00

nh-avx2-x86_64.S

crypto: x86/nhpoly1305 - eliminate unnecessary CFI wrappers

2022-11-25 17:39:19 +08:00

nh-sse2-x86_64.S

crypto: x86/nhpoly1305 - eliminate unnecessary CFI wrappers

2022-11-25 17:39:19 +08:00

nhpoly1305-avx2-glue.c

crypto: x86/nhpoly1305 - implement ->digest

2023-10-20 13:39:25 +08:00

nhpoly1305-sse2-glue.c

crypto: x86/nhpoly1305 - implement ->digest

2023-10-20 13:39:25 +08:00

poly1305_glue.c

crypto: poly1305 - fix poly1305_core_setkey() declaration

2021-04-02 18:28:12 +11:00

poly1305-x86_64-cryptogams.pl

crypto: x86/poly1305: Remove custom function alignment

2022-10-17 16:41:03 +02:00

polyval-clmulni_asm.S

crypto: x86/polyval - Add PCLMULQDQ accelerated implementation of POLYVAL

2022-06-10 16:40:17 +08:00

polyval-clmulni_glue.c

crypto: x86/polyval - Fix crashes when keys are not 16-byte aligned

2022-10-21 19:05:05 +08:00

serpent_avx2_glue.c

crypto: x86 - eliminate anonymous module_init & module_exit

2022-04-08 16:13:31 +08:00

serpent_avx_glue.c

crypto: x86 - use local headers for x86 specific shared declarations

2021-01-14 17:10:30 +11:00

serpent_sse2_glue.c

crypto: x86 - use local headers for x86 specific shared declarations

2021-01-14 17:10:30 +11:00

serpent-avx2-asm_64.S

crypto: x86/serpent: Remove redundant alignments

2022-10-17 16:41:01 +02:00

serpent-avx-x86_64-asm_64.S

crypto: x86/serpent: Remove redundant alignments

2022-10-17 16:41:01 +02:00

serpent-avx.h

crypto: x86 - use local headers for x86 specific shared declarations

2021-01-14 17:10:30 +11:00

serpent-sse2-i586-asm_32.S

x86: Prepare asm files for straight-line-speculation

2021-12-08 12:25:37 +01:00

serpent-sse2-x86_64-asm_64.S

x86: Prepare asm files for straight-line-speculation

2021-12-08 12:25:37 +01:00

serpent-sse2.h

crypto: x86 - use local headers for x86 specific shared declarations

2021-01-14 17:10:30 +11:00

sha1_avx2_x86_64_asm.S

crypto: x86/sha - Use local .L symbols for code

2023-04-20 18:20:04 +08:00

sha1_ni_asm.S

- Add the call depth tracking mitigation for Retbleed which has

2022-12-14 15:03:00 -08:00

sha1_ssse3_asm.S

crypto: x86/sha1 - fix possible crash with CFI enabled

2022-11-25 17:39:19 +08:00

sha1_ssse3_glue.c

crypto: x86/sha1 - autoload if SHA-NI detected

2023-11-17 19:16:29 +08:00

sha256_ni_asm.S

- Add the call depth tracking mitigation for Retbleed which has

2022-12-14 15:03:00 -08:00

sha256_ssse3_glue.c

crypto: x86/sha256 - autoload if SHA-NI detected

2023-11-17 19:16:29 +08:00

sha256-avx2-asm.S

crypto: x86/sha - Use local .L symbols for code

2023-04-20 18:20:04 +08:00

sha256-avx-asm.S

crypto: x86/sha - Use local .L symbols for code

2023-04-20 18:20:04 +08:00

sha256-ssse3-asm.S

crypto: x86/sha - Use local .L symbols for code

2023-04-20 18:20:04 +08:00

sha512_ssse3_glue.c

crypto: x86/sha512 - load based on CPU features

2022-08-19 18:39:39 +08:00

sha512-avx2-asm.S

crypto: x86/sha - Use local .L symbols for code

2023-04-20 18:20:04 +08:00

sha512-avx-asm.S

arch/x86: Fix typos

2024-01-03 11:46:22 +01:00

sha512-ssse3-asm.S

arch/x86: Fix typos

2024-01-03 11:46:22 +01:00

sm3_avx_glue.c

crypto: x86/sm3 - add AVX assembly implementation

2022-01-28 16:51:11 +11:00

sm3-avx-asm_64.S

- Add the call depth tracking mitigation for Retbleed which has

2022-12-14 15:03:00 -08:00

sm4_aesni_avx2_glue.c

crypto: x86/sm4 - Remove cfb(sm4)

2023-12-08 11:59:45 +08:00

sm4_aesni_avx_glue.c

crypto: x86/sm4 - Remove cfb(sm4)

2023-12-08 11:59:45 +08:00

sm4-aesni-avx2-asm_64.S

crypto: x86/sm4 - Remove cfb(sm4)

2023-12-08 11:59:45 +08:00

sm4-aesni-avx-asm_64.S

crypto: x86/sm4 - Remove cfb(sm4)

2023-12-08 11:59:45 +08:00

sm4-avx.h

crypto: x86/sm4 - Remove cfb(sm4)

2023-12-08 11:59:45 +08:00

twofish_avx_glue.c

crypto: x86 - use local headers for x86 specific shared declarations

2021-01-14 17:10:30 +11:00

twofish_glue_3way.c

crypto: x86 - eliminate anonymous module_init & module_exit

2022-04-08 16:13:31 +08:00

twofish_glue.c

crypto: Prepare to move crypto_tfm_ctx

2022-12-02 18:12:40 +08:00

twofish-avx-x86_64-asm_64.S

crypto: twofish: Remove redundant alignments

2022-10-17 16:41:03 +02:00

twofish-i586-asm_32.S

x86: Prepare asm files for straight-line-speculation

2021-12-08 12:25:37 +01:00

twofish-x86_64-asm_64-3way.S

x86: Prepare asm files for straight-line-speculation

2021-12-08 12:25:37 +01:00

twofish-x86_64-asm_64.S

x86: Prepare asm files for straight-line-speculation

2021-12-08 12:25:37 +01:00

twofish.h

crypto: x86 - use local headers for x86 specific shared declarations

2021-01-14 17:10:30 +11:00