Martin Willi 29a47b54e0 crypto: x86/chacha20 - Add a 2-block AVX-512VL variant
This version uses the same principle as the AVX2 version. It benefits
from the AVX-512VL rotate instructions and the more efficient partial
block handling using "vmovdqu8", resulting in a speedup of ~20%.

Unlike the AVX2 version, it is faster than the single block SSSE3 version
to process a single block. Hence we engage that function for (partial)
single block lengths as well.

Signed-off-by: Martin Willi <martin@strongswan.org>
Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au>
2018-11-29 16:27:04 +08:00
..
2018-10-31 08:54:16 -07:00
2018-10-31 15:39:25 -07:00
2018-10-31 08:54:16 -07:00
2018-11-03 10:47:33 -07:00
2018-10-31 08:54:16 -07:00
2018-10-31 08:54:16 -07:00
2018-10-31 08:54:16 -07:00
2018-11-03 10:47:33 -07:00
2018-11-01 17:04:07 -07:00
2018-10-31 08:54:16 -07:00
2018-11-01 11:46:27 -07:00