plan9front/sys/src/libsec/amd64
cinap_lenrek 077e719dfb libsec: write optimized _chachablock() function for amd64 / sse2
doing 4 quarterround's in parallel using 128-bit
vector registers. for second round shuffle the columns and
then shuffle back.

code is rather obvious. only trick here is for the first
quaterround PSHUFLW/PSHUFHW is used to swap the halfwords
for the <<<16 rotation.
2017-11-20 00:10:35 +01:00
..
aesni.s libsec: AES-NI support for amd64 2017-11-12 23:15:15 +01:00
chachablock.s libsec: write optimized _chachablock() function for amd64 / sse2 2017-11-20 00:10:35 +01:00
md5block.s libc and ape support for amd64 2014-02-01 10:31:41 +01:00
mkfile libsec: write optimized _chachablock() function for amd64 / sse2 2017-11-20 00:10:35 +01:00
sha1block.s libc and ape support for amd64 2014-02-01 10:31:41 +01:00