summaryrefslogtreecommitdiff
path: root/stdio-common
diff options
context:
space:
mode:
authorH.J. Lu <hjl.tools@gmail.com>2017-05-23 11:25:19 -0700
committerH.J. Lu <hjl.tools@gmail.com>2017-06-08 05:07:18 -0700
commit6f6e1e2e9ab2edaaf2fa20913b195733b08cb0a9 (patch)
tree55371a838ac2858d4e68c33a1587200fbbe59f9e /stdio-common
parente0f20b5a54a803ab753f2e2cc1fce7729fa23f81 (diff)
downloadglibc-6f6e1e2e9ab2edaaf2fa20913b195733b08cb0a9.tar.gz
x86-64: Optimize memrchr with AVX2
Optimize memrchr with AVX2 to search 32 bytes with a single vector compare instruction. It is as fast as SSE2 memrchr for small data sizes and up to 1X faster for large data sizes on Haswell. Select AVX2 memrchr on AVX2 machines where vzeroupper is preferred and AVX unaligned load is fast. * sysdeps/x86_64/multiarch/Makefile (sysdep_routines): Add memrchr-sse2 and memrchr-avx2. * sysdeps/x86_64/multiarch/ifunc-impl-list.c (__libc_ifunc_impl_list): Add tests for __memrchr_avx2 and __memrchr_sse2. * sysdeps/x86_64/multiarch/memrchr-avx2.S: New file. * sysdeps/x86_64/multiarch/memrchr-sse2.S: Likewise. * sysdeps/x86_64/multiarch/memrchr.c: Likewise.
Diffstat (limited to 'stdio-common')
0 files changed, 0 insertions, 0 deletions