summaryrefslogtreecommitdiff
path: root/libavutil/x86
Commit message (Expand)AuthorAgeFilesLines
* x86: replace explicit REP_RETs with RETsLynne2023-02-012-11/+11
* x86/tx_float: fix stray change in 15xM FFT and replace imul->leaLynne2022-11-281-2/+2
* lavu/tx: refactor to explicitly track and convert lookup table orderLynne2022-11-241-21/+25
* x86/tx_float: implement striding in fft_15xMLynne2022-11-241-16/+29
* x86/tx_float_init: properly specify the supported factors of 15xM FFTsLynne2022-11-241-3/+3
* x86/tx_float: add a standalone 15-point AVX2 transformLynne2022-11-242-0/+117
* x86/tx_float: optimize and macro out FFT15Lynne2022-11-241-134/+143
* lavu/fixed_dsp: add missing av_restrict qualifiersJohannes Kauffmann2022-10-041-1/+1
* x86/tx_float: enable AVX-only split-radix FFT codeletsLynne2022-09-242-0/+10
* x86/tx_float: fix some symbol namesJames Almer2022-09-231-3/+3
* x86/tx_float: change a condition in a preprocessor checkJames Almer2022-09-231-1/+1
* x86/tx_float: add missing preprocessor wrapper for AVX2 functionsJames Almer2022-09-231-1/+1
* x86/tx_float: generalize iMDCTLynne2022-09-232-29/+40
* x86/tx_float: add 15xN PFA FFT AVX SIMDLynne2022-09-232-0/+348
* x86/tx_float: adjust internal ASM call ABI againLynne2022-09-231-20/+8
* x86/tx_float: add asm call versions of the 2pt and 4pt transformsLynne2022-09-192-3/+32
* x86/tx_float: fully support 128bit regs in LOAD64_LUTLynne2022-09-191-5/+5
* x86/tx_float: simplify and describe the intra-asm call conventionLynne2022-09-191-13/+30
* x86/float_dsp: use three operand form for some instructionsJames Almer2022-09-131-8/+8
* avutil/x86/float_dsp: add fma3 for scalarproductPaul B Mahol2022-09-132-0/+129
* avutil/x86/intreadwrite: Add ability to detect whether MMX code is usedAndreas Rheinhardt2022-09-111-0/+2
* x86/tx_float: add missing check for AVX2James Almer2022-09-061-1/+1
* x86/tx_float: set all operands for shufpsJames Almer2022-09-061-2/+2
* x86/tx_float: Fix building for platforms with a symbol prefixMartin Storsjö2022-09-061-5/+5
* x86/tx_float: implement inverse MDCT AVX2 assemblyLynne2022-09-062-1/+216
* x86/tx_float: add support for calling assembly functions from assemblyLynne2022-09-062-47/+138
* x86/tx_float: save a branch during coefficient deinterleavingLynne2022-08-091-4/+1
* avutil/x86/float_dsp: Remove obsolete 3dnowext functionAndreas Rheinhardt2022-06-222-29/+1
* avutil/x86/pixelutils: Remove obsolete MMX(EXT) functionsAndreas Rheinhardt2022-06-222-67/+0
* x86/tx_float: replace fft_sr_avx with fft_sr_fma3Lynne2022-05-212-9/+9
* x86/tx_float: improve temporary register allocation for loadsLynne2022-05-211-24/+24
* x86/tx_float: add AV_CPU_FLAG_AVXSLOW/SLOW_GATHER flags where appropriateLynne2022-05-211-14/+21
* Revert "x86/tx_float: remove vgatherdpd usage"Lynne2022-05-212-31/+43
* x86/tx_float: remove vgatherdpd usageLynne2022-05-202-43/+31
* avutil/cpu: add AVX512 Icelake flagWu Jianhua2022-03-103-28/+34
* Remove unnecessary libavutil/(avutil|common|internal).h inclusionsAndreas Rheinhardt2022-02-241-2/+1
* avutil/x86/emms: Don't unnecessarily include lavu/cpu.hAndreas Rheinhardt2022-02-211-1/+4
* libavutil: include assembly with full path from source rootAlexander Kanavin2022-02-087-7/+7
* lavu/tx: refactor assembly codelet definitionLynne2022-02-071-93/+47
* x86/tx_float: avoid redefining macrosLynne2022-02-021-6/+6
* x86/tx_float: mark AVX2 functions as AVXSLOWLynne2022-01-291-2/+2
* x86/tx_float: add missing FF_TX_OUT_OF_PLACE flag to functionsLynne2022-01-271-2/+2
* x86/tx_float: do not build tx_float_init.c if x86 assembly is disabledLynne2022-01-271-1/+2
* x86/tx_float: add permute-free FFT versionsLynne2022-01-262-59/+158
* lavu/tx: rewrite internal code as a tree-based codelet constructorLynne2022-01-262-114/+131
* avutil/cpu: move slow gather checks below in the functionJames Almer2021-12-211-14/+9
* libavutil/cpu: Add AV_CPU_FLAG_SLOW_GATHER.Alan Kelly2021-12-211-1/+14
* x86/intmath: add VEX encoded versions of av_clipf() and av_clipd()James Almer2021-11-191-0/+30
* libavutil/common: clip nan value to aminMark Reid2021-11-151-4/+4
* x86/tx_float: correctly load the transform lengthLynne2021-07-181-1/+1