summaryrefslogtreecommitdiff
path: root/libavcodec/x86/vp8dsp.asm
Commit message (Expand)AuthorAgeFilesLines
* x86: vp8dsp: Split loopfilter code into a separate fileDiego Biurrun2013-11-011-1555/+0
* dsputil: Make dsputil selectableRonald S. Bultje2013-04-101-7/+6
* Drop DCTELEM typedefDiego Biurrun2013-01-221-5/+5
* x86inc: support stack mem allocation and re-alignment in PROLOGUERonald S. Bultje2012-12-121-33/+35
* x86: mmx2 ---> mmxext in asm constructsDiego Biurrun2012-11-141-15/+15
* x86: yasm: Use complete source path for macro helper %includesDiego Biurrun2012-10-311-1/+1
* x86: include x86inc.asm in x86util.asmDiego Biurrun2012-10-311-1/+0
* x86: add colons after labelsMans Rullgard2012-08-071-15/+15
* x86inc: add SPLATB_LOAD, SPLATB_REG, PSHUFLW macrosLoren Merritt2012-07-051-21/+0
* vp8dsp x86: perform rounding shift with a single instructionChristophe GISQUET2012-04-041-10/+8
* vp8: convert mbedge loopfilter x86 assembly to use named arguments.Ronald S. Bultje2012-03-101-235/+212
* vp8: convert inner loopfilter x86 assembly to use named arguments.Ronald S. Bultje2012-03-101-187/+164
* vp8: convert simple loopfilter x86 assembly to use named arguments.Ronald S. Bultje2012-03-031-25/+30
* vp8: convert idct x86 assembly to use named arguments.Ronald S. Bultje2012-03-031-97/+103
* vp8: convert mc x86 assembly to use named arguments.Ronald S. Bultje2012-03-031-272/+272
* vp8: convert loopfilter x86 assembly to use cpuflags().Ronald S. Bultje2012-03-031-201/+158
* vp8: convert idct/mc x86 assembly to use cpuflags().Ronald S. Bultje2012-03-031-60/+68
* vp8: disable mmx functions with sse/sse2 counterparts on x86-64.Ronald S. Bultje2012-03-021-0/+15
* Fix NASM include directiveDave Yeo2011-08-151-2/+2
* Move x86util.asm from libavcodec/ to libavutil/.Ronald S. Bultje2011-08-121-1/+1
* Move x86inc.asm to libavutil/.Ronald S. Bultje2011-08-121-1/+1
* Modify x86util.asm to ease transitioning to 10-bit H.264 assembly.Daniel Kang2011-05-171-5/+5
* Fix FSF address copy paste error in some license headers.Diego Biurrun2011-05-141-1/+1
* Replace FFmpeg with Libav in licence headersMans Rullgard2011-03-191-4/+4
* Use "d" suffix for general-purpose registers used with movd.Reimar Döffinger2010-09-051-13/+13
* Mark xmm registers as clobbered in simple loopfilter. Should fix the lastRonald S. Bultje2010-08-241-11/+11
* Fix segfaults in VP8 SIMD code on Win64 (and FATE/win64 failures).Ronald S. Bultje2010-08-231-14/+14
* VP8: move zeroing of luma DC block into the WHTJason Garrett-Glaser2010-08-021-2/+18
* Use word-writing instead of dword-writing (with two cached but otherwiseRonald S. Bultje2010-07-311-104/+95
* Use pmaddubsw for the mbedge_filter (>=ssse3), 6-10 cycles faster.Ronald S. Bultje2010-07-261-2/+78
* VP8: Much faster SSE2 MCJason Garrett-Glaser2010-07-261-88/+78
* Enable no-loop memory/register saving for ssse3/sse4 also.Ronald S. Bultje2010-07-261-2/+2
* Save a register (or regsize of stackspace for x86-32) for the no-loopRonald S. Bultje2010-07-261-16/+24
* Use nested ifs instead of &&, which appears to not work with %ifidn (i.e. thisRonald S. Bultje2010-07-261-3/+9
* Split pextrw macro-spaghetti into several opt-specific macros, this will makeRonald S. Bultje2010-07-261-30/+49
* Fix obvious bug in assignment. Somehow, the test vectors don't test this...Ronald S. Bultje2010-07-251-1/+1
* Fix SPLATB_REG mess. Used to be a if/elseif/elseif/elseif spaghetti, so thisRonald S. Bultje2010-07-241-33/+52
* VP8: optimize DC-only chroma case in the same way as luma.Jason Garrett-Glaser2010-07-231-3/+44
* VP8 asm: cosmetics (spacing)Jason Garrett-Glaser2010-07-231-2/+2
* VP8: 30% faster idct_mbJason Garrett-Glaser2010-07-231-54/+127
* VP8: clear DCT blocks in iDCT instead of using clear_blocks.Jason Garrett-Glaser2010-07-231-4/+22
* Use pextrw for SSE4 mbedge filter result writing, speedup 5-10cycles onRonald S. Bultje2010-07-221-5/+30
* Fix and enable horizontal >=SSE2 mbedge loopfilter.Ronald S. Bultje2010-07-221-2/+2
* Eliminate one instruction in VP8 dc_add_sse4Jason Garrett-Glaser2010-07-211-2/+1
* Various VP8 x86 deblocking speedupsJason Garrett-Glaser2010-07-211-32/+67
* Make mmx VP8 WHT fasterJason Garrett-Glaser2010-07-211-17/+22
* VP8 MBedge loopfilter MMX/MMX2/SSE2 functions for both luma (width=16)Ronald S. Bultje2010-07-201-0/+641
* Chroma (width=8) inner loopfilter MMX/MMX2/SSE2 for VP8 decoder.Ronald S. Bultje2010-07-201-77/+131
* Revert r24339 (it causes fate failures on x86-64) - I'll figure out what'sRonald S. Bultje2010-07-191-108/+32
* Implement chroma (width=8) inner loopfilter MMX/MMX2/SSE2 functions.Ronald S. Bultje2010-07-191-32/+108