#ffmpeg-devel on 2025-03-13 — irc logs at libera.catirclogs.org

2025-03-03 01:04 michaelni changed the topic of #ffmpeg-devel to: Welcome to the FFmpeg development channel | Questions about using FFmpeg or developing with libav* libs should be asked in #ffmpeg | This channel is publicly logged | FFmpeg 7.1.1 has been released! | Please read ffmpeg.org/developer.html#Code-of-conduct

00:04 ^Neo_ has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

00:07 ^Neo has joined #ffmpeg-devel

00:07 ^Neo has quit [Changing host]

00:13 <fflogger> [editedticket] Aleksoid1978: Ticket #11505 ([avcodec] Cuvid decoders do not work with CUDA hwaccel anymore) updated https://trac.ffmpeg.org/ticket/11505#comment:7

00:13 minimal has quit [Quit: Leaving]

00:16 <fflogger> [editedticket] jamrial: Ticket #11490 ([avformat] [Regression] Audio silent for long MOV file) updated https://trac.ffmpeg.org/ticket/11490#comment:7

00:49 <cone-049> ffmpeg Timo Rothenpieler master:4c7d0f88f507: avcodec/Makefile: remove redundant object

00:49 <cone-049> ffmpeg Timo Rothenpieler master:fed6612415c9: avcodec/cuviddec: use pre-existing chroma format information

00:50 <fflogger> [editedticket] Timo Rothenpieler <timo@rothenpieler.org>: Ticket #11505 ([avcodec] Cuvid decoders do not work with CUDA hwaccel anymore) updated https://trac.ffmpeg.org/ticket/11505#comment:8

01:02 <fflogger> [editedticket] bermond: Ticket #11505 ([avcodec] Cuvid decoders do not work with CUDA hwaccel anymore) updated https://trac.ffmpeg.org/ticket/11505#comment:9

01:05 <fflogger> [editedticket] Wallboy: Ticket #11503 ([avcodec] AC-3 downmix levels defaulting to 1.414 with recent decoder changes) updated https://trac.ffmpeg.org/ticket/11503#comment:4

01:21 <haasn> ramiro: wget https://0x0.st/8SdV.c -O swsbench.c && gcc swsbench.c -O3 -mavx2 `pkg-config --cflags --libs libavutil` -o swsbench && ./swsbench

01:21 <haasn> Gave all the crazy ideas a try

01:22 <haasn> I think based on this I want to use hybrid array/vector CPS approach; since the same code works for both; that way we will get the (close to optimal) vector performance on GCC while falling back to still-decent array code on other compilers

01:22 <haasn> (the two relevant lines in the benchmark are "vector" and "pointer")

01:23 <haasn> sadly clang shits the bed for all of these approaches

01:23 <haasn> #tomorrow I will have to take a look at what happens when we start going up to float sized elements though

01:23 <haasn> before committing to any of this

01:25 <haasn> but I'm hesitantly optimistic that we can do something like #if VECTOR_SIZE >= sizeof(float[SWS_CHUNK_SIZE]) use f32vec_t #else use float[SWS_CHUNK_SIZE];

01:25 <haasn> to not sacrifice too much performance when using floats

01:26 <haasn> (as opposed to spilling the f32vec_t all over the stack)

01:27 <haasn> what I really like about the CPS approach is it is very flexible; because we can also call the tail multiple times it can even deal with changing element sizes

01:27 <haasn> changing chunk sizes rather; e.g. if you want to embed a 2x upscale _inside_ a pipeline

01:27 <haasn> so we're not forced to go via memory when scaling

01:28 <cone-049> ffmpeg Andreas Rheinhardt master:3e19e5062c42: avcodec/decode: Move is_open check to avcodec_receive_frame()

01:28 <cone-049> ffmpeg Andreas Rheinhardt master:47d7c6cd1571: avcodec/codec_internal: Add dedicated is_decoder flag to FFCodec

01:28 <cone-049> ffmpeg Andreas Rheinhardt master:c8be309719df: avcodec/codec_internal: Add inlined version of av_codec_is_(de|en)coder

01:28 <cone-049> ffmpeg Andreas Rheinhardt master:bfbceb7d554f: avcodec/tests/avcodec: Silence deprecation warnings

01:28 <cone-049> ffmpeg Andreas Rheinhardt master:ed1b76cdb79c: avcodec/allcodecs: Don't wrap supported_framerates

01:28 <cone-049> ffmpeg Andreas Rheinhardt master:958c46800e68: avcodec/mjpegenc: Reconstify mjpeg encoder

01:30 <haasn> what I don't like about the CPS approach is that it slightly hinders our ability to swap out unaligned versions of the read/write callbacks for the edge case, but that's a minor thing to work around by just patching the cps ops list before calling into it

01:32 <haasn> what I also don't like is the two levels of indirection for *priv but given that we need to pass the global ctx (for image pointers) and the per-op ctx (for cps) we're a bit short on registers; it's a double indirection one way or the other

01:32 <haasn> and *priv is considerably less useful; in theory we could stick some extra data inside the per-op context to allow implementations to store up to e.g. 64 bits without needing to load a pointer

01:51 thilo has quit [Ping timeout: 244 seconds]

01:52 ^Neo has quit [Ping timeout: 252 seconds]

01:53 thilo has joined #ffmpeg-devel

01:53 thilo has quit [Changing host]

01:53 thilo has joined #ffmpeg-devel

01:54 ^Neo has joined #ffmpeg-devel

01:54 Mirarora has quit [Quit: Mirarora encountered a fatal error and needs to close]

02:13 pross has joined #ffmpeg-devel

02:35 Mirarora has joined #ffmpeg-devel

02:43 ^Neo has quit [Ping timeout: 246 seconds]

02:54 <Lynne> do we have anything in the code that produces a software YUV frame but where a single buffer in AVFrame holds all planes?

02:54 <Lynne> by default we allocate one buffer per plane

02:55 <Lynne> just want to know if its worth having a fast path for the case where someone packs all planes in a single buffer

03:02 <jamrial> Lynne: av_frame_get_buffer()

03:02 <Lynne> huh, I was sure we allocated one buffer per plane

03:19 <toots5446> Lynne: do you like the new ogg patch series? I think it's got your last recommendations in it!

03:20 <Lynne> the files still need to be added to fate before it can be pushed

03:20 <toots5446> okay! Any way I can help with that?

03:22 <jamrial> Lynne: lavc's get_buffer2() callback does

03:22 <Lynne> cool

03:22 <Lynne> thanks

03:22 <jamrial> the default one, at least

03:23 <jamrial> as in, it allocates one buffer per plane, unlike av_frame_get_buffer()

03:24 <Lynne> ah

03:27 jamrial has quit []

03:27 ukn_unknown has joined #ffmpeg-devel

04:28 cone-049 has quit [Quit: transmission timeout]

04:34 Kei_N has quit [Read error: Connection reset by peer]

04:39 Kei_N has joined #ffmpeg-devel

04:53 Martchus has joined #ffmpeg-devel

04:54 Martchus_ has quit [Ping timeout: 252 seconds]

05:00 System_Error has quit [Remote host closed the connection]

05:06 System_Error has joined #ffmpeg-devel

05:28 ccawley2011 has joined #ffmpeg-devel

05:28 Kwiboo has quit [Quit: .]

05:29 Kwiboo has joined #ffmpeg-devel

05:34 ccawley2011 has quit [Ping timeout: 260 seconds]

06:18 ukn_unknown has quit [Ping timeout: 240 seconds]

06:41 Martchus_ has joined #ffmpeg-devel

06:42 Martchus has quit [Ping timeout: 272 seconds]

07:05 ngaullier has joined #ffmpeg-devel

07:49 mlauss2 has joined #ffmpeg-devel

07:51 ^Neo has joined #ffmpeg-devel

07:51 ^Neo has quit [Changing host]

07:53 ahmedhamed has quit [Quit: Connection closed for inactivity]

08:15 ^Neo has quit [Ping timeout: 276 seconds]

08:21 ngaullier has quit [Remote host closed the connection]

08:25 ngaullier has joined #ffmpeg-devel

08:38 <fflogger> [newticket] redstone: Ticket #11506 ([undetermined] FFMPEG's AV1 hardware decoding is completely broken on Mac) created https://trac.ffmpeg.org/ticket/11506

09:07 <JEEB> lol, that is more that ffplay is built without libplacebo in that binary, and he isn't testing just decoding

09:10 <ePirat> michaelni, maybe you can help get the files added to fate for toots5446 chained ogg metadata patchset?

09:32 <fflogger> [editedticket] quinkblack: Ticket #11506 ([undetermined] FFMPEG's AV1 hardware decoding is completely broken on Mac) updated https://trac.ffmpeg.org/ticket/11506#comment:1

09:44 <fflogger> [editedticket] tomwillow: Ticket #7558 ([undetermined] Ignore coded resolutions in using -c:v copy?) updated https://trac.ffmpeg.org/ticket/7558#comment:1

09:52 j45_ has joined #ffmpeg-devel

09:53 j45 has quit [Ping timeout: 260 seconds]

09:53 j45_ is now known as j45

09:53 j45 has quit [Changing host]

09:53 j45 has joined #ffmpeg-devel

10:04 mlauss2 has quit [Quit: Client closed]

10:09 MyNetAz has quit [Remote host closed the connection]

10:12 <fflogger> [editedticket] Gyan: Ticket #11506 ([undetermined] FFMPEG's AV1 hardware decoding is completely broken on Mac) updated https://trac.ffmpeg.org/ticket/11506#comment:2

10:13 MyNetAz has joined #ffmpeg-devel

10:40 cone-926 has joined #ffmpeg-devel

10:40 <cone-926> ffmpeg wang-bin master:154c00514d88: lavc/videotoolboxenc: add hevc main42210 and p210

10:55 Anthony_ZO has joined #ffmpeg-devel

11:02 System_Error has quit [Ping timeout: 264 seconds]

11:05 ^Neo has joined #ffmpeg-devel

11:05 ^Neo has quit [Changing host]

11:05 ^Neo has joined #ffmpeg-devel

11:09 System_Error has joined #ffmpeg-devel

11:33 <wbs> lol, that issue about av1 sw decoding being too slow, and hardware decoding being broken on an M4 ... is using an x86_64 build of ffmpeg

11:42 <kierank> loool

11:47 <wbs> (and sw decoding of av1 should be plenty fast; the only thing that hwdec gains you is a bit lower power consumption and cpu usage)

11:48 <kierank> wbs: does the rosetta simd compiler convert x86 simd to arm simd?

11:48 <kierank> or just implements it in scalar?

11:51 <wbs> kierank: no idea, but I think it does map to NEON in some form (and it doesn't do AVX, only SSE variants, iirc)

12:16 _whitelogger has quit [Remote host closed the connection]

12:18 _whitelogger_ has joined #ffmpeg-devel

12:19 j45 has joined #ffmpeg-devel

12:24 ngaullier has quit [Ping timeout: 246 seconds]

12:26 <JEEB> wbs: lol, didn't even notice they were using the wrong arch since their testing methodology had issues on a whole different level

12:38 Thul has quit [Ping timeout: 245 seconds]

12:41 minimal has joined #ffmpeg-devel

12:41 <fflogger> [editedticket] cgbug: Ticket #11490 ([avformat] [Regression] Audio silent for long MOV file) updated https://trac.ffmpeg.org/ticket/11490#comment:8

12:42 jamrial has joined #ffmpeg-devel

13:00 paulk has quit [Ping timeout: 252 seconds]

13:00 paulk has joined #ffmpeg-devel

13:01 ngaullier has joined #ffmpeg-devel

13:04 av500 has quit [Ping timeout: 246 seconds]

13:29 HarshK23 has joined #ffmpeg-devel

13:37 <ramiro> haasn: have you tested with neon as well? it seems gcc (and clang even worse) don't like passing arguments in vector registers

13:38 <ramiro> apparently we'd need to use neon's intrinsics vectors types for that

13:40 cone-926 has quit [Quit: transmission timeout]

13:54 <haasn> ramiro: https://godbolt.org/z/cGj5f86W3

13:54 <haasn> don't see any issue here

13:55 <haasn> it only breaks for chunk sizes exceeding the vector length

13:55 <haasn> but that's a given

13:55 <haasn> and by breaks I mean spills to stack

13:57 <haasn> RVV codegen completely breaks but that's a given, on RVV we need to determine the vector size dynamically

13:57 <haasn> and probably use hand written asm for it

14:18 Anthony_ZO has quit [Ping timeout: 252 seconds]

14:35 <ramiro> hmm, that's odd. can you do the whole chain on that godbolt? (read, swizzle, from8, lshift, write). it looks like lshift and write are reading from memory again (?)

14:37 ccawley2011 has joined #ffmpeg-devel

14:38 <ramiro> haasn: oh, got it. it was a chunk size issue. if I set chunk size to 8 on neon then it also works with the int16 functions.

14:39 <haasn> right

14:39 <haasn> so

14:39 <APic> ☺

14:39 <haasn> the approach I'm eyeballing now is to use arrays when the vector size would exceed the native vector size

14:39 <haasn> this is just for the C fallback code

14:39 <haasn> obviously a hand written asm path can do whatever it wants

14:40 <haasn> e.g. passing the high and low halves separately

14:40 <haasn> though handling 32 bit float vectors is always gonna be a bit challinging

14:42 <haasn> since I'm guessing we will want to go no lower than 16 on the chunk size, that will require storing 512 bits per component, e.g. 8 vectors of size 256 or 16 (!) of size 128

14:42 <haasn> at least even on RVV 128 we have 32 vector registers so that's fine I suppose

14:42 <haasn> what about NEON?

14:43 <haasn> actually I have big plans for an RVV backend

14:44 <haasn> since we control the entire call chain we can do neat tricks like only setting $vtype on SWS_OP_READ and SWS_OP_CONVERT

14:44 <haasn> all other operations can just assume the vector type is already implicitly set

14:45 ccawley2011_ has joined #ffmpeg-devel

14:45 <haasn> and we can just use a static pattern m1 for u8, m2 for u16, m4 for f32 + determining the effective chunk size automatically

14:47 ccawley2011 has quit [Ping timeout: 252 seconds]

14:55 rvalue has quit [Read error: Connection reset by peer]

14:56 rvalue has joined #ffmpeg-devel

15:00 ukn_unknown has joined #ffmpeg-devel

15:01 ukn_unknown43 has joined #ffmpeg-devel

15:04 ukn_unknown has quit [Ping timeout: 240 seconds]

15:27 ukn_unknown43 has quit [Ping timeout: 240 seconds]

16:14 microchip_ has quit [Quit: There is no spoon!]

16:14 microchip_ has joined #ffmpeg-devel

16:37 ccawley2011_ has quit [Ping timeout: 244 seconds]

16:43 rvalue has quit [Read error: Connection reset by peer]

16:44 rvalue has joined #ffmpeg-devel

17:04 ccawley2011 has joined #ffmpeg-devel

17:51 <haasn> ramiro: new code WIP: https://github.com/haasn/FFmpeg/blob/swscale4/libswscale/ops_tmpl_int.c

17:51 <haasn> I like this framework a lot better overall

17:51 <haasn> and it's way faster :)

17:51 <haasn> and we can do things like dynamically choosing the correct chunk size, even based on how many remaining operations there are

17:52 <haasn> how large, rather

17:59 ngaullier has quit [Remote host closed the connection]

18:01 cone-821 has joined #ffmpeg-devel

18:01 <cone-821> ffmpeg James Almer master:c3b60e0df73b: tests/fate/pixfmt: add conversion tests with semi planar YUV formats

18:01 <cone-821> ffmpeg James Almer master:228713ef5dc7: swscale/input: add support for UYYVYY411

18:01 <cone-821> ffmpeg James Almer master:52eb0e18db27: avfilter/vsrc_testsrc: use aligned macros for writing

18:23 <welder> How to run the new swscale benchmark locally?

18:27 another| is now known as another

18:28 ccawley2011 has quit [Ping timeout: 252 seconds]

18:38 ccawley2011 has joined #ffmpeg-devel

18:49 ccawley2011 has quit [Ping timeout: 252 seconds]

18:56 <fflogger> [newticket] nathanf: Ticket #11507 ([avfilter] vpp_qsv tonemapping and color space conversion does not change metadata) created https://trac.ffmpeg.org/ticket/11507

19:04 minimal has quit [Quit: Leaving]

19:49 <fflogger> [editedticket] nyanmisaka: Ticket #11507 ([avfilter] vpp_qsv tonemapping and color space conversion does not change metadata) updated https://trac.ffmpeg.org/ticket/11507#comment:1

20:30 <Lynne> what was the way to disable probing in ffmpeg.c?

20:31 Guest47 has joined #ffmpeg-devel

20:31 Guest47 has quit [Write error: Broken pipe]

20:32 <Lynne> speaking of; jamrial, I thought the ffv1 parser avoided the need to decode upfront to detect the format

20:32 Guest95 has joined #ffmpeg-devel

20:33 Guest95 has quit [Write error: Broken pipe]

20:33 Guest47 has joined #ffmpeg-devel

20:37 Guest47 has quit [Write error: Connection reset by peer]

21:01 cone-821 has quit [Quit: transmission timeout]

21:08 Flat_ has joined #ffmpeg-devel

21:09 Flat has quit [Ping timeout: 265 seconds]

21:37 <jamrial> Lynne: it should, not sure what else could be missing for the demux code to still attempt to decode a frame

21:49 psykose has quit [Remote host closed the connection]

21:50 psykose has joined #ffmpeg-devel

21:53 <fflogger> [editedticket] redstone: Ticket #11506 ([undetermined] FFMPEG's AV1 hardware decoding is completely broken on Mac) updated https://trac.ffmpeg.org/ticket/11506#comment:3

21:58 <fflogger> [editedticket] redstone: Ticket #11506 ([undetermined] FFMPEG's AV1 hardware decoding is completely broken on Mac) updated https://trac.ffmpeg.org/ticket/11506#comment:4

22:03 <fflogger> [editedticket] redstone: Ticket #11506 ([undetermined] FFMPEG's AV1 hardware decoding is completely broken on Mac) updated https://trac.ffmpeg.org/ticket/11506#comment:5

22:06 <fflogger> [editedticket] nathanf: Ticket #11507 ([avfilter] vpp_qsv tonemapping and color space conversion does not change metadata) updated https://trac.ffmpeg.org/ticket/11507#comment:2

22:09 <mkver> jamrial, Lynne: Have a look at FF_CODEC_CAP_SKIP_FRAME_FILL_PARAM

22:09 <fflogger> [editedticket] redstone: Ticket #11506 ([undetermined] FFMPEG's AV1 hardware decoding is completely broken on Mac) updated https://trac.ffmpeg.org/ticket/11506#comment:6

22:12 <Lynne> mkver: adding that to .caps_internal doesn't seem to do it either

22:14 <jamrial> Lynne: maybe http://pastie.org/p/6Hz6ab5MnJT6LfvFmrkWm1

22:21 <mkver> Yup, it needs to be combined with a skip_frame check.

22:21 <Lynne> yes, that works

22:26 <mkver> Why are we actually not generically checking for AVDISCARD_ALL in ff_get_buffer()?

22:29 another is now known as another|

22:34 ^Neo has quit [Ping timeout: 272 seconds]

22:38 <ramiro> haasn: nice. this way you can define different backends (DECL_IMPL, DECL_IMPL_VEC, CONTINUE, ...)

22:40 <ramiro> haasn: btw, "swscale: fix gray -> grayf32 SIGFPE" looks good to me.

22:49 <ramiro> (I guess you should still submit it to the ML though)

23:22 Mirarora has quit [Quit: Mirarora encountered a fatal error and needs to close]