#ffmpeg-devel on 2023-12-26 — irc logs at libera.irclog.whitequark.org

2023-11-11 01:05 michaelni changed the topic of #ffmpeg-devel to: Welcome to the FFmpeg development channel | Questions about using FFmpeg or developing with libav* libs should be asked in #ffmpeg | This channel is publicly logged | FFmpeg 6.1 has been released! | Please read ffmpeg.org/developer.html#Code-of-conduct

00:01 mkver has quit [Ping timeout: 240 seconds]

00:03 iive has quit [Quit: They came for me...]

00:23 kurosu has quit [Quit: Connection closed for inactivity]

00:28 jamrial has joined #ffmpeg-devel

01:16 navi has quit [Quit: WeeChat 4.0.4]

01:26 feiw1 has quit [Ping timeout: 240 seconds]

01:26 feiw1 has joined #ffmpeg-devel

01:31 <cone-420> ffmpeg James Almer master:4fee63b241e0: x86/takdsp: add missing wrappers to AVX2 functions

01:40 thilo has quit [Ping timeout: 246 seconds]

01:42 thilo has joined #ffmpeg-devel

01:42 thilo has quit [Changing host]

01:42 thilo has joined #ffmpeg-devel

02:12 derpydoo has quit [Quit: derpydoo]

02:14 derpydoo has joined #ffmpeg-devel

03:00 lemourin has quit [Read error: Connection reset by peer]

03:01 lemourin has joined #ffmpeg-devel

03:18 jamrial has quit []

03:37 jarthur has joined #ffmpeg-devel

04:31 cone-420 has quit [Quit: transmission timeout]

04:54 AbleBacon has quit [Read error: Connection reset by peer]

05:07 \\Mr_C\\ has joined #ffmpeg-devel

06:03 epony has quit [Remote host closed the connection]

06:47 rvalue has quit [Read error: Connection reset by peer]

06:58 rvalue has joined #ffmpeg-devel

07:04 jarthur has quit [Quit: jarthur]

07:14 feiw1 has quit [Ping timeout: 246 seconds]

07:14 feiw1 has joined #ffmpeg-devel

08:07 dellas has joined #ffmpeg-devel

08:07 dellas has quit [Remote host closed the connection]

08:17 <Lynne> ^^we should just drop old yasm by now

08:17 <Lynne> it was 2.5 years ago since the last time the topic was brought up

08:18 Krowl has joined #ffmpeg-devel

08:23 tmm1 has quit [Ping timeout: 264 seconds]

08:25 tmm1 has joined #ffmpeg-devel

08:32 derpydoo has quit [Ping timeout: 256 seconds]

08:54 feiw1 has quit [Ping timeout: 260 seconds]

08:54 feiw1 has joined #ffmpeg-devel

09:04 philipl has quit [Ping timeout: 260 seconds]

09:04 philipl has joined #ffmpeg-devel

09:07 rvalue has quit [Quit: ZNC - https://znc.in]

09:07 rvalue has joined #ffmpeg-devel

09:20 feiw1 has quit [Ping timeout: 255 seconds]

09:23 feiw2 has joined #ffmpeg-devel

09:25 Krowl has quit [Read error: Connection reset by peer]

09:46 kurosu has joined #ffmpeg-devel

10:00 <motherboard> A bit off-topic, but how is 3D visual data encoded, like the ones it would be needed for VR headsets

10:04 Krowl has joined #ffmpeg-devel

10:27 Workl has joined #ffmpeg-devel

10:30 Krowl has quit [Ping timeout: 260 seconds]

10:35 averne has quit [Quit: quit]

10:37 averne has joined #ffmpeg-devel

11:10 <thardin> you mean point clouds or?

11:10 averne has quit [Quit: quit]

11:11 <thardin> also I just found I had an IMA APC encoder laying around. rebasing

11:12 averne has joined #ffmpeg-devel

11:17 mkver has joined #ffmpeg-devel

11:21 averne has quit [Quit: quit]

11:21 averne has joined #ffmpeg-devel

11:23 <thardin> what migh cause configure not to find a muxer?

11:24 <thardin> ah I was using AVOutputFormat not FFOutputFormat

11:32 <thardin> is fate supposed to work with --disable-everything?

11:34 <thardin> "make: *** Ingen regel för att skapa målet ”libavcodec/tests/mjpegenc_huffman”, som behövs av ”fate-libavcodec-huffman”. Stannar." I'm guessing no

11:43 <thardin> there we go. I even had tests

12:00 jamrial has joined #ffmpeg-devel

12:03 <Lynne> full fate requires even ffprobe

12:03 <Lynne> which I never enable, no way I'm waiting for another linking step

12:33 Kei_N_ has quit [Read error: Connection reset by peer]

12:33 Kei_N has joined #ffmpeg-devel

12:34 ccawley2011 has joined #ffmpeg-devel

12:58 Workl has quit [Read error: Connection reset by peer]

13:13 paulk has quit [Ping timeout: 260 seconds]

13:14 paulk has joined #ffmpeg-devel

13:16 navi has joined #ffmpeg-devel

13:17 Krowl has joined #ffmpeg-devel

13:26 <motherboard> thardin: yes point cloud

13:29 <thardin> depends on what type of point cloud I think

13:30 <thardin> some use ellipsoids rather than points

13:33 <Lynne> thardin: now I think about it, couldn't you just skip some coeffs during decoding?

13:34 <Lynne> unless the quarter res option does that already, IIRC it only affected transforms

13:40 dellas has joined #ffmpeg-devel

13:45 <thardin> that would be ideal

13:45 <thardin> j2k unfortunately is *flexible*, so I'm not sure whether you can always do that

13:45 <thardin> pal: ?

13:46 <thardin> each pass can, if I'm not mistaken, encode either one or more level(s) of coeffs, or bit slices thereof, or both. all using CABAC

13:47 <thardin> htj2k changes this so you can only use CABAC for the MSBs

13:50 <Lynne> wavelet levels directly correspond to resolution

13:50 <Lynne> if you need quarter res, you can skip the last 2 levels

13:54 dellas has quit [Remote host closed the connection]

13:57 lemourin has quit [Quit: The Lounge - https://thelounge.chat]

13:59 lemourin has joined #ffmpeg-devel

14:00 <BBB> I believe most of us would at this point be supportive of dropping yasm support (giggle)

14:01 <BBB> I don't think dav1d builds with yasm anymore

14:04 derpydoo has joined #ffmpeg-devel

14:05 kurosu has quit [Quit: Connection closed for inactivity]

14:07 <Lynne> we wouldn't even be dropping yasm, just 2009-circa versions of it

14:07 <Lynne> which some distributions shipped

14:08 <Lynne> nevcairiel: I think it was you last time who pointed out where yasm support was needed, do you still remember?

14:09 <jamrial> unless some change in x86inc/util requires a modern yasm/nasm, i don't think dropping support for old versions is justified

14:09 <jamrial> if all it takes is a %if HAVE_WHATEVER_EXTERNAL check

14:09 <nevcairiel> I had some issues with linking nasm objects with msvc ages ago, but that was fixed either on our end or on theirs at some point

14:11 <thardin> Lynne: not so easy with CABAC I think

14:12 <Lynne> jamrial: it's not that big of a deal, but keeping compatibility with yasm requires using 3-arg instructions everywhere

14:12 <Lynne> and some other stuff

14:12 <thardin> you're supposed to be able to abort the coeff bitstream at any point though. I'd need to dig into the spec some more to find out the proper way to do it

14:12 <jamrial> on avx functions for some instructions, yeah

14:12 <thardin> openjpeg does that I think, but it has other problems

14:13 Kei_N_ has joined #ffmpeg-devel

14:14 <Lynne> jamrial: pretty much everywhere for avx, in fact

14:15 <Lynne> it's not a big deal, but still, it's a 15-ish year old yasm, we have to cut the line at some point and stop worrying about it

14:15 <Lynne> thardin: slices are independent of each other, right?

14:16 <Lynne> including AC context

14:16 Kei_N has quit [Ping timeout: 256 seconds]

14:21 <thardin> tiles are independent

14:21 <thardin> codeblocks are too to an extent

14:21 <thardin> tiles are typically quite large, on the order of 1024x1024. codeblocks are always 4096 pixels, usually 64x64 but not necessarily

14:22 <thardin> the present decoder can only ||ize the IDWT, not the CB decoding. I forget whether my coarse tile decoding is in master atm or not

14:22 <Lynne> yeah, with 1024x1024 tiles, I can see how you'd have issues with slice threading

14:23 <thardin> 1024x1024 is not granular enough to be useful

14:23 <thardin> I whipped up something the ||izes tiles x components so for a 4k image you can do 24 component files in ||

14:24 <Lynne> err, 24 components?

14:24 <thardin> 3 components * 8 tiles

14:24 <Lynne> ah, ok

14:25 <thardin> but that's not enough to fully utilize say a 96 core machine. and often the files aren't equally sized. you typically have a large one in the center and three smaller ones. or just one big tile

14:25 <thardin> the tiles*

14:25 <thardin> only by doing it at the CB level can you guarantee good speedup

14:26 <thardin> I forget whether each CB is strictly limited to one reslevel or not

14:26 derpydoo has quit [Ping timeout: 256 seconds]

14:29 <Lynne> this could've been fixed by defining j2k levels, but a bit too late for that

14:33 <thardin> htj2k fixes it sort of, by being less stupid

14:34 <thardin> there is something in j2k similar to scan orders in jpeg. I forget the name

14:34 <thardin> for htj2k this is more constrainted. and of course you can mix htj2k and regular j2k in the same file

14:35 <thardin> for example you can have a low quality htj2k with fewer reslevels and then a lossless j2k version of the same image in the same file

14:36 <Lynne> how is htj2k adoption coming along?

14:39 <thardin> there is interest from the usual suspects

14:39 <thardin> pal should know better than I

14:44 kurosu has joined #ffmpeg-devel

14:47 derpydoo has joined #ffmpeg-devel

14:56 <Lynne> non-existent, I guess :/

14:58 <thardin> disney is interested I think

14:59 <thardin> we got some lossless RGB48 samples that were non-trivial to decode quickly

15:00 <thardin> are audio packets always keyframes? I'm trying to clear AV_PKT_FLAG_KEY for every packet except the first one in the APC demuxer

15:04 lemourin8 has joined #ffmpeg-devel

15:04 lemourin is now known as Guest1785

15:06 <thardin> codec_props

15:06 <jamrial> thardin: no, mlp/truehd is an example of audio with non keyframes

15:07 <jamrial> but i don't know if the generic logic takes that into account properly

15:09 <thardin> yeah I figured it out

15:13 Krowl has quit [Read error: Connection reset by peer]

15:21 epony has joined #ffmpeg-devel

15:44 Krowl has joined #ffmpeg-devel

16:10 dellas has joined #ffmpeg-devel

16:20 dellas has quit [Remote host closed the connection]

16:31 tmm1_ has joined #ffmpeg-devel

16:33 tmm1 has quit [Ping timeout: 264 seconds]

16:43 noonien852 has joined #ffmpeg-devel

16:45 noonien85 has quit [Ping timeout: 256 seconds]

16:45 noonien852 is now known as noonien85

16:50 Krowl has quit [Read error: Connection reset by peer]

17:11 jamrial has quit [Read error: Connection reset by peer]

17:11 jamrial_ has joined #ffmpeg-devel

17:15 kurosu has quit [Quit: Connection closed for inactivity]

17:23 rvalue has quit [Ping timeout: 264 seconds]

17:28 <pal> > j2k unfortunately is *flexible*, so I'm not sure whether you can always do that < --> both a gift and a curse

17:29 <Lynne> from the PoV of someone familiar with vc-2, it's just far too flexible

17:29 <pal> for new file-based media applications the goal is encourage folks to use on of the two constraint sets at Annex I of https://pub.smpte.org/doc/st2067-21/20221124-pub/

17:31 <pal> these constraint sets should work for any frame-based RGB(A)/YCbCr/XYZ media application (lossy or lossless)

17:31 <pal> for sub-frame latency, e.g. as used in streaming over RTP, the community is still honing the set of constraints

17:31 dellas has joined #ffmpeg-devel

17:32 <pal> thardin: let me know if you come across users that are looking to new applications of J2K and/or revamping old applications

17:33 rvalue has joined #ffmpeg-devel

17:33 <pal> ... it would be good to understand their requirements and either modify the constraint sets above or encourage them to adopt them as-is

17:33 <pal> ... the goal is to create a single reasonable set of constraints for media applications

17:34 <pal> (sorry for the delay over IRC... my workstation blew up before the holiday break... no amount of backups can recreate a honed machine :(

17:34 <JEEB> &34

17:34 <JEEB> whoops

17:38 <thardin> no worries

17:39 <thardin> and yeah sounds a bit like mxf. the format is so general

17:41 <pal> more like ffmpeg :)

17:47 <Lynne> it is a general format, it was designed to cover anything from photography to fingerprint images to satellite photos in different wavelengths

17:47 <Lynne> just not quite real-time video

17:49 <Lynne> nor running on any realistic hardware, current or future

17:49 <pal> there is activity around J2K in HEIF/ISOBMFF (https://github.com/strukturag/libheif/discussions/1054), as friendlier alternative to JPX and JP2

17:49 <pal> Lynne: HT is running on today's hardware

17:50 <Lynne> yeah, but it was standardized with a delay of 20 years

17:50 <pal> ... maybe it should have been called JPEG 2020

17:50 <Lynne> the jp2 header isn't that bad, though

17:50 <Lynne> much better than isobmff

17:51 <pal> s/jp2/mj2/

17:51 <pal> (mistype)

17:53 <thardin> too bad heif is a mess

17:54 <thardin> and yeah j2k is likely the only format that standardizes hyperspectral images. sort of.

17:55 <thardin> makes me wonder if NASA or any other space agency uses it for probes

17:55 <Lynne> nope

17:55 <pal> very much yes

17:55 <Lynne> they use tiff mostly afaik

17:55 <thardin> lo

17:55 <thardin> l

17:55 <Lynne> or at least that's what they publish

17:55 <pal> widely used in geospatial applications

17:55 <pal> LOL

17:55 <thardin> Lynne: published and what's sent in space are two different things

17:56 <Lynne> I kinda doubt that they do jpeg2000 onboard satellites, maybe as mezzanine on the ground

17:56 <thardin> but, the CPUs used in probes are typically 20 years old or so. can't use too complicated formats

17:57 <thardin> for a space project I was involved with we purposefully picked an AVR running at 7 MHz because it has an affordable space qualified variant

17:58 <thardin> affordable here meaning 3,000€ or so

18:01 <pal> https://www.intelligence-airbusds.com/en/8724-jpeg-2000-format

18:15 <thardin> I wonder how something like intra prediction would play with j2k

18:16 <pal> https://pubmed.ncbi.nlm.nih.gov/21411403/

18:16 <pal> ... is one approach

18:18 <Lynne> it's more complex than this

18:19 <Lynne> wavelets don't play well with intraprediction

18:19 <Lynne> as unlike a dct, they have to be contiguous, even when done on a per-block basis

18:19 <thardin> could you have a "tilted" wavelet mimicking intra prediction?

18:20 <thardin> so that if a block has a general "angle" one picks a wavelet following the angle

18:20 <pal> what do you mean by "they have to be contiguous"

18:20 <thardin> thereby compressing variance into one dimension

18:20 <Lynne> what dirac did was use OBMC, transformed it, and signalled a residual to be added onto the prediction

18:20 <Lynne> similar to what daala ended up doing

18:20 <thardin> wavelets don't have to be contiguous. you can have lapped wavelets if you want

18:20 <Lynne> that's my point

18:21 <Lynne> you need data from the previous block to correctly apply an idwt

18:21 <thardin> sure

18:21 <Lynne> larger and more complex wavelets than haar need more taps

18:22 <thardin> j2k uses tiles to solve that issue. I'm sure it's possible to come up with a way to mix wavelets

18:23 <thardin> each coefficient has a corresponding "pattern" after all

18:25 <thardin> after all, the purpose of intra prediction is to exploit directional redundancy

18:26 <Lynne> "solve that issue"?

18:26 <pal> (ok I am slow this morning... I had read "inter"... ignore the link above, which is about inter)

18:28 <thardin> Lynne: tile borders are non-lapped. within each file you get the inherent "lapping" the dwt provides. for better or worse

18:28 <thardin> each tile*

18:30 <thardin> if you have say 8 "angles", each with their own corresponding set of wavelets, you should be able to at least group "macroblocks" according to angle then perform a sparse (I)DWT for each set of blocks

18:30 <thardin> sadly that wouldn't likely not thread well

18:31 <thardin> would likely not

18:31 <Lynne> thardin: so when you do an idwt, you take each tile's data, and for any area outside of it, you assume the value is 0?

18:34 <thardin> nah that wouldn't work

18:35 <thardin> j2k extends values outside the boundaries of each tile. I forget the name of that

18:35 <Lynne> mirror?

18:35 <thardin> hm.. yeah it mirrors it

18:35 <Lynne> oof

18:35 <Lynne> no wonder tiles are huge

18:36 <thardin> dct by contrast repeats the data

18:36 <motherboard> all of you seem to be quite knowledgeable, what are your backgrounds if you don't mind me asking

18:37 <thardin> compsci and broadcast

18:37 <Lynne> dirac/vc2 just avoids the problem

18:37 <Lynne> physics

18:37 <thardin> haven't looked at vc-2

18:37 <motherboard> nicee, i am a cs undergrad

18:38 <thardin> I feel like picking basis per block is a better idea than intra prediction, at least from a threading perspective

18:38 <Lynne> picking basis?

18:39 <thardin> dct and dwt are just two of many linear bases you could use

18:39 <thardin> non-linear stuff is also coming into vogue, under the umbrella of "AI"

18:39 <Lynne> it's been tried

18:39 <thardin> it's just regression

18:40 <Lynne> at the end of the day, you just want a predictable (literally) frequency output

18:40 <thardin> for people who don't want to pretend they're doing statistics

18:40 <Lynne> DCTs do this optimally

18:40 <Lynne> by giving you a laplacian distribution that you can take advantage of while quantizing & coding

18:42 <thardin> welll. dct is generic

18:43 <thardin> it's good at compacting energy under a certain model of how the input data behaves IIRC

18:44 <thardin> it can't change depending on the input data. neither can the color transform vary across the image

18:46 <thardin> we can't go full KLT at present for computational reasons. but on the other hand tensor units are becoming more and more common

18:46 epony has quit [Remote host closed the connection]

18:51 <Lynne> you won't gain that much out of it

18:51 <Lynne> you'll get more with better quantization

18:52 <Lynne> and more importantly, DC prediction

18:52 <Lynne> with daala's lossless mode, predicting DC was a free 5% gain iirc

18:53 <Lynne> and this wasn't even a DCT, but a 5-level haar on 64x64 blocks (1x1 DC)

18:56 <thardin> DC prediction just sounds like DWT or the hierarchical DCT in JPEG that no one implements

19:01 epony has joined #ffmpeg-devel

19:04 <Lynne> daala went a step further with it, by also predicting the DC of each subblock within a superblock by using a haar

19:04 <Lynne> that's mostly incompatible with the current trend of using rectangular blocks, but it made sense back then

19:05 <Lynne> oh, speaking of, rectangular blocks sort of aleviate the gains you'd otherwise get from tunable transforms

19:05 <Lynne> they're cheaper to analyze for too

19:06 <Lynne> though you need a way to expressively signal partitions that isn't going to be too expensive

19:06 <thardin> nothing stops you from using a rectangular haar, no?

19:06 <Lynne> I was stuck on that while working on my own codec

19:06 <thardin> but yeah, subdividing blocks into rectangular subblocks is an old ida

19:06 <thardin> idea

19:07 <Lynne> it took years and was first implemented in av1, I think, unless VP9 or HEVC had it (don't remember)

19:08 <Lynne> but yeah, it was an old idea, vc-2 had it in a limited form by recommending 32x8 slices, due to mostly being used with interlacing

19:09 <thardin> I mean old as in 90's or even 80's

19:10 <thardin> interplay MVE has rectangular subdivisions I think

19:10 <thardin> it's not very fancy of course, just a way to do palette stuff a bit better. but still

19:14 <thardin> picking bases strikes me as being very VQ-ish come to think of it

19:21 derpydoo has quit [Ping timeout: 256 seconds]

19:26 <Lynne> it's just far too expensive and boring though

19:27 <Lynne> the ideal codec will have fully lapped transforms with frequency-domain intra and VQ

19:29 <thardin> there's a sliding scale between VQ, custom bases and using some fixed base

19:32 Krowl has joined #ffmpeg-devel

19:33 <Lynne> sort of, if you pick a good base, that takes care of needing clever coefficient coding

19:34 <Lynne> but it takes too long to search for a good base, tunable transforms are hard to implement in fixed hardware, and writing a clever coefficient coder is sort of its own reward

19:41 <thardin> yeah

19:45 dellas has quit [Read error: Connection reset by peer]

19:46 dellas has joined #ffmpeg-devel

19:46 kasper93 has quit [Ping timeout: 256 seconds]

19:48 <Lynne> there's also the issue of overflows in transforms

19:48 <Lynne> not a mistake we'd like to repeat again after av1

19:49 kasper93 has joined #ffmpeg-devel

19:49 <Lynne> though now that I think back, the hardware folks told us that tuning the coefficients was not a problem for them

19:50 <Lynne> so that's room to play with, but not as much as fully tweakable transforms

19:53 dellas has quit [Ping timeout: 246 seconds]

19:53 dellas82 has joined #ffmpeg-devel

19:56 AbleBacon has joined #ffmpeg-devel

20:02 mkver has quit [Ping timeout: 240 seconds]

20:10 jamrial_ has quit []

20:12 jamrial has joined #ffmpeg-devel

20:17 dellas82 has quit [Ping timeout: 246 seconds]

20:18 dellas82 has joined #ffmpeg-devel

20:38 mkver has joined #ffmpeg-devel

20:45 dellas82 has quit [Ping timeout: 268 seconds]

20:49 dellas82 has joined #ffmpeg-devel

20:53 Krowl has quit [Read error: Connection reset by peer]

20:54 TheSashmo has quit [Quit: Leaving...]

20:57 TheSashmo has joined #ffmpeg-devel

20:58 TheSashmo has quit [Client Quit]

21:10 TheSashmo has joined #ffmpeg-devel

22:07 <ubitux> ok i found the fix for prores

22:10 <JEEB> 'grats

22:31 <ubitux> damn the hell is that drama thread

22:32 <ubitux> i'm going to have 2012 ptsd reading that thread

22:33 <JEEB> I saw it grew longer and decided that while I was trying to lower my stress levels I would not read into it unless absolutely necessary

22:35 qeed_ has quit [Remote host closed the connection]

22:35 qeed_ has joined #ffmpeg-devel

22:36 ccawley2011 has quit [Read error: Connection reset by peer]

22:58 dellas82 has quit [Remote host closed the connection]

23:57 <BBB> nice debugging effort

23:58 <BBB> please do continue merging the two encoders ;)

23:58 <BBB> having 1 is better than having 2, sometimes