#ffmpeg on 2024-02-02 — irc logs at libera.irclog.whitequark.org

00:09 rvalue has quit [Ping timeout: 252 seconds]

00:18 rvalue has joined #ffmpeg

00:34 <FlorianBad> Did you ever come across some kind of stats that would describe the bitrates internet users can usually tolerate in most developed countries (or per country...) ?

00:35 <FlorianBad> and I'm not talking about their internet connection because their WiFi might slow down, e.g. I'm really talking about something measured at the browser level

00:51 lexano has quit [Ping timeout: 246 seconds]

01:06 lavaball has quit [Quit: lavaball]

01:06 navi has quit [Quit: WeeChat 4.1.2]

01:08 <FlorianBad> kepstin, I absolutely love the crf unit for picking the various bitrates available, it just makes so much sense. Thanks for suggesting that a few days ago :)

01:09 <FlorianBad> It's really perfect for my use-case to offer evenly spaced bitrates to the client

01:24 KombuchaKip has quit [Remote host closed the connection]

01:25 Suchiman has quit [Quit: Connection closed for inactivity]

01:28 KombuchaKip has joined #ffmpeg

01:32 thilo has quit [Ping timeout: 240 seconds]

01:34 thilo has joined #ffmpeg

01:39 five618480 has quit [Remote host closed the connection]

01:39 five618480 has joined #ffmpeg

02:07 <aaabbb> the problem with crf is that you're going to have to adjust the range, otherwise you may have videos that aren't streamable at *any* crf you use

02:13 MrZeus has quit [Ping timeout: 255 seconds]

02:14 <FlorianBad> aaabbb, because even 27 e.g. might be super heavy?

02:14 <aaabbb> yes

02:14 <FlorianBad> And you're talking about high-action motion typically?

02:14 <furq> it's unlikely if these are just music videos

02:15 <FlorianBad> like Tom Cruise running type of thing?

02:15 <furq> but you should use vbv constraints

02:15 <aaabbb> high-action motion, complex scenes, full color space, 10 bit, yuv444 or 422, large dimensions, high fps, etc

02:15 <aaabbb> lots of factors

02:16 <FlorianBad> well remember these will always come from my own final project exports, which will be the fps I chose, along with everything else

02:16 <aaabbb> yeah vbv constraints will let you prevent it from being overwhelming but then you might also be streaming a video that could be better because you're underutilizing the bandwidth

02:16 <aaabbb> eg i could make a video where even crf 18 barely uses any bitrate

02:16 <FlorianBad> and I already have -pix_fmt yuv420p

02:16 <aaabbb> then it would just be color range, dimensions, and complexity (both spatial and temporal)

02:17 <FlorianBad> aaabbb, sure but the quality that -crf 18 would be so amazing that 15 would make no visual difference, right?

02:17 <furq> as usual it depends

02:18 <furq> but i don't think i've ever encoded anything below 18

02:18 <aaabbb> yeah but my point is just that if you are targeting a specific bitrate because you are streaming and want to make sure you are tailoring the video for each person's bandwidth, then you usually want abr with constraints, and 2pass abr is the best way to do it

02:18 <FlorianBad> Ok, I think it will be fine, because I will have crf 17 19 21 23 25 27 for each of ~10 resolutions lol So my back-end will pick what fits best. I just need to write that part well and should be fine

02:18 <furq> that seems like a uselessly high number of variants

02:18 <aaabbb> that's a lot

02:18 <aaabbb> it's much better for streaming to use abr, the use of crf is really for storage

02:19 <furq> i don't know if i agree with that but it's at least no worse than using crf

02:19 <furq> unless you're paying a lot for bandwidth

02:19 <furq> for vod i would use crf+vbv

02:19 <aaabbb> furq: it's most commonly used by companies that do streaming (not big ones like netflix, they have their own special tricks)

02:19 <furq> yeah for live

02:19 <furq> this is vod

02:20 <aaabbb> crf+vbv is also good

02:20 <furq> if you're encoding 60 variants per video then i guess at least cpu and disk space are cheap

02:20 <furq> i would also be shocked if more than five of those ever get used

02:21 <aaabbb> yeah that's a whole lot of resolutions, and crf 17 is super super overkill

02:21 <aaabbb> for x264 at least

02:22 <FlorianBad> furq, they will, because it will always pick the resolution that is >= to the resolution of the canvas element in the browser page. So then it will look at which crf based on current bandwidth stats

02:22 <FlorianBad> since the first step is to pick the resolution, then you realize that's not that many options after all

02:22 <aaabbb> how big is the buffer on the browser?

02:22 <FlorianBad> but just enough ;)

02:23 <FlorianBad> aaabbb, I will handle that manually in my code, haven't decided yet, but it will probably be dynamic based on what happened in the past

02:23 <aaabbb> FlorianBad: the smaller the buffer the more strict you want to be with vbv

02:24 <FlorianBad> I see, yeah. Well it will probably be small if no bw irregularities are detected

02:25 <FlorianBad> I don't see why it would be greater than 5s if you have an amazing connection with little latency

02:25 <FlorianBad> (at least as long as I'm sure of that)

02:25 <aaabbb> greater means better quality because you can allow the bitrate to shoot up more for high complex scenes

02:25 <furq> youtube only has ~30 variants for 4k and that's with two codecs

02:25 <furq> and more than half of those are for 360p/240p/144p

02:26 <aaabbb> youtube also has huge asics that makes ~30 variants practical

02:26 <FlorianBad> But youtube cannot afford to make insane decisions, I can ;)

02:26 <furq> and that also includes the premium locked streams

02:26 <aaabbb> FlorianBad: insane decisions like -preset placebo? ;)

02:26 <FlorianBad> lol

02:27 <aaabbb> which actually isn't always insane for x264

02:27 lolok has joined #ffmpeg

02:27 <FlorianBad> no like very small increments in resolutions. e.g. right now I'm my script with a video and this is all the increments it will produce:

02:27 <furq> that's 30 variants for 4k30 but i assume you don't have any 60p stuff anyway

02:27 <FlorianBad> Now encoding each of 17 resolutions (448x252 512x288 576x324 640x360 768x432 896x504 1024x576 1152x648 1280x720 1536x864 1728x972 1920x1080 2304x1296 2560x1440 2944x1656 3328x1872 3840x2160) to each bitrate

02:27 <FlorianBad> (that's the print of my script)

02:27 <aaabbb> that's a lot

02:27 <furq> yeah that seems entirely unnecessary

02:28 <aaabbb> you can use fewer and then let the browser do the downscaling

02:28 <furq> especially with six variants for each

02:28 <furq> in general i would rather have higher crf and lower resolution than the opposite

02:29 <aaabbb> i prefer the opposite personally just because downscaling is less prone to artifacts than upscaling

02:29 <FlorianBad> Yeah it's a lot but if your canvas in the page is 1477x831 then it will pick 1536x864 which will rescale beautifully in the page

02:29 <furq> yeah but upscaling is less prone to artifacts than a low quality encode

02:29 <aaabbb> true

02:30 <furq> also i meant lower crf but i guess you figured that out

02:31 <FlorianBad> Ok so now I need to research this VBV thing that I know nothing about :)

02:31 <aaabbb> FlorianBad: what i usually do is i keep things in resolutions divisible by 8

02:31 <aaabbb> vbv lets you do constraints

02:31 <aaabbb> so you can say "crf 18 but never exceed 10mbps over this amount of time even if you otherwise would"

02:32 <FlorianBad> aaabbb, my program picks the best nice modulos (as you can tell from above, it decided all these resolutions just from the original 3840x2160)

02:32 <FlorianBad> aaabbb, ok, but wouldn't that really just apply to Tom Cruse stuff?

02:33 <aaabbb> not necessarily

02:33 <aaabbb> things you might never expect, like gentle snowfall, can totally bitrate starve it

02:33 <furq> static shot of a lake

02:35 <aaabbb> exactly and then when you use crf, you're telling it "throw as many bits as you possibly can until you hit the desired quality"

02:35 <furq> fast camera pans and stuff will generally not be that bad because it's blurry anyway

02:35 <aaabbb> also fast camera pans are simple motion

02:35 <furq> but god forbid there's some confetti

02:36 <aaabbb> confetti, snow, anything with lots of particles moving in unpredictable directions

02:36 <aaabbb> but fast tom cruise running won't actually use all that much because the majority of the shot is smooth motion in one predictable direction

02:37 <furq> and it'll be shot on film so there'll naturally be a lot of motion blur

02:37 <FlorianBad> snowfall :D ok I see

02:38 <aaabbb> what makes sense intuitively as far as "complex" isn't always the same as what the encoder considers complex

02:38 <FlorianBad> confetti hahaha

02:38 <furq> have you never seen a sports team win the championship

02:38 <furq> guaranteed to ruin an encoder's day

02:38 <aaabbb> snowfall or confetti or brain just turns into a simple "glitter falling" perception which is even simpler for our minds to process than tom cruise running or a car chase scene, but the former is a nightmare for the encoder and the latter is relatively straightforward

02:38 <FlorianBad> but wait a second, why would that be important since I will calculate the bitrate of each piece of stream from mpd anyway?

02:39 <furq> because you're using crf

02:39 <aaabbb> that's why i usually recommend 2pass abr (or crf+vbv as furq says)

02:39 <FlorianBad> my program will know that this stream chunk corresponds to 2 seconds, e.g. So if that won't work for the connection this person has it will simply pick the same chunk from a lower crf

02:40 <furq> "simply" doing a lot of work there

02:40 <aaabbb> you don't really want to change it too fast

02:40 <furq> you want to avoid switching as much as you can

02:40 <FlorianBad> furq, well it will do it anyway, so it's not like I will code anything extra

02:40 <aaabbb> even if it's struggling to keep up with it, let it run out of buffer before you switch

02:40 <aaabbb> FlorianBad: but then it will look perceptively bad

02:40 <FlorianBad> why would switching cause problems?

02:41 <furq> viewers can often perceive the switch

02:41 <aaabbb> because going between crf 24 and crf 26 every few seconds is more jarring than just staying with crf 26

02:41 <furq> and also it takes time to detect whether you need to switch

02:41 <FlorianBad> oh I see, because if there's no constraint I would have to switch so far from the previous crf that there would just a huge difference in quality?

02:41 <aaabbb> FlorianBad: not even so far, but just a little

02:41 <aaabbb> because when the crf changes, the encoder has to use different encoding techniques

02:41 <FlorianBad> hmm, really?

02:42 <aaabbb> yeah

02:42 <FlorianBad> oh I see, ok. So my program will have to be very hesitant in switching then. ok. Well then vbv makes sense

02:42 <aaabbb> you only want to switch if you are stuck between the decision "give this person crap quality even tho his bandwidth is plenty" and "let the video get stuck buffering over and over"

02:42 <aaabbb> vbv isn't to make switching less often, it's to stop massive spikes in bitrate

02:43 <FlorianBad> so it VBV just -maxrate and -bufsize ?

02:43 <furq> yeah

02:43 <aaabbb> vbv basically encodes, then decodes in real-time to verify how much buffer is needed

02:43 <aaabbb> https://stackoverflow.com/questions/33611900/what-is-vbv-video-buffering-verifier-in-h-264

02:43 <furq> i don't know of any other streaming service that tries to do such granular switching

02:43 <furq> so this is stuff you'd want to experiment with

02:43 <aaabbb> and if you put minrate and maxrate at almost the same place, then that's what cbr is

02:43 <aaabbb> (you don't want cbr ofc)

02:43 <furq> e.g. youtube will just bump you down from 1080p60 to 480p30 and then never switch you back again

02:44 <furq> which isn't great either

02:44 <aaabbb> switching resolution is more jarring than switching crf, but you really don't want to switch crf often

02:45 <furq> also minrate doesn't do anything

02:45 <FlorianBad> ok, I will code with these in mind then

02:45 <FlorianBad> I was wondering also, switching audio probably produces clicks, right?

02:45 <furq> audio is such low bitrate that you should never need to switch it

02:46 <aaabbb> depends on where it is switching

02:46 <FlorianBad> or does the dash algorithm figure out a way to slice when the waveform is at zero?

02:46 <furq> youtube uses 128k aac or 128k opus for almost all video streams

02:46 <aaabbb> many codecs have like 200ms independent packets for example

02:46 <furq> you would usually have video and audio separate

02:46 <furq> so the audio just carries on using the same playlist/manifest when the video switches

02:46 <aaabbb> FlorianBad: it's not real-time, it will buffer and then "splice it together" no matter where the waveform is

02:46 <aaabbb> but usually you don't want to switch audio

02:47 <FlorianBad> ok

02:47 <aaabbb> especially if it's music video, also 128k aac will sound much worse than 128k opus

02:47 <furq> it really won't

02:47 <FlorianBad> I'll test just out of curiosity

02:47 <aaabbb> furq: sure it will, with classical music

02:47 <aaabbb> if aac-lc

02:47 <furq> which encoder

02:47 <FlorianBad> (test the change in kbps during play)

02:47 <aaabbb> fdk_aac (and especially twoloop native aac)

02:47 <furq> fdk compares very well to opus in the tests i've seen

02:48 <furq> at 96k and up

02:48 <furq> below that is where opus really shines

02:48 <aaabbb> i've found that it's only at about 128k and up

02:48 <aaabbb> from what i've read in abx testing

02:48 <furq> on account of he-aac sounds absolutely awful

02:48 <aaabbb> well, with certain music

02:48 <aaabbb> like classical

02:48 <aaabbb> classical can even cause 96k opus to fall into intensity stereo

02:48 <FlorianBad> for audio I'm pretty confident I will hear a ton of difference, I mixed music my whole life and I use the Focal Twin6be right now, which are absurdly bright due to the technology of their tweeter

02:49 <aaabbb> you gotta abx test before determining that you can hear a difference

02:49 <furq> if you can abx 128k opus from 128k fdk then fair enough

02:49 <furq> but they should both be transparent on most samples

02:49 <aaabbb> yeah most samples they are

02:49 <aaabbb> it's really classical music that stresses audio codecs

02:49 <aaabbb> and killer samples ofc but no one listens to that

02:50 <aaabbb> FlorianBad: these are music videos right?

02:50 <aaabbb> will people be allowed to download and archive or is it just streaming?

02:51 <aaabbb> if people will download it then you might want to have a higher bitrate option, like maybe even 160k

02:53 <furq> or just make the source video available

02:54 Muimi has quit [Quit: Going offline, see ya! (www.adiirc.com)]

02:54 <aaabbb> yeah that's a better idea lol, i'm dum

03:00 <aaabbb> i have a dumb question but for x264/x265, is there ever a reason *not* to disable the deblock filter if the source is lossless or perceptually lossless (ie no macroblocking whatsoever)?

03:01 lemourin3 has joined #ffmpeg

03:01 lemourin has quit [Killed (zirconium.libera.chat (Nickname regained by services))]

03:01 lemourin3 is now known as lemourin

03:01 <furq> deblocking is applied when decoding

03:02 <aaabbb> wow, my question really was dumb. thanks

03:02 <furq> also can you even turn it off

03:02 <furq> --deblock 0:0 is just the default value

03:02 <aaabbb> yeah you can, on both x264 and x265

03:02 <furq> oh never mind there's --no-deblock

03:02 <aaabbb> -x264opts no-deblock or -x264-params no-deblock=1

03:03 <FlorianBad> aaabbb, in the vast majority of the cases it will be music videos yes, piano performances

03:03 <aaabbb> oh x265, it's deblock=0:0:0 with the first being whether it's enabled or not

03:03 <aaabbb> FlorianBad: oh ok, both aac and opus are very good at piano iirc

03:03 <aaabbb> (debock=1:0:0 being default on x265 iirc)

03:04 <aaabbb> furq: does that mean that a video's deblocking value can be changed without reencoding, if it's just metadata specifying what to do at decoding time?

03:05 <FlorianBad> aaabbb, this is 5-years old and the next step for me will involve getting my hand on amazing equipment so it will be much better filmed, but just for the idea of the style/mood: https://vimeo.com/574631289 (so not Tom Cruise lol)

03:05 <FlorianBad> although that's kind of a Tom Cruise thing :) https://vimeo.com/259280325

03:06 <aaabbb> just make sure you use either 2pass abr or crf+vbv

03:06 <aaabbb> and bigger buffer size = better quality (because vbv will be less aggressive)

03:06 <FlorianBad> I need to get some food, will read everything you guys wrote in 15-20min, thanks again for the help

03:07 <furq> i have no idea if you can change it but it is also applied during encoding before prediction

03:07 <furq> so if you can change it then it'll probably lead to bad results

03:07 <aaabbb> furq: oh ok that makes sense

03:08 orthoplex64 has joined #ffmpeg

03:08 minimal has quit [Quit: Leaving]

03:10 <aaabbb> one more dumb question i've been wondering for a while, but why is a flac file with mono turned into stereo smaller than when downmixing it to mono?

03:10 <furq> no idea why it would be smaller

03:10 <furq> i would expect it to be exactly the same size

03:10 <aaabbb> i have an input that is 2ch stereo where each channel is 100% identical, when i do -ac 1 it's more than 20% larger

03:11 <furq> maybe swresample is doing something funky

03:11 <aaabbb> sample rate doesn't change, sample type doesn't change, and i can losslessly convert it back to the original 2ch just by duplicating the mono

03:11 <furq> or maybe not

03:11 <aaabbb> hmm

03:11 <aaabbb> could it be applying dither?

03:11 <furq> might be a question for #xiph if people still talk in there

03:13 <aaabbb> maybe i was wrong about it being losslessly convertible back, i'll test again

03:14 <aaabbb> furq: if it's swresample then it would be an ffmpeg thing and not a libflac thing

03:14 <furq> sure

03:14 <aaabbb> i'll try selecting one stream instead of mixing them, my guess is that it's adding dither when mixing

03:14 <furq> if you can recreate it losslessly then it's definitely not swr

03:14 <furq> if you can't then it probably is

03:15 <furq> you can use -af channelmap if you want to eliminate that possibility

03:15 <furq> -af channelmap=FL:channel_layout=mono

03:16 Epakai_ is now known as Epakai

03:17 <furq> also assuming you're using libflac and not lavc flac

03:22 <aaabbb> just -c:a flac

03:22 <aaabbb> which i think uses libflac doesn't it?

03:23 <aaabbb> i used -af "pan=mono|c0=c1"

03:27 blaze has quit [Ping timeout: 246 seconds]

03:27 blaze has joined #ffmpeg

03:28 <furq> ffmpeg can't use libflac

03:29 <furq> it's only got the internal encoder

03:29 <aaabbb> i thought that was based on libflac

03:31 <furq> it's loosely based on flake

03:31 <furq> but they diverged years ago

03:31 <aaabbb> is libflac superior?

03:31 <furq> they're about the same i think

03:31 <furq> last i checked flac performed very slightly better

03:31 <furq> but not enough to bother switching

03:32 <furq> also that was just comparing flac -8 and lavc -compression_level 10

03:32 <furq> there's more extreme settings for both

03:32 <aaabbb> i use -compression_level 12

03:32 <aaabbb> i know it's non-subset but i only use decode it with ffmpeg

03:32 <furq> yeah idk how well flac does with -ep or --freeformat or whatever

03:33 <aaabbb> and sometimes cholesky which occasionally has better compression than levinson

03:34 <FlorianBad> furq, I don't know about opus vs libfdk_aac, but earlier today I listened to libfdk_aac 320 vs 128 and the difference was huge. But again, I have the best monitors and trained my ears for years to hear these things

03:34 <aaabbb> FlorianBad: did you do abx testing?

03:34 <furq> like i said if you can abx it then more power to you

03:34 Muimi has joined #ffmpeg

03:35 <aaabbb> because except for killer samples or certain types of samples, you won't notice a difference between 320 or 128 (and even with certain types of samples, not between 160 and 320)

03:35 <aaabbb> if you can tell the difference between say 160 and 320 for a typical (non pathological) sample, then you are the only human being on earth to do so

03:36 <FlorianBad> aaabbb, no download, in fact my player modifies the bytes of video data to make it very complicated to download the content. The player controls are also part of the canvas itself so it's not like you can grab the video element in the page and do things with it either. Still technically possible but extremely difficult, especially with the amount of code obfuscation I use

03:37 <aaabbb> i meant downloading the audio since it's a music video

03:38 Marth64 has joined #ffmpeg

03:41 <FlorianBad> aaabbb, in a way it was abx because I didn't realize which one I was playing until I notice the degradation in sound quality, and then I looked and realized it was 128kbps versus 320 for the previous video I was looking at (I was really just looking at pictures, but then I noticed the sound difference)

03:41 <aaabbb> that's a/b and not abx

03:41 <aaabbb> but it has to be done multiple times to be valid

03:42 <FlorianBad> ok, fair enough, but trust me it was too obvious

03:42 <aaabbb> and it's possible that the 128k sample had something that played bad with libfdk_aac

03:42 <aaabbb> it might be but if that's the case then something is wrong with libfdk_aac

03:43 <FlorianBad> well I was using the main theme of Gladiator as audio input, so that would definitely fit in your "classical music" thing

03:43 <aaabbb> ah

03:43 <aaabbb> yep that would make sense. did it have symbols too?

03:44 <FlorianBad> an no, no audio nor video download, if I want download I'll put that separate in the page, not part of the player files at all, but I probably won't

03:44 <aaabbb> symbals*

03:44 <furq> c

03:45 <aaabbb> cymbals

03:45 MightyBOB has quit [Ping timeout: 260 seconds]

03:45 <FlorianBad> yeah but it's mostly string ensembles that I developed a really good ear for over the years

03:45 MightyBOB has joined #ffmpeg

03:45 <aaabbb> yeah that makes sense, 128k aac is probably not going to be transparent for that kind of thing

03:46 <FlorianBad> aaabbb, listen to that second part after 2:05 : https://vimeo.com/168439494

03:47 <aaabbb> can't actually do sound, no speakers on this comp

03:47 <FlorianBad> I literally spent my whole childhood making strings sound like that, editing one note at a time lol

03:48 <FlorianBad> And now when I listen to it on vimeo obviously this sounds absolutely AWFUL, they probably use 128kbps or something really bad

03:48 <aaabbb> and it might have gone through generation loss too

03:48 <FlorianBad> (This one was not in my childhood it was just 10 years ago, but it's the same kind of sounds)

03:59 <FlorianBad> So the -bufsize will determine how dash can slice these segments afterwards?

03:59 <FlorianBad> meaning it won't be able to slice smaller, but it could slice larger?

03:59 <aaabbb> see https://stackoverflow.com/questions/33611900/what-is-vbv-video-buffering-verifier-in-h-264

04:00 <aaabbb> it's not about the segments, it's about the decoder's buffer

04:00 <FlorianBad> I know but I will then use that mp4 file to put in dash

04:01 <aaabbb> the only thing that matters about the segments is that they're independently decodable

04:01 <FlorianBad> well, their size too, I don't want a 10MB segment

04:02 <FlorianBad> I actually want them as small as possible because my program actually puts them in an object when sending to client, so it can have many in a single request

04:02 <FlorianBad> so the smaller the better as long as it doesn't have any other downsides

04:03 <aaabbb> it does have more downsides

04:03 <aaabbb> worse compression efficiency

04:03 <aaabbb> you want the segment to fit in whatever unit of response you give

04:04 <aaabbb> so if you send 1mb chunk at a time, you want a segment ~1mb

04:04 waleee has quit [Ping timeout: 240 seconds]

04:05 <FlorianBad> Ideally I'd like segments to correspond to an approximate length, probably about 1s

04:05 <aaabbb> yeah that's fine but if you're sending it in chunks that have 5s worth of content, then you want the segments to be 5s

04:05 <aaabbb> the longer it is, the more efficient the compression is and the less bandwidth you use for the same quality

04:06 <kepstin> for a vod application, i'd probably look at something in the 5-10s length as an encoding efficiency compromise.

04:07 <FlorianBad> how bad of a difference in compression % are we talking about (just guessing) beteen a 5s buffer and a 1s?

04:07 <aaabbb> FlorianBad: i don't know about percentage and it depends on the video contents, but it's not insignificant

04:08 <aaabbb> more s means you can have more consecutive bframes and a much lower i to b/p ratio

04:08 <kepstin> for reference, note that the default gop size with libx264 is 240 frames, so 10s at 24fps

04:08 <FlorianBad> kepstin, no because in the first few seconds my program will gather as much stats as possible about the bandwidth and fps of the client, so I don't want to be stuck for a whole 5s

04:08 <aaabbb> FlorianBad: bandwidth especially with qos is an interesting thing, gathering the first few seconds might not give you anything accurate because of tcp sliding winow

04:09 <furq> shorter segments also makes the buffer more likely to run out

04:09 <FlorianBad> aaabbb, not just that... the user presses F and goes full screen, now I have to change the resolution, but I'm stuck with the small one for another 5s

04:09 <kepstin> and for buffering vbr content, you want the client to buffer several chunks ahead if possible to workaround issues with tcp scaling

04:09 <furq> especially if you're trying this hard to match the bandwidth the client has

04:10 <aaabbb> FlorianBad: you shouldn't be changing resolution that fast anyway, even 10s isn't excessive

04:10 <FlorianBad> aaabbb, I can, because it's all in canvas so no one will really know, it's all handled in the way I draw in that same canvas

04:11 <kepstin> people will know, because quality switch is a visual difference

04:11 <aaabbb> FlorianBad: but will the decrease in quality from a small gop size be worth it?

04:11 <FlorianBad> sure, but that's a good thing, so I can switch very quickly after they went fullscreen

04:11 <kepstin> you want to avoid switching quality as much as possible, since it gives a worse experience

04:11 <FlorianBad> (for example, among many other things)

04:12 <FlorianBad> So -gop is what determines how small dash will slice?

04:12 <aaabbb> the gop size will be a limiting factor

04:13 <kepstin> you can create chunks with multiple gops, you can't create chunks smaller than a gop

04:13 <aaabbb> because you can't predict across slices

04:13 <aaabbb> yeah and generally you don't want a chunk to have multiple gops unless the chunk is really really giant

04:14 <FlorianBad> kepstin, right, ok

04:14 <FlorianBad> I see

04:15 <aaabbb> bigger gop = more efficiency but slower seeking (and for realistic chunk size, the seeking won't be a problem)

04:15 <FlorianBad> so then with by -crf options I should definitely use all 3 options: -maxrate -bufsize and -g ?

04:15 <aaabbb> you might want to turn off scenecut detection too

04:15 lolok has quit [Remote host closed the connection]

04:15 <aaabbb> but definitely use maxrate and bufsize with crf

04:16 <FlorianBad> Also I assume -g should be a multiple of keyint ?

04:16 <aaabbb> -g is literally keyint

04:16 <FlorianBad> I did that I use -x264-params "scenecut=0:keyint=xxx"

04:16 <aaabbb> -g 300 is the same as -x264-params 'keyint=300'

04:17 <FlorianBad> ah! well, then I already have -g :) lol

04:17 <FlorianBad> (That term "group of pictures" seems misleading, but anyway)

04:18 <kepstin> for reference, i _think_ google is using 10s gop for vod youtube content

04:18 <aaabbb> it's even more misleading when you realize the difference between open gop and closed jop ;)

04:18 <aaabbb> s/jop/gop

04:18 <aaabbb> x264 defaults to closed tho. it's slightly less compression efficiency but you need it for dash

04:19 <kepstin> (youtube's bandwidth estimation routinely underestimates my connection speed, but it still guesses a value high enough to play videos at max available quality)

04:21 <FlorianBad> kepstin, when quality gets bad (like crf 25) I can clearly notice the keyint like a clock on the picture, when I set keyint to fps I can clearly see everything change every second... So for that reason I think I'll just use a fps/2 keyint (500ms) which looks a lot better

04:21 <aaabbb> fps/2 is a very bad iea

04:21 <aaabbb> idea

04:21 <aaabbb> that's extremely short gop, plus switching often is not good anyway

04:21 <aaabbb> FlorianBad: the sudden quality change at each keyint is caused by the qp ratio for i and p/b frames

04:22 <FlorianBad> that doesn't mean I will switch between bitrates, it just means these I-frames won't be so obvious on the screen

04:22 <kepstin> if you see keyint causing issues in crf mode, that might mean your vbv limits are too low causing it to run in bandwidth limited mode instead of crf mode :(

04:22 <aaabbb> just set the qp to be higher for i frames and lower for p/b frames and that sudden "jump" as the i frame hits will go away

04:22 <kepstin> the defaults combined with crf mode _should_ cause reasonably consistent quality over a video

04:23 <FlorianBad> kepstin, I had no vbv option set thus far, and -crf 25 with keyint=fps looked really bad

04:23 <aaabbb> FlorianBad: set it larger than fps then

04:23 <aaabbb> like fps*5 or fps*10, and if you keep getting that "jump", change the qp for i frames

04:23 <FlorianBad> But it probably depends on the type of video. I was testing with that ARRI Alexa footage of the woman that I posted a few days ago

04:24 <FlorianBad> (so little subtle movements of the hair... these keyframes made that terrible)

04:24 <aaabbb> reducing keyint is the wrong solution

04:25 <FlorianBad> aaabbb, hmm, interested in knowing what that means: "qp to be higher for i frames and lower for p/b frames" :)

04:26 <aaabbb> FlorianBad: so the reason it "jumps" is because an i frame is like a still picture, like a jpeg. p frames and b frames only hold differences. a lower i frame qp (ie higher quality i frame) means that you need less bits in p/b frames to keep quality, a lower quality (higher qp) i frame means that you need more bits in the p/b frames to "make up for that"

04:27 <aaabbb> the reason you see a jump is because the quality of the p/b frames is kinda low, so you get errors that accumulate, until it hits an i frame and it suddenly jumps up in quality

04:28 <kepstin> well, it's actually more common to get the other way around, where the i frame is kinda low quality, and the p/b frames 'repair' it over time - that's caused by bitrate limits.

04:28 <kepstin> usually caused by*

04:28 <aaabbb> kepstin: yeah but if he's getting those sudden jumps in quality, it's not that

04:29 <kepstin> since i frames are _huge_. in most cases, the i frame is the majority of the gop, and the predicted frames are tiny in comparison.

04:29 <kepstin> so if the i frame doesn't fit in the vbv buffer bandwidth budget, then it has to be encoded smaller (lower quality), but the predicted frames can still be encoded full quality.

04:30 <aaabbb> that's also why having a very small gop like just 12 frames is a bad idea

04:30 <FlorianBad> aaabbb, hmm so you're talking about adjusting the BALANCE in quality of these keyframes (I-frames) vs the ones in between? what parameters do that?

04:30 <kepstin> yeah, the lower quality keyframes is more commonly an issue in _live_ video.

04:31 <furq> ipratio

04:31 <aaabbb> FlorianBad: yeah the balance, iff you're having problems with the keyframe causing an unpleasant jump in quality

04:31 <aaabbb> default ipratio is 1.4 for x264

04:32 <kepstin> i.e. if a keyframe is qp=23, then the pframe will be qp=32.2

04:32 <kepstin> (note that while qp and crf use the same number scale in libx264, they don't mean the same thing)

04:32 <aaabbb> FlorianBad: intra refresh is also a possibility

04:33 <aaabbb> that will do gradual i frame like updates instead of one sudden change, but that also slightly decreases compression efficiency

04:33 <kepstin> hmm, intra refresh is more problematic than it's worth imo

04:33 <kepstin> you don't get "keyframes" then, so you can't quality switch

04:33 <FlorianBad> I don't see an ipratio option in `ffmpeg -h encoder=libx264` it's not from libx264?

04:34 <aaabbb> kepstin: ahh good point

04:34 <furq> it's not exposed in ffmpeg

04:34 <aaabbb> FlorianBad: it is but you gotta use x264-params

04:34 <furq> ^

04:34 <FlorianBad> ah I see, the other extra params, thanks

04:34 <aaabbb> the ffmpeg options just make it easier to do it with flags to ffmpeg, like "-direct_pred 1" is just an optional way of doing "-x264-params direct=spatial"

04:35 <kepstin> i wanted to use intra refresh in some live stuff, but i had issues with some decoders/players not being able to start the stream when joining late because they never saw a frame marked as a keyframe, so they never started decoding.

04:35 <kepstin> that was back in the days of flash tho, i dunno if it's gotten better

04:36 <furq> seems like it would be totally useless if that was still the case

04:36 <aaabbb> FlorianBad: just adjust the ratio until the subjective quality improves. for testing, i recommend 2pass abr so that you can do actual comparisons of quality at the same bitrate

04:36 <aaabbb> even if you'll use crf+vbv in production

04:36 <furq> you should still use vbv with 2pass

04:37 <aaabbb> furq: even for just testing to compare ipratio?

04:37 <furq> i meant for streaming but it's probably better to use your actual settings for testing

04:38 <FlorianBad> furq, but 2pass means no crf ?

04:38 <aaabbb> 2pass is abr not crf

04:38 <aaabbb> but i just meant for testing

04:38 <kepstin> the quality and bitrate you get from crf depends on other options - it's not "fixed". so in order to do a fair comparison, you need to fix one of those things, and fixing bitrate is easier to do, and more relevant for streaming applications

04:38 <aaabbb> ^

04:38 <FlorianBad> aaabbb, yeah but he doesn't mean for testing

04:38 <aaabbb> i think he was replying to me saying 2pass abr

04:38 <FlorianBad> ok

04:39 <aaabbb> like crf 20 with ultrafast will look way way worse than crf 21 with veryslow

04:39 <aaabbb> and changing ipratio counts as changing settings

04:39 <FlorianBad> but in the same time, for some videos it's absurd to have a high bitrate that isn't necessary because nothing moves

04:41 <FlorianBad> kepstin, so I should not set qp value, just crf and ipratio?

04:41 <kepstin> using too small of a keyint means that you need high bitrate even if nothing moves, since it can't make use of the fact that nothing moves to improve compression.

04:41 <aaabbb> FlorianBad: never ever set qp directly

04:41 <aaabbb> that's cqp which is inferior to crf

04:41 <kepstin> unless you're trying to use lossless mode :)

04:41 <aaabbb> well -crf 0 also works for lossless (except 10 bit)

04:43 <FlorianBad> Interesting that it says "Usually, you'll want to lower these from the defaults of 1.40 for ipratio and 1.30 for pbratio" : https://silentaperture.gitlab.io/mdbook-guide/encoding/x264.html#motion-estimation-range

04:43 <aaabbb> FlorianBad: that's not setting qp directly, that's just setting some biases

04:43 <FlorianBad> kepstin, ok, will probably not need a lot keyint as soon as I figured the ratio thing

04:44 <FlorianBad> aaabbb, ok I wasn't planning to do it, just wanted to confirm

04:44 <kepstin> FlorianBad: a keyframe (idr frame) is a point in the video at which it's not allowed to reference (reuse data from) earlier frames. so the more often you have an idr frame, the less the codec is able to take advantage of re-using data from parts of the frame that doesn't change.

04:44 <aaabbb> ^ and at the extreme end where you are i frame only, you're no better than mjpeg

04:45 <kepstin> so having larger gop size is important for reducing the bitrate needed for a given quality of video, especially if there's limited motion.

04:45 <FlorianBad> keyint=1 :)

04:45 <furq> you are still quite a bit better than mjpeg

04:45 <kepstin> i mean, x264 has better intra-only compression than jpeg, so it's not _as_ bad as mjpeg ;)

04:45 <furq> see

04:46 <kepstin> you'd have to go back to mpeg-1/2 for intra-only to be the same as mjpeg.

04:46 <FlorianBad> kepstin, what I'm wondering is at what point the difference between a given keyint (gop) and another 50% greater one becomes something like 3% in savings...

04:46 <furq> you'd have to test that

04:47 <furq> there's not much benefit to using short gops for vod

04:47 <furq> but i guess the thing you want to do is one of the times it would make some kind of sense

04:48 <aaabbb> it totally depends on the amount of motion

04:49 LionEagle has quit [Read error: Connection reset by peer]

04:50 <kepstin> using longer gops shouldn't ever make quality per bit _worse_, since modernish codecs like h264 have the ability to encode individual parts of frames as non-predicted blocks even in the middle of a gop on high motion stuff.

04:51 <kepstin> (the purpose of scenecut detection is simply to make sure that the codec doesn't encode a bunch of non-predicted blocks near the end of a gop right before an idr frame, resulting in wasting bandwidth by immediately re-sending those blocks; instead it moves up the idr frame)

04:52 <FlorianBad> well another reason I don't want too long keyint is because when the user moves the playback bar it will show some gray crap for a long while

04:52 <furq> it should never do that

04:53 <FlorianBad> so you might say, just force them to start playing at the nearest keyframe, but then it can be a little annoying if that's 10s accurate...

04:53 <furq> you don't need to do that either

04:53 <FlorianBad> hmm?

04:53 <FlorianBad> ok, so a proper player will go back and grab the previous I-frame?

04:53 <furq> the decoder needs to do that but you don't need to present the frames in between

04:53 <aaabbb> if it's doing gray crap then there's a bug

04:54 <furq> it would probably be green but yeah

04:54 <FlorianBad> well, so far I just noticed that on ... VLC ;)

04:56 <kepstin> seeking should normally only cause a pause for buffering, which means that the last shown video frame will be held until enough data is available to decode the frame being seeked to, at which point that frame will be shown and playback will resume.

04:56 LionEagle has joined #ffmpeg

04:56 <aaabbb> if the seek isn't in the buffer ofc

04:57 <kepstin> (youtube buffers up to 70s ahead in my experience)

04:59 <kepstin> buffering ahead gives you better ability to dynamically quality switch, not worse, since you can try downloading higher quality replacements for upcoming already downloaded stuff and improve your bandwidth estimation - and then give up and keep playing the existing quality without a "bump up then down" if there's not enough bandwidth available :)

04:59 <aaabbb> but yes it has to grab the previous i frame (specifically idr frame, aka an i frame in a closed gop), it's impossible to decode a p or b frame without referencing the i frame

05:01 <aaabbb> kepstin: doesn't av1 have some kind of magic frame type that lets it send low and high quality at once? or am i confused with a dream i had?

05:03 <furq> maybe you're thinking of lcevc

05:04 <aaabbb> oh i was thinking of S frames in av1

05:12 deetwelve has quit [Quit: null]

05:14 deetwelve has joined #ffmpeg

05:14 <kepstin> the weird alternate frame stuff in av1/vp9 are a workaround to avoid patents on re-ordered frames

05:17 fling has quit [Ping timeout: 255 seconds]

05:18 lockywolf is now known as bigot_age_dude_l

05:22 bigot_age_dude_l is now known as lockywolf

05:24 lolok has joined #ffmpeg

05:24 <aaabbb> and yet still some assholes created a patent pool to extort people

05:27 ivanich has quit [Ping timeout: 264 seconds]

05:27 Tano has quit [Quit: WeeChat 4.1.2]

05:45 fling has joined #ffmpeg

05:48 stolen has joined #ffmpeg

05:54 Ogobaga has quit [Quit: Konversation terminated!]

05:54 Ogobaga has joined #ffmpeg

05:59 <FlorianBad> Ok so I think I have the mp4 encoding pretty much figured out (to tweak later if my tests once in the player have issues). So now my next step is going to slice that into -c copy -f dash Will start with that tomorrow (10pm here)

05:59 <FlorianBad> THanks again aaabbb, furq, and kepstin for your very valuable help! :)

06:02 AbleBacon has quit [Read error: Connection reset by peer]

06:11 Marth64 has quit [Ping timeout: 268 seconds]

06:14 epony has joined #ffmpeg

06:15 <aaabbb> FlorianBad: out of curiosity, what ipratio did you find helpful in your case?

06:16 <FlorianBad> I haven't tested yet, but will soon. For now I just put ipratio=1.0:pbratio=1.0 in my script with a TODO to test that later in details

06:16 <furq> pbratio doesn't do anything unless you disable mbtree

06:16 <furq> please don't disable mbtree

06:17 <FlorianBad> ah, ok thanks

06:32 epony has quit [Read error: Connection reset by peer]

06:44 epony has joined #ffmpeg

06:48 vincejv has quit [Quit: Bye bye! Leaving for now...]

06:48 <aaabbb> FlorianBad: pbratio just picks between p and b frame qp, but both p and b frames are inter frames so they aren't a problem. if you want to have more b frames, change b_bias

06:49 <aaabbb> as far as your issue is concerned all that matters is the ratio between i frames and non i frames

06:49 <FlorianBad> quite frankly I don't even understand the difference between b and p frames ;)

06:49 <FlorianBad> right, I will test tomorrow to see

06:49 <furq> p predicts from past frames, b predicts from both past and future frames

06:50 <furq> b = bidirectional

06:51 <aaabbb> and having lots of b frames consecutive usually improves compression efficiency (but not always). badapt will adaptively place b frames in the optimal places. just let x264 do its job that way

06:51 <furq> yes

06:52 <furq> you can get very deep into the weeds with x264 settings but it's mostly not worth it

06:53 <FlorianBad> furq, AH! thanks, well then why P frames at all then instead of only B ?

06:53 <FlorianBad> ah

06:53 <aaabbb> because eventually it's less efficient

06:53 <FlorianBad> I see, because B already gives info about next frames which means more data?

06:54 <furq> it requires info from subsequent frames which means they already have to be in the decoded picture buffer

06:54 <furq> which is potentially only a few frames in size

06:54 <FlorianBad> and not the last ones, right?

06:55 <aaabbb> the last of a gop is always a p frame (for closed gop)

06:55 <FlorianBad> which also means I don't get their benefit if keyint is too small, right?

06:55 <aaabbb> exactly

06:55 <FlorianBad> ok

06:55 <furq> well you get a bit less benefit

06:55 <FlorianBad> keyint and bufsize I guess

06:55 <furq> bufsize makes no difference

06:55 <furq> that's not the same thing as the DPB

06:55 <FlorianBad> ah ok

06:56 <aaabbb> you can have up to 16 b frames consecutively, so it's kinda silly to have a gop of only 12 frames

06:56 <furq> the DPB size is the level

06:57 <aaabbb> ofc badapt=2 (the default on higher presets) will rarely achieve 16 consecutive because it's not usually ideal, but still

06:57 <furq> https://en.wikipedia.org/wiki/Advanced_Video_Coding#Decoded_picture_buffering

07:01 <furq> very high bframe values will really slow down the encode

07:01 <furq> often for little benefit

07:01 <furq> although if you have a lot of static shots then it works out

07:02 <furq> the x264 post-encode log will show how often it used n consecutive bframes so you can see if it was worth it

07:04 <JEEB> x264 was basically the point where encoding was simplified. in most cases you'd tweak two values: preset for how much time you want to utilize for compression or analysis, CRF for the quantizer range (compression vs quality)

07:05 <JEEB> man I still recall before x264 got presets *shrug*

07:05 <furq> yeah 9 times out of 10 i just use preset, tune and crf

07:06 <aaabbb> speaking of post-encode log, i noticed that it was using 100% spatial 0% temporal direct mvs, but when i do -direct_pred spatial, the bitrate actually goes *up* when i expect there would be no change (or even lower bitrate with less metadata), why is that?

07:06 <aaabbb> but when there's 0% weightp, when i turn off weightp, there's naturally less overhead and bitrate goes down

07:07 <JEEB> pick like 2100 frames, minute or two of content that more or less represents what you're encoding (-ss SECONDS and -t SECONDS can be utilized to limit the encode to not the full input with ffmpeg cli). figure out the slowest preset you're willing to take, then adjust CRF starting with 23 (default) and going down if it looks bad, and up if it looks good.

07:08 <furq> or just encode the whole thing with preset veryslow tune film crf 19

07:08 <furq> and then if it's too big then decide you've already spent too much time and just leave it

07:08 <JEEB> I don't believe in magical CRF values :D

07:08 <JEEB> I might have used 19.5 for 720p24 animation at 10bit, but that's due to actually testing first with similar content.

07:11 <furq> you should try 19

07:11 <furq> it's pretty good

07:12 <JEEB> I've used that for other stuff too :P bottom line, don't like magical numbers. at least it's not 18 that some people just parrot :D

07:12 <CounterPillow> nah man 19.14 is the absolute peak

07:12 <furq> well 18 is just silly

07:13 <JEEB> also I've dealt with some dark SD content where I've had to go bonkers with CRF 14 because x264 would otherwise decide it can compress the dark spots a bit too much

07:13 <JEEB> although I think that was with 8bit, should check that stuff with 10bit at some point

07:13 <furq> you'll be using zones next

07:13 <furq> slippery slope

07:13 <aaabbb> furq: in testing out the absolute best i could get with lossless x264, i started using zones extensively

07:14 <aaabbb> just for fun

07:14 <JEEB> furq: I've done debanding with manual area definitions :P

07:14 <JEEB> "oh this shotgun barrel has banding in it from the master"

07:14 <furq> well that's just necessary

07:15 <JEEB> not sure if area is the correct word, basically part of the frame from that range of frames :D

07:15 <furq> spend days on your vapoursynth script if you have to

07:15 <furq> and then guess what: it's crf 19

07:17 <JEEB> depends on the content vOv

07:20 <JEEB> also it's funny how when my "how bad can it go until I can't take it", I think since 2006 it's been around 0.5-6fps (I used to have a dual core AMD Turion laptop). and x264 by now can easily do a 1080p50 live stream at preset veryslow :D

07:20 <JEEB> newer formats having more options/variables at least let me go slow enough that I dislike it :D

07:38 rv1sr has joined #ffmpeg

07:38 <aaabbb> JEEB: you can always set merange to a super high value if x264 is not slow enough ;)

08:17 Muimi has quit [Remote host closed the connection]

08:27 Suchiman has joined #ffmpeg

08:34 YuGiOhJCJ has joined #ffmpeg

08:37 whatsupdoc has quit [Quit: Connection closed for inactivity]

08:57 cc0 has quit [Ping timeout: 260 seconds]

08:57 lullerhaus has quit [Ping timeout: 256 seconds]

08:57 lullerhaus has joined #ffmpeg

08:59 cc0_ has joined #ffmpeg

09:19 Ogobaga has quit [Quit: Konversation terminated!]

09:19 Ogobaga has joined #ffmpeg

09:27 stolen has quit [Quit: Connection closed for inactivity]

09:33 bitblit has quit [Ping timeout: 268 seconds]

09:55 j45 has quit [Ping timeout: 240 seconds]

09:58 j45 has joined #ffmpeg

10:16 ivanich has joined #ffmpeg

10:26 ivanich has quit [Ping timeout: 252 seconds]

10:33 Vonter has quit [Ping timeout: 268 seconds]

10:34 Vonter has joined #ffmpeg

10:43 j45 has quit [Quit: ZNC 1.8.2 - https://znc.in]

10:45 MootPoot has quit [Quit: Connection closed for inactivity]

10:45 j45 has joined #ffmpeg

10:49 cc0_ is now known as cc0

10:51 epony has quit [Remote host closed the connection]

11:07 vincejv has joined #ffmpeg

11:08 fossdd has quit [Ping timeout: 255 seconds]

11:09 fossdd has joined #ffmpeg

11:13 jb3 has quit [Quit: ZNC 1.8.2 - https://znc.in]

11:22 epony has joined #ffmpeg

11:31 bitblit has joined #ffmpeg

11:53 waleee has joined #ffmpeg

11:58 Tano has joined #ffmpeg

11:59 LionEagle has quit [Read error: Connection reset by peer]

12:03 stolen has joined #ffmpeg

12:15 sm1999 has quit [Quit: WeeChat 4.3.0-dev]

12:28 rish has joined #ffmpeg

12:28 <rish> Does anybody knopw if I can install ffmpeg 6.1.1 on Debian 12 from some repo?

12:29 Blacker47 has joined #ffmpeg

12:30 sm1999 has joined #ffmpeg

12:36 <JEEB> possibly but I'd probably just grab an automated linux 64bit build from BtbN 's github setup

12:37 <JEEB> since BtbN is part of the community

12:37 TheElixZammuto has quit [Remote host closed the connection]

12:51 Ogobaga has quit [Quit: Konversation terminated!]

12:51 Ogobaga has joined #ffmpeg

13:02 lexano has joined #ffmpeg

13:06 lavaball has joined #ffmpeg

13:14 vampirefrog has quit [Ping timeout: 268 seconds]

13:16 <CounterPillow> Yeah grab a static build, replacing your distro ffmpeg would involve replacing everything that links against its libraries, and not replacing your distro ffmpeg with a deb packaged build that ships the libraries may conflict with the distro ffmpeg depending on sonames and such

13:18 navi has joined #ffmpeg

13:35 MootPoot has joined #ffmpeg

13:38 navi has quit [Ping timeout: 264 seconds]

13:40 Ogobaga has quit [Quit: Konversation terminated!]

13:41 navi has joined #ffmpeg

13:41 ivanich has joined #ffmpeg

13:41 Ogobaga has joined #ffmpeg

13:45 YuGiOhJCJ has quit [Quit: YuGiOhJCJ]

13:59 ivanich has quit [Remote host closed the connection]

14:00 Nixkernal has joined #ffmpeg

14:00 Nixkernal has quit [Client Quit]

14:01 <LimeOn> then you can make an alias so you can use that ffmpeg version easily from terminal

14:13 minimal has joined #ffmpeg

14:18 rvalue has quit [Quit: ZNC - https://znc.in]

14:18 rvalue has joined #ffmpeg

14:35 epony has quit [Remote host closed the connection]

14:36 bibble has quit [Quit: bibberly bobberly]

14:36 alexherbo2 has joined #ffmpeg

14:39 billchenchina has joined #ffmpeg

14:39 psykose has quit [Remote host closed the connection]

14:39 bibble has joined #ffmpeg

14:40 psykose has joined #ffmpeg

14:49 billchenchina has quit [Remote host closed the connection]

14:56 waleee has quit [Quit: updating stuff]

14:57 pikapika is now known as militantorc

15:03 stolen has quit [Quit: Connection closed for inactivity]

15:09 psykose has quit [Remote host closed the connection]

15:15 minimal has quit [Quit: Leaving]

15:16 Starz0r_ has quit [Ping timeout: 268 seconds]

15:18 Starz0r has joined #ffmpeg

15:19 TuxJobs has joined #ffmpeg

15:19 <TuxJobs> Problem: In mpv, I navigate to a black frame and press the "c" button. This calls a script I've made which uses FFMPEG to take the video file, cut out from the given timestamp to the end, and save this as a new video. Frustratingly, it sometimes doesn't work right. Even though the video frame in mpv was 100% black, the first frame(s) in the output video has the nagscreen which flicker by. Very annoying. It seems to work on MOST videos, but behaves like

15:19 <TuxJobs> this for some. I've also noticed that there can be horrible audio/video desync issues caused by doing this. What could be the reason for this? Both the videos that work and those that don't are typically .mp4. Example command: `ffmpeg -loglevel quiet -y -ss '00:00:03.133' -i '/home/me/Desktop/test_in.mp4' -c copy '/home/me/Desktop/test_out.mp4'`

15:23 <CounterPillow> <CounterPillow> you are doing a stream copy

15:23 <CounterPillow> <CounterPillow> you cannot cut frame perfectly

15:23 <CounterPillow> <CounterPillow> modern codecs do not work like this

15:28 AbleBacon has joined #ffmpeg

15:55 APic has quit [Ping timeout: 268 seconds]

16:13 APic has joined #ffmpeg

16:15 intrac has quit [Quit: Konversation terminated!]

16:15 intrac has joined #ffmpeg

16:25 epony has joined #ffmpeg

16:29 ivanich has joined #ffmpeg

16:35 rv1sr has quit []

16:37 Vonter has quit [Quit: WeeChat 4.2.1]

16:39 Vonter has joined #ffmpeg

16:50 fling has quit [Remote host closed the connection]

16:52 lucasta has joined #ffmpeg

17:06 fling has joined #ffmpeg

17:08 treefrob has quit [Ping timeout: 240 seconds]

17:21 treefrob has joined #ffmpeg

17:27 minimal has joined #ffmpeg

17:40 Dotz0cat has quit [Ping timeout: 256 seconds]

17:42 jarthur has joined #ffmpeg

17:44 Dotz0cat has joined #ffmpeg

17:53 treefrob has quit [Ping timeout: 264 seconds]

17:58 lns has joined #ffmpeg

17:58 gvg__ has quit [Ping timeout: 264 seconds]

17:59 gvg has joined #ffmpeg

18:00 gvg_ has joined #ffmpeg

18:01 gvg___ has quit [Ping timeout: 240 seconds]

18:01 rvalue has quit [Ping timeout: 252 seconds]

18:05 treefrob has joined #ffmpeg

18:12 rvalue has joined #ffmpeg

18:30 mrelcee has quit [Quit: I want Waffles!]

18:30 mrelcee has joined #ffmpeg

18:38 lucasta has quit [Quit: Leaving]

18:40 irrgit has joined #ffmpeg

18:44 MrZeus has joined #ffmpeg

18:46 lucasta has joined #ffmpeg

19:03 rv1sr has joined #ffmpeg

19:05 ivanich_ has joined #ffmpeg

19:06 ivanich has quit [Read error: Connection reset by peer]

19:18 treefrob has quit [Ping timeout: 256 seconds]

19:19 Ogobaga has quit [Ping timeout: 264 seconds]

19:19 Ogobaga has joined #ffmpeg

19:19 waleee has joined #ffmpeg

19:22 cosimone has joined #ffmpeg

19:26 Ogobaga has quit [Ping timeout: 246 seconds]

19:32 treefrob has joined #ffmpeg

19:41 lavaball has quit [Quit: lavaball]

19:41 lavaball has joined #ffmpeg

19:41 <TuxJobs> Why not?

19:41 <TuxJobs> It would just "play forward" from the last keyframe or whatever, internally?

19:41 <TuxJobs> What is the problem exactly?

19:42 <kepstin> you can't "play forward" from the last keyframe unless you copy starting at the keyframe before the cut point into the video

19:43 <kepstin> and then in most formats there's no way to say "actually don't show the start of the video, but hide it until time T"

19:43 <TuxJobs> That's what I mean: FFMPEG would do this at the time when it's converting the video to the new, shorter video.

19:43 <furq> that would work if you were reencoding

19:43 <TuxJobs> Hmm.

19:43 <kepstin> but you can't if you're copying, that needs to re-encode

19:43 <kepstin> ffmpeg does do that by default when you re-encode, in fact.

19:44 <TuxJobs> Well, is there some way to know where a keyframe begins, then, in mpv? Would be nice to be able to instead of moving frame by frame (as I do right now), move "keyframe to keyframe" instead.

19:44 <furq> you can disable hr-seek

19:44 <TuxJobs> Well, re-encoding inevitably means quality loss, no?

19:45 WereSquirrel has quit [Ping timeout: 260 seconds]

19:45 <furq> with hr-seek=absolute relative seeks will always be to a keyframe

19:45 <furq> unless the key is bound to "seek 5 exact" etc

19:56 Tano has quit [Ping timeout: 256 seconds]

19:57 ivanich_ has quit [Remote host closed the connection]

20:00 realies has quit [Read error: Connection reset by peer]

20:01 Ogobaga has joined #ffmpeg

20:01 realies has joined #ffmpeg

20:02 NaviTheFairy has joined #ffmpeg

20:04 Gaboradon has quit [Quit: Quitting]

20:10 NaviTheFairy has quit [Ping timeout: 252 seconds]

20:13 user03 has joined #ffmpeg

20:13 user03 is now known as gchound

20:18 LionEagle has joined #ffmpeg

20:19 LionEagle has quit [Remote host closed the connection]

20:27 iive has joined #ffmpeg

20:30 MG2021 has joined #ffmpeg

20:31 NaviTheFairy has joined #ffmpeg

20:33 <MG2021> How can I cut a .TS (AAC) file without losing quality?

20:35 bitoff_ has joined #ffmpeg

20:35 alexherbo2 has quit [Remote host closed the connection]

20:36 NaviTheFairy has quit [Ping timeout: 255 seconds]

20:38 hussein1 has joined #ffmpeg

20:38 bitoff has quit [Ping timeout: 264 seconds]

20:48 MG2021 has quit [Quit: Client closed]

21:01 * FlorianBad testing various ipratio values to see if it fixes the visible keyint changes

21:05 <FlorianBad> I couldn't see much difference between 1.4 and 1.0 so I went to 0.3 to see what happens and it's 10x worst, which tells me that that maybe the problem was the reverse! (now testing higher than 1.4 to see)

21:06 <FlorianBad> in other words maybe the I frames had a bad quality which was immediatley fixed by the following P-frames, until the next crappy I frame

21:08 Blacker47 has quit [Quit: Life is short. Get a V.90 modem fast!]

21:09 <FlorianBad> My guess is that it depends on the type of motion, if everything is very fixed then it might makes sense to have nice P-frames to redraw these subtle changes in pixels, but if that's some Tom Cruise stuff then these I-frames might be more important because they change so much... ?

21:11 NaviTheFairy has joined #ffmpeg

21:11 <FlorianBad> But in my videos things will almost always move very slowly, so I might need a high ipratio (now testing 1.7 and 2.5 now on that same ARRI Alexa footage I got from here in 3840x2160 : https://www.youtube.com/watch?v=ccc9-zhGPbo )

21:13 deus0ww has quit [Ping timeout: 268 seconds]

21:13 <FlorianBad> (also I meant "B-frames" for the Tom Cruise stuff above)

21:13 <FlorianBad> P

21:15 deus0ww has joined #ffmpeg

21:15 <FlorianBad> Yea, so 1.7 is still noticeable (not far from 1.4 after all) but 2.5 becomes excellent, especially considering that I'm testing this with -crf 25

21:15 <FlorianBad> (aaabbb, furq)

21:17 <FlorianBad> So the conclusion (I think) is that if there's a lot of motion/action ipratio should drop to give more quality to the P-frames between keyframes (makes sense), but if it's very smooth camera movement, then these I-frames should be better quality or they will be crappy while the P-frames suddenly become nicer when they don't need to be that nice since very little changed

21:17 <FlorianBad> Makes sense?

21:23 JanC_ has joined #ffmpeg

21:23 JanC is now known as Guest3217

21:23 Guest3217 has quit [Killed (lead.libera.chat (Nickname regained by services))]

21:23 JanC_ is now known as JanC

21:24 TuxJobs has quit [Quit: Leaving]

21:26 <FlorianBad> Well, one thing I didn't realize is that increasing ipratio w/ crf increases bitrate significantly, so it's not just a "ratio"

21:28 realies has quit [Read error: Connection reset by peer]

21:30 realies has joined #ffmpeg

21:30 five618480 has quit [Remote host closed the connection]

21:31 five618480 has joined #ffmpeg

21:52 rv1sr has quit []

21:53 vampirefrog has joined #ffmpeg

21:56 realies has quit [Read error: Connection reset by peer]

21:59 realies has joined #ffmpeg

21:59 realies has quit [Read error: Connection reset by peer]

22:00 realies has joined #ffmpeg

22:06 AbleBacon has quit [Read error: Connection reset by peer]

22:07 AbleBacon has joined #ffmpeg

22:11 emmanuelux has joined #ffmpeg

22:14 <FlorianBad> Ok so I guess I should have listened to aaabbb, kepstin, and furq ;) Indeed... increasing the keyint (gop) solves the problem and results in much better quality without raising the ipratio. After all that ipratio is directly dependent on keyint since there will be an "I" every keyint-1 "P"

22:15 <kepstin> it's really only indirectly dependent, not directly dependent.

22:16 <kepstin> ipratio allows you to increase (or decrease) the relative size of predicted frames compared to i frames, so it should actually make a bigger difference in terms of file size with longer gop.

22:18 <kepstin> if you're using 2-pass mode with target bitrate, x264 will automatically adjust the overall quality of the video to compensate for predicted frames taking more space; with crf mode you just get a different video size.

22:22 cosimone has quit [Remote host closed the connection]

22:22 cosimone has joined #ffmpeg

22:23 lavaball has quit [Remote host closed the connection]

22:26 cosimone has quit [Remote host closed the connection]

22:32 <FlorianBad> yeah, ok

22:39 gchound has quit [Quit: WeeChat 3.8]

22:39 HarshK23 has quit [Quit: Connection closed for inactivity]

22:40 <FlorianBad> kepstin, so if my movie has a lot confetti it could be good to set a *lower* ipratio so that these P-frames in between I-frames can get some extra quality?

22:40 lavaball has joined #ffmpeg

22:41 <kepstin> hard to say; the default is "good on most content on most settings", and you need to test on specific content with specific settings when changing it.

22:42 <FlorianBad> ok, let me find a confetti/slow video :D haha

22:46 five618480 has quit [Remote host closed the connection]

22:46 five618480 has joined #ffmpeg

22:49 <FlorianBad> lol there's some 10-hour snow falling footage on Youtube... How to piss-off Google :)

22:55 <FlorianBad> No bad at the end: https://www.youtube.com/watch?v=THXqDUOpuJg 3840x2160 60fps webm

22:57 cosimone has joined #ffmpeg

23:03 <furq> https://www.youtube.com/watch?v=aKLobzbLxuY&t=2085s

23:03 <furq> that's a pretty good example if it's not geoblocked

23:04 cosimone has quit [Remote host closed the connection]

23:07 <FlorianBad> hahaha

23:08 <FlorianBad> Funny thing is that all these then get to ISPs at the exact same time :)

23:08 <furq> probably hard to find a clip like that to test with that hasn't already been ruined by ota/youtube compression

23:09 <FlorianBad> yeah, you'd need to just generate it digitally

23:09 <furq> well it's easy to generate a synthetic clip that will upset an encoder

23:09 <furq> not very useful for testing though

23:09 <FlorianBad> or just take any video and then find an average confetti thing, and put it on top but scale it down so there's 100 of them in the screen

23:10 <FlorianBad> with a "difference" or multiply composition or something

23:12 cosimone has joined #ffmpeg

23:12 cosimone has quit [Remote host closed the connection]

23:13 cosimone has joined #ffmpeg

23:16 cosimone has quit [Remote host closed the connection]

23:16 lusciouslover has quit [Read error: Connection reset by peer]

23:18 cosimone has joined #ffmpeg

23:19 lusciouslover has joined #ffmpeg

23:29 cosimone has quit [Remote host closed the connection]

23:32 SuicideShow has quit [Ping timeout: 240 seconds]

23:34 SuicideShow has joined #ffmpeg

23:40 <FlorianBad> I'm assuming that -tune grain lowers the default ipratio?

23:41 <furq> grain sets --ipratio 1.1

23:41 <furq> x264 --fullhelp

23:45 <FlorianBad> ah :) thanks

23:51 five618480 has quit [Remote host closed the connection]

23:51 five618480 has joined #ffmpeg