#osdev on 2022-02-26 — irc logs at libera.irclog.whitequark.org

2021-05-23 01:57 klange changed the topic of #osdev to: Operating System Development || Don't ask to ask---just ask! || For 3+ LoC, use a pastebin (for example https://gist.github.com/) || Stats + Old logs: http://osdev-logs.qzx.com New Logs: https://libera.irclog.whitequark.org/osdev || Visit https://wiki.osdev.org and https://forum.osdev.org || Books: https://wiki.osdev.org/Books

00:03 mctpyt has quit [Ping timeout: 272 seconds]

00:25 dmh has joined #osdev

00:29 [itchyjunk] has joined #osdev

00:56 pretty_dumm_guy has quit [Quit: WeeChat 3.4]

01:00 isaacwoods has quit [Quit: WeeChat 3.4]

01:05 sdfgsdfg has joined #osdev

01:40 masoudd has quit [Ping timeout: 240 seconds]

01:52 Vercas has quit [Remote host closed the connection]

01:53 Vercas has joined #osdev

01:58 Mutabah has quit [Ping timeout: 256 seconds]

01:58 Mutabah has joined #osdev

02:18 theruran has joined #osdev

02:20 sdfgsdfg has quit [Quit: ayo yoyo ayo yoyo hololo, hololo.]

02:20 relipse has joined #osdev

02:20 <relipse> I am coding a 2D rpg puzzle game anyone want to see it?

02:32 gwizon has quit [Quit: Lost terminal]

02:34 gog has quit [Quit: byee]

02:46 dude12312414 has quit [Remote host closed the connection]

03:15 nyah has quit [Ping timeout: 256 seconds]

03:31 sdfgsdfg has joined #osdev

03:47 elastic_dog has quit [Ping timeout: 240 seconds]

03:53 elastic_dog has joined #osdev

03:55 ElectronApps has joined #osdev

04:22 Jari-- has joined #osdev

04:23 <Jari--> morning all

04:30 sdfgsdfg has quit [Quit: ayo yoyo ayo yoyo hololo, hololo.]

04:51 rustyy has quit [Quit: leaving]

04:52 rustyy has joined #osdev

04:53 rustyy has quit [Client Quit]

04:53 MiningMarsh has quit [Ping timeout: 240 seconds]

04:54 MiningMarsh has joined #osdev

04:54 rustyy has joined #osdev

05:04 rustyy has quit [Quit: leaving]

05:04 rustyy has joined #osdev

05:10 rustyy has quit [Quit: leaving]

05:11 rustyy has joined #osdev

05:32 srjek has quit [Ping timeout: 252 seconds]

05:39 bradd has quit [Remote host closed the connection]

06:07 <moon-child> relipse: no, not really

06:18 <klange> not unless it's getting a PonyOS release

06:28 [itchyjunk] has quit [Read error: Connection reset by peer]

06:44 bradd has joined #osdev

06:45 xenos1984 has quit [Remote host closed the connection]

06:46 xenos1984 has joined #osdev

07:00 k8yun has joined #osdev

07:27 k8yun has quit [Quit: Leaving]

07:36 ThinkT510 has quit [Quit: WeeChat 3.4]

07:39 ThinkT510 has joined #osdev

07:41 vdamewood has quit [Read error: Connection reset by peer]

07:42 vdamewood has joined #osdev

07:47 wolfshappen has quit [Ping timeout: 256 seconds]

07:48 wolfshappen has joined #osdev

07:52 the_lanetly_052_ has joined #osdev

07:52 the_lanetly_052_ has quit [Remote host closed the connection]

08:34 mlombard has quit [Quit: Leaving]

09:25 nyah has joined #osdev

09:52 Patater has quit [Quit: Explodes into a thousand pieces]

10:08 theruran has quit [Quit: Connection closed for inactivity]

10:15 iceneko has joined #osdev

10:17 GeDaMo has joined #osdev

10:18 Jari-- has quit [Ping timeout: 256 seconds]

10:29 iceneko has quit [Ping timeout: 256 seconds]

10:40 mepy has joined #osdev

10:43 _xor has quit [Quit: brb]

10:43 mepy has left #osdev [Leaving]

10:54 j00ru has quit [Ping timeout: 260 seconds]

10:56 j00ru has joined #osdev

11:02 j00ru has quit [Ping timeout: 256 seconds]

11:19 ElectronApps has quit [Remote host closed the connection]

11:44 zaquest has quit [Remote host closed the connection]

11:46 zaquest has joined #osdev

11:59 j00ru has joined #osdev

12:06 the_lanetly_052 has joined #osdev

12:19 heat has joined #osdev

13:03 the_lanetly_052 has quit [Remote host closed the connection]

13:07 dennis95 has joined #osdev

13:09 Jari-- has joined #osdev

13:14 <heat> what's a good heuristic on consolidating per-thread free lists into the shared free list, for a memory allocator?

13:17 <mrvn> heat: keep a counter per thread how many pages are free. Then calculate the average and if you are way above return some pages to a common pool.

13:18 <Jari--> You can also make non-virtually mapped memory allocation.

13:18 <heat> that doesn't make sense if the memory allocator is only being hit by a single CPU

13:18 <Jari--> Just allocate it at the physical memory, in case some process will have access to this.

13:18 <heat> i.e doing networking RX on a single CPU, for performance

13:18 <mrvn> or use a work stealing list (lock-free). if one thread runs out steal some pages from one above average.

13:18 <mrvn> Jari--: that makes no sense.

13:20 <mrvn> heat: you can balance memory on each context switch or something that happens regulary

13:20 <heat> stealing pages defeats part of the purpose of having a per-cpu/per-thread list

13:21 <heat> and balancing memory on context switches seems like a great way to add a shit ton of latency

13:21 <mrvn> heat: a work stealing list is tune to that stealing not happening often.

13:22 <mrvn> Do you have shared memory and filesystem caches?

13:22 <heat> and most of the question is: what's a good way to know when to steal memory from the per-cpu free lists to the shared list?

13:22 <heat> yes

13:23 <heat> balancing lists based on the average of pages/memory chunks each list has seems really suboptimal

13:23 [itchyjunk] has joined #osdev

13:23 <mrvn> In what way?

13:23 <heat> you want to let memory stay inside the per-cpu lists that are getting hit the most often for as long as possible

13:24 <heat> only if you have a bunch of pages inside a per-cpu list that has been mostly stale for a while does that make sense

13:24 <mrvn> then have a decaying usage/s variable.

13:25 <mrvn> Do you need something for 4 cores or for 1024?

13:26 <heat> yes

13:26 <heat> :)

13:26 <GeDaMo> 64K cores should be enough for anyone :P

13:27 <mrvn> For 4 cores you can just average some metric every time you need to. For N cores you want some tree or graph form to average only a bunch of cores and let that propagate over time.

13:28 <mrvn> Won't all your memory be in the filesystem cache except for the local per-thread lists?

13:28 <clever> i might just use an uint32_t[4] (or native wordsize) to hold the free-count for each core

13:28 <clever> and just read them lock-less

13:29 <mrvn> clever: that was my suggestion above.

13:29 <clever> the only problem i can see there, is cache line theft

13:29 <clever> every time you free, you check those counts, and steal that cache line from the other cores

13:29 <heat> mrvn, no? why would it

13:30 <mrvn> as long a there is FS activity not using a free page for cache is wastefull

13:30 <clever> and if the other cores are all updating the counts in the same line, they are also stealing it from eachother

13:30 <clever> so maybe have a per-core avg, and update it less often

13:30 <clever> and spread those counters out into seperate cachelines?

13:31 <heat> all memory is page cache memory except the anonymous memory and the shared memory and the kernel internal allocations

13:31 <heat> and those kernel internal allocations are done by a kernel allocator which probably wants a percpu cache as well

13:33 <mrvn> So the normal state will be that all memory except a little reserve is in use by processes, kernel or cache.

13:34 <clever> thats something that a lot of windows users have trouble with when moving to linux

13:34 <heat> that's not really the case

13:34 <heat> you need a good amount of memory in free lists

13:34 <clever> they see something like 64mb free and 7gig used, and freak out about it being a problem

13:35 <mrvn> that's just a matter of adjusting what "little" means.

13:35 <clever> and 99% of the time, 6gig of that is just the fs cache

13:35 <clever> and they still think flushing the fs cache improves performance

13:36 <heat> clever, windows does the same, but it's designed for human beings

13:36 <clever> maybe task manager is just lying a bit, and reporting fs cache as free?

13:37 <clever> ive not taken such a close look at it

13:37 <heat> the task manager now shows a detailed bar

13:37 <clever> maybe the same is for those new users? and its just their first time on a slower arm sbc?

13:37 <clever> and the first time it slows down, they check ram, and see its low

13:37 <heat> compressed memory, page cache(I think?), actually free memory, anonymous memory, etc

13:37 <mrvn> My point is that the free lists, per-core and global, should be small. So I would create some metric how much memory a core should hold. If the free list is twice that then return half to the global list. If it's half that check if you can get some from the global list. If it's empty run an IPI or steal some.

13:38 ElectronApps has joined #osdev

13:38 <mrvn> clever: Linux dumbed it down for users now too. Top shows "available Memory"

13:39 <heat> i understand what you mean but that's not really the case I'm afraid

13:39 <heat> kernel memory allocation is way more complex than "page cache uses everything, so just use a tiny list"

13:39 <clever> mrvn: `free -m` also shows available, but users still complain when free is low

13:40 <heat> turns out most users aren't kernel devs or system devs so they just want to know if there's enough memory for their programs to run

13:40 <clever> yeah

13:41 <mrvn> clever: but now we can point those to "available". Didn't use to show that.

13:41 <clever> another issue, is bloat in chrome, it can no longer run on just 512mb of ram

13:42 <clever> and the recent pi02 model only has 512mb

13:42 <heat> forcing users to learn kernel concepts just to find out if they can have 30 tabs open on chrome is exactly why linux is not going anywhere on desktop

13:42 <heat> the pi zero 2 w isn't meant for desktop usage anyway

13:43 <heat> source: i have one

13:43 <mrvn> heat: I still don't see a problem. Over time cache will eat all memory till you hit some lower limit of what should be free. At that point you have to balance free lists.

13:43 <clever> heat: i cant even have 1 tab open on about:blank, lol

13:43 <clever> the system nearly deadlocks

13:43 <heat> yeah

13:43 <clever> and some users arent using it for desktop, but just kiosk type applications

13:43 <heat> compilation is also decently slow

13:43 <clever> where it just boots to a static url and auto-refreshes

13:44 <clever> i am wondering where all of that ram is even going

13:44 <heat> i think chrome has a special build for android (for lower memory usage)

13:44 <clever> yeah

13:44 <heat> maybe that's proprietary? dunno

13:45 <mrvn> clever: it caches rendered frame buffers so switching tabs is fast.

13:46 <clever> mrvn: but with a single open tab, thats not much

13:46 <heat> chrome does plenty of stuff to make sure things are safe and fast

13:46 <clever> heat: https://chromium.googlesource.com/chromium/src/+/main/docs/android_build_instructions.md

13:46 <bslsk05> chromium.googlesource.com: Checking out and building Chromium for Android

13:47 <heat> running everything on a single process is more efficient than spreading it out over multiple, for instance

13:47 <dmh> i do plenty to remain safe and fast

13:47 <mrvn> heat: and then throws it out the window because with more than 6 tabs that would eat too much memory.

13:47 <clever> oh, that reminds me, spectre changes make things worse

13:47 <clever> in the old days, a given worker process was mixing different domains together

13:48 <clever> but to limit the scope of spectre exploits, each proc is now restricted to servicing a single domain

13:48 <clever> so you can only ever steal data from yourself

13:48 <mrvn> Like each tab gets their own javascript VM for security. Except tab 7 then shares because can't have O(tabs) memory usage

13:49 <clever> mrvn: if i pop open shift+escape on my desktop, i can see 8 youtube tabs are sharing a single proc, which is using 850mb of ram

13:49 <clever> 251mb just for js

13:49 <mrvn> So your youtube tabs can all steal from each other.

13:49 joe9 has quit [Quit: leaving]

13:50 <clever> kinda, there are at least 2 procs for youtube

13:50 <mrvn> what about other pages that just have a youtube video in them?

13:50 <clever> it randomly decides to spawn a new one

13:51 <mrvn> why does it have so much state that each tab can't he their own process?

13:51 <clever> let me check subframes...

13:51 <clever> yeah, if i play a youtube video inside discord, a process containing "subframe: https://youtube.com" spikes in cpu

13:51 <bslsk05> youtube.com <no title>

13:52 <clever> so its RPC'ing things between the discord process and the youtube process

13:52 <clever> which makes sense, JS doesnt give you very much control over the iframe

13:52 <clever> when cross-domain

13:53 <clever> mrvn: checking a heap snapshot....

13:54 <mrvn> gotten better then.

13:55 <clever> 35% snapshotted

13:56 <mrvn> Personally I think tabs that haven't been visible for say an hour can be reduced to minimum state. No need to keep a snapshot of the framebuffer, run any java scripts or keep any processes around.

13:56 <clever> that framebuffer is actually key to how android performance works

13:57 <clever> when your scrolling thru the open tabs, the only state you have is the url and the framebuffer

13:57 <clever> the entire dom and js heap just doesnt exist

13:57 <clever> and those framebuffers get saved to disk

13:57 <clever> so it can give you the illusion of the tabs still being live

13:57 <mrvn> then how does mouse-over or any other javascript trigger work?

13:58 <clever> android, no hover events

13:58 <clever> and you cant click anything until you set focus to that tab, which then reloads the page

13:58 <mrvn> oh, I see what you mean, that tab scrolling. I though scrolling inside the active tab.

13:58 <clever> yeah, that one

13:59 <clever> the snapshot for a random youtube tab is done

13:59 <clever> 73mb shallow size for (system)

13:59 <mrvn> that picture is just the visible portion though. I believe tabs cache a lot larger section so scrolling inside the tab is fast

13:59 <clever> yeah, when live

13:59 <clever> under heavy strain, i can scroll faster then it can render, and that gets exposed

14:00 <heat> i bet most of the memory usage isn't O(ntabs)

14:00 <heat> a regular web browser uses a gig of memory with what, 4-6 tabs?

14:00 <mrvn> 15485 mrvn 20 0 531208 259532 19028 R 32.1 0.4 8782:11 MainThread

14:01 <mrvn> I haven't used my browser today at all. Why is it using 32% cpu time?

14:01 <clever> mrvn: hit shift+escape, and sort by CPU

14:01 <heat> render?

14:01 <mrvn> Another 30% for Web Content

14:01 <GeDaMo> GIFs?

14:01 <clever> GeDaMo: yeah, i have caught imgur eating cpu before

14:01 <bauen1> video encoding / decoding of some sort in the background ?

14:01 <mrvn> clever: firefox

14:01 <GeDaMo> GIFs in particular are bad

14:02 <mrvn> The visible tab has a paused video on it.

14:02 <heat> also av1 has no gpu acceleration

14:03 <heat> and you need to explicitly enable GPU acceleration on linux

14:03 <GeDaMo> What annoys me about all the JS on Youtube is that none of it has anything to do with decoding video :|

14:03 <heat> because linux

14:03 <bauen1> GeDaMo: just be happy that you can block a crap ton of it in a proper browser :(

14:03 <heat> if you want to do something without getting a BSc in computer science you're absolutely wrong

14:04 <clever> GeDaMo: oh, have you seen blob url's?

14:04 <GeDaMo> You mean like data: ?

14:05 <clever> nope, blob:

14:05 <clever> its a better api

14:05 <GeDaMo> Better for whom? :|

14:05 <clever> the browser performance

14:05 <clever> basically, you can take a JS byte-array, and pass it to the browser

14:05 <clever> the browser then returns an opaque token like 'blob:https://www.youtube.com/c6bdc7d8-45f9-47c2-bf7e-fc0e77eb810d' back to you

14:05 <bslsk05> redirect -> consent.youtube.com: Before you continue to YouTube

14:06 <clever> you can then use that in any network based operation

14:06 <clever> and the browser will just refer to the byte-array

14:06 <clever> no need to convert the bytes to hex, and then back to bytes

14:06 <clever> no waste generating a string 2-3x the size of the blob

14:08 <clever> GeDaMo: this also works with <input type=file> boxes

14:08 gog has joined #osdev

14:09 <GeDaMo> How does this cut down on all the JS on YT?

14:09 <clever> it doesnt, but it makes some of their crazy methods use less ram/cpu

14:10 <clever> i dont know why, but the data for the video, isnt just a plain http(s) request

14:10 <clever> its ajax and js based

14:10 <GeDaMo> Yeah, I've watched the network console :P

14:10 <heat> >i don't know why

14:10 <clever> and this lets the blobs be passed into a <video> tag without much more overhead

14:10 <heat> copyright?

14:10 <heat> obfuscation?

14:10 * clever points to youtube-dl

14:11 <clever> hows that working? :P

14:11 <heat> RE

14:11 <mrvn> And who thought up that any javascript you load form some obscure ads URL can throw a transparent box over the whole browser and capture any mouse clicks?

14:11 <heat> not saying it works, just saying it's probably needed

14:11 <clever> if you grab an element for a <input type=file>, you can then do URL.createObjectURL(element.files[0]); and youll get a blob: token like above

14:12 <clever> you can then use that token in anything that expects a url (ajax, img tag, others), and get the contents of a local file the user selected

14:13 <mrvn> .oO(or not selected)

14:14 <mrvn> Having the browser load file:///etc/passwd still scares me

14:14 <clever> https://developer.mozilla.org/en-US/docs/Web/API/File the type behind .files[0]

14:14 <bslsk05> developer.mozilla.org: File - Web APIs | MDN

14:14 <clever> mrvn: but the cross-domain policies wont allow JS to actually read the contents

14:14 <clever> only if the user intentionally chooses that file in a <input type=file> will that be possible

14:15 <heat> fuchsia is meant to solve that issue

14:15 <mrvn> how does that work with uploading the blob to a server and getting it send back?

14:15 <heat> can't look at etc/passwd unless you give chrome a handle

14:16 <clever> mrvn: how is the file uploaded?

14:16 <mrvn> as "blob:file:///etc/passwd" if I followed the discussion right

14:16 <clever> that wont work

14:16 <clever> the browser generates an opaque token, like blob:https://www.youtube.com/c6bdc7d8-45f9-47c2-bf7e-fc0e77eb810d

14:17 <clever> which has your domain, and a randomly generated string

14:17 <clever> it then looks that up in a table, to find the original byte-array you used to create it

14:17 <clever> so you can only access blobs you already had access to

14:17 <mrvn> and keeps that token in memory forever in case it someday comes back to the tab?

14:17 <clever> probably tied to the lifetime of that tab

14:18 <clever> https://developer.mozilla.org/en-US/docs/Web/API/URL/createObjectURL

14:18 <bslsk05> developer.mozilla.org: URL.createObjectURL() - Web APIs | MDN

14:18 <clever> > The URL lifetime is tied to the document in the window on which it was created.

14:18 <clever> yep

14:18 <mrvn> I gues that breaks the back button then

14:19 <clever> > Each time you call createObjectURL(), a new object URL is created, even if you've already created one for the same object. Each of these must be released by calling URL.revokeObjectURL() when you no longer need them.

14:26 <clever> https://developer.mozilla.org/en-US/docs/Web/API/MediaSource

14:26 <bslsk05> developer.mozilla.org: MediaSource - Web APIs | MDN

14:26 <clever> mrvn: and i think youtube is using the MediaSource api, rather then the Blob api

14:26 <clever> it looks like some kind of js managed ringbuffer, to allow access to a seekable media file, in chunks

14:29 <clever> https://w3c.github.io/media-source/#introduction

14:29 <bslsk05> w3c.github.io: Media Source Extensions™

14:30 <dmh> browsers might be both the easiest and worst os to develop for

14:30 <clever> heat: oh, another feature that blob: and MediaSource offer, is changing the bitrate

14:31 <clever> if the remote server slices your mp4 file up cleanly at keyframes and whole container level packets

14:31 <clever> then the JS can dynamically switch between different bitrates, and append chunks to the stream as it gets them

14:31 <clever> and the <video> tag will just deal with it

14:32 <clever> > Define a splicing and buffering model that facilitates use cases like adaptive streaming, ad-insertion, time-shifting, and video editing.

14:32 <clever> oh, and ad's, ive got an adblocker, so i rarely think of that case

14:33 <GeDaMo> https://en.wikipedia.org/wiki/Dynamic_Adaptive_Streaming_over_HTTP

14:33 <bslsk05> en.wikipedia.org: Dynamic Adaptive Streaming over HTTP - Wikipedia

14:33 <heat> i should port chromium

14:34 <clever> GeDaMo: ah yeah, ive used dash and hls before, with the nginx-rtmp plugin

14:35 <clever> https://github.com/arut/nginx-rtmp-module

14:35 <bslsk05> arut/nginx-rtmp-module - NGINX-based Media Streaming Server (3314 forks/11669 stargazers/BSD-2-Clause)

14:35 <clever> it can accept an rtmp input (from obs or ffmpeg for example), and then provide rtmp/hls/dash streams to viewers

14:36 <heat> >43 deps

14:36 <heat> i'm fucked

14:37 <heat> why does it need pciutils

14:37 <clever> and based on the MediaSource stuff above, and some MDN example code, i'm assuming xhr.responseType = 'arraybuffer'; could be used to fetch chunks of the dash/hls video

14:37 <GeDaMo> Pfft! Just write your own browser :P

14:37 <clever> which nginx-rtmp is pre-slicing

14:37 <clever> https://developer.mozilla.org/en-US/docs/Web/API/SourceBuffer has example code on how to shove that arraybuffer into a mediasource

14:37 <bslsk05> developer.mozilla.org: SourceBuffer - Web APIs | MDN

14:38 <clever> heat: webusb is why chrome has libusb as a dep

14:38 <heat> i would rather rewrite my kernel in java than write my own browser

14:38 <GeDaMo> https://platform.html5.org/

14:38 <bslsk05> platform.html5.org: The Web Platform: Browser technologies

14:38 <clever> ive used that to prod the rpi bootloader from js

14:38 <heat> something that I could actually use though

14:38 <heat> mesa

14:40 pretty_dumm_guy has joined #osdev

14:41 <heat> if I got mesa I could get opengl, vulkan, then a compositor on top of that

14:41 <heat> chromium shouldn't be too hard after that

14:41 <clever> and webgl needs mesa anyways :P

14:41 <heat> but chromium doesn't need webgl

14:42 <heat> fuchsia has a chromium build but has no opengl

14:42 <clever> ah, some parts are probably optional then

14:43 <heat> actually if fuchsia can run chromium maybe it's not that hard to port

14:43 <heat> it's just "hard" instead of "extremely hard"

14:44 * clever heads off to bed

14:51 Patater has joined #osdev

14:55 Patater has quit [Client Quit]

15:03 Patater has joined #osdev

15:04 masoudd has joined #osdev

15:04 Vercas has quit [Remote host closed the connection]

15:04 Vercas has joined #osdev

15:07 Patater has quit [Client Quit]

15:12 Patater has joined #osdev

15:29 X-Scale has joined #osdev

15:32 m3a has joined #osdev

15:34 ElectronApps has quit [Remote host closed the connection]

15:35 mahmutov has joined #osdev

15:37 theruran has joined #osdev

15:38 terminalpusher has joined #osdev

15:44 srjek has joined #osdev

16:16 terminalpusher has quit [Remote host closed the connection]

16:18 dude12312414 has joined #osdev

16:35 the_lanetly_052 has joined #osdev

17:41 elastic_dog has quit [Ping timeout: 240 seconds]

17:47 elastic_dog has joined #osdev

18:00 * geist yawns

18:00 <geist> good morning folks

18:00 dennis95 has quit [Quit: Leaving]

18:01 <geist> actually got up a while ago, just forgot to yawn in this direction

18:01 * mjg yawns back

18:02 <mjg> do we have any locals in .ua?

18:02 <mrvn> geist: got any plans for a web browser that doesn't need gigabytes of ram?

18:02 <geist> get more gigabytes!

18:03 <GeDaMo> I suspect geist has shares in RAM manufacturers :|

18:03 <mjg> the address sapce is there to use it

18:03 <geist> but yeah web browsers soaking up ram is pretty annoying

18:04 <geist> to be fair i think i blame a lot of web apps for it. i think the browsers are largely growing in size due to the size of the individual pages and how much state they keep going

18:04 <mjg> also have fun keeping long uptime without restarting them

18:04 <geist> if you poke around in chrome's task manager you can really see a wide disparity based on individual pages

18:04 <geist> can easily have a few hundred MB of just javascript heap/etc

18:04 <mrvn> geist: That's a problem of the underlyning design of the web

18:04 <geist> assuming it's GCing reasonably efficiently, i dont think the browser can do much about it

18:05 <mrvn> used to be the web was for hypertext. Now it's a VM running server driven games.

18:05 <mrvn> geist: it could not run the crap

18:05 <geist> sure, but then that's not a modern web browser

18:06 <geist> you are free to disable that crap, but then that's a different thing

18:06 <mrvn> geist: run it in tabs the user looks at.

18:06 <GeDaMo> Web browser's just another operating system :|

18:06 dude12312414 has quit [Quit: THE RAM IS TOO DAMN HIGH]

18:06 <geist> might be worse, because then it has to keep reloading it, etc

18:06 <geist> but yes theres' a fair amount of that sort of thing going on in the background

18:07 <graphitemaster> dammit, who yawned - that shit be contagious

18:07 <mrvn> geist: used to be you could hit ESC on a page and it would stop. I really miss that.

18:07 <mrvn> The annoying bit is that 99.9% of the crap going on in the background is for adds. It's not for the benefit of the user.

18:08 <geist> indeed

18:08 <graphitemaster> An empty `Map()` object in V8 JS engine is already 4 MiB of physical RAM - The web is not very 'ram' optimized.

18:08 <GeDaMo> graphitemaster: where do you see that? :|

18:09 <graphitemaster> I did some memory profiling awhile back on a node app

18:09 <GeDaMo> Ouch

18:09 <graphitemaster> Was surprised how much memory Map and Set used compared to regular JS object dictionaries

18:09 <mrvn> graphitemaster: does Map() take an argument for the expected map size?

18:10 <geist> guess 4MB is the default hash table size?

18:10 <geist> i guess it could at least demand fault it in

18:10 <geist> though i suppose the runtime probably memsets it

18:11 <geist> that's definitely something we've discovered in fuchsia. there's a tremendous amout of code out there in various languagse that allocate something out of their heap and then memset it to zero and never touch it again

18:11 <graphitemaster> There's no way to set the capacity of a Map in JS. This has more to do with how V8 has different object heaps to keep things from trampling over another, and the largeish alignment they need so they can hack stuff into the pointer bits

18:11 <geist> so much of a problem we actually added to the VM a pass that does a quick zero check of pages that were recently faulted in

18:12 <geist> a lot of that extraneous zeroing is basically security minded code that trusts nothing

18:12 <mrvn> geist: Does memory allocated from fuchsia garantee it's zerroed?

18:12 <geist> yep, no other option

18:13 <geist> but the code at that level doesn't know, since it's probably pulled a block out of its heap

18:13 <geist> which may or may not be fresh from the OS

18:13 <graphitemaster> geist, Good memset implementations used by memory allocation should just sit in a loop comparing say 64-bit words at a time to check if they're all zero first, before actually zeroing memory, just to avoid those page-faults - since largish allocations already pull in zero page - like musl for instance does this in calloc, as does glibc iirc.

18:13 <geist> and since C++ (or rust or whatot) has no equivalent of calloc

18:13 <mrvn> geist: that code should calloc instead of malloc

18:13 <graphitemaster> Yeah, bingo.

18:13 <geist> C++

18:13 <graphitemaster> Fuckin' Rust *shrug*

18:13 <geist> or rust

18:13 <graphitemaster> Systems language, yet has no "zero alloc" optimization XD

18:14 <mrvn> geist: new() should have a calloc flavour

18:14 <geist> but whats really going on is it's a modern policy to have your class or object zero out pretty much everything in its constructor

18:14 <geist> which is highly encouraged at work, etc for security reasons

18:14 <geist> so doesn't matter anyway, by the time the object is made, boom aother zero

18:14 <mrvn> The compiler should know about allocated memory being zero and about initializations filling in zero and skip them.

18:15 <geist> not sure how that's feasible since it doesn't kow where the pointer came from

18:15 <geist> you'd have to tag the pointer somehow with a source

18:16 <mrvn> geist: or a different new() that ensures zeroed out memory.

18:16 <graphitemaster> Having worked in the games industry long enough, zeroing memory has a pretty insane cost associated with it. Take for instance a simple vec3 class of x,y,z floats, zeroing them out doesn't seem too egregious, until you end up with vector<vec3> which is the most commonly used object in an engine (or some flavor of it) and well zeroing the vec has a large enough cost that in a physics library like Bullet, turning it off in the class (which

18:16 <graphitemaster> you can do with a define) makes the library almost 30% faster at physics calculations.

18:16 <geist> graphitemaster: yah totally

18:16 <geist> mrvn: sure there are plenty of solutions if you're willing to break the language

18:16 <mrvn> geist: wouldn't break anything. The object is still properly initialized.

18:16 <geist> and as graphitemaster is pointing out, this sort of stuff is ptobably not tolerated in the games indust5ry

18:17 <mrvn> graphitemaster: vector<vec3> should not zero anything. Only used parts of the vector should get initialized.

18:17 <geist> graphitemaster: yah we *also* have an extra safety thing in clang in the kernel that fills all new locals with a nonzero pattern

18:17 <graphitemaster> it's worse in c++ because constructors are zeroing objects, object at a time, so something like vector<vec3>::resize(1 million) is making 1 million function calls assigning this->*... to zero

18:17 <geist> also has some performance hits

18:18 <graphitemaster> Rather than just using memset(0)

18:18 <mrvn> that's more like it.

18:18 <geist> worse when the constructor is't inlined (which is probalby filling it with zeros), since the compiler will full the space with garbage, and then run the constructor which overwrites that with zeros

18:20 <geist> but yeah i definitely remember some regressions in speed when we moved some code in the zircon kernel from C to C++

18:20 <geist> the memset -> O(N) constructor for exampel as we switched the vm_page struct from C to C++

18:20 <geist> in C can just memset over the whole array and make sure 0 i the default state

18:21 <mrvn> graphitemaster: A vec3 should have the default constructor and if the std::allocator had a flag saying memory is zeroed then the default constructor would just not do anything.

18:21 <geist> but then when it grew a constructor, boom you have a loop and contructors. inlined of course but still less efficient than a memset across the whole thing

18:21 <graphitemaster> I don't think security is at odds with high performance code though. I think these are exclusively problems with modern language design and the purist of value-semantics and the lack of good allocation information within the design of the language. Like if you didn't just treat the allocator as a global concept that allocated an object (like both Rust and C++ do), but had language-concept of an allocator and designed different types of

18:21 <graphitemaster> allocations (zeroed alloc, non-zeroed alloc, zeroed-resize, non-zeroed resize, the aligned versions of all of those, sized-free, etc) you can do a billion times better job while also being secure.

18:22 <geist> yah totes

18:22 <j`ey> rust has those concepts now

18:22 <mrvn> C++ has gone a lot more towards speed with the move semantic.

18:22 <j`ey> https://doc.rust-lang.org/std/alloc/trait.Allocator.html

18:23 <bslsk05> doc.rust-lang.org: Allocator in std::alloc - Rust

18:24 <graphitemaster> Yeah Rust is getting a bit better here, there's still a lot missing from the design I'd say. One major gripe I have is no language really provides allocation interfaces over virtual memory specifically. Like if you want to create a ring-buffer for instance where you just tie the ends together with virtual memory rather than wrapping the buffer with modular arithmetic.

18:24 <mrvn> graphitemaster: My idea would have the allocator (or the underlying memory resource) zero out the memory. So vector<vec3> would do a big memset(), or even skip that when the OS gives zeroed memory.

18:25 <geist> yah the ring buffer would probably go against the whole model of ownership, i guess

18:25 <geist> since you'd now officially have two views of the same thing

18:25 <mrvn> You mean have a uint32_t buf[2048]; and use the same physical page for both halfs to make a 1024 element ring?

18:26 <geist> i have heard folks talk about things involving multiple mappings of the same object and how to deal with it in rut

18:26 <graphitemaster> A lot of optimizations can be done at the virtual memory layer, for instance one of the engines I worked on owns the entire heap - and it has a memcpy implementation which is capable of determining if the addresses are in the managed engine heap, and it'll actually only copy up to a page on each side of the memcpy for page-alignment, then hac at it's heap data structures to perform the inner-copy with virtual memory shenanigans, so it can

18:26 <graphitemaster> copy gigabytes of data at the cost of 2 pages basically (max), and let page-faults deal with the rest. If you covnert it back to regular C memcpy ... the performance is lost and stuff like level loads take almost 20x longer.

18:28 <mrvn> but then you assume the copied memory isn't all used. Otherwise all the page faults cost you more.

18:29 <mrvn> mmap/mremap should have a flag to COW a block of memory.

18:29 <graphitemaster> Right, I think it's just more about the amortization of the page-faults. A memcpy is an expensive inline operation and on large data copies it can block for a long time, while something like a ton of page-faults is spread more evenly in the frame where it's needed, it hides better

18:30 <graphitemaster> Most of what you care about in realtime is hiding spikes, you're more latency focused

18:30 <mrvn> totally. In a realtime system you can't memcpy() 1GB.

18:31 <graphitemaster> Unless you're an M1 mac :P

18:31 <graphitemaster> Then you apparently can and that's not fair XD

18:31 <mrvn> "If the value of old_size is zero, and old_address refers to a shareable mapping (see mmap(2) MAP_SHARED), then mremap() will create a new map‐ ping of the same pages.

18:32 <mrvn> " so it seems you can copy pages. Any way to turn those pages into COW semantic?

18:35 <graphitemaster> Who knows, a lot of platforms don't even have good ways to do copy-on-write.

18:35 <graphitemaster> Like Windows for instance, getting the POSIX behavior where you just get the zero page over and over for your allocation and then it page-faults on writes and actually commits physical memory is just not a thing really.

18:36 <graphitemaster> On Windows you can allocate virtual memory and you can physically commit it when needed, but you can't get the implicit behavior of COW as far as I know

18:38 terminalpusher has joined #osdev

18:39 <graphitemaster> The commit charge is always billed at the VirtualProtect call and the call can fail if you run out of commit

18:40 <graphitemaster> Which sucks because the only sensible way to do something like this is to allocate a virtual region large enough that you never need to worry about running out in the case of say a ring buffer

18:40 <graphitemaster> But you obviously cannot commit a terabyte :P

18:41 <mrvn> graphitemaster: and instead of doing % you want to free pages at the front while fauling in pages at the end?

18:49 <graphitemaster> I mean ideally I just want to virtually allocate large everything so I never need to worry. Like if every std::vector<T> just used likke 1 TiB of virtual memory :P

18:49 <graphitemaster> Then every resize is in place

18:50 <graphitemaster> Basically I envision an OS where there really isn't any actual memory allocation interfaces in user-space. You just get the entire virtual address range and the C allocator just bumps along that space never needing to make a system call to allocate, only ever to free pages which can be done in bulk

18:51 <graphitemaster> Then let page-faults and cow deal with the rest

18:51 <mrvn> did that, execpt no syscall to free pages.

18:52 <mrvn> basically what you have then is a system with (s)brk and #define brk(size) NOP

18:53 <mrvn> #define brk(size) 0 I mean

18:53 <geist> you could functionally do that in fuchsia by just creating a huge VMO and mapping it over most of your address space

18:54 <geist> and then letting it fault in whever you touch it

18:54 <geist> other OSes too, with a gigantic anonymous mapping

18:55 <mrvn> If you do demand page faulting then the malloc() call becomes totally obsolete other than protecting against run away pointers.

18:56 <mrvn> It's too bad there is no MAP_GROWSUP analog to MAP_GROWSDOWN

18:58 <mrvn> But you can map something to 0x80....000 - 0x1000 with MAP_GROWSDOWN and have malloc grow the heap downwards.

19:22 terminalpusher has quit [Remote host closed the connection]

19:28 <graphitemaster> We should also bump the stack size considerably

19:29 <mrvn> 4k isn't enough for you?

19:29 <gog> 4k should be enough for anybody

19:30 <mrvn> My mikrotasks only have 4k memory in total. That includes the page table, struct Task, Heap and Stack.

19:31 <mrvn> Running 1 million tasks on a RPi was fun.

19:39 the_lanetly_052 has quit [Ping timeout: 240 seconds]

19:51 <jimbzy> https://os.phil-opp.com/

19:51 <bslsk05> os.phil-opp.com: Writing an OS in Rust

19:51 <jimbzy> I found that on OSNews and it looked pretty interesting.

19:51 <geist> noice. i've seen a few so far but havne't seen a really good next level one

19:52 <geist> lots of simple taskers, but not yet one that's got all the major pieces in place

19:52 <geist> not saying this one doesn't (aven't looked at it yet)

19:52 <jimbzy> I think I saw one on osdev that's re-working the ARM tutorials.

19:54 <geist> yah i'm in no way sayig rust is insufficient for kernel work, but there are some data structure challenges i'd love to see solved efficiently

19:55 <jimbzy> I'll save it for a later look. I've been working on a little game pack using the Godot game engine.

19:56 <j`ey> the phil-opp one doesn't go super deep

19:57 <jimbzy> j`ey, Just another "bare-bones" style kernel?

19:57 <j`ey> a little bit more than that

20:00 <jimbzy> Yeah, I'll give it a look soon.

20:03 <j`ey> geist: you can always fall back to unsafe+pointers :P

20:03 <geist> of course. that's the question to me: how unsafe do you realistically need to be able to build an actually performant rust kernel

20:04 <geist> stuff ike data structures accessed in interrupt mode, etc

20:04 <geist> global thread ists, etc. there are slow but correct solutions for that that i fear would be not good enough

20:04 <jimbzy> Pssh. Safety? That's why we have the "hold harmless clause" ;)

20:05 <j`ey> yah stuff like that will likely be unsafe, borrowing rules cant really work acrosss interrupts and stuff

20:06 <geist> right. of course if you just turn off all the safeties and have to stick to a smaller subset of the language you start to get into the 'why bother with rust' question

20:07 dude12312414 has joined #osdev

20:07 <geist> vs say having a core system in C/C++ linked with rust drivers or subsystems, which seems like would be a pretty good solution

20:07 <j`ey> if you're starting from scratch anyway *shrug*

20:07 <mrvn> does that include having threads sleeping till an IRQ happens?

20:09 <geist> that might be the solution, running all irqs in thread context

20:10 <mrvn> turning IRQs into messages send to threads?

20:11 <geist> or having dedicated threads that wake up on irq. not an uncommon solution of course, even in monokernels

20:11 <geist> but that might avoid the problem of having driver code running in interrupt context and thus have more of the code in the system exposed to that sort of locking issue

20:12 dude12312414 has quit [Remote host closed the connection]

20:17 <jimbzy> https://i.imgur.com/wFdpqxD.png

20:18 <mrvn> jimbzy: The helm is so nothing flies into your eyes while you are blind?

20:18 <jimbzy> Exactly!

20:19 <GeDaMo> https://v.redd.it/r6mx0vrqm5j81/DASH_480.mp4

20:19 <bslsk05> 'r6mx0vrqm5j81' by [idk] (--:--:--)

20:20 <jimbzy> I'm pretty sure that's a Speedglas auto-darkening hood, too.

20:21 <mrvn> I want a hood with a screen on the inside and camera on the outside that maintains a constant brightness on the screen.

20:22 <jimbzy> That's easy. The hard part is making one that doesn't get nuked due to the EMF.

20:22 <mrvn> EMF?

20:22 <jimbzy> Electromagnetic field.

20:22 <mrvn> do you mean EMP?

20:22 <jimbzy> You wish it was just a pulse :p

20:27 <mrvn> should be lots of pulses

20:28 <jimbzy> https://youtu.be/qJPEF4z5sHU?t=58

20:28 <bslsk05> '1000 AMPS Stick Welding' by WeldTube (00:06:15)

20:30 <mrvn> see how the light fluctuates? Lots and lots of sparks.

20:30 <jimbzy> Yeah

20:31 <jimbzy> You can manipulate the properties of the weld by manipulating the lead angle and arc length.

20:32 <mrvn> Is the power supply even a steady current or pulses?

20:32 <jimbzy> Stick welding is funky like that, though because the shielding material covering the metal. As the electrode is consumed it creates a little gas pocket around the molten pool.

20:33 <GeDaMo> Maybe charging/discharging capacitors?

20:33 <jimbzy> It depends on the machine and process.

20:34 <jimbzy> There are also constant amperage or constant voltage machines. I believe stick welding like that is constant current.

20:34 <mrvn> With 1000 A you certainly don't want to pay the electricity bill if that's a steady current.

20:34 <geist> i forget, is the current exceptionally high or very low? one of the two

20:34 <jimbzy> High

20:34 <geist> but fairly low voltage

20:34 <jimbzy> Yeah

20:34 <GeDaMo> Seems like "capacitive discharge welder" is a thing

20:35 <jimbzy> At 200amps I was usually pushing around 14-15v iirc.

20:35 <mrvn> geist: you are creating a short with 2 mettals. Lot and lots of current.

20:35 <geist> i remember being surprised that the total W is not as high as you'd think

20:35 <jimbzy> Yeah, basically just an out of control heating element.

20:35 <geist> though i guess still a few kW

20:36 gxt has quit [Ping timeout: 240 seconds]

20:36 <mrvn> Your power in is limited to 16A at 240V so do the math.

20:37 <jimbzy> Usually like 6.5kw in and 4 out.

20:37 <mrvn> unless you have one that needs 3 phase current.

20:38 <jimbzy> And it's not constant. They have a duty cycle.

20:38 gxt has joined #osdev

20:39 <geist> well, okay thats fairly high

20:39 goncalo has joined #osdev

20:40 <geist> i do remember being happy when i bought my house that it has a pretty solid set of 220 circuits in the garage

20:40 <geist> and 400A of power into the house, so could do that sort of thing if wanted

20:40 <jimbzy> Yeah that's pretty tight.

20:41 <jimbzy> No 220 in the garage here, but the laundry room is there, so I could make it happen if needed.

20:41 <geist> and there used to be a hot tub on the deck so there's this whole 50A 220 circuit dedicated to it

20:41 <jimbzy> That's really awesome.

20:41 <geist> no idea what to do with it, but it's there

20:41 <mrvn> I have 240V everywhere.

20:42 <jimbzy> geist, Tesla coil?

20:42 <geist> was waiting for someone in europe to point out they have 240V everywhere

20:42 <geist> jimbzy: hmmm!

20:42 <mrvn> We had 220V like 50 years ago.

20:42 <GeDaMo> Crazy Americans with their 120V :P

20:42 <mrvn> too inefficient so they started to raise it a bit every year.

20:42 <geist> but when we do have 220 (or 240 i forget) we usually have fairly high amperage circuits for it

20:43 <gog> yeah like 50amp

20:43 <mrvn> In germany we have 3 phase power for that.

20:43 <jimbzy> We have 3phase here, too.

20:43 <jimbzy> 440v

20:43 <gog> the one thing i don't like is that the receptacles here don't have a standard neutral

20:43 <GeDaMo> In the UK, I think we have 30 amp for things like cookers

20:43 <mrvn> And if 3 phases of 240V isn't enough there is also 400V iirc.

20:43 <gog> it's whichever

20:45 <mrvn> In the UK they also have this strange wireing where the wire forms a loop and you can draw more A from the sockets than the wire should be able to do if you spread it around the room because it then flows from both sides.

20:45 <jimbzy> You guys are 50hz too aren't you?

20:45 <geist> yah i read about that one time and didn't completely grok it

20:45 <gog> oh yeah loop circuits

20:46 <mrvn> geist: R = I * V, which basically gives you heat. Every power outlet has two wires providing power so I gets split into left and right.

20:48 joe9 has joined #osdev

20:49 <jimbzy> Doesn't that require load balancing?

20:49 <mrvn> I think they stopped using that because it has some odd properties (because you don't load balance) and it's better to just have more non-looped wires.

20:50 <jimbzy> Interesting.

20:50 <mrvn> I think UK + Ireland is also the only place where they have fuses in every outlet.

20:50 <eryjus> mrvn: V = I * R

20:51 <geist> yah i understood the basics i guess i just didn't grok how it'd go back to the fuse panel and whatnot

20:51 <geist> but also didn't try too hard

20:51 <mrvn> eryjus: hey, at lest I got all the letters :)

20:51 <eryjus> lol

20:52 <jimbzy> We're sorry, but your answer has to be in the form of a vector.

21:03 GeDaMo has quit [Remote host closed the connection]

21:03 <graphitemaster> We call them operating systems but they don't operate anymore.

21:09 <gog> makes ya think

21:27 <heat> bootloaders don't load boots either

21:28 <gog> :'(

21:39 <mrvn> heat: except on bootos

21:40 rustyy has quit [Quit: leaving]

21:41 <graphitemaster> The entirety of computers is a lie

21:41 <graphitemaster> They don't even compute

21:41 rustyy has joined #osdev

21:45 <gog> it's ok

21:50 <mrvn> what are they? Too hot for a fridge, too cold for a stove, not enough airflow for a cooling fan, ....

22:08 <heat> reject compoter become m

22:16 Clockface has joined #osdev

22:17 <Clockface> i have noticed for the mod/rm byte that some of the offsets are stuff like disp16^2 or disp8^3

22:18 <Clockface> does this mean that the value stored should be exponented before being used?

22:18 <mrvn> hardly

22:18 <mrvn> is that foodnote 2 and 3?

22:19 <Clockface> oh...

22:19 <Clockface> nevermind then

22:19 <Clockface> thank you

22:54 troseman has joined #osdev

23:02 mahmutov has quit [Ping timeout: 240 seconds]

23:06 pretty_dumm_guy has quit [Ping timeout: 240 seconds]

23:31 pretty_dumm_guy has joined #osdev

23:32 [itchyjunk] has quit [Remote host closed the connection]

23:32 [itchyjunk] has joined #osdev

23:39 orthoplex64 has quit [Ping timeout: 240 seconds]

23:39 crm has joined #osdev

23:48 orthoplex64 has joined #osdev

23:49 crm has quit [Ping timeout: 272 seconds]

23:57 emartinez has joined #osdev

23:59 heat has quit [Remote host closed the connection]