#osdev on 2022-08-25 — irc logs at libera.irclog.whitequark.org

2021-05-23 01:57 klange changed the topic of #osdev to: Operating System Development || Don't ask to ask---just ask! || For 3+ LoC, use a pastebin (for example https://gist.github.com/) || Stats + Old logs: http://osdev-logs.qzx.com New Logs: https://libera.irclog.whitequark.org/osdev || Visit https://wiki.osdev.org and https://forum.osdev.org || Books: https://wiki.osdev.org/Books

00:00 gildasio has quit [Remote host closed the connection]

00:02 gildasio has joined #osdev

00:02 gildasio has quit [Remote host closed the connection]

00:03 gildasio has joined #osdev

00:07 gog has quit [Ping timeout: 252 seconds]

00:13 SpikeHeron has quit [Quit: WeeChat 3.6]

00:26 SpikeHeron has joined #osdev

00:31 gog has joined #osdev

00:32 frkazoid333 has joined #osdev

02:02 kof123 has quit [Ping timeout: 268 seconds]

02:25 [itchyjunk] has quit [Remote host closed the connection]

02:42 terrorjack has quit [Quit: The Lounge - https://thelounge.chat]

02:43 terrorjack has joined #osdev

02:54 gog has quit [Ping timeout: 260 seconds]

04:14 sympt has joined #osdev

04:40 kof123 has joined #osdev

05:59 bauen1 has quit [Ping timeout: 268 seconds]

06:55 bauen1 has joined #osdev

07:07 knusbaum has quit [Ping timeout: 252 seconds]

07:18 <mrvn> Griwes: can you guess a handle? They aren't just indicies into an array of handles the kernel maintains for the process?

07:18 <mrvn> (like files in unix)

07:32 the_lanetly_052 has quit [Ping timeout: 260 seconds]

07:35 nyah has joined #osdev

08:03 <mrvn> Got to love people cheating leetcode: https://stackoverflow.com/questions/73453575/palindrome-linkedlist-4ms-solution-explanation-needed

08:03 <bslsk05> stackoverflow.com: c++ - "Palindrome LinkedList" 4ms solution explanation needed - Stack Overflow

08:09 <kazinsal> fuck, I hate it. but I also hate the laziness of modern interviewers in copy and pasting whiteboard questions from leetcode

08:09 <kazinsal> so, lmao.

08:12 xenos1984 has joined #osdev

08:15 <j`ey> what does cin.tie(nullptr) do?

08:18 <Mutabah> https://en.cppreference.com/w/cpp/io/basic_ios/tie

08:18 <bslsk05> en.cppreference.com: std::basic_ios<CharT,Traits>::tie - cppreference.com

08:19 xenos1984 has quit [Quit: Leaving.]

08:19 <Mutabah> Seems to detach buffering of stdout from any actions on stdin

08:21 xenos1984 has joined #osdev

08:24 <j`ey> ah

08:39 frkazoid333 has quit [Ping timeout: 255 seconds]

08:41 zaquest has quit [Remote host closed the connection]

08:59 GeDaMo has joined #osdev

09:50 knusbaum has joined #osdev

10:10 zaquest has joined #osdev

10:17 carbonfiber has joined #osdev

10:33 liz has joined #osdev

10:34 gxt_ has joined #osdev

10:35 gxt has quit [Ping timeout: 268 seconds]

10:47 gxt_ has quit [Remote host closed the connection]

10:48 gxt_ has joined #osdev

11:09 pretty_dumm_guy has joined #osdev

11:09 gog has joined #osdev

11:14 smach has joined #osdev

11:59 Matt|home has quit [Quit: Leaving]

12:23 <mjg> i'm super tempted to repeat the bench in https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.94.8528&rep=rep1&type=pdf

12:32 heat has joined #osdev

13:54 vai has joined #osdev

13:54 vai is now known as JerOfPanic

14:29 wgrant has joined #osdev

14:47 carbonfiber has quit [Quit: Connection closed for inactivity]

14:47 wgrant has quit [Ping timeout: 252 seconds]

15:00 wgrant has joined #osdev

15:01 frkzoid has joined #osdev

15:07 gxt_ has quit [Remote host closed the connection]

15:07 gxt_ has joined #osdev

15:16 <heat> should I discard SUID and SGID?

15:16 <heat> security me would think so, but I think that a good chunk of UNIX revolves around it

15:16 <clever> how else would you do things like sudo?

15:16 <clever> or do you just never, ssh in again as root?

15:18 <heat> you could have a root daemon and run stuff from there

15:18 <heat> I think that magisksu does that

15:23 <mrvn> sshd doesn't need suid

15:26 <mrvn> heat: do you mean the uid/euid/suid fields in a process or the suid bit in the filesystem?

15:27 <heat> suid bit

15:28 <mrvn> sometimes you want to give programs more capabilities than the caller has. Unless you you want that dbus insanity from desktops

15:29 <heat> suid is also insane

15:29 <heat> in fact, even more

15:30 <mrvn> at least it's simple an effective

15:31 <mrvn> extended attributes to give capabilities would be more flexible

15:36 <heat> its not simple in practice

15:37 gog has quit [Ping timeout: 260 seconds]

15:38 <mrvn> usage or implementation?

15:38 <heat> usage

15:39 <mrvn> I don't know. I want something to be able to access the audio devices owned by the audio group so I set sgid audio. seems simple enough.

15:40 <heat> i don't care about sgid audio

15:40 <heat> I care about suid root

15:40 <mrvn> still simple. if it should run as root set suid root

15:41 <heat> thank you mrvn

15:41 <heat> you truly understand the intricacies of building a secure system around setuid

15:41 <mrvn> Handling the uid/euid/suid correctly in code and implementing it in the kernel I find harder.

15:44 <mrvn> heat: suid/sgid is pretty much an all-or-nothing system. Not sure you can talk about secure with that.

15:45 <mrvn> But it's simple: You either trust the binary with your live or not. Black & white.

15:47 <mrvn> it's easy to debug too. Ever tried to debug the dbus madness to find out why some 3rd party code gets an error trying to do something?

15:48 <heat> it's horrible to debug

15:49 <heat> have you seen the 30000000000000000000000 setuid privilege escalation bugs?

15:50 <zid> I'm not sure if 'setuid' is the thing causing the bug though, or just "bugs in methods to run privledged operations" in general is hard to do right.

15:50 <mrvn> What has that got to do with debugging the setuid feature? That's about making it secure, which you can't. Any bug in a suid binary is probaby an escalation.

15:51 <zid> That is to say, I've also seen a lot of bugs in syscalls, security tokens, etc

15:51 <heat> I don't give a shit about debugging the setuid features

15:51 <zid> setuid is just one method, of which will also be as buggy as all other things

15:51 <mrvn> The same bugs in a food server running as root and accessed via dbus has the same escalation problem.

15:51 <heat> it's way easier

15:51 <heat> your attack surface is so reduced

15:52 <mrvn> you think the dbus interface has a smaller surface than the command line?

15:52 <heat> yes

15:52 <clever> ive seen a car before, that had a dbus server running as root, listening on tcp, and it had a bloody command to just run anything as a shell command

15:53 <clever> anybody that can connect to the wifi hotspot can abuse that

15:53 <clever> the default WPA password, is based on the unix uptime of the first boot, before ntp has had a chance to fix the clock

15:53 <clever> so its just a hash of how many seconds it takes a fresh install to boot

15:53 <clever> how much entropy is that? :P

15:53 <mrvn> 2 or 3 keys

15:54 <clever> to protect the ECU from a malicious entertainment system, there is a dedicated MCU acting as a firewall between the 2 CAN busses

15:54 <clever> and only when the entertainment system is put into flashing mode, can that MCU also be reflashed

15:54 <heat> https://blog.qualys.com/vulnerabilities-threat-research/2022/01/25/pwnkit-local-privilege-escalation-vulnerability-discovered-in-polkits-pkexec-cve-2021-4034

15:54 <bslsk05> blog.qualys.com: PwnKit: Local Privilege Escalation Vulnerability Discovered in polkit’s pkexec (CVE-2021-4034) | Qualys Security Blog

15:54 <heat> et al...

15:54 <clever> and it only reads code from the (normally) read-only flash

15:55 <clever> except, for one spot, where it can run a shell script from the rw partition

15:55 <clever> so a malicious payload can create that script, reboot into flashing mode, re-flash the MCU, and then attack the ECU!

15:55 <clever> and it gets worse!

15:55 <clever> the dbus port, is also accessible over the cellular modem....

15:56 <mrvn> heat: aeh, polkit is that thing that you use with dbus on desktops. You just shown that that has all the bugs of dbus plus suid bugs.

15:56 <clever> mrvn: i recently took a peek, and i can disable polkit on nixos, xfce will sanely detect the lack of polkit, and just disable the shutdown/reboot buttons

15:57 <clever> and polkit has bit me a few times, i typed "reboot" into the wrong, NON ROOT shell, and it rebooted without any confirmation

15:57 <clever> i expect non-root shells to lack root :P

15:57 <heat> polkit is using setuid for privilege escalation

15:57 <clever> yep

15:57 <heat> this is not the first problem with setuid, nor will it be the last

15:58 <clever> there was also a fuse bug

15:58 <heat> setuid is not fucking easy, because it's impossibly hard

15:58 <mrvn> heat: not realy. polkit needs escalated priviledges by some form or another. That's not the bug.

15:58 <clever> a lot of fuse programs are setuid, because you need root to mount an fs

15:58 <clever> and libfuse will helpfully do `modprobe fuse`, if it fails to open `/dev/fuse`

15:58 <mrvn> polkit is buggy leeting other abuse the escalated priviledges it runs under.

15:58 <mrvn> s/leeting/letting/

15:59 <heat> i swear to fucking god

15:59 <heat> are you trolling?

15:59 <clever> modprobe was never designed to be setuid friendly, so modprobe respects an env var to change its config path

15:59 <clever> the modprobe config, can remap `modprobe fuse` into a shell command

15:59 <heat> "setuid is bad because its insecure and hard to use" "no setuid is great, programs that use it are buggy and bad"

15:59 <clever> and if you exaust the open fd counts, opening /dev/fuse will fail

15:59 <zid> I think security is just hard to use and buggy and bad :P

15:59 <zid> disregard security

15:59 <clever> heat: exactly, the fuse authors messed up, in creasing the above bug

16:00 <clever> setuid isnt to blame, its fuse&modprobe for making some bad assumptions

16:00 <heat> setuid is to blame because it's an inherently flawed idea

16:00 <mrvn> heat: no

16:01 <heat> "lets keep everything as is, except the euid, what can go wrong?"

16:01 <clever> heat: my understanding, is that euid is there, to make it easy to open things as the original user, so you dont wind up with bugs where you can `foo --config /etc/shadow` and it spits out your syntax errors

16:01 Ermine has quit [Ping timeout: 264 seconds]

16:02 <clever> so you can temporarily drop root, but get it back, when running code you trust with root access

16:02 <mrvn> clever: carefull, you have to do that differently when using suid bits.

16:02 <mrvn> (and that part I think is a bad design)

16:03 <clever> without euid, you would either need to re-implement all permission checks in userland (moar bugs), or spawn a dedicated child proc to drop root, read the file, then pass you the contents out its stdout

16:04 <mrvn> clever: you are talking about a different (although related) feature

16:04 <clever> fuse also violates a lot of the normal rules, where root can get permission denied, and you have no way to really re-implement that

16:04 <clever> yeah, i might be getting some names mixed up

16:05 <zid> security is hard, disregard security

16:05 Ermine has joined #osdev

16:06 <zid> stop letting people run commands on your machine and they won't be running them as root due to setuid bugs

16:06 <zid> run all your software in the butt

16:06 <clever> zid: if you trust every piece of software on the machine, and its air-gapped, who needs security!

16:06 <mrvn> heat: the fact remains that you will need some way to escalate priviledges. Figure out something better that can't be abused through buggy code and everyone will thank you.

16:06 <mjg> there was a scritp recently which you were supposed to curl | bash

16:06 <mjg> it had sudo invocation inside :D

16:07 <clever> mrvn: the installer for the nix package manager does exactly that

16:07 <clever> oops, mjg ^

16:07 <zid> clever: I trust that the software is software, and therefore the worst it can possibly do is software things. The things I want to keep 'secure' on my machine are the least protected, my email account.

16:07 <mjg> my assessment: LOL

16:07 <clever> but if you have +w to /nix, it wont ask to use sudo

16:07 <zid> software protects silly things, like the machine not being turned off by the wrong users etc

16:08 <clever> polkit likes to violate that :P

16:08 <zid> but polkit doesn't even ATTEMPT to secure the things I don't want others to have/see

16:08 <clever> i have systems where the user with physical access, shouldnt have such permissions

16:08 <zid> so security is useless

16:09 <clever> for example, i'm running a media center out of my NAS, and if you hit escape, youll get a menu with the shutdown option

16:09 <mjg> setenforce 0

16:09 <mjg> first thing i do!

16:09 <zid> If they can run commands on my machine, they can already completely ruin everything the moment they run rm -rf ~, or cat my firefox profile etc

16:09 xenos1984 has quit [Quit: Leaving.]

16:09 <zid> if they got root and I have to then reinstall the machine afterwards is sort of irrelevent

16:09 <mrvn> clever: I go through aall that trouble to disconnect the physical power button so users stop turning off the system and then they go and click "shutdown" in the GUI.

16:09 <clever> mrvn: heh, ive not gone that far

16:10 gog has joined #osdev

16:10 <heat> clever, if you dislike a specific polkit policy, you can change that

16:10 <heat> it's all written in javascript, so not hard to pick up

16:10 <clever> heat: yeah, the policy files are written in bloody javascript :P

16:11 <clever> i could just `return false` everything

16:11 <heat> it was either that, python or a domain specific language

16:11 <clever> /etc/polkit-1/rules.d/10-nixos.rules just has a single rule for me, polkit.addAdminRule(function(action, subject) { return ["unix-group:wheel"]; });

16:12 <mrvn> clever: groupd wheel? why should wheel have admin rights and adm not?

16:12 <clever> and there is a security.polkit.adminIdentities config flag, with a default value of [ "unix-group:wheel" ]

16:13 <clever> that polkit.addAdminRule part is always there, no way to remove it, but you can easily make it return [];

16:13 <clever> extra rules can be added as well

16:14 <mrvn> You could throw out the whole polkit mess and just use suid and make the binaries executable by group wheel.

16:14 <clever> https://github.com/NixOS/nixpkgs/blob/master/nixos/modules/security/polkit.nix#L67-L75

16:14 <bslsk05> github.com: nixpkgs/polkit.nix at master · NixOS/nixpkgs · GitHub

16:14 <clever> mrvn: or just use `sudo reboot` and yeah disable polkit entirely

16:14 <heat> i trust polkit a lot more than sudo

16:14 <clever> so i get a password prompt and time to re-think my actions

16:14 <heat> and certainly setuid

16:15 <mrvn> How many thousands of lines of code does polkit add just to reimplement the user/group permissions in your case?

16:15 <clever> heat: my problem, is with `reboot` just working, even if it lacked root, and having zero confirmations

16:15 <clever> mrvn: all of spidermonkey, the JS engine that powers firefox, last i looked

16:15 <heat> it doesn't reimplement anything

16:15 <clever> i trust sudo over polkit

16:16 <heat> it allows way way more finegrained security

16:16 <gog> i trust nothing because security is a myth

16:16 <heat> clever, sudo???

16:16 <mrvn> clever: do you know molly-guard?

16:16 <heat> have you looked at it? don't forget it's running in a suid binary

16:16 <mrvn> molly-guard - protects machines from accidental shutdowns/reboots

16:16 <clever> mrvn: oooo

16:17 <clever> heat: yes, i know sudo is setuid root, thats how it gets the ability to setuid to the chosen user (sudo -u foo)

16:17 <mrvn> Makes you confirm by typing in the hostname of the system you want to shutdown if you aren't on the physical console.

16:17 <clever> mrvn: what would xterm be counted as?

16:17 <heat> clever, sudo is huge

16:17 <heat> doas is way better in that regard

16:19 puck has quit [Excess Flood]

16:19 <clever> bbl

16:19 puck has joined #osdev

16:19 <mrvn> clever: I think that works but you can install a script that checks if you are root or not and ask for confirmation easily

16:20 <mrvn> clever: I use it for not shutting down systems via ssh

16:35 gelatram has joined #osdev

16:41 bauen1 has quit [Ping timeout: 252 seconds]

16:43 gog has quit [Ping timeout: 264 seconds]

16:44 FreeFull has joined #osdev

16:48 gelatram has quit [Quit: Ping timeout (120 seconds)]

16:49 Killy has joined #osdev

16:53 bauen1 has joined #osdev

17:04 gog has joined #osdev

17:12 gelatram has joined #osdev

17:15 bauen1 has quit [Ping timeout: 260 seconds]

17:29 gelatram has quit [Ping timeout: 252 seconds]

18:04 nyah has quit [Remote host closed the connection]

18:22 bauen1 has joined #osdev

18:28 alexander has quit [Quit: ZNC 1.8.2+deb2+b1 - https://znc.in]

18:30 alexander has joined #osdev

18:36 smach has quit [Ping timeout: 260 seconds]

18:53 wxwisiasdf has joined #osdev

18:53 <wxwisiasdf> hello operating system development irc channel

18:54 <mjg> hello random stranger

18:55 <wxwisiasdf> I am trying to make an OS in a... well, rather unusual hardware board

18:55 <wxwisiasdf> I've seen uClinux but it's apparently dead, any recommendations?

18:56 <wxwisiasdf> i could just rollout my own OS through but i don't feel like reinventing the wheel needlessly :P

18:56 <wxwisiasdf> i just want to knowif there is a rather "port-friendly" OS out there for embedded?

18:57 <GeDaMo> CP/M? :P

18:58 <wxwisiasdf> haha, no :)

19:00 <heat> what's the arch?

19:00 <wxwisiasdf> custom

19:00 <heat> usually as far as I know, netbsd is the goto for super portable shit

19:01 <wxwisiasdf> alright

19:03 gxt_ has quit [Remote host closed the connection]

19:04 gxt_ has joined #osdev

19:05 netbsduser` has joined #osdev

19:05 darkstardev13 has joined #osdev

19:05 archenoth has joined #osdev

19:05 darkstarx has quit [Remote host closed the connection]

19:05 netbsduser has quit [Read error: Connection reset by peer]

19:07 * geist yawns

19:07 <geist> good afternoon everyone

19:07 * mjg yawns back

19:07 Oshawott has quit [Ping timeout: 252 seconds]

19:07 <heat> sup geist

19:07 * mjg looks at adding 'echo' and 'touch' keywords to bmake

19:08 <mjg> if equivalent functionality is not alredy present

19:13 wxwisiasdf has quit [Quit: leaving]

19:23 <geist> noice. not sure gmake has a touch (though it does have direct file access nowadays) but it does have echo in the form of $(info ...)

19:25 <mjg> you can 'echo' in similar manner as well in bmake, but it's not equivalent to regular echo

19:25 <mjg> it is prefixed with some stuff

19:25 <mjg> afair bmake and submake level

19:25 <mjg> anyhow the intent is to whack some of the forks + execs which are clearly avoidable

19:26 <heat> b-b-but that's not the unix philosophy

19:26 <mjg> ... and which happen a lot

19:27 <geist> yah what i never completely understood until someone htat new gnu make ws that gmake itself doesn't actually always start a shell for every line of a rule

19:28 <geist> its only 'complicated' lines that it does, otherwise it runs a lot of them inline. probably basically what you're doing here

19:28 <geist> basically trivial stuff like echos without environment variable substitution it does there

19:28 <dzwdz> i wonder if creating a frankenstein make with embedded gcc could speed up builds

19:28 <mjg> man my poor eyes bleed when i strace bmake

19:28 <mjg> :(

19:28 <heat> geist, it aliases SHELL invocations when SHELL is a well known value

19:28 <mjg> /bin/sh -e -c echo "===> License GPLv2 accepted by the user"

19:29 <mjg> and so on

19:29 <geist> heat: right. and then for trivial lines. if the lines are more complicated then it punts it to the shell

19:29 <mjg> that i don't mind

19:29 <mjg> but if the content is known at compilatin time, PLZ DON'T

19:29 <geist> so i used to try to merge lots of adjacent lines in gmake with the idea to reduce the number of forks, but actually can backfire

19:29 <heat> yeah

19:30 <geist> because combining trivial lines it already wasn't going to fork actually causes it *to* fork

19:32 <mjg> you may appreciate this bit

19:32 <mjg> building freebsd base system -j 104: 30689.14s user 2509.58s system 6453% cpu 8:34.45 total

19:32 <geist> noice!

19:32 <mjg> llvm linker spawns 104 threads every single time

19:32 <mjg> when i limit that to 1....

19:32 <mjg> 30632.01s user 2392.17s system 6384% cpu 8:37.24 total

19:33 <mjg> iow 3 seconds of a difference for spawning 1 thread instead of 104

19:33 <mjg> great win llvm, worth it

19:33 <heat> lld isn't very multithreaded

19:33 <geist> well, is it 104 llvm instances each running 104 threads?

19:33 <mjg> at times yes

19:33 <geist> on what appears to be a 64 core machine

19:33 <mjg> it is 104 thread machine

19:33 <mjg> the build process is not *that* parallel all of the time

19:34 <mjg> plus there is some contention putting stuff off cpu

19:34 <geist> ah yeah, okay that makes more sense

19:34 <geist> yah 100% util on a largeass mulitstage build like that is a holy grail

19:34 <mjg> general point being, they put no upper limit on how many threads they decide to spawn, apart from your thread count

19:34 <mjg> and it gets ridiculous quite fast

19:34 ghee has joined #osdev

19:34 <mjg> --threads=1 links the kernel in ~1.5 seconds

19:35 <geist> yeah i can see that. i wonder if it just proactively spawns the threads but then only uses them in some sort of worker fashion

19:35 <mjg> --threads=8 in 0.6

19:35 <geist> such that in reality only 1 or 2 isused

19:35 <mjg> that's literally what it is doing

19:35 <mjg> makes me a sad panda

19:35 <geist> yah i bet it only gets some capability to parallel with sufficiently large programs that it can carve up

19:35 <mjg> i created a ticket about it severl months ago, no response

19:35 nyah has joined #osdev

19:35 <geist> my guess is you just really wont see much of a win against more plain C codebases

19:35 <mjg> right, i wanted them to estimate on input files instead of going in blindly

19:36 <mjg> they literally spawn this for a hello world man

19:36 <geist> you really need a huge expansion of code that then gets merged for lld to be doing a ton of work

19:36 <geist> or LTO

19:36 <geist> yah

19:36 <mjg> i don't mind for chromium et al

19:36 <mjg> although i don't know how many threads they can really use

19:37 <geist> yah and the 1 vs 8 number does also bring up the point that i've seen with rustc in fuchsia: adding more threads to the LTO linker doesn't spread the same amount of cpu across multiple threads

19:37 <heat> sounds they should use make's job server?

19:37 <geist> each new thread tends to do say 80% of the total work, so you end up burning more and more total cpu time across all of them

19:37 <geist> reason being that each of the workers duplicates a lot of the work, since each of them carves out a chunk of the binary and statrs from scratch as far as codegen and whatnot

19:38 <geist> the wall time is probably faster as you add more threads, but the total cpu time goes up a lot

19:39 <mjg> i don't know what they do in termso f the actual work they need to accomplish

19:39 <geist> so in isolation (one lld on a dedicated machine) you may has well -j as much as you can, but as part of a larger build it's not a win, you almost want -j1 for large builds

19:39 <mjg> i can tell you the llvm linker itself just does not scale re its work list management

19:39 <geist> except then you can end up with one of the linkers being the long pole in the tent

19:39 <mjg> for one all the spawned threads start with taking a global lock to unlink something for them to do

19:40 <geist> like i said i suspect it's more than just that. it's also very possible the linkers are duplicating work

19:40 <mjg> so there is tons of bouncing on and off cpu from the get go

19:40 <mjg> i have no doubt there is waste there as well

19:40 <mjg> just saying how it looks like from OS pov

19:40 <geist> areyou seeing a lot of demand faults? GN, our build system on fuchsia, for example *nails* the heap in really degenerate ways

19:41 <geist> on a linux machine when it's doing a gn gen it's easily >1mil soft faults/sec

19:41 <mjg> ye i'm doing more

19:41 <mjg> i don't have exact numbers stored

19:41 <geist> and on windows WSL it takes like 20 minutes because those soft faults are really badly emulated

19:41 <mjg> part of the poblem is that bmake itself forks into oblivion

19:41 <geist> a different problem, but one of those cases where if you develop a tool on something like linux you might not notice that it'll lean on stuff that linux is well optimized for

19:42 GeDaMo has quit [Quit: Physics -> Chemistry -> Biology -> Intelligence -> ???]

19:42 <heat> >and on windows WSL it takes like 20 minutes because those soft faults are really badly emulated huh?

19:42 <geist> heat: yeah! it was interesting sleuthing

19:42 <mjg> funny bit, but so happens parallel thread creation and destruction is way faster on freebsd than on linux :)

19:42 <geist> note this is WSL1 vs WSL2. WSL2 being just liux in a can runs GN fine

19:42 <mjg> (one of the few things were freebsd is legitimatley faster)

19:42 <heat> why does it have issues soft faulting?

19:42 <heat> windows should still have it no?

19:43 <heat> and since it's anon memory, yadda yadda

19:43 <mjg> what fucks linux up on this is their lack of process abstraction and resulting numerous tasklist_lock acquires

19:43 <geist> heat: it's because it's hitting the heap in a way that causes it to need to expand and shrink the heap aggressively, much past the point where the heap is asking for and freeing large chunks of address space

19:43 <clever> that reminds me, somebody in the osdev discord was mentioning that they like the internal design of linux, but they dislike the userland and syscall api

19:43 <geist> so basically it's wailing on the aspace, adding and removing mappings, and faulting them in

19:43 <clever> and then i was wondering, how hard is it to have a different syscall table for each process?

19:43 <mjg> clever: it is not. bsd can do it.

19:44 <clever> a bit of research later, i found the answer, very hard :P (at least on linux)

19:44 <heat> clever, bsds can do it, linux can (or could?) do it, windows can do it

19:44 <clever> every arch in linux, implements the syscall lookup table differently!

19:44 <mjg> freebsd is doing it a lot with linux emul

19:44 <clever> and for the 2 arches i checked, its a global array of handlers

19:44 <clever> x86-64 implements the lookup in plain c

19:44 <mjg> heat: even illumos can do it! :)

19:44 <clever> arm32 does the lookup in raw asm!

19:45 <geist> heat: what we also found was a) adding removing anon regions in WSL1 is comparitively slow and b) there is some sort of internal lock contention in the WSL1 'vm'. GN is heavily multithreaded and the more threads you add to the mix while fiddling with the aspace == exponential slowdown

19:45 <clever> so if i wanted to add this feature to linux, i would have to modify how every arch handles syscalls

19:45 <mjg> geist: so fuchsia builds on linux?

19:45 <mjg> geist: maybe i'll bench it on freebsd under linux emul

19:45 <geist> and all the demand faults means the whole thing ends up collapsing to single threaded speed and most of the time is spent in the kernel blocked on

19:45 <mjg> geist: it should work(tm)

19:45 <heat> fuchsia only builds on mac and linux

19:46 <geist> it *used* to build under freebsd and netbsd but we rely on a crapton of prebuilts

19:46 <geist> so yeah you'd have to use linux emul

19:46 <geist> might just work

19:46 <mjg> fwiw linux emul is good enough to build the linux kernel

19:46 <geist> i'm sure there will be silly edge cases, there are lots of build scripts that use uname to determine what host it's on, etc

19:46 <geist> so you might have to fake it out at that level

19:46 <mjg> uname lies and claims linux 3. something

19:46 <heat> downvote

19:46 <heat> where's 2.6???

19:47 <mjg> in an optional setting

19:47 <geist> but the build *should* be pretty hermetic. FWIW the amount of toolchains and prebuilt tools fuchsia downloads is largely to be highly hermetic

19:47 <heat> damn right

19:47 <heat> and 2.4?

19:47 <mjg> in linus's butt

19:47 <geist> even 'host' tools that fuchsia builds as part of its run are built with downloaded toolchains

19:47 <mjg> geist: the only worry here is that your stuff is using a syscall which is not implemented

19:48 <mjg> which is not very likely

19:48 <geist> yah it's just tools so probably not

19:48 <geist> gn + ninja + a bunch of prebuilt toolchains

19:48 <geist> and some python3

19:48 <mjg> that should all be fine

19:49 <mjg> well modulo bugs maybe ;)

19:49 <geist> if something ives you trouble it'll probably be a linux Go or Dart binary

19:49 <geist> Go in particular is pretty wonky

19:49 <mjg> fwiw, believe it or not, freebsd can build linux only slightly slower than linux

19:49 <mjg> something like +5% more total real time

19:49 <mjg> i'm working on it

19:50 <geist> noice

19:50 <mjg> it is mostly off cpu time stemming from shit i have not even looked at

19:50 <geist> yah at some point we had a grand plan to buld fuchsia on fuchsia, but that got sufficiently difficult around 2018 when we started leaning more and more on prebuilt toolchain binaries

19:50 <mjg> https://people.freebsd.org/~mjg/linux-build/full-offcpu2.svg

19:51 <geist> there was a brief period where we self hosted for like a month though. back when it wasn't much more than LK + a simple but functional user space

19:51 <mjg> there is a stupid problem where faulting on the same page gets an exclusive lock on it

19:51 <mjg> and then you immediately go off cpu if you wait

19:51 <geist> that was when we could still build on freebsd and netbsd and whatnot too, since it ws just using the LK build system, written in gmake

19:51 <mjg> this can be patched to use shared locking, but there are stupid tech debt thingies which need fixing first

19:51 <mjg> thanks mach vm

19:52 <heat> geist, self hosted?

19:52 <geist> heat: sentence fragment?

19:52 <heat> i would think building fuchsia early on is sufficiently hard

19:53 <geist> actually no, was easier early on

19:53 <geist> since we were mostly just plain C and C++ then, and the LK build system already understood this stuff

19:54 <geist> we had very quickly built up a posix-lite environment with musl + some file systems, and we were using gcc and binutils then, so it was quite easy to get that working

19:54 <heat> interesting

19:54 <mjg> geist: do you happen to have numbers from building fuchsia on something > 16 threads?

19:54 <geist> was a case of being a mid-hobby os class design in terms of being advanced, just could do it much faster because we already had 10-20 people weorking on it

19:55 <geist> heat: since then the component based design and whatnot has pivoted the design somewhat away from being able to do command liney stuff like that

19:55 <geist> which is still somewhat of an open question: how does someone really *use* fuchsia interactively

19:56 <geist> there's always a push internally to get rid of the shell, and a fair amount of folks push back, so it's a bit at a detente

19:56 <geist> but conflict is fine. that's essence of engineering

19:56 <geist> mjg: hmm, well, it's all over the place. depends on which level of fuchsia you build

19:56 <geist> and how much server assist you get. a 16 core machine with no assist, building a 'core' build is i think in the 30-40 minute range?

19:57 <heat> geist, yeah. building fuchsia on fuchsia sounds great though

19:57 <heat> I guess the best chance you have is starnix? but that's probably super slow

19:57 <geist> heat: yeah agreed. we're just so far away from that right now i dunno if we'll get back

19:58 <geist> though it's really a topi for the discord channel

19:58 <mjg> get rid of the shell?

19:58 <geist> mjg: it's hard to verbalize how much fuchsia is *not* a posix system

19:58 <mjg> you mean they don't like the posix-like shell (or whatever you got) or the concept in general?

19:58 <mjg> even windows eventually got powershell

19:59 <geist> yah and i'd say windows is much more aligned with posix than fuchsia is

19:59 <mjg> i'm saying some form of usable command line is probably mandatory, does not ave to pretend to be unix

19:59 <geist> i 100% agreed

19:59 <geist> i am pro-command line team

20:00 <mjg> team command line unite

20:00 <geist> i think there's a contingent of folks that have the notion that you can build a more powerful notion of a sea of components that can be automatically started and solved for some sort of dynamic task

20:00 <geist> which is probably pretty neat, i just dont know how that works with a user sitting in front of it

20:01 <geist> but anyway, i shouldn't complain about it, at least here

20:01 <heat> yeah. i don't know if a fully modular, capability-based system is super usable like that

20:01 <heat> at some point using bash and some pipey bois to make stuff happen surpasses the need for super locked down stuff

20:02 <mjg> worse is better

20:03 <geist> or at least, command line is tuned for what folks like us think. i think there's some slightly different paradigms than what posix shells do that are interesting, but posix shells are kinda the lowest common denominator

20:03 <geist> (vs say, some sort of command line job based thing you got on some other OSes at the time)

20:04 <mjg> unix is shit, let's be real

20:04 <mjg> but it also beats the shit out of typical GUI

20:04 <geist> yah but it's *just* powerful enough to let you buid something powerful on top of it. that's the genius of it

20:04 <geist> the line is drawn just at the right spot

20:04 <mjg> agreed

20:06 <heat> how well does the shell work if you can't even .. though

20:14 dude12312414 has joined #osdev

20:14 dude12312414 has quit [Remote host closed the connection]

20:15 dude12312414 has joined #osdev

20:20 sonny has joined #osdev

20:25 <sonny> I just thought what if ever user gets a copy of the OS and realized that's virtualization

20:26 <dzwdz> what, like the matrix?

20:28 <sonny> no, like vmWare

20:28 <j`ey> what about it?

20:28 <sonny> the only thing that's left is the language based OS I guess

20:28 <sonny> j`ey: nothing, I am just thinking about what is possible

20:30 <geist> hmm, not sure i follow there

20:31 <sonny> language based OS there's no more need for additonal runtime, your server app can just be a function

20:32 <vin> What is the policy that determines when page caches are flushed? Also how many flusher threads are spawned and when? For example if there is a cpu bound workload after a bunch of writes to the page cache, it wouldn't make sense to flush them because it would disrupt the cpu bound workload.

20:33 <vin> Also do these flushes (from kernel pages) to disk happen through DMA?

20:33 <dzwdz> page caches are overrated

20:34 <vin> I am just trying to understand how linux does it dzwdz

20:34 <dzwdz> sorry

20:35 <geist> to answer the latter question (DMA), almost certainly

20:35 <geist> but really it's just a matter of whatever the device driver does

20:35 <geist> for pages that one way or another need to get flushed out to disk/storage they'll go through whatever the driver already does

20:36 <geist> anything halfway modern uses at least some sort of DMA

20:38 <vin> I see, so you are saying the flushing of dirty pages won't have a big impact on CPU load.

20:38 <heat> vin, this is not a linux internals channel

20:38 <heat> but anyway, block devices each have a flusher thread

20:38 <heat> you dirty a page on an inode, that page gets set as dirty, that page gets added to the dirty list in the inode, the inode gets added to the bdev's dirty inode list

20:39 <vin> heat: my bad, the purpose of using something specific (like linux) is to get the conversation started and later generalize to the best possible way to do it.

20:39 <heat> well, these details change from kernel to kernel

20:40 <heat> but anyway, it's done on a time limit

20:40 <klange> We rather specifically push against using Linux as a discussion point here, as there's plenty of other places on Libera to talk Linux.

20:40 <heat> I can't remember the sysfs knob that does it, but there's one

20:40 <klange> Not a rule, but a community preference.

20:41 <heat> and there's an easy way to test this

20:41 <geist> all this aside, as a general rule most systems try to flush dirty pages as a combination of time and memory pressure

20:41 <heat> getrusage(), write to shared file page (mmap), loop while getrusage().faults == oldrusage

20:41 <geist> ie, a) make sure any dirty page doesn't stick around longer than N units of time (say 30 seconds)

20:42 <heat> yeah, you can force this of course

20:42 <heat> evicting an inode from the icache will sync it, fsync(file) will sync it, sync() will sync it

20:42 <geist> and b) try to keep the number of total pages in the system that are dirty below some threshold

20:42 <vin> klange: got it! My experience has only been linux and xv6, hence the question.

20:43 <geist> there are tons of algorithms for this, but generally they are trying to do something along these lines

20:43 <geist> a degenerate case you dont really want to get into is some super high percentage of pages in the system are dirty and waiting for writeback

20:43 <vin> Yes geist that is mostly what I read as well. http://sylab-srv.cs.fiu.edu/lib/exe/fetch.php?media=paperclub:lkd3ch16.pdf

20:43 <geist> generally better in that case to apply backpressure against the process(es) that are generating pages

20:43 <geist> while the writeback happens

20:44 <geist> vs just purely reacting to dirty pages

20:44 <heat> an interesting property of implementing mmap MAP_SHARED and dirtying is that flushing a page needs to mprotect every mapping of that page back to write protected

20:44 <vin> I am curious about how the flush (by multiple threads) can potentially disturb other workloads in a multi-tenant setup

20:45 <vin> disturb in performanc I mean

20:45 <heat> TLB shootdowns, lots of IO queueing (and maybe lock contention)

20:45 <heat> but apart from that, not really

20:46 <heat> and IRQs of course

20:46 <vin> Even if we assume the new workload is not doing any IO, the act of flusher threads being scheduled over the workload can be bad.

20:46 <geist> sure

20:46 <geist> but that's just fact of life. the system has to do work so stuff can continue to do work

20:47 frkzoid has quit [Ping timeout: 244 seconds]

20:47 <geist> in general the cpu % of threads doing these background flushes are pretty low compared to user space work, probably. definitely more so now than say 20-30 years ago

20:47 <heat> yeah, services can expect to get interrupted in high performance systems

20:48 <heat> at cloudflare our edge network machines run all sorts of performance-critical services at the same time

20:48 <vin> Right, I would like to quantify this potential performance degradation. Given there will be larger page caches in the future (with CXL), background flushes might need rethinking?

20:49 <heat> why will there be larger page caches?

20:50 <vin> larger number of pages cached. Since per machine DRAM capacity is poised to grow drastically with the number of cores reamining pretty much the same.

20:50 <heat> and why would that matter, relly

20:50 <vin> With this CXL era

20:50 <heat> well, that's wrong then

20:50 <vin> how?

20:51 <heat> increasing memory without increasing cpus will lead to some interesting inbalances

20:51 <vin> yup

20:51 <mjg> please reduce memory access time kthx

20:52 <heat> probably CXL will only impact people on the compute end

20:52 <heat> and writeback isn't something you do frequently on compute, probably

20:52 <heat> at least minimizing that is the goal

20:53 <heat> at the end of the day, no sane person is running 256GB of ram on a PATA hard drive and a core-duo

20:53 <heat> when RAM increases, you get more CPUs, faster CPUs, faster storage with more IO queues, etc

20:54 <heat> and really, what alternative do you have to the current writeback system(s)? they're all going to be similar to what we have now

20:55 <heat> if you have assign a thread to multiple bdevs, writeback will be slower

20:55 <heat> if you assign multiple threads to a single bdev, then that's just weird

20:56 <heat> I guess you could theoretically have one thread per io queue?

20:57 <vin> So far it is believed that memory capacity is the bottleneck for most apps, so you were forced to run the app on two servers communicating over the network. This is what CXL is trying to solve, avoiding the need to scale out just because you don't have enough per node memory capacity.

21:14 heat has quit [Ping timeout: 260 seconds]

21:14 heat_ has joined #osdev

21:29 sonny has left #osdev [#osdev]

22:08 Oshawott has joined #osdev

22:11 archenoth has quit [Ping timeout: 268 seconds]

22:14 <zid> Good news, I managed to make the cd-rom drive on the playstation send me a spurious INT 0, and break every single emulator

22:25 ptrc_ has joined #osdev

22:25 eschaton_ has joined #osdev

22:25 fkrauthan_ has joined #osdev

22:25 <geist> hah nice. they dont bother emulating cdrom int support?

22:25 jstoker has quit [Ping timeout: 268 seconds]

22:25 antranig| has joined #osdev

22:26 <geist> is the interface to the cdrom on those things even remotely standard?

22:26 tomaw_ has joined #osdev

22:26 dayimproper has quit [Ping timeout: 244 seconds]

22:26 PotatoGim_ has joined #osdev

22:26 Patater has quit [Ping timeout: 240 seconds]

22:26 travisg_ has joined #osdev

22:26 froggey has quit [Ping timeout: 268 seconds]

22:27 ornitorrincos_ has joined #osdev

22:27 bleb_ has joined #osdev

22:27 HeTo_ has joined #osdev

22:28 nyah_ has joined #osdev

22:28 sbalmos1 has joined #osdev

22:28 pretty_d1 has joined #osdev

22:28 Emil_ has joined #osdev

22:28 sbalmos has quit [Killed (NickServ (GHOST command used by sbalmos1))]

22:28 sbalmos1 is now known as sbalmos

22:28 joe9_ has joined #osdev

22:28 shikhin_ has joined #osdev

22:29 shikhin has quit [Killed (NickServ (GHOST command used by shikhin_))]

22:29 shikhin_ is now known as shikhin

22:33 nyah has quit [*.net *.split]

22:33 pretty_dumm_guy has quit [*.net *.split]

22:33 puck has quit [*.net *.split]

22:33 joe9 has quit [*.net *.split]

22:33 elastic_dog has quit [*.net *.split]

22:33 ptrc has quit [*.net *.split]

22:33 tomaw has quit [*.net *.split]

22:33 HeTo has quit [*.net *.split]

22:33 Clockface has quit [*.net *.split]

22:33 fkrauthan has quit [*.net *.split]

22:33 ornitorrincos has quit [*.net *.split]

22:33 eschaton has quit [*.net *.split]

22:33 aejsmith has quit [*.net *.split]

22:33 stux has quit [*.net *.split]

22:33 weinholt has quit [*.net *.split]

22:33 bleb has quit [*.net *.split]

22:33 Emil has quit [*.net *.split]

22:33 travisg has quit [*.net *.split]

22:33 Raito_Bezarius has quit [*.net *.split]

22:33 PotatoGim has quit [*.net *.split]

22:33 antranigv has quit [*.net *.split]

22:33 tomaw_ is now known as tomaw

22:33 ptrc_ is now known as ptrc

22:33 travisg_ is now known as travisg

22:33 bleb_ is now known as bleb

22:33 fkrauthan_ is now known as fkrauthan

22:33 PotatoGim_ is now known as PotatoGim

22:33 elastic_dog has joined #osdev

22:39 Raito_Bezarius has joined #osdev

22:39 puck has joined #osdev

22:55 pretty_d1 has quit [Quit: WeeChat 3.5]

22:58 nyah_ is now known as nyah

23:01 dayimproper has joined #osdev

23:09 jstoker has joined #osdev

23:11 aejsmith has joined #osdev

23:13 Patater has joined #osdev

23:15 nyah has quit [Ping timeout: 268 seconds]

23:21 antranig| is now known as antranigv

23:31 <heat_> nothing I enjoy more than debugging refcount bugs

23:33 hbag has joined #osdev

23:34 matt__ has joined #osdev

23:34 matt__ is now known as freakazoid333

23:36 heat_ is now known as heat

23:36 <ebb> I would simply count the number of references

23:37 <heat> great tip

23:37 <heat> would've never come up with that on my own

23:37 <ebb> It hasn't let me down so far

23:42 freakazoid333 has quit [Ping timeout: 255 seconds]

23:44 dude12312414 has quit [Quit: THE RAM IS TOO DAMN HIGH]

23:45 <zid> heat: have you considered refcounting your refcounts

23:45 <zid> then if they don't match, you have a refcount bug

23:45 * zid preens

23:47 <heat> refcountsan?

23:48 <zid> refcountcountsan

23:49 <zid> for when you want to check that your refcount count doesn't have UB

23:59 <moon-child> this is why tracing gc is better

23:59 <moon-child> definitely not frustrating to debug