#osdev on 2022-09-22 — irc logs at libera.irclog.whitequark.org

2021-05-23 01:57 klange changed the topic of #osdev to: Operating System Development || Don't ask to ask---just ask! || For 3+ LoC, use a pastebin (for example https://gist.github.com/) || Stats + Old logs: http://osdev-logs.qzx.com New Logs: https://libera.irclog.whitequark.org/osdev || Visit https://wiki.osdev.org and https://forum.osdev.org || Books: https://wiki.osdev.org/Books

00:00 LittleFox has quit [Quit: ZNC 1.8.2+deb2+b1 - https://znc.in]

00:00 <jimbzy> Have you ever read The Blue Nowhere?

00:01 <jimbzy> By Jeffery Weaver, or maybe it was Deaver?

00:01 LittleFox has joined #osdev

00:01 LittleFox has quit [Remote host closed the connection]

00:01 <geist> no! should i?

00:01 LittleFox has joined #osdev

00:01 <geist> looks interesting!

00:02 thinkpol has quit [Remote host closed the connection]

00:02 <jimbzy> I thought it was pretty good.

00:03 thinkpol has joined #osdev

00:04 <heat> https://devblogs.microsoft.com/oldnewthing/20220919-00/?p=107195

00:04 <bslsk05> devblogs.microsoft.com: Why load fs:[0x18] into a register and then dereference that, instead of just going for fs:[n] directly? - The Old New Thing

00:05 <heat> thoughts?

00:09 outfox has quit [Ping timeout: 255 seconds]

00:17 <klys> lea bx,[fs:0x18] ; cli ; mov eax,[bx] ; mov [bx],edx ; sti ; ret

00:17 <geist> watching a few recent bjork videos (new upcoming album)

00:17 <geist> unable to hold a coherent thought

00:18 <klys> lea bx,[fs:0x18] ; cli ; mov eax,[fs:bx] ; mov [fs:bx],edx ; sti ; ret

00:21 <klys> int 6: cpu-generated (80186+): invalid opcode

00:22 gog has quit [Ping timeout: 252 seconds]

00:23 [itchyjunk] has quit [Ping timeout: 264 seconds]

00:25 outfox has joined #osdev

00:26 [itchyjunk] has joined #osdev

00:32 <zid> oh that post made *no* sense until the comments clarified that fs+0x18 holds the fsbase

00:33 elastic_dog has quit [Ping timeout: 248 seconds]

00:34 <geist> i started to read it but it was prettyu confusing and using the syntax i dont read much, and just below the threshold of interesting

00:37 <heat> yeah I took some time to figure out what 0x18 was

00:38 elastic_dog has joined #osdev

00:39 <geist> i guessed what it was (usually it's canonical to put it at offset 0) but just didn't feel like diving into that set of x86 minutae today.

00:40 <heat> is x18 on arm64 tp?

00:40 <geist> the abi says that x18 can be used by the OS to do what it wants, otherwise its another temporary reg

00:41 <geist> but yeah, usually x18

00:41 <heat> ahh

00:41 <heat> what's ffixed-x18 for then?

00:41 <geist> no idea what windows does, but iirc their abi is fairly close to the standard ELF one

00:41 <heat> do gcc/clang default to x18 = temp reg?

00:41 <geist> -ffxied-x18 is precisely for that: tell the compiler not to use the register for anything

00:41 <geist> probably. or it's defined in the triple

00:41 <geist> but for -elf it may define it as a temp

00:42 <heat> windows seems to use it for tls

00:42 <heat> which is interesting

00:42 <geist> yah, i think linux does too

00:42 <zid> cute that fs:18 and x18 go together

00:42 <zid> wonder how that happened

00:42 <heat> is there a big disadvantage in reading tpidr instead of x18?

00:42 <geist> iluminati confirmed

00:42 <geist> oh oh yeah, duh, tpidr. sorry, my head is a little frazzled

00:43 <geist> yeah linux uses tpidr_el0, not sure it uses x18 in user space.

00:43 <geist> kernel has of course its own use for those things, but that's a different can of worms

00:43 <geist> windows, i have no idea

00:43 <heat> I think linux uses ffixed-x18 in the kernel

00:43 <geist> yah fuchsia absolutely dees to

00:44 <geist> for user space fuchsia uses x18 for the safe call stack too

00:44 <geist> as well as the kernel

00:44 <heat> yeah I think that's mandated by the sanitizer?

00:44 <geist> at least is mandated by safe stack, since you need a ready to use register holding the pointer or it's not usable

00:45 <heat> yeah

00:46 <geist> i guess you could move it from tpidr, load the SS pointer, use it, store it back

00:46 <geist> but it makes a generally free security thing somewhat more expensive

00:46 <heat> what I'm wondering is: why x18 for tls? is mrs TPIDR_ELx, reg slower?

00:46 <geist> no idea

00:46 <geist> that's what windows does?

00:47 <heat> well, you do that

00:47 <heat> so does the linux kernel

00:47 <geist> uhuh?

00:47 <geist> wait wait. now i'm paying attention. what do you think is going on?

00:47 <geist> clean slate here.

00:47 <heat> you're using x18 to store the per-thread/per-cpu stuff right?

00:47 <geist> no. not at all. that's tpidr

00:47 <geist> x18 holds the safe stack

00:47 <heat> ah!

00:48 <zid> ah, sounded like you were, think you meant "we are reservng x18" earlier, not "we are using x18 for tls"

00:48 <geist> yes and then i corrected myself but i wasn't clear enough

00:48 <zid> It's a magic trick, indirection

00:48 <geist> we are reserving x18 but it's for another use

00:49 <geist> tpidr is used for the TLS bits in both linux and fuchsia. i have no idea if linux does something with x18 in user space. (i think it doen't)

00:49 <geist> for kernel, well it's all different, but that's how kernel is.

00:49 <zid> Talk about tls then confirm you are reserving x18 also, but not why. Throw them off the scent but don't give the lawyers any ammo to go after you for perjury. Smart.

00:49 <geist> fine. anyway yes just to be clear that's what's going on

00:49 <zid> yea we know

00:49 <heat> the linux kernel seems to use x18 to get the current task

00:49 <zid> we moved on to jokes

00:50 <geist> the arm ABI says that x18 can be a temp or used by the OS. ie it has no other purpose

00:50 <heat> but the switch was fairly recent

00:50 <geist> and it being the last temp means it is the last to be allocated

00:50 <heat> I assume there's no penalty in mrs tpidr, reg

00:50 <geist> x19 is the first of the saved regs

00:50 <geist> also to be more confusing there's two of them in user space: tpidr_el0 and tpidrro_el0

00:51 <geist> we (fuchsia) dont use the latter in user or kernel, and i think linux uses the latter for kernel purposes, so same thing in user space

00:52 <heat> what's tpidrro used for?

00:52 <heat> i assume it's read-only from the title

00:52 <geist> that's right

00:53 <geist> it's the same as tpidr except read only to EL0

00:53 <geist> but RW to EL1+

00:53 <heat> so how is that useful?

00:53 <geist> beats me

00:54 <geist> you could put something in it that you dont want used to fuck up, like, say the current cpu #

00:54 <geist> and then not context switch it

00:54 <heat> ah shit, I think you meant darwin?

00:54 <geist> or, i dunno, maybe a pointer to the current vdso or something

00:54 <geist> i dunno is that what darwin does with it?

00:55 <heat> "In Apple open source, TPIDRRO_EL0/TPIDRURO is used to save the CPU number,"

00:55 <geist> ah okay there you go

00:55 <heat> https://opensource.apple.com/source/xnu/xnu-4570.1.46/osfmk/arm/cswitch.s.auto.html

00:55 <bslsk05> opensource.apple.com: cswitch.s

00:55 Ram-Z has quit [Ping timeout: 252 seconds]

00:55 <geist> makes sense. you set it up once and then just leave it that way

00:55 <geist> i dunno, i dont look at darwin source

00:56 <geist> so yeah i guess that's a reason for it

00:57 <geist> i dont think we use it at all in fuchsia, though maybe the need would arise at some point

00:57 <geist> i think linux hard fixes it at 0 when in user space, and uses it somewhat like the SSCRATCH register in riscv, where it's used for temporary bits when the kernel is in kernel space

00:57 <geist> someone had mentioned it can hold an anchor when in EL1 so the cpu can detect a recursive stack overflow, but then set it to 0 when in user space

00:57 <geist> or somethig like that

00:58 <geist> but i honestly dunno off the top of my head

00:59 <geist> or use it as a temporary scratch space to move a local reg into when taking an EL1 -> El1 exception so you get a reg to do some stack work with

00:59 <geist> much like how you use the sscratch reg in riscv in general

01:00 <geist> only requirement there with tpidrro is you'd have to make sure you zero it out before switching to EL0 or you'd leak kernel info

01:00 <heat> yeah

01:01 <geist> i think holding the current cpu number for user space doesn't really get you anything IMO. in the case of rdtscp where it simultaneously returns the time stamp and the cpu number, that makes total sense,

01:02 <geist> you are guaranteed that both things happened on the same cpu

01:02 <geist> but in the case of having a separate, preemptable instruction to read the cpu # i dont think that'd be particularly helpful

01:02 <heat> why? you skip a syscall or something

01:03 <geist> sure but the current cpu # is only really meaningful if it corresponds to something like a time stamp

01:03 <geist> which you can't gather atomically

01:03 <geist> otherwise it's just a piece of info that probably isn't worth burning a whole register on

01:03 <mrvn> and the timestamp is meaningless if the cpu number change since the last time you checked

01:04 <heat> struct timespec ts; clock_gettime(CLOCK_MONOTONIC, &ts); unsigned long cpunr; __asm__ __volatile__("mrs tpidrro_el0, %0" :: "=r"(cpunr)); printf("Event X on time %blah, cpu %lu", &ts, cpu_nr);

01:04 <heat> you don't need the timestamp at the same time because time isn't supposed to shift

01:04 <geist> yah but the os can preempt between the clock and the tpidr

01:05 <mrvn> heat: but that's something different than the rdtscp

01:05 <heat> i know

01:05 <heat> but you get the same thing, the time + cpunr

01:05 <mrvn> if the time you read is global then the core you get it on is irrelevant

01:05 <heat> yes

01:05 Vercas6 has quit [Write error: Connection reset by peer]

01:05 gxt has quit [Write error: Connection reset by peer]

01:05 opal has quit [Write error: Connection reset by peer]

01:06 <geist> like, it's nice to have the current cpu number, but it's not really that helpful if you didn't get it atomically with the time stamp

01:06 gxt has joined #osdev

01:06 <geist> and even the latter isn't that useful in a constant TSC world

01:06 opal has joined #osdev

01:06 <mrvn> geist: is the tsc constant even if a core was powered down a bit?

01:06 <heat> what's the cpu number useful for?

01:06 <geist> but since constant/invariant TSC wasn't always the case

01:07 Vercas6 has joined #osdev

01:07 <geist> which is bak when rdtscp was added

01:07 <mrvn> heat: to see if the core changed since the last measurement

01:07 <geist> a) assume the TSC doesn't tick the same on every core

01:07 <geist> b) then read the TSC + cpu number atomically, now you can use this to measure time between two points for benchmarking purposes

01:07 <geist> and if the two values dont have the same cpu # you can toss it

01:08 Ram-Z has joined #osdev

01:08 <mrvn> how does constant TSC work in bigLITTTLE world?

01:08 <geist> mrvn: there are cpuid bits that say yes or no on that

01:08 <geist> if it is constant + invariant TSC then they tick at the same rate, or those cpuid bits are lying

01:09 <heat> mrvn, the TSC ticks based on the FSB

01:09 <heat> FSB frequency, I mean

01:09 <geist> well, not precisely. it's complicated

01:09 Iris_Persephone has joined #osdev

01:09 <Iris_Persephone> hia, don't mind me

01:09 <geist> but basically a modern x86 says 'i have a constant & invariant TSC which ticks at <rate to be determined via a number of ways>'

01:09 <Iris_Persephone> just lurking a little

01:10 <mrvn> anyway the TSC used to be different per core so any change in core # makes measuring differences meaningless

01:10 <geist> but if that's the case, then it *usually* ticks are some fairly high rate similar to base cpu frequency

01:10 <geist> but correct, yes. if the TSC doesn't tick at the same rate then the core # is important

01:10 <geist> mrvn: re: big.LITTLE on x86, specifically alder lake i have personally confirmed that TSC ticks a tthe same rate on all cores

01:10 <geist> which makes sense, or it'd totally fuck everything up

01:11 <geist> for ARM the big.LITTLe stuff and the tick rate of the time stamp counter is *always* global and constant

01:11 <geist> it's simply defined as such

01:12 <geist> re TSC and apic tick rate, i was just a few days ago fixing a couple bugs i fuchsia here, so it's fresh on my mind

01:12 <heat> I was $today years old when I found out MSR_PLATFORM_INFO is a thing and has the frequency

01:12 <geist> exactly

01:13 <geist> i think section 19.7.3 or something. have a CL up on fuchsia for review to tighten that up

01:14 <geist> here's the problem (and the bug i'm trying to fix): all of that is fine and dandy until you're in a hypervisor, in which case all of that fixed frequency nonsense is completely gone. so the fix is to if in the presence of hypervisor, fall back to calibration for the apic and tsc *unless* cpuid 15h/16h is present

01:14 <geist> which tells you exactly what the freq is

01:14 <geist> anything post about ice lake just fills that in on intel, and AMD still doesn't

01:15 <JerOfPanic> hi

01:15 <geist> the problem with MSR_PLATFORM_INFO is it's very specific to what gen intel cor eyou have, so you already need a bunch of code to detect precisely which core you're on, etc

01:15 <geist> it's all an annoying mess

01:16 <geist> ARM just straight up has a register that firmware is supposed to fill in that tells you the tick rate. thank you arm

01:18 <heat> geist, what's the problem with hypervisors?

01:18 <heat> also https://fuchsia-review.googlesource.com/c/fuchsia/+/727856 for context

01:18 <bslsk05> fuchsia-review.googlesource.com <no title>

01:18 <geist> their APIC tick rate may be something else

01:19 <geist> (which is precisely the case here)

01:19 <geist> so the falling back to assuming it's 24 or 25Mhz or so, according to the intel manual, is invalid

01:19 <heat> so you can't derive the APIC tick rate from the FSB frequency you measure?

01:19 <geist> you have to measure it

01:19 <heat> yeah

01:19 <geist> vs assuming it's 24/25/100/etc mhz, like the manual says

01:20 <heat> AHA yes

01:20 <heat> this is the fuckery I was thinking of

01:20 <geist> so it's about short circuiting the code we have there that says 'ah i know this is a sandy bridge/skylake/etc, and cpuid 15h doesn't say what the FSB is but i know it's X'

01:20 <heat> KVM hardcodes the FSB frequency

01:20 <heat> as in, the APIC frequency

01:21 <geist> the TL;DR is the code we have here was assuming the APIC was ticking at 25mhz, when it was actually ticking at 1000mhz because KVM

01:21 <heat> which is why

01:21 <geist> so we were firing timers at 40x rate

01:21 <heat> gEfiMdePkgTokenSpaceGuid.PcdFSBClock|1000000000 per OVMF

01:21 <heat> hardcoded

01:21 <heat> OVMF doesn't even attempt to calibrate anything

01:21 <geist> so i short circuited it so that if we weren't explicitly told in cpuid 15h, return 0 here, which causes it to calibrate elsewhere

01:21 <geist> (and we're in a hypervisor)

01:22 * JerOfPanic is 63. day 0 smokes on smoking cessation program quiting - on British American Tobacco's Zonnic nicotine replacement product

01:22 <JerOfPanic> ;-P

01:22 <heat> i'm fine with measuring the APIC timer tbh

01:22 <JerOfPanic> two months, never did this before since I began smoking in China on 2009

01:22 <heat> I'd just really like a way to get the TSC frequency directly, everywhere

01:23 <geist> yah there was a bit of discussin on chat as to thwether or not it's worth even bothering returning a hard coded apic value

01:23 <geist> you can argue that unless cpuid 15h tells you precsely what it is, just calibrate it

01:23 <geist> i may just do that in a later CL

01:23 <mrvn> with kvm who knows how much cpu time the guest runs anyway

01:23 <JerOfPanic> I can code again :-O

01:23 <geist> cpuid 15h of course tells you what the TSC is ticking at, if present

01:23 <heat> with a precise enough TSC you will get precise enough measurements of the APIC frequency

01:24 <geist> *or* the KVM pv_clock stuff

01:24 <heat> yea

01:24 <geist> really the interesting thing is i fyou're using the apic TSC deadline mode, which more modern stuff uses, you dont even need to know the apic tick rate

01:24 <heat> IIRC there's a good bit of errata on TSC deadline

01:24 <vin> How important is it for system developers to learn rust? I see a lot of new projects being started in it.

01:24 <geist> for fuchsia we dont even bother reading/calibrating the rate if using apic non deadline

01:25 <vin> My only concern is rust won't be able to take years of optimization C/C++ went through, so for performace code will it really be faster

01:26 Matt|home has quit [Quit: Leaving]

01:26 <geist> my experience is tha tmost folks i know that use rust or write large systems code, performance isn't the primary concern

01:26 <geist> it's a concern, but you choose rust because you want safe/etc code first and foremost

01:27 <geist> and the fact that it's also pretty quick is a plus

01:27 <heat> as late as kabylake: https://www.intel.com/content/dam/www/public/us/en/documents/specification-updates/7th-gen-core-family-spec-update.pdf

01:27 <heat> "When the local APIC (Advanced Programmable Interrupt Controller) timer is

01:27 <heat> configured for TSC-Deadline mode, a timer interrupt may be generated much earlier

01:27 <heat> than expected or much later than expected"

01:28 <heat> when messing around with IA32_TSC_ADJUST

01:28 <geist> yah much earlier on occasion is fine, much later is a problem

01:28 <vin> geist: I am trying to figure it if it is for me. I mostly do independent research and write protoypes rather than production ready code.

01:28 <geist> vin: yeah honestly i can't tell ya. i'm gonna have to learn it more soon too, i'm sure. i have dabbled in rust enough that i can kinda read it

01:28 <geist> but dont really know how to speak it, per se

01:29 <vin> I guess I will take the plunge and try it for a project.

01:30 <Iris_Persephone> I read the wiki, saw all the warnings about how complex this was, but... it is just now starting to hit me

01:30 <heat> erm

01:30 <heat> we're talking about complex, late game shit

01:31 <heat> as a beginner nothing of what we're talking about is relevant

01:31 <heat> there's layers to osdev

01:35 <jimbzy> True story, heat.

01:38 <heat> geist, oh yes something I forgot to ask: why was the isb needed after the write to tpidr_el1?

01:38 <geist> possible it isn't

01:38 <heat> https://stackoverflow.com/questions/64856566/what-is-the-purpose-of-thread-id-registers-like-tpidr-el0-tpidr-el1-in-arm <-- according to this, it used to be there in fuchsia

01:38 <bslsk05> stackoverflow.com: kernel - What is the purpose of Thread ID registers like TPIDR_EL0/TPIDR_EL1 in ARM? - Stack Overflow

01:38 <heat> but now it's not

01:39 <heat> i'm struggling to understand where I need barriers

01:39 <geist> i think it was just an abundance of caution. ie, isb after writing to MSRs unless it can be proven you dont need to

01:39 <geist> oh there's a bunch of verbiage in the ARM manual about precisely this topic

01:39 <geist> i think there's even a whole section about which ones you do and dont need to

01:39 <Iris_Persephone> At this moment I am just trying to learn everything I can - I didn't expect to be at the point where I could actually code anything for _years_

01:39 <heat> I've noticed you need to isb and dsb immediately after writing to page tables because the ARM cpu can speculate like crazy

01:39 <geist> correct

01:40 <Iris_Persephone> So you guys hopefully won't mind if I stay for a little, just try to pick things up?

01:40 <heat> Iris_Persephone, just start

01:40 <heat> https://wiki.osdev.org/Bare_bones

01:40 <bslsk05> wiki.osdev.org: Bare Bones - OSDev Wiki

01:40 <geist> yah you'll learn a lot more by starting and stumbling a bit

01:41 <geist> otherwise you'll just get overwhelmed

01:43 <jimbzy> I love those bare bones projects.

01:44 <jimbzy> I learned a lot by working through them and breaking them.

01:57 vdamewood has quit [Quit: My MacBook Pro has gone to sleep. ZZZzzz…]

02:00 smeso has quit [Quit: smeso]

02:02 gxt has quit [Ping timeout: 258 seconds]

02:04 gxt has joined #osdev

02:12 smeso has joined #osdev

02:13 <geist> also that's what we're here for: to provide help for those that are breaking things

02:14 <Iris_Persephone> I was half-expecting you guys to tear me to pieces for being uninformed

02:14 <jimbzy> Hah.

02:14 <jimbzy> How long have you been trying to help me fix the things I break, geist? 20 years? XD

02:15 <geist> Iris_Persephone: oh gosh no, everyone starts somewhere

02:15 <geist> jimbzy: heh, feels like it

02:15 <geist> though unless you've been on as another nick you've probably only been here maybe 5 or 6 years? i forget

02:16 <jimbzy> I honestly don't remember my old nickname.

02:16 <geist> or (probably) much longer and my concept of time is off

02:16 <geist> especially the last few years

02:16 <jimbzy> You were working at Danger if that narrows it down.

02:17 <geist> okay, yeah that was 20 years

02:17 <geist> you musta been on another nick back then

02:17 <jimbzy> Yeah.

02:18 <Iris_Persephone> The wiki says that beginner-friendly Linux distros like Mint are not recommended; any specific reason why this is?

02:18 <jimbzy> You or air told me to get the dinosaur book at order hard-copies of the Intel manuals.

02:18 <geist> also fun fact: the first Danger Hiptop was released 20 years ago on october 1st, so 20th anniversay will be next weekened

02:18 <jimbzy> That's crazy.

02:19 <heat> Iris_Persephone, that's bs

02:19 <geist> Iris_Persephone: dunno. I use mint linux myself

02:19 <geist> and yeah i'd generally regard that as bs. indeed

02:19 <heat> where does that say?

02:20 <Iris_Persephone> Getting Started, subheading "Choosing your development environment"

02:20 <jimbzy> Who has the time to roll a custom gentoo distro or build slackware?

02:20 <geist> a statement like that with no real backed up reason shouldn't be on there, and i cant think of a good rationale for it

02:20 <geist> but all that aside, to be clear this channel and the wiki aren't strictly speaking connected. just defacto

02:21 <Iris_Persephone> Ah, the page I got this link from just said "partnered with" so I wasn't sure of the actual connection

02:21 <heat> Best distros for kernel development are (but keep in mind this is also a matter of personal taste, so these distros are not required rather suggested, and they usually require some experience): Arch, Gentoo, Solus, Slackware, void etc. even Puppy.

02:21 <heat> lmao

02:22 <geist> heh yeah. and i take a bit of umbrage to the notion that mint linux is not a general purpose distro

02:22 <geist> shows that whoever wrote it really hadn't used it directly

02:22 <geist> i get the idea that dont use a linux distro that's trying to hide the linux side of things (ie no development tools, no command line), but i think those are more of an exception than the norm

02:22 <jimbzy> heat, It makes OS dev a lot easier because you're spending all your time manually configuring your OS and build environment. :P

02:23 <jimbzy> You can't freak out over your build script if you can't get the buildtools to build. Think about it.

02:23 <geist> anyway this whole getting started page reads a lot like a bunch of blabbing about whatever someone's experience was

02:24 <geist> instead of a bulleted list of things to set up or whatnot

02:24 <heat> fixed that linux distros thing

02:25 <geist> boom.

02:25 <heat> that whole paragraph was bullshit inc.

02:25 <jimbzy> The system works.

02:25 <heat> you're not a real kernel hacker unless you use gentoo btw

02:25 <heat> or linux from scratch

02:26 <Iris_Persephone> I was actually just in the middle of doing LFS

02:26 <jimbzy> You call that "operating"? I use a bank of 128 toggle switches.

02:26 <jimbzy> Unlabeled.

02:26 <Iris_Persephone> I think I borked my Mint install, though, after my power got cut

02:26 <kazinsal> is it really a computer if you don't have to toggle in your bootloader at poweron?

02:27 <jimbzy> I know, right?

02:29 <geist> it's just an appliance if it boots itself

02:30 <heat> just an appliance? I'll let you know I flashed coreboot on my fridge

02:30 divine has quit [Ping timeout: 265 seconds]

02:35 <Iris_Persephone> I think I'll start Bare Bones once I get my LFS in a semblance of normality, thanks guys!

02:35 <heat> use linux mint

02:36 <heat> "If you are unsure, try Ubuntu, Fedora or Linux Mint. " <-- just to spite whoever wrote that before

02:36 <Iris_Persephone> I want to finish one project before I start another :p

02:37 <jimbzy> Ubuntu works well enough for me.

02:38 <jimbzy> Heck, just using Debian would be an improvement over the ones listed before.

02:38 <heat> oh wait

02:38 <heat> it was bzt

02:38 <heat> lmao

02:38 <heat> dude will track me down and beat me up

02:40 <heat> jimbzy, using debian is never an improvement

02:42 <Iris_Persephone> Oh!

02:42 <Iris_Persephone> My Mint _isn't_ borked

02:47 [itchyjunk] has quit [Read error: Connection reset by peer]

03:24 divine has joined #osdev

03:24 divine has quit [Read error: Connection reset by peer]

03:29 divine has joined #osdev

03:33 divine has quit [Client Quit]

03:34 divine has joined #osdev

03:48 heat has quit [Ping timeout: 250 seconds]

04:17 Iris_Persephone has quit [Ping timeout: 264 seconds]

04:24 Iris_Persephone has joined #osdev

05:49 vdamewood has joined #osdev

05:55 vinleod has joined #osdev

05:55 vdamewood is now known as Guest8017

05:55 Guest8017 has quit [Killed (calcium.libera.chat (Nickname regained by services))]

05:55 vinleod is now known as vdamewood

06:08 SGautam has joined #osdev

06:35 Iris_Persephone has quit [Read error: Connection reset by peer]

06:35 Iris_Persephone has joined #osdev

07:06 GeDaMo has joined #osdev

07:28 Vercas6 has quit [Quit: Ping timeout (120 seconds)]

07:29 Iris_Persephone has quit [Ping timeout: 252 seconds]

07:32 Iris_Persephone has joined #osdev

07:39 Persephone has joined #osdev

07:43 Iris_Persephone has quit [Ping timeout: 264 seconds]

07:45 Vercas6 has joined #osdev

07:46 Persephone has quit [Remote host closed the connection]

07:46 Persephone has joined #osdev

07:49 scoobydoo has quit [Ping timeout: 265 seconds]

07:57 scoobydoo has joined #osdev

08:00 Persephone has quit [Ping timeout: 244 seconds]

08:01 Persephone has joined #osdev

08:14 Persephone has quit [Remote host closed the connection]

08:14 Persephone has joined #osdev

08:24 opal has quit [Ping timeout: 258 seconds]

08:25 opal has joined #osdev

08:27 Persephone has quit [Remote host closed the connection]

08:27 Persephone has joined #osdev

08:40 Persephone has quit [Remote host closed the connection]

08:40 Persephone has joined #osdev

08:54 vdamewood has quit [Quit: Life beckons]

08:54 Persephone has quit [Remote host closed the connection]

08:55 Persephone has joined #osdev

09:00 opal has quit [Remote host closed the connection]

09:01 opal has joined #osdev

09:08 Persephone has quit [Remote host closed the connection]

09:08 Persephone has joined #osdev

09:14 bgs has joined #osdev

09:20 <zid> https://cdn.discordapp.com/attachments/642855427222143001/1022372365902032946/VS5TBVV.jpg

09:22 Persephone has quit [Ping timeout: 248 seconds]

09:25 Persephone has joined #osdev

09:40 Persephone has quit [Ping timeout: 264 seconds]

09:42 Persephone has joined #osdev

09:49 bgs has quit [Remote host closed the connection]

09:59 Persephone has quit [Ping timeout: 268 seconds]

10:01 Persephone has joined #osdev

10:15 Persephone has quit [Ping timeout: 264 seconds]

10:18 Persephone has joined #osdev

10:19 <mjg> real life unix question

10:19 <mrvn> real life unix answer

10:20 <mjg> is there a fcntl F_GETSIZE or whatever other name in any unix system?

10:20 <mrvn> see "CONFORMING TO"

10:20 <mjg> i know postgres is using funny games with lseek to find the size and hopefully avoid full blown stat in the process

10:20 <mjg> what?

10:20 <mrvn> SVr4, 4.3BSD, POSIX.1-2001. Only the operations F_DUPFD, F_GETFD,

10:20 <mrvn> F_SETFD, F_GETFL, F_SETFL, F_GETLK, F_SETLK, and F_SETLKW are specified

10:20 <mrvn> in POSIX.1-2001.

10:21 <mjg> i just came up with the flag

10:21 <mjg> and the name

10:21 <mjg> the q is if there is a functionality like that somewhere already

10:21 <mjg> if so i would reuse their name

10:23 <mrvn> I don't see such a flag mentioned at all. seek + tell is the way to get the size without a stat I think.

10:27 <mjg> wow darwin really went at it with adding F_ flags

10:27 <mjg> interestingly they have F_SETSIZE (!)

10:27 <mjg> bsd/sys/fcntl.h:#define F_SETSIZE 43 /* Truncate a file. Equivalent to calling truncate(2) */

10:27 <mjg> but no F_GETSIZE

10:27 <mjg> pretty weird on that front

10:28 <LittleFox> wat

10:28 <mrvn> They probably just implemented a bunch of syscalls that affects files in a single handler.

10:28 xenos1984 has quit [Read error: Connection reset by peer]

10:33 Persephone has quit [Ping timeout: 260 seconds]

10:37 Persephone has joined #osdev

10:47 xenos1984 has joined #osdev

10:47 isaacwoods has joined #osdev

10:51 Persephone has quit [Ping timeout: 244 seconds]

10:54 Persephone has joined #osdev

11:08 SGautam has quit [Quit: Connection closed for inactivity]

11:08 Persephone has quit [Ping timeout: 264 seconds]

11:09 Persephone has joined #osdev

11:23 Persephone has quit [Ping timeout: 244 seconds]

11:28 Persephone has joined #osdev

12:39 lkurusa has joined #osdev

12:43 CYKS has quit [Quit: Ping timeout (120 seconds)]

12:43 CYKS has joined #osdev

13:16 thatcher has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

13:27 <geist> huh odd. yeah

13:28 <geist> i always thought fctnl was only for really different things but not something that would cross functionality

13:28 <geist> maybe truncate() came later?

13:29 <mjg> fcntl duplicates a lot of syscalls, i don't know what came first

13:30 <mjg> i mean respective F_ vs a syscall

13:30 <mjg> there is dup, advisory locking

13:35 Persephone has quit [Ping timeout: 264 seconds]

13:38 netbsduser has joined #osdev

13:44 <lkurusa> https://lpc.events/event/16/contributions/1213/attachments/1012/1945/io-uring-spawn.pdf

13:44 <bslsk05> lpc.events <no title>

13:45 Persephone has joined #osdev

13:59 Persephone has quit [Ping timeout: 244 seconds]

14:00 ElementW has joined #osdev

14:02 Persephone has joined #osdev

14:11 <geist> hmm, i dont see the F_SETSIZE on darwin here

14:11 <geist> or at least macosx

14:13 <geist> and there it is

14:14 <geist> ah the OSX man page gives some context:

14:14 <geist> "F_SETSIZE Deprecated. In previous releases, this would allow a process with root privileges to truncate a file without zeroing space. For security reasons, this operation is no longer supported and will instead truncate the file in the same manner as truncate(2)."

14:15 <geist> so it was for file extend without zeroing. presumably at the time someone thought that was an optimization if the the process was just about to overwrite it completely

14:16 <geist> so actually it's silly but reasonable. there's usually reasons like this exist

14:17 Persephone has quit [Ping timeout: 268 seconds]

14:19 <mjg> that's part of the problem though

14:19 <mjg> it sure as hell can't be inferred from the name, can it

14:20 <mjg> they should have added flags to truncate, which woudl also make it extendable

14:24 Persephone has joined #osdev

14:27 Matt|home has joined #osdev

14:34 heat has joined #osdev

14:35 <heat> re: avoiding a full stat

14:35 <heat> really? how much slower is it?

14:35 <heat> at least on my system all the stat stats come from the in-memory inode itself, it's just as fast as a random F_GETSIZE

14:36 <heat> (you can also just use the traditional two-lseeks method to get it, but that's slow)

14:36 <zid> yea I'd figure stat was 'free'

14:36 <zid> i.e it ammortized into everything else as the inode came off the disk for you to read the file in the first place to seek it

14:36 wgrant has quit [Ping timeout: 244 seconds]

14:37 <geist> well that is why F_GETSIZE doesn't exist

14:37 <geist> that was part of the discussion: if F_SETSIZE exists why doesn't GETSIZE? well, the answer is F_SETSIZE was a weird special case that used to not do what truncate does

14:37 <geist> and thusit's only there for backwards compatibility

14:37 <geist> there was no reason for it to generically exist

14:38 <heat> most of fcntl is old garbo anyway

14:38 Persephone has quit [Ping timeout: 265 seconds]

14:38 <zid> aka osdev

14:38 <geist> i dont know. most of them seem to be for a specific purpose

14:38 <heat> F_DUPFD has dup, dup2, dup3

14:38 <geist> sure, but the darwin one at least has specical dup that *also* copies cloexec flags

14:39 <geist> not saying dup and fcntl shuoldn't overlap but most of the other ones seem to not have an overlap

14:39 <heat> linux also has that

14:39 <geist> most of this is a reason why if you have a generic syscall like that, add a options or flags field to it

14:39 <geist> we generally do that in fuchsia and it has helped immensenly

14:40 <heat> yea

14:40 <geist> ie, if starting over write it as dup(int infd, uint flags, out *outfd)

14:40 <heat> it's part of sane linux syscall API design right now

14:40 <geist> now you can implement all the dups and some ones you haven't thought about

14:40 <heat> int syscall(args..., uint32_t flags)

14:40 <geist> yah

14:41 <geist> there are a fair number of zircon syscalls now where 0 is the nly valid flags field, but that's for future expansion

14:41 <heat> they're also exclusively doing explicitly sized integers for every in-memory struct that goes to the kernel to avoid 32-bit compatibility crap

14:41 <heat> oh yeah that's a funny one

14:41 <heat> technically you can't return EINVAL for bad flags in open(2)

14:41 <geist> hmm, is that for a reason or just bad design?

14:42 <heat> bad design baby!

14:42 <geist> ooooh becayuse varargs

14:42 <geist> also protip: please dont varargs your syscalls

14:42 <heat> oh no varargs has nothing to do with that

14:42 <geist> ah

14:42 <heat> open("/bin/bash", O_RDONLY | O_RANDOM_FLAG_THAT_DOESNT_EXIST) doesn't fail

14:43 Persephone has joined #osdev

14:43 <geist> probably some historical reason, like source level compatibility between unices where some flags aren't there

14:43 <heat> yea

14:43 <geist> but that's a bad idea, probably born out of necessity than anything else

14:43 dude12312414 has joined #osdev

14:44 <heat> also a funny one: there's a random 10 year old linux kernel release where O_CLOEXEC wasn't respected

14:44 <heat> so musl, out of compatibility's sake, always does open() + fcntl(F_SETFL, O_CLOEXEC) for O_CLOEXEC opens

14:44 <geist> yay multithreaded

14:45 <heat> not only can you accidentally leak fds but you need to confirm your O_CLOEXEC choice with an extra syscall, yay!

14:45 <heat> and this is born out of stubbornness because the musl people could just, erm, look at the kernel version to figure this stuff out

14:46 <mjg> stat is far from free

14:46 <geist> so here's a question for ya: on arm64 and x86-64 when you're running a 32bit user space on top of a 48 bit address space (ie, a 64bit kernel) what happens when the process writes to address 0xffff.ffff+?

14:46 <mjg> that's copying over 128 bytes of data to grab 8

14:46 <geist> ie, wraps around

14:47 <heat> I would say "sanely, wraps around"

14:47 <mjg> in fact several years back linux was making some moves to make reading inode size dirt cheap specifically because it was needed a lot

14:47 <mjg> and a full blown stat was fucking terrible

14:47 <heat> but insanely page faults is also something I could assume

14:47 <zid> isn't that different between amd and intel

14:47 <geist> never thought about it much, but on x86 machine you do end up blowing at least 1 extra page table to run a 32bit process on a 64bit kernel

14:48 <geist> since you still have to set up a 64bit aspace but then only 0-4GB is ever accessed

14:48 <zid> or was that the 1M limit that's different

14:48 <mjg> also did i mnetion it is easy to implement size thing without taking the inode lock?

14:48 <heat> yeah sure

14:48 <heat> just read it

14:48 <heat> ez

14:49 <geist> well, depends on the memory barriers, but as long as the upader properly barriers, then it should be generally safe indeed

14:49 wgrant has joined #osdev

14:50 <geist> there's a few places in zircon where we read outside of a lock a thing that's updated in a lock, and there's usually some hand wrangling and extra commments to make sure thats cool in the gang

14:50 <mjg> to get some perspective, a fstat syscall (as in no path lookup, just straight to stat) is about 9 mln/s. getpid is over 20 mln

14:50 <heat> that's pretty fast

14:50 <geist> hmm, what is mln?

14:50 <heat> million

14:50 <mjg> milion

14:50 <geist> million what?

14:50 <mjg> ops/s?

14:50 <geist> i dunno.

14:51 <mjg> so a fcntl(fd, F_GETSIZE) would be at half the price of fstat(fd, &statbuf);

14:51 <geist> oh oh i see, you mean 'mln' == million

14:51 <mjg> and i'm not even talking about all the cases where the kernel grabs the size internally

14:51 <heat> so

14:51 <geist> got it, just never seen that particular shortening. i was thinking 'million lines' or something

14:51 <mjg> so ye, i do think F_GETSIZE is definitely worth it

14:51 <heat> why are you assuming fcntl(fd, F_GETSIZE) is as expensive as getpid()?

14:52 <mjg> it should be in the same ballpar

14:52 <mjg> k

14:52 <mjg> provided there is no inode locking

14:52 <mjg> what kind of perf do you expect

14:52 <geist> yah i'd assumed there'd be no real difference. if it can return the value on the left hand side it'd avoid a user copy in the syscall itself

14:52 <heat> sys_getpid() is just a task access

14:52 <geist> and that might be more substantial

14:52 <mjg> heat: well ye it is some more memory references

14:52 <mjg> so it will be slower than getpid, but definitely still way faster than fstat

14:53 <mjg> fstat has strictly more work to do

14:53 <geist> ie if you had some sort of `off_t get_file_size(int fd);` idealized syscall it would avoid any memory references, vs fstat

14:53 <mjg> and i mean a lot

14:53 <heat> it does have more work to do but it's trivial and cpus are fast

14:53 <mjg> which is exdtra crappy on contemporary boxes with SMAP

14:53 <mjg> ... since turning it off to do copyout is turbo expensive

14:53 <mjg> which happens to be avoided in this case

14:54 <heat> is it?

14:54 <geist> but not in a hypothetical fcntl call right?

14:54 <mjg> it is

14:54 <geist> because that can't return on the left side

14:54 <mjg> heat: for one it is serializing

14:54 <heat> oh yeah also, a problem: fcntl returns int

14:54 <mjg> got ya covered

14:54 <mjg> freebsd can return *two* ints in a syscall

14:54 xenos1984 has quit [Read error: Connection reset by peer]

14:55 <mjg> :)

14:55 <heat> bonkers

14:55 <geist> yes but then it's not fcntl

14:55 <geist> honestly a more interesting syscall that is missing is 'where is my file position'

14:55 <mjg> general point being, gathering full info for a stat call is way more work and often requires locking

14:55 <geist> since the canonical solution is to seek to the end and then ftell

14:55 <heat> geist, that's covered by lseek

14:55 <mjg> ye

14:55 <geist> does it?

14:56 <heat> lseek(fd, 0, SEEK_CUR) returns the current seek and adjusts it by 0

14:56 <geist> oh huh. yeah okay

14:56 <geist> i think i jnew that but forgot. got it

14:57 Persephone has quit [Ping timeout: 244 seconds]

14:57 <mjg> if anything i'm surprised by opposition to F_GETSIZE as a concept

14:57 <mjg> wanting *just* size is pretty standard before you mmap

14:57 <mjg> and paying for the entire stat buf to get there is definitely wasteful

14:58 <geist> i mean sure, but i dont think adding a new call just to optimize is necessarily worth it

14:58 <mjg> but now that i wrote it, a "just map the whole ffucking file" flag to mmap would be great

14:58 <geist> based on that same premise all of the rest of all of the fields in fstat should get a syscall, etc

14:58 <geist> and then isn't it intrinsically implementation defined as to whether or not it helps?

14:58 <mjg> if a field is very frequently asked for, while the rest is ignored, then yes

14:58 <mjg> so far that's only size

14:58 <geist> what about time stamp? that might be extremely helpful (say for git, etc) to read in a single syscall

14:59 <geist> without a full stat

14:59 <mjg> i don't know if that is of practical use

14:59 <mjg> if one was to survey real world uses

14:59 Persephone has joined #osdev

14:59 <geist> i dunno, millions of time stamps for build systems/s is pretty important

14:59 <mjg> it may turn out a quarter of actual stat buf is used inp ractice for 99% of consumers

14:59 <mjg> and it can be populated without locks

14:59 <mjg> that i would call a win worth pursuing for sure

15:00 <mjg> in the meantime, i know for a fact doing fstat just to get the szie is super common

15:00 <geist> yah. mostly just playing devils advocate

15:00 rpnx has joined #osdev

15:00 <geist> you're right, but i'd say a holistic view of the whole world is speed isn't always the most important thing

15:00 <geist> duplicated apis have cost, especially when they're around forever, etc etc

15:01 <mjg> see, so happens this functionality *cleans it up* in the kernel

15:01 <mjg> namely there are several caess in there which do vop_getattr just to get the size

15:01 <geist> not really, because there's now two completely different paths

15:01 <mjg> i'm deduping this shit in preparation to make it sensible

15:01 <geist> since you can't get rid of stat

15:02 <mjg> i'm saying the current code will end up calling vn_getsize and be shorter for it

15:02 * geist nods

15:03 <mjg> and there are several places

15:03 <mjg> but ye, at the bottom there will be a new routine

15:03 <mjg> which i don't consider ot be a problem

15:03 <geist> (which OS are you talking about here anyway?)

15:03 <heat> freebsd

15:03 <geist> i forget you're primarily freebsd right?

15:03 <mjg> freebsd, as usual :p

15:04 <geist> right, so there's also that. if one of the big kernels addds it you're effectively forcing the other ones to as well

15:04 <mjg> i have not checked on linux on that front in quite a while

15:04 <geist> so maybe it's a win for freebsd but not linux or the other way around

15:04 <mjg> ye part of why i asked previously if *any os* already has it, i know linux does not

15:04 <mjg> instead shitty games get played with lseek

15:04 <mjg> by people who try to avoid fstat

15:05 <heat> oh yeah i_size is funny

15:05 <heat> in linux, it's just a 64-bit read

15:05 <heat> no barriers no nothing

15:05 <heat> no locks either

15:05 <mjg> just atomic load

15:05 <heat> no

15:05 <mjg> quite a win if you ask me

15:05 <heat> just a load

15:05 <heat> https://elixir.bootlin.com/linux/latest/source/include/linux/fs.h#L849

15:05 <mjg> you mean READ_ONCE?

15:05 <bslsk05> elixir.bootlin.com: fs.h - include/linux/fs.h - Linux source code (v5.19.10) - Bootlin

15:05 <geist> depends on how it's written

15:06 <heat> return inode->i_size;

15:06 <heat> 32-bit does crazy shit with seqcounts to make sure things are updated atomically

15:06 <mjg> wut

15:06 <mjg> that's stupid

15:06 <mjg> i suspect they just did not clean it up yet

15:06 <geist> hang on, it also says below (in i_size_write()) that it needs locking

15:07 <geist> so that lines up with the general rules i'd expect (on 64bit at least)

15:07 <heat> it requires locking because of 32-bit

15:07 <geist> ie, the write has to either be atomic (witha builtin barrier) or under a lock, which will barrier when the lock is released

15:07 <mjg> they say the 32 bit needs locking to not fuck up seqlock

15:07 <geist> so that means the read on a weakly ordered cpu siumuiltaneously can read something stale up to the point at which the lock is released

15:07 <geist> which works

15:07 <mjg> i mean seqcount

15:08 <geist> as lomg as there's some sort of guaranteed barrier on the writing side

15:08 <mjg> for 64 bit barriers are of no significance, you only need to make sure to store the value in one write

15:08 <mjg> which they are not doing

15:08 <geist> oh they are hella significance on weakly ordered machines

15:09 <geist> very very much a thing, but any reasonable lock has an implied barrier on acquire and release

15:09 <mjg> man

15:09 Iris_Persephone has joined #osdev

15:09 <mjg> 32-bit -- agreed

15:09 <mjg> 64-bit -- no

15:09 <geist> yes

15:09 <geist> 100% yes on 64bit

15:09 <mjg> what exactly woud that barrier synchro against?

15:09 <mjg> it's literally a load

15:09 <geist> a barrier on the *write*

15:10 <mjg> but if the reader does not has any locks whatsoever

15:10 <geist> ie, you write it witout a barrier, it doesn't 'appear' to the reader until you do

15:10 <mjg> and only reads the size

15:10 <mjg> it is already 100% unsychnronized against writers

15:10 <mjg> no matter how many locks you plop in there

15:10 <mjg> weak cpu will *eventually* flush it, worst case when it unlocks the inode

15:10 <geist> yes, but i'm talking about whether or not yuo get a stale value *after* the write has occurred on another cpu

15:10 <geist> *thats what i'm saying*

15:10 <geist> we're violently agreeing

15:10 <mjg> i agree you can get a stale value

15:11 <mjg> but i'm saying that's not a problem here

15:11 * geist head desks

15:11 <mjg> you may have as well did the read just prior the write

15:11 <geist> i'm saying the exactly same thing

15:11 <mjg> so there is no use for barriers

15:11 <mjg> for this purpose

15:11 <geist> *because the lock has a barrier*

15:11 Persephone has quit [Ping timeout: 252 seconds]

15:11 <mjg> well let me restate. i_size_write routine in 64-bit variant does not need to post any fences

15:12 <geist> okay. i'm tired of violently agreeing

15:12 <mjg> which i understood as the point of cnention

15:12 <geist> yes.

15:12 <geist> no there's no contention, there never was

15:12 <mjg> ok, scratch it

15:12 <mjg> they defo need atomic store/loads there though

15:12 <geist> i'm saying 'yeah that's why this works' and then you're like 'no, it works because <same thing stated differently>'

15:12 <geist> anyway. fun times.

15:13 <mjg> so what about that mmap

15:13 <mjg> a flag to just map the whole file without providing explicit size

15:13 <mjg> i guess it runs into a problem if you want to munmap by hand later

15:13 xenos1984 has joined #osdev

15:13 <geist> right, i think that's te main issue

15:13 <mjg> would defo eliminate my usecase for F_GETSIZE

15:13 <geist> you dont get a notification of how big the mapping is

15:14 Iris_Persephone has quit [Ping timeout: 264 seconds]

15:14 <mjg> if mmap was returning a token of sort instead just an address this would be a non-issue

15:14 <mjg> damn you unix

15:14 <mjg> well token, handle, whatever

15:16 <geist> yah agreed, this is also an inconsistent api we have in zircon because of the need to try to be posixy

15:16 <geist> would have liked to have returned a handle to a new mapping, but since that's so incompatible with posixy apis it's too difficult to do

15:16 <mjg> perhaps a "whack the mapping starting at X" would be ok enough

15:16 <mjg> if usersapce was fucking around with remapping that's their problem

15:16 <mjg> it only is guaranteed to work if they did not

15:17 <geist> the real barrier to handle based mappings is the posixy ability to protect() or unmap() in the middle of an existing mapping

15:17 <mjg> hence the above comment

15:17 <geist> that at worst case takes one mapping and turns it into 3

15:17 <mjg> 17:16 < mjg> it only is guaranteed to work if they did not

15:17 <geist> yeah i know.

15:17 <mjg> if they fuck with it, the syscall has undefined behavior

15:17 <mjg> personally i would unlink the binary

15:17 <mjg> :S

15:18 <geist> anyway we had to in zircon follow the posix model of making mappings be mostly a hidden object in the kernel, and let operations be range based

15:18 <geist> for better or worse

15:18 <geist> at best it's inconsistent with the rest of the handle based model that zircon has

15:20 <mjg> ye the more i look at optimizing the more i just don't like unix

15:21 Iris_Persephone has joined #osdev

15:21 <mjg> you would think all the slow hw they had would make for slim to the point interfaces with great room for optimization

15:21 <mjg> but it's the opposite

15:21 <mjg> they slam syscalls like they are free

15:22 <geist> indeed. a holistic view of these sort of things is to try to get userspace to operate smartly rather than just optimize what they do

15:22 <geist> though both are the case, sometimes stepping back and trying to point them in a better direction is the right solution

15:22 <geist> OTOH, in the posixy world i think that particular ship sailed, for at least most of the meaningful direction changes yuo could make

15:23 <mjg> well see the mmap flag idea

15:23 <geist> or at least things like ioring or whatnot are probably the only real meaningful style major changes you could do

15:23 <mjg> instant bummer

15:23 <geist> yah, well it's all bummers if you look at it that way :)

15:23 <mjg> i'm polish, so there is not much choice

15:23 <geist> for every design decisions there are always downsides. that's a downside of the 'mmap doesn't return info about the mapping' choise that someone made like 30 years ago

15:24 <mjg> there is serious insanity in practice concerning credential management in the kernel

15:24 <geist> or if there was a sane api for 'give me info about the thing mapped <here>'

15:24 <mjg> there are workloads which keep spawning processes and setgid, setuid and so on

15:24 <mjg> all systems i know of handles this by allocating new creds from scratch

15:24 <geist> vs walking through /proc/self/smaps or whatnot like linux fols robably would say to do

15:25 <mjg> you end up with a fuckton of allocs for common idioms

15:26 <mjg> hmmmm

15:26 <mjg> how about an extra flag: unmap when the is closed

15:26 <mjg> the fd

15:26 <clever> ooop, smaps, thats a new one!

15:26 <clever> ooo*

15:26 <mjg> oh heh i did not know about /proc/self/smaps_rollup

15:26 <clever> yet its been there since 3.8 at least, how have i not seen it before?

15:27 <heat> aw fuck

15:27 <clever> ah, that one is absent on 3.8

15:27 <geist> also /proc/self/maps, which is a more terse version

15:27 <heat> i was late with /proc/self/maps

15:27 <heat> it was going to be so funny

15:27 <clever> ive known of maps for ages

15:27 <mjg> oh environ

15:27 <mjg> this reminds me of this linux "philosophy"

15:28 <mjg> you know the trick to "setproctitle" where you move your args and env out of the way

15:28 <heat> unix philosophy everything is a file baby

15:28 <geist> yah slamming everything in /proc is at once really frustrating, and actually very nice for general user space hackery

15:28 <mjg> and then plop your custom stuff in there

15:28 <mjg> one of my fumdanetal problems with linux

15:28 <geist> i do love how you can load up linux on a machine and then with the shell basically get all of the info you'd ever want out of it

15:28 <mjg> someone comes up with a stupid hack, people copy paste it and that's way to go now

15:28 <clever> heat: bash takes that one step further, you can redirect from /dev/tcp/ip/port

15:28 <geist> i dont know if it's a *good* idea, but it's pretty neat nonetheless

15:28 <clever> i was confused when i first saw that, because ls claims it doesnt exist

15:29 <heat> i know

15:29 <heat> it's bash bs

15:29 <geist> what sucks is accessing stuff programatically. i like the BSD sysctl style api for certain things

15:29 <geist> where you just want a number

15:29 <mjg> i find the free form text files coming out of the kernel to be a problem

15:30 <mjg> ultimatey you want this in json or some other serialisation non sensitive to whitespace

15:30 <geist> excactly. it's nice for shell users, bad for programs

15:30 <heat> and thats why bsd isn't true unix

15:30 <mjg> and preferably not want the kernel to do it

15:30 <clever> there is also a trick i found for environ, xargs -0 -n1 echo < /proc/15605/environ

15:30 <clever> xargs takes \0 seperated strings, and passes them to echo, 1 at a time

15:30 <geist> yah xargs -0 is super powerful

15:30 <mjg> xargs -0 rox

15:31 <clever> best paired with `find -print0`

15:31 <mjg> but also note there is no guarantee this is representative of envirnoment used by the proc at hand

15:31 <heat> yea

15:31 <mjg> if glibc moved it around at best you see what it started with

15:31 <heat> hrm

15:31 <heat> does it look at the user stack?

15:31 <heat> I assumed it just saved it at exec time

15:31 <mjg> it is saved at exec time

15:31 <mjg> and then it just blindly derefs the area

15:32 <geist> i assume it's saved at exec time. thsi is what was initially passed it via whatever mechanism linux does

15:32 <mjg> if env moved, tough

15:32 <geist> possibly the user process literalyl reads its environment via this file?

15:32 <mjg> which is part of my issue with all this shit

15:32 <geist> (probably not for speed purposes)

15:32 <clever> another weird bit, is how a process can modify its own argv, to censor out passwords in `ps aux`

15:32 <mjg> i'm unaware of any real world code doing something liike

15:32 <clever> that implies /proc is peeking into the stack of another proc

15:32 <mjg> clever: except that's too late

15:33 <clever> yeah

15:33 <mjg> old unix security ideas

15:33 <clever> ive also seen that used less for security

15:33 <clever> worker threads reporting their status and client

15:33 <mjg> postgres is doing it

15:33 <mjg> this is a legit usecase

15:33 <clever> yep

15:33 <mjg> it was made sensible on freebsd: there was a func named setproctitle, inititally i did not perform for shit

15:33 <mjg> always making a syscall

15:33 <clever> postgres 1346525 0.0 0.1 294808 18192 ? Ss Sep21 0:02 postgres: hydra hydra [local] idle

15:34 <mjg> then it was patched to "i'm going to store my title at this location"

15:34 <geist> huh /proc/self/stack doesn't have what i thought it would

15:34 <geist> unclear precisely what it holds

15:34 <mjg> it's the kernel stack

15:34 <mjg> check something blocked there

15:34 <geist> hmm, except i just did a hexdump -C on it and it was just a bunch of strings

15:34 <mjg> oh that's what you mean

15:34 <mjg> it's the backtrace

15:34 <mjg> from the kernel

15:34 <clever> yeah, the kernel walked its own stack, then did symbol lookups on it

15:34 <geist> oh oh

15:35 <geist> oh heh yeah. ksys_read... etc

15:35 <clever> [<0>] proc_pid_stack+0xa7/0x120

15:35 Iris_Persephone has quit [Ping timeout: 265 seconds]

15:35 <clever> but the offset within the function is present

15:35 <clever> it seems to also be censoring the actual addresses out, for kaslr reasons?

15:35 <geist> i figured it was like /proc/self/mem and held some sort of map of the user stack for the thread

15:35 <clever> so you can then convert that to a line#, if you had debug info

15:36 <clever> another one ive used is /proc/self/pagemap

15:36 <mjg> geist: right, they really should prefix everything u or k

15:36 <clever> on a 64bit system, its a uint64_t[] array

15:36 <clever> one slot for every page of virtual memory

15:36 <mjg> but another ship which has sailed

15:36 <clever> essentially giving you read-only access to the leaf nodes in the paging table

15:37 <clever> but much to my anoyance, if you mmap MMIO, it shows up as 0 in that table

15:37 <clever> i spent months trying to figure out why mmio didnt work in one situation, only to discover it was the hardware

15:37 <heat> that's probably because mmio mappings don't have vm state in linux

15:38 <heat> they're just straight up mapped in the page tables

15:38 <mjg> clever: :)

15:39 <clever> in my case, the AXI bus has a "user" vs "kernel" bit on every memory transfer

15:39 <clever> and the hardware was configured to just deny MMIO from anything userland

15:39 <clever> so no amount of software debugging could fix it, lol

15:40 <clever> geist is the one that ultimately revealed that axi is even capable of doing that

15:40 <mjg> you could have ran into a bug which flipped it tho

15:40 <clever> mjg: it was a bloody typo, in the official headers, for the enum that configures the hw

15:40 <mjg> i knew who a guy writing bios for some board

15:40 <mjg> they kept running into weird crashes and they knew it's their fault

15:40 <mjg> but they blamed it on ram manufacturer

15:40 <mjg> and got away with it

15:40 <heat> damn right

15:41 <clever> mjg: https://github.com/librerpi/rpi-open-firmware/blob/master/common/broadcom/bcm2708_chip/arm_control.h#L44-L46

15:41 <bslsk05> github.com: rpi-open-firmware/arm_control.h at master · librerpi/rpi-open-firmware · GitHub

15:41 <clever> mjg: code A here, blocks access from userland

15:41 Iris_Persephone has joined #osdev

15:41 <mjg> wait, so that *remains* broken?

15:42 <clever> i didnt fix the .h file at the time

15:42 <mjg> you are linking top of the tree

15:42 <heat> mjg, it doesn't seem to blindly deref the stack

15:42 <clever> https://github.com/librerpi/rpi-open-firmware/blob/master/firmware/drivers/BCM2708ArmControl.cc#L236

15:42 <bslsk05> github.com: rpi-open-firmware/BCM2708ArmControl.cc at master · librerpi/rpi-open-firmware · GitHub

15:42 <clever> mjg: i instead fixed it by |'ing in the mislabeled flag

15:43 <clever> and then never got around to confirming what every bit does

15:43 <mjg> heat: environ support? last time i was patching that code i'm confident it was just accessing the pre-saved area

15:43 <clever> the code now works, but the header is still wrong

15:43 <mjg> heat: i mean it makes sure to not crash and wahtnot

15:43 <heat> yeah

15:43 <mjg> heat: but it does not account the env moving around, should it happen

15:44 <heat> right

15:44 <heat> but it Just Works(tm)

15:44 <heat> sometimes

15:44 <mjg> works for the default state of not changing env vars

15:44 <mjg> but good luck debugging when someone did

15:44 <mjg> and it does not show up there

15:44 <mjg> ... and you are confident it would

15:45 <heat> also someone on the interwebs suggested this: strings /proc/self/environ

15:45 <mjg> [no i did not happen to me :>]

15:45 <heat> instead of your shitty xargs -0 ... workarounds

15:45 <mjg> it's not mine bro

15:45 <heat> well, someone's

15:46 <heat> oh shit

15:46 <clever> heat: but strings has some size limit, and may not detect A=1

15:46 <mjg> you just don't like xargs -0

15:47 <heat> I just re-found out about /proc/self/wchan

15:47 <heat> clever, strings -n 2

15:47 <mjg> you reminded me of a long standing problem, wonder if they fixed it

15:48 <mjg> you mmap something backed by nfs

15:48 <mjg> nfs server dies

15:48 <mjg> you take a fault on that mapping

15:48 <mjg> tools like ps and top will start hanging in an uninterruptible state trying to grab your mmap semaphore

15:48 <heat> lol

15:48 <mjg> as in they don't show squat and you can't kill them

15:49 <clever> mjg: ive killed my server before, just by coming home, lol

15:49 <clever> cacti was running `df -h` in a cronjob, to graph my free space

15:49 <clever> the laptop was an nfs server, mounted into that box

15:49 <mjg> so a typical sysadmin is 100% fucked here

15:49 <clever> when i take the laptop on a trip, `df -h` just hangs, and 1000's of them pile up, and slip into a swap coma

15:49 <clever> when the laptop returns home, every bloody `df -h` wakes up at once

15:50 <clever> and they all fight over the ram

15:50 <clever> system grinds to a total halt :P

15:50 <mjg> 17:49 < clever> the laptop was an nfs server, mounted into that box

15:50 <mjg> what in thea ctual fuck man

15:50 <mjg> are you a webdeloper in your dayjob

15:50 <clever> mjg: the solution to all of those problems, use the soft flag when you mount nfs

15:51 <clever> if you use the hard mount flag, then nfs errors block forever, until the server returns

15:51 <clever> but if you use soft, network errors result in the syscall giving an error

15:51 <clever> and things become killable

15:51 <heat> that's not true

15:52 <clever> has the man page lied again?

15:52 <heat> nfs waits are always killable afaik

15:52 <mjg> which waits

15:52 <heat> locks, and waits in wait_queues

15:52 <mjg> it was not all of them when i had to deal with this shit 5 years ago

15:52 <mjg> in fact it was quite routine to get a crashdump with a dead nfs server in dmesg

15:52 <heat> well, your example involves a lock acquired in a non-killable way

15:52 <mjg> and a bunch of unkillable processes fucking aroudn

15:53 <mjg> ye i'm saying there were cases where you would take a lock, go off cpu waiting for the nfs server

15:53 <mjg> and possibly other code in nfs would trip on it

15:53 <mjg> and then you are fucked

15:53 <mjg> it is plausbiel this is fixed now

15:53 <heat> well sure, that's possible

15:53 <heat> and a harder problem

15:54 <heat> mmap_sem aren't mutex_lock_killable'd

15:54 <mjg> it was not just mmap sem

15:54 <heat> (well, the rwsem)

15:54 <mjg> albeit that was the main culprit -- monitoring would stop working :->

15:56 Iris_Persephone has quit [Ping timeout: 264 seconds]

16:00 Iris_Persephone has joined #osdev

16:01 <clever> getting a bit more back on topic, ive been working on an ext4 driver lately, and i think the only major feature i'm currently missing, is the ability to recursively follow the extent tree

16:02 <clever> but part of the problem in testing, is that i need to create a complex extent tree

16:02 frkzoid has joined #osdev

16:02 <clever> and due to it being extent based, its not based on filesize, but fragmentation

16:03 <clever> my first thought, is to just set a torrent client loose on it

16:04 <clever> reducing the number of blocks per group may also help, since the block group metadata is sitting between each group

16:04 <clever> so having really tiny groups will force a max size onto fragments

16:14 Iris_Persephone has quit [Ping timeout: 265 seconds]

16:17 Iris_Persephone has joined #osdev

16:27 xenos1984 has quit [Ping timeout: 260 seconds]

16:28 xenos1984 has joined #osdev

16:32 Iris_Persephone has quit [Ping timeout: 250 seconds]

16:34 kof123 has quit [Ping timeout: 268 seconds]

16:34 Iris_Persephone has joined #osdev

16:49 Iris_Persephone has quit [Ping timeout: 264 seconds]

16:51 Iris_Persephone has joined #osdev

16:53 <geist> somewhere i wrote an app years ago that generates fragmented files

16:53 <geist> basically creates a crap ton of files and then keeps resizing them such that their blocks end up probably overlapping a lot

16:53 <geist> thoughyou need to get pretty close to full disk utilitzation for it to work

16:56 <clever> i can see how an FS would space the new fragments out nicely, so they dont collide while growing

16:57 <geist> https://pastebin.com/9ZzbaD3V there

16:57 <bslsk05> pastebin.com: #include <errno.h>#include <fcntl.h>#include <limits.h>#include <stdio.h> - Pastebin.com

16:57 <geist> i have no warranty on it, and i wrote it like 20 years ago, on BeOS

16:57 <geist> so might need some fiddling

16:58 pretty_dumm_guy has joined #osdev

16:58 <geist> and probably assumes files aren't sparse

16:58 <clever> yeah, sparse would only be fought off with writing actual data

16:59 <geist> or using whatever fallocate() style call linux has. i dunno precisely what it bottoms out in

16:59 <geist> could probably impleent this as a shell script too

16:59 <clever> it does at least compile on linux

16:59 <geist> anyway i remember it working pretty well

17:00 <geist> though like i said i think it only really works well if you size it such that the number of working files is pretty close to the full size of the disk

17:00 <geist> otherwise if all of them are just playing with 5% of the space, the fs impl may nicely space them out

17:01 <clever> i can just make a 512mb or 128mb disk image

17:01 <geist> anyway, see how that works. `filefrag -v` tells you what it ended up with

17:01 <geist> can fiddle with it and see what tunables work for you

17:03 <clever> yep

17:05 Iris_Persephone has quit [Ping timeout: 265 seconds]

17:07 bauen1 has quit [Ping timeout: 252 seconds]

17:07 bauen1 has joined #osdev

17:07 Iris_Persephone has joined #osdev

17:21 Iris_Persephone has quit [Ping timeout: 265 seconds]

17:23 Iris_Persephone has joined #osdev

17:25 Vercas6 has quit [Quit: Ping timeout (120 seconds)]

17:26 Vercas6 has joined #osdev

17:41 cross has joined #osdev

17:41 <mjg> you know, for the biggest system in the world and so much money behind it

17:41 <mjg> linux still manages to surprise me by how bad things can get

17:41 <mjg> in this episode i tried: perf record --all-kernel --call-graph dwarf

17:42 <mjg> perf report is losing its shit though

17:42 <mjg> ubuntu 20

17:46 pretty_dumm_guy has quit [Quit: WeeChat 3.5]

17:54 <heat> geist, the "full disk utilization" thing is sadge :/

17:55 <heat> I would really love a tool to generate some stupidly fragmented files that doesn't take that

17:55 <heat> also other stupidly stupid conditions like a directory with 100k files

17:56 <heat> also downright broken filesystems but that's more on the realm of fuzzers so :|

17:56 <geist> well you could fallocate a big file that chews up say 90% of the disk, then run this to generate fragmented files in the space outside of it

17:57 <geist> it's totally intended to be for small test disk images though

17:57 <geist> and/or in an era when a 2GB disk was big

17:57 <heat> I'm fairly sure I've seen syzkaller craft a broken filesystem and mount it as a loop device

17:57 <heat> also in my filesystem wishlist: xfstests

18:01 <heat> i remember i found a .c that had some sort of simple filesystem unit tests for unixish systems

18:01 <heat> wonder what that was

18:01 frkzoid has quit [Ping timeout: 244 seconds]

18:02 <geist> yah

18:03 <heat> ah yes, fsx.c

18:03 freakazoid332 has joined #osdev

18:04 <heat> https://github.com/apple/fstools/blob/master/src/fsx/fsx.c

18:04 <bslsk05> github.com: fstools/fsx.c at master · apple/fstools · GitHub

18:05 <heat> they also an fstorture tool in there

18:05 <heat> I need those badly

18:08 vdamewood has joined #osdev

18:36 dude12312414 has quit [Remote host closed the connection]

18:38 dude12312414 has joined #osdev

18:49 gog has joined #osdev

18:50 <dh`> heat, you know about Impressions?

18:51 <dh`> it is a widget for generating sort-of-realistic fs images

18:51 <dh`> and while it has a bunch of issues, it is at least sort of usable

18:51 <heat> nope, never heard of it

18:51 <heat> what is "sort-of-realistic" here?

18:51 <dh`> it is a research artefact from some years back

18:52 <mjg> you reminded me of an AI which generates human faces

18:52 <mjg> https://thispersondoesnotexist.com/

18:52 <bslsk05> thispersondoesnotexist.com: This Person Does Not Exist

18:52 <dh`> https://www.usenix.org/events/fast09/tech/full_papers/agrawal/agrawal.pdf

18:52 <dh`> I'm not sure what the state of the code is at this point; I had some patches and I'm not sure if they ever got rolled in

18:53 <gog> hi

18:53 <heat> henlo

18:53 <dh`> sort-of-realistic means distributions of sizes and filename extensions and whatnot that are supposed to correspond to the real world

18:54 <zid> I just invented a filesystem

18:54 <zid> It's a B-tree. The end.

18:54 <zid> If your filename doesn't fit the invariants of a B-tree when you try to creat() it, it fails.

18:55 <heat> dh`, im not particularly interested in realistic workloads

18:55 <heat> I can just mkfs a filesystem from a sysroot and I can something "realistic" I think

18:55 <heat> s/workloads/images/

18:56 <dh`> well, whether or not you care about that, it is a tool for randomly populating images

18:57 <gog> stop inventing filesystems

18:57 <zid> It's the ultimate filesystem though

18:59 <gog> the null filesystem

18:59 <gog> aways fails successfully

19:00 <zid> All filesystems are a superset of that one right

19:00 <gog> basically

19:00 <zid> {}, {{},{}}, etc

19:11 <heat> do page caches usually optimize for file holes?

19:11 <geist> directories are a mistake

19:11 <heat> as in mapping a zero page instead of an actual page

19:11 <geist> let the mount points be your directories

19:11 <heat> or not mapping anything at all

19:11 <geist> yes

19:12 <heat> i don't have that

19:12 <heat> :|

19:12 <geist> the zero page optimization? it's eventually worth it, though i guess it's not a deal breaker up front

19:13 <geist> you can alloc a page and fill it with zeros on demand

19:13 <heat> i have the zero page optimization, I just don't use it for the page cache (so, for inodes)

19:13 <geist> ah. probably only matters at map time. the page cache itself is probably unaware of it

19:13 <geist> it'd be simply a hole in the file though you might want some sort of sentinel, depends on how your page cache works

19:14 <geist> in the case of zircon we simply leave that as a hole in the vmo, whic is the default state anyway (all holes, no pages)

19:15 <geist> and at map time if you read fault on it it just maps in the zero page instead of the nonexistant page (unless the pager source has one, etc etc)

19:15 <geist> if you write fault on it its either a fresh new zero page or some pager behavior or a failyre (RO mapping, etc)

19:15 <heat> hummm

19:16 <heat> right now my vmos don't have that behavior

19:16 <heat> all read pages are sourced from whatever is backing the vmo

19:17 <heat> write is even more confusing

19:17 <heat> also, do your vmos' sizes need to be page aligned?

19:18 <mjg> zero page?

19:18 scaleww has joined #osdev

19:18 <mjg> i highly doubt it is really worth it

19:18 <heat> I'm currently requiring that because if I think of a vmo as a bag of pages, it doesn't make sense to have one with size 104, but rather 4096

19:18 <mjg> afair the linux folk were not sure either, but did not have enough info way or theo ther to whack it

19:18 <heat> so linux does that?

19:19 <mjg> yea

19:19 <mjg> it leads to retarded discussions sometimes

19:19 <mjg> i think i ranted about it on this very channel few weeks back

19:20 <mjg> people who don't deal with the problem domain get very bad ideas concerning memory management, with ideas like "calloc is free bro cause zero_page"

19:20 <mjg> glibc has an optimization where first calloc returning given address is not touching it

19:21 <mjg> then they read from it and get the zero page

19:22 <heat> well, I'm talking specifically about non-anon file mappings here

19:23 <heat> I'm already using the zero page on anon mappings, for better or worse

19:23 <heat> i imagine you could save some decent memory by representing file holes with zero_pages and only giving them actual backing when written to

19:24 <mjg> fair, i don't know if that's good or bad

19:25 <mjg> does this look off?

19:25 <heat> yes

19:25 <heat> or no

19:25 <heat> one of those

19:25 <mjg> now that i said it, it does, but im gonna paste

19:29 vdamewood has quit [Read error: Connection reset by peer]

19:31 <dh`> I would think if you have zerofill pages at all that using them for nonexistent file pages wouldn't be hard

19:31 <dh`> and therefore probably worthwhile

19:32 vdamewood has joined #osdev

19:40 scaleww has quit [Quit: Leaving]

19:48 freakazoid332 has quit [Read error: Connection reset by peer]

19:49 <Iris_Persephone> So, another newbie question: Is it worth trying to implement POSIX as closely as possible, or is it something that depends on your goal for the system?

19:50 <heat> latter

19:51 GeDaMo has quit [Quit: Physics -> Chemistry -> Biology -> Intelligence -> ???]

19:51 <heat> posix will let you have a useful system much quicker

19:51 <heat> going non-posix gives you the freedom to do whatever you want

19:51 <heat> you can theoretically always implement posix as a compatibility layer but that's always iffy

19:52 SpikeHeron has quit [Quit: WeeChat 3.6]

19:53 radens has joined #osdev

19:53 <Iris_Persephone> Yeah, makes sense

19:53 <radens> does lk build with clang? If so, how do I tell it to use a clang toolchain and not gcc?

19:53 <heat> no

19:53 <Iris_Persephone> I was a little worried about standing out from all the other POSIX systems, but in hindsight that is a little silly

19:53 <heat> that's a hard question but afaik right now the answer is no

19:54 <heat> radens, see https://github.com/littlekernel/lk/pull/322

19:54 <bslsk05> github.com: [build] make LK buildable with LLVM/Clang by pcc · Pull Request #322 · littlekernel/lk · GitHub

19:56 <geist> yeah it's a tough problem to generically solve. that PR for example totally relies on a very specific toolchain for a specific arch

19:56 <radens> thanks heat

19:56 <geist> so i dont see a good way to take it upstream

19:56 <geist> i talked to pcc about it a bit but haven't heard much of an answer

19:56 <geist> it's a good example of 'works for that person for their use case' sort of PR i tend to get into LK

19:57 <geist> but i think there's probably a more low level way to do it that involves starting from a generic solution fundamentally in the build system, instead of just hacking it into the ARM side of the build system

19:57 <geist> anyway, bbiab

20:02 SpikeHeron has joined #osdev

20:02 <radens> It would be nice if it built with llvm. It's annoying to need another gcc toolchain for each arch, when I have a perfectly good llvm toolchain which should do it all.

20:06 <mrvn> isn't there some wraper to make clang accept most gcc options?

20:07 <heat> yes

20:07 <heat> it's called clang

20:08 <mrvn> heat: that's not enough for lk

20:08 frkzoid has joined #osdev

20:10 <heat> it absolutely is

20:11 <heat> the lk PR I linked just makes a -Wno- option conditional on gcc (because LLVM doesn't have it I assume)

20:12 <mrvn> see

20:13 <heat> if you test if options exist a-la traditional kconfig or autoconf, you'll have no issues using clang or gcc

20:13 <mrvn> There are probably a buch more of those for other archs.

20:13 <heat> doing CC=clang ./configure && make CC=clang Just Works(tm)

20:14 <mrvn> I rather trust geist there that it's a "works for me" solution.

20:17 <j`ey> which is fair enough for PRs I guess

20:18 <heat> it's like you choose to ignore what I say

20:18 <heat> what geist said and what I'm saying aren't mutually exclusive

20:38 frkzoid has quit [Ping timeout: 244 seconds]

20:43 xenos1984 has quit [Ping timeout: 250 seconds]

20:58 xenos1984 has joined #osdev

21:13 <geist> well what i really mean is that PR doesn't work on anything but a specific LLVM build he has

21:13 <geist> i looked at it, and it wont build with any of my llvms

21:13 <geist> talked to him in chat and he has some custom llvm for android i think

21:13 <geist> the difference being the presence of or the lack of particular built in headers

21:14 <geist> it gets into the 'is this a generic clang build or is this one thats inended to be for linux' sort of problem all over again

21:14 <geist> yes, clang can target any triple, but the existing headrs may be specific to a particular one, etc

21:14 <heat> yup

21:15 <geist> so it's not just a drop in the bucket. the change i was thinking was the split 'generic code changes made in the codebase that get it to build with clang' and 'the build system stuff to switch it' which i think requires a bit deeper cut

21:15 <geist> and then what i had seen was this PR was totally incompatible with your 'link with the compiler, not the linker' PR for Reasons i forget

21:18 <heat> did you ever look at it again?

21:23 <mrvn> geist: and there I was expecting that freestanding is a standard that you could rely on, silly me.

21:23 <geist> heat: i have not

21:24 <geist> i took it as 'huh yeah i should also look at this too' but have not done so

21:24 <heat> mrvn, how do you generate and install headers for thousands of different target triplet combos?

21:25 <mrvn> heat: that's what /usr/include/triplet/ is for.

21:25 <heat> you do realize that clang would have to generate those right? for every combination it supports

21:25 <heat> which in the default case, is a boatload of them

21:25 <mrvn> it does generate them for every combination it supports. Just nor all at the same time.

21:26 <heat> it does not

21:26 <mrvn> If you want to build a compiler for everything then yes, you have to generate everything.

21:26 <mrvn> heat: whatever triplet you build clang for it will generate at least those headers.

21:27 <heat> yes

21:27 <mrvn> so it generates all of them, just not at the same time.

21:27 <heat> no, it generates a handful of them

21:27 <geist> anyway i think it's not too bad, but part of what i need is a local llvm build so i can CI this

21:28 <geist> i dont want it going in the tree if i cant build it locally

21:28 <mrvn> you do see the "not at the same time" part, right?

21:28 <geist> and i was stuck at 'can't build it locally' because of reasons

21:28 <geist> and that's where i dropped it and hadn't picked it up again

21:28 <heat> add llvm CI?

21:28 <geist> hmm?

21:28 <heat> add llvm to your ci

21:28 <geist> possibly. note this all takes time to implement

21:28 <geist> of which i did not spend.

21:29 <geist> ie, the PR is not just a freebie, which is why i didn't accept it

21:29 <geist> because it's a new can of worms i'd like to solve more generally for LK

21:29 <heat> oh wow I was wrong

21:29 <heat> clang installs a single set of headers

21:29 <heat> and they're supposedly portable

21:29 <geist> yes and thats possibly the answer, but i have to sort it out because it didn't just work with the llvm i threw at it

21:30 <geist> and i know very little about it, so it's a learning curve for me. ie, i want to know what i'm checking in

21:30 <geist> and until i can at least locally test it i dont want to take it into the tree

21:30 <heat> how do I check out a PR?

21:30 <geist> it's a branch

21:30 <geist> usually i just fetch it and then either merge it local or rebase it ontop of yours

21:31 <geist> i thik what i had decided is the build system actally needs 4 separate modes: gcc + ld as linker, gcc + gcc as linker, llvm + ld.bfd as linker, llvm + ld.llvm as linker'

21:32 <geist> and each are subtly different in at least a way the budli system needs to understand

21:32 <geist> not an insurmountable problem but one that takes a solid day or two of work to map out

21:32 <mrvn> what about gold?

21:32 * geist shrugs. maybe?

21:34 <geist> question is if it's useful enough to warrant support for (or if it needs any particular support)

21:34 <geist> also note that linkers other than binutils tend to trip over things in my linker scripts

21:35 <geist> partially my fault, partially theirs, so i'd be much more inclined to do a simple 'clang the compiler, nothing else' support for phase 1

21:35 <geist> ie, clang + binutils

21:35 <geist> which is why i was asking pcc to split it into separate CLs

22:01 Iris_Persephone has quit [Ping timeout: 264 seconds]

22:04 vdamewood has quit [Read error: Connection reset by peer]

22:05 vdamewood has joined #osdev

22:10 Iris_Persephone has joined #osdev

22:25 Iris_Persephone has quit [Ping timeout: 264 seconds]

22:28 <heat> i need a unix guru rn

22:29 Iris_Persephone has joined #osdev

22:29 <heat> let's imagine a dangling symlink named 'a'

22:29 <heat> why should faccess('a', ...) be valid if its dangling and I didn't tell it to not follow symlinks?

22:30 <heat> and opens seem to work

22:31 <heat> does path resolution just open the symlink if its dangling?

22:36 <heat> actually, hrm

22:36 <heat> I think I was misinterpreting the strace

22:52 <mjg> that should fail with ENOENT

22:55 <heat> yes

22:55 <heat> i was looking at it wrong

23:05 Iris_Persephone has quit [Ping timeout: 244 seconds]

23:09 Iris_Persephone has joined #osdev

23:10 [itchyjunk] has joined #osdev

23:20 Vercas6 has quit [Quit: Ping timeout (120 seconds)]

23:20 Iris_Persephone has quit [Remote host closed the connection]

23:21 Iris_Persephone has joined #osdev

23:21 Vercas6 has joined #osdev

23:31 isaacwoods has quit [Quit: WeeChat 3.6]

23:32 netbsduser has quit [Remote host closed the connection]

23:57 Matt|home has quit [Ping timeout: 248 seconds]