#osdev on 2021-06-28 — irc logs at libera.catirclogs.org

2021-05-23 01:57 klange changed the topic of #osdev to: Operating System Development || Don't ask to ask---just ask! || For 3+ LoC, use a pastebin (for example https://gist.github.com/) || Stats + Old logs: http://osdev-logs.qzx.com New Logs: https://libera.irclog.whitequark.org/osdev || Visit https://wiki.osdev.org and https://forum.osdev.org || Books: https://wiki.osdev.org/Books

00:03 trufas has quit [Ping timeout: 258 seconds]

00:04 trufas has joined #osdev

00:18 <vai> doug16k: whats your project home website URL? if any? or github would be good, as well.. interested

00:20 <doug16k> https://github.com/doug65536/dgos

00:20 <bslsk05> doug65536/dgos - Operating System (13 forks/99 stargazers/GPL-3.0)

00:22 <doug16k> see also qemu-rom - it's a toy BIOS project that supports multiple cpus

00:22 <doug16k> it's kind of osdev

00:24 <clever> doug16k: the original goal of me joining #osdev, was to write a quick&dirty "bios" rom, that would go from realmode to 64bit paging, in as few steps as possible, for use exclusively in qemu

00:24 <clever> but i never got that working right

00:24 <doug16k> page table init fast enough for bochs :P https://github.com/doug65536/qemu-rom/blob/master/machine/x86/entry_arch.S#L257

00:24 <bslsk05> github.com: qemu-rom/entry_arch.S at master · doug65536/qemu-rom · GitHub

00:24 <doug16k> clever, my qemu-rom project does exactly that

00:24 <clever> i was also cheating, and had qemu generate the full paging table from the host side, so its already in the addr space, and you just set whatever cr it was

00:25 <clever> the original need for all of that is long gone though, so not much point in finishing it now

00:25 <doug16k> yeah, I considered putting the page tables in ROM. I want to map 16GB and only need 2MB pages, so it is not even close to fitting

00:25 <clever> my goal was to run a xen PV guest under plain qemu

00:26 <clever> i got surprisingly far, with an entirely userland based solution

00:26 tacco has quit []

00:26 <vai> doug16k: my bootloader is huge.. takes 4 kilos of static assembly compiled binary

00:27 <doug16k> vai, my bios bootloader is about 100KB

00:27 <vai> doug16k: so on all systems it loads up a single track full of boot sector content

00:27 <clever> https://github.com/cleverca22/xen-mock/blob/master/main.cpp#L132

00:27 <bslsk05> github.com: xen-mock/main.cpp at master · cleverca22/xen-mock · GitHub

00:27 <clever> doug16k: basically, i just loaded the guest (an elf file) kernel into ram, and blindly ran the entry point, with a hypercall page setup

00:27 <clever> and it worked surprisingly well, able to do hypercalls and print debug logging

00:28 <clever> the problem, is that the guest assumed a certain order for stack/heap/.text, and crashed horribly because of pthread putting the stack in the wrong place

00:28 <doug16k> vai, my bootloader can boot all 6 combinations of {bios|efi}-{disk|cd|lan}

00:29 <klange> > 7.7K Jun 17 23:34 cdrom/boot.sys

00:29 <doug16k> the elf code can't even tell it is on tftp

00:29 <klysm> doug16k, have you been interested in doing hypercalls?

00:30 <doug16k> I have paravirtualized EOI

00:30 <doug16k> can eoi by writing an int to regular ram, instead of vmexit

00:30 <klysm> end of interrupt

00:30 <doug16k> yeah

00:31 <clever> https://github.com/cleverca22/xen-mock/blob/master/main.cpp#L91-L116

00:31 <bslsk05> github.com: xen-mock/main.cpp at master · cleverca22/xen-mock · GitHub

00:31 <doug16k> telling LAPIC "ok, done that one, gimme next lower priority one"

00:31 gog has joined #osdev

00:31 <clever> klysm: for XEN, there is a "hypercall page" containing an array of 32 byte functions, each one for doing a different job, and the xen hypervisor pre-fills it with whatever opcode is best suited for the cpu (such as vmexit)

00:31 <clever> klysm: the guest then just blindly treats that as a set of functions, and calls it with the normal calling convention

00:32 <doug16k> EOI happens thousands of times per second

00:32 <doug16k> under heavy I/O

00:32 <clever> actually, it might be 128 byte functions

00:32 <doug16k> if you paravirtualize exactly one thing, make it the EOI

00:32 <clever> no, 32, misread this old code

00:33 <clever> https://github.com/torvalds/linux/blob/master/include/xen/interface/xen.h#L44-L84

00:33 <klysm> paravirtual means a contrast to hardware virtual, what else does paravirtual mean?

00:33 <bslsk05> github.com: linux/xen.h at master · torvalds/linux · GitHub

00:34 <doug16k> paravirtualizing means, making the guest aware of the host and do a special thing that the host provides to avoid an expensive vmexit trap

00:34 <clever> klysm: in the case of xen, a paravirtual VM, will lack a lot of legacy/real hw, no bios, no emulating an ata controller, everything must be done with proper virtualized drivers

00:35 <klysm> clever, then there must be a mixed environment, also

00:35 lg has quit [Quit: leaving]

00:35 <clever> klysm: xen still allows the PV ops on a non-pv based VM

00:35 lg has joined #osdev

00:35 <doug16k> normally, eoi would trap and it would exit, and kvm would do some work just figuring out what happened, then it would do some insignificant thing and resume the guest

00:35 <clever> so a guest can boot via the legacy stuff (such as windows), but then switch to using PV based drivers (once you install them, and the boot has progressed that far)

00:36 <doug16k> the paravirtualized eoi has you doing a single cmpxchg in the guest and kvm peeks at it some other time

00:36 <doug16k> no vmexit

00:36 <klysm> is vmexit an instruction?

00:36 <clever> doug16k: ahh, so the guest can finish servicing an irq, and return to running guest kernel/userland code, and not bother context switching back to the host?

00:36 <doug16k> no, vmexit is the cpu freaking out because the hypervisor configured the cpu to not allow that instruction in the guest

00:36 lg has quit [Client Quit]

00:37 <doug16k> it stops running the guest and the hypervisor gets control

00:37 lg has joined #osdev

00:37 <klysm> yeah I was thinking it was a reference to an instruction that was supposed to trap, from context

00:37 <doug16k> that instruction or operation or access or whatever

00:38 <klysm> next question is how do you choose the opcode that's supposed to trap?

00:38 <klysm> this would be for x86

00:38 <doug16k> there are bits in the vm state / registers where you set what traps and what is just allowed

00:39 <klysm> so does the vm state have a list of opcodes?

00:39 <doug16k> you can make it so they see the real MSRs and they are allowed to really change them, or you can set it to trap it

00:39 <doug16k> there is a list of things you can allow/trap

00:39 <klysm> the list of things you can allow/trap, is it in the manual?

00:39 <doug16k> yeah let me see what term they use

00:40 <doug16k> vol 3 entire chapter 27

00:41 <doug16k> 24.7 VM-EXIT CONTROL FIELDS

00:42 <doug16k> if you mean intel

00:43 <doug16k> AMD system programming manual appendix B lists a pile of bitflags that decide whether to trap things

00:43 <doug16k> vol 2

00:44 nyah has quit [Ping timeout: 268 seconds]

00:44 <klysm> looking at 24.7 now

00:46 <klysm> and 27.2.1

00:47 <doug16k> you could make a hypervisor that lets the real machine "leak through" to the guest, and the guest is affecting the real machine, all the way down to the hypervisor not seeing anything real at all, nothing but emulated devices

00:48 <doug16k> sorry, the hypervisor not letting the guest see anything real at all

00:48 <klysm> that makes a bit more sense

00:49 <doug16k> letting everything leak through is super fast. trapping everything is slowest

00:51 <klysm> "VM exits due to the following causes: debug exceptions; page-fault exceptions; start-up IPIs (SIPIs); system-management interrupts (SMIs) that arrive immediately after the retirement of I/O instructions; task switches; ...

00:51 <klysm> ...INVEPT; INVLPG; INVPCID; INVVPID; LGDT; LIDT; LLDT; LTR; SGDT; SIDT; SLDT; STR; VMCLEAR; VMPTRLD; VMPTRST; VMREAD; VMWRITE; VMXON; XRSTORS; XSAVES; control-register accesses; MOV DR; I/O instructions; MWAIT; accesses to the APIC-access page (see Section 29.4); EPT violations; EOI virtualization"

00:51 <doug16k> yeah you can't let everything through. I mean leatting as much as you can through

00:52 CryptoDavid has quit [Quit: Connection closed for inactivity]

00:52 <klysm> so those are instructions you can cause to trap

00:52 <doug16k> there are things that trap no matter what, and there are things that trap if you like

00:53 <klysm> can you make a trap for "IRET," then?

00:53 <doug16k> and most normal instructions can't be trapped

00:53 <klysm> okok

00:53 <geist> hmm, interesting. was piddling around on my AMD bios and it has two options for TPM: discrete (in case you have something plugged in which i dont) and 'firmware tpm'

00:53 <klange> There's something satisfyingly nostalgic about spinning up VGA text mode...

00:53 <geist> the firmare TPM seems to be some sort of emulation style thing that the AEGSA implements. curious how that works

00:53 <doug16k> geist, security processor provides it

00:53 <geist> oooh makes sense

00:54 <geist> ah yes, rebooted into linux and now i see a 'TPM2' ACPI table

00:54 <geist> presumably describes it

00:58 <doug16k> yeah, my understanding is, at cold start, that same arm security processor brings up ram and pci, then it fires up the zen2 cores

00:58 <doug16k> or whatever zen

01:01 <geist> yah makes sense. okay that expains how it's 'built in' to the cpu

01:01 <geist> i wonder if linux has any drivers or whatnot for it? this particular build doesn't seem to say anything in dmesg spew about it

01:02 <doug16k> it happens before the rom. they actually do a thing a lot like uboot

01:02 <doug16k> the rom doesn't even have to start at fffffff0

01:02 <doug16k> you can tell it to put you in ram and jump into you wherever

01:02 <geist> yah obviously linux has drivers and whatnot for it, but doesn't seem like this distro is trying to use it

01:03 <doug16k> it's a completely generic tpm

01:03 <doug16k> it should just work like a keyboard

01:03 <doug16k> right?

01:03 <doug16k> no such thing as installing tpm drivers afaik

01:03 <doug16k> that would be like installing PIC drivers no?

01:04 <geist> there is a /dev/tpm0

01:04 <geist> i just saw no boot spew and no module for it. could be built into the kernel

01:05 flx has quit [Quit: Leaving]

01:06 <doug16k> I am very glad that you don't use the drivers that came with a tpm. as if I trust some random driver

01:06 <doug16k> I think the completely generic nature of them is part of the security

01:07 <doug16k> you can use a thoroughly proven driver

01:07 <klysm> what is the "tpm driver" and what does it allow you to do to your tpm?

01:08 <doug16k> it is for storing secrets, like your full-disk-encryption key

01:08 <geist> well FWIW if i boot into windows it sees the tpm (tpm.msc has a little status window)

01:09 <klysm> so if you know your keys, you have access to all the secrets?

01:09 <geist> presumably there's a public/private thing there that lets you authenticate the storer/asked

01:09 <geist> asker

01:10 <clever> i believe TPM's also have a "measured boot" mode for unlocking

01:10 <geist> dont think it'd be very useful if you could just store a bag of bits with the name "foo" and then ask for "foo" later

01:10 <clever> every executable chunk of code in the boot chain, will report the hash of the next chunk to the TPM, before passing on control

01:10 <geist> i guess i really should read the spec. it comes up at work enough that eventually i should really know what it is

01:10 <clever> and if you play the right sequence of hashes, the TPM will unlock itself

01:11 <clever> so if say grub.efi was modified by an evil maid, the hash will be wrong, and the TPM wont unlock

01:11 <clever> and there is no way to reset the TPM, without also resetting the cpu and giving the bios/efi control

01:11 <geist> yah

01:12 <clever> so the firmware on the board will start playing the hashes back again, and only the hashed code has control over what to do with the keys

01:12 <geist> or at least if you do reset it you lose everything that was stored in it

01:12 <clever> yeah, i would expect a factory-reset to still be available

01:13 <geist> there's even a button for it here in the TPM control panel in windows

01:13 <klysm> and the arm security processor is the first thing that executes on a cold boot? that's the tpm right?

01:13 <doug16k> yeah

01:13 <doug16k> zen bioses don't even need to do cache as ram or anything. ram and pci works at bios entry

01:14 <clever> ive mostly heard of the TPM as being a self-contained module plugged into the motherboard

01:14 <klysm> except the arm security processor locks out all modifiable code?

01:14 <geist> it says it stores the secrets in some special storage. dunno if that's flash on chip or just something in EFI storage that's encrypted out the wazoo with a private key

01:14 <clever> but i can see how modern x86 "backdoor" stuff can emulate a TPM securely

01:15 <geist> clever: yah the BIOS setting here on this machine is literally 'use the emulated TPM or use the plugged in one'

01:15 gog has quit [Ping timeout: 265 seconds]

01:15 <geist> curiously theer's no 'no TPM'

01:15 <geist> though it says basically if you tell it to use the plugged in one and there isn't one present, its the same thing

01:15 <doug16k> mine has no tpm

01:15 <geist> and that was the default

01:15 <doug16k> ...setting

01:15 <geist> interesting

01:15 <geist> is it hard enabled all the time?

01:16 <geist> maybe if you didn't have a socket on your mobo they dont bother giving you the option

01:16 <doug16k> how can I check?

01:16 <doug16k> to make sure I don't have a tpm

01:16 <geist> do you have windows?

01:16 <doug16k> linux

01:16 <geist> i guess the presence of a /dev/tpm0 and/or some ACPI table TPM2?

01:17 flx has joined #osdev

01:17 <doug16k> ls: cannot access '/dev/tpm*': No such file or directory

01:17 <doug16k> it might be in "amd cbs" setting

01:17 <klysm> geist, zgrep TPM /proc/config.gz

01:17 <geist> yah mine is fairly high level in the list, but it's possibly under that

01:18 <geist> also dunno if you've updated your bios, etc

01:18 <geist> mine is very new, updated it last week, because of the x570 stability fixe that's supposedly in

01:19 <doug16k> microcode: CPU31: patch_level=0x08701021

01:19 <doug16k> zen2 btw

01:19 <geist> yah we have the same 3950x

01:20 <geist> oh side note, FWIW 5950xes appear to finally be available for more than 30 minutes at a time

01:22 <doug16k> I have my 3950x in my B350-plus and my 2700x in my x470 pro lol

01:22 <doug16k> apparently it's lackwards band

01:23 <kazinsal> I think once 5950Xes show up in reality up here in the no longer frozen and actually rapidly melting wastes north of the 49th I'll probably pick up a gently used 3900X or similar to replace my 2xE5-2620v0 machine

01:23 <doug16k> oh and my really good cooler is on my cool running 2700x and my tired cooler is on my 3950x

01:23 <kazinsal> my desktop is an 8700K and frankly anything better just isn't going to exist in this country for another half a decade

01:25 <doug16k> is there any DDR4-3200 unbuffered ECC memory available?

01:26 <doug16k> looking forward to ddr5

01:26 <doug16k> if they don't call it qdr5 I am going to have a fit

01:27 <doug16k> it's not double data rate!

01:27 <geist> kazinsal: well they're starting to show up on newegg and B&H at least

01:28 <geist> at MSRP basically

01:28 <geist> dunno if that means it's avail in .ca

01:29 <geist> interesitng the wikipedia article on TPM says that hypervisor based TPMs are a legit thing

01:29 <kazinsal> yeah, we generally have a 10-15% markup on stuff before conversion

01:29 <kazinsal> it's pretty bad

01:29 <geist> so my guess is thats how win 11 will work in a VM. the hypervisor will just provide it

01:29 <geist> it's just not incredibly secure

01:30 <kazinsal> it looks like other 5x00 series are in stock though

01:30 <kazinsal> 5900X at $719 CAD

01:31 <kazinsal> ~585 USD

01:31 <kazinsal> 5950X is in stock, wow

01:31 <moon-child> meh, zen 3 is cool but my 3960x serves me very well

01:31 <kazinsal> showing as a "regular price" of 1149 CAD but a "sale" price of 1019 CAD

01:32 <doug16k> ya, the top zen2 stuff is close enough to zen3

01:32 <geist> yah i had some plan to upgrade my 3950x to a 5950x maybe, but now that it's here i just cant say ive been sad about the 3950x

01:32 <kazinsal> so yeah, wow, I'm wrong, that's basically MSRP

01:32 <geist> like i'm sure it's 20% faste ror so, but not worth the $800

01:32 <geist> the main advantage would be i'd roll it down to my server and then get more cores there

01:32 <kazinsal> GPUs are unobtanium still of course

01:32 <doug16k> I hardly care about the singlethread increase. my workload is mostly embarrassingly parallel

01:32 <geist> but... also haven't been really stressing out over that either

01:33 <doug16k> if I cared more about the singlethread, I'd be all over zen3

01:34 <kazinsal> there's a single RTX 3070 in stock at memoryexpress but you have to buy it in a whole system

01:35 <kazinsal> annoyingly it's listed in the video cards section and not the prebuilts section because you're actually just buying all the components for a 5800X + RTX 3070 build

01:35 <geist> yah though be careful there. i've heard that some of the OEM geforces are seriously nerfed

01:36 <geist> like yeah it's technically a RTX XXXX but it's underclocked, etc

01:38 <doug16k> maybe worst bin?

01:39 <doug16k> defaults don't even work

01:40 Yukara is now known as meisaka

01:45 <kazinsal> yeah, I'm just going to wait for the 4000s to come out and the shitcoin market to collapse further so 3080s become available on the used market

01:45 <kazinsal> my 1080 Ti does 99% of what I want it to and that last 1% is just silly RTX AI demos

01:47 martm193 has joined #osdev

01:49 <geist> oh totally. 1080 tis are like gold now

01:49 <geist> it's a super trooper

01:49 <kazinsal> yeah, the most stressful thing I'll be using it for is the next battlefield game

01:49 <kazinsal> and DICE has been historically great at making those run really well on slightly older machines

01:52 <graphitemaster> If you don't mind, you can pay a scalper :P

01:52 <graphitemaster> I mean yeah it'll be over-priced, but since it's impossible to get one otherwise.

01:53 <moon-child> https://xkcd.com/606/ an alternate strategy

01:53 <bslsk05> xkcd - Cutting Edge

01:54 <graphitemaster> If no one bought high end gaming gpus or cpus when they came out we'd be worse off though

01:54 <graphitemaster> Like whales are important for you to even participate in a five year lag

01:54 <graphitemaster> They're subsidizing the cost for you in a way

02:00 <doug16k> they are extremely high volume though, they are totally gouging for video card prices right now, even before the shortage

02:00 <doug16k> my 2060 super was a ripoff, before the shortage

02:01 <martm193> So this bus vulnerability was very long talked about by me, i was very honest about it, it's an inherent arch of pipelining thing either partial or full, or even no pipelining, on immediates used to load instruction cache no units can not be behind mtrr immediates i.e i-cache protection latches, and not behind mpu protection either from dma firmware, cause this is hw issue without needing any os sides, slow pipeline mode and not the best performing and

02:01 <martm193> lowest power and electron accurate in their phases with no powerline given due to no load on the wires -- this voids any license and patents for companies, cause all the software is ust like that reverse engineered, with the help of hw bugs. Ther will be never a way to block that locally, not even the on the secure pipeline mode, it's common sense guys, tht is why google gives their code away by default also, this zero day tmining vuln. research from

02:01 <martm193> zero-labs on google is just misleading trolling and they know it as well as every real computer engineer like doug16k .

02:02 gog has joined #osdev

02:07 <martm193> the company itself is a big success though, and the best they can do is might be in place, never tried to get into google servers with exploits, there is away only to not to let this happen same thing with satellites, if code is electron perfect and out of phase in that hw, tcp will either always time out, or the phase is out and you are cut out of service with your exlpoit, i.e it never executes.

02:10 martm193 was kicked from #osdev by geist [martm193]

02:10 <Mutabah> I'm too nice :(

02:11 <Mutabah> Every time he returns I think "maybe this is interesting to something, and he's not being insulting"

02:14 <geist> yeah he's sending pages of random at me now in privmsg

02:16 <kazinsal> unsurprising

02:16 <Mutabah> Props to him for finding the new channel

02:16 gog has quit [Quit: byee]

02:18 <clever> could have probably hid better if we went to #os-dev, lol

02:18 <clever> make it a little less obvious

02:19 <Mutabah> Nah... we (well.. I) try to be open to anyone looking for help

02:19 <Mutabah> the above is why we have mods

02:24 <klange> clever: Don't want to hide over one consistent troll :)

02:25 <clever> klange: could still update the wiki, and hope the troll doesnt look?

02:25 <clever> but yeah

02:25 <klange> I was going to wait for an actually-offensive comment, new network and all...

02:25 <Mutabah> They found us on a new network, despite the "old" channel still existing - they looked at the wiki

02:25 <kazinsal> it would be easier and more effective to stage a coup in their home country with the intent of introducing a universal mental healthcare system than it would be to attempt to hide an IRC channel from one obsessive poster

02:25 <clever> kazinsal: lol

02:39 aquijoule_ has joined #osdev

02:41 richbridger has quit [Ping timeout: 268 seconds]

02:51 <doug16k> I think people could find it if it were named #0a18a189-760d-4e5a-a297-6c9970419cc7

02:57 <klysm> which reminds me I've been writing an irc bot "obot" for my irc server at call.cbu.net which invites users to uuid-based channels and keeps local comments in a database for static content.

02:59 sts-q has quit [Ping timeout: 265 seconds]

03:04 <geist> yah i thought about not kicking him, for about 3 seconds

03:04 <geist> but figured, look we know where this is going

03:04 <kazinsal> this story only ends one way

03:06 ElectronApps has joined #osdev

03:06 <geist> spent a part of the day going through the rust book and getting it set up with visual studio code

03:06 <geist> actually a pretty nice experience

03:09 sts-q has joined #osdev

03:12 <Mutabah> Yeah, I was originally vim only for my rust programming (doing legacy osdev)

03:13 <Mutabah> but started using vscode+RA for some work-adjacent stuff, and wow

03:14 <geist> yah i'm not fully sold if using a gigantic external project that it may or may not be able to index

03:14 <geist> but... it's not a terrible experience. and with the vim style key bindings it's pretty easy to switch between

03:57 <doug16k> https://gist.github.com/doug65536/90368ce92d710fa55b117488d687d391

03:57 <bslsk05> gist.github.com: rubberduck.c · GitHub

03:57 <doug16k> info tlb elision (just 4KB pages so far)

03:58 <doug16k> contiguous pages with the same flags just say ...

03:58 <doug16k> once

04:03 <doug16k> anyone care to codereview that psycho bit of state machine I made to decide whether to print each line? :P

04:06 <doug16k> I suppose last_pte shouldn't have present bit set at start

04:06 <doug16k> so nonsensical -1 pte won't mislead it

04:07 <doug16k> gotta add check for reserved bit set in here too

04:07 <doug16k> it doesn't really care right now

04:14 <doug16k> it can only omit 510/512 though, due to it showing transistion to next pd entry

04:15 <doug16k> 99.6% removal

04:16 <doug16k> 99.8% on 32 bit paging

04:18 <doug16k> what's going on here! https://gist.github.com/doug65536/90368ce92d710fa55b117488d687d391#file-gistfile1-txt-L219

04:18 <bslsk05> gist.github.com: rubberduck.c · GitHub

04:21 <doug16k> haha, look at this: https://gist.github.com/doug65536/90368ce92d710fa55b117488d687d391#file-gistfile1-txt-L19

04:21 <bslsk05> gist.github.com: rubberduck.c · GitHub

04:21 <doug16k> screwy UB MTRR overlapping large page = nonsense

04:22 <doug16k> makes that whole GB UC

04:23 <doug16k> if TLB miss that filled it was inside MTRR. otherwise, it falsely makes whole GB WB, in conflict with MTRR

04:24 <doug16k> intel doesn't guarantee that to work

04:24 <doug16k> AMD doesn't even say it works below 2MB paddr

04:25 <doug16k> I thought it would print a UB message for that

04:25 <doug16k> must have missed that case

04:26 <doug16k> would be awesome if it said which MTRR eh? :D

04:27 <doug16k> and show its base and size

04:27 <doug16k> maybe for UB scolding output I'll show the exact MTRR base/range vs page base/range

04:32 <doug16k> line 104 is an example of one that isn't elided because flags changed

04:40 <doug16k> do you think I should keep count and print how many pages were elided, instead of just ...?

04:42 srjek_ has quit [Ping timeout: 250 seconds]

04:50 drewlander has quit [Quit: ZNC 1.7.2+deb3 - https://znc.in]

04:51 drewlander has joined #osdev

04:59 <doug16k> how's this: https://gist.github.com/doug65536/3078354a8b0ceb1df0dc4c4ca7820747

04:59 <bslsk05> gist.github.com: gist:3078354a8b0ceb1df0dc4c4ca7820747 · GitHub

05:02 <klange> nice

05:16 <doug16k> found a memory corruption in my bootloader I think

05:16 <doug16k> whenever I have problems with my kernel, I should work on qemu until qemu tells me what's wrong with my kernel, instead of debugging my kernel :P

05:17 <doug16k> basically Q's algorithm. if you have to lift something heavy, just change the gravitational constant of the universe!

05:21 <doug16k> you can modify the reality your kernel runs within to make it find your bugs

05:22 Izem has joined #osdev

05:22 <doug16k> fixed weird redundant entry in eliding-at-end-of-page boundary condition

05:39 Izem has left #osdev [#osdev]

05:58 fconti has joined #osdev

06:24 tenshi has joined #osdev

06:25 Mooncairn has quit [Ping timeout: 252 seconds]

08:04 Matt|home has joined #osdev

08:12 sortie has joined #osdev

08:35 flx has quit [Ping timeout: 268 seconds]

08:38 sortie has quit [Ping timeout: 268 seconds]

09:19 dennis95 has joined #osdev

09:26 <doug16k> how can it be truncated? https://www.godbolt.org/z/65qPMGeWc

09:27 <doug16k> fixed some nonsense, didn't help https://www.godbolt.org/z/3Ed85G9M8

09:28 <doug16k> oops https://www.godbolt.org/z/6YhjbTv3f

09:28 <doug16k> it's not possible to be truncated with that code, right?

09:29 <moon-child> doug16k: it does the same thing if you say restore_path_len+imgname_len+100, I think it's just ignoring that part

09:29 <doug16k> why would it ignore it?

09:30 <doug16k> the new warnings are garbage

09:30 <doug16k> it warns about complete impossibilities

09:30 <j`ey> doug16k: fixed https://www.godbolt.org/z/sejr86nrb :P

09:30 <doug16k> warning, this never ever happens

09:31 <moon-child> I mean, I kinda feel like if you're using snprintf you're signing yourself up for truncated messages when the inputs are excessively long

09:31 <doug16k> more like if (ret < 0) __builtin_unreachable();

09:31 <moon-child> so even if it _did_ truncate they shouldn't warn you about it

09:31 <doug16k> no I didn't

09:31 <doug16k> look at the code

09:32 <doug16k> strlen strlen compute size, only if it fits, then snprintf

09:32 <doug16k> if the compiler can't see that, then it can't do that warning

09:32 <moon-child> doug16k: I'm saying that completely aside from the fact that the warning is wrong in this case, the whole idea that you should warn about this kind of situation is wrong

09:33 <j`ey> moon-child: it's a warning that you dont check the return value, if you check the return value the warning goes away

09:33 <doug16k> stupid warning

09:33 <moon-child> j`ey: I understand. I think that's a dumb warning is all

09:33 SGautam has joined #osdev

09:33 <doug16k> it is pointless to check that return value

09:33 <moon-child> 99% of the time I don't care if snprintf truncated

09:33 <doug16k> it is 100% guaranteed to succeed

09:34 <doug16k> I am annoyed about these broken warnings, while I am also glad to know a workaround

09:35 <doug16k> buffer overrun static analysis must not diagnose code that can't overrun

09:35 <j`ey> if (ret < 0) __builtin_unreachable();, is kinda nice to keep anyway, in case you messed up and put an off by 1 error in the if condition :P

09:35 <doug16k> if it does, then it needs to hear the boy who cried wolf

09:36 <doug16k> j`ey, yeah, ubsan would make it an assert

09:36 <doug16k> ubsan traps reached unreachable

09:37 <doug16k> https://github.com/doug65536/dgos/blob/master/kernel/lib/ubsan.cc#L153

09:37 <bslsk05> github.com: dgos/ubsan.cc at master · doug65536/dgos · GitHub

09:38 <doug16k> just look at backtrace

09:39 <doug16k> ubsan calls that if you execute builtin_unreachable() line

09:39 <j`ey> huh

09:40 <j`ey> and if ubsan isnt enabled?

09:40 <j`ey> 'ud'?

09:40 <doug16k> then it is UB

09:40 <j`ey> ud2

09:40 <doug16k> na, it hardly ever emits that

09:40 <doug16k> it just lets it fall through

09:40 <j`ey> o

09:40 <doug16k> whatever is next, who cares

09:41 <doug16k> seriously, I checked

09:41 <moon-child> yeah, it assumes that any path that reaches there won't be taken and just strips it out of the graph

09:41 <doug16k> it implicitly believes you

09:41 <j`ey> oh right, i was thinking of rust's unreachable!(), which inserts a panic!()

09:41 <moon-child> I think sometimes it does ud2 instead of ret if you reach the end of a function

09:42 <doug16k> it could emit ud2, but I hardly ever see it

09:42 <j`ey> moon-child: someone here had no ret and no ud2

09:42 <doug16k> I think clang does it more

09:42 <j`ey> so it just executed the next function in the binary lol

09:42 <j`ey> doug16k: ah, maybe that's why ive seen it, since i used clang a decent amount

09:42 <doug16k> yeah or worse

09:42 <doug16k> it might fall into some else or something

09:43 <doug16k> from the if body

09:43 <doug16k> true block

09:43 <doug16k> it didn't need to jump over else block right? it's unreachable

09:44 <doug16k> and the register allocation might not be the same there

09:44 <doug16k> it reordered the blocks all funny because of unlikely or something

09:44 <doug16k> so it could go completely insane

09:44 <doug16k> but run a while

09:45 <j`ey> do whatever it wants!

09:46 <doug16k> it's pretty close to worst case scenario. you want to fail fast, not run all screwy for a while with mostly valid variables

09:47 <doug16k> it's bad when you luckily have a bunch of valid pointer values in the registers and execute some code that thinks they are something else

09:47 <j`ey> (which is why I like that rust's unreachable actually panics!)

09:47 <doug16k> it'll keep going and not stop until it is useless to look at

09:47 <doug16k> it does if you have ubsan on

09:48 <doug16k> rust probably has a form of ubsan on forever

09:48 <moon-child> if you want fail fast, then you don't wanna __builtin_unreachable in the first place, you want to assert or w/e

09:48 <j`ey> heh, kinda

09:49 <doug16k> I have the option to turn all my assert(e) into assume(e) which makes then if (!(e)) __builtin_unreachable();

09:50 <doug16k> you can make assert go to one extreme or the other if you want

09:50 <moon-child> that's basically NDEBUG

09:50 <doug16k> no

09:50 <doug16k> it makes the optimizer assume every assert can't possibly fail

09:50 z_is_stimky has quit [Read error: Connection reset by peer]

09:50 <doug16k> and gives it really aggressive clues

09:50 z_is_stimky has joined #osdev

09:51 dormito has quit [Ping timeout: 268 seconds]

09:51 <moon-child> yeah. But from semantics perspective

09:51 <doug16k> yeah you could make it NDEBUG driven

09:52 <doug16k> usually NDEBUG just means leave it not said whether the assert is true or not. if you assume the asserts, you tell the optimizer the asserts can't possibly fail, so use that to make sweeping assumptions in value analysis

09:55 <doug16k> https://www.godbolt.org/z/nbMW4Pz95

09:56 <moon-child> hahha, cute

09:56 <j`ey> lol

09:59 GeDaMo has joined #osdev

10:00 SGautam has quit [Ping timeout: 265 seconds]

10:00 isaacwoods has joined #osdev

10:01 sortie has joined #osdev

10:06 <doug16k> ubsan verifies them: https://www.godbolt.org/z/nbMW4Pz95

10:06 <j`ey> same link

10:07 <doug16k> thanks https://www.godbolt.org/z/jq3hxc6ra

10:07 <j`ey> that's better

10:08 <kingoffrance> thats pretty sweet, now just to embed a compiler and jit on the fly lol

10:10 <moon-child> that was synthesis os

10:10 <kingoffrance> yep, thats my understanding

10:11 <doug16k> what are you going to jit that you couldn't aot

10:12 <doug16k> why not just aot compile the bytecode for this cpu

10:13 <doug16k> aot = ahead of time

10:13 <doug16k> gcc is aot

10:14 <moon-child> https://wiki.c2.com/?SynthesisOs has some examples

10:14 <bslsk05> wiki.c2.com <no title>

10:15 <kingoffrance> i didnt say its a good idea :) any function where you know arg values and local variable values (or can eliminate some things, even if you dont exactly know) and can make an "optimized" variant; for many things and modern cpus i think it will be a loss, although i think more complicated code even tiny gains can add up, e.g. a( b( c() ); d(); ) what moon-child said too (havent seen it, presumably shows some real example)

10:16 <kingoffrance> i see it as a middle thing

10:16 <kingoffrance> some code that is somewhat complicated, calls other functions, but you need to be able to compile quick to make it worthwhile

10:16 <kingoffrance> i mean, i guess multicores, you could dedicate some to background "optimize"

10:16 <GeDaMo> https://en.wikipedia.org/wiki/Partial_evaluation

10:16 <bslsk05> en.wikipedia.org: Partial evaluation - Wikipedia

10:17 <doug16k> I found an old jit thing I made for qsort callback long ago, lol

10:17 <moon-child> yea cache effects turn the scales slightly further against jit these days, but only slightly

10:17 <doug16k> it generated a thunk that gave you the extra parameter

10:17 <moon-child> doug16k: have definitely done that

10:20 SGautam has joined #osdev

10:22 dormito has joined #osdev

10:23 <kingoffrance> im not sure the specific differences jit versus aot; wikipedia says one context aot is opposite of jit, but jit page JIT compilation is a combination of the two traditional approaches to translation to machine code (ahead-of-time compilation (AOT), and interpretation) and combines some advantages and drawbacks of both

10:24 <moon-child> it's a somewhat poorly-defined term

10:26 Arthuria has joined #osdev

10:27 <kingoffrance> i almost see it as time/space tradeoff. in sense of, all these variant functions floating around, might be shorter, but more of them versus one "generic" one

10:27 <kingoffrance> *each individual might be shorter

10:28 <kingoffrance> if you say run a program 20 times

10:28 <kingoffrance> 20 of the same proc running at same time, each with different variants for whatever functions

10:29 <kingoffrance> i mean, nothing is free

10:31 <kingoffrance> *same program, different processes

10:31 Arthuria has quit [Read error: Connection reset by peer]

10:32 Arthuria has joined #osdev

10:35 X-Scale has quit [Ping timeout: 268 seconds]

10:40 <kingoffrance> maybe what you want is something like: -Oquick do any optimization that doesnt take too long to compile

10:41 <doug16k> -O is "quick"

10:41 <doug16k> without a number makes a big difference

10:42 <doug16k> -O basically does exactly what you said with not bad codegen

10:42 <kingoffrance> yeah, just embed it now :)

10:42 <kingoffrance> i do agree that does seem kind of how gcc -O is

10:42 <doug16k> if you said for (int i = 0; i < 20; ++i) it will precisely set the thing to 0, and check each time if it is less than 20, and increment the thing, but it will try to do that well

10:43 <doug16k> it won't transform anything

10:43 <doug16k> it varies across targets though

10:43 <doug16k> on x86 -O is extremely obedient

10:43 <doug16k> on riscv it says ya right and forces a call to something

10:44 <kingoffrance> the obvious of course, more "space" because you have to keep "source" around too

10:44 <doug16k> (where x86 would have just copied each int like you said)

10:45 <doug16k> there is a gcc jitter you know

10:45 <doug16k> I'm sure you heard of it

10:46 <doug16k> https://gcc.gnu.org/wiki/JIT

10:46 <bslsk05> gcc.gnu.org: JIT - GCC Wiki

10:47 <doug16k> it's more like a "don't care what time compiler"

10:48 <doug16k> best of both worlds. can precompile for instant cold start, or jit for smallness

10:49 Arthuria has quit [Read error: Connection reset by peer]

10:50 Arthuria has joined #osdev

10:53 ElectronApps has quit [Ping timeout: 265 seconds]

10:55 ElectronApps has joined #osdev

10:56 <doug16k> https://gcc.gnu.org/onlinedocs/jit/intro/tutorial01.html

10:56 <bslsk05> gcc.gnu.org: Tutorial part 1: “Hello world” — libgccjit 12.0.0 (experimental ) documentation

10:58 Arthuria has quit [Read error: Connection reset by peer]

10:58 Arthuria has joined #osdev

10:58 <doug16k> because very little of the code actually gets called, just jitting each thing as it is first used is way smaller

10:59 <doug16k> lots of the code runs rarely and briefly

10:59 <doug16k> usually I mean

11:00 <doug16k> that jit api looks great though

11:00 <doug16k> just have to transpiler to that and it compiles

11:00 Arthuria has quit [Read error: Connection reset by peer]

11:00 Arthuria has joined #osdev

11:05 SGautam has quit [Ping timeout: 252 seconds]

11:08 SGautam has joined #osdev

11:31 <kingoffrance> no i hadnt heard of it, i dont follow things, but yes that does look like youd just need to transpile basically

11:32 gog has joined #osdev

11:33 <kingoffrance> i wonder what magic goes on so that you can greet() isnt modern os not gonna let you just put some machine code to some address and execute it?

11:34 <kingoffrance> anyhow, at least from c pov, looks transparent. greet is just a normal function pointer

11:34 <GeDaMo> mprotect

11:35 <kingoffrance> yeah i figured, i just dont know if distros lock that down for a normal user and only allow it for certain programs etc.

11:40 <sham1> mprotect? It does work for normal users

11:40 <sham1> A lot of JITs use that to change between writing to the "machine code buffer" and making it executable again because W^X

11:42 SGautam has quit [Ping timeout: 268 seconds]

12:00 Arthuria has quit [Read error: Connection reset by peer]

12:00 Arthuria has joined #osdev

12:03 <clever> doug16k: https://www.youtube.com/watch?v=R7CO9v9rpOk

12:03 <bslsk05> 'The Dirty Way Manufacturers are Downgrading Your PC' by Linus Tech Tips (00:16:53)

12:03 <clever> doug16k: basically, its the ram density, number of chips changing, but total storage capacity the same

12:12 ElectronApps has quit [Read error: Connection reset by peer]

12:12 ElectronApps has joined #osdev

12:41 ahalaney has joined #osdev

12:57 ElectronApps has quit [Read error: Connection reset by peer]

12:57 ElectronApps has joined #osdev

13:15 nyah has joined #osdev

13:22 Arthuria has quit [Read error: Connection reset by peer]

13:22 Arthuria has joined #osdev

13:36 CryptoDavid has joined #osdev

13:39 tenshi has quit [Ping timeout: 268 seconds]

13:40 tenshi has joined #osdev

13:54 janemba has quit [Ping timeout: 258 seconds]

13:57 Mooncairn has joined #osdev

13:57 ElectronApps has quit [Remote host closed the connection]

13:58 ElectronApps has joined #osdev

14:00 xenos1984 has quit [Ping timeout: 250 seconds]

14:02 vai has quit [Ping timeout: 258 seconds]

14:03 xenos1984 has joined #osdev

14:03 ElectronApps has quit [Ping timeout: 258 seconds]

14:04 ElectronApps has joined #osdev

14:06 janemba has joined #osdev

14:21 ElectronApps has quit [Read error: Connection reset by peer]

14:25 mniip has quit [Quit: This page is intentionally left blank.]

14:38 Arthuria has quit [Read error: Connection reset by peer]

14:40 Arthuria has joined #osdev

14:42 Arthuria has quit [Read error: Connection reset by peer]

14:43 Arthuria has joined #osdev

14:43 Arthuria has quit [Read error: Connection reset by peer]

14:43 mniip has joined #osdev

14:44 BadQuanta has joined #osdev

14:44 Arthuria has joined #osdev

14:51 vdamewood has joined #osdev

14:54 Arthuria has quit [Read error: Connection reset by peer]

15:05 mahmutov has joined #osdev

15:08 Arthuria has joined #osdev

15:13 Arthuria has quit [Read error: Connection reset by peer]

15:13 Arthuria has joined #osdev

15:17 mahmutov has quit [Ping timeout: 258 seconds]

15:26 flx has joined #osdev

15:41 srjek_ has joined #osdev

15:53 Arthuria has quit [Read error: Connection reset by peer]

15:55 Arthuria has joined #osdev

15:57 Arthuria has quit [Read error: Connection reset by peer]

16:03 Arthuria has joined #osdev

16:03 Arthuria has quit [Read error: Connection reset by peer]

16:06 Arthuria has joined #osdev

16:06 Arthuria has quit [Read error: Connection reset by peer]

16:06 Arthuria has joined #osdev

16:07 dennis95 has quit [Remote host closed the connection]

16:07 Arthuria has quit [Read error: Connection reset by peer]

16:07 dennis95 has joined #osdev

16:08 dennis95 has quit [Remote host closed the connection]

16:08 dennis95 has joined #osdev

16:08 dennis95 has quit [Remote host closed the connection]

16:10 dennis95 has joined #osdev

16:10 Arthuria has joined #osdev

16:11 srjek_ has quit [Ping timeout: 250 seconds]

16:15 Arthuria has quit [Read error: Connection reset by peer]

16:17 archenoth has quit [Remote host closed the connection]

16:17 Arthuria has joined #osdev

16:17 archenoth has joined #osdev

16:21 asymptotically has joined #osdev

16:29 Arthuria has quit [Read error: Connection reset by peer]

16:30 Arthuria has joined #osdev

16:31 xenos1984 has quit [Remote host closed the connection]

16:33 xenos1984 has joined #osdev

16:34 Arthuria has quit [Read error: Connection reset by peer]

16:35 Arthuria has joined #osdev

16:35 dennis95 has quit [Read error: Connection reset by peer]

16:35 Arthuria has quit [Read error: Connection reset by peer]

16:36 Arthuria has joined #osdev

16:53 tacco has joined #osdev

17:04 mahmutov has joined #osdev

17:15 dh` has quit [Remote host closed the connection]

17:20 dh` has joined #osdev

17:50 mctpyt has quit [Ping timeout: 250 seconds]

17:52 mctpyt has joined #osdev

18:16 Arthuria has quit [Read error: Connection reset by peer]

18:16 Arthuria has joined #osdev

18:24 MiningMarsh has quit [Quit: ZNC 1.8.2 - https://znc.in]

18:25 MiningMarsh has joined #osdev

18:42 zoey has joined #osdev

18:47 Arthuria has quit [Read error: Connection reset by peer]

18:47 Arthuria has joined #osdev

18:49 <doug16k> clever, linus never heard of dram banks before

18:49 <doug16k> he should go look up what an xor gate does

18:51 <doug16k> he's too clueless to see that AMD gets more performance out of additional banks than intel does

18:51 Arthuria has quit [Read error: Connection reset by peer]

18:51 <doug16k> that was noticed the day zen3 was released

18:51 Arthuria has joined #osdev

18:54 <clever> doug16k: in the video, he swapped the ram between 2 laptops, and the benchmark results almost entirely swapped

18:54 <doug16k> yes, last year this was news

18:54 <doug16k> recycled

18:56 <doug16k> he has taken zen3 taking better advantage of bank concurrency, and turned it into a pile of bullshit that implies that the intel is better somehow

18:56 <doug16k> he pisses me off. he doesn't even know how computers work

18:57 <doug16k> he thinks laptops are not the same as desktops

18:57 <doug16k> somehow dram performance is magically not the same because it is a laptop

18:58 <clever> i also couldnt make full sense out of which ram was better for which cpu, from that vid

18:59 <clever> i think the main point he was making though, was that OEM's where selling hw with the "worse" ram, and not listing the ram specs that matter

18:59 <doug16k> the reason that switching the ram helped amd is, he put the ram with more banks in the amd, and ram with fewer banks in the intel, and amd suddenly sped up because AMD is capable of more concorrency

18:59 <doug16k> bank concurrency

19:00 <doug16k> the intel doesn't benefit from the extra banks

19:00 <clever> yeah, i can see how that would help, so the AMD chip is better designed, but the OEM paired it up with the "wrong" ram, and didnt give that in the spec sheets

19:01 <doug16k> what about the zen3 supporting 3200 memory. what's the max non-OC clock on that intel?

19:02 <doug16k> not 3200 right?

19:02 <doug16k> the amd memory controller runs circles around that intel one

19:02 <doug16k> not even fair. intel using older process

19:03 <doug16k> just compare intel igpu against amd igpu. guess which one hammers the memory controller drastically harder and succeeds?

19:06 <clever> changing topics slightly...

19:06 <clever> i can see how a concurrent capable dram controller, might help in the rpi, given that it almost has 3 brains fighting over the ram

19:06 mctpyt has quit [Ping timeout: 252 seconds]

19:07 <clever> but from what ive seen of the design, it only has a single bus, so it can only do one transfer at a time

19:08 <doug16k> bank concurrency is expected from all memory controllers

19:09 <doug16k> even EE student homework memory controllers do it :D

19:09 <clever> i think i'm mixing up 2 different concepts

19:09 <doug16k> zen3 just took it to an extreme

19:09 <clever> 1: accessing one bank while another is doing open/refresh/commit

19:09 <clever> 2: accessing 2 entirely seperate chips, on seperate busses, in parallel

19:10 <clever> i dont think the pi is capable of 2

19:10 <doug16k> it can simultaneously access I/O and memory right?

19:11 <doug16k> like a pc

19:11 <clever> internal IO, yep

19:11 <doug16k> axi is separate concurrent interface isn't it?

19:12 <clever> i believe axi is a shared bus (with fifo's at entry/exit), to connect every master to every slave

19:12 <clever> so the axi bus routes a request to either the ram controller, or the mmio, based on the addr

19:12 <clever> and its packet based, so while a read is in progress, axi can send another read-request to a diff slave

19:13 <clever> the fifo's on both ends, let you shove a read request into the master port, without having to wait for your turn

19:13 <clever> and lets axi transfer it over to the slave port, without having to wait for the slave to be ready

19:13 <clever> whenever the slave is ready, it will pop a request off its fifo, act on it, and push a reply on a fifo facing the other way

19:16 <clever> doug16k: in theory, the ram controler can have its own internal fifo, where it will move requests, and then process them out of order, and concurrently

19:16 Arthuria has quit [Read error: Connection reset by peer]

19:16 Arthuria has joined #osdev

19:19 kwilczynski has joined #osdev

19:19 <clever> https://github.com/librerpi/rpi-open-firmware/blob/master/firmware/sdram.c#L85-L98

19:19 <bslsk05> github.com: rpi-open-firmware/sdram.c at master · librerpi/rpi-open-firmware · GitHub

19:19 tenshi has quit [Quit: WeeChat 3.2]

19:19 <clever> doug16k: when i was analyzing the existing source, i had found that the 1gig model of pi, has the same density of ram s the 512mb model, but half the bus width

19:20 <clever> what i think is happening there, is that they just shoved a pair of 512mb chips into a single package, and wired them up in parallel, to act as a single bigger chip, each taking up half the data bus

19:23 fkrauthan has quit [Quit: ZNC - https://znc.in]

19:24 fkrauthan has joined #osdev

19:33 mahmutov has quit [Ping timeout: 268 seconds]

19:34 mahmutov has joined #osdev

19:36 mctpyt has joined #osdev

19:41 fconti has quit [Quit: Leaving]

19:50 immibis has joined #osdev

20:16 <moon-child> doug16k: gccjit is garbage

20:16 <moon-child> ditto llvmjit

20:16 <moon-child> you don't want to actually use something like that for runtime compilation

20:16 <moon-child> webkit started out using llvm, and then they realised that was a horrible idea

20:17 <moon-child> llvm/gcc are set up for batch compilation. They're never going to be fast enough, and they're not set up to take advantage of things you can only do when jitting

20:19 GeDaMo has quit [Quit: Leaving.]

20:21 asymptotically has quit [Quit: Leaving]

20:22 Arthuria has quit [Read error: Connection reset by peer]

20:22 Arthuria has joined #osdev

20:30 aerona has joined #osdev

20:38 dormito has quit [Ping timeout: 268 seconds]

20:43 Arthuria has quit [Read error: Connection reset by peer]

20:43 Arthuria has joined #osdev

21:07 <Bitweasil> ARMv7 starts in supervisor mode, secure, right?

21:07 <Bitweasil> Ok, yeah, found the reference.

21:16 <clever> Bitweasil: i believe arm always starts in the most powerful mode the cpu supports

21:16 <clever> so if the core has supervisor, it will start in supervisor

21:16 <clever> once your in kernel mode, there is basically no way to know that the core ever has supervisor support, enless an active supervisor reveals itself

21:16 <clever> same for 64bit support

21:31 srjek_ has joined #osdev

21:36 dormito has joined #osdev

21:46 Arthuria has quit [Read error: Connection reset by peer]

21:46 BadQuanta has quit [Read error: Connection reset by peer]

21:46 Arthuria has joined #osdev

21:48 zoey has quit [Quit: Leaving]

22:08 aerona has quit [Quit: Leaving]

22:09 <geist> clever: armv7 can't not support supervisor mode. it's a different model than the EL stuff in armv8

22:09 <geist> yes, it does

22:19 sortie has quit [Quit: Leaving]

22:31 Mooncairn has quit [Quit: Quitting]

22:31 xenos1984 has quit [Remote host closed the connection]

22:33 xenos1984 has joined #osdev

22:33 <Bitweasil> er.

22:33 <Bitweasil> Supervisor is EL1.

22:34 <Bitweasil> Are you thinking HYP/Monitor?

22:34 <Bitweasil> It appears to start in secure EL1, not monitor mode.

22:34 <Bitweasil> Though it's trivial to drop down into monitor mode, looking at the Pi armstubs.

22:38 <clever> Bitweasil: my memory says EL3 is supervisor? EL2 is hypervisor, EL1 is kernel, and EL0 is userland

22:39 <Bitweasil> EL3 is Monitor.

22:39 <clever> ah, getting those 2 mixed up

22:39 <Bitweasil> Supervisor and System are both EL1 modes.

22:39 <clever> i also had to implement dropping donw a level in LK

22:39 <j`ey> clever: supervisor ~= kernel

22:39 <clever> j`ey: ahh

22:40 <j`ey> supervises the users, hypervisor supervises the supervisors :P

22:40 <clever> Bitweasil: https://github.com/littlekernel/lk/commit/71687b4cbfd4be7b11363622578e2cc97031a21a#diff-fa868b857c8fc967227bef0126425b7e41883b735b5bbe75ec44f2f0d6b1faa7

22:40 <bslsk05> github.com: [arch][arm] fix booting when in HYP mode · littlekernel/lk@71687b4 · GitHub

22:40 <clever> Bitweasil: when LK was first ported to the rpi, the firmware left HYP mode for you, and everything just worked

22:41 <clever> Bitweasil: but then somebody wanted HYP on linux, and the firmware was modified to launch linux in HYP mode, which entirely broke LK

22:41 <clever> LK would set the MMU enable flag for supervisor/kernel mode, and being in HYP mode, that had zero effect

22:41 <Bitweasil> *nods*

22:41 <clever> then it would jump to the virt addr, and *fault*

22:41 <Bitweasil> yeah, this all sounds familiar.

22:41 <Bitweasil> I'm on the other end of it, but... yes.

22:42 ahalaney has quit [Remote host closed the connection]

22:44 <clever> Bitweasil: and i'm also running into similar problems on the open-firmware side, because the arm stubs are entirely omitted

22:49 Arthuria has quit [Read error: Connection reset by peer]

22:50 Arthuria has joined #osdev

22:53 <doug16k> moon-child, yeah, I wouldn't expect anything to do magical JVM-style recompile-reoptimize stuff

22:59 Arthuria has quit [Read error: Connection reset by peer]

23:00 Arthuria has joined #osdev

23:00 <moon-child> that stuff aside, you want fast and probably tracing, neither of which gccjit can do

23:01 <doug16k> you could probably generate code faster, but unlikely you could generate faster code

23:01 Arthuria has quit [Read error: Connection reset by peer]

23:01 <doug16k> I don't even know how it is possible for gcc to be so fast

23:01 Arthuria has joined #osdev

23:02 <doug16k> tens of milliseconds to compile most of my files

23:04 <doug16k> it's over 100 files per second when I compile qemu

23:04 <doug16k> way over

23:05 <doug16k> in my rom project, I am often checking the build window to see if it really built something. seemed like it did nothing. it did it

23:06 <doug16k> it's weird when clean build is 40ms

23:07 <clever> heh

23:15 <doug16k> ah, has grown since then. up to 100ms on -j32, but the compiles finish so instantly that it could only keep 4 cpus going

23:15 <doug16k> real: 100ms, user, 452ms

23:17 <doug16k> imagine how fast gcc would be if it didn't need a new process for each file?

23:18 <doug16k> funniest thing, that 100ms, half that is page faults, half of that half is malloc calls, and a tiny blip of it is compiling

23:19 <doug16k> probably 10ms of actual compiling in there

23:19 <moon-child> ehh I don't buy that

23:20 <doug16k> don't have to. see for yourself in perf

23:20 <moon-child> tcc is 10-20x faster than gcc. It can't be 90% overhead

23:20 <doug16k> gcc spends more time in page fault than anything else

23:20 <doug16k> 2nd place is malloc

23:20 <doug16k> in a make -j32

23:20 <moon-child> well--alright

23:21 <moon-child> but you don't have to do page faults nor malloc

23:21 <moon-child> if you don't build graphs, everything is flat, access patterns are predictable

23:23 <clever> doug16k: sqlite does a thing where it concats every .c into one big fat .c file

23:23 <clever> doug16k: you can also `gcc foo.c bar.c -o baz`

23:24 <doug16k> that defeats parallelism though

23:24 <moon-child> ^

23:24 <clever> yeah

23:24 <doug16k> LTO could extract some back out of it

23:24 <clever> ghc can get parallelism, while also recursively finding all modules on its own

23:24 <moon-child> I actually wish compilers could do that kinda parallelism, though

23:24 <clever> but it works in a very different way, with no .h files

23:24 <moon-child> you could cut down on i/o that way too, only read headers once

23:24 <clever> moon-child: there is the pre-compiled headers thing

23:25 <clever> where you turn the .h files into a binary form, that will parse more quickly

23:25 <clever> i stumbled upon it in my early days, when i blindly ran gcc on every source file

23:25 <moon-child> not the parse, you have to do separate parse anyway (unless you do clever caching), the i/o

23:26 <clever> comments would also be gone from the compiled headers, so it may need less IO

23:26 <moon-child> and actually, you do want to do clever caching, but you want that in the compiler, not the build system, because the compiler will be less likely to get it wrong. So another argument for doing it all in one process

23:26 <clever> yep, thats what ghc is doing

23:26 <clever> it has a hashmap for other modules

23:28 <moon-child> it's harder to do correctly for c, though. You have to check that all the same macros are pre-defined before loading your cached copy

23:28 <clever> the weird thing with haskell, is that there are no header files

23:28 <moon-child> except obviously some macros are going to be different, so you instead have to check that all the macros that were actually _used_ are the same

23:28 <moon-child> clever: right, exactly. That kinda caching is way easier with proper modules

23:29 <clever> it will basically parse very .c file your linking against, and extract the type information from it

23:29 <clever> but it also auto-generates .hi files, which contain that type info

23:29 <clever> in a binary form

23:30 <clever> moon-child: this is also where performance problems come in, the hashmap contains lazy objects, so the compiler only has to compute the value if it actually wants to read it

23:31 <clever> moon-child: the problem is 2 fold, first its loosing that value and turning it back into lazy every time a module completes building

23:31 <clever> moon-child: second, a performance tunable called lazy blackholing, causes it to run that computation multiple times in parallel, wasting cpu time

23:31 <clever> so building with -j4, causes it to use 4*4 times as much cpu

23:32 Arthuria has quit [Read error: Connection reset by peer]

23:32 <clever> (rough estimate)

23:32 Arthuria has joined #osdev

23:32 <clever> it can be solved, now that its known, but its a tricky edge case

23:34 netbsduser``` has joined #osdev

23:35 netbsduser has quit [Remote host closed the connection]

23:37 Arthuria has quit [Ping timeout: 272 seconds]

23:40 <ZetItUp> https://wiki.osdev.org/PCI_IDE_Controller the ide_read_buffer bug, is there a fix for it?

23:40 <bslsk05> wiki.osdev.org: PCI IDE Controller - OSDev Wiki

23:56 buffet0 has joined #osdev

23:57 opios2 has quit [Ping timeout: 244 seconds]

23:57 buffet has quit [Read error: Connection reset by peer]

23:57 buffet0 is now known as buffet

23:57 Griwes has quit [Ping timeout: 244 seconds]

23:58 Griwes has joined #osdev

23:59 CryptoDavid has quit [Quit: Connection closed for inactivity]

23:59 mahmutov has quit [Ping timeout: 272 seconds]