#osdev on 2022-08-05 — irc logs at libera.irclog.whitequark.org

2021-05-23 01:57 klange changed the topic of #osdev to: Operating System Development || Don't ask to ask---just ask! || For 3+ LoC, use a pastebin (for example https://gist.github.com/) || Stats + Old logs: http://osdev-logs.qzx.com New Logs: https://libera.irclog.whitequark.org/osdev || Visit https://wiki.osdev.org and https://forum.osdev.org || Books: https://wiki.osdev.org/Books

00:00 <dh`> and during traps it remember the CPU number in the MMU register that's supposed to be there for pointing at the page table base, since the hardware doesn't actually interpret that

00:00 <clever> software mmu?

00:00 <dh`> because students write the VM system, there can't be magic special mappings

00:01 <heat> that's dope

00:01 <dh`> iirc this scheme was based on what linux mips was doing at the time; I have no idea why they weren't using per-cpu mappings

00:02 <dh`> the provisional riscv port so far uses the tp register as the kernel thread pointer

00:04 <clever> dh`: would that thread exist in both user and kernel? 2 stacks to the thread?

00:04 <dh`> yeah, like most conventional designs each userlevel thread manifests in the kernel as a kernel-level thread with a kernel-level thread stack

00:05 <dh`> and there are some other kernel-only threads that do things like sync the disks

00:05 frkzoid has joined #osdev

00:06 <clever> i was thinking earlier about how to do a EL3 or EL2 in LK, where you have a function call that drops to a lower level and resumes a guest

00:06 <clever> and when the guest traps or syscall's, you return from that function

00:07 <heat> that seems possible with setjmp longjmp magic

00:08 <clever> but ive also heard of other kernels, where a syscall/irq starts fresh at the top of a new stack

00:08 <clever> and you may then have to kinda context-switch into the right kernel thread?

00:09 MiningMarsh has quit [Read error: Connection reset by peer]

00:10 MiningMarsh has joined #osdev

00:11 <clever> heat: the EL2->EL1 change seems easy enough, just set the saved pc/status, and eret, the tricky part is getting it to return from that upon any exception...

00:12 <clever> each EL has its own stack pointer right?, so the stack will be at that function call upon the exception?, but the exception prologue would want to return from the exception

00:12 <clever> so you kinda have to skip a stack frame?

00:13 <dh`> all that kind of stuff seems to me like borrowing trouble

00:14 <clever> dh`: why?

00:14 <clever> its the most logical solution i can see, having read the low-level /dev/kvm api

00:14 <dh`> because the code to do stuff like that at trap time is extremely delicate and really easy to get wrong, and then undebuggable

00:14 <clever> yeah, thats the tricky part

00:16 <dh`> also in general if you leave anything pending in the kernel when you go back to userland (or anything less trusted) you're begging for it to never return or crash and then you have to do some kind of recovery on that context somehow

00:16 <clever> dh`: but you could also do what heat said, setjump before you eret, and then longjmp from the exception handler, does that sound safe?

00:16 <dh`> and that's another pile of delicate and difficult to debug code that also rarely gets exercised

00:16 <dh`> in general my recommendation is to always unwind your kernel state and do any wild gyrations at the trap handler level

00:16 <dh`> like traditional signal delivery works

00:17 <dh`> because while it's not free, it's relatively straightforward to get right

00:17 <dh`> but there's a lot of stuff related to supporting kvm-style guests that I've never really thought much about

00:18 <clever> that reminds me, the general design i was thinking of, was to run LK in EL2, and the guests in EL1

00:19 <clever> but normal linux kvm, runs linux in EL1, and hypercalls into an EL2 stub, to do any hypervisor actions

00:19 <clever> which then returns into a guest EL1

00:19 <clever> but geist also mentioned a new armv8 extension, where you can technically be in EL2, but all of the EL1 registers alias into their EL2 counterparts

00:19 <clever> so linux can run fully in real EL2, but still be coded as-if it was in EL1

00:19 gog has quit [Read error: Connection reset by peer]

00:20 <clever> and then it doesnt have to EL1->EL2->EL1

00:20 <dh`> I also have 0 knowledge of this arm stuff :-|

00:20 <clever> arm and x86 name the permission levels in the oposite direction

00:20 <clever> EL3 == secure monitor, the most powerful, EL2=hypervisor, EL1=kernel, EL0=userland

00:21 <clever> when you transition to a lower level, you choose what pc/status reg pair the cpu atomicly swaps into

00:21 <clever> and the current EL is in the status reg

00:21 <dh`> and I've also disliked the idea of having fixed levels, it seems like there should be machine-monitor level and everything else should be recursive instances of the same stuff

00:21 <moon-child> capability hardware!

00:21 * moon-child hides

00:22 <dh`> but everybody has instead rushed to add special hypervisor-level crap to the machine architectures, with the result that either you can't have guests of guests or it takes a lot more work

00:22 <clever> dh`: arm also lets levels be optional, if your kernel started in EL1, and there is no EL3/EL2 firmware, it is impossible to tell if EL2/EL3 even exist

00:22 <clever> its functionally identical to if EL2/EL3 dont even exist

00:24 <clever> dh`: yeah, nested virtualization isnt really supported at the hw level on arm, EL2 can setup a second level of paging tables, so EL1 tables automatically get translated

00:24 <heat> oh yeah something I forgot to complain about but I really want to: a good portion of facebook recruiting emails get tagged to Social by gmail

00:24 <clever> but if your doing nested virt, i would think you would have to shadow the inner hypervisor guest->guest tables

00:25 <clever> you would also need to track if its a guest of a guest currently running, and forward traps to the right guest level

00:28 <dh`> you also need to virtualize everything that makes it look like you know which level you're on

00:28 <dh`> iow, a lot more work

00:29 <clever> yeah

00:30 <clever> and EL3 cant do nested paging tables

00:30 <clever> so you cant use that to run multiple EL2's

00:31 <clever> EL3 is generally reserved for TPM type tasks

00:37 <clever> when a system is setup for security, then EL3 controls what code can access key storage, and can either block booting non-signed kernels, or disable key-storage access when booting unsigned kernels

00:37 <clever> or destroy all key material before you boot an unsigned kernel

00:41 MiningMarsh has quit [Read error: Connection reset by peer]

00:42 MiningMarsh has joined #osdev

00:55 gildasio has quit [Remote host closed the connection]

00:56 gildasio has joined #osdev

00:57 MA-SA-YU-KI has quit [Quit: Connection closed for inactivity]

01:02 MiningMarsh has quit [Ping timeout: 245 seconds]

01:03 MiningMarsh has joined #osdev

01:14 mykernel has quit [Ping timeout: 240 seconds]

01:14 [itchyjunk] has quit [Ping timeout: 240 seconds]

01:16 mykernel has joined #osdev

01:19 [itchyjunk] has joined #osdev

01:20 mykernel has quit [Client Quit]

01:32 air has quit [Ping timeout: 245 seconds]

01:45 gildasio has quit [Remote host closed the connection]

01:45 air has joined #osdev

01:46 gildasio has joined #osdev

01:46 zaquest has quit [Remote host closed the connection]

01:49 zaquest has joined #osdev

01:56 heat has quit [Ping timeout: 240 seconds]

02:03 MiningMarsh has quit [Ping timeout: 268 seconds]

02:04 MiningMarsh has joined #osdev

02:08 gog` has quit [Ping timeout: 245 seconds]

02:49 terrorjack has quit [Quit: The Lounge - https://thelounge.chat]

02:51 terrorjack has joined #osdev

03:07 gildasio has quit [Remote host closed the connection]

03:08 gildasio has joined #osdev

03:12 gildasio has quit [Remote host closed the connection]

03:12 gildasio has joined #osdev

03:21 [itchyjunk] has quit [Read error: Connection reset by peer]

03:57 andydude has joined #osdev

03:58 andydude_ has joined #osdev

03:59 andydude_ has quit [Client Quit]

04:05 frkzoid has quit [Ping timeout: 244 seconds]

04:06 ggherdov has quit [Ping timeout: 264 seconds]

04:11 gaze___ has quit [Ping timeout: 260 seconds]

04:13 ggherdov has joined #osdev

04:13 gaze___ has joined #osdev

04:13 matt__ has joined #osdev

05:04 srjek has quit [Ping timeout: 255 seconds]

06:00 MiningMarsh has quit [Quit: ZNC 1.8.2 - https://znc.in]

06:04 MiningMarsh has joined #osdev

06:18 andydude has quit [Quit: CoreIRC for Android - www.coreirc.com]

06:34 the_lanetly_052 has joined #osdev

06:51 vdamewood has joined #osdev

06:58 vinleod has joined #osdev

06:59 vdamewood has quit [Killed (zinc.libera.chat (Nickname regained by services))]

06:59 vinleod is now known as vdamewood

07:10 the_lanetly_052 has quit [Ping timeout: 268 seconds]

07:13 the_lanetly_052 has joined #osdev

07:27 Brnocrist has quit [Ping timeout: 268 seconds]

07:45 bauen1 has quit [Ping timeout: 268 seconds]

07:46 poyking16 has joined #osdev

07:51 scaleww has joined #osdev

08:02 gildasio has quit [Remote host closed the connection]

08:02 gildasio has joined #osdev

08:27 bauen1 has joined #osdev

08:46 GeDaMo has joined #osdev

08:48 scaleww has quit [Quit: Leaving]

08:54 socksonme_ has joined #osdev

09:01 gdd has quit [Ping timeout: 268 seconds]

09:03 gdd has joined #osdev

10:05 gildasio has quit [Remote host closed the connection]

10:06 gildasio has joined #osdev

10:16 _xor has joined #osdev

10:34 heat has joined #osdev

10:39 wootehfoot has joined #osdev

10:53 vai has joined #osdev

10:53 <vai> Any neat debugging flags for GCC? I have to stop using -W

10:54 <vai> I want to do a warning level that warns of pragmatic errors, really true ones only.

10:54 <vai> aka warnings

10:55 <GeDaMo> Why do you have to stop using -W? Did the parameter police pay you a visit? :|

10:55 <psykose> i think -pedantic has what you're looking for

10:55 gildasio has quit [Remote host closed the connection]

10:56 gildasio has joined #osdev

10:56 <vai> yeah, and great to find an error without the warnings settings GeDaMo + psykose ... meaning I have 100 lines of C code

10:56 <vai> *100 thousand

10:57 <vai> FS has errors. And HD driver I am looking now, it still has a remote device server, etc. code I have to remove -- which is completely wrong approach in monolithic kernel.

10:58 <vai> FS is not that bad, I get a ls on my OS.

10:59 <vai> chs.c - nice to have you friend still along

11:02 <vai> #coders is empty eh ?

11:03 <GeDaMo> There's ##programming for general programming chat

11:03 wootehfoot has quit [Read error: Connection reset by peer]

11:06 <vai> for(i=0,i2=p1*chd->units,p=sttmp; i<chd->units; i++,p+=512,i2++) { hdBlockRW(chd, p,chd->l_buf, i2,0,WRITE); }

11:06 <vai> 512 bytes pieces or larger writes, what is your recommendation if any? :-)

11:07 <vai> DWORD *size; int flags,i,i2,sz; char *p;

11:08 <vai> p1 delivered from my device call arguments/parameters, every driver gets equal number of parameters

11:11 <vai> same for READ

11:12 <vai> // Copy chunk to client's memory space

11:12 <vai> memcpy(po1, sttmp, chd->units*12);

11:12 <vai> not sure if this exactly is good code

11:12 <vai> *12?? who joed this code

11:12 <vai> I dont remember writing this

11:13 <vai> 512

11:13 <vai> no shit HD driver did not work because of this

11:21 gildasio has quit [Ping timeout: 268 seconds]

11:35 <vai> okay tested, it works okay.. actually better than the ramdisk driver which has problems with really old cache - if I am not misguided

11:49 poyking16 has quit [Ping timeout: 240 seconds]

11:52 poyking16 has joined #osdev

11:55 <heat> vai, what do you want with "neat debugging flags"?

11:55 <heat> -W isn't really a debugging flag

11:56 <heat> it's also a horrible flag

11:56 <heat> use -Wall -Wextra

12:00 <zid> -W -Wall -Wextra

12:00 <zid> -O3 -fwhole-program

12:02 <heat> -flto

12:02 <kazinsal> -fso-i-think-the-compiler-is-wrong-but-i-refuse-to-admit-my-own-possible-faults

12:02 <zid> fwhole-program is just -flto but better

12:02 <heat> but in all honesty, like good base "debugging" flags are -O0 and -g

12:02 <zid> -Og

12:02 <heat> -O0 is especially important when using clang

12:02 <zid> -O0 is useless when using gcc

12:03 <zid> it generates utterly trash code that's unreadable

12:03 <heat> -Og is a nop in clang :) and -O1 onwards gives you some fucked up debugging info sometimes

12:03 <zid> -fwhole-program gives the best debug info

12:03 <zid> main.LTO_FRAG_473

12:03 <zid> is your entire program

12:03 <heat> (I can't remember if -Og aliases to -O1 or -O0... maybe -O1?)

12:03 <heat> oh I ranted about that yesterday

12:04 <heat> EDK2's debug GCC builds use LTO for some reason

12:04 <zid> It makes sense though, given it basically destroys the program structure in the name of optimization

12:04 <heat> was trying to trace a crash and found a really flattened entry point with all sorts of crap there and almost no debug info

12:04 <heat> yay?

12:04 <zid> I'm not sure it's a solvable problem

12:04 <heat> it's not

12:04 <heat> just turn it off

12:05 <psykose> it's like cutting people up and putting them back together but expecting them to remain alive

12:05 <heat> anyhow, really good, useful debugging flags that require runtime support are -fsanitize=address (or kernel-address for kernels), -fsanitize=undefined, -fsanitize=memory, -fsanitize=thread(?), -fsanitize=scudo

12:05 <heat> all them sanitizers

12:06 <zid> scudo? isn't that a board game

12:06 <heat> it's also an allocator

12:06 <heat> also, no, that's skudo

12:06 <zid> Oh I was thinking of subuteo

12:06 <zid> or cluedo

12:07 <psykose> scudo doesn't even work on musl they should write better code

12:07 <heat> psykose, none of the sanitizers work on musl, or at least "officially"

12:07 nyah has joined #osdev

12:07 <j`ey> yes musl should write better code

12:07 <psykose> address/undefined do

12:07 <zid> nothing works in musl

12:07 <psykose> my favorite musl meme is the chromium partition alloc shim

12:07 <mjg> dafqu's scudo

12:07 <mjg> (i read csudo)

12:08 <j`ey> psykose: theres a musl meme there too? I thought that was just an m1 meme :P

12:08 <psykose> yea

12:08 <heat> mjg, scudo is an allocator that tries to provide more safety while being fast

12:08 <heat> for instance, headers are CRC32'd afaik

12:08 <psykose> the partition alloc init code calls pthread_atfork for some lock-related things

12:08 <mjg> wut

12:09 <psykose> except pthread_atfork calls malloc() on musl, so you get an infinite loop

12:09 <psykose> if you comment it out it works fine though

12:09 <heat> mjg, it's android and fuchsia's default malloc

12:09 <j`ey> nice

12:09 <psykose> yolo

12:09 <mjg> heat: well i would not call android fast :-P

12:09 <mjg> on a more seirous note, i'll check it out

12:10 <heat> you can run it as a sanitizer and as standalone

12:10 xenos1984 has quit [Read error: Connection reset by peer]

12:10 <heat> just LD_PRELOAD it afaik

12:10 <heat> i think google ditched some "high perf" allocator for it

12:10 <vai> take a cup of coffee, sit down, relax, and read.. this is going to take a while

12:11 <mjg> don't tell me what to do

12:11 <vai> do we do anything else than read?

12:11 <heat> oh yeah also forgot gwp-asan, which randomly samples your allocations and quarantines them

12:11 <heat> yes i write the codes

12:12 <vai> lots of thinking to do

12:13 <kof123> i deal with the customers so the engineers dont have to. i have people skills

12:13 <heat> ASAN helps you debug UAFs, out of bounds, basically all sorts of bad memory acceses

12:13 <heat> accesses*

12:13 <mjg> except when it is buggy

12:13 <heat> MSAN helps you debug uninitialized memory issue AFAIK (I've never used so...)

12:14 <heat> TSAN is for thread races

12:14 <mjg> freebsd has kmsan ported, but it osmetimes goes crazy and reports utter bs

12:14 <heat> GWP-ASAN and scudo are like very lightweight asans-replacements which are not nearly as effective as asan

12:14 <mjg> not only there was no undef access, the supposed callsite does not make sense (i.e. does not deref any memory)

12:14 <heat> UBSAN is for undefined behavior

12:15 <heat> there's also hwasan which is asan for platforms that support memory tagging

12:15 <heat> s/memory/address/

12:16 <heat> safestack and shadowstack help you not fuck over your stack in the case of stack overflows

12:16 <heat> and that's mainly it?

12:17 <heat> there's also gcov and sancov but those are for tracing

12:17 <heat> oh yeah linux also has kmemleak to detect memory leaks

12:18 <heat> it's crazy, it's a fucking garbage collector-like thing

12:19 [itchyjunk] has joined #osdev

12:22 <heat> general newbie kernel tips: map your things with the proper permissions, don't forget to set CR0.WP or your architectural equivalent

12:22 <heat> also build interrupt handlers that don't result in a crash-loop, or at least add exception handlers that just cli;hlt

12:23 <heat> you can use CFI directives to get a pretty stack trace from the exception handling code down to where it crashed, yadda yadda in gdb

12:23 <heat> if you're lacking debug tooling, build it

12:23 <mjg> solaris has ::findleaks in the debugger

12:24 <zid> and use -d nochain in qemu if you don't want the crash to not make sense in my case :p

12:24 <mjg> i don't know how it works though

12:24 <heat> i think any sort of findleaks will need to walk the stacks and the heap for references to objects

12:25 <heat> and mark-and-sweep it

12:25 <heat> i don't see how else you could do it

12:25 <zid> time for breakfast I think heat

12:25 <zid> what are you making

12:25 <heat> it's 1pm zid

12:26 <heat> i've already had breakfast

12:26 <zid> 1pm is best time for breakfast if you wake up at noon

12:26 <mjg> i'm way ahead of you

12:26 <mjg> 2 pm here

12:27 <heat> chad GMT/BST vs virgin CET/CEST

12:28 xenos1984 has joined #osdev

12:28 <mjg> real programmers are asleep at this time anyway

12:28 <heat> the PNW people?

12:28 <mjg> in their respective timezones

12:28 <mjg> you are not a programmer unless you sleep 9-5

12:28 * mjg used to do it

12:29 <heat> 9-5?

12:29 <mjg> from 9 am to 5 pm

12:29 <heat> 9am to 5pm or 9pm to 5am?

12:29 <heat> cringe

12:29 <mjg> i'm getting too old for that shit though

12:29 <heat> you have to wake up at 5am for that david goggins type of shit

12:30 <mjg> STAY HARD

12:30 <mjg> NOTHING GETS DONE BY BEING A BITCH

12:31 <heat> 05:10 run 20KM; 05:40: Write javascript; 06:00: Run 50KM; 06:30: write a new kernel; 07:00: Take some viagra so you can stay hard during the rest of the day

12:31 <heat> true chad pnw programmer

12:31 <mjg> gigachad programmer

12:32 <zid> fine I made my own breakfast

12:33 <mjg> are you breakfastshaming me for not having one?

12:33 <zid> No I am shaming heat for not making me any

12:33 <mjg> OK

12:34 <mjg> please continue

12:37 gildasio has joined #osdev

12:40 <zid> I have too much food but I didn't wanna leave like 5% of it in the fridge I'd do nothing of

12:40 <zid> with

12:44 <heat> vai, you can also set up clang-tidy to help you catch issues

12:55 gog has joined #osdev

13:10 <heat> oh yeah also fstack-protector

13:10 <heat> or fstack-protector-strong, or fstack-protector-all

13:10 <psykose> one day they'll just pick one

13:12 <mjg> fstack-protector-chad

13:20 gildasio has quit [Remote host closed the connection]

13:37 gildasio has joined #osdev

13:45 <mrvn> mjg: most important meal of the night

13:48 <mjg> :)

13:50 <mjg> hm i just wrote a 200 line diff and it compiled the first time

13:50 <mjg> not looking good for it working first time, is it

13:51 <GeDaMo> Even worse, it might /appear/ to work :P

13:51 <mjg> i take it!

13:53 srjek has joined #osdev

13:54 <mrvn> That's usually the difference between C and ocaml. C code often compiles the first time but doesn't work. ocaml code doesn't compile the first time. but when it does it usually works.

13:55 <mjg> well it booted

13:55 <mjg> i think that's sufficient testing

14:12 SGautam has joined #osdev

14:16 wootehfoot has joined #osdev

14:30 srjek has quit [Ping timeout: 244 seconds]

14:34 poyking16 has quit [Ping timeout: 252 seconds]

14:39 poyking16 has joined #osdev

14:41 Brnocrist has joined #osdev

14:48 poyking16 has quit [Ping timeout: 240 seconds]

14:51 poyking16 has joined #osdev

15:10 poyking16 has quit [Ping timeout: 245 seconds]

15:12 poyking16 has joined #osdev

15:19 zhiayang has quit [Quit: oof.]

15:21 zhiayang has joined #osdev

15:30 poyking16 has quit [Ping timeout: 268 seconds]

15:37 poyking16 has joined #osdev

15:39 gildasio has quit [Remote host closed the connection]

16:02 gog has quit [Ping timeout: 252 seconds]

16:04 vai has quit [Remote host closed the connection]

16:04 gog has joined #osdev

16:21 srjek has joined #osdev

16:23 SpikeHeron has quit [Quit: WeeChat 3.0]

16:27 SpikeHeron has joined #osdev

16:29 <zid> https://twitter.com/Azure_Husky/status/1555335719312392193?s=20&t=dUaRVr-hn7_fUCTGntue7g

16:29 <bslsk05> twitter: <Azure_Husky> Here's a common scam that is going around that you should know about: ␤ ␤ Sometimes cats will meow at you like they haven't been fed, but in fact someone DID feed them and they're just trying to get fed again

16:40 <gog> zid: meoww

16:55 <zid> Don't listen to her lies

16:56 <sbalmos> woof

16:58 matt__ has quit [Read error: Connection reset by peer]

17:00 <gog> meowwww

17:00 <gog> feed me

17:00 <gog> i did eat but that was 4 hours ago

17:01 <gog> and it was just a sandwich and some fruit

17:03 <zid> https://pbs.twimg.com/media/Dsfr_z-UcAAwfDs?format=jpg&name=900x900

17:15 <gog> I'm getting pizza later i think

17:29 <gog> maybe idk

17:29 <gog> the grocery store is gonna be closed by the time I leave

17:33 <mjg> there is a twiight zone episode where the earth starts moving towards the sun

17:33 <mjg> i think we are living it

17:33 bauen1 has quit [Ping timeout: 268 seconds]

17:35 <GeDaMo> https://en.wikipedia.org/wiki/The_Midnight_Sun_(The_Twilight_Zone)

17:35 <bslsk05> en.wikipedia.org: The Midnight Sun (The Twilight Zone) - Wikipedia

17:46 poyking16 has quit [Quit: WeeChat 3.6]

18:05 the_lanetly_052 has quit [Ping timeout: 240 seconds]

18:16 gildasio has joined #osdev

18:31 sprock has quit [Ping timeout: 268 seconds]

18:49 bauen1 has joined #osdev

18:58 vdamewood has quit [Quit: My MacBook Pro has gone to sleep. ZZZzzz…]

19:07 matt__ has joined #osdev

19:22 gildasio has quit [Remote host closed the connection]

19:22 dh` has quit [Quit: brb]

19:23 gildasio has joined #osdev

19:35 dh` has joined #osdev

19:39 sprock has joined #osdev

19:40 dh` has quit [Ping timeout: 245 seconds]

19:47 Vercas6 has quit [Remote host closed the connection]

19:47 gxt___ has quit [Remote host closed the connection]

19:48 Vercas6 has joined #osdev

19:48 gxt___ has joined #osdev

19:51 wootehfoot has quit [Quit: Leaving]

19:56 matt__ is now known as freakazoid333

19:56 GeDaMo has quit [Quit: A program is just a bunch of functions in a trenchcoat.]

19:57 wootehfoot has joined #osdev

20:01 SGautam has quit [Quit: Connection closed for inactivity]

20:04 gxt___ has quit [Ping timeout: 268 seconds]

20:05 gxt___ has joined #osdev

20:07 gxt___ has quit [Remote host closed the connection]

20:07 gxt___ has joined #osdev

20:12 bauen1 has quit [Ping timeout: 252 seconds]

20:12 gog has quit [Read error: Connection reset by peer]

20:14 bauen1 has joined #osdev

20:16 poyking16 has joined #osdev

20:16 poyking16 has quit [Client Quit]

20:20 gog` has joined #osdev

20:29 gog` is now known as gog

20:32 <heat> geist, why is fuchsia's arm64 trap code so confusing?

20:33 <zid> working as intended, it caught heat

20:33 <heat> it seems to have a whole lot of paths

20:34 <heat> I would expect the basic path to be: switch stacks (?), save registers, branch to C++, load registers, switch stacks, eret

20:34 <heat> (right now it's unclear to me where that switch stacks step is done)

20:44 <Griwes> Hmm, I think I need to actually write process and thread teardown code to proceed with what I've been doing. I've been putting this off for a while now :'D

21:23 socksonme_ has quit [Ping timeout: 245 seconds]

21:37 <mjg> hrm is there an amd64 cpu which does not support sse3?

21:38 <heat> yes

21:38 <heat> it's not a base ISA feature

21:39 <mjg> :sadface:

21:40 <heat> sadge

21:41 <heat> you know what also isn't but is way cooler?

21:41 <heat> tlbsync and invlpgb

21:41 <heat> who tf needs sse3 after that

21:42 <zid> I'm not sure if you can *find* one of those cpus

21:42 <zid> but in theory they are allowed

21:42 <zid> like, `core` only existed in a few thousand laptops total

21:42 <zid> before core2 replaced it and sold hundreds of millions of chips

21:44 <zid> but we have to make allowances for it

21:45 <heat> and the k8 too

21:46 <zid> I thought k8 was at least popular

21:46 <zid> the x2 and fx and sempron and opteron and turion were all k8 weren't they

21:46 <heat> possibly

21:47 <heat> just adding it as one of the sse3-less cpus

21:47 <heat> at least according to gcc

21:48 <zid> yea amd is built different

21:48 <zid> aka since 3dnow intel has added all the simd extensions first and amd has gotten them later

21:51 <heat> 3dnow is amd's mmx but worse

21:51 <heat> since at least mmx is used once during boot

21:57 <zid> I've never used or seen 3dnow :(

21:57 nyah has quit [Quit: leaving]

21:59 <mjg> 3dbie

21:59 <mjg> 3dbye even

22:01 dh` has joined #osdev

22:02 <heat> 3dlater

22:13 carbonfiber has joined #osdev

22:15 <geist> heat: i think it needs a bit of a cleanup (re: the arm64 dispatch path) but the primary reason is the N entry points, and trying to optimize them individually

22:16 <geist> hence lots of macros that stamp out things

22:16 leah_ has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

22:16 <geist> and then there's a fastpath for teh syscall mechanism, which is also mostly implemented in the exceptions.S code

22:16 <geist> i have a TODO to go in there and say fuck it with the whole 'try to fit the first part of the exception handler in the 0x80 bytes you get per exception

22:16 <geist> and just build a table that branches to the real implementations

22:16 leah_ has joined #osdev

22:17 <geist> that will reduce some of the confusion, at the expense of not fully utilizing the space in the exception table

22:17 <geist> that seems to be what most Big OSes do

22:17 <carbonfiber> Is the default PIO mode for devices specified anywhere? I can't find it in the ATA standard. How is a host able to use the Identify Device command if there isn't a default PIO mode specified in the standard?

22:17 <geist> heat: but yeah you have identified that the complexity has reached a point as to make it hard to read

22:18 <geist> also FWIW a lot of the complexium has to deal with only reloading some regs some times (x18, x20, etc) and dealing with debug instrumentation

22:22 <geist> heat: re: switching stacks zircon currently uses the SP_ELx mode, so there's no real stack switching necessarily. before you exit you simply restore SP_EL0 before eretting

22:22 <geist> that's done.... https://fuchsia.googlesource.com/fuchsia/+/refs/heads/main/zircon/kernel/arch/arm64/exceptions.S#78

22:22 <bslsk05> fuchsia.googlesource.com: zircon/kernel/arch/arm64/exceptions.S - fuchsia - Git at Google

22:23 <geist> i think the regsave/regrestore macros are fairly easy to follow, its precisely where they get stamped out and how many instances of it is the complexity

22:36 froggey has joined #osdev

22:36 dude12312414 has joined #osdev

22:37 dequbed has quit [Quit: bye!]

22:40 dequbed has joined #osdev

22:52 dude12312414 has quit [Remote host closed the connection]

22:53 dude12312414 has joined #osdev

22:54 opal has quit [Write error: Connection reset by peer]

22:54 Vercas6 has quit [Read error: Connection reset by peer]

22:54 dude12312414 has quit [Write error: Connection reset by peer]

22:54 gildasio has quit [Write error: Connection reset by peer]

22:55 opal has joined #osdev

22:55 dude12312414 has joined #osdev

22:55 gildasio has joined #osdev

22:55 Vercas6 has joined #osdev

23:11 <heat> geist, yeah

23:12 <heat> probably a good chunk of me not being able to read it is that I'm still learning how arm64 works

23:12 <heat> but it definitely seems confusing anyway

23:12 <heat> what's x20 for? safestack or so?

23:15 <heat> ah SP_EL0 is cool

23:16 <heat> what exactly do I need to save? x0-31, sp, pc, pstate

23:16 <heat> am I missing something?

23:16 <heat> this looks like a reasonable stack frame but linux has something way more complex, as usual

23:17 <j`ey> elr

23:17 <heat> i think that fits into "pc"?

23:17 <j`ey> oh sorry, yeah

23:18 <heat> https://fuchsia.googlesource.com/fuchsia/+/refs/heads/main/zircon/kernel/arch/arm64/include/arch/regs.h#24

23:18 <bslsk05> fuchsia.googlesource.com: zircon/kernel/arch/arm64/include/arch/regs.h - fuchsia - Git at Google

23:18 <heat> https://elixir.bootlin.com/linux/latest/source/arch/arm64/include/asm/ptrace.h#L178

23:18 <bslsk05> elixir.bootlin.com: ptrace.h - arch/arm64/include/asm/ptrace.h - Linux source code (v5.19) - Bootlin

23:19 <heat> linux doesn't seem to agree but who the fuck am I to be able to understand this

23:20 <heat> as an aside, something I've never really understood in linux is the orig_eax/orig_x0/... whatever ordeal

23:25 <geist> that's *probably* so that they can restart a syscall

23:25 <geist> even if they've already modified the return rax or whatnot

23:26 <geist> what part of it doesn't seem to agree?

23:28 <heat> they have different members in the struct

23:28 <geist> they have some additional things, presumably for their purposes, but the base stuff is the same

23:28 <geist> just different names

23:29 <heat> ah I think I see it

23:29 <heat> lr and usp are just x[] registers

23:29 <heat> elr is pc

23:29 <geist> lr is x30 yes, usp is a special case

23:29 <heat> spsr is pstatus

23:30 <geist> elr is pc, spsr.. yep

23:30 <geist> er yeah lr is elr

23:30 <geist> right

23:30 <heat> isn't usp that register that is both xzr and sp?

23:31 <geist> kinda but that's just an encoding

23:31 <geist> SP is a special register

23:31 <heat> why?

23:31 <geist> because it is

23:31 <heat> because of the alignment requirements and whatnot?

23:31 <geist> architectural decisions with arm64

23:31 <geist> they decided to make it a special case

23:32 <heat> special case of what?

23:32 <geist> presumably much easier to build as a high performance design if its not a general purpose register

23:32 <geist> special case as in it's not part of the normal register set

23:32 wootehfoot has quit [Quit: Leaving]

23:32 <geist> and you can't do all normal operations against it

23:33 <heat> ah yeah sure

23:33 <geist> in certain cases it's *encoded* as the same as xzr

23:33 <geist> but that's an implementatino detail of the ISA encoding

23:37 <geist> both PC and SP being non regular integer registers is one of the key differences between arm32 and arm64

23:37 <geist> primarly to make the design simpler. PC being a normal register in arm32 is really nifty but its hard to build high speed superscalar designs out of that, so i have read

23:42 <heat> why?

23:42 <geist> less special cases

23:42 <heat> why isn't mov $func, %pc just a jmp?

23:42 <geist> it is. it was, in arm32

23:43 <geist> but now that means the decoder has to look for that, and you can do lots of more exotic ways to get a branch out of the ISA

23:43 <geist> so it's harder to find and decode all those special cases

23:43 <geist> much simpler if there's effectively one or two ways to branch, makes the branch predictors life far simpler