#osdev on 2023-03-23 — irc logs at libera.irclog.whitequark.org

2021-05-23 01:57 klange changed the topic of #osdev to: Operating System Development || Don't ask to ask---just ask! || For 3+ LoC, use a pastebin (for example https://gist.github.com/) || Stats + Old logs: http://osdev-logs.qzx.com New Logs: https://libera.irclog.whitequark.org/osdev || Visit https://wiki.osdev.org and https://forum.osdev.org || Books: https://wiki.osdev.org/Books

00:26 <geist> reminds me of the hackaday thing recently with the 486 on a breadboard

00:26 <geist> https://hackaday.com/2023/03/18/its-a-486-computer-on-a-breadboard/ makes me wanna build a similar thing

00:26 <bslsk05> hackaday.com: It’s A 486 Computer, On A Breadboard | Hackaday

00:26 gog has quit [Ping timeout: 268 seconds]

00:27 <geist> i started to design something kinda similar to a 68030 i have floating around, but should consider adding more support chips onboard like that one does

00:31 [itchyjunk] has quit [Ping timeout: 240 seconds]

00:33 <zid> yea it'd be rad to get an old cpu, but one that isn't typically made into a home computer like a z80 or 6502

00:33 <zid> and hook it up

00:33 <zid> 486 is a good pick

00:36 <geist> yah by 486 or 030 era it starts to get fairly complicated in that there are a fair amount of control bits and whatnot, but its still mostly just an A + D bus

00:37 <geist> i think after that it starts getting pretty difficult to deal with the state without an ASIC level circuit to decode things

00:37 <linearcannon_> maybe not ASIC, but at least FPGA

00:37 <linearcannon_> i'm pretty sure you could manage with an FPGA all the way up to P3 era, at least

00:38 <linearcannon_> with modern FPGAs

00:39 <jimbzy> http://www.chrisfenton.com/the-zedripper-part-1/

00:39 <bslsk05> www.chrisfenton.com: The ZedRipper: Part 1 – chrisfenton.com

00:43 <bnchs> 68030!

00:43 <bnchs> my fav, m68k with 32-bit addressing!

00:46 [itchyjunk] has joined #osdev

00:46 gbowne1 has quit [Ping timeout: 255 seconds]

00:53 gbowne1 has joined #osdev

00:55 nyah has quit [Quit: leaving]

00:55 <geist> sure, that's what i mean by ASIC level. ie probably somewhat more compliated than a big pile of TTL logic, or even a simple CPLD

00:55 <zid> I imagine the main issue is just.. they don't work on a breadboard

00:55 <linearcannon_> yeah

00:55 <zid> plus it becomes near impossible to debug outside of a sim

00:55 <zid> who wants to debug hypertransport using blinking LEDs

00:55 <zid> plus more packet based stuff not paralrlalel

00:55 <linearcannon_> i've actually thought about trying to design a PPro/P2/P3 chipset on an fpga

00:56 <linearcannon_> but that requires manufacturing

00:56 linearcannon_ is now known as linearcannon

01:04 [itchyjunk] has quit [Read error: Connection reset by peer]

01:04 <heat> ok, stupid c question

01:04 <zid> I'm good at those

01:04 <heat> lets assume we have a struct something_ops { void (*stuff)(void); };

01:05 <zid> pick me

01:05 <heat> if struct something_ops *s = ...;, is s->stuff() the same as (*s->stuff())?

01:06 <zid> no

01:06 <zid> you can tell, because one has a dereference and the other has two

01:06 <moon-child> do you mean (*s->stuff)()?

01:06 <heat> ah yes

01:06 <heat> my bad

01:06 <moon-child> yeah, former is syntax sugar for the latter

01:06 <heat> oh cool

01:06 <heat> how recent is the former?

01:07 <moon-child> I think you can do it pretty much everywhere

01:07 <heat> this svr4 book seems to do the latter

01:07 <moon-child> pre-ansi c, idk, but pretty sure it's in standard c

01:09 <heat> hmm i guess good chunks of the svr4 codebase predate c89

01:10 <zid> https://godbolt.org/z/ddE9jK4ah

01:10 <bslsk05> godbolt.org: Compiler Explorer

01:10 <zid> * just does nothing to functions

01:11 <heat> huh

01:11 <heat> bizarre

01:11 <zid> dereferencing a function has no meaning though

01:11 <heat> not that I would expect some other behavior except maybe erroring out

01:12 <zid> they just decided it gives you back a function pointer, rather than an error, yea

01:12 <heat> i guess the semantics of function vs function pointer are all pretty loose

01:13 <heat> like funcptr = g; or funcptr = &g; doing the same thing

01:13 <moon-child> in particular a function isn't an object

01:13 <moon-child> so the implementation can kinda make it be whatever it wants

01:13 <zid> It stops you having to put *f everywhere if 'f' is also 'dereference yourself and give your own pointer'

01:13 <moon-child> in wasm apparently a function pointer is an index into a big global array of functions

01:13 <zid> which also causes the ******f thing to work

01:15 <zid> they have a special class of grammar called 'function designators' if you wanna look it up

01:16 <zid> The unary * operator denotes indirection. If the operand points to a function, the result is a function designator

01:16 <geist> i wonder if *funcpointer() is really trying to dereferece whatever the function returns

01:16 <zid> being the *f part

01:16 <zid> geist: yea that's why you need the ()

01:16 <zid> f() is shorthand for (*f)();

01:16 <zid> not *f()

01:17 <zid> a function designator with type ‘‘function returning type’’ is converted to an expression that has type ‘‘pointer to function returning type’’

01:17 <zid> which is sort of recursive and weird, if you * a fp you get a function designator, and a function designator is 'a pointer to function'

01:18 <zid> so it just decays ten times if you do ten *

01:18 <zid> then has a dangling 11th conversion that () eats

01:18 <geist> OCMPUERS!

01:19 <heat> take this rust fanboys

01:19 <heat> C!!!!!!!!!!!!!

01:19 <zid> They could have just said that function objects can't be evaluated, and it would have all errored instead

01:20 genpaku has quit [Remote host closed the connection]

01:38 <moon-child> they could also have had bounds checking

01:38 <moon-child> and garbage collection

01:39 <moon-child> but instead they had buffer overflows and use-after-frees

01:39 <moon-child> I don't think these c-specifying-thingy-people are very smart

01:39 <FireFly> function pointers and object pointers also potentially might not be comparable or castable to one another, which gets fun with void pointers, IIRC

01:40 <FireFly> unless that was changed in a more recent C spec..

01:41 <zid> yea dlsym is illegal

01:42 <zid> I think function pointers are allowed to go back and forth again same as void * pointers

01:42 <zid> as long as you don't try to do anything with the fake

01:42 <moon-child> so

01:42 zxrom has left #osdev [Leaving]

01:42 <moon-child> the c standard doesn't guarantee that you're allowed to go back and forth between function pointers and void pointers

01:42 <moon-child> but it doesn't say you _can't_ either

01:42 <moon-child> so it's ok for posix to mandate that you can

01:42 <moon-child> so dlsym is legal

01:43 <mjg> the only fhing guarnateed by the c standard is that you are going to get shafted by it

01:43 <mjg> thing

01:44 <moon-child> indeed

01:45 <mjg> little known fun fact: the official standard pdf has a hidden message embedded in it

01:45 <mjg> which says: LOL

01:54 catern- is now known as catern

02:02 <geist> c++ of course has the whole virtual method pointer thing which is defintely not compatible with void *

02:10 <mjg> is there a public regression testing for the scheduler?

02:10 <mjg> not asking about some adhoc benchez

02:11 <moon-child> 'the' scheduler?

02:12 <mjg> THE motherfucker

02:12 <heat> mjg, what would be a regression for the scheduler

02:13 <zid> it stops scheduling

02:13 <zid> duh

02:13 <mjg> lol u srs

02:13 <mjg> here is an example

02:14 <mjg> https://marc.info/?l=freebsd-hackers&m=167951299918135&w=2

02:14 <bslsk05> marc.info: 'Re: Periodic rant about SCHED_ULE' - MARC

02:14 <mjg> the diea would be to fikkz in $magic manner, but then what about other cases

02:15 <heat> okay so

02:15 <heat> i'm not sure if you've heard of "real workloads" before

02:15 <heat> it's a new bench

02:15 <heat> essentially you run that and see if you regress

02:16 <mjg> cool story

02:16 <mjg> so

02:16 <heat> in all honesty it's probably the best you'll get for a scheduler m8

02:17 <mjg> bitch plz

02:17 <heat> you can't write some weirdly synthetic shit and say it's better or worse

02:17 <Mutabah> mjg: Mind cooling the language?

02:17 <mjg> there is tons of funny corner cases you can't hope to cover by running REAL WORKLOAD by yourself

02:17 <mjg> Mutabah: sure

02:17 <mjg> i know linux has a bunch of tests, but i don't know of anything comprehensive

02:17 <Mutabah> Also - I'm guessing you're talking about "the _linux_ scheduler" here

02:17 <heat> no

02:18 <Mutabah> Oh, hey, it's FreeBSD

02:18 <zid> You can tell that guy's insane because he's writing example code in fortran

02:18 <mjg> no, but i do suspect if there is a good test suite, it is for the linux one :)

02:18 <zid> and writing csh scripts

02:18 <Mutabah> Always worth clarifying

02:18 <mjg> zid: mind cooling the language

02:18 <mjg> Mutabah: now this much is implied mate :>

02:18 <zid> what language

02:18 <zid> is fortran a banned word in poland

02:18 <zid> sorry I said it again

02:19 <mjg> general question stands though

02:19 <heat> f*rtr*n

02:19 <heat> sorry, fartrun

02:19 <heat> my IRC client auto-censors

02:20 <zid> Imagine posting a bug report to lkml with an example program, but you did it in algol and /bin/ksh

02:20 <mjg> do you remember the 'decade of wasted cores' paper?

02:21 <mjg> i had a look at it due to the above problem and left rarther disappointed

02:21 <mjg> interestingly next to not traffic on lkml either

02:29 <heat> anyway what kind of corner cases do you think you can't find by running a real workload

02:29 <heat> and why do those matter?

02:30 <mjg> for example the above is a case where there is sightly more workers than cores, *all* cpu bound

02:30 <mjg> my usual workload is *not* like that whatsoever

02:30 <mjg> here is another one

02:30 <mjg> dude has n threads all with nice 20 and did make -j

02:30 <heat> there are plenty of workloads with that shit

02:30 <mjg> without noitice

02:31 <mjg> nice

02:31 <zid> almost all interesting workloads are that

02:31 <mjg> ffs people

02:31 <zid> if they weren't cpu bound you wouldn't have a scheduling problem to begin with

02:31 <heat> i've seen servers that had 2000 threads and like 70 cores

02:31 <zid> "Scheduler works perfect if it's at 1% cpu load!", no shit?

02:31 <heat> (servers as in web server, not hardware server)

02:31 <mjg> there is tons of congiguratins one needs to check and doing it by hand by running some workloads is not going to uct it

02:31 <zid> mjg: There are like, three configurations anyone *cares* about, however.

02:32 <mjg> hence i'm looking for something comperhensive which i can just run and which will cover typical stuff

02:32 <zid> So you can focus your tests on those and hit almost all actual users

02:32 <mjg> what those might be

02:32 <heat> welp i'm not telling you to run every workload by hand but the main thing would be running a couple of synthetic workloads of that shit and then if you have bad regressions someone complains and you take a look

02:32 <zid> big iron, interactive desktop running prime95, no load

02:32 <mjg> heat: i'm trying to do comperhensive job here dog

02:32 <mjg> not just a bunch of stuff by hand

02:33 <heat> but there's no compreehensive job to be done here

02:33 <mjg> that's liek the last resort

02:33 <mjg> sure there is

02:33 <heat> you can test throughput of something, you can test the responsiveness

02:33 <heat> but those will still be ultra specific cases where the scheduler responded based on *your system*

02:33 <mjg> i can run on more systems

02:34 <zid> scheduling is also chaotic, which helps a bunch..

02:34 <mjg> https://lore.kernel.org/lkml/20191021075038.GA27361@gmail.com/ rather primitive testing i have to say

02:34 <bslsk05> lore.kernel.org: Re: [PATCH v4 00/10] sched/fair: rework the CFS load balance - Ingo Molnar

02:35 <heat> the stupid throughput test is a kernel build, the responsiveness test would be like testing both uncontended and contended CPU load and seeing how long until the thread gets rescheduled back or something

02:35 <mjg> does not bode well for existence of a suite

02:35 <heat> but all of these are stupid because the scheduler is supposed to adapt based on your system or system load or whatever its doing or how things interact with each other

02:35 <zid> You can write a cool synthetic benchmark, but different threads will get different distances at different times (irqs, cache, etc) and then everything ends up on a different core between runs etc and comparisons get unruly.

02:35 <mjg> huh

02:35 <mjg> > A full run on Mel Gorman's magic scalability test-suite would be super

02:35 <mjg> useful ...

02:35 <mjg> :]

02:36 <heat> https://github.com/linux-test-project/ltp/tree/master/testcases/kernel/sched

02:36 <bslsk05> github.com: ltp/testcases/kernel/sched at master · linux-test-project/ltp · GitHub

02:36 <heat> see if any of these work for ya

02:36 <mjg> zid: see my aforementioned remark about having something worked which takes care of most real settings, which i would expect to already exist

02:36 <zid> Plus the final answer is *always* going to be "it depends"

02:36 <zid> How much slower do I want prime95 to be if it stops my mouse lagging as much?

02:37 <mjg> https://github.com/gormanm/mmtests

02:37 <bslsk05> gormanm/mmtests - MMTests: Benchmarking framework primarily aimed at Linux kernel testing (84 forks/189 stargazers/GPL-2.0)

02:37 <zid> Then you're just tuning bias values until someone shows up in a few years, tells everybody the entire thing is trash and does't work, and replaces it with something else that needs tuning :D

02:38 dude12312414 has quit [Quit: THE RAM IS TOO DAMN HIGH]

02:40 <heat> i agree

02:40 <heat> expecting objective performance testing esp "realistic" one is super unrealistic

02:42 d34d1457 has quit [Read error: Connection reset by peer]

02:42 <zid> Best you can do is a benevolent dictator scheduler that 'knows best' and doesn't support weird loads

02:42 <zid> but handles big iron and interactive desktop gracefully

03:57 <kof123> eh, mechanism not policy. benevolent dictator that gracefully steps down when proverbial gun held to his head, and gracefully retakes power when their services are required again

04:02 slidercrank has quit [Ping timeout: 240 seconds]

04:06 bradd has joined #osdev

05:12 <mjg> quite frankly responses here are most bizzare

05:14 <mjg> for starters the test suite at hand may have an array of actual workloads to test, each of which stresses different parts of scheduling

05:14 <mjg> and where it is known that optimization for one case easily causes degradation for another

05:14 <kof123> well im not qualified. my answer is just "give me lots of switches to toggle if need be"

05:14 <mjg> having a collection of the sort lets one make at least somewhat informed choice regarding patchen'

05:15 <kof123> i dont disagree with your assessment on test suite

05:16 <moon-child> mjg: it almost feels like you'd want to somehow mock the scheduler's view of the system state

05:16 <moon-child> which obviously doesn't account for imprecision in modeling the system state

05:16 <moon-child> but does allow you to very precisely characterise the scheduler's behaviour

05:16 <mjg> this is assuming i want to run a bunch of few line c progs

05:16 <mjg> which i don't

05:16 <mjg> again what's going on here

05:16 <mjg> twilight zone

05:30 slidercrank has joined #osdev

05:45 catern has quit [Ping timeout: 250 seconds]

05:52 frkzoid has quit [Ping timeout: 252 seconds]

05:55 <geist> hmm, frown. looks like most of the rv64 implementations i've seen *dont* do misaligned accesses

05:55 <geist> but appear to be handled in firmware. transparently

05:55 <geist> unclear if it's going to linux, or to opensbi

05:55 <geist> a simple test is write to a unaligned address 10 million times: 4.5s, aligned .029s

05:56 <geist> so clearly something is trapping an emulating it (this is on linux)

05:57 <moon-child> opensbi?

05:57 <geist> yah that's what i'm wondering

05:57 <moon-child> no I mean, what's opensbi?

05:57 <geist> oh it's the machine mode firmware that handles some low level details, even to linux kernel

05:57 <Mutabah> A common firmware

05:58 <Mutabah> think system management mode

05:58 * moon-child nods

05:58 <moon-child> what's the riscv architectural stance on unaligned accesses?

05:58 <geist> the stance seems to be that user code can assume it works, and compiler can generate code accordingly

05:58 <geist> but, it's allowed for the hardware to not support it and have it trapped and emulated in firmware

05:58 <geist> which IMO is worse, because then you can just transparently have code that runs orders of magnitude slower

05:59 <moon-child> hrm

05:59 <moon-child> that's unfortunate

06:00 <geist> it avoids having two different sets of incompatible binaries i guess

06:00 <moon-child> oh yes, risc 'extension hell' v wants to avoid having incompatible binaries

06:00 <geist> and means you should generally avoid unaligned accesses like the plague i guess

06:00 <moon-child> very consistent

06:00 <moon-child> :)

06:00 <geist> word

06:03 <geist> since the mdeleg register exists, it's entirely possible for machine mode code (where SBI runs) to trap all unaligned accesses first, and if they deal with it they can transparently do it behind the kernel's back

06:03 <geist> though there is some comp[lexity as to how the machine mode code can access memory through supervisor paging. but i think there's a mechanism for that

06:18 <geist> ah i see: mstatus.MPRV instruction lets machine mode temporarily operate as if it were at a lower priviledge mode

06:18 <geist> while the bit is set load/stores act as if they were in mstatus.mpp priviledge level

06:18 <geist> where mstatus.mpp is the saved priviledge level the cpu came from in the last exception

06:19 <geist> so yeah sure enough there's some fairly convuluted code in opensbi to trap exceptions from user and supervisor mode and it'll transparently attempt to emulate the unaligned access

06:19 <geist> https://github.com/riscv-software-src/opensbi/blob/master/lib/sbi/sbi_misaligned_ldst.c

06:19 <bslsk05> github.com: opensbi/sbi_misaligned_ldst.c at master · riscv-software-src/opensbi · GitHub

06:20 <geist> with the generated routines to emulate load/stores from lower levels punched out in https://github.com/riscv-software-src/opensbi/blob/master/lib/sbi/sbi_unpriv.c

06:20 <bslsk05> github.com: opensbi/sbi_unpriv.c at master · riscv-software-src/opensbi · GitHub

06:20 <geist> gosh it'd be helpful if any of this code included like one frickin comment

06:20 <geist> like just some sort of hint as to what the fuck it's doing

06:21 <geist> took me staring at it for 30 minutes to figure out what's goin on

06:22 <moon-child> comments are for quiche-eaters

06:22 <kof123> freebsd would spit something on the console for alpha "unaligned accesss @ 0xdeadbeef" IIRC

06:23 <geist> yah in this case the higher level firmware in machine mode is just trapping it without any ability for supervisor mode code (freebsd, linux, etc) to intercept

06:23 <geist> it would be kinda nice to just let you take into your own hands this stuff, so you can decide to not let code do it, or at least get some sort of count of the number of times it's been fixed up

06:24 bgs has joined #osdev

06:25 <moon-child> do riscv cpus have performance counters?

06:28 <geist> yah that's my guess how you're supposed to figure it out. there's a SBI extension to get access to perf counters, so probably you enumerate and watch the umber of unaligned traps

06:30 gbowne1 has quit [Quit: Leaving]

06:39 asarandi has joined #osdev

06:48 <geist> anyway, interesting if nothing else

06:49 <geist> always fun to figure out how the sausage is made

06:49 <geist> the whole machine mode trapping and reflecting exception model is pretty simple and powerful, and SBI uses it to great effect

07:17 MarchHare has quit [Ping timeout: 256 seconds]

07:19 MarchHare has joined #osdev

07:39 aejsmith has quit [Remote host closed the connection]

07:43 aejsmith has joined #osdev

07:57 catern has joined #osdev

08:00 gxt__ has quit [Remote host closed the connection]

08:00 gxt__ has joined #osdev

08:48 bnchs has quit [Remote host closed the connection]

09:04 smpl has joined #osdev

09:14 gog has joined #osdev

09:16 gog has quit [Client Quit]

09:35 gog has joined #osdev

10:12 GeDaMo has joined #osdev

10:17 morgan has quit [Read error: Connection reset by peer]

10:22 morgan has joined #osdev

10:22 \Test_User has quit [Ping timeout: 246 seconds]

10:24 \Test_User has joined #osdev

12:39 danilogondolfo has joined #osdev

13:33 dude12312414 has joined #osdev

13:37 Left_Turn has joined #osdev

13:55 Left_Turn has quit [Remote host closed the connection]

13:56 gabi-250_ has quit [Remote host closed the connection]

14:00 gabi-250_ has joined #osdev

14:05 danilogondolfo has quit [Ping timeout: 265 seconds]

14:06 danilogondolfo has joined #osdev

14:14 smpl has quit [Ping timeout: 248 seconds]

14:22 danilogondolfo has quit [Ping timeout: 265 seconds]

14:36 sinvet has joined #osdev

14:46 d34d1457 has joined #osdev

14:52 Left_Turn has joined #osdev

15:01 danilogondolfo has joined #osdev

15:02 freakazoid332 has joined #osdev

15:03 danilogondolfo has quit [Max SendQ exceeded]

15:04 danilogondolfo has joined #osdev

15:06 danilogondolfo has quit [Max SendQ exceeded]

15:06 danilogondolfo has joined #osdev

15:07 slidercrank has quit [Ping timeout: 250 seconds]

15:17 d34d1457 has quit [Read error: Connection reset by peer]

15:47 danilogondolfo has quit [Quit: Leaving]

15:49 rnicholl1 has joined #osdev

16:01 rnicholl1 has quit [Quit: My laptop has gone to sleep.]

16:03 danilogondolfo has joined #osdev

16:12 <heat> geist, fyi that's disgusting

16:13 <heat> the trapping into fw for unaligned stuff bit

16:29 * gog traps into firmware

16:40 Ali_A has joined #osdev

16:41 Ali_A has quit [Client Quit]

16:42 Ali_A has joined #osdev

17:40 <heat> noooooooooo gog dont trap into firmware

17:41 <heat> that's a bad idea you'll hit firmware bugs!

17:47 gog has quit [Quit: Konversation terminated!]

17:55 <Ermine> wdym

17:56 <heat> wdym wdym

17:57 <Ermine> trapping into fi

17:57 <Ermine> firmware

17:58 Ali_A has quit [Quit: Client closed]

17:59 <heat> welp in this case geist was saying that riscv either handles unaligned loads and stores directly or traps into firmware

18:00 <heat> and he was saying that most/all riscv CPUs currently do not handle it directly but rather the firmware does it for them

18:00 <heat> which is severely wrong IMO

18:01 <geist> yah they were clearly trying to make it so that the unaligned problem is not for regular code to worry about, except it kinda is

18:01 <geist> because it runs so slowly obviously you have to deal with it

18:01 * roan traps into firmware

18:02 <roan> we should really do something about this

18:02 <nortti> is there any benefit to trapping to fw over to kernel?

18:02 <geist> stay aligned!

18:02 <geist> not particularly. the kernel could do it more efficiently, but in this case openSBI hard sets the mdeleg bit so that the kernel has no opportunity to override it

18:02 <heat> nortti, works transparently for everyone I guess

18:03 qubasa has quit [Remote host closed the connection]

18:03 <heat> geist, has no ISA extension mandated hardware unaligned accesses or something?

18:04 <heat> cuz, you know, this seems like a big problem

18:04 <zid> can it.. load bytes?

18:04 <geist> possible. like the spec says it's valid for hardware to deal with unaligned natively

18:04 <zid> or is the problem that it's 32bit load/store only

18:04 <heat> zid, yeah it can

18:04 <geist> but if it isn't then firmware must transparently do it, so that you dont have to worry about it at user space level

18:04 <zid> so you can at least write code that doesn't need it then, same as the C model

18:04 <zid> I'd prefer it crashed, tbh

18:04 <heat> it's like x86 with #AC but the firmware handles your exception and patches it up

18:05 <heat> yes, same

18:05 <geist> but then clearly you want to avoid unaligned accesses because it's slower. similar to x86 and arm64 where it's generally not a good idea for performance reasons, all else held equal

18:05 <heat> hm?

18:05 <geist> right, my main complaint is exactly that, i can't make it crash if i wanted to

18:05 <heat> x86 unaligned accesses are very close to ideal

18:05 <geist> sure

18:06 <geist> but it's an implementation detail. hardware deals with it, but there have been periods of times with various x86 microarchitectures where it was more of a hit

18:06 <heat> in fact that whole overlapping copy stuff is all based on this

18:06 <geist> much like someone could make a riscv core that deals with it with very little to no cost

18:06 <heat> or all the memcpy implementations really

18:06 <geist> that's precisely where i got started on it, wanted to sit down to build a new asm memcpy

18:06 <geist> and was going to rely on the 'align to dest, dont worry about src' memcpy strategy

18:07 <geist> but oh no, you better deal with both here

18:07 <geist> glad i tested this first

18:07 <heat> it's super possible that the best way to do this atm is like glibc

18:07 <heat> in C too

18:07 <geist> i was basically just going to implement that logic in asm

18:08 <geist> main reason being that i want to force it to use a fixed set of registers, so i can safely use it in multiple asm contexts

18:08 <geist> ie, user copy, memcpy, etc

18:08 <heat> hmm yeah

18:09 <heat> wait does your user copy need to be in asm too?

18:09 <geist> right now there's a silly problem in zircon where i dont know what registers it uses, so it's a full function call inside the guts of the user copy routine, so i have to basically hand roll a full setjmp/longjmp for the error case

18:09 <geist> it's basically 'free' to do that if its known that memcpy only uses callee trashed regs

18:10 <heat> https://github.com/heatd/Onyx/blob/master/kernel/arch/riscv64/usercopy.cpp#L89

18:10 <bslsk05> github.com: Onyx/usercopy.cpp at master · heatd/Onyx · GitHub

18:10 <geist> that's the strategy we do on arm64. the user copy 'set the recovery pointer' mechanism is implemented as an asm wrapper that sets things up, calls into memcpy (which is implemented in asm), and thus can undo things

18:10 <heat> asm goto baybeh!

18:10 <geist> but that works because the memcy routine on arm is explicitly written to never touch the stack or any saved regs, so it can be safely 'branched out of' the middle of it

18:10 <geist> i basically wanted the same thing on RV

18:11 <geist> and there's enough registers that it's basically possible, sinc eyou h ave a0-a6 + t0-t6 to work with

18:11 <geist> a7 even

18:12 <heat> yeah

18:12 <heat> tbf you could make it work using the frame register right?

18:13 <heat> ah wait no, saved regs

18:13 <heat> yuck

18:13 <geist> yah, if it pushes any saved regs you can't recover them

18:13 <geist> so at the moment i have a total hack that just pushes s0-s3 locally, saves sp locally, then sts it up

18:13 <geist> and i know that the memcy implementation actually only fiddles with s0-s2, so s3 is enough of an anchor to get it back

18:14 <geist> but clearly that's Bad becaus eyoure relying on the compiler to generate code in a particular way

18:14 <geist> nice thing is it'll instantly fail a unit test if the compiler tries anything else. so it's good for like, this week

18:15 <geist> so what'd be ideal is to have a macro with a memcpy implementation to stamp out, that i just put inline

18:15 <geist> or... a memcpy function that is known to just use a and t registers so it's safe to call it, either way

18:18 gog has joined #osdev

18:19 <gog> mew

18:29 Ali_A has joined #osdev

18:41 * geist pets gog

18:42 * gog prr

18:42 danilogondolfo has quit [Ping timeout: 255 seconds]

18:44 <Ermine> gog: may I pet you too?

18:44 <gog> yes

18:44 * Ermine pets gpg

18:44 <Ermine> LOOOOL

18:44 * Ermine pets gog

18:44 <geist> it's nice innit?

18:44 <heat> gog geist needs your help writing a riscv memcpy

18:45 <geist> MEMCPY

18:45 * gog prr

18:45 <geist> if only RUUUUUUUUST

18:45 <Ermine> Let's get gog out of fw first

18:45 <heat> RUUUUUUUUUUUUUUUUUUUUUUUUUST

18:45 <geist> what, we need to stop trapping gog in firmware?

18:45 <heat> yes

18:45 <heat> firmware bad

18:45 <geist> gog in the machine

18:46 <gog> don't copy memory

18:46 <gog> copying bad

18:46 <gog> never copy

18:46 <Ermine> map map map map

18:47 <geist> dont copy that memory!

18:47 <geist> that's not aligned with the ideals of the firmware

18:47 <Ermine> One mremap to rule them all

18:48 danilogondolfo has joined #osdev

18:48 <heat> silly gag mapping memory is slow!

18:48 <Ermine> -sponsored by memory copying gang

18:51 <geist> welcome to the machine (mode)

18:52 <heat> TAKE THE RED PILL, EXIT THE MACHINE

18:52 bnchs has joined #osdev

18:52 <heat> why is riscv such a trashy arch

18:53 <geist> oh i dunno, i think it's kinda elegant

18:53 <geist> just limited

18:53 <heat> i dont like that half the stuff is handled in M mode and passed down to the kernel like some sort of weird hypervisor

18:53 <sham1> What did you expect from a RISC

18:53 <geist> all of my complaints tend to be some variant of 'this area is underdeveloped and needs to get more complicated'

18:53 <geist> ah yeah the SBI stuff is a mixed bag

18:54 <geist> though to be fair it's not required for say embedded or whatnot. embedded code probably runs directly in M mode and doesn't have any SBI around

18:54 <heat> and this "haha unaligned accesses Just Work" is another weird quirk

18:54 <heat> like, why?

18:54 <Ermine> Mr. Smith: *tries to copy unaligned memory* Everybody: *get into machine again*

18:54 <geist> i may see about getting this changed via work, but trying to come up with a compelling argument

18:54 <heat> most of this stuff doesn't really seem very well thought out

18:54 <geist> ie, at least have some ability to set the properties of SBI via some call

18:55 <heat> geist, performance?

18:55 <heat> or control

18:55 <geist> yes

18:55 <Ermine> Btw how does it happen on other arches?

18:55 <heat> how does what happen

18:55 <geist> varies. some architectures disallow unaligned accesses entirely

18:56 <geist> or they let you set a control bit that conditionally allows it

18:56 <geist> or they allow it entirely

18:56 <heat> yeah i mean in general on new stuff you just handle them

18:56 <geist> tend to fall within those 3 categories

18:56 <Ermine> unaligned memory access

18:56 <heat> like x86 and arm64 all do unaligned

18:56 <geist> there's a 4th category, like armv4, where it just gave you effectively garbage when you did an unaligend access

18:56 <sham1> Unaligned accesses are ugly, slow and considered harmful

18:56 <heat> you're ugly and slow and considered harmful

18:57 <sham1> Right, but so are unaligned accesses

18:57 <geist> so what riscv is trying to do is standardize the ABI to declare that unaligned is okay

18:57 <geist> so it's fairly clear they want things to eventually arrive at unaligned is fine, but current hardware doesn't have the support

18:57 <geist> so SBI transparently fixes it. so it's clearly not a performance choice, but a anti-fragmentation choice

18:57 <geist> i get the idea, i'd just like a bit more control so i can turn it off and trap

18:58 <geist> and note this is everything to do with supervisor mode level OSes. for pure embedded, machine mode, you have to deal with it yourself

18:58 <geist> but in those cases ABI compatibility is generally not a concern

18:59 <geist> note i haven't checked to see what qemu is doing. it's entirely possible qemu is simply doing full unaligned access, since there's probably little reason for it not to

19:02 <danlarkin> I'm only mildly informed but I think it changed recently to faulting to M mode on an unaligned access

19:02 <heat> cry.jpeg?

19:02 <heat> you know, because qemu tcg isn't slow enough

19:04 <geist> danlarkin: as in SBI didn't previously do this?

19:04 <geist> well opensbi that is

19:06 <danlarkin> nah qemu I mean

19:07 <geist> oh gotcha. yah was gonna write a test in a minute, easy enough to determine by just looking at the performance of it

19:07 <heat> old qemu did unaligned natively

19:08 <heat> https://github.com/riscv-software-src/riscv-isa-sim/issues/93 see sorear's comment

19:08 <bslsk05> github.com: The unaligned load/store is not supported in Spike · Issue #93 · riscv-software-src/riscv-isa-sim · GitHub

19:14 Ali_A has quit [Quit: Client closed]

19:16 sm2n has quit [Ping timeout: 240 seconds]

19:16 ddevault has quit [Ping timeout: 240 seconds]

19:16 milesrout_ has quit [Ping timeout: 240 seconds]

19:16 patwid has quit [Ping timeout: 240 seconds]

19:16 whereiseveryone has quit [Ping timeout: 240 seconds]

19:16 pitust has quit [Read error: Connection reset by peer]

19:16 exec64 has quit [Read error: Connection reset by peer]

19:16 tom5760 has quit [Read error: Connection reset by peer]

19:16 alethkit has quit [Read error: Connection reset by peer]

19:16 utzig has quit [Read error: Connection reset by peer]

19:16 vismie has quit [Read error: Connection reset by peer]

19:16 tommybomb has quit [Write error: Connection reset by peer]

19:18 staceee has quit [Ping timeout: 240 seconds]

19:18 milesrout_ has joined #osdev

19:19 vismie has joined #osdev

19:19 tommybomb has joined #osdev

19:19 utzig has joined #osdev

19:20 alethkit has joined #osdev

19:20 tom5760 has joined #osdev

19:21 whereiseveryone has joined #osdev

19:21 pitust has joined #osdev

19:21 exec64 has joined #osdev

19:21 staceee has joined #osdev

19:21 patwid has joined #osdev

19:21 ddevault has joined #osdev

19:21 sm2n has joined #osdev

19:22 Ali_A has joined #osdev

19:22 alturmann1729 has joined #osdev

19:22 Ali_A has quit [Client Quit]

19:26 nyah has joined #osdev

19:26 catern has quit [Ping timeout: 250 seconds]

19:36 crankslider has joined #osdev

20:34 <geist> yeah interesting

20:35 <gog> hi

20:38 <Ermine> ho gog

20:39 <lav> let's go

20:52 GeDaMo has quit [Quit: That's it, you people have stood in my way long enough! I'm going to clown college!]

20:54 gbowne1 has joined #osdev

20:55 wereii has quit [Quit: ZNC - https://znc.in]

20:58 <geist> heat: hmm, from what i can tell that never happened. i'm not getting unaligned traps on qemu

20:58 wereii has joined #osdev

21:34 <heat> oh cool

21:38 catern has joined #osdev

21:49 <mrvn> geist: maybe qemu faults on unaligned access if and only if the actitecture segfaults on unaligned access and qemus segfault hander then decodes the opcode and throws unaligned access

21:50 <mrvn> Isn't unaligned access even on x86 still slower if you cross a cache line? You also keep 2 cache lines busy that way.

21:51 * mrvn doesn't get how people still generate unaligned access. Stop using packed.

21:53 <moon-child> I'm p sure 2 cache lines is the same speed as 1

21:53 <moon-child> for <=8-byte accesses, that is--wider accesses do want to be aligned

21:53 <moon-child> even if not, though, it's just one extra cycle

21:54 <moon-child> I think page crossing is more expensive, but even in that case is just microcoded and 10s of cycles

21:56 crankslider has quit [Ping timeout: 265 seconds]

21:56 <heat> moon-child, isn't avx2 movaps just slightly faster than movups

21:57 <moon-child> no

21:57 <moon-child> on aligned addresses, they have the same performance. On unaligned addresses, one fault, and the other might be slow

22:01 <geist> a IMO valid use case i've seen a compiler emit is for example copying 7 bytes: the simplest code gen is to emit to 4 byte load/stores, with one of them offset by one and overlapping

22:01 <geist> stuff like that i've seen a compiler do when i knows that it can do unaligned accesses with no issue. arm64 in particular loves that sort of thing

22:02 <moon-child> I have at least one mathematical program that relies heavily on unaligned accesses

22:03 <moon-child> if it's fast, no reason not to rely on it. (If not, then, well, different algorithm is called for)

22:03 <geist> exactly

22:05 * mjg is doing locked atomic ops across different huge pages

22:05 <mjg> 2 byte ops!

22:05 * moon-child slaps mjg around a bit with a large trout

22:05 <mjg> people talk about "latency bubbles" in the frontend etc.

22:06 <mjg> what is actually happening is that the cpu sees the shit you are feeding it and facepalms, then needs some cycles to recover

22:06 <moon-child> I read somewhere there are plans to let the cpu straight up fault on split locks

22:07 <mjg> but muh code!

22:07 <moon-child> poor code

22:07 <mjg> there was a lkml thread claiming there are real *games* using this shit

22:07 <heat> yes there are

22:07 <heat> this was a whole saga

22:07 <moon-child> o.o

22:08 <heat> they had to rollback the extensive throttling they were doing to split locking threads

22:08 <heat> https://lwn.net/Articles/911219/

22:08 <bslsk05> lwn.net: The search for the correct amount of split-lock misery [LWN.net]

22:09 <heat> actually, sorry, they kept the throttling but added a command line knob to the kernel

22:10 <mjg> truegamer=1

22:12 <heat> moon-child, btw this is an actual feature for 11th gen+ intel cpus

22:12 <heat> you get an exception for a split lock

22:12 <moon-child> oh cool

22:12 <moon-child> I thought it was just planned

22:13 <mjg> curious if RUST will end up generating code which runs into it

22:15 <mrvn> moon-child: why would page crossing be more expensive?

22:15 <mrvn> geist: the important word there is "knows that it can do unaligned accesses"

22:16 <moon-child> not sure. Possibly protection stuff

22:16 <geist> right

22:16 <mrvn> moon-child: a second TLB lookup?

22:16 <moon-child> but I have heard this is the case at least for apple arm and intel

22:16 <geist> and this is what ARM64 and armv8 has mandated

22:17 <moon-child> actually now I'm somewhat curious on x86 if this applies to a crossed 4k boundary when you have big pages

22:17 <moon-child> considering x86 cpus tend to do a lot of 4k stuff regardless of the page size

22:17 <mrvn> moon-child: If it does then it's doing a TLB lookup for no reasons. But the part of the core that does that might not know the page size

22:17 <heat> what does?

22:18 <mrvn> Frankly, nobody uses huge pages so why should they optimize for that?

22:18 <moon-child> mrvn: well--either that, or it doesn't have to do with tlb

22:18 <moon-child> mrvn: wut

22:19 <mrvn> moon-child: other than the phys mapping and VMs where would you use huge pages?

22:19 <heat> mjg, RUST does not generate code, llvm does. checkmate freebsd idiot

22:19 <heat> RUST is perfect, LLVM is a donkey

22:20 <moon-child> any case when you have a lot of data and you don't want to die on tlb...?

22:20 <heat> "every kernel ever"

22:20 <mrvn> moon-child: and what app does that?

22:21 <moon-child> most client apps 1) have comparatively small working sets and 2) don't need to manage their allocations at a low level to that degree

22:21 <mjg> heat: you missed the part where llvm has no choice but emit what it was asked for

22:22 <mjg> heat: and if it was asked for a split lock op, dafaq ya gonna do

22:22 <mrvn> NUMA and huge pages might be used in some high performance cluster stuff but all the home and desktop use cases won't be using it.

22:22 <moon-child> however I would expect allocators to transparently take advantage of that where feasible. I know mimalloc at least does (leaving aside lulz regarding queueing...), and I would expect the nice java gcs to

22:22 <mrvn> moon-child: does it pass the MAP_HUGE to mmap?

22:22 <heat> dude I'm fairly sure that since a recent kernel version a lot of program stacks are getting hugepage'd

22:22 <heat> through thp

22:23 <mjg> stacks?

22:23 <heat> yes

22:23 <mjg> huh

22:23 <mrvn> stacks? that would require actually allocating a multiple of 2MB for them.

22:23 <heat> which they are going to try to revert because it's somewhat silly

22:23 <mjg> not saying they should not, but i would expect fragmention to fuck it up real quick

22:23 <mrvn> pretty wastefull for threads.

22:23 <moon-child> mrvn: https://github.com/microsoft/mimalloc/blob/master/src/os.c#L624

22:23 <bslsk05> github.com: mimalloc/os.c at master · microsoft/mimalloc · GitHub

22:23 <heat> i'm fairly sure linux thp is agressively decent at this shit

22:23 <moon-child> heat: I heard transparent hugepages suck on linux

22:24 <heat> I think they're a mixed bag

22:24 <geist> seems like on an arch like arm64 it would be much more interesting since there are lots of intermediate page sizes

22:24 <moon-child> just hearsay so might be wrong, but

22:24 * moon-child nods

22:24 <geist> and AMD transparent 32K pages

22:24 <heat> https://www.youtube.com/watch?v=QJHUbtR0yI8 <-- thp

22:24 <bslsk05> www.youtube.com <no title>

22:24 <mjg> anyone has data how much stack is normally used?

22:24 <geist> which i've heard is actually a pretty good win if you can pull it off

22:25 <mrvn> geist: I would be interested in running with 64k page size. It's kind of the best size for IO too.

22:25 <geist> indeed

22:25 rnicholl1 has joined #osdev

22:25 <mjg> uh

22:25 <moon-child> geist: 32 interesting, have a reference?

22:25 <geist> ithink the biggest downside being the minimum size for a lot of things goes up, so probably would be a nonzero amount of overhead

22:25 <geist> moon-child: yeah it's just described in the AMD manual

22:25 <mrvn> memory is cheap.

22:26 <geist> if you map N pages back to back in the same way on the same alignment etc the hardware may transparently treat it as a larger TLB

22:26 <geist> but 16K pages is i think a nice compromise

22:26 <mjg> does 16K work though right now?

22:27 <mjg> i wuld expect tons of hardcoded 4k sizes

22:27 <mjg> in userspace

22:27 <moon-child> I feel like this probably depends on priorities

22:27 <mjg> like everything was vax

22:27 <mrvn> geist: I would expect the overhead to be pretty minimal for userspace actually. Most memory goes towards malloc and that can just use the remaining 60k after 4k was used for the next allocation. Doesn't really make sense to ask the kernel for single pages anyway.

22:27 <moon-child> intel prioritises hpc and server stuff so 2mb is fine for them

22:27 <geist> mjg: i'm fairly certain linux has gotten 16K support at this point

22:27 <moon-child> apple on client cares somewhat more about fragmentation when you have a lot of apps

22:27 <moon-child> amd somewhere in the middle

22:27 <geist> note i'm talking about arm64 which up front in its ABI mandated that 64K is the largest base page size

22:27 <mjg> geist: in that spirit freebsd is ding 16k on arm64 as well

22:28 <mjg> geist: but that already required a fair amount of rototoiling the base system

22:28 <geist> i know linux had 64k support pretty early on for arm64, but i think it only got 16k relatively recently

22:28 <mjg> i have to expect 3rd party software not only hardcodes 4k, but that it is doing nasty stuff with it

22:28 <geist> oh i'm sure stuff was busted, but then that hardware would't have worked on alpha!

22:28 <geist> or vax!

22:28 <mrvn> I think linux got 64k support for ppc and 16k for arm.

22:28 <mjg> i'm saying i expect the current realities to pretty bad

22:28 <mrvn> arm64 just inherited it

22:28 <geist> yah

22:28 dutch has quit [Quit: WeeChat 3.8]

22:29 * geist nods

22:29 <mrvn> mjg: then it will fail because it didn't check the sysconfig for page size

22:29 <geist> i mean sure, yeah OTOH that same software running on mac already has to deal with it, so i suspect things sort itself out over time

22:30 <geist> old x86 software is i think where there's a lot more problem, since there's a basic assumption that old x86 stuff still works. so if there was some minimum page size bump there i bet a lot of shit gets broken

22:30 <geist> ARM64 is new enough that i would think you're not going to get too upset on linux if some old binary doesn't run right

22:31 <mrvn> I wouldn't expect things to run "not right" but outright fail.

22:31 <moon-child> honestly I would expect concurrency stuff to cause more silent breakage

22:31 <moon-child> 'not right' yeah

22:32 <mrvn> EINVAL We don't like addr, length, or offset (e.g., they are too large,

22:32 <mrvn> or not aligned on a page boundary).

22:33 <mrvn> My expecteation would be that anything that does things PAGE_SIZE related will call mmap or mprotect and get the above error

22:33 [itchyjunk] has joined #osdev

22:34 <mrvn> Urgs, just though of something. All ELF files are build so the segments are at 4k or 2M boundaries. If you change the page size to 64k then 4k alignment won't work and 2M you have to split into 64k chunks.

22:35 <mrvn> So 32bit x86 is screwed with it's 4k default. On AMD64 it's hit or miss wether something uses 4k or 2M.

22:35 <moon-child> https://www.da.vidbuchanan.co.uk/blog/netflix-on-asahi.html cf

22:35 <bslsk05> www.da.vidbuchanan.co.uk: The Quest for Netflix on Asahi Linux | Blog

22:36 <mrvn> geist: Gues you would have to at least recompile everything for 16k or 64k pages.

22:36 <geist> mrvn: for ARM64 at least that's why the ELF ABI says to do 64k alignment

22:37 <geist> so that if the system is using any of the 3 base page granules it still works

22:37 henloduud42069 has joined #osdev

22:37 <mrvn> geist: even better reason than amd64 recommending 2M.

22:39 dutch has joined #osdev

22:41 henloduud42069 has quit [Client Quit]

22:44 bnchs has quit [Remote host closed the connection]

22:50 dude12312414 has quit [Ping timeout: 255 seconds]

22:57 dude12312414 has joined #osdev

23:03 sjs has quit [Remote host closed the connection]

23:07 sjs has joined #osdev

23:13 rnicholl1 has quit [Quit: My laptop has gone to sleep.]

23:14 simpl_e has quit [Remote host closed the connection]

23:14 ptrc has quit [Ping timeout: 265 seconds]

23:17 simpl_e has joined #osdev

23:30 ptrc has joined #osdev

23:34 gog` has joined #osdev

23:34 gog has quit [Ping timeout: 268 seconds]

23:38 gog` is now known as gog

23:42 <heat> gogzilla

23:43 gog` has joined #osdev

23:43 <heat> gog, is gog` an impostor

23:44 <heat> is this a gog conspiracy

23:44 <heat> gogspiracy

23:44 <Griwes> strong "I'm gregnant" vibes

23:45 <heat> i'm pargant

23:45 gog has quit [Ping timeout: 265 seconds]

23:53 gog` is now known as pog

23:53 <pog> i'm pognant

23:54 <heat> omg congrats

23:55 <zid> pog

23:56 <zid> who's the.. victim?

23:56 <zid> I assume you already ate their head