#osdev on 2022-06-26 — irc logs at libera.irclog.whitequark.org

2021-05-23 01:57 klange changed the topic of #osdev to: Operating System Development || Don't ask to ask---just ask! || For 3+ LoC, use a pastebin (for example https://gist.github.com/) || Stats + Old logs: http://osdev-logs.qzx.com New Logs: https://libera.irclog.whitequark.org/osdev || Visit https://wiki.osdev.org and https://forum.osdev.org || Books: https://wiki.osdev.org/Books

00:01 <heat> maybe I should finally pick up arm64

00:01 <geist> gotta say that nano pi r5s is not a half bad little machine. a lot pokey on the ram (2GB) but it *seems* hackable, and more importantly it's cheap and available

00:02 thinkpol has quit [Remote host closed the connection]

00:02 <geist> that being said i haven't really looked into precisely getting code on it yet. it has some complicated bootloader scheme that of course ultimately arrives at uboot, but need to figure out where to 'cut in' so to speak

00:03 thinkpol has joined #osdev

00:05 <geist> but it has a bunch of standard ethernet blocks (2.5gb realtek) so should be fun to write some networking stuffs on

00:18 Likorn has quit [Quit: WeeChat 3.4.1]

00:29 <heat> geist, what did you mean with "the use of PCID changes that calculus somewhat"?

00:30 <geist> well, if you're not using PCID then by definition when you reload the cr3 switching processes, it dumps all the TLB entries for that process (assumung you're using global pages in the usual way)

00:30 <heat> yes

00:30 <geist> but if you're using PCID now other cpus can have TLBs active from aspaces that are not currently active, so the calculation of what is potentially on that cpu is different

00:30 <geist> so it makes the solution more complicated

00:31 <geist> and then there's the AMD solution which basically points you in the direction of ARM, sincei t's fundamentally the same thing

00:31 <heat> how do you solve that?

00:32 <heat> you can't possibly know that you have an ASID present in the TLB that's not active

00:32 <geist> i guess you could keep a list of cpus that the aspace is *ptentially* available on

00:32 <geist> which means more IPIs for more shootdowns

00:33 <geist> or i guess you could set some sort of generation counter that when the aspace becomes active again on that cpu it simply dumps the entire TLB

00:33 <geist> that way you only get the advantage of switching back if the aspace hadn't been touched

00:33 <geist> but now yuo have to track gen counters per cpu per aspace?

00:34 <geist> (we haven't solved it in zircon yet, no PCIDs for x86)

00:36 <geist> my guess is a good solution is also integrated into PCID assignment, which we also haven't solved. currently am just assigning a 16bit id per ASID in ARM per fprocess, which is good up to 64k processes, of course

00:36 <geist> but with 8 or 12 bit you start pushing it. so seems that real oses do some sort of more dynamic assignment where they roll through IDs on the fly

00:37 <geist> so most likely if you can pick some sort of reasonable point where you swizzle ids you can also integrate the recycling of ids to dumpgint eh TLB

00:37 <geist> and thus avoid the stale cpu PCID TLB shootdown problem

00:38 <heat> linux uses gens

00:38 <heat> makes sense

00:39 <geist> yeah i dunno what they do but i'm not surprised. i'm a fan of gen-style algorithms

00:40 <heat> does TLBI's latency increase linearly with the number of CPUs an address space is on?

00:41 <geist> the arm/amd solution? presumably

00:41 <geist> but note that both of them are a two instruction sequence: one where you start the sync and the othe where you wait to complete it

00:41 <geist> so there's some ability to hide the latency by manually filling that gap with other work

00:42 <heat> hmm

00:42 <geist> but i'm guessing it internally uses whatever existing logic is there to deal with cache and atomic coherency, so it's probably not too bad

00:42 <heat> two instructions?

00:42 <geist> yah, two instructions

00:42 <heat> on arm isn't just a TLBI?

00:42 <heat> or is this just for local TLBs

00:43 <geist> it's the same instruction local or global

00:43 <geist> just different option bits

00:43 <geist> and even local, it's a two instruction sequence

00:43 psykose has quit [Remote host closed the connection]

00:43 <geist> it doesn';t block waiting for the TLB to sync, it juist starts the operation, you add a DSB instruction to synchronize it

00:44 psykose has joined #osdev

00:44 <geist> same as cache flusing instructions on arm. basically aside from DSB and DMB instructions, which are explicit barriers, no general instructions stall on ARM like that, so it's a fairly consistent model

00:45 <geist> even writing to core control registers like SCTLR and whatnot are not considered synchronizing necessarily (theres a bunch of rules about that) so in general you follow one of those up immediately with an ISB, which synchronizes the pipeline with the value you just wrotre to the control register

00:46 <heat> yeah

00:46 <heat> AMD has TLBSYNC for that

00:47 <heat> I had never seen this extension before

00:47 <heat> looks cool

00:47 <geist> yeah, it is almost perfectly a match for the ARM model, so it almost has to have come out of the K12 work

00:47 <geist> like they already had to build the machinery for it, but they finally got it ready for prime time

00:48 <geist> but if not for the lack of needing IPI, you can also locally sync with a range, which is nice

00:50 <heat> this reminds me, I remember vaguely that you mentioned that an AMD extension required you to invalidate something when you changed the page tables

00:50 <heat> any idea of what that was?

01:01 CaCode- has joined #osdev

01:04 CaCode_ has quit [Ping timeout: 255 seconds]

01:33 lainon has joined #osdev

01:35 lainon has quit [Client Quit]

01:36 lainon has joined #osdev

01:45 scaramanga has joined #osdev

01:49 simpl_e has joined #osdev

01:52 lainon has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

02:02 CaCode- has quit [Ping timeout: 246 seconds]

02:22 dude12312414 has joined #osdev

02:22 pretty_dumm_guy has quit [Quit: WeeChat 3.5]

02:25 dude12312414 has quit [Client Quit]

02:47 heat has quit [Read error: Connection reset by peer]

02:47 heat has joined #osdev

02:50 Likorn has joined #osdev

02:56 matrice64 has joined #osdev

02:58 gog has quit [Ping timeout: 272 seconds]

03:17 heat has quit [Remote host closed the connection]

03:18 heat has joined #osdev

03:25 wand has quit [Ping timeout: 268 seconds]

03:27 CaCode- has joined #osdev

03:30 toluene has quit [Quit: Ping timeout (120 seconds)]

03:32 toluene has joined #osdev

03:32 wand has joined #osdev

03:33 CaCode- has quit [Ping timeout: 255 seconds]

03:44 rorx has joined #osdev

04:58 foudfou has quit [Remote host closed the connection]

04:59 foudfou has joined #osdev

05:19 matrice64 has quit [Quit: Textual IRC Client: www.textualapp.com]

05:22 foudfou has quit [Remote host closed the connection]

05:22 Likorn has quit [Quit: WeeChat 3.4.1]

05:23 foudfou has joined #osdev

05:45 vai has joined #osdev

05:45 vai is now known as Jari--

05:45 * Jari-- been busy designing his AIML Project

05:57 GeDaMo has joined #osdev

06:03 heat has quit [Read error: Connection reset by peer]

06:03 heat has joined #osdev

06:12 toluene has quit [Quit: Ping timeout (120 seconds)]

06:14 toluene has joined #osdev

06:15 opal has quit [Remote host closed the connection]

06:16 opal has joined #osdev

06:24 heat_ has joined #osdev

06:25 heat has quit [Read error: Connection reset by peer]

06:35 srjek has quit [Ping timeout: 272 seconds]

06:39 heat_ has quit [Ping timeout: 272 seconds]

06:57 the_lanetly_052_ has joined #osdev

07:43 jafarlihi has joined #osdev

07:43 jafarlihi is now known as l33th4x0r

07:43 l33th4x0r is now known as l337h4x0r

07:44 <l337h4x0r> Does anyone know if execveat calls execve internally in Linux? Or where it is defined at?

07:44 <l337h4x0r> I grepped, can't find it

07:45 <psykose> not this again

07:48 <geist> no idea. keep in mind this isn't a linux channel

07:49 <geist> when it comes to things like the GDT or whatnot low level stuff, thats kinda interesting to us, but we're not really here to help folks dig into guts of linux

07:49 <geist> except maybe to learn operating system concepts

07:49 <l337h4x0r> Other channels don't know shit

07:51 <geist> bummer

07:52 Jari-- has quit [Quit: leaving]

07:54 <geist> mostly just pointing out that if you just pop in every few days and pepper us with questions about linux for your hack linux project and then dissapear, you'll probably wear out your welcome

08:01 Matt|home has quit [Ping timeout: 246 seconds]

08:03 <mrvn> too late

08:04 Matt|home has joined #osdev

08:29 l337h4x0r has quit [Ping timeout: 255 seconds]

08:33 l337h4x0r has joined #osdev

08:36 l337h4x0r has quit [Client Quit]

08:54 psykose has quit [Remote host closed the connection]

08:55 psykose has joined #osdev

09:23 Burgundy has joined #osdev

09:31 <kazinsal> anyone know if there are any good books or similar about the PPC MacOS nanokernel/68k rosetta?

09:38 l337h4x0r has joined #osdev

09:45 <ddevault> error: attempt to use poisoned "calloc"

09:45 <ddevault> the fuck

09:47 <liz_> "poisoned" is a humorous term

09:47 <liz_> well, in the context of GCC; it's less funny generally

09:48 <ddevault> patched it out of gcc/system.h

09:48 <ddevault> ¯\_(ツ)_/¯

09:48 <kazinsal> the real fun part is that in the modern search engine environment, trying to look up an error like that results in just people trying to hack their way around it rather than an understanding of what the root cause is

09:50 <liz_> perhaps that's a symptom of compounding complexity in the tools with those errors

09:51 <ddevault> yeah, why am I even using gcc instead of cproc

09:58 mzxtuelkl has joined #osdev

09:58 <mrvn> so what does it mean?

09:59 <ddevault> something about wanting to really undefine an identifier

10:01 <mrvn> forgot nostdinc, nostdlib, nobuiltin?

10:03 <ddevault> I'm building a cross compiler that targets my OS

10:13 * Mutabah does a little dance - EHCI interrupt endpoints working

10:15 <ddevault> ayy https://paste.sr.ht/~sircmpwn/10a076a6854dfed14c69df774f18058dcb7dd422

10:15 <bslsk05> paste.sr.ht: paste.txt — paste.sr.ht

10:22 alexander has quit [Quit: ZNC 1.8.2+deb2+b1 - https://znc.in]

10:28 l337h4x0r has quit [Ping timeout: 255 seconds]

10:32 Vercas2 has joined #osdev

10:33 Vercas has quit [Quit: Ping timeout (120 seconds)]

10:33 Vercas2 is now known as Vercas

10:37 <gamozo> Mutabah: would you say you've made it to an EHCI... end point?

10:39 <Mutabah> gamozo: Fuck you... but yes :)

10:40 alexander has joined #osdev

10:40 <gamozo> :D

10:40 <kazinsal> lol

10:41 alexander has quit [Client Quit]

10:41 <kazinsal> one of these days when I get around to actually doing some osdev stuff my next big non-core project will be USB and I'm dreading it

10:42 <Mutabah> I think I've said it before, but USB1 is relatively easy - it is a bit of a learning curve, but it's not too bad

10:43 <Mutabah> It's like networking, with more structure and a few more concepts

10:50 alexander has joined #osdev

10:52 <kazinsal> my hypothetical osdev goal is to eventually be a sort of open source "competitor" to eg IOS or Junos

10:52 <kazinsal> deep in the hypothetical of course

10:53 <kazinsal> need to actually get my ass into the zone to write a bunch of core code first

10:54 elastic_dog has quit [Quit: elastic_dog]

10:55 elastic_dog has joined #osdev

11:22 the_lanetly_052_ has quit [Remote host closed the connection]

11:23 alexander has quit [Quit: ZNC 1.8.2+deb2+b1 - https://znc.in]

11:30 alexander has joined #osdev

11:30 gxt has quit [Remote host closed the connection]

11:30 alexander has quit [Client Quit]

11:30 gxt has joined #osdev

11:40 alexander has joined #osdev

11:46 arch-angel has quit [Ping timeout: 240 seconds]

11:50 gog has joined #osdev

11:54 Likorn has joined #osdev

12:03 arch-angel has joined #osdev

12:13 jafarlihi has joined #osdev

12:13 jafarlihi is now known as l337h4x0r

12:14 <l337h4x0r> When reading 16 byte IDT entries to a struct with two u64, do you get higher half first or the lower half?

12:17 <Mutabah> higher at higher addresses?

12:17 <l337h4x0r> What?

12:18 <psykose> how do you read them?

12:18 <Mutabah> well... I'm not sure what you mean.

12:18 <Mutabah> There isn't really a higher/lower half in the endian sense for the IDT

12:18 VicIamQuickY has joined #osdev

12:18 <Mutabah> There is a second qword to it... and it's (iirc) specified in the docs using dwords

12:19 VicIamQuickY has quit [Client Quit]

12:21 <zid`> Whichever order you wrote them to memory in in the first place :P

12:21 <zid`> we sadly were not there to watch

12:26 Burgundy has quit [Ping timeout: 268 seconds]

12:28 l337h4x0r has quit [Read error: Connection reset by peer]

12:33 jafarlihi has joined #osdev

12:34 <jafarlihi> Which one of these look like a valid IDT entry in x64?: 0xb1808e0100100bc0ffffffff vs 0xffffffffb1808e0100100bc0

12:37 <zid`> ooh is this a quiz

12:37 <jafarlihi> pls sir

12:37 <zid`> I would have assumed the ff.. was actually in the middle, I'd have to check the manual

12:38 X-Scale` has joined #osdev

12:39 <zid`> https://cdn.discordapp.com/attachments/417023075348119556/990597031930241054/unknown.png

12:39 <jafarlihi> That's 32 bits

12:39 <zid`> is that the wrong one, it's too short

12:39 <zid`> ah found it

12:39 <zid`> https://cdn.discordapp.com/attachments/417023075348119556/990597260201058355/unknown.png

12:40 <zid`> offset, offset, selector, selector, ist+0s, dpl/type, offset, offset, offset, offset, offset, offset, offset, resv, resv, resv, resv

12:41 <jafarlihi> So the former is more likely?

12:41 X-Scale has quit [Ping timeout: 246 seconds]

12:41 X-Scale` is now known as X-Scale

12:41 <zid`> I mean, how are you intending to write a 128bit constant to memory, SSE?

12:41 <jafarlihi> Here's code, I'm not writing but reading: https://github.com/jafarlihi/kernsec/blob/master/idt/hello.c

12:41 <bslsk05> github.com: kernsec/hello.c at master · jafarlihi/kernsec · GitHub

12:41 <zid`> so why not treat it as bytes and make your life super easy?

12:42 <jafarlihi> I'm treating it as unsigned __int128, what is "treat as bytes"?

12:43 <zid`> char[]

12:43 <Mutabah> jafarlihi: Going to change to your original nick?

12:43 <zid`> __int128 isn't a real type

12:43 <zid`> it's simulated for you by the compiler

12:43 <Mutabah> well... it's a gcc type

12:43 <Mutabah> likely stored in the machine's endian, so higher bits at higher bytes for x86

12:43 <zid`> and involves annoying endian issues you wouldn't have to deal with

12:43 <zid`> if you'd just deal with it by creating a struct or a char pointer

12:46 lainon has joined #osdev

12:50 dennis95 has joined #osdev

12:52 <mrvn> I want int128_t

12:53 <mrvn> and while we are at adding types lets add int256_t, int512_t and int1024_t as well. It's not like architectures have to have it.

12:53 scaramanga has quit [Quit: segfault at 6565656565656565 ip 0000000000401116 sp 00007fff8af7d930 error 6 in bitchx[401000+1000]]

12:53 <Mutabah> rust has u128/i128

12:53 <mrvn> or even allow intX_t for any X the implementation likes.

12:54 <Mutabah> (with bigger ones provided by libraries, albeit not with literals)

12:54 <j`ey> Mutabah: cant you have macro!(665456787654345678987654345678987654345678987654)?

12:54 <mrvn> j`ey: 665456787654345678987654345678987654345678987654_big

12:54 <j`ey> not in rust

12:55 <mrvn> c++ has user defined literals

12:56 <Mutabah> j`ey: the literal still has to fit in u128

12:56 blockhead has quit []

12:56 <j`ey> Mutabah: lame

13:07 jafarlihi has quit [Quit: WeeChat 3.5]

13:21 heat_ has joined #osdev

13:23 <heat_> rust is lame

13:24 <mrvn> lame is a mp3 encoder

13:24 <heat_> can we rewrite it in rust

13:24 <zid`> ooh good idea

13:24 <zid`> maybe we can get the build dir up from 20GB to 80GB with that

13:25 <mrvn> is there ssh in rust?

13:25 <Mutabah> pretty sure there's a lib or two

13:26 <heat_> https://pijul.org/thrussh

13:26 <bslsk05> pijul.org: Pijul - Thrussh

13:26 <Mutabah> although... I hear that the dropbear dev is looking into makng one

13:26 <heat_> we rewrote it in rust bois

13:28 <mrvn> in rust can I use different allocators for collections or can I only define the global_allocator?

13:28 <Mutabah> You can override the allocator in the data type

13:28 <heat_> Mutabah, that kinda beats the purpose of having a "runs on anything program" program

13:28 <heat_> "Dropbear is particularly useful for "embedded"-type Linux (or other Unix) systems, such as wireless routers." <-- how many routers have LLVM and rust targets?

13:28 heat_ is now known as heat

13:29 <Mutabah> heat: Oh, not rewriting dropbear in rust... I think

13:29 <heat> ah, making a new one?

13:29 <Mutabah> probably

13:29 <mrvn> heat: arm has rust, doesn't mips?

13:29 <Mutabah> Or a ssh library

13:29 <j`ey> mips has

13:30 <mrvn> So basically all wireless routers are covered.

13:30 <heat> that's missing the point

13:30 <heat> this runs everywhere

13:30 <heat> every little crap system

13:30 <heat> it even runs on my fucking OS

13:30 <mrvn> I doubt it runs on 6502 or even avr

13:31 <mrvn> heat: no rust for your OS?

13:31 <heat> no

13:31 <mrvn> better start then. :)

13:31 <heat> not yet at least

13:31 <heat> meanwhile, dropbear can run on Tru-fucking-64

13:32 <heat> the power of C and gcc :)

13:33 <psykose> it runs everywhere but it sucks

13:33 <psykose> it's better to run nowhere and not to suck

13:33 <heat> it sucks?

13:34 <mrvn> curl sucks faster, and saver

13:38 <heat> curl needs UNIX sockets so I don't have support for it yet

13:39 <clever> heat: i have seen in the man page, that it supports http over unix

13:40 <clever> curl --unix-socket socket-path https://example.com

13:40 <bslsk05> example.com: Example Domain

13:41 <clever> the domain is still in the url, for https cert and virtualhost reasons

13:42 <heat> lol

13:43 <heat> the issue is that it uses socketpair() to resolve names

13:43 <clever> it even supports abstract unix sockets

13:43 <clever> curl --abstract-unix-socket socketpath https://example.com

13:43 <heat> abstract unix sockets are kool

13:43 <clever> which exist outside of the filesystem, in their own realm

13:44 <clever> i forgot about the difference, and had trouble getting curl to connect to my socket when i tried this out

13:45 <heat> abstract unix sockets should've been the only unix sockets, or at least the default ones

13:46 <heat> messing with the filesystem to create a socket is stupid

13:47 <clever> i kind of like being able to use standard tooling like ls and chmod to both find and control permissions

13:47 <mrvn> heat: how does another process access the abstract socket? Do you have to pass the FD or does path work?

13:47 <clever> and symlinks to alias the sockets

13:47 <clever> mrvn: they have names like @foo, and if you know the name, you can connect

13:47 <clever> you just cant see it with ls

13:48 <mrvn> So it's an FS just not visible in the default namespace.

13:48 <clever> exactly

13:48 <heat> no, it's not

13:48 <heat> clever is wrong

13:48 <mrvn> and how do you control permissions?

13:48 <heat> you don't

13:48 <clever> i think abstract sockets also lack directories?

13:48 <mrvn> that seems like 1 step forward and 100 steps back

13:48 <heat> abstract unix sockets are just socket addresses that have a name and start with a \0

13:48 <clever> so its more of a flat name -> socket list

13:49 <heat> you don't need a filesystem to do IPC

13:49 <heat> sorry, that's just a stupid idea

13:49 <heat> sockets are ephemeral

13:49 <mrvn> heat: you don't need a unix socket to doe IPC

13:49 <heat> unix sockets are *the way* to the IPC in unix

13:49 <clever> yeah, i'm kind of bothered by how you need to delete a unix socket before you can listen on it

13:49 <clever> so you need to clean up after your previous instance

13:49 <clever> postgresql also goes the other direction

13:50 <clever> srwxrwxrwx 1 postgres postgres 0 Jun 26 09:55 /var/run/postgresql/.s.PGSQL.5432

13:50 <mrvn> It's fine to use a pipe pair or socket pair and pass the FD through fork/exec. If you wan't to have some lookup service for sockets across processes that better have some permissions.

13:50 <clever> its emulating tcp ports over unix sockets

13:50 <mrvn> -'

13:50 <heat> a socket pair is just two UNIX sockets connected to each other

13:51 <heat> *without* the filesystem

13:51 <mrvn> heat: exactly.

13:51 <clever> but how might you connect to something like the above postgresql socket?

13:51 <mrvn> and I believe you can dup it to make more sockets

13:52 <heat> if you wanted a way to get permissions on abstract unix sockets, that would be done ez

13:52 <mrvn> clever: that needs a lookup service, like an FS.

13:52 <heat> no need to use the fucking filesystem

13:52 <mrvn> ez?

13:52 <heat> yes

13:52 <mrvn> what is ez supposed to mean?

13:53 <heat> struct anon_name { uid_t owner; gid_t group; mode_t mode; }; <-- stick that in your struct unix_socket

13:53 <clever> mrvn: like rpb.bind in nfs?

13:53 <mrvn> That would work. All I'm saying is that you better have something like that.

13:54 <mrvn> Assuming nobody can guess the name of some socket is just stupid. Esspecially for offering known services where the name is public.

13:54 <heat> going through thousands and thousands more lines just to have a hierarchy and permissions is downright stupid

13:55 <heat> VFS code + filesystem + block device

13:55 <mrvn> not arguing that.

13:55 <clever> unix sockets can live on a tmpfs

13:55 <clever> the postgresql one above is on a tmpfs

13:55 <mrvn> remove the stupid bits and keep the good bits (permissions)

13:55 <clever> srwxrwxrwx 1 root root 0 Jun 26 08:36 /tmp/.X11-unix/X0

13:55 <clever> another unix socket your likely using on a daily basis

13:55 <heat> going to tmpfs is still stupid complex

13:56 <clever> that is where DISPLAY=:0 points

13:56 <mrvn> heat: it hardly matters. you open the socket once and then use it for ages.

13:56 <heat> it does to me

13:56 <heat> you want to reduce complexity

13:56 <mrvn> it uses thousands of lines of code you already need and have.

13:56 <heat> therefore, struct anon_name

13:57 <clever> what if you want to expose a unix socket to a docker container?

13:57 <heat> also, any connect/sendto will look that up

13:57 <clever> with normal unix sockets, you can bind mount them into another mount namespace

13:57 <heat> making it quite a bit slower

13:57 <clever> but how do you clone an abstract socket into another namespace, such that connect() routes to accept()?

13:58 <mrvn> heat: connect you do once. sendto takes an FD and doesn't do any FS lookups.

13:58 <heat> wrong

13:58 <heat> sendto takes a sockaddr

13:58 <clever> heat: both right, sendto takes an open socket, and a sockaddr

13:58 <mrvn> heat: is that used on a unix socket?

13:58 <heat> yes

13:59 <clever> udp is the case where having both matters the most

13:59 <heat> SOCK_DGRAM is a possibly socket type for the socket

13:59 <clever> the open socket FD, is the source udp port

13:59 <clever> while the sockaddr is the dest ip/port

13:59 <mrvn> "If sendto() is used on a connection-mode (SOCK_STREAM, SOCK_SEQPACKET) socket, the arguments dest_addr and addrlen are ignored

13:59 <mrvn> "

13:59 <clever> ah, and that, forgot about that

13:59 <heat> AF_UNIX has SOCK_DGRAM support

13:59 <heat> (and that sentence isn't correct on linux btw)

14:00 <mrvn> How does that work with unix sockets? How would that get routed to the dest_addr?

14:01 <mrvn> How do you specify your own address when listening?

14:01 <clever> one min

14:01 <clever> ive done something with wpa_supplicant before

14:01 <mrvn> And how does a second process listen on the same unix socket?

14:02 <clever> mrvn: srwxrwxrwx 1 root root 0 Jun 26 08:36 /tmp/.X11-unix/X0

14:02 <clever> https://gist.github.com/cleverca22/79bcd06b01ba825f65416f7bd85f2855#file-wpa_client-cpp-L30-L37

14:02 <bslsk05> gist.github.com: wpa_client.cpp · GitHub

14:03 <clever> the client, must first bind to its own unix socket path

14:03 <clever> and then can connect to the server

14:03 <clever> when using DGRAM mode

14:03 <mrvn> clever: yes, you bind with the path to the socket. Where do you set the address?

14:03 <clever> and one bug i discovered, is that wpa_supplicant, isnt aware of you disconnecting, because its DGRAM mode

14:03 <clever> the client picks its own client unix socket, and puts it wherever

14:03 <clever> and binds as shown in the gist

14:04 <mrvn> clever: the question is about the server

14:04 <clever> the client can then connect() to the server's unix socket

14:04 <clever> but because its datagram based, no actual connection is formed, its just the default for write()

14:04 <mrvn> clever: and how does the server set the address it listens on?

14:04 <heat> bind

14:04 <heat> sockaddr_un

14:05 <mrvn> heat: no, that's just the path of the socket.

14:05 <clever> it calls bind() on its own socket, the same way, before it calls listen() and accept()

14:05 <heat> if name[0] == '\0', it's a abstract socket

14:05 <heat> I told you this before

14:05 <mrvn> or I'm missing something.

14:05 arch-angel has quit [Ping timeout: 268 seconds]

14:07 <mrvn> 1) server creates the unix socket, e.g. /tmp/.X11-unix/X0. 2) client creates a socket. 3) client connects socket to /tmp/.X11-unix/X0. 4) client sendto(fd, buf, len, flags, addr, addrlen); where addr is something magic to pick which server gets the data

14:07 <clever> the bug i ran into, is that beause wpa_supplicant is using a DGRAM socket, it has no idea when i connect or disconnect

14:08 <mrvn> I don't see how addr is supposed to work with a unix socket even in DGRAM mode.

14:08 <clever> and only when you send the ATTACH message, does the server become aware of you, and begin sending things to /tmp/wpa_client

14:08 <clever> and critically, if you ATTACH twice, you get 2 messages for every event!

14:11 <clever> mrvn: i believe when in DGRAM mode, the addr must be a sockaddr_un, and the client must also first bind to its own sockaddr_un

14:12 <mrvn> clever: I think that if you use sendto then the address would be some (other) socket, like /tmp/.X11-unix/X1 and you basically operate in an unconnected mode.

14:13 <clever> yeah

14:13 <clever> but the X11 socket, is stream based, so that wouldnt happen there

14:13 <mrvn> So then the answer is simple: don't do that with unix sockets. That's what connect is for. :)

14:13 <clever> wpa_supplicant is designed to operate in DGRAM mode instead

14:14 <clever> instead of maintaining one open FD to every client, it just keeps an array of every sockaddr_un that has attached, and calls sendto() to each, on the same FD

14:14 <mrvn> At least I hope that if you connect if does the permissions check once and caches the socket instead of looking it up in the FS on every dgram.

14:14 <clever> yeah, i dont know how permissions work with DGRAM

14:15 <mrvn> clever: saves a lot of resources but has the drawback that you don't know when a client disconnects.

14:15 <clever> yeah

14:15 <clever> and it has a bug, where if a client restarts, and does ATTACH again (a msg you send to the server), your sockaddr_un gets into the list twice

14:15 <mrvn> Should be easy to test. Open a unix DGRAM socket, connect, change the permissions in the FS and try sending.

14:15 <clever> so now you get 2 of every event!

14:16 <mrvn> clever: how does the client get the same address as before?

14:16 <clever> on startup, it just deletes the unix socket, and listens on it again

14:16 <clever> because i was lazy and didnt use mktemp

14:16 <clever> https://gist.github.com/cleverca22/79bcd06b01ba825f65416f7bd85f2855#file-wpa_client-cpp-L31

14:16 <bslsk05> gist.github.com: wpa_client.cpp · GitHub

14:16 <clever> and the sockaddr_un is just a path, not any special kind of reference

14:17 <clever> so i assume on every sendto(), the kernel has to resolve the path again, and find whatever socket it now points to

14:17 <mrvn> clever: wouldn't you get some magic address when you use socket(); connect(); send()?

14:18 <heat> no

14:18 <heat> it stays unnamed

14:18 <mrvn> So unless you bind it to a names unix socket first you can't get replies?

14:18 <heat> yes

14:18 <heat> unless its a socketpair

14:18 <clever> the gist i linked above, is from a desktop widget i wrote back in 2011

14:18 <heat> those are implicitly connected to each other

14:18 <mrvn> s/names/named/ Well, as you see I haven't used unix sockets much :)

14:18 <clever> it showed the bssid of the current wifi, the cpu temp, cpu fan speed, and the wifi signal strength as a crazy-fast graph

14:19 <clever> it had such resolution and speed, that if i simply tilt the lcd on the laptop 10 degrees, the wifi signal changes

14:19 <clever> and i could use it to track down where a router was in seconds

14:19 <heat> the permissions check is only done when opening the fs socket on connect() or sendto

14:19 <mrvn> I would have assumed that when you connect without setting a name you get one of those @.... socket names with some random magic in it.

14:20 <heat> those @.... are abstract sockets

14:20 <heat> @ is just the null character :)

14:20 <heat> (what they use to print it out)

14:20 <mrvn> I know. So why isn't the kernel generating them implicitly when connecting?

14:20 <heat> because it's just Not How Thing Are Done (tm)

14:20 <heat> Things*

14:20 <heat> there's an autobind feature for bind(2) but it's linux specific

14:21 <mrvn> The should have changed that when they added abstract sockets

14:21 <heat> why? the semantics stay the same

14:22 <mrvn> heat: Because having the client sockets in the FS is really stupid.

14:22 <heat> they don't need to be

14:22 <heat> use abstract sockets

14:23 <mrvn> leading to such bugs as getting the wpa_client messages twice

14:23 <mrvn> heat: autobind should be the default.

14:23 <heat> but it's not

14:23 <heat> POSIX baby!

14:23 <heat> backwards compatibility til the 80s!

14:23 <mrvn> 16:21 < mrvn> The should have changed that when they added abstract sockets

14:24 <heat> they can't change it

14:24 <heat> they can't explicitly break POSIX like that

14:24 <heat> unnamed sockets exist

14:24 <clever> another fun thing, /proc/PID/fd/FD, lets you open a clone of any fd in any pid

14:25 <clever> so you can steal a reference to what should have been a private socket pair

14:25 <mrvn> clever: only if you have permissions and are in the same namespace

14:25 <clever> yeah

14:25 <clever> you can even open a file that has been deleted

14:25 <heat> procfs is majik

14:25 <mrvn> In which case you can also attach to the process and have it execute arbitrary code

14:25 <clever> lrwx------ 1 clever users 64 Jun 26 11:25 /proc/95405/fd/65 -> '/dev/shm/.org.chromium.Chromium.tqRyAT (deleted)'

14:26 <clever> as an example, this filename doesnt exist

14:26 <heat> mrvn, not quite

14:26 <clever> but if i try to open the symlink anyways, it will still work

14:26 <heat> ptrace is commonly restricted to children

14:26 <zid`> so I have to hire some child slaves to use it?

14:26 <clever> lrwx------ 1 clever users 64 Jun 26 08:37 /proc/9599/fd/3 -> 'socket:[23927]'

14:26 <clever> sockets are always "dead" symlinks

14:26 <mrvn> clever: opening deleted files in /proc/ is the only way to undelete something

14:26 <clever> mrvn: or zfs snapshots

14:26 <heat> you can just relink them

14:26 <clever> yep

14:27 <heat> wait, no

14:27 <mrvn> clever: if you have a snapshot then it wasn't deleted

14:27 <heat> you can't

14:27 andreas303 has quit [Quit: fBNC - https://bnc4free.com]

14:27 <heat> AT_EMPTY_PATH semantics on linkat are confusing

14:27 <clever> heat: hmmm, yeah, link() and linkat() both need the source path

14:27 <clever> the fd params, are for the source/dest dir, not the source filename

14:28 <heat> I find AT_EMPTY_PATH quite stupid

14:29 <heat> I think we've discussed this before

14:29 <heat> you need CAP_DAC_READ_SEARCH (effectively root) to use it

14:29 <mrvn> heat: avoids having to have an flink(int fd, const char * dest) syscall

14:29 <heat> and you can't undelete non-O_TMPFILE files for some reason

14:29 <heat> mrvn, AT_EMPTY_PATH doesn't behave like an flink(fd, dest)

14:30 <heat> 1) you need root or that cap; 2) you can't link a bunch of files

14:31 <clever> i just remembered a bug ive abused in android before, involving backup restoring

14:31 <clever> when you restore a backup, android will first set the permissions on a dir, as the backup said they should be

14:31 <clever> it will then open() every child of the dir, and write them out

14:32 <clever> the bug, is that you can race the restore, and create a symlink in its way

14:32 <clever> and it will then follow the symlink when restoring the file

14:32 <mrvn> that seems wrong. The dir should be 000 and only give permissions at the end

14:32 <clever> exactly

14:32 <heat> https://github.com/torvalds/linux/commit/f0cc6ffb8ce8961db587e5072168cac0cbc25f05

14:32 <bslsk05> github.com: Revert "fs: Allow unprivileged linkat(..., AT_EMPTY_PATH) aka flink" · torvalds/linux@f0cc6ff · GitHub

14:32 <clever> to abuse it, you restore a backup with a 777 directory, containing 1000 dummy files, and 1 known name

14:32 <clever> then you race to symlink that 1 known name, to a special file in /etc/

14:33 <clever> and boom, a special flag indicating that android is running under an emulator has been set

14:33 <mrvn> clever: it doesn't even use open exclusive?

14:33 <clever> since its under an emulator, there is no point in blocking root via adb

14:33 <clever> mrvn: this is an ancient version of android

14:33 <clever> the bug has likely been fixed years ago

14:33 <mrvn> it's always amusing how many times basic bugs are reinvented.

14:33 <clever> but, this is how i rooted my kindle fire

14:34 <clever> that bug lets you write to a file, that disables all security, at the cost of also breaking hw accel

14:34 <clever> but now you have root, and can create setuid root binaries anywhere you want, and undo that

14:35 <clever> so you can just give yourself an su binary, that doesnt need the root pw (in the case of android, it talks to an android app, and confirms via the GUI)

14:36 <heat> magisk does su with unix sockets

14:37 <mrvn> Speaking of android: I'm looking for a shopping list app where I can GPS tag items when I find them in the shop and next time it can tell me where to find stuff, show near items from the list, draw a map. Does that exist or do I have to write one?

14:37 <clever> mrvn: gps tends to not work that well indoors

14:37 xenos1984 has quit [Read error: Connection reset by peer]

14:37 <heat> I bet 50% here have a nokia 3310

14:37 <heat> and a 2005 thinkpad :)

14:38 <mrvn> clever: possible. Using the cameras to build a map and locate itself sounds complicated.

14:39 <clever> mrvn: open street map does have support for indoor mapping, and even mapping each floor of a building, so you can filter to 1 floor

14:39 <mrvn> wifi hotspots or bluetooth for locating yourself in the store sounds fishy too

14:39 <clever> but actually pointing to where you are on the map, wont work as well

14:40 <clever> so you would still have to count the ailes yourself and compare the map to the world with your eyes

14:40 <mrvn> the accelerometer might be ok to track positions inside the store.

14:40 <clever> there are google services to give a rought gps location, based on what wifi mac's are near you

14:40 <clever> so you can get "gps" without turning on the actual gps receiver

14:41 <mrvn> clever: that's like 100m near you. Not "you are standing next to the milk" near you.

14:41 <clever> yeah

14:42 <mrvn> would be nice if shops had RFID or bluetooth tags on their shelfs.

14:42 <clever> i think one of my local shops is using some form of rfid, but in a different way

14:42 <clever> the price tags on the shelf, are now an e-ink module, i think with rfid for power/data

14:42 <clever> so instead of printing a new price tag out every day, and generating waste

14:43 <clever> they hold a device over the tag, and it just magically updates

14:43 <mrvn> I've seen those online

14:43 <mrvn> I think they have a very short range. Like a few cm.

14:45 <clever> and they are unpowered

14:45 <clever> so you cant really snoop the waves and find out your location

14:45 <clever> the tag is powered by the writer

14:45 <mrvn> a passive thing is fine if you could ping it from a few meters.

14:45 <clever> yeah

14:46 <clever> that reminds me, have you heard about how apple's "find my" network works?

14:46 <mrvn> A map published by the store and QR codes on the shelfs would probably work best.

14:47 <clever> basically, there is a private key held on every apple device you own, somehow synced between them when you add a device to your acct

14:47 <clever> the OS will then use the private key, and the date, to generate a temporary key, for just today

14:47 <clever> it will then broadcast the pubkey of the day over bluetooth, constantly

14:48 <clever> if any other apple device (say an iphone) overhears that, it will encrypt its own gps location and time with the public key, and post the pubkey + ciphertext to apple

14:48 <clever> apple lacks the private, so they cant track you

14:48 <clever> the keypair changes every day, so a 3rd party cant track you by the signals your constantly emitting

14:49 <clever> but if you want to find out where your ipad is, you generate the public for the last 5 days, and ask apple for all ciphertexts belonging to those publics

14:49 <clever> you can then decrypt it, and find out where your ipad was last seen

14:50 <clever> mrvn: does that sound like a sound design?

14:51 vdamewood has joined #osdev

14:52 andreas303 has joined #osdev

14:54 xenos1984 has joined #osdev

15:17 <ddevault> hrm

15:17 <ddevault> my binutils patch builds a linker script which seemingly leaves .data empty in the ELF file

15:17 <ddevault> specifying a (sane) linker script manually "fixes" the problem

15:17 <heat> what linker script are you using?

15:17 <ddevault> I added my platform as a target to binutils

15:18 <ddevault> with this script for generating the linker script https://paste.sr.ht/~sircmpwn/cfc262929cfb998633112757e9bfb544c3c71609

15:18 <bslsk05> paste.sr.ht: paste.sh — paste.sr.ht

15:20 <ddevault> god I hate the binutils linker scripts

15:20 <heat> mine is just a ". ${srcdir}/emulparams/elf_x86_64.sh"

15:20 <heat> have you tried that?

15:20 <ddevault> I suppose I can

15:20 <heat> also a dump of the built-in linker script would be cool

15:20 <ddevault> know how to obtain one? I was scouring the man page for it

15:20 <zid`> -v just prints it doesn't it

15:21 <ddevault> same behavior without my changes

15:21 <ddevault> -v does not print it

15:21 <heat> it should?

15:22 <ddevault> do you mean --verbose?

15:22 <ddevault> that does work

15:22 <zid`> oh right ld is the weird one that overrides -v for version

15:22 <ddevault> https://paste.sr.ht/~sircmpwn/c97fac4a5d493e4926dff127d04fe014bd807c8c

15:22 <bslsk05> paste.sr.ht: paste.txt — paste.sr.ht

15:22 <ddevault> working script: https://paste.sr.ht/~sircmpwn/049f2db478aa0dbcc8bd8ac3c114058b3d0f0cc1

15:22 <bslsk05> paste.sr.ht: paste.txt — paste.sr.ht

15:23 <heat> it shouldn't discard .data

15:23 <ddevault> (not using --gc-sections)

15:23 <heat> are you using lto or gc?

15:23 <ddevault> .data is there, but it's zeroed out

15:23 <ddevault> no lto either

15:24 <ddevault> https://mirror.drewdevault.com/bunk.elf

15:24 <ddevault> bad elf file

15:24 <ddevault> https://mirror.drewdevault.com/good.elf

15:24 <ddevault> good file

15:24 <zid`> and your data actually has references?

15:25 <zid`> It'd be hard not to but if the test file is *very* simple it can end up getting optimized out I suppose

15:25 <ddevault> I mean, it looks good to me

15:25 <ddevault> none of this is built with an optimization flag

15:25 <ddevault> and the symbols are still *there* - they're just zero

15:25 <heat> .data is there?

15:25 <ddevault> yeah, see for yourself

15:25 <heat> 01 .ctors .dtors .data .bss

15:26 <heat> on bunk.elf

15:26 <zid`> okay so the boog is somewhere else just from the description

15:26 <ddevault> where do you see that, heat?

15:26 <heat> readelf -a bunk.elf

15:26 <heat> Section to segment mapping

15:26 <zid`> it's hard work to generate a section that's got the right size but contains the wrong data in a linker script

15:26 <heat> it did not discard .data

15:26 <heat> it's in the section headers

15:26 <zid`> so is that actually zero, or did you just load it wrong?

15:26 <zid`> and treat it as bss or something

15:26 <ddevault> ah, I was reading objdump

15:27 <ddevault> it is actually zero as best as I can tell

15:27 <ddevault> will explain how I checked this, hold please

15:27 <heat> 0x208 sized

15:27 <ddevault> objdump -h bunk.elf => .data @ file offset 000013e0

15:27 <ddevault> hexdump -C main, look for that offset... zero

15:28 <ddevault> actually, it's not all zero on a second look

15:28 <zid`> it contains data on my end

15:28 <heat> same

15:28 <ddevault> __stdout_FILE is the object I'm interested in, btw

15:28 vdamewood has quit [Quit: My MacBook Pro has gone to sleep. ZZZzzz…]

15:28 <zid`> __stdout_used, __TMC_END__ blah blah

15:28 <zid`> oh wait I hexdumped the wrong thing

15:29 <zid`> because the input elf had funky warnings I ignored and I just chained the tools together

15:29 <heat> $1 = {flags = 5, rpos = 0x0, rend = 0x0, close = 0x0, wend = 0x0, wpos = 0x0, mustbezero_1 = 0x0, wbase = 0x0, read = 0x0, write = 0x40036a <writecons>, seek = 0x0,

15:29 <heat> buf = 0x601668 <stdoutbuf+8> "", buf_size = 4096, prev = 0x0, next = 0x0, fd = 1, pipe_pid = 0, lockcount = 0, mode = 0, lock = -1, lbf = 10, cookie = 0x0, off = 0, getln_buf = 0x0,

15:29 <heat> mustbezero_2 = 0x0, shend = 0x0, shlim = 0, shcnt = 0, prev_locked = 0x0, next_locked = 0x0}

15:29 <heat> print __stdout_FILE

15:29 <heat> (in gdb)

15:29 <heat> it's entirely correct

15:29 <ddevault> hrm, so it is

15:29 <ddevault> the hell

15:30 <zid`> my objcopy *hates* this btw

15:30 <zid`> empty section at 0x40000

15:30 <ddevault> hates which file, both?

15:30 <zid`> bunk

15:30 <zid`> didn't try good

15:30 <heat> what libc is this?

15:30 <ddevault> a godawful abomination I hacked together from bits of musl

15:30 <ddevault> https://git.sr.ht/~sircmpwn/mercury

15:30 <bslsk05> git.sr.ht: ~sircmpwn/mercury - sourcehut git

15:31 <zid`> why is this aligned to 16 :(

15:31 <heat> ddevault, why did you just not use musl?

15:31 <zid`> This loads the entire elf into memory at 0x4000000, then loads a part of the elf to 0x6013a8

15:32 <ddevault> heat: I am missing a lot of stuff upstream musl would need

15:32 <ddevault> such as a unix-like system

15:32 <heat> doesn't matter

15:32 <heat> hack it

15:32 <ddevault> I probably will, at some point

15:32 <ddevault> this also qualifies as hacking it

15:32 <ddevault> it's not libc's fault, it's something with my loader or these elf files

15:33 <heat> musl needs like arch_prctl (a way to set the TLS) and writev for a int main() { printf("Hello World\n"); while(1);}

15:33 <heat> if you want to return, add an exit_group-like syscall

15:33 <ddevault> my kernel does not even have write, let alone writev, nor does either syscall fit into its design

15:34 <ddevault> anyway, not important right now. Just hacking together enough shit to get doom running and then I'm throwing this all out and doing more important things

15:34 <heat> get a script to generate wrappers around stuff :)

15:34 <ddevault> my system is not even written in C anyway

15:34 * zid` hates magic ELF

15:34 <zid`> I'm a non-magic ELF supremacist

15:35 <heat> aout fan?

15:35 <ddevault> what is magic ELF

15:35 <zid`> it loads itself into memory rather than its contents

15:35 <zid`> the easiest way to achieve that being to pass one of the nmagic/omagic options usually

15:35 <ddevault> does this file do that? I don't think this file does that

15:35 Vercas has quit [Remote host closed the connection]

15:35 <zid`> readelf says it does

15:35 <ddevault> how do I get readelf to say so

15:36 <mrvn> I just wahsed my keyboard and now I'm blinded

15:36 Vercas has joined #osdev

15:36 <zid`> I'm looking at -a because I can never remember the options, it's under Program Headers:

15:36 <heat> this is standard

15:36 <zid`> I just hate them

15:36 <heat> most programs do this so you can look at the phdrs at runtime

15:37 <zid`> I am a racial supremacist afterall

15:37 <ddevault> I wonder if it's because .data is aligned on good.elf

15:37 <ddevault> page aligned, that is

15:38 <zid`> good.elf has an empty 00 PHDR that does.. something

15:38 <ddevault> yep

15:38 <ddevault> my loader is dumb

15:39 <ddevault> it assumes all sections are page aligned

15:39 <zid`> yea I did ask why it was aligned to 16

15:39 <zid`> I exclusively write and use dumb loaders

15:40 <zid`> so I only load the text+ and data+ load segments from the middle of the elf, page aligned

15:40 <ddevault> I could fix my loader, or I could align .data

15:40 <ddevault> hmm.....

15:40 <ddevault> one of these options takes about 30 seconds

15:45 <mrvn> If all the data only needs 16 byte alignment and the linker script does not add page alignment then you get that.

15:45 <mrvn> Usualy you page align so each section can be protectd by the MMU.

15:49 <heat> the linker script does usually add some alignment iirc

15:51 <ddevault> I must give gnu credit where it is due for teaching the world how not to write software

15:59 <heat> writing an elf loader is surprisingly hard

15:59 <heat> I needed a few years until I got mine 99% correct

15:59 <heat> it's probably still broken sometimes

16:00 <ddevault> I think it has something to do with the fact that the ELF specification (if you can even call it that) just kind of tells you what's in the file but not what to do with it

16:00 <ddevault> and while it's intuitive enough to make guesses that mostly or somewhat work, there's no good reference for all of the right things to do

16:01 <mrvn> and every arch add it's own rules

16:01 <heat> and there are a lot of corner cases

16:09 <zid`> yea that's why I have preferences

16:09 <zid`> but I also don't think it's a bad thing, the platform just defines what a valid ELF for it looks like

16:10 <zid`> so on my platform, program headers must refer to 4k aligned sections or I won't load it, simple

16:10 <zid`> That's not a bug in my loader, it's a constraint

16:19 <mrvn> but then you have to fix gcc which might be harder

16:23 blockhead has joined #osdev

16:43 wand has quit [Remote host closed the connection]

16:54 wand has joined #osdev

16:58 <heat> i forgot how verbose the arm arm is

16:58 <heat> yay

17:01 dennis95 has quit [Quit: Leaving]

17:04 <zid`> I verily failed to put my utmost unto rememberance of the manual of the ARM provided by the company ARM.

17:04 <zid`> 's most profuse verbosity*

17:04 <heat> arm

17:04 <heat> armarmarmarmarmarmarmarmarm

17:12 pretty_dumm_guy has joined #osdev

17:14 <heat> how do I get the address of a symbol relative to my PC?

17:14 <heat> ldr x0, =symbol doesn't work, that gets the absolute value

17:14 <heat> I want something relative to the PC

17:14 <j`ey> adr

17:15 <heat> adrl?

17:16 <heat> ok it doesn't exist? huh

17:16 <j`ey> adr/adrp

17:16 <j`ey> heat: "C3.3.5 PC-relative address calculation"

17:16 <heat> but that ignores the bottom 12-bits right?

17:17 <heat> in this case it works since it's a page table, but I wanna understand how I would do this if I needed the whole address

17:17 <j`ey> pair it with an add

17:17 <heat> no pseudo-instruction magic?

17:18 <mrvn> one ignores the bottom 12 bits and the other ignores the top bits

17:19 <mrvn> with clang it even works across sections, gcc can only do it within a section.

17:19 <j`ey> heat: https://github.com/torvalds/linux/blob/master/arch/arm64/include/asm/assembler.h#L185

17:19 <bslsk05> github.com: linux/assembler.h at master · torvalds/linux · GitHub

17:20 <heat> thanks, I hate it

17:20 <heat> ld.lld: error: arch/arm64/image.o:(.boot+0x64): has non-ABS relocation R_AARCH64_ADR_PREL_PG_HI21 against symbol 'boot_page_tables'

17:20 <heat> halp

17:20 <heat> on "adrp x0, boot_page_tables"

17:22 <zid`> do an ABS relocation then you big dummy

17:23 <heat> b-b-b-b-but how

17:23 <zid`> idk how do you normally do PIE on arm64

17:23 <zid`> do that

17:23 <heat> i unno

17:23 <zid`> ask gcc?

17:24 <heat> https://godbolt.org/z/8KsqhvWeh

17:24 <bslsk05> godbolt.org: Compiler Explorer

17:25 <heat> not different from what I'm doing

17:25 <zid`> what's a sxtw

17:25 <zid`> sexy taiwanese mail order bride?

17:25 <heat> sign extend w (32-bits)?

17:26 <heat> sign extend word

17:26 <heat> woohoo I can read arm-ese now

17:35 <heat> yeah huh I can't adrp anything

17:36 <heat> 😭

17:36 <zid`> does you need to put a special section into .text

17:36 <zid`> .text.common.adrp.nonsense.pls

17:36 <zid`> (literally just throwing out ideas)

17:36 <heat> not a bad idea

17:37 <j`ey> are they actually 4gb apart?

17:37 <heat> no

17:39 <j`ey> more than 4gb?

17:39 <heat> the address is high up there in the -2GB space

17:39 <heat> maybe that's the problem?

17:39 <j`ey> for both of them? the page table and the instruction?

17:40 <heat> yes

17:41 <mrvn> did you set the mcmodel=kernel?

17:42 * heat checks

17:42 <heat> there's no mcmodel=kernel for arm64

17:43 <heat> I have -mcmodel=small

17:44 <mrvn> try something else

17:46 <heat> no difference

17:46 <heat> and this is assembly too

17:48 <mrvn> does "ldr x0, =boot_page_tables" work?

17:49 <heat> yes

17:49 <heat> but I get the absolute address

17:49 <heat> I don't want that

17:49 <mrvn> and what code does it produce?

17:50 <mrvn> is the absolute address within PC range?

17:51 <heat> ldr x1, #0x40081070

17:51 <mrvn> and #0x40081070 contains?

17:53 srjek|home has joined #osdev

17:53 <heat> i dont know

17:54 <mrvn> what does objdump show?

17:56 <heat> that's the objdump

17:56 pseigo has joined #osdev

17:57 vdamewood has joined #osdev

17:58 <heat> should equal "ldr x1, 0xffffffff80001070 <stack_top>" (the last one was taken from qemu)

18:00 <heat> is it because the symbol values are absolute? Do I need to do something to the symbol beforehand?

18:03 SGautam has joined #osdev

18:03 JanC has quit [Remote host closed the connection]

18:03 JanC has joined #osdev

18:03 <mrvn> ldr should point to a word in the constants section that contains the address of the table

18:04 vdamewood has quit [Killed (erbium.libera.chat (Nickname regained by services))]

18:04 vdamewood has joined #osdev

18:04 <heat> 0x40081070 is not a valid address

18:11 <zid`> agreed

18:11 <psykose> bullies, all of you

18:11 <psykose> 0x40081070 can be valid if it wants to be

18:16 mzxtuelkl has quit [Quit: Leaving]

18:21 <mrvn> at least it isn't odd.

18:45 pseigo has quit [Ping timeout: 268 seconds]

19:00 zid` has quit [Ping timeout: 248 seconds]

19:09 foudfou has quit [Remote host closed the connection]

19:09 wand has quit [Remote host closed the connection]

19:09 foudfou has joined #osdev

19:10 lainon has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

19:12 * geist yawns at everyone

19:12 * vdamewood sticks an apple in geist's mouth

19:12 <heat> kinky

19:12 <vdamewood> Naw, kinky would be if it had a leather strap.

19:14 wand has joined #osdev

19:14 sprock has quit [Quit: Reconnecting]

19:15 sprock has joined #osdev

19:15 <heat> oh wise geist, can you help me understand why adrp doesn't work

19:16 <geist> sure

19:16 <heat> well, it don't work

19:16 <geist> i assume you've looked it up the manual and whatnot right?

19:16 <geist> are you writing it in asm?

19:16 <heat> yes

19:16 <heat> ld.lld: error: arch/arm64/image.o:(.boot+0x64): has non-ABS relocation R_AARCH64_ADR_PREL_PG_HI21 against symbol 'boot_page_tables' <-- "adrp x0, boot_page_tables"

19:16 <heat> this should work right?

19:16 <heat> the addresses are like almost next to each other

19:17 <geist> possibly. probably has to do with how the symbol is declared

19:17 <heat> what do you mean?

19:18 <geist> the instruction should work, so it's not the instructions fault, but something to do with how you're linking it, declaring it, etc

19:18 <geist> guess i need the full pastebin

19:19 <heat> https://github.com/heatd/Onyx/tree/master/kernel/arch/arm64

19:19 <bslsk05> github.com: Onyx/kernel/arch/arm64 at master · heatd/Onyx · GitHub

19:19 <heat> boot_page_tables is defined in linker.ld

19:19 <heat> (this is all being done in image.S, although it's not up to date in git)

19:20 <geist> aaah that's defeinitely something different. if you try to locally define it in a .S or .c fie does it work?

19:20 <geist> just to see at least

19:20 <geist> also you're using lld which tends to be more fiddly

19:20 <heat> no, because I tried with adrp x1, stack_top (which is defined in the same file, same section)

19:20 <geist> no it doesn't work?

19:20 <heat> yes, doesn't work

19:20 <heat> same error and everything

19:21 <heat> now, I'm thinking that it may be the address's fault? idk

19:21 <heat> because this is happening at -2GB

19:21 <geist> yeah i have honestly no idea what's up here

19:21 <geist> but you're also using lld which i never do

19:21 <geist> and i dont know how you're linking it. what mcmodel are you using?

19:21 <heat> i'll try bfd, give me a sec

19:21 <heat> -mcmodel=small

19:21 <geist> that might be your problem

19:21 <heat> why?

19:21 <j`ey> it looks similar to what linux has though..

19:22 <geist> sure, but the details matter. does linux link with lld?

19:22 <j`ey> yeah

19:22 <heat> it can

19:22 <geist> try bfd and see what happens

19:22 <geist> also mcmodel=small is definitely something i dont do, maybe try without it?

19:23 <heat> small is the default

19:23 <geist> try without it anyway, just start fiddling with variables here

19:23 <geist> alls i can do is look at what i do and observe it works, and then remove the deltas until it works, then work backwards from there

19:23 <geist> can objdump the .o files and see what the rel entries are i guess

19:24 <heat> oh wtf

19:24 <heat> it works with bfd

19:24 <geist> yah like i said lld is very fiddly

19:24 <geist> now question is what does the message *mean*

19:25 <heat> https://www.youtube.com/watch?v=_eRRab36XLI

19:25 <bslsk05> 'No one knows what it means, but it's provocative' by bentleyguten (00:00:06)

19:25 <geist> i'm looking at my start.S.o objdumped with `objdump -dr` and seeing tose kinda rel entries for sure

19:26 <geist> https://www.irccloud.com/pastebin/fHA2DjoT/

19:26 <bslsk05> IRCCloud pastebin | Raw link: https://irccloud.com/pastebin/raw/fHA2DjoT

19:26 <geist> so it's definitely pretty standard stuff at that point

19:29 <geist> i've seen lots of people ask about this relocation eerror in linker on the web but it's generally no answer

19:30 gxt has quit [Ping timeout: 268 seconds]

19:30 <geist> and yeah, you're right about mcmodel=small being default. it's not so small on arm64 to be honest: text and data can be 4GB, and located anywhere

19:30 <geist> it's a pretty relocatable and reacheable arch

19:30 gxt has joined #osdev

19:32 <heat> aww shit

19:32 <heat> I see

19:32 <heat> I needed the section marked as alloc

19:32 <heat> https://github.com/llvm/llvm-project/blob/16ca490f450ea3ceaeda92addd8546967af4b2e1/lld/ELF/InputSection.cpp#L915

19:32 <bslsk05> github.com: llvm-project/InputSection.cpp at 16ca490f450ea3ceaeda92addd8546967af4b2e1 · llvm/llvm-project · GitHub

19:33 <geist> aaaah so it was getting stripped from the link? are you using LTO?

19:33 <heat> no, it wasn't getting stripped from the link

19:33 <geist> just a non alloc section

19:34 <heat> yup

19:34 <heat> the linker thinks it's not getting loaded, but my linker script says otherwise

19:34 <geist> cool! well there you go. general rules of the road: if it's a symbol within the same file you can generally just use the 'adr' instruction if you want PC rel since adr can reach i think 22 bits or so

19:35 <geist> but if you try to use adr for an external symbol i think it'll fail

19:35 <heat> can't the linker do relaxation there?

19:35 <geist> and you know about the :lo12: thing right?

19:35 <heat> yes, add reg, reg, :lo12:sym

19:36 <geist> i dont think arm linkers relax, but the compiler generally is good about combining the 12 bit part with the instruction that actually uses it

19:36 <geist> ie, `adrp reg, symbol; ldr another reg, [reg, :lo12:symbol]`

19:36 <geist> so in that case there's nothing to really relax

19:37 <geist> but yeah you see in my above paste that it didn't relax away the add instructions, even though it was a nop

19:39 <heat> arm64 don't relax?

19:39 <heat> i had the impression they did

19:40 <geist> i dont think so. riscv does, but that's not arm64

19:41 <j`ey> in my code add+adrp is turned into adr

19:41 <geist> or at least not linux relaxations. when you get into that you have to emit a lot more relocations so the linker knows how to fix things up, and i dont think there are a suite of relaxation rels

19:41 <heat> it does, for tls at least

19:41 <geist> j`ey: in the same file?

19:41 <geist> well i'm thinking specifically about *linker* relaxations

19:41 <geist> within a single file where it knows where the target is sure

19:41 <j`ey> geist: yeah in the same file

19:42 <heat> yeah I can see the same thing

19:42 <geist> yah that i expect. linker relaxations are a different can of worms. you need to have a bunch of rel entries for effectively every branch in the file so the linker knows how to fix things up

19:42 <heat> the linker is doing this

19:43 <heat> image.o has the adrp + add, the linker transforms it into an adr

19:43 <geist> https://www.irccloud.com/pastebin/RjHIqBtK/

19:43 <bslsk05> IRCCloud pastebin | Raw link: https://irccloud.com/pastebin/raw/RjHIqBtK

19:43 <geist> etc

19:43 <geist> all those RELAX entries are there to assist the kernel

19:44 * geist nods

19:44 <geist> i dont see it here, but then maybe bfd ld can't relax and lld can?

19:44 <geist> are you passing -mrelax or whatnot?

19:45 <heat> https://github.com/llvm/llvm-project/blob/056d63938a6f2ea6af9fb0934702dc664ee784e5/lld/ELF/Arch/AArch64.cpp#L602

19:45 <bslsk05> github.com: llvm-project/AArch64.cpp at 056d63938a6f2ea6af9fb0934702dc664ee784e5 · llvm/llvm-project · GitHub

19:45 <heat> no

19:45 <geist> aaaah replaces it with a nop. that makes sense

19:45 <geist> it's the shuffling around of instructions that's the issue in general

19:45 <geist> but if it keeps the same code size then it's just a 1:1 replacement

19:46 <heat> it tries this optimisation for all R_AARCH64_ADR_PREL_PG_HI21 and R_AARCH64_ADR_PREL_PG_HI21_NC

19:46 <geist> yah dont think i've seen bfd do that, but that falls into linker optimization

19:46 <j`ey> yeah adrp+add -> nop+adr

19:46 <geist> makes sense. whether or not an adrp+add -> nop+addr is much of a win i dunno, since AFAIK most modern arm cores merge adrp+add in the instruction decoder

19:46 <geist> but might be a win for lower end cores

19:47 <geist> if you read the optimization guides for the at least 7x or higher cores they have a bunch of instruction fusing with things like that

19:47 <heat> they also do a bunch of relaxation for tls, etc

19:47 <geist> also the usual cmp + branch stuff

19:47 <heat> bfd does this as well

19:47 <heat> (not sure if it does this particular relaxation, probably not)

19:47 <geist> yah i just checked a compile of LK and didn't see the optimization

19:48 <geist> (with bfd)

19:48 * heat coughs https://github.com/littlekernel/lk/pull/322

19:48 <bslsk05> github.com: [build] make LK buildable with LLVM/Clang by pcc · Pull Request #322 · littlekernel/lk · GitHub

19:48 <heat> :)

19:48 <geist> yea i know, but it breaks all the other arches

19:49 <geist> that's a common trait of folks that submit patches to LK

19:49 <geist> 'i have this narrow thing i care about so i hacked your build system to do this'

19:49 <geist> but yes i should look at it

19:49 <geist> pcc has a few other things oo

19:49 <heat> mine works for every arch

19:49 <heat> and I fought for that

19:49 <geist> right, i have to hack his patch to do it

19:49 <heat> microblaze is insaaaaaane

19:50 <geist> wait which is 'mine'?

19:50 <heat> https://github.com/littlekernel/lk/pull/324/

19:50 <bslsk05> github.com: [build] Use CC as a linker by heatd · Pull Request #324 · littlekernel/lk · GitHub

19:50 <geist> ah yes

19:50 <geist> i should look at that one

19:51 <geist> they're basically two conflicting patch sets. but it's clear i need to provide some support for clang. i've seen like 4 variants of that patch since i know LK is used internally at $company for a bunch of projects, and about half of them have hacked the build system to use clang

19:51 <geist> but virtually 100% of those internal hacks are 'i only care about 1 or 2 arches, so i just hack it'

19:52 <geist> another common trait of internal hacks are 'jam some arch code right in the middle of kernel/thread.c' or whatnot

19:52 <heat> are they conflicting?

19:52 <geist> like works for them but unacceptable

19:52 <geist> possibly. but i should look at it closer. i'll do so in aminute

19:52 <geist> honestly i just really dont care about clang, but i gotta get my act together

19:52 <geist> i mean i do care about clang, but i dont care in the sense that it's not *interesting* to me

19:53 <geist> as far as LTO, how far did you get the link working?

19:53 <geist> i've over the years fiddled with LTO on LK and there's always some fatal flaw that kills it in the end

19:53 <heat> i can't remember

19:53 <geist> usually runs afoul of the linker and GCed sections and whatnot

19:54 <heat> I think I started sticking attribute((used)) on a bunch of places but then gave up?

19:54 <geist> or works on one arch but totally broken on the other ones, etc

19:54 <geist> yah same

19:54 <heat> i should look at that again

19:54 <geist> it ends up being very fragile such that i'm not sure i'd want to officially have it in there

19:54 <heat> also issue #318 which I promised I would look at :)

19:54 <geist> but again, should suck it up and do some work

19:54 zid has joined #osdev

19:54 <j`ey> that VBAR trampoline thing is pretty funny

19:54 <geist> PCC also has that 'fault the mmu to skip trapoline thing' yeah

19:54 <zid> nice, scared the shit out of myself

19:55 <zid> had a 15 min powercut, took the time to give my machine a dust and swap some dimms around

19:55 <zid> machine is now triple channel and the bios crashes when entered

19:55 <mrvn> geist: clang is bad with `mls` or `muls` multiply-and-subtract.

19:55 <zid> swap the dimms back, still the same issue, think I've killed the slot

19:55 <geist> j`ey: i'm torn about that. it's nice in that it removes a bunch of stuff, but then it makes things slightly less flexible (cant use the trampoline post boot to read low memory) and i'm worried there's some machine somewhere that that isn't kosher

19:55 <gamozo> zid: Don't forget to rotate your bioses too!

19:55 <zid> eventually pull them both and I'm on.. single channel!? one of the *other* dimms, that I hadn't changed, had popped out

19:56 <geist> i *think* n at least one machine i use the trampoline in early boot to read the device tree or something out of 'low' memory

19:56 <gamozo> Otherwise the bios can get worn in one hot path, which is bad for the BIOS

19:56 <mrvn> geist: i'm not sure clang on arm64 is as good as gcc

19:56 <zid> all good now, except the power cut nuking all of mirc's settings as usual

19:56 <j`ey> geist: then have code that does trampline page tables for 4k, 16k, 64k!

19:56 <mrvn> worse on arm

19:56 <geist> j`ey: oh i already fixed it in zircon. it's an easy fix

19:56 <geist> so i should do that anyway

19:56 <geist> 4k pages and remove the code for 64k

19:56 <mrvn> (or godbold simply has old compilers)

19:57 <j`ey> everything supports 4k, rite

19:57 <geist> so i'll probably do that in a minute

19:58 <heat> mrvn, clang is android's compiler

19:58 <heat> so I hope so

19:58 <geist> re: dimms. i have a dimm start generating a bad memory cell the last few days on one of my work test machines. fiddled with GRUB_BADMEM and it actually works

19:58 <geist> heat: it's pretty mixed. we use clang on fuchsia too

19:58 lainon has joined #osdev

19:58 <mrvn> heat: try compiling int foo(int a, int b) { return a * a - b * b; }

19:59 <heat> we use clang at work too

19:59 <gamozo> clang codegen is often really inconsistent and it makes me sad

19:59 <geist> it has gotten better in the last few years. i'd probably say arm64 clang support is about par but it seems a little wonky the closer you look at it

19:59 <geist> exactly, the codegen is inconsistent

19:59 <heat> turns out V8 can't really reliably compile with a compiler that isn't the one it ships with

19:59 <geist> but in general it's better over time

19:59 <geist> it's pretty close at least

19:59 <mrvn> On arm64 it does neg + multiplay-and-add instead of multiply-and-sub

19:59 <mrvn> on arm it does mul, mul, sub

20:00 <raggi_> Clang is much better at complex c++

20:00 <geist> thing is comparing micro code sequences from compilers is mostly a fools errand. one can always pick and choose something one compiler does than the other

20:00 <mrvn> raggi_: but laggs behind in c++23

20:00 <geist> the key is overall how does it do, and i think they're basically on par nowadays

20:00 <mrvn> geist: missing an opcode completely is pretty bad

20:00 <geist> vs say 3 or 4 years ago when we were first really switching to clang

20:00 <raggi_> mrvn: perhaps, but if g++ ooms I can't tell ;-p

20:01 <geist> mrvn: shrug. that's just an optimizatino path that someone hasn't typed in, if that's the case

20:01 * geist waves at raggi_

20:01 <mrvn> geist: it's odd though that someone typed in multiply-and-add. I would expect both or none.

20:01 <raggi_> Howdy :-)

20:01 <geist> and as raggi_ says i think clang does a pretty good job with large complex c++ codebases

20:02 <mrvn> large complex c++ codebases live a lot on the STL. You have to implement that to be compiler / memory friendly

20:02 <raggi_> Lldb va gdb is similar too, and the symbolizes performance

20:02 <geist> we use it at work because from a feature and maintainability point of view it's the compiler with a future, even if it's codegen started off worse

20:03 <mrvn> The inconsistency of clang worries me more than bad code in the end.

20:03 <geist> but as someone says it's micro codegen is just... wonky. its like an AI designed it's codegen and it's optimized for strange cases

20:03 <geist> but in general performs about the same because modern cpus are pretty good at eating garbage

20:04 <zid> Anything performance critical gets written in streams of avx intrinsics anyway :p

20:04 <geist> whereas i tend to be able to 'see' the hand tuned code generator from gcc. for better or worse, it tends to be more stable

20:04 <mrvn> I notice that with ocaml. It doesn't optimize at all, just generates good, simple code from the start and is usualy the same speed as c/c++

20:05 <gamozo> https://rust.godbolt.org/z/sqj7q9fPh I ran into this the other day and it's such weird codegen

20:05 <bslsk05> rust.godbolt.org: Compiler Explorer

20:05 <gamozo> Does the same compare twice, even though the flags still are active from the same compare above. The compare above doesn't at least jump past the second compare

20:05 <geist> it feels a bit like clang is the reuslt of test driven optimizations. you can end up with weird decisions, but at the end of the day if it benchmarks >= older changes and you maintain that rule forever you end up driving it's optimizations into some local minima

20:05 <gamozo> just so strange

20:05 <geist> even if it doesn't 'make sense'

20:05 <zid> uh oh machine is acting funny, gaah

20:05 <zid> something's wrong

20:06 <mrvn> I run into bubblesort the other day: gcc -O2 and clang -O2 is about the same. gcc -O3 is 5 times slower, clang -O3 is twice as fast.

20:06 <gamozo> I feel like clang gets macro really well, but micro really poorly. I often see clang loading registers and then never using them, or overwriting them the instruction after. Things that I don't think should be possible "ever"

20:06 <geist> vs stepping back and making good clear human centric decisions on how you apply optimizations

20:06 <zid> no applications will start, neat

20:07 <geist> gamozo: yah precisely. and if all the 'accept this change' decisions in the project itself are based on some sort of large test cases, it'll still pass the test because in the long run the large scale optimizations usually win over small ones

20:07 <raggi_> The thing about rust codegen for cases like that is you have to remember it's a tall lazy iter apj that's been inline folded at a high level then optimized later

20:07 <geist> so you end up with this. strange micro decisions that as long as they dont hurt the overall picture, make it in

20:07 <mrvn> or std::optional<double>. Instead of just writing the bool and double for a std::optional it messes around with xmm0 registers constructing the thing on the stack and then copying it out.

20:08 <geist> but... all this aside i've watched over the last few years clang get better and better with micro decisions, presumably as folks find misoptimizations, file bugs, and someone fixes em.

20:08 <mrvn> gamozo: what I don't get is that the codegen doesn't do a final pass to eliminate dead code

20:08 <geist> so it's one of those cases where eventually the project with the mindshare wins

20:09 <gamozo> mrvn: I would hazard a lot of the things I see are on teh backend, I'm not sure. I'd imagine these "issues" don't exist in the IL but rather the backend

20:09 <geist> but i have more trouble using it for smaller, less sophisticated cpu cores where actually the micro placement of instructions does kinda matter

20:09 <mrvn> can you explain that rust code?

20:09 <raggi_> The advantage clang has now is that it sucked a huge volume of the professionally funded toolchain teams, so it gets a ton of full time input

20:09 <geist> vs a high end superscalar machine that generally reduces wonky codegen to more or less the same thing as good codegen (in a lot of cases)

20:10 zid has quit [Read error: No route to host]

20:10 <geist> raggi_: yah basically it doesn't matter if it's better or not, it's simply where the money is

20:10 <gamozo> mrvn: It requests the u8 at index 5, which returns an Option<&u8>. It can be None if it's out of bounds. I then apply a map to deref the inside to convert it to an Option<u8> instead of returning a reference to the byte

20:10 <gamozo> It's just indexing a slice, but with out-of-bounds checking not resulting in a panic

20:10 <heat> "the money" considerably backed off of llvm when the standard refused to break the ABI

20:11 <raggi_> did it?

20:11 zid has joined #osdev

20:11 <heat> c++20 compliance is still far off

20:11 <heat> yes

20:11 <raggi_> Apple, Google, meta, etc are still invested heavily

20:11 <heat> don't get me wrong, they're still on it

20:11 <mrvn> gamozo: line 4 loads the byte. But what does the rest do in the asm?

20:11 <heat> but it seems like most of the invest is going into rust instead

20:11 <zid> That was a fun crash, filesystem basically disappeared, I could interact with a bunch of stuff but programs wouldn't start and some actions locked up various programs

20:11 <zid> then 30 seconds later, all the clicks I did finally caught up and a bunch of programs/files opened

20:11 <zid> then it immediately bluescreened

20:12 <geist> I'll count the money next time i go into the office and dive into the large pit of cold coins, Scrooge McDuck style

20:12 <raggi_> I haven't heard of any mega corp or mega product teams switching to a non clang alternative

20:12 <zid> My pc is now officially haunted

20:12 <gamozo> mrvn: THis fnuction is not marked extern, thus it uses Rust's (undefined) calling convention. It would appear it uses two return values. al being the bool indicating if the option is Some or None (present or not), and dl holds the value contained in the option

20:12 <geist> zid: if you want to keep your data you might want to be a bit more careful, ou can easily get a fata FS corruption if you have a known buggy machine

20:12 <heat> raggi_: not a non-clang alternative, but a non-C++ alternative :)

20:12 <mrvn> gamozo: I figured the al for that too

20:12 <heat> which is understandable, but sad

20:12 <zid> geist: yea has happened before, ntfs is even especially bad for it imo

20:13 <gamozo> Without marking extern, Rust uses it's own (undefined) calling convention. Which allows the compiler to change up things as it sees fit. It's kinda interesting/neat

20:13 <mrvn> gamozo: and "cmp rsi, 6"?

20:13 <j`ey> checking the length of the slice/array

20:13 <j`ey> to see if array[5] is ok

20:13 <raggi_> heat: I ahevnt really heard of that either. Ms and AMZN are investing heavily in rust, and it's slowly making some inroads at Google but the c++ wall is solid too. That's all llvm investment

20:13 <gamozo> &[u8] internally is a (void*, size_t). It's checking the "second argument" to the function, which is the size_t (length of the vector), to see if it's in bounds

20:14 GeDaMo has quit [Quit: There is as yet insufficient data for a meaningful answer.]

20:14 <gamozo> this function, from an ABI perspective, is the same as fn foo(void*, size_t), and thus rsi is the second arg

20:14 <mrvn> gamozo: so why isn't line 7 between 2 and 3?

20:15 <heat> raggi_: I dunno, maybe I heard wrong. But even here at cloudflare lots of things are getting written in rust or go and not C++. Of course, no one is really rewriting core stuff to rust

20:15 <gamozo> bad codegen :)

20:15 <raggi_> heat: I think gos been taking more away from dynlangs than from c++

20:15 <mrvn> gamozo: As to the rust source I would rather have the function prototyle say |a| >= 6

20:16 <geist> it's definitely happening in fuchsia a lot. seems most new subsystems are implicitly rust based

20:16 <geist> much to the chagrin of folks trying to build the OS

20:16 <gamozo> In reality, they probably move the return value closest to the ret, to avoid reserving such a critical register such as 'al' in the middle of the function (and having to preserve it to the end). That's probably _why_ it's at the end. And another pass just didn't move it around from there

20:16 <raggi_> geist: yeah, fuchsia is a rebel in Google in general

20:16 <j`ey> gamozo: it's the same codegen for aarch64, where regs arent so important

20:17 <raggi_> geist: by some measures Ms is moving faster here, with official windows sdks in rust already

20:17 <raggi_> And they've written two network stacks in rust

20:17 <heat> what

20:17 <heat> two network stacks?

20:17 <raggi_> Yep

20:17 <heat> in the kernel? or user space?

20:17 <mrvn> gamozo: buy more registers :)

20:18 <gamozo> mrvn: But they just killed IA64

20:18 <geist> says the person that insists on using arm32

20:18 <raggi_> Cloud things with blurred lines

20:18 <raggi_> Parts run on nics etc

20:19 <heat> how do you get an itanium machine

20:19 <geist> might be able to find one on ebay. but it'll almost certainly be a huge ass machine

20:19 <geist> since there werne't a lot of workstations made over the years, and theones that were were still huge ass workstations

20:20 <geist> https://www.ebay.com/itm/224936107527?hash=item345f3c9e07:g:Z8kAAOSwdw5iVxYF looks like would be a fairly cheap one

20:20 <bslsk05> www.ebay.com: HP Integrity BL860c Server Blade Intel Itanium 9140m 1.66GHz 16GB RAM | eBay

20:20 <heat> oooooooooh

20:20 <heat> seems nice

20:21 <heat> I kinda want one

20:21 <heat> why? the lulz

20:21 <geist> yah i had one for a while. was fun. kinda.

20:21 <gamozo> Given the height of that server, and the TDP of IA64, I'm gonna say that this bad boi is _loud_

20:21 <geist> i mostlygot rid of it because it was physically huge. so not worth the effort to keep around

20:22 <zid> do servers come in quiet?

20:22 <zid> if your server fan doesn't walk across the desk you got scammed

20:22 <gamozo> I only buy 2Us+ to get quieter servers. It's not nearly as bad as 1Us

20:23 <geist> https://ark.intel.com/content/www/us/en/ark/products/31794/intel-itanium-processor-9140m-18m-cache-1-66-ghz-667-mhz-fsb.html looks dual core

20:23 <bslsk05> ark.intel.com: Intel Itanium Processor 9140M 18M Cache 1.66 GHz 667 MHz FSB Product Specifications

20:23 <heat> Intel® 64: No

20:23 <heat> lie!

20:24 <gamozo> It has Intel 64... just no AMD 64 ;)

20:24 <geist> it's a montvale, so at least a montvale

20:24 <geist> er i mean itanium 2

20:24 <gamozo> Imagine not writing the 64-bit extension for your own architecture

20:24 <gamozo> Kinda cringe as a company

20:24 <heat> imagine making physical processors

20:24 <geist> heat: right, itanium 2 dropped x86 backwards compatibility support so no intel 64

20:24 <heat> this message is sponsored by arm gang

20:24 <geist> and i'm not sure itanium 1 ever had x86-64

20:25 <heat> intel 64 is itanium

20:25 <gamozo> Just write the spec and have everyone else figure out how to fab it with their own proprietary extensions! Arm 4 life

20:25 <geist> i assume you're joking, because it isn't at all

20:25 <heat> yes it is?

20:25 <zid> okay that took longer than I wanted to find

20:25 <zid> https://youtu.be/D5UKs9H57U8?t=3

20:25 <bslsk05> 'Delta fans jet' by Tam3n (00:00:37)

20:25 <heat> isn't x86_64 IA-32e?

20:25 <gamozo> (Actually MIPS beat ARM in this. I'm pretty sure every router chip I've looked at that uses MIPS uses absolutely proprietary opcodes and breaks the standard it's great)

20:25 <geist> no. intel 64 == x86-64

20:25 <geist> ia64 == itanium

20:25 <gamozo> ^

20:25 <geist> they are completely different things

20:25 <heat> wtf

20:26 <heat> I prefer IA-32e

20:26 <zid> they changed name like 3849 times

20:26 <heat> much clearer naming

20:26 <zid> 32e was in use for a bit I swear

20:26 <geist> x86-64 == ia32e == amd 64 == intel 64 == x64 == probably a few others

20:26 <heat> it is the official name in the SDM I think

20:26 <geist> itanium == ia64, period

20:26 <zid> ia64 is itanium though

20:26 <zid> I'd also call x64 arguably itanium but more people use it to mean amd64

20:26 <geist> that's where the ia32 name came from they had to backronym it when they invented the ia64 moniker

20:27 <heat> the proper way to refer to it is x86_64 (WITH THE UNDERSCORE)

20:27 <geist> since at the time there was not going to be a x86-64

20:27 <heat> anything else is heresy to me

20:27 <zid> x86_64 definitely has an underscore yep

20:27 <gamozo> heat over here in their own dimensions

20:27 <psykose> amd64 is one less character hence better

20:27 <mrvn> They should have kept amd64 as arch name

20:27 <zid> intel architecture 32, and amd 64, it fits

20:27 <zid> considering who designed which

20:27 <raggi_> x64 is better beause it's easier to scan read next to arm64 xD

20:28 <geist> it was going to be simply ia32 for x86 and ia64 for itanium and all future 64bit stuff would obviously be derived from itaniu,. i remember that koolaid flowed heavily all over the industry. it was a great new time to switch to a Real Architecture

20:28 <zid> what was the crazy i4389 or whatever arch

20:28 _xor has quit [Quit: WeeChat 3.4.1]

20:28 <heat> itanium was the peak of humanity

20:28 <zid> can we have that instead for ia64

20:28 <geist> raggi_: yah exactly. that's why i relented on the battle of what to call the arches in fuchsia. in exchange for arm64 i yielded x64

20:28 <heat> we've gone downhill from thereon out

20:28 <gamozo> Riscv is the future

20:28 <geist> because aarch64 drives me nuts

20:28 <raggi_> yeah

20:29 <raggi_> i hate having to type eabi too

20:29 <geist> every single time i see it i think of Samuel Jackson saying 'do i have a stutter?'

20:29 <mrvn> geist: you stutter :)

20:29 <heat> aarch64 and ia32e sounds great to me dunno what you're on about

20:29 <geist> ARM is many things but inventors of good acronyms they are not

20:30 <mrvn> geist: ARM ARM?

20:30 <heat> sxtw

20:30 <geist> the ARM ARM so you can program your ARM in your ARM computer

20:30 <geist> though officially i think they changed it to arm instead of ARM a while back

20:30 <geist> because rebranding

20:30 <heat> a r m

20:31 <gamozo> Lowercase is very trendy right now

20:31 <mrvn> my robot arm has an arm controller so you have to read the ARM ARM ARM

20:31 <zid> ARM: Advanced Recursive Manual

20:31 <raggi_> hired a new brand lead, switched case, font and logo

20:31 <geist> but i like to say ARM because it makes it clear it's te company/architecture, and not just another word

20:31 <heat> ARM: ARM reference manual

20:31 <gamozo> How can you justify the reorg unless you change random things?

20:31 <geist> and it's pretty easy to type with right shift, so no biggie

20:31 <heat> yall use the right shift?

20:32 <raggi_> yes

20:32 <heat> nah nah nah nah nah what

20:32 <geist> i almost exclusively use right shift, though i'm trying to mix in the left shift more dynamically

20:32 <heat> i don't think i've ever used the right shift

20:32 <geist> but years of typing has be pretty much fixed on right shift, left ctrl

20:32 <raggi_> i started using right shift more after i stopped flying between eurpean and us regions regularly

20:33 <raggi_> before that, right shift would change shape, which made it unreliable

20:33 <geist> is ther eno right shift on european layouts?

20:33 <heat> there is

20:33 <gamozo> There's a right shift key?

20:33 <geist> oh differnet shape, yeah

20:33 <raggi_> it's there, but there's often an extra key there, making it shorter

20:33 <heat> thank you gamozo

20:33 <geist> at least for US keyboards it's at least pretty symmetric. there's a right and left shift of all the modifiers

20:34 <raggi_> i rarely use right ctrl

20:34 <heat> my right shift and right ctrl are longer

20:34 <raggi_> mostly for a one handed solute

20:34 <heat> maybe this is my gamer hands talking

20:34 <geist> yah but when i do it it's nice

20:34 <heat> all the left buttons are closer to wasd

20:34 <geist> like when i dynamically mix in a left shift because i'm typing on the right side of the keyboard, etc

20:34 <gamozo> My right shift, right ctrl, right alt, right menu, and right super are all fully textured on my ~15 year old keyboard

20:34 <geist> it feels good when yuor muscle memory pulls off something complicated like that

20:35 <raggi_> i type quite poorly, but i'm ok with that, because i don't get bad rsi from typing

20:35 <raggi_> every time i try to type "correctly" i hurt myself

20:35 <heat> do you mean %rsi

20:35 <geist> I think i use right shift since for the most part i like to yse the opposite modifier for the thing i'm typing, and statiscally speaking i think more capitalized first letters (in english at least) are on the leftish side of the keyboard

20:35 <geist> except I of course

20:35 <heat> hehehehehehehe

20:36 <geist> but then i usually dont bother capitalizing I

20:36 <raggi_> i only capitalize Serious Business

20:36 <raggi_> and THREADRIPPER

20:36 <heat> srs biz

20:36 <geist> EL TORITO

20:36 <heat> EL TORITO

20:36 <raggi_> :)

20:36 <gamozo> raggi_: Yeah, I never formally learned how to type. Which I think means I learned naturally instead of what some person taught me. And I've never had RSI

20:37 <geist> though now that i think about it a lot of programming keys are shift on the rght side: ()_+", etc

20:37 <geist> and i think i justone hand those

20:37 <raggi_> yeah, i do too, it's probably the most dangerous part of my typing, and part of why i enjoy languages wtih punctuation elision grammars

20:38 <geist> bu it's also why i dont feel particular hatred for _ like a lot of folks do

20:38 <geist> to me it's just another key because i just one hand shift - it

20:39 <raggi_> where i have a choice i prefer snake case to camel case, i don't really hate _, but in cli's etc, i prefer -

20:39 <geist> 100% i really dont like camel case, but it's not a hill to die on

20:39 <raggi_> google style --foo_bar flags are just unpleasant to my eyes

20:40 <heat> oh yeah those are horrible

20:40 <zid> yea fuck that

20:40 <heat> they're just not right

20:40 <mrvn> __help__ any better?

20:40 <geist> tiny admission: i'm starting to not terribly hate the trailing _ for member stuff in C++ classes

20:40 <zid> I'm already on - from typing --

20:40 <heat> oh yeah I like the trailing _

20:40 <geist> i thought it was terrible at first, but honestly i'm starting to stockholm symdrome that

20:40 <zid> also good news I'm about to have an ocular migraine

20:40 <raggi_> geist: that's how it happens, that's why python is heavily infected now

20:40 <geist> it looks terrible with foo_->bar_->baz but

20:41 <heat> the good part of the trailing _ is that you can have a class c { private: char *data_; public char *data() {return data_;}}

20:41 <geist> i was working on an older project of mine where i ws using mFoo and i'd prefer foo_ over that now

20:41 <geist> or m_foo

20:41 <mrvn> geist: you don't use _ for arguments that confcit with members?

20:41 <geist> m_foo actually doesn't bug me too much but still

20:41 <geist> mrvn: yeah but those are leading

20:41 <heat> lpFoo

20:41 <mrvn> nah, leading is a big nono

20:41 <geist> sure but fuck the man

20:42 <raggi_> unpopular opinion puffin: if you need those sigils, there are other code problems

20:42 <geist> leading in global scope sure. but if it's some local thing it doesn't matter if you use leading

20:42 <heat> C standard: "nooo only the implementation can use leading underscores" geist: "I am the implementation"

20:42 <mrvn> this->foo for member variables sucks too

20:42 <geist> if it conflicts with global scope the compiler will tell you

20:43 <geist> but yeah in general i try to avoid args conflicting with things. sometimes if it's a single liner i dont care that much

20:43 <geist> void set_foo(int _foo) { foo = _foo; } etc

20:43 <mrvn> constructors and setters are a problem there

20:44 <geist> but usually if it's a trailing _ for members the problem goes away

20:44 <geist> void set_foo(int foo) { foo_ = foo; }

20:44 <mrvn> void set_foo(int foo_) { foo = foo_; }

20:44 <heat> the whole problem would go away if we STLified all our C++ code

20:45 <geist> nooooo!

20:45 <heat> int _Foo;

20:45 <heat> looks beautiful

20:45 <raggi_> then you just get constructor variance problems

20:45 <geist> honestly (unpopular opinion) what i really hate is over use of <algorithm>

20:45 <geist> where you get that kinda code that everything is an amalgamation of existing algorithms, instead of just writing what you mean

20:45 <raggi_> c++ stdlib is kind of apalling

20:45 <geist> i get it, i understand why that's 'better' in a lot of cases, but i dont think it helps with readability at all

20:46 <heat> or <functional>

20:46 <heat> or fucking <array>

20:46 <mrvn> I'm missing a range-for with index

20:46 <geist> yah

20:46 <heat> you also import a huge header full of crap

20:46 <heat> making your compile time slower

20:46 <geist> like i dunno i was staring at some code the other day that took me a while to grok because it was some complicated scheme where it was using std::optional and std::uh what is it that calls one of N functions based on what the input is?

20:46 <mrvn> or looping over 2 or more things in parallel

20:46 <geist> i get it, it's kinda neat. rust has shit like that

20:47 <geist> but... the codegen is atrocious, and it was used for a case where the number of std::optional cases is like 2 or 3

20:47 <mrvn> geist: std::variant

20:47 <mrvn> and lambda with overloads

20:47 <raggi_> geist: a friend roped me into helping them overcome issues in a codebase full of boost variant and boost reflect recently, and i wanted to smash things

20:47 <geist> std::variant yeah. so you end up with a lot of boilerplate to define all of this stuff so that you just end up with functionally a 3 entry switch statement

20:47 <raggi_> ^^

20:47 <raggi_> that, very much that

20:47 <heat> C++ codegen will never be as good as rust's on that stuff because all of C++'s codegen is built on classes and templates and all that crap

20:48 <geist> but it's hard to argue against it in a code review, because you're telling someone to undo some 'beautiful' code that they just spent this time on

20:48 <geist> to do something 'dumber'

20:48 <raggi_> the argument "this will reduce cost of extension in the future" is nebulous, and sadly often bs, but hard to resolve in a review

20:48 <geist> the best argument i have against it is std::variant *can* generate a vtable and a bunch of functions. there's no guarantee it'll inline it

20:48 <mrvn> c++ lacks a match statement and the function overloading needs too much boilerplate

20:49 <geist> what i really dont like is it's hard to grok precisely which variant will be called in any given case. it puts the onus on the reader of the code to manually match what goes where

20:49 <raggi_> it gets worse

20:49 <geist> but this is the sort of thing i feel like a stick in the mud about a lot, since i feel like i'm generally being outgunned at work with this sort of code being more the norm

20:49 <raggi_> if you use gcc, and there are conflicts, by default in some cases it'll "jsut pick one"

20:50 <geist> because it's better and more templateable and thus more unit testable, etc

20:50 <raggi_> clang at least will error on that by default

20:50 <mrvn> raggi_: it complains when it is ambiguos

20:50 <raggi_> not always

20:51 <mrvn> c++ has strict rules about it. Nobody understands them but they are there

20:51 <raggi_> i literally fixed one of these cases about two months ago

20:51 <gamozo> C++ has rules?

20:51 <raggi_> xD

20:51 <zid> I can't see a damn thing

20:51 <heat> that's assuming that compiler writers interpret the standard the same

20:51 <zid> thank god I can still type and annoy you all regardless

20:51 <geist> baiscally i like enough C++ to give me better type safety, better control over memory allocation (safe pointers, etc), and ability to structure object oriented code the way I was probalby already basically writing it in C

20:51 <gamozo> I thought C++ just allowed compiling of /dev/urandom. The fact that C++ "looks the way it does" is just random chance that the undefined behavior lines up that way

20:51 <geist> and that seems like a nice balance, especially for lower end hardware

20:52 <heat> geist, +1

20:52 <raggi_> geist: i know you don't like rust much, but you might actually enjoy nostd from that perspective

20:52 <heat> my C++ is just C with classes, some light templates and RAII wrappers

20:52 <raggi_> as that's mostly all it provides

20:52 <geist> when you use a lot of std:: just to use it i think it starts to quickly spiral into some other language that really does not help on the readability front

20:52 * kingoffrance adds "unpopular opinion puffin" to aethernet / unix "a gnome buffers your characters" / pixel faeries / daemon/dragon / pypy etc. list of technical terms

20:52 <geist> raggi_: yah i dont really dislike it i just need to put more effort into it

20:53 <geist> my main gripe is its really slow to compile large projects. feels like it exponentially scales up the compiling side of things

20:53 <raggi_> that's mostly choice side effects, and c++ does the same

20:53 <geist> yah but it feels like the exponential scale factor is somewhat higher

20:53 <raggi_> if you spent time wiht it in nostd with your kind of code style it'll mostly perform like c

20:53 <geist> like 3.5 instead of 1.5 or something

20:53 <raggi_> i think the exponential is a perception distortion

20:53 <gamozo> Rust is fine for build times, it's just really bad decisions from third party stuff. I have multiple 50-100k LoC rust codebases that build in 1-2 seconds

20:54 <gamozo> But the second I bring in a third party crate it goes to 30-60 seconds :sigh:

20:54 <mrvn> geist: mostly a problem of templates and sfinae

20:54 <geist> perhaps? we have tools within the tree now that are taking 35 *minutes* of compile time

20:54 <raggi_> e.g. slow parts in the fuchsia code base when i left was like ffx, which links like 50% of the fidl code in teh code base

20:54 <geist> and 8GB of memory

20:54 <geist> yep. ffx. it's gotten a lot worse, and i dont see how that can scale

20:54 <raggi_> no other program in the tree does that, and wall time wise it's doing pretty well

20:54 <gamozo> Yeah, that's absurd to me. That being said, I know it happens. I've never had a problem with my own code bases. I think there are a few big crates out there that are doing some nasty things that serialize the build and heavily use generics

20:54 <geist> i keep raising the flag but i dont think there's any will/way to fix it

20:54 <raggi_> if it was split into hundreds of programs, it woudl take similar time

20:55 <raggi_> you just wouldn't see it, because it'd be 32 one minute compiles, rather than one 32 minute compile

20:55 <geist> problem is it also needs to be recompiled/linked at the drop of a hat since it links with everything

20:55 <raggi_> right, so that's a build system/toolchain problem

20:55 <gamozo> I'd hazard rust build times being bad are less about rust build times, and more about the crate-culture which makes it so easy to pull in massive deps just to use 1-2 simple functions

20:55 <geist> and now the build system depends on it since it's used n a step, so yo ucan't just skip it either

20:55 <raggi_> treating it like a c compiler when you optimize it doesn't work out well

20:55 <gamozo> You can just invoke rustc and generate objects, and then link them together at the end like C

20:55 <geist> but anyway that's work, which is not weekend

20:55 <geist> but you're right, it'snot fair to penalize the language for a bad case

20:56 <raggi_> gamozo: fuchsia does that, but that's not entirely true

20:56 <gamozo> If you're willing to write rust like -ffreestanding C/C++, I have _never_ had build time issues at all

20:56 <geist> hell, zstd.c still takes like 2 minutes to compile, goma just helps with that

20:56 <geist> and that's *C*

20:56 <gamozo> Correct, you need forwards declarations

20:56 <heat> there was a change a few weeks back that started verifying some stuff after building fuchsia and that stopped 8GB + 8GB systems from compiling fuchsia

20:56 <heat> nuts

20:56 <geist> heat: it was that. ffx

20:56 <geist> it keeps accidentally bumping it's size

20:56 <raggi_> yeah, doesn't entirely surprise me

20:56 <raggi_> the fact that it's doing lto doesn't help

20:57 <geist> i had it blowing out my /tmp the other day, since at some phase of rustc apparently rustc creates a dir in /tmp and writes out copies of all the input rlibs

20:57 <geist> and it was 6.8GB

20:57 <raggi_> rust defers a lot to link time, and none of the fuchsia toolchain optimizations optimize that yet

20:57 <gamozo> That being said, I do think Rust could get 10-100x build time speedups without sacrificing codegen. It's just not super parallel. It also serializes pretty heavily at crate-levels when you hit generics, where in reality it should build as much as it can, until it needs it's generic dependencies to be compiled

20:57 <heat> being parallel doesn't help IMO

20:57 <raggi_> gamozo: that would bloat memory usage further, in a significant way

20:57 <heat> most build systems expect a single job to only use a single thread

20:57 <raggi_> gamozo: and make linking even more expensive

20:58 <gamozo> It is something I'm disappointed with. But, for my own OSdev stuff where I'm not using crates, I've never had problems with absurdly complex macros, generics, etc, causing any issues. My entire IL is generic across the size of the architecture, and it's still sub-second build times

20:58 <geist> someone at work explained to me how the cpu time can jump up a lot on rustc when using parallel lto. basically each job in the lto re-expands all the generics

20:58 <zid> hoorah it stopped, I can see again

20:58 <raggi_> right, so user choices count for a ton

20:58 <gamozo> raggi_: Not necessarily. Yes, if they bolted it on, sure. but in reality they could do a lot better job with that memory usage and linking

20:58 <geist> and can't share cpu/memory while they're running

20:58 <mrvn> stop making a single file use 8GB to compiler and you can use -j8 again

20:58 <gamozo> ^

20:58 <geist> so the more threads you add to the LTO it his some scaling point where it's diminishing returns time wise because the amount of duplicated work doesn't help

20:58 <raggi_> if you say, code generate generics that produce expensive and bloaty monomorphizations and then make that mandatory for every api on the system, you'll have poor compile times

20:59 <gamozo> There's really no reason Rust can't be "converted" into C-like-rust where you extern "rust" {} for your forward declarations and compile nearly everything in parallel

20:59 <mrvn> gamozo: nd inline nothing?

20:59 <raggi_> well there is, that severely limits optimizations

20:59 <gamozo> You can inline at the linker stage

20:59 <raggi_> you can't type fold there though

20:59 <heat> tldr reject modernity, embrace C99

21:00 <mrvn> gamozo: that's the part that takes all the time

21:00 <raggi_> so it breaks down fast

21:00 <gamozo> Keep in mind, compiling doesn't have to be to native arch, it can be to a simple IL that's easy to inline at

21:00 <geist> so -j1 it might lto in say 5 minutes of cpu time, -j2 may be like 8, -j3 may be like 12, etc. in that it doesn't just take the initial 5 minutes and subdivide it among cpus

21:00 <heat> in fact, C89 is ok too

21:00 <gamozo> linking takes long because linkers are written like absolute garbage. We've already seen that there's 100x gains in linker times

21:00 <geist> the overhead of addiing more cpus is not linearly dividing the time, since each job duplicately performs the same tasks

21:00 <gamozo> (with mold)

21:00 <heat> ld.lld is pretty good

21:00 <gamozo> Pretty much all our toolchains, and linkers were written for single-core CPUs. We're missing so much perf

21:00 <raggi_> heat: yeah, it's much better at hard tasks

21:01 <gamozo> https://github.com/rui314/mold/raw/main/docs/comparison.png

21:01 <mrvn> gamozo: linking is rather trivial compared with the optimizer

21:01 <geist> omg rant: someone has set my email as their notification for their ADT security system. i get notifications about windows being left open, door to the garage opened

21:01 <heat> i'm not convinced on doing things in parallel here

21:01 <psykose> multithreading can't give you magic performance though; mold is 100x faster not because of 100 threads, but because it's fast on one and then also scales

21:01 <gamozo> I disagree, I genuinely think the way we're doing last-minute inlining, generics, is just really poorly done. There's no reason we can't have a super simplifed IL that's designed for inlining at LTO (rather than more complex graph optimizations)

21:01 <zid> geist: Your fly is undone

21:01 <geist> i can't log into their account and shut it off because they have 2 factor

21:02 <j`ey> gamozo: just wish it did linkerscripts

21:02 <mrvn> gamozo: lat-minute inlining is whole program optimization, not linking

21:02 <heat> geist, complain to ADT?

21:02 <raggi_> geist: eugh, though i'd rather ahve that than the jerk who signs me up for hard right wing newsletters

21:02 <gamozo> j`ey: yeah, I think it's still only Rei working on it. It'll definietly be a while. I'm honestly impressed it can build chrome and clang lol

21:02 <geist> raggi_: oh i get that too.

21:02 <geist> i mean i can just filter it into the trash

21:03 <gamozo> mrvn: I'd hazard inlining can relax some of the optimizations. There's a lot of optimization passes that will just not really do anything when you're ready to inline

21:03 <gamozo> I think it could be a simplified optimization pass

21:03 <gamozo> I'm not talking about perfect optimization. Of course perfect optmization you have to pull everything into one compile unit

21:03 <gamozo> But I think we're using a massive hammer for a stage where you don't really need it

21:03 <j`ey> gamozo: yeah, but he has stated linkerscript as a non-goal

21:03 <raggi_> geist: yeah, i am gunna move to my own domains eventually, but i want a mailserver that does zk encryption on arrival that doesn't require insane amounts of weekly maintenance

21:04 <gamozo> j`ey: awhh. Well, I think he has plans for his own linker-script replacement. Which I approve of, as linker scripts are absolutely fail-open garbage

21:04 <geist> yah i remember back when i used to run my own spamassassin

21:04 <geist> that eventually became a losing battle

21:04 <raggi_> maddy is really close

21:04 <gamozo> I think he wants to make a stricter linker scripting environment? Which thus, won't be backwards compat

21:04 <mrvn> gamozo: I doubt it. The benefit of inlining isn't so much the saved function call anymore, it's that you optimize the code to the local usage. register allocation alone is a major cost factor and speed gain

21:04 <raggi_> maddy folds about 7 daemons into one binary, which helps, but i still want encryption and some spam stuff

21:04 <gamozo> mrvn: Yeah, you can do that really cheaply though. You can do that effectively without graph analysis. Source: Working on my own IL for whole-program optimization which is insanely fast

21:05 <heat> https://blog.cloudflare.com/announcing-route-to-workers/

21:05 <gamozo> So, my current view (which could be wrong, research eh?) is that a faster inlining allows more inlining, which is better than fewer inlinings with "better" optimizations

21:05 <mrvn> gamozo: great. seems like you solved P == NP.

21:05 <gamozo> I only do a single forward and reverse pass of the IL

21:05 <raggi_> gamozo: how do you deal with duplicate eliminiation?

21:05 <gamozo> No graph traversals

21:05 <raggi_> (and other forms of bloat)

21:06 <gamozo> raggi_: I can only remove/alias duplications if they happen above in teh graph, in a dominator. But that's really no different than any other IL

21:07 <raggi_> this is important to measure, in very large programs it ends up becoming geometric

21:07 <gamozo> But we'll see, I have a very special use case. So far it's been absolutely mindblowing for perf. I'm not sure if it's generically applicable yet though? Idk. Working on my OS right now, got sick of working on my IL on Linux

21:07 <mrvn> raggi_: don't write code with duplications, don't generate IL with duplication

21:08 <raggi_> mrvn: never call the same function twice xD

21:08 <raggi_> gamozo: nothign wrong with fit-for-purpose, as long as it's used for the purpose for which it fits

21:08 <mrvn> raggi_: not with the same argumenta at least if it's pure

21:08 <gamozo> So, my current IL is designed for treating all memory as constant until proven otherwise, and it's wild

21:08 <gamozo> If you define write(stdout) to be a no-op (eg. no console), it will bubble up and remove printf() in your code

21:09 <gamozo> it's so good

21:09 <gamozo> and it does that in microseconds

21:09 <raggi_> yeah, there's a lot of rust stuff that produces those patterns

21:09 <gamozo> To make this work, it _has_ to _compile_/JIT fast. I'm oaky with "sloppy" allocations, because the micro-optimizations are less important than the macro optimizations

21:09 <gamozo> Cause I end up re-compiling/optimizing the same code thousands or millions of times, and thus it has to be absurdly fast

21:10 raggi_ is now known as raggi

21:11 <gamozo> Thus the entire optimizer works on a Vec<Op>, and it does one linear forward pass, and one linear backwards pass, such that the CPU can prefetch during optimization. No graphs, no graph traversals, no maps, no dicts, etc

21:11 <gamozo> any of those things effectively make this unusably slow as you get 100-150x slowdowns. But, that limits what I can optmiize. But it's "worth" it

21:11 <raggi> i'm getting Factor vibes

21:11 xenos1984 has quit [Read error: Connection reset by peer]

21:11 <gamozo> it allows things like dynamic configuration (globals), config files, registry keys, etc. To be optimized into your language and constpropped

21:12 <gamozo> It's actually really really interseting and probablym y favorite work (and it's why I'm back to writing OSes so I have a better env to dev it in)

21:13 andreas303 has quit [Ping timeout: 248 seconds]

21:14 <gamozo> Any if(logging) code paths just get completely deleted, it's so cool

21:14 <gamozo> printf("%s", foo) turns into write(), etc, etc

21:15 <j`ey> I wrote a clang rewriter tool that rewrote if statements in clang to add '&& false' to conditions I knew could never happen

21:15 <j`ey> (similar kinda thing)

21:16 <gamozo> The main operating principal of my work is that it will const-prop things without knowing they're constant. When memory is treated as constant it is marked with metadata saying "this was used as constant". if a write to it is ever observed, then the JITs which made that assumption are invalidated and the JIT is re-entered

21:17 <gamozo> "constant until proven guilty"

21:18 <gamozo> And alas, you can see why this will re-JIT many many times (as it learns what is and isn't constant). And thus, why compile speed matters more. The faster I can compile, the wider I can inline things. My eggs are in the basket that removing massive amounts of code with shitty one-pass optimizations will out-perform doing micro optmiizations that are more robust

21:20 <gamozo> It's designed for snapshot fuzzing. With the idea that if you give constant input to a program (eg, the same file input to like a PNG parser). It will have _no_ conditional operations in the optimized program. It will only be memory side effects (if threads are enabled), or simply, entirely collapse to a 'ret' if you don't have threads (and thus writes aren't "volatile")

21:21 <gamozo> You'll just either get a massive function of millions of writes with no branches, or a ret. As you mutate the input, parts of the code flow will "come back", but only the things you have caused. Which is really interesting for static analysis, as you can "see" the program that you're observing rather than the original program

21:25 andreas303 has joined #osdev

21:28 xenos1984 has joined #osdev

21:47 <heat> i should get a domain but then I need a server

21:50 <gamozo> For the right price I'll host for you

21:50 <heat> 1 cent a day

21:50 <heat> deal?

21:50 <gamozo> I was thinking like $100/hour

21:50 <gamozo> Electricity is expensive right now

21:50 <heat> hmm

21:51 <heat> what's the hardware?

21:51 <heat> gameboy color?

21:51 <gamozo> I have a gameboy pocket

21:51 <heat> no deal

21:52 <heat> I'm not confident that it can run gerrit

22:02 _xor has joined #osdev

22:03 <gamozo> ugh fine

22:13 <heat> i wondered if I could run git on the edge but now im playing doom on the edge

22:13 <heat> go figure

22:14 <heat> does doom have its own software rasterizer?

22:20 xenos1984 has quit [Quit: Leaving.]

22:27 pretty_dumm_guy has quit [Quit: WeeChat 3.5]

22:31 <heat> the arm arm keeps redirecting me to pages

22:31 <heat> why couldn't they just describe the whole MMU in one go

22:31 <gamozo> Section 1.3 MMU: vaddr -> paddr

22:31 <heat> "here's the descriptor format; for more information on attributes, see page 2151515151; page 2151515151: here's a summary of the attributes, for more information, see page 599999999999"

22:34 dude12312414 has joined #osdev

22:34 <heat> it's full of cross fucking references aaaaaaaaaaaaaaaaaaa

22:35 <sbalmos> heat: page 599999999999: If you've read this far, you have experienced a real-life example of our implementation of 4-level page tables

22:35 <heat> AF, bit[10] The Access flag, see The Access flag on page D4-2165.

22:35 <heat> they are fucking trolling me

22:35 <heat> sbalmos, PML5 at this point

22:36 <sbalmos> heh

22:38 <heat> geist, what does "Secure" mean in the ARM world?

22:38 <heat> kernel?

22:39 <geist> no it's a hardware feature. specifically the cpu is either in secure mode or not at any point in time

22:39 <geist> which doesn't really affect the performance of the cpu except that it tosses a secure bit on AXI transactions on the bus, so the SOC can accept/deny/modify it's behavior based on if the cpu is in secure mode or not

22:39 <heat> in which mode are you usually running?

22:40 <heat> non-secure?

22:40 <geist> then the arm arch has a complex set of rules regarding setting/clearing the secure bit (hint: it's at EL level transitions and follows the uusal logic of you can only drop when going down a level, and only raise going up)

22:40 <heat> (and then theoretically firmware or that trusted firmware thing would be on secure)

22:40 <geist> correct. if the cpu implements secure bit (all i know of do *Except* apple M1)

22:40 <geist> then the cpu boots with secure mode on implicitly

22:40 <geist> and it's up to the firmare to decide to drop it when going to a lower level. usually it does

22:41 <geist> such that secure mode is 'controlled' by EL3 firmware. it can choose to drop to a lower level with the bit set

22:41 <geist> but lower code cannot set the bit

22:42 <geist> and this is why you se references to running secure OSes alongside your insecure OS (say trusty (secure) alongside linux (insecure))

22:42 <geist> linux makes calls into EL3 firmware which switches modes and drops into the secure OS, which has the secure bit set

22:42 <clever> https://i.imgur.com/JBx0Rtg.png a diagram from the arm arm

22:42 <geist> when it switches back to linux (via EL3) it drops the secure bit

22:42 <geist> and thus memory transactions the secure OS makes can access hardware that is locked off behind the secure mode stuff, including the physical memory it's living in

22:43 <geist> fairly simple strategy, but fairly effective

22:43 <heat> interesting

22:43 <sbalmos> I couldnt' decide whether to say interesting, cute, or go full Spock fascinating

22:44 <clever> so EL3 is always in secure mode, and runs the "secure monitor"

22:44 <clever> EL2 can be both secure and non-secure, and runs either the normal or secure hypervisor

22:44 <clever> then EL1 is where either the normal or secure kernel lives

22:44 <clever> and EL0 can be either normal or secure userland

22:46 <geist> right, but the rules also follow that a lower EL can't run with a higher secureness than a higher one

22:46 <clever> and each time you transition to a lesser EL, you can optionally also reduce the bit width, but that lower EL can then never increase the bit width

22:46 <geist> without bouncing through a mode that has it

22:46 <geist> right, more or less the same rules as bitness

22:46 <geist> though i'm sure there are subtle details. i forget if you can unset your own secure bit for example, which you can't do with bitness (without an exception)

22:46 <clever> https://i.imgur.com/o3dDDD6.png this is the same diagram, but if your EL3 is running in aarch32 mode

22:47 <clever> so now everything is forced to be aarch32

22:47 <geist> note that apple M1 doesn't implement secure mode, and as a result they dont implement EL3

22:47 <heat> any advantage/disadvantage to that?

22:47 <geist> which is actually perfectly valid. thus the cpu boots directly into EL3 in 64bit mode, no secure bit set

22:47 <clever> so the cpu is running in EL2, secure or non-secure, when it comes out of reset?

22:48 <gamozo> huh

22:48 <geist> they just decided they didn't want/need it, so they left it out. and if you dont have secure mode there's really no point implementing EL3

22:48 <clever> heat: EL2 and EL3 are optional, and can just be omited from a design to save transistors

22:48 <geist> s/cpu boots directly in EL3/EL2/

22:48 <heat> why would you have secure mode? secret sauce for the SoC?

22:49 <geist> the arch allows for both of EL3 and EL2 to be optional. and the bitness at every level is optional, except higher ELs must implement at least the superset of the lower

22:49 <clever> heat: a way to access secret keys in a secure manner

22:49 <geist> yah basically, or the ability to build some sort of secure enclave of code that the insecure os can't see

22:49 <clever> heat: for example, secure mode could implement a TPM in software

22:50 <clever> which reminds me, netflix has various security levels

22:50 <geist> so you can (and do) implement say some address decoding logic in the memory controller that says (for example) 0-64MB is secure mode only. now when linux is running in insecure mode it simply can't decode that address. doesn't matter how hard it tries, it'll generate some sort of external sync abort

22:50 <clever> and 4k video streaming, is only authorized on devices with a proper secure enclave

22:50 <heat> but couldn't you just stick that in the hypervisor?

22:50 <geist> but if you have a trusted os you can load it in that range, and it can run just fine because secure bit is set

22:50 <clever> which makes stealing the 4k content far harder

22:50 <geist> the model is not to even trust the hypervisor

22:51 <clever> it also saves the cost of having to deal with a second set of paging tables

22:51 <geist> idea is whatever kernel/os/hypervisor/etc is loaded on the cpu is insecure, so treat it as such

22:51 <clever> you can just hard-wire the ram controller, such that non-secure can just never touch a defined range of ram

22:51 <geist> and put secure bits instead of inside some second processor (liek you get on intel or AMD) but in some trusted OS

22:51 <geist> that lives off in its own world

22:51 <clever> so its enforced by dedigated logic in the chip, rather then a paging table that could corrupt

22:52 <geist> there's more advanced stuff you can do with it, but i think there may be too many cooks right now. want to make sure heat groks it first

22:53 * clever backs off

22:54 <heat> how do you get to the secure mode though? considering nothing else is trusted

22:54 <geist> the cpu starts in secure mode

22:55 <heat> yes but you're down the chain

22:55 <geist> and thus if the firmware dropped secure mode right off the bat and remained in EL3 you cannot get it back

22:55 <geist> but once you're down the chain, in a lower EL without secure mode set, you hve to make some sort of call into a higher EL that has it

22:55 <geist> and then it's a software problem: does it 'bless' you with secure mode?

22:55 <heat> yes, my question is: how do you know you're actually calling it

22:55 <geist> or (more generally) does it do an operation based on its secureness

22:56 <geist> there's an instruction to make a EL3 (or EL2) call

22:56 <geist> HVC for EL2 and SMC for EL3

22:57 <geist> operates exactly like SVC (EL0 -> EL1) in that it just switches to that mode and calls the exception handler

22:57 <j`ey> and the DT will tell you what to call

22:57 <j`ey> (for stuff like PSCI)

22:57 <geist> so there's a standardized firmware call layout to request secure operations from your firmware

22:57 <geist> op code, args, etc

22:57 <geist> so *usually* what you do as an insecure os is make a SMC call to your firmware to send some message to a secure os implementation to go do something for you

22:58 <heat> oh I see

22:58 <geist> EL3 traps it, decides it's legit, saves cpu context, then loads secure OS context and drops to the secure OS with the secure bit set

22:58 <heat> EL3 always has secure mode if it exists

22:58 <geist> right

22:58 <geist> the secure bit is basically latched in every level in a register that can only be seen by that level and above

22:59 <heat> you smc, the secure monitor chucks that request down the secure mode stack, and it goes to trusty, trusty does its thing

22:59 <geist> i forget precisely what the bit is called in what register, but when yuo switch to the new mode it intrinsically is secure or not based on the bit being set

22:59 <geist> ie SECURE_BIT_EL3, SECURE_BIT_EL2, SECURE_BIT_EL1, etc. (that's not the name but it's something in one of the banked regs)

22:59 <heat> whats the point of having a non-secure bit in the page tables then?

23:00 <heat> "For memory accesses from Secure state, specifies whether the output address is in

23:00 <heat> the Secure or Non-secure address map"

23:00 <geist> oh gosh i totally forget. probably some ability to mask off the secureness for safety purposes

23:00 <geist> maybe a secure OS sets that when creating mappings in its own page tables to see pages from a non secure OS

23:01 <geist> so that it doesn't access them with the secure bit going across the bus for Reasons

23:01 <geist> most likely non secure OSes have no need for it

23:02 <geist> at the end of the day that's all the secure mode stuff is. it's just a bit that goes along with the AXI transaction, and its up to hardware to do something with it or ignore it. it doesn't get saved to memory or whatot

23:02 <geist> but i do think it has complex interactions with the cache and it's a bit in the TLBs

23:02 <geist> so there's some rules there to deal with nonsecure oses using cache lines generated from a secure os, etc

23:03 <geist> all told it's kinda like a 1 bit coarse granular iommu but for cpus. kinda.

23:04 <clever> and for dma, you could hard-wire the secure bit on axi to non-secure, so dma cant be used to read secure memory

23:05 <geist> yah and/or have special secure mode dma controllers that you can mask off their registers to program them from insecure oses

23:05 <heat> ARMv8.0 requires that software manages the Access flag. This means an Access flag fault is generated whenever

23:05 <heat> an attempt is made to read into the TLB a translation table descriptor entry for which the value of Access flag is 0.

23:05 <heat> oh wow

23:05 <geist> or even latch the secureness of the dma transactions based on the secure bit at the time the cpu accessed the registers

23:06 <geist> yeah, modified flag too, which is really strange and a head scratcher

23:06 <geist> v8.1 or so adds hardware support for A and D bit, but they're kinda a can of worms. frankly it's kinda nice to have software implemented A and D bits for reasons

23:06 <heat> i don't see anything about the software D bit

23:07 <heat> where's the READ bit?

23:07 <geist> yeah because it's implemented in a wonky way. it's not a D bit per se, but a bit that says IIRC 'its okay for hardware to modify the page table entry from RO -> RW'

23:07 <heat> for execute-only behavior

23:07 <geist> but there's a bit that controls that. it's a weird implementation

23:08 <geist> i dont know why they did it that way vs just having the cpu writeback to a D bit

23:08 <geist> probably some clever reason it's technically more efficient, though hardware for software to deal with

23:09 <geist> okay so what question are we on now?

23:09 <geist> we have overlapping questions, dunno where the current stack pointer for questions is at

23:10 <heat> <heat> where's the READ bit?

23:10 <heat> <heat> for execute-only behavior

23:11 <geist> that's kinda two questions. it's not implemented as RWX but a table of 3 bits that has a bunch of permutations

23:11 <geist> well, not that even. it's kinda wonky

23:12 <geist> i al;ways have to go back to the code to see, hang on

23:12 <geist> ah yeah, it's *two* bits to control 4 permutations of the RW permissions

23:12 <geist> but then a separate XN bit

23:12 <geist> https://github.com/littlekernel/lk/blob/master/arch/arm64/include/arch/arm64/mmu.h#L192

23:12 <bslsk05> github.com: lk/mmu.h at master · littlekernel/lk · GitHub

23:12 <heat> yes

23:12 <geist> tbut one of the permutations is --x

23:13 <geist> but due to the permutations can only be EL0. it's kinda a mess

23:13 <heat> aww crap that's new

23:13 <heat> wait, how is one of the permutations --x?

23:14 <heat> OH

23:14 <geist> uh. i'm about to point you at the manual because frankly i forgot

23:14 <geist> and i dont implement it in LK or zircon

23:14 <heat> i see

23:14 <geist> and Its Complicated

23:14 <heat> if 00, access from EL0 = none

23:14 <geist> some combination of the 2 permission bits + UXN and PXN i think

23:14 <heat> but if it's !uxn, it executes

23:14 <geist> right. yeah that's it

23:15 <heat> 1) cool! 2) horrendous

23:15 <heat> what the fuck

23:15 <heat> i thought you said arm64 was better

23:15 <geist> yeah it's pretty dumb. we actually had a bug in zircon that we fixed a while back, i fixed it here too

23:15 <geist> where we were accidentally leaving the kernel executable from EL0, but not readable/writable

23:15 <geist> because if you forget to set PXN on user bits, even if user space can't read/write it it can still execute it

23:15 <heat> you could execute the kernel?

23:16 <geist> yah, but not read/write, so it was dubious what you could actually *do* with it

23:16 <geist> i fixed it in LK too

23:16 <geist> basically the combinations in https://github.com/littlekernel/lk/blob/master/arch/arm64/mmu.c#L72

23:16 <bslsk05> github.com: lk/mmu.c at master · littlekernel/lk · GitHub

23:17 <geist> there's some actually not bad reason the XN bits are separate from RW, but it's sort of a head scratcher

23:18 <geist> so now that you've seen that you see how the D bit is implemented: https://github.com/littlekernel/lk/blob/master/arch/arm64/include/arch/arm64/mmu.h#L192

23:18 <geist> it sets bit 6, which intrinsically conversts a RO page to RW

23:18 <geist> so they arranged for the bit encoding to double as permissions and D bit

23:20 <geist> but *then* i forget how you arm this D bit mode, since you dont want it to just unilaterally convert RO pages to RW

23:21 <heat> what kind of horrible voodoo do I need to do for caching?

23:22 <geist> not to much. you just have to implement the standard cache flush routines

23:22 <heat> but the mappings seem to have attributes

23:22 <geist> and then apply them at appropriate times. generally only need to do it for dealing with various dma bits in hardware

23:22 SGautam has quit [Quit: Connection closed for inactivity]

23:22 <geist> yeah you just pre-can the MAIR and then you have 8 types. basically similar to PAT in x86

23:23 <geist> ie, a MAIR register has 8 fields of 4 bits with a bunch of permutiations, but only like 2 or 3 actually matter, and then you write the index into it for the page table entries

23:23 <heat> what happens if I don't set that up?

23:23 <geist> https://github.com/littlekernel/lk/blob/master/arch/arm64/include/arch/arm64/mmu.h#L208 are the 3 types i set

23:23 <bslsk05> github.com: lk/mmu.h at master · littlekernel/lk · GitHub

23:24 <geist> then MAIR is probably intrinsically zeroed (hopefully) and then when you have to write an index field in your page tables it selects one of the 0ed entries

23:24 <geist> which is probably looks like `Device-nGnRnE`

23:24 <geist> whichi s 'strongly ordered'

23:24 <geist> which means it'll run like shit

23:24 <geist> that's effectively Hella Uncached

23:24 <geist> see line 208

23:25 <heat> i see

23:25 <geist> in practice basically every OS on the planet sets the MAIR to the same 3 or 4 entries and then just uses it. since most of the permutations of the bits dont matter

23:25 <heat> I think I'll skip this for now since I don't quite understand it

23:26 <heat> I just want to get a flat identity mapping for now

23:26 <heat> full of 1GB pages

23:26 <heat> that seems nice

23:26 <geist> qemu probably wont emulate any of it so you'll probably be fine just leaving MAIR zeroed (you should explicitly zero it at least) and then just put zero in the index field of the page tables

23:27 <heat> oh yeah also what's with the shareable stuff

23:28 <geist> ah re: the dirty bit, it's bit 51. i just dont have a #define for it in my code. it's the DBM bit (dirty bit modifier)

23:28 <geist> just always mark everything sharable. it's legacy

23:28 <geist> from the arm32 days

23:28 <geist> iirc it means 'do you both implementing inter-cpu cache coherency on this page'

23:28 <geist> and the answer is yes. for some sort of per cpu mapping you could probably unset it and hypothetically it's faster

23:29 <geist> i have no idea if modern cpus implement that

23:29 <heat> inner or outer?

23:30 <geist> you only care about inner

23:31 <geist> outer is also kinda legacy, or at least so largely unused you dont really need it

23:32 <geist> oh god i just read the 5 or 6 pages of the ARM ARM talking about the dirty bit (and ingeneral writing back to page table entries by hardware, A bit included) and my eyes are bleeding already

23:32 <geist> it's so complicated. that's why we haven't implemented it in zircon. there's something very ordered and nice about just taking a page fault and doing it there

23:32 <heat> that's a common side effect of reading the ARM ARM

23:33 <heat> <heat> AF, bit[10] The Access flag, see The Access flag on page D4-2165.

23:33 <geist> though to be fair x86 is completely underspecified there WRT precisely how page tables are written back when setting A and D bits

23:33 <heat> wanna know a cool fact: x86 CPUs can possibly hang trying to write to ROM

23:34 <geist> arm goes into it with extreme detail, especially with regards to how precisely the writeback is ordered relative to the instruction that triggered it, and how it nests in a multi translation scheme (S2 + S1), etc

23:34 <heat> which is why firmware sets the accessed bit in the GDT for the ones in ROM: so the CPU doesn't try to writeback the access bit to ROM

23:34 <geist> and what kind of transaction can trigger it, including atomics and cache ops etc etc. this is the sort of detail that sometimes is nice to the ARM ARM but you kinda wish they'd have a TL;DR version of things and then a section that says 'if you really wanna know how it works XYZ'

23:35 <geist> the riscv manual on page tables is like 'theres an A and B bit you know what this does LOL'

23:35 <heat> the riscv manual annoyed me

23:35 <heat> but now I really appreciate it

23:35 <geist> yah it's like the trashy indie band version of architectures

23:36 <geist> they're the Melvins vs late stage Metallica

23:36 <geist> https://youtu.be/KzXDF4M26EM <-- risc-v

23:36 <bslsk05> 'Melvins - Honey Bucket (HQ)' by A Giant Sloar (00:02:44)

23:37 <heat> lol

23:38 <heat> tbh it's not bad

23:38 <heat> just like riscv

23:38 <heat> not bad, but amateur in comparison

23:38 <geist> https://youtu.be/q0ZrF7taMHA <-- armv8

23:38 <bslsk05> 'Animals As Leaders "CAFO" official music video' by ProstheticRecords (00:06:55)

23:52 dude12312414 has quit [Quit: THE RAM IS TOO DAMN HIGH]

23:52 <geist> been slowly adding opcodes to my 6800 emulator. basically runs all of the altair 680 monitor now

23:53 <geist> always kinda fun

23:57 <heat> why does linux want to be loaded at 0x80000?

23:58 <heat> (why that high up?)