#osdev on 2022-11-22 — irc logs at libera.irclog.whitequark.org

2021-05-23 01:57 klange changed the topic of #osdev to: Operating System Development || Don't ask to ask---just ask! || For 3+ LoC, use a pastebin (for example https://gist.github.com/) || Stats + Old logs: http://osdev-logs.qzx.com New Logs: https://libera.irclog.whitequark.org/osdev || Visit https://wiki.osdev.org and https://forum.osdev.org || Books: https://wiki.osdev.org/Books

00:01 <mrvn> if you later store the flag in memory then "m" should optimize that. But like this I have no idea what clang is thinking.

00:02 <mrvn> does that even work right? The pushf modifies the stack pointer to the "-8(%rsp)" could be the wrong offset

00:02 <heat> yes, it works right, it's 100% defined

00:02 <heat> <nbjoerg> it just prefers memory for complicated histerical reasons

00:03 <heat> from #llvm

00:03 <heat> which explains absolutely nothing

00:03 <mrvn> so popq first increments the %rsp and then -8(%rsp) is computed?

00:03 <gog> pop from memory to memory

00:05 <mrvn> https://godbolt.org/z/Kos1nGG6c still doing the right thing?

00:05 <bslsk05> godbolt.org: Compiler Explorer

00:07 <heat> that doesn't work

00:07 <heat> you can't move the stack pointer and hope clang notices

00:07 <mrvn> You can if the compiler doesn't throw in SP relative memory references into your arguments

00:08 <mrvn> I don't think pushf with "m" works.

00:08 <heat> it does

00:08 <heat> believe me

00:08 gxt has quit [Ping timeout: 255 seconds]

00:09 <heat> * it evaluates its effective address -- this is part of the

00:09 <heat> * "=rm" is safe here, because "pop" adjusts the stack before

00:09 <heat> * documented behavior of the "pop" instruction.

00:09 <heat> */

00:09 <heat> https://elixir.bootlin.com/linux/latest/source/arch/x86/include/asm/irqflags.h#L20

00:09 <bslsk05> elixir.bootlin.com: irqflags.h - arch/x86/include/asm/irqflags.h - Linux source code (v6.0.9) - Bootlin

00:09 <mrvn> heat: yeah, in this special case

00:10 <mrvn> It would also work if clang would use a frame pointer instead of RSP

00:10 gxt has joined #osdev

00:12 <mrvn> heat: that code would normally be compiled with -mno-red-zone: https://godbolt.org/z/xdT1dsfjh

00:12 <bslsk05> godbolt.org: Compiler Explorer

00:13 <mrvn> looks even worth then

00:13 <mrvn> worse even

00:13 <heat> omg wtf is going on in f1

00:14 <heat> why is it saving %rax

00:14 <mrvn> heat: no red zone, so it first has to make space for "flags" on the stack.

00:14 <mrvn> same as "subq $8, %rsp"

00:14 <heat> wtf

00:14 <heat> so just sub?

00:14 <mrvn> sub ios 4 bytes, push is 1

00:15 <heat> sub doesn't store, push does

00:16 <heat> gcc agrees with me

00:17 <mrvn> https://godbolt.org/z/937z1ejTs

00:17 <bslsk05> godbolt.org: Compiler Explorer

00:17 <mrvn> probably because gcc keeps the stack 16 byte aligned

00:18 <mrvn> clang lines 5+6 could be just "popq %rax"

00:21 <mrvn> https://godbolt.org/z/nb78s6MWT What is going on in g? Why isn't it passing the memory address to the inline asm?

00:21 <bslsk05> godbolt.org: Compiler Explorer

00:22 <heat> if you static inline f you'll get cleaner code, but still not ideal

00:23 <mrvn> not really. it just removes the output for f()

00:24 <mrvn> I would expect "g()" to be "pushfq; popq 305419896"

00:24 <mrvn> isn't that possible?

00:25 <heat> that I'm not sure

00:25 <heat> you could do mov $0xdeadbeef, %rdi; pop (%rdi)

00:26 epony has quit [Quit: QUIT]

00:26 <mrvn> 8F /0 : POP r/m64, valid in 64-bit mode

00:27 <heat> https://godbolt.org/z/Pfbx9a8c8

00:27 <bslsk05> godbolt.org: Compiler Explorer

00:27 <heat> there we go

00:27 <mrvn> but that's not the optiomizer doing the work

00:29 <mrvn> With the original the "m" makes no sense. If the optimizer isn't optimizing out the return and stores directly into the target memory then "m" can never be faster.

00:30 <heat> m can be faster if 1) the optimizer sees it 2) doing "r" spills a register for a return you're probably not actively using atm

00:30 <mrvn> it will always remain: pop to stack, read from stack into reg, store reg in memory

00:31 <mrvn> So you already need to spill the reg and then poping to that reg is faster

00:32 <mrvn> That was my test with "g": Does the optimizer see the store to memory and eliminate the temp register

00:32 <heat> right, but it's super possible you're just saving eflags, cli + disable stuff, and do something, restore eflags

00:33 <heat> in that case, storing to the stack is faster

00:33 <heat> (if you need to spill)

00:33 <heat> and it does indeed do it correctly on the stack

00:34 <mrvn> heat: but you aren't. You poping into a temp place, reading into a reg, ..., storing reg into temp place, pushing temp place.

00:34 <heat> but you can also can depressingly bad codegen like

00:34 <heat> ffffffff801b945a: 9c pushf

00:34 <heat> ffffffff801b945b: 8f 45 d0 pop -0x30(%rbp)

00:34 <heat> ffffffff801b945e: 44 8b 65 d0 mov -0x30(%rbp),%r12d

00:34 <mrvn> heat: I don't see the optimizer eliminating that "return flags" that loads into a reg anywhere.

00:35 <heat> well, keep in mind this func gets inlined

00:35 <mrvn> yeah, badly :)

00:36 <heat> in other news, omg this codegen is horrible

00:36 <mrvn> sometimes compiler really disappoint.

00:36 <heat> https://gist.github.com/heatd/9fe40e4eaa77d1fc0d859c42dd035a2c

00:36 <bslsk05> gist.github.com: pushwtf · GitHub

00:36 <heat> per objdump -d kernel/vmonyx-unstripped | grep -C 2 pushf

00:37 <heat> 99% of the inlines are pushf; pop (somewhere in the stack); mov (somewhere in the stack), <reg>

00:37 <heat> like WTF???

00:37 <geist> hmm, i dunno, that's strange, but it's hard to do the same thing with less instructions

00:37 <mrvn> heat: that's clang?

00:37 <heat> yup

00:38 <mrvn> geist: gcc just uses "pop reg" saving the extra opcode

00:38 <geist> sure but it might still need to save it to the stck for some other reason elsewhere

00:38 <heat> a quick grep on gcc-built vmlinux sees a lot of pop <reg> effectively

00:38 <geist> it's popping it off the stack onto the local frame, maybe something else needs it

00:38 <geist> but i could see rewriting it as 'pushf; pop reg; move reg,-x20(rbp)'

00:39 <mrvn> geist: clang does too. They both end up using a register, just clang shoves the value onto the stack and then into a reg.

00:39 <geist> yah but question is does that stack value get used anywhere else

00:39 <geist> maybe it does later in the routine

00:39 <heat> it's not that clang has an all seeing eye and knows i'm storing it for later

00:39 <geist> also it *could* be a misread of the inline asm

00:39 <mrvn> geist: you mean the register is justr a copy that get trashed and then the original gets read again? Not in any of my tests.

00:39 <geist> that i have seen

00:39 <heat> it's that clang literally just stores to the memory

00:39 <geist> okay.

00:39 * geist shrugs

00:40 <heat> clang for some reason prefers memory to registers and no one knows why

00:40 <geist> i do bet a modern cpu will elide that second mov and see it's a store and then load from the same address

00:40 <geist> i think for some time now modern intel and amd machines have that sort of load-after-store optimizations

00:41 <heat> having a standalone x86_save_flags(): sub $8, %rsp; pushf; pop (%rsp); mov (%rsp), %rax; ret is just really poor codegen

00:41 <mrvn> .oO(Our compilers are dumb, lets fix that in the hardware)

00:41 <geist> of course mrvn

00:42 <heat> geist, how does that play with smp and fences?

00:42 <\Test_User> just make CPUs that run on common programming languages already if you're going to do that :P

00:42 <heat> JAZELLE

00:42 <mrvn> I though exactly such cases where the point of having SSA form. So this load-after-store would become trivial to recognize and optiomize out.

00:42 <geist> i think it'd be fine right? a store and then immediately load back from it is fine as long as it's cached. doubleplus so in a weakly ordered machine

00:43 <geist> yah i think the pipeline could just elide that to a register move

00:43 <heat> i guess, since we're not using atomics?

00:43 Lumia has joined #osdev

00:43 <mrvn> geist: depends on how smart the cpu is. It's a register and memory dependency so it could totaly serialize the operations and stall the pipeline

00:43 <geist> and it's cached. if it was uncached memory, ie a mmior egister, that's absolutely not okay

00:43 <geist> yah i just know i've seen talk recently of much more sophisticated load/store eliding than this, so i suspect this is already a given

00:43 <mrvn> the cpu has to detect it as load-after-store and optiomize it away

00:44 <geist> Zen 2 in particular had something somewhat more powerful than this, in particular

00:44 <geist> (though it was removed in zen 3 for some reason)

00:44 <heat> actually

00:44 <heat> movl $0x0,(%rbx)

00:44 <heat> this is atomic

00:44 <heat> with a release C11 memory ordering

00:45 <mrvn> I guess this is something useful if you have code that calls tiny functions a lot. registers get spilled to the stack, function call, ret, restore registers. If the function is small enough the cpu would elide the push/pop completely.

00:45 <heat> movl $0x0, (%rbx), <something else stores>, movl (%rbx), %rax <-- what value can %rax be?

00:46 <mrvn> 0 or something else

00:46 <heat> unless you need a proper atomic when writing with an acquire C11 memory ordering

00:47 <mrvn> atomic really only makes a difference when you have a second observer.

00:47 <geist> ugh my thermostat at home is starting to die i think

00:47 <mrvn> battery low?

00:47 <geist> it's more than once in the last few days suddenly in mid day read much higher temp than it is

00:47 <heat> mrvn, you do have two observers in this case

00:47 <geist> so it doesn't turn on, so then it gets colder and colder in the house

00:48 <mrvn> heat: a single thread is just one observer

00:48 <geist> but it dosn't seem to consistently read too high

00:48 <heat> to be clear, the <something else stores> is supposed to be another core

00:48 <mrvn> ahh, why don't you say so

00:48 <heat> I don't understand how you can ever elide that store

00:48 <heat> erm, load

00:49 <heat> particularly if the other cpu does a proper atomic cmpxchg or whatever

00:49 <mrvn> heat: there is no synchronizing event so the order os the movl and something else is undefined

00:50 <mrvn> if the other core does atomics then it's synchronizing. Could still execute as if the second "movl" was before something else.

00:50 <heat> I think movl $0x0, (%rbx); lock cmpxchg <...>; movl (%rbx), %rax is entirely defined

00:50 <mrvn> heat: when is the second movl decoded, added to the pipeline, optimized by the hardware, accesses the cache, ...

00:50 <heat> to be clear, you don't need any special fence to get that release memory ordering

00:51 <mrvn> If you execute that "lock cmpxchg" at just the right time then it will work. If you do it a quarter cycle later it might not.

00:52 <mrvn> mixing atomic (one core) and non-atomic access (other core) will have variable success.

00:52 <heat> but this is atomic

00:53 <mrvn> the 2 movl are not atomic. The lock can happen before both, in the middle or after.

00:53 epony has joined #osdev

00:54 <heat> the 1st mov is absolutely atomic

00:54 <mrvn> What x86 garantees is that you won't get %rax filled half before the lock and half after.

00:54 <mrvn> each movl is atomic but the pair is not.

00:55 clever has quit [Ping timeout: 260 seconds]

00:59 <mrvn> for your code to be deterministic you would have to turn of interrupts on both cores, synchronize them and then execute all opcodes with perfect knowledge of the timing so the "lock cmpxchg" on the second core executes right in the middle of the two "movl" on the first core.

01:06 Lumia has quit [Remote host closed the connection]

01:20 dude12312414 has quit [Remote host closed the connection]

01:29 gildasio has quit [Remote host closed the connection]

01:30 gildasio has joined #osdev

01:39 clever has joined #osdev

03:32 <geist> i wonder. are there *any* circumstances by which multiple SMT threads are allowed to share TLB entries between cores, or are they all intriniscally tagged with the thread that generated them (when the TLB is being shared between them)

03:33 <geist> on x86 you could say that if the page is marked global then it could be, however that would be assuming the architecture states that you can only have one set of global pages, ie, they're intrinsically going to be referenced on all cores. not running multiple kernels with multiple sets for example

03:34 <geist> PCID is right out on x86, but perhaps ARM's ASIDs are allowed to share, because the arch pretty much states taht all cores within an inner shared domain must have the same asids

03:34 <geist> same set that is

03:34 <geist> if so that might have a complication if you truly wanted to run multiple kernels side by side without using virtualization

03:34 <geist> ie, statically allocating cores to individual kernels

03:47 terrorjack has quit [Ping timeout: 256 seconds]

03:47 <Mondenkind> about the store thingy (movl $0x0, (%rbx); lock cmpxchg <...>; movl (%rbx), %rax), x86 has total store order. That means the initial store _has to_ happen before the cas

03:48 <Mondenkind> and you can't fold away the load. Suppose you write a zero. Somebody else reads the zero, writes something else in its place, and then writes something to the location you cas to

03:49 <Mondenkind> your store happens-before his load happens-before his store happens-before your cas happens-before your second load

03:50 terrorjack has joined #osdev

03:56 k8yun has quit [Ping timeout: 252 seconds]

03:58 <heat> i've been writing a qemu pflash driver tonight

03:58 <heat> kinda cute

03:59 <heat> but god is this all really poorly documented

03:59 <heat> there's no proper cfi spec I can see

03:59 <heat> there are some docs but they're all shitty and super incomplete

04:04 <zid> what's a pflash

04:04 <zid> it just sounds like someone who can't say the letter f right

04:09 <clever> zid: parallel flash

04:09 <clever> vs serial (spi) flash

04:10 <heat> pflash is qemu's flash devices

04:11 <heat> they speak CFI which is some kind of standard-ish language flash devices speak

04:12 <zid> imagine having more than 8 pins

04:25 smeso has quit [Quit: smeso]

04:30 smeso has joined #osdev

04:45 epony has quit [Read error: Connection reset by peer]

04:46 k8yun has joined #osdev

04:50 k8yun has quit [Ping timeout: 248 seconds]

05:03 wxwisiasdf has joined #osdev

05:08 LostFrog has joined #osdev

05:08 chartreuse has quit [Remote host closed the connection]

05:09 PapaFrog has quit [Ping timeout: 240 seconds]

05:15 bradd has joined #osdev

05:20 heat has quit [Ping timeout: 256 seconds]

05:29 <wxwisiasdf> iapx 432 osdev

05:30 smach has quit [Ping timeout: 260 seconds]

05:34 epony has joined #osdev

05:37 acidx has quit [Remote host closed the connection]

05:46 <geist> i always thought it'd be kind ainteresting to actually try to do something with it, but iirc the documentation for it is incomplete

05:47 <geist> something like the programming docs are higher level and dont tell you precisely how the low level OO stuff work son the processor

05:47 <geist> so it's insufficient to implement an emulator or whatnot

05:47 <wxwisiasdf> it is unfortunely

05:47 <wxwisiasdf> even through it was a dumpsterfire it had some cool ideas

05:48 <geist> yah

05:48 <geist> and you can kinda see how some of it bled over into 286 i think

05:48 LostFrog is now known as PapaFrog

05:48 <wxwisiasdf> 286 TSS time

05:48 <geist> and in general the whole handle to segments with bytewise size, etc

05:49 <wxwisiasdf> i think the most weird thing is how instructions were bitstreams rather than bytestreams

05:49 <wxwisiasdf> like imagine getting an offset wrong

05:49 <geist> heh

05:49 <wxwisiasdf> whole program dead

05:49 <geist> i suppose it could resync fairly quickly, but depend son what the opcode layout is

05:50 <wxwisiasdf> i assume they just used nibbles instead of allowing 6 bit insn

05:53 <geist> yah good question. also depends on what units they address memory

05:53 <geist> i was fairly certain it was at least a full 32bit machine

06:05 acidx has joined #osdev

06:10 Burgundy has joined #osdev

06:10 wxwisiasdf has quit [Quit: leaving]

06:13 clever has quit [Ping timeout: 256 seconds]

06:19 Burgundy has quit [Ping timeout: 252 seconds]

06:21 simpl_e has quit [Read error: Software caused connection abort]

06:22 simpl_e has joined #osdev

06:25 acidx_ has joined #osdev

06:26 bauen1 has quit [Ping timeout: 256 seconds]

06:27 acidx_ has quit [Remote host closed the connection]

06:27 acidx has quit [Remote host closed the connection]

06:28 acidx has joined #osdev

06:41 wxwisiasdf has joined #osdev

06:54 MarchHare has joined #osdev

07:08 knusbaum has quit [Ping timeout: 248 seconds]

07:10 doppler has quit [Ping timeout: 248 seconds]

07:15 doppler has joined #osdev

07:20 knusbaum has joined #osdev

07:29 k8yun has joined #osdev

07:32 bauen1 has joined #osdev

07:35 k8yun has quit [Quit: Leaving]

07:38 Benjojo has quit [Read error: Software caused connection abort]

07:38 Benjojo has joined #osdev

08:00 gxt has quit [Remote host closed the connection]

08:01 gxt has joined #osdev

08:10 gxt has quit [Remote host closed the connection]

08:11 gxt has joined #osdev

08:12 gxt has quit [Remote host closed the connection]

08:13 gxt has joined #osdev

08:17 bauen1 has quit [Ping timeout: 255 seconds]

08:20 gildasio has quit [Remote host closed the connection]

08:21 gildasio has joined #osdev

08:24 bauen1 has joined #osdev

08:35 Vercas6 has quit [Quit: Ping timeout (120 seconds)]

08:35 gog has quit [Quit: byee]

08:38 wxwisiasdf has quit [Ping timeout: 256 seconds]

08:39 clever has joined #osdev

08:39 Vercas6 has joined #osdev

08:59 carbonfiber has joined #osdev

09:02 nyah has joined #osdev

09:21 gaze___ has quit [Read error: Software caused connection abort]

09:21 gaze___ has joined #osdev

09:23 ElementW has quit [Quit: -]

09:23 SanchayanMaity has quit [Read error: Software caused connection abort]

09:23 SanchayanMaity has joined #osdev

09:57 bauen1 has quit [Ping timeout: 252 seconds]

09:59 bauen1 has joined #osdev

10:05 diamondbond has joined #osdev

10:18 ElementW has joined #osdev

10:18 bauen1 has quit [Ping timeout: 260 seconds]

10:20 bauen1 has joined #osdev

10:22 Mutabah has quit [Ping timeout: 248 seconds]

10:23 Mutabah has joined #osdev

10:29 GeDaMo has joined #osdev

10:33 Stella is now known as theWeaver

10:34 heat has joined #osdev

10:35 Maja[m] has quit [Quit: Bridge terminating on SIGTERM]

10:35 Irvise_ has quit [Quit: Bridge terminating on SIGTERM]

10:35 chibill has quit [Quit: Bridge terminating on SIGTERM]

10:35 sakasama has quit [Quit: Bridge terminating on SIGTERM]

10:35 identitas has quit [Quit: Bridge terminating on SIGTERM]

10:38 Maja[m] has joined #osdev

10:40 Burgundy has joined #osdev

10:42 diamondbond has quit [Ping timeout: 260 seconds]

10:59 identitas has joined #osdev

10:59 Irvise_ has joined #osdev

10:59 chibill has joined #osdev

10:59 sakasama has joined #osdev

11:04 heat has quit [Remote host closed the connection]

11:04 heat has joined #osdev

11:17 <heat> your operating systems are mid at best

11:18 <heat> mine is super good numba one best operating system ever

11:18 Burgundy has left #osdev [#osdev]

11:22 eroux has quit [Ping timeout: 260 seconds]

11:25 Bitweasil has quit [Remote host closed the connection]

11:25 Bitweasil has joined #osdev

11:26 eroux has joined #osdev

11:33 GeDaMo has quit [Read error: Connection reset by peer]

11:35 CryptoDavid has joined #osdev

11:37 GeDaMo has joined #osdev

11:54 Burgundy has joined #osdev

11:55 bradd has quit [Ping timeout: 252 seconds]

12:04 gildasio has quit [Ping timeout: 255 seconds]

12:06 gildasio has joined #osdev

12:48 Vercas6 has quit [Remote host closed the connection]

12:49 Vercas6 has joined #osdev

12:52 <mjg> hue

12:52 <mjg> conside the following: you impement scalable inode number allocation for tmpfs

12:52 <mjg> 's all per-cpu 'n shit

12:53 <mjg> and then you run into a funny corner case: / has to have ino 2

12:53 <mjg> but your scheme does not *guarantee* you will have that

12:54 <heat> my inode allocation scheme for tmpfs is cur_inode_num++

12:55 <clever> i believe x86 allows that to be atomic with just 1 opcode and maybe stalling? but it also steals the entire cache line the var is in?

12:55 <mjg> well it is lock xadd now

12:55 <mjg> the point is for it to NOT be this way

12:55 <heat> yeah I use atomics here

12:55 <mjg> just sayin a funny problem

12:56 <mjg> one which would probably would not show on a 2 thread vm for the test suite

12:56 <clever> but if you had per-cpu counts, and each count was in its own cache line, it wouldnt stall or need atomics

12:56 <mjg> would it

12:56 <mjg> clever: that is the entire point, yes

12:56 <clever> you could just `(core << 20) | counts[core]++` basically

12:56 <heat> are you just dividing the 64-bit inode space into NR_CPUs partitions?

12:56 <heat> I can see that working

12:56 <clever> heat: i was thinking, just slot the corenr into the upper bits, dont bother trying to partition it up better

12:57 <mjg> no. conceptually for_each_cpu(c) { per_cpu(c)->ino = c; }

12:57 <mjg> then allocation is you add MAXCPU

12:57 <mjg> to whatever per-cpu var you got

12:57 <clever> 8bit corenr, toss it into the top 8 bits, that leaves you with 56 bits for each cpu to count its own inodes

12:57 <mjg> guaranteed lack of conflict, but also huge gaps, which should not be a problem

12:57 <heat> odd scheme

12:58 <heat> make CPU2 start at 2 + MAXCPU ig

12:58 <heat> while you're at it, maybe also reserve 0 and 1

12:59 <heat> well, in any case you'll need the allocation code to detect these reserved inode numbers and skip them

12:59 <clever> in my scheme, inodes 0/1 effectively belong to core 0

13:00 genpaku has quit [Read error: Connection reset by peer]

13:00 <clever> and if you want those to be for things like the root dir, you can just pre-initialize core-0's inode table and counter, before you allow other cores into the ball-pit

13:00 genpaku has joined #osdev

13:00 <clever> if core-0 cant possibly be touching the state, then your free to initialize core-0's state however you like

13:00 <clever> your going to initialize all of the other cores anyways, from the wrong core

13:02 <heat> yes, that layout also works well

13:02 <heat> cpu0 just skips the first 3 inodes

13:06 <mjg> there is 0 difficulty skipping some range in my scheme

13:06 <mjg> the entire point tho was that you may need to allocate something which goes aganst the general scheme

13:06 <mjg> and thus needs to be special-caesd

13:08 <clever> oh, i just had another idea, just reserve core-255's range for special stuff

13:08 <clever> when do you expect to see 256 cores? :P

13:08 <clever> but that would drop the special ones at near thet top of the 64bit range

13:09 <mjg> there is no difficulty here per se

13:09 <mjg> just fucking around to get it done

13:10 <heat> it's mildly annoying to skip in your scheme given a low enough MAXCPU

13:10 <heat> does anything actually depend on root ino = 2?

13:11 <heat> that seems............ depressing

13:11 <clever> the only time ive ever had issues with the inode number being weird, was 64bit inodes and a 32bit userland

13:11 gildasio has quit [Remote host closed the connection]

13:11 Vercas6 has quit [Remote host closed the connection]

13:11 <clever> the 32bit readdir() returns EOVERFLOW if any inode is over 32bits in length

13:12 <mjg> heat: that's to make it cpu hotplug-proof

13:12 <clever> xfs spreads the inodes over the whole disk, and picks an inode near the first data block, so if the first data block is >2tb into the fs, the inode# is over 32bits

13:12 <mjg> heat: one can roll with current cpu count without conceptually changing anything

13:12 Vercas6 has joined #osdev

13:12 <clever> and much to my surprise, a number of core linux utils, used in initrd stuff, dont check the readdir() return code

13:12 <mjg> heat: but you may notice MAXCPU would be a macro known at compilation time, while current cpu count would have to be read every time

13:12 <clever> if it returns -1, it must be EOF!

13:13 <clever> so basic linux utils, claim file not found, when the file exists!!

13:13 <mjg> clever: is there no magic to misrepresent inos which don't fit?

13:13 <mjg> i would expect gnu to have a hack big time

13:14 <clever> mjg: your supposed to rebuild your program with large file support, and libc will then change the size of off_t&friends, and call readdir64 behind the curtain

13:14 <mjg> e.g., masking off the overflowing part

13:14 <clever> and then everything just works

13:14 <heat> GNU's big time hack is a 64-bit version of getdents

13:14 <mjg> for example linux in kernel had a scalable scheme to allocate 32-bit inos

13:14 <heat> which is actually not a big time hack but a normal time hack

13:14 <mjg> except it just assumed there would be no duplicates

13:14 <mjg> or even if you get one, it wont matter

13:15 <clever> one extra complication in my case, was that the fs itself, was xfs on a 64bit machine, running an nfs server

13:15 <mjg> linux in a nutshell

13:15 gildasio has joined #osdev

13:15 <clever> the client was then purely 32bit, and an nfs client

13:15 <mjg> clever: you lose by exporting nfs man

13:15 <clever> mjg: what else would you use for sharing files?

13:15 <mjg> ootb i don't see good alternatives on linux

13:16 <mjg> assuming you need to pretend posixy

13:16 <clever> yeah, thats why i use nfs

13:16 <clever> i discovered this bug, when working on nfs based net-boot for an rpi

13:17 <clever> my work-around was iscsi, export a whole block device, run the fs client side

13:17 <clever> also, due to the block dev being smaller, the inode table isnt big enough to cause the problem in the first place

13:17 <clever> but iscsi is limited to a single client, so it cant replace nfs

13:18 <mjg> and you are stuck with 32-bit inos on the client no matter what?

13:19 <clever> it was running the arm in 32bit mode

13:19 <clever> and due to limitations in dozens of linux packages, i couldnt use readdir64()

13:19 <clever> so yes

13:20 <mjg> linux

13:21 <mjg> no such problems on onyx amiright

13:21 <clever> i was able to apply overrides to the packages, to force them to build with large file support

13:21 <clever> which did technically fix it

13:21 <clever> but it was like a game of whack-a-mole

13:22 <clever> fix one, and 2 more come out

13:22 <heat> ONYX BEST OPERATING OF SYSTEM

13:22 <mjg> solaris on powerpc == $$$

13:22 <clever> i used gentoo on a sparc machine as my nas for a while

13:23 <mjg> wut

13:23 <clever> but one day, it just randomly stopped booting and i couldnt figure out why

13:23 <clever> and then i discovered, xfs is super lazy, the journal is in native byte order only

13:23 <mjg> you know, i do find it disheartening when people boot !solaris on sparc

13:23 <clever> and there is no cross endian journal replay

13:23 <clever> so an LE machine cant replay a BE journal

13:23 <mjg> and the endiannes

13:24 <mjg> having BE/LE branchfest to translate as needed was the shit

13:24 <mjg> s/and/ah/

13:24 <clever> zfs has a neat trick to solve most endian issues, write all records in native order, include a magic# in the header

13:25 <clever> if the magic# is backwards, your on a different endianness, swap all fields

13:25 <mjg> bugreport: magic is all 0s

13:25 <clever> if you dont change the host endianness, it will always be operating in native mode and never byteswap

13:26 <clever> but if you do mess with the host, it can still read things, but all new data will be in native order

13:26 <clever> however, there are a number of surprise fields, that are BE only

13:26 bauen1 has quit [Quit: leaving]

13:28 * mjg adds midgetendian

13:28 <clever> one thing i want to get back to at some point, is BE linux on aarch64

13:28 <clever> both arm32 and aarch64 can run in BE mode

13:28 <clever> its just a config register in the cpu

13:29 <mjg> can you fuck with it any time?

13:29 <clever> on 32bit arm, yes, and i think it has even caused some bugs

13:30 <clever> on 64bit arm, i think its part of the EL switch system, so when you drop from say hypervisor to kernel, or kernel to userland, you can also change bit width, and endianness

13:30 <mjg> now that makes sense

13:30 <clever> but linux isnt able to handle a userland with a differing endianness

13:31 <clever> so kernel and userland must match

13:31 <clever> i dont know of any that allow it, but a hypervisor could support both LE and BE guests

13:31 <clever> a few months back, i was helping some guys on the rpi forums, at booting BE linux

13:32 <clever> and they then discovered, half of the drivers arent doing proper bit-flips

13:32 <mjg> :]]

13:32 <clever> while linux will automatically byte-swap MMIO writes

13:32 * mjg == shocked

13:32 <clever> the control structures you dump in ram, and then point at, dont get magically swapped

13:33 <mjg> i'm positively surprised with the half which does

13:33 <clever> and those kind of edge cases, are why i want a BE CI machine

13:33 <mjg> i'm guessing some of the drivers only do it some of the time

13:33 <mjg> ;>

13:33 <clever> yeah

13:33 <clever> another whacky case ive been involved in, is the pistorm guys

13:33 <mjg> there was a cpu which had errata: writing to a reg is expected LE, but reading gives BE

13:33 <mjg> LS

13:34 <mjg> :S

13:34 <clever> basically, emu68 is a JIT'ing emulator, to run m68k (BE) code, on an rpi3, in BE aarch64 mode

13:34 <clever> by keeping the arm in BE mode, it never has to deal with byte-swaps when emulating

13:34 <clever> it can just directly translate an m68k load into an aarch64 load, and not care about address or endiannes

13:35 <clever> but where it gets whacky, is that the rpi MMIO window, is mapped into the guest address space

13:35 <clever> you then write drivers for the rpi hardware, with the BE->LE byteswaps everywhere, compile the driver for m68k, then JIT it into aarch64 at runtime

13:36 gxt has quit [Ping timeout: 255 seconds]

13:37 <clever> mjg: the other crazy part, is that emu68, is mostly only emulating the cpu, you socket the whole rpi into the cpu socket of an amiga, and it bit-bangs the entire motherboard, lol

13:43 gxt has joined #osdev

14:20 Burgundy has quit [Ping timeout: 252 seconds]

14:22 <heat> yeah but like, realistically who cares about big endian these days? at least on modern-ish not-embedded stuff

14:26 <clever> heat: and thats why BE stuff is always broken :P

14:44 eroux has quit [Ping timeout: 252 seconds]

14:48 eroux has joined #osdev

14:53 Burgundy has joined #osdev

15:10 Vercas6 has quit [Quit: Ping timeout (120 seconds)]

15:11 Vercas6 has joined #osdev

15:13 Vercas6 has quit [Remote host closed the connection]

15:14 Vercas6 has joined #osdev

15:23 xenos1984 has joined #osdev

15:34 <heat> today's saga: rdrand detection code

15:35 <heat> or "god oh god why can't amd get this right"

15:35 <zid> check if the cpuid name contains a z

15:36 <heat> z for what

15:36 <heat> brokenz rdrandz

15:38 <zid> but no double l, actually

15:38 <zid> to check for ryzen, or zen, but not bulldozer

15:40 <heat> the other ones also have broken shit

15:40 <heat> amd can't get rdrand right

15:46 diamondbond has joined #osdev

15:50 epony has quit [Ping timeout: 268 seconds]

16:00 <heat> zid, https://lore.kernel.org/all/776cb5c2d33e7fd0d2893904724c0e52b394f24a.1565817448.git.thomas.lendacky@amd.com/

16:00 <bslsk05> lore.kernel.org: [PATCH] x86/CPU/AMD: Clear RDRAND CPUID bit on AMD family 15h/16h - Lendacky, Thomas

16:01 <heat> "There have been reports of RDRAND issues after resuming from suspend on some AMD family 15h and family 16h systems."

16:01 <heat> this is a crying emoji moment

16:02 <zid> yea amd has a LOT of bugs with suspend, exception etc recovery just from.. not being that mature as a setup

16:02 <zid> lots of bios bugs, lots of cpu bugs

16:02 <heat> how hard can rdrand be

16:07 <heat> 1. get entropy 2. magic cryptography 3. ??? 4. put it in a register

16:08 diamondbond has quit [Ping timeout: 268 seconds]

16:08 <heat> you'll notice there's no step called "screw up rdrand for 11 years"

16:10 <kof123> this is some really good gourmet coffee heat

16:10 gildasio has quit [Ping timeout: 255 seconds]

16:10 <heat> thanks

16:11 <heat> i try my hardest to give you the best gourmet coffee

16:12 diamondbond has joined #osdev

16:13 <heat> btw re: that clang thing: https://github.com/llvm/llvm-project/issues/20571 cc mrvn, geist

16:13 <bslsk05> github.com: inline asm "rm" constraint lowered "m" when "r" would be preferable · Issue #20571 · llvm/llvm-project · GitHub

16:13 <heat> <jyknight> heat: This is a really long-standing issue which is not easy to fix. At the time the decision about whether to use "r" or "m" is currently made, it doesn't yet know that a register can be allocated at all. That's why it always falls back to memory -- the alternative is sometimes failing to compile. I believe https://github.com/llvm/llvm-project/issues/20571 is the canonical issue for this.

16:14 gildasio has joined #osdev

16:29 diamondbond has quit [Remote host closed the connection]

16:35 diamondbond has joined #osdev

16:56 Matt|home has joined #osdev

17:13 gildasio has quit [Remote host closed the connection]

17:14 gildasio has joined #osdev

17:21 wxwisiasdf has joined #osdev

17:27 xenos1984 has quit [Ping timeout: 246 seconds]

17:28 xenos1984 has joined #osdev

17:51 CryptoDavid has quit [Quit: Connection closed for inactivity]

18:03 diamondbond has quit [Ping timeout: 256 seconds]

18:23 xenos1984 has quit [Ping timeout: 252 seconds]

18:33 gog has joined #osdev

18:39 xenos1984 has joined #osdev

18:47 xvmt has quit [Read error: Connection reset by peer]

18:48 <gog> hi

18:50 <Ermine> hi gog!

18:50 <gog> :)

18:50 xvmt has joined #osdev

18:59 heat_ has joined #osdev

18:59 heat has quit [Read error: Connection reset by peer]

19:00 wootehfoot has joined #osdev

19:01 crm has joined #osdev

19:04 orthoplex64 has quit [Ping timeout: 255 seconds]

19:15 <wxwisiasdf> Hii

19:25 epony has joined #osdev

19:29 dude12312414 has joined #osdev

19:57 elastic_dog is now known as Guest5023

19:57 Guest5023 has quit [Killed (zirconium.libera.chat (Nickname regained by services))]

19:57 elastic_dog has joined #osdev

20:01 dude12312414 has quit [Quit: THE RAM IS TOO DAMN HIGH]

20:13 heat_ has quit [Read error: Connection reset by peer]

20:13 heat_ has joined #osdev

20:25 kof123 has quit [Ping timeout: 268 seconds]

20:53 GeDaMo has quit [Quit: There is no spoon.]

20:53 kof123 has joined #osdev

20:55 <jbowen> o/

20:57 immibis_ has quit [Ping timeout: 260 seconds]

20:58 immibis_ has joined #osdev

21:05 <geist> heat_: ah so it was the lowering thing

21:06 <geist> now that i think about it i think i've seen something like this in zircon, grumbled about it as bad clang codegen, and moved on

21:08 <mrvn> heat_: premature optimization is the root of all evil

21:08 <mrvn> heat_: Also the point isn't wether a register is available but wether the result is going to go to memory.

21:09 gxt has quit [Ping timeout: 255 seconds]

21:09 gildasio has quit [Ping timeout: 255 seconds]

21:10 gxt has joined #osdev

21:10 gildasio has joined #osdev

21:30 <geist> but yeah i only late discovered you can *have* multiple constraints like that on inline asm, but then was dissapoint when i discovered it on zircon, using clang, since it's mostly broken there

21:30 <geist> now it makes sense why

21:33 <mrvn> It should not lower the asm until all the inputs and outputs have been determined or multiple options are viable.

21:36 heat_ is now known as heat

21:36 <heat> yeah

21:36 <heat> gcc handles it fine tbf

21:36 <heat> praise be the GNU operating system

21:37 <mrvn> heat: does it? doesn't it just default the other way?

21:37 <heat> no

21:37 <geist> yah gcc i've seen do it quite well. very pleasing when it handles in/out instructions with a constant, for example

21:37 <geist> look at the dissasmelby and all is well

21:38 <heat> maybe this warrants a #ifdef __clang__ #define RM_CONSTRAINT "=r" #else #define RM_CONSTRAINT "=rm" #endif

21:38 <mrvn> heat: I think yesterday gcc did pop into a register and then mov to memory for the "to memory" test case

21:38 <heat> well, that's not what I'm complaining about

21:39 <geist> i mean all else held equal if you can't have both constraints, just keep the 'r' one

21:39 <geist> since it's probably cheaper to pop into that and push it, and there's some chance you'll want to use it anyway

21:39 <heat> pushf; pop <something> <-- I expect gcc to pop into a register OR spill onto the stack

21:39 <mrvn> clang is worse in that when you store to memory it will pop to the stack, load into a register and then store that to memory.

21:39 <geist> constants and R for inputs also

21:40 <geist> really i mean goddamnit intel and/or amd: add a stupid instruction to load flags into an integer

21:40 <geist> why have they not fixed this damn bug after all these years

21:40 <heat> becuz

21:40 <geist> they have done far more intrusive ISA fixes in the intervening years

21:41 <mrvn> geist: and why do they have so many pops? reg16, reg32, reg64, mem16, mem32, mem64,

21:41 <heat> CISC ftw

21:41 <geist> well the multi pops make sense

21:41 <mrvn> especially where mem is relative to a register

21:42 <mrvn> geist: sure, 16, 32, 64bit pop is needed. but all the address modes?

21:42 <geist> do you really mean 'why do they have pop to memory?"

21:42 <mrvn> ARM is better there. pop is just a mov with the "store" flag

21:43 <mrvn> geist: yeah

21:43 <geist> that's just risc vs cisc

21:43 <geist> why x86 has pop to memory i dunno, but since it *does* have a memory operand i think it supports most of the usual addressing modes, so that just comes along with the territory

21:43 <geist> though honestly i haven't looked at it. it may be a special snowflake

21:43 <geist> part of the very few instructions that can implicitly access two memory opeands at a time

21:44 <mrvn> I didn't even know "m" would mean picking a SP relative address.

21:44 <geist> but really mrvn, as a 68k person i'm quite dissapointed in you

21:44 <geist> i'd expect the opposite: why is pop so limited in its addressing modes!

21:44 <heat> yeah but this is not two memory operands, just one

21:44 <mrvn> geist: that too. either it should one addressing or all

21:44 <geist> heat: yeah fair. one operand, two memory accesse

21:45 <heat> just like rep movsb has 0 memory operands

21:45 <heat> makes total sense lgtm ok heat@ reviewed-by: heat <heat@irc.libera.chat>

21:45 <mrvn> some are just implicit because 8bit opcodes just aren't big enough

21:46 <geist> but yeah even in that, there are only a handful of x86 instructions that can do two memory accesses, but none of them have two full memory operands i guess

21:46 <mrvn> pop probably uses the same logic path as mov

21:46 <geist> oh i seriously doubt it. i think push/pop is highhhhly specialized on x86

21:47 <mrvn> maybe nowadays

21:47 <heat> x86 really is a high quality very consistent architecture

21:47 <geist> though whether or not it always was

21:47 <mrvn> But really push/pop is just a mov with one argument being the SP and storing the address modification

21:47 <heat> I do really like that part where storing to a lower 16-bit halve doesn't zero the upper but storing to the lower 32-bits zeroes the upper part

21:47 <geist> i can't believe i have mostly forgotten the details of 68k and multiple operands. iirc it's far more consistent, but still limited to one operand except in a few cases (MOVEM comes to mind)

21:48 <mrvn> movem is 040 extension I think

21:48 <mrvn> or 030

21:48 <geist> 040 i think

21:48 <mrvn> Didn't make sense before you had cache I guess

21:48 <geist> however 68k has the indirect addressing modes though, so lots of stuff can implicitly do more than one access

21:50 <geist> my brain tends to smear 68k and VAX together, since they're clearly cut from the same cloth. in terms of ISA layout and addressing modes it's like VAX-lite

21:50 <geist> so i tedn to lose track unless i fiddle with it a bit

21:50 <mrvn> 68k is quite regular except for some opcodes that limit regs to A

21:50 <geist> right. but for example you can't ust have both operands be any addressing mode, right?

21:50 <mrvn> think so

21:50 <geist> so there tends to be multiple opcodes for mem to reg or reg to mem

21:51 <geist> and then of course opcodes that implicitly deal with A or D regs (which is admittedly the weirdest quirk of the arch)

21:51 <geist> if there's one weird quirk that didn't work, it's the two sets of registers

21:52 <geist> but what does make those addressing mdoes much more poewrful than x86 is theres so many of them

21:52 <geist> pre-post increment/decrement, indirects, indirects with pre-post increment/decrement, etc

21:52 <geist> all told i think there are 16 different addressing modes

21:53 <geist> (this is why there's no push/pop on 68k, since you can accomplish it with a move indirect with pre or post inc/dec)

21:53 <mrvn> same on ARM

21:54 <geist> yah until arm64 in which case you can't

21:54 <geist> one of the core, critical differences between arm32 and arm64

21:55 <geist> re: push/pop on x86. there are also some silly differences, like it always moves the stack in units of <defined by segment, mode, etc>

21:55 <geist> and it always implicitly uses the SS: segment

21:55 <mrvn> I guess they saw that languages don't use push/pop except potentially in the function entry and exit. But most often you have push + sub #x, %sp. Might as well drop the logic for pre/post inc/decrement and just make "x" larger.

21:55 <geist> (moves with BSP do too)

21:56 <geist> mrvn: well, i mean you can do push/pop on arm64, it's just more limited. basically you dont have the move multiple, but you have the 'load store pair' instruction

21:56 <geist> so you generally move 16 bytes, two registers, at a time

21:56 <geist> it does have pre/post inc/dec so basically you can do what you want, it's just more limited

21:57 <geist> so i guess in that case it's not really fundamentally different from arm32, just more limited

21:57 <mrvn> I like the move multiple.

21:57 <geist> yah those are the first thing you ditch when mkaking high speed impls. they clearly wanted to remove any microcoded style instructions eveyrwhere

21:57 <geist> and explicitly make things like PC hard to access and SP a special case so it can be treated specially. all tricks for high speed impls

21:58 <mrvn> geist: isn't that kind of similar to loading/storing a full cacheline? Same as SIMD registers doing large load/store?

21:58 <geist> sort of. mostly its because it fits with their model of 'stack pointer *must* be 16 byte aligned at all times'

21:58 <geist> so to make that useful they added double register load/stores

21:58 <mrvn> You could even limit it to the caller saved regs, or 4/8 regs.

21:59 <geist> sure but then you can't easily fit that in the instruction, since they have 32bit wide instructions and there are 31 regs

21:59 <mrvn> arm32 has double register load/store already. The register just got bigger.

21:59 <geist> one nice thing ldp/stp does in arm64 is it takes any two register pairs

21:59 <geist> including the same register

21:59 <mrvn> geist: first reg + log(size)

21:59 <geist> so it's not exactly the same thing as the arm32 double load/store

21:59 <geist> mrvn: yah that's what PPC does

22:00 <geist> you get a linear run of any registers, but only a run of them

22:00 dude12312414 has joined #osdev

22:00 <mrvn> which is exactly what a language needs

22:00 * geist nods

22:00 <geist> anyway you really should do some arm64

22:00 <mrvn> ahh, yeah, the good old time when they made RISC to implement just what a compiler needs.

22:01 <geist> yah POWER/PPC is clearly cut from a different cloth. its risc, but at its core it just has a different viewpoint, i guess

22:01 <geist> you can see that it has some heritage somewhere completely different

22:01 <mrvn> None of that fancy "solve this quadratic equation" opcode that you need once in every 10th program.

22:01 <geist> probably if i knew ibm 360 or whatnot i'd see the resemblance

22:02 <geist> yeah that's a distraction. i hate it when folks bring up the quadractic equation instruction, since a) it's not really what you think and b) it was a weird outlier even at the time

22:02 <mrvn> hence why they bring it up

22:02 <gog> wait solve quadratic equation instruction

22:02 <geist> to me the essence of CISC is more interesting and subtle

22:03 <geist> if you're *actually* interested in it i forget what it's called but can look it up

22:03 <geist> IIRC it's actually some sort of 'do math and do a table lookup' instruction

22:03 <gog> oh

22:03 <mrvn> If you comapre m68k and ARM I don't think there is that much difference in the addressing modes. The CISC/RISC line is totally blurred.

22:03 <geist> clearly intended to accellerate somethig, iirc

22:03 <geist> mrvn: except all the indirect addressing modes. none of those exist in ARM

22:03 <gog> is it in the main isa or is it x87?

22:04 <geist> but yes, minus the indirect stuff, t's clear that the ARM stuff took their inspiration from the motorola world.

22:04 <mrvn> geist: you mean base reg + offset reg + immediate?

22:04 <geist> gog: oh this is some famous VAX instruction

22:04 <gog> oh

22:04 <heat> aaa best instruction

22:04 <geist> mrvn: i mean 'base + something' then take that as an address and read the thing at that address

22:04 <mrvn> heat: better than sex?

22:04 <heat> yes

22:04 <geist> ie a double indirect

22:04 <heat> also better than eieio

22:05 <mrvn> geist: oehm, did m68k have that?

22:05 <geist> er i didn't phrase it right, but 68k has variations of most of the modes that double indirect

22:05 <geist> yeah, i think mostly 020+?

22:05 <mrvn> geist: sure it's not VAX?

22:05 <heat> gog, u still using x87

22:05 <geist> mrvn: VAX definitely has that

22:05 <heat> have you heard of streaming simd extensions

22:05 <heat> it's this new thing for floating point and simd

22:06 <mrvn> can't remember having double indirection on m68k but I learned 68000 asm and only little bits for 020+

22:06 <gog> i will never do floating point math

22:06 <mrvn> and it's been decades since I used it

22:06 <geist> FWIW the VAx instruction that mrvn was referring to is https://documentation.help/VAX11/op_POLY.htm

22:06 <heat> reject floating - it's the devil's way

22:06 <geist> mrvn: yeah and in fact i couldn't easily get the compiler to even emit it

22:07 <heat> oh wow, very cool

22:07 <geist> mrvn: oh you know i might be full of shit: https://www.thedigitalcatonline.com/blog/2019/03/04/motorola-68000-addressing-modes/#table-of-addressing-modes

22:07 <bslsk05> www.thedigitalcatonline.com: The Digital Cat - Motorola 68000: addressing modes

22:07 <gog> oh neat

22:07 <geist> may be the vax stuff bleeding through

22:07 <mrvn> That POLY opcode is really powerfull. You need that for all the trigometric functions for example.

22:07 <heat> *heat

22:07 <gog> when you've programmed on too many architectures

22:09 <geist> in general vax has pretty much similar stuff to 68k there but then you can also take the result of it and use that as an address you then indirect once more

22:09 <geist> ie, compute this address as a table entry of pointers and then read whatever is at the pointer we just computed in memory

22:09 <mrvn> I like (d8,Dn,PC). You have an array of structs on the stack and access it with (off+label, array index, pc)

22:09 <mrvn> -on the stack

22:10 <geist> ah no. actually turns out 040 has more

22:10 <mrvn> or (d8,Dn,An) for the more general case

22:10 <geist> i think that may be where i was looking at it. trying to find a better list

22:10 <mrvn> geist: Not much more it can have. Everything up to 111 100 is in use.

22:11 <mrvn> So only 4 more to go

22:11 <geist> yeah in wikipedia article on 020:

22:11 <geist> Addressing modes added scaled indexing and another level of indirection"

22:11 <geist> yah that's what i remember seeing

22:11 <geist> i forget how it's encoded

22:12 <geist> but those seem highly exotic and special cased, probably removed in 060 and coldfire and whatnot

22:12 <mrvn> maybe opcode specific

22:12 <geist> possibly

22:12 <geist> for the indirect stuff you just need one more bit somewhere

22:13 <mrvn> The 040 was kind of a branch. When they made the 060 superscalar they went back a step and never implemented everything the 040 had.

22:13 <geist> yah though in this case these additional modes were added in 020

22:14 <heat> kinda offtopic but does arm64's RNDR also have a history of shitty impls?

22:14 wootehfoot has quit [Read error: Connection reset by peer]

22:15 <mrvn> like having a pattern?

22:15 scoobydoo_ has joined #osdev

22:15 scoobydoo has quit [Ping timeout: 260 seconds]

22:15 scoobydoo_ is now known as scoobydoo

22:16 <heat> having predictable results, or plain depressing impls like accidentally always returning all-1s

22:16 <gog> heh, oops, all 1's

22:16 <geist> heat: hmm, arm64s?

22:16 <geist> what machine are you on?

22:17 <mrvn> heat: bit 3 ^ bit 7 == 1

22:17 <heat> geist, none. just wondering

22:17 <geist> in general i dont think a lot of ARM machines have it yet. it's a new extension that i think is optional in most case

22:17 <geist> i haven't personally seen it on any machine. though M1 maybe?

22:17 <heat> hm

22:18 <j`ey> I dont think m1 has it either

22:18 <heat> I've just spent all day battling rdrand and negligence and was just wondering

22:18 <geist> j`ey: yeah i think it's probably going to show up on servers first is my guess

22:18 <mrvn> people don't trust hardware RND

22:18 <geist> gotcha

22:19 <heat> https://github.com/tianocore/edk2/blob/master/MdePkg/Library/BaseRngLib/Rand/RdRand.c#L123-L128

22:19 <bslsk05> github.com: edk2/RdRand.c at master · tianocore/edk2 · GitHub

22:19 <heat> look at this shit

22:20 <geist> wat!

22:20 <geist> all that aside i mean i think you're supposed to use it with care

22:21 <heat> a new patch wants to hook up this stuff automatically in OVMF

22:21 <heat> which means broken implementations return broken unsafe results and cpus that don't support it will just crash

22:22 <mrvn> .oO(and how is that different from the test function returning False?)

22:22 <heat> if the test function returns false you don't use compromised RNG OR crash

22:22 <geist> heat: guess it depends on precisely when it was added in the intel and AMD line

22:23 <geist> and does TCG properly implement it? i'd be worried about running a build of OVMF in a VM where it's not passed through

22:23 <heat> i don't know, does TCG even implement it?

22:24 <geist> right exactly. OTOH arguably even a broken implementation that returns all 1s is still entropy, it's just bad

22:24 <geist> one shouldn't use the instruction for a random number, but as entropy to feed into a pool

22:24 <geist> (i think at least, i usually consult experts when it comes to this stuff)

22:25 <heat> yeah except having a literal EFI_RNG_PROTOCOL tempts you to just use it for your early boot KASLR or whatever

22:25 <heat> and even if you mix entropy, you just got yourself a lot of predictable entropy. yay?

22:26 <mrvn> no, entropy can never get worse, only better

22:26 <mrvn> random bits XOR all-1s doesn't make it less random

22:26 <heat> I would genuinely prefer a rdtsc fallback over "100 shades of broken" rdrand

22:27 <mrvn> unless the rdrand is influenced by the existing pool it can't make it worse

22:27 <geist> anyway re: ARM and RNDR, it hasn't been as big of a deal since *most* soc hardware has a hardware block to read random from

22:28 <mrvn> just wastes cpu if it's really bad

22:28 <geist> or at least it's fairly common. i suspect the instruction spec is mostly for server stuff

22:28 <geist> lets a VM get entropy whe it's super abstracted from hardware or whatnot

22:28 <geist> and also a way to standardize it

22:28 <mrvn> hardware block?

22:28 <geist> yah like a device

22:29 <mrvn> A one-time pad or a random register?

22:29 <geist> latter

22:29 <geist> and yes i know what you're about to say now

22:29 <geist> you're going to try to punch a bunch of holes in that. i am well aware

22:30 <geist> same thing: if some random vendor implements a hardware RNG is it any good? Good question! No Fucking Clue

22:30 <mrvn> no, wasn't going there. I though it rather sensible to have a MMIO register for random numbers.

22:30 <geist> oh haha

22:30 <mrvn> waste of a perfeclty good opcode to have that as opcode.

22:30 <geist> okay. well yeah most reasonable ARM hardware i've seen has some sort of RNG device you can read from

22:30 <geist> However that means 'how good is the implementation if random vendor can make it?' no idea

22:31 <mrvn> it's usualy just a diode or open gate that you read and it flutters.

22:32 <mrvn> And you are supposed to mix it into an entropy pool. It might be totally biased like giving only 1% 1s but as long as they have no pattern that adds entropy.

22:32 <geist> the hard part is something like getting a random number 1ms after booting

22:32 dude12312414 has quit [Quit: THE RAM IS TOO DAMN HIGH]

22:32 <mrvn> Physically it's not that challenging to build a random bit generator.

22:33 <geist> since you haven't had time to stir the pot. and in this case heat is saying EFI will potentially give you garbage

22:33 <heat> yup

22:33 <heat> particularly on AMD machines I guess

22:34 <mrvn> You probably need to init the pool by stiring it a few times first.

22:34 <mrvn> and onl then pick your random number for KASLR

22:34 <heat> since they're saying this is already hooked up on Real World Implementations(tm), I'd be curious to see if there indeed is an RNG_PROTOCOL and if it returns entirely predictable results

22:34 <heat> for e.g Ryzen zen3

22:35 <geist> there is a random instrution on some AMD machines, i just dont know where it got started

22:35 <mrvn> heat: boot a kernel 1000 times and plot the random bits?

22:35 <geist> i do rememeber reading a good whitepaper from AMD on how their hardware works

22:36 <heat> AMD RDRAND has been broken since bulldozer

22:36 <geist> oh?

22:36 <heat> every family since then has had a broken rdrand!

22:36 <geist> how so?

22:36 <heat> https://lore.kernel.org/all/776cb5c2d33e7fd0d2893904724c0e52b394f24a.1565817448.git.thomas.lendacky@amd.com/

22:36 <bslsk05> lore.kernel.org: [PATCH] x86/CPU/AMD: Clear RDRAND CPUID bit on AMD family 15h/16h - Lendacky, Thomas

22:36 <heat> "There have been reports of RDRAND issues after resuming from suspend on some AMD family 15h and family 16h systems."

22:36 <geist> that's family 15/16h

22:36 <geist> what about 17 and 19? those are zens

22:37 <heat> wikipedia says 15h is bulldozer and a few others

22:37 <geist> yes. zen 1-2 is 17, zen 3-4 is 19h

22:37 <mrvn> heat: sounds like a bios problem, not stiring the hardware pool on resume

22:38 <heat> 17h and 19h also had their issues, zen2 I think suffers from the same suspend problem and zen3 just plain returns all-1s ALWAYS (without the microcode fix)

22:38 <heat> firmware doesn't stir anything on resume, this is entirely a CPU thing

22:39 * geist nods

22:39 <mrvn> heat: see the comment onyour url

22:39 <mrvn> heat: returning all-1s must be a different problem

22:40 <heat> it sounds like "firmware didn't do this hacky solution so now resume is always broken"

22:40 <geist> well thats from 2019

22:40 <geist> it is from an amd employee, so they seemed to be just functionally giving up

22:40 <heat> per the spec rdrand with the proper flag set should return you cryptography-grade randomness

22:40 <geist> or at least at that snapshot in time, they might had a quirk MSR bit later that says 'actually it's okay' and then reenable it

22:40 <geist> which they have done from time to time

22:41 <geist> but not on bulldozers

22:41 <geist> since those are dead

22:41 <heat> oh yeah geist you might like this small thread: https://twitter.com/aionescu/status/1393728057005920263

22:41 <bslsk05> twitter: <aionescu> Playing around with my first AMD Ryzen system. Turns out the "AMD PCI Driver" isn't actually a PCI Driver... at all. ␤ ␤ Here's a few fun facts: ␤ ␤ 1) It registers a process creation notify routine, and checks all process names against a list of 19 hashed names.

22:41 <mrvn> heat: I assume the CPU has a entropy pool that rdrand returns values from and that only works if the pool is random to begin with. So the firmware is supposed to generate a million random numbers to stirr the pool on boot to get it randomized.

22:41 <geist> oh haha will have to unblock twitter and read that

22:42 <geist> alas right now gonna do some work

22:47 <heat> he DRBG is re-seeded frequently from

22:47 <heat> an on-chip non-deterministic entropy source to guarantee data returned by RDRAND is statistically uniform, non-

22:47 xenos1984 has quit [Read error: Connection reset by peer]

22:47 <heat> In order for the hardware design to meet its security goals, the random number generator continuously tests itself

22:47 <heat> periodic and non-deterministic.

22:47 <heat> and the random data it is generating.

22:48 <heat> per Intel SDM volume 1 7.3.17.1

22:48 <heat> if amd rdrand requires this then they built a broken rdrand

22:48 <heat> s/this/manual stirring/

22:48 <mrvn> heat: see, and the firmware is supposed to make sure such a re-seeding happens at boot

22:48 <heat> no it's not

22:48 <geist> oh ugh. i even have that stupid AMD PCI driver installed

22:49 <heat> did you even read it? it's seeded on-chip, re-seeded on-chip, tested on-chip

22:49 <mrvn> heat: which would take time. It doesn't pop up random at boot.

22:50 <heat> rdrand can fail

22:51 <heat> it's explicitly defined that a rdrand "invocation" should call it up to 10 times to get a result

22:51 <mrvn> Maybe it's not what intel expected but I can totaly see AMD saying firmware has to stir the pool at boot a bit for randomness to happen.

22:52 <mrvn> And by stir the pool I mean call rdrand a million times so the re-seeding and self testing and such triggers.

22:52 <heat> the AMD manual says nothing. maybe there's something in the confidential FW docs, but I doubt it

22:55 netbsduser` has joined #osdev

22:55 k4m1_ has joined #osdev

22:56 TkTech6 has joined #osdev

22:56 <heat> AMD PCI driver is named like actual spyware, it's amazing

22:56 torresjrjr_ has joined #osdev

22:56 geist_ has joined #osdev

22:57 fkrauthan_ has joined #osdev

22:57 antranigv_ has joined #osdev

22:57 les has joined #osdev

22:58 pie__ has joined #osdev

22:58 vancz_ has joined #osdev

22:58 ElementW_ has joined #osdev

22:59 Effilry has joined #osdev

22:59 lanodan_ has joined #osdev

22:59 vin1 has joined #osdev

22:59 sprocket has joined #osdev

22:59 outfox_ has joined #osdev

22:59 Mutabah_ has joined #osdev

23:00 eck_ has joined #osdev

23:00 joe9_ has joined #osdev

23:02 <mrvn> heat: so any results on that tweet? Has anyone tried benchmarking the games with the original and changed name to see what it does?

23:02 kof123 has quit [Ping timeout: 268 seconds]

23:02 JTL1 has joined #osdev

23:03 <heat> *shrug*

23:04 identitas has quit [*.net *.split]

23:04 Mutabah has quit [*.net *.split]

23:04 ElementW has quit [*.net *.split]

23:04 fkrauthan has quit [*.net *.split]

23:04 antranigv has quit [*.net *.split]

23:04 vin has quit [*.net *.split]

23:04 pie_ has quit [*.net *.split]

23:04 TkTech has quit [*.net *.split]

23:04 vancz has quit [*.net *.split]

23:04 les_ has quit [*.net *.split]

23:04 qubasa has quit [*.net *.split]

23:04 lanodan has quit [*.net *.split]

23:04 geist has quit [*.net *.split]

23:04 JTL has quit [*.net *.split]

23:04 k4m1 has quit [*.net *.split]

23:04 torresjrjr has quit [*.net *.split]

23:04 sprock has quit [*.net *.split]

23:04 eck has quit [*.net *.split]

23:04 netbsduser has quit [*.net *.split]

23:04 joe9 has quit [*.net *.split]

23:04 FireFly has quit [*.net *.split]

23:04 outfox has quit [*.net *.split]

23:04 torresjrjr_ is now known as torresjrjr

23:04 geist_ is now known as geist

23:04 TkTech6 is now known as TkTech

23:04 fkrauthan_ is now known as fkrauthan

23:04 <zid> heat stop causing netsplits

23:04 <zid> your shrugs are too powerful

23:04 <geist> yeah heat!

23:05 <heat> thanos snap but shrug

23:05 <mrvn> "We can rebuild him. We have the technology."

23:06 elastic_dog has quit [Ping timeout: 246 seconds]

23:06 elastic_dog has joined #osdev

23:06 xenos1984 has joined #osdev

23:07 <zid> My friend made a db of intel cpus and is showing me silly stats :P

23:08 <zid> skylake has 7 sockets, LGA2011 has 114 SKUs, the cpu with the most FMA units is Intel® Core™ i7-11850HE Processor with 8

23:08 <zid> which is a mobile cpu

23:08 <mrvn> float-multiply-add?

23:08 <geist> oh as a side note i have found as definitive a list of microarches to cpuids as anywhere else i have: https://en.wikichip.org/wiki/intel/cpuid

23:08 <bslsk05> en.wikichip.org: CPUID - Intel - WikiChip

23:08 <geist> https://en.wikichip.org/wiki/amd/cpuid

23:08 <bslsk05> en.wikichip.org: CPUID - AMD - WikiChip

23:09 <geist> in case were interested FWIW

23:09 eck_ is now known as eck

23:10 <zid> Sandy Bridge (Server) E, EN, EP 0 0x6 0x2 0xD Family 6 Model 45

23:10 <zid> checks out

23:10 <zid> I am a 0x6:0x2D

23:11 <heat> fused-multiply-add i guess

23:11 <mrvn> a*b+c is an extremly useful operation

23:11 <zid> So if you want some sick avx512 perf, get one of those mobile cpus, add an LN2 pot to the top

23:11 <zid> and get cunching

23:12 <geist> i remember there was some other weirdness where the early Bonnell atoms (first gen) were totall yunbalanced and actually had far more vector processing than they could fill

23:12 <geist> one of those weird quirks of design

23:13 <zid> intel were doing weird things with getting 10nm not working and saying "only comet lake mobiles get 10nm" and stuff

23:13 <zid> which I think is why the laptop cpus have all the avx

23:13 <zid> yield issues

23:14 kof123 has joined #osdev

23:14 <zid> smaller laptop dies in lower volumes = yes, huge 32 core desktop dies in high volumes = no

23:14 <mrvn> did intel ever build in more units than needed and then burned some fuses for the units that don't work?

23:16 maksy_ has joined #osdev

23:16 identitas has joined #osdev

23:17 sprocket is now known as sprock

23:19 Effilry is now known as FireFly

23:20 gxt has quit [Remote host closed the connection]

23:21 maksy_ has quit [Ping timeout: 246 seconds]

23:21 gxt has joined #osdev

23:22 maksy_ has joined #osdev

23:22 antranigv_ is now known as antranigv

23:32 <zid> so the results are in

23:32 <zid> buy W-3375 for your webserver, a 13900k to play dwarf fortress on, and a W-2225 for your desktop

23:38 gildasio has quit [Remote host closed the connection]

23:38 gxt has quit [Remote host closed the connection]

23:39 gxt has joined #osdev

23:39 gildasio has joined #osdev

23:40 Burgundy has quit [Ping timeout: 255 seconds]

23:46 [itchyjunk] has joined #osdev

23:50 Mutabah_ is now known as Mutabah

23:53 Ellenor has quit [Quit: Bye Open Projects!]

23:58 <kaichiuchi> clang scares me now

23:58 Vercas62 has joined #osdev

23:59 <kaichiuchi> https://godbolt.org/z/Tbcaqcsda <- switch compiler #2 to gcc and observe that the code generated will be the same as compiler #1, but observe now that clang is generated what looks to be shit code for this

23:59 <bslsk05> godbolt.org: Compiler Explorer

23:59 Vercas6 has quit [Ping timeout: 255 seconds]

23:59 Vercas62 is now known as Vercas6