#osdev on 2022-02-25 — irc logs at libera.irclog.whitequark.org

2021-05-23 01:57 klange changed the topic of #osdev to: Operating System Development || Don't ask to ask---just ask! || For 3+ LoC, use a pastebin (for example https://gist.github.com/) || Stats + Old logs: http://osdev-logs.qzx.com New Logs: https://libera.irclog.whitequark.org/osdev || Visit https://wiki.osdev.org and https://forum.osdev.org || Books: https://wiki.osdev.org/Books

00:00 <clever> geist: i assume the tlb shootdowns are fully automatic, and dont require the target core software to co-operate?

00:00 <geist> correct

00:01 <clever> yeah, no way to map them then

00:01 <geist> you basically use a specific form of the instruction that specifies ASID, VA, if it's a global shootdown, and if it's a terminal or inner node

00:01 JanC_ has joined #osdev

00:01 <clever> and i assume its chip wide, so you cant send a different shootdown to each core

00:01 <geist> and then do a DSB to wait for the other core to ack it

00:01 JanC is now known as Guest6602

00:01 JanC_ is now known as JanC

00:01 <geist> right

00:03 Guest6602 has quit [Ping timeout: 250 seconds]

00:04 <geist> side note: the new amd extensions are functionally identical except they also support flushing by range

00:04 <geist> VA + # pages

00:05 <mrvn> I don't follow that.

00:05 <mrvn> If the TLB entries are per core then on cross core shootdown you only have to translate the ASID from one core to the other.

00:06 <mrvn> or is there an opcode to shoot down a TLB on all cores?

00:07 <geist> yes

00:07 <geist> the latter

00:07 <mrvn> Ok, so if a process runs on many cores you want a global ASID so the shootdown is just one op and no IPI.

00:07 <geist> in which case the ASID you send through only makes sense if it maps to the same thing on the other core

00:07 <geist> right

00:08 <mrvn> This is so much simpler if you don't have threads. :)

00:10 <clever> so you could instead use an IPI and a local TLB clear, to have per-core ASID's

00:10 <clever> but that comes at the cost of IPI's having to interrupt every core, and ack the action

00:11 <mrvn> I don't have threads so I never have to do a shootdown to another core.

00:12 <clever> and even if you have shared memory between procs, the munmap removes it from that procs tables

00:13 <mrvn> no shared memory either

00:13 <clever> so only after both cores unmap a page, will it be free

00:13 <mrvn> My design is based on message passing.

00:13 <clever> if only one proc unmaps, only its pagetables need to update

00:13 <clever> what about the cost of copying the message between memory regions?

00:14 <mrvn> clever: messages are passed and not copied.

00:14 <clever> passed how?

00:14 <mrvn> the page(s) are mapped into the other address space.

00:15 <clever> that sounds like it needs some tlb shootdowns?

00:15 <mrvn> sure, locally by the sender to unmap the page

00:16 <clever> ah, and with no tlb-miss cache, the receiver will check the pagetables again, and discover it?

00:16 <mrvn> that's the idea.

00:17 <sonny> what happened before linker scripts?

00:18 Starfoxxes has quit [Ping timeout: 240 seconds]

00:18 <mrvn> sonny: single compilation units

00:18 <sonny> oh

00:18 <mrvn> hand calculated jump offsets

00:18 <sonny> well that makes sense

00:18 <mrvn> punch cards

00:18 <sonny> oof

00:18 <mrvn> hard wired programming

00:19 <gog> hot

00:19 <sonny> hand calculated ... D:

00:19 <mrvn> People used to programm in a hex editor at some point

00:19 Starfoxxes has joined #osdev

00:19 <sonny> it seems that gcc is responsible for linker scripts?

00:19 <clever> mrvn: ive seen a blog on bootstrapping a compiler

00:19 <klange> define "responsible for"

00:20 bgs has quit [Read error: Connection reset by peer]

00:20 <clever> step 1 was making a program that can convert hex to binary, i think using only opcodes within the ascii range

00:20 bgs has joined #osdev

00:20 <sonny> klange invented

00:21 <gog> klange invented linker scripts til

00:21 <mrvn> clever: I can't see anyone doing that unless it is the only computer they have access too.

00:22 <clever> mrvn: it was more of an example of how to bootstrap assuming you have no tools beyond the ability to run a binary and a text editor

00:22 <mrvn> clever: usualy you have some medium to transport data that you write on a nother system. Even if that means making punch cards.

00:22 <mrvn> or flashing an eprom

00:23 <mrvn> Where did you get the editor from? And if you have an editor why no basic?

00:23 <clever> just an arbitrary set of limits the author started from

00:23 <mrvn> "run a binary" is usualy not a given I think, an interpreter is more likely.

00:24 <clever> mrvn: https://web.archive.org/web/20180223072827/https://www.rano.org/bcompiler.html

00:24 <bslsk05> web.archive.org: Wayback Machine

00:24 <mrvn> Anyone know what it looked like on mainframes? Did they have some basic or something or did you have to load a binary from punch cards?

00:25 <clever> oh, it also relied heavily on a shell with working redirection

00:25 <clever> so he didnt have to parse argv and call open()

00:25 <mrvn> clever: writing a binary from shell with only 7 bit input is pretty hard but possible.

00:26 <mrvn> I mean "cat > mycc" and then type in 7bit opcodes.

00:26 <mrvn> or rather ascii only.

00:26 <clever> i lost the link right now, but i had seen a youtube vid of somebody bootstrapping an old pdp-11 machine i think

00:26 <clever> they had to load some code with both punchcards and a tape emulator i think

00:27 <clever> then compiled it, and wrote it to real tape reels

00:28 <mrvn> clever: you would implement something like forth or scheme in punch cards and then write your assembler in that.

00:28 <clever> yeah

00:28 <clever> the bcompiler above, migrated into an assemble that supported ever-increasing levels of complexity

00:29 <clever> starting out as just a hex->binary program with label support, so you dont have to count bytes

00:29 <clever> and evolving into a macro based compiler that pastes chunks of asm together

00:29 <mrvn> those old systems didn't have the immediate inside the opcode like modern cpus.

00:30 <clever> like the 6502, yeah

00:30 <mrvn> So you can write "hex for jmp" <label>

00:30 <clever> so you can put the immediate in more easily

00:30 <clever> yep

00:31 <mrvn> having a label is a step up from what I had on my C64. I had to hand count bytes.

00:31 <mrvn> On the other hand the C64 has basic so you can write a better assembler in that.

00:32 <clever> yeah

00:33 <mrvn> I never got around to writing a compiler on the C64.

00:35 sonny has quit [Quit: Client closed]

00:38 <clever> mrvn: ah, found it: https://www.youtube.com/watch?v=uFQ3sajIdaM

00:38 <bslsk05> 'The IBM 1401 compiles and runs FORTRAN II' by CuriousMarc (00:23:40)

00:44 blockhead has joined #osdev

00:50 <mrvn> WTF? The first example in the handbook is inverting a matrix? Not hello, world?

00:54 adder has quit [Read error: Connection reset by peer]

00:56 adder has joined #osdev

01:00 <cb> mothers milk for mathematicans

01:06 adder has quit [Read error: Connection reset by peer]

01:08 adder has joined #osdev

01:19 [itchyjunk] has quit [Ping timeout: 240 seconds]

01:23 sdfgsdfg has quit [Quit: ayo yoyo ayo yoyo hololo, hololo.]

01:23 [itchyjunk] has joined #osdev

01:30 masoudd has joined #osdev

01:34 [itchyjunk] has quit [Ping timeout: 240 seconds]

01:37 <klange> interestingly, I've been running this serial console now since last night with no freezes

01:37 <gog> aaay

01:37 <gog> nice

01:38 <gog> did you change adapters?

01:38 <klange> no, but I moved some cables around, maybe it was interface from a power cord ;)

01:38 <gog> aha

01:39 <klange> could also be my host desktop's USB situation just being an absolute mess; I have a hub here that crashes regularly, my Wacom tablet flakes out if I plug in my iPhone _to a wall adapter_ (cables run alongside each other)...

01:43 <clever> my ftdi uart disconnects randomly

01:43 <clever> [155288.462774] usb usb13-port2: disabled by hub (EMI?), re-enabling...

01:45 <klange> [6391889.064768] pl2303 ttyUSB1: usb_serial_generic_read_bulk_callback - urb stopped: -32

01:45 <klange> These probably align with my freezes

01:46 <clever> i keep a `dmesg -w&` open most of the time

01:46 <clever> and that makes it easier to see things lining up

01:47 [itchyjunk] has joined #osdev

02:15 wxwisiasdf has joined #osdev

02:15 <wxwisiasdf> hi

02:16 wxwisiasdf has quit [Client Quit]

02:16 wxwisiasdf has joined #osdev

02:18 srjek has quit [Ping timeout: 240 seconds]

02:20 <Mutabah> wxwisiasdf: Hello.

02:30 not_not has quit [Ping timeout: 272 seconds]

02:31 sdfgsdfg has joined #osdev

02:33 sdfgsdfg has quit [Read error: Connection reset by peer]

02:33 sdfgsdfg has joined #osdev

02:50 sdfgsdfg has quit [Quit: ayo yoyo ayo yoyo hololo, hololo.]

02:53 sonny has joined #osdev

02:56 pretty_d1 has quit [Quit: WeeChat 3.4]

03:11 Belxjander has joined #osdev

03:26 netbsduser has quit [Read error: Connection reset by peer]

03:27 netbsduser has joined #osdev

03:27 <geist> hi

03:30 <kazinsal> aloha

03:35 Belxjander has quit [Ping timeout: 245 seconds]

03:35 <wxwisiasdf> im making an os

03:35 <klange> well you're in the right channel

03:36 <wxwisiasdf> heh

03:43 sdfgsdfg has joined #osdev

03:51 gog has quit [Ping timeout: 250 seconds]

03:54 wxwisiasdf has quit [Quit: Client closed]

04:00 wxwisiasdf has joined #osdev

04:03 wxwisiasdf has quit [Client Quit]

04:07 wxwisiasdf has joined #osdev

04:07 Maka_Albarn has joined #osdev

04:07 <Maka_Albarn> do any of you know if gcc freestanding supports bit fields? and if so, how to use them?

04:08 <moon-child> freestanding should be no different than hosted

04:08 <moon-child> in that respect

04:08 <wxwisiasdf> it does support bitfields

04:08 wxwisiasdf has quit [Client Quit]

04:08 epony has quit [Ping timeout: 240 seconds]

04:08 <Maka_Albarn> so it seems like I just suck at using them.

04:08 <Maka_Albarn> any tips or exampls?

04:09 <moon-child> struct foo { int y: 4; }; makes 'foo' a struct which contains a bitfield 'y' taking up 4 bits

04:09 <moon-child> if you find them confusing, you do not have to use them

04:10 <moon-child> extracting bits directly from ordinarily addressible fields is equivalent

04:10 <moon-child> bitfields just provide an alternate syntax for doing the same thing

04:10 <clever> the main complaint ive heard about bitfields, is that gcc and ms's compiler, put the fields in a different order i think

04:10 <clever> so you run into compatability problems if you want to mix compilers or talk to hw

04:11 <moon-child> who compiles kernels with msvc though? :P

04:11 <clever> heathens that write nodejs in vscode, and still use file->save to save, even after being told about ctrl+s :P

04:12 <moon-child> sounds like an accurate depiction of the windows kernel team

04:13 <clever> another issue i can see with bitfields, is mmio passwords

04:13 <clever> a large chunk of sensitive registers in the rpi only have 24 usable bits, and you must `0x5a000000 | x` every value you write to the register

04:14 <clever> if the 5a is missing, the write is silently ignored, and those bits read back as 0

04:14 <Maka_Albarn> clever: I use vscode, but compile with GCC through WSL Ubuntu

04:14 <Maka_Albarn> muahahahaha

04:14 <clever> Maka_Albarn: you better at least use ctrl+s !

04:14 <Maka_Albarn> i do

04:15 <Maka_Albarn> file -> save is to slow

04:15 <sonny> llvm is available btw

04:15 <clever> and dont do right click -> copy, right click->paste!!

04:15 <sonny> dang, guess I'm a heathen then

04:15 * moon-child just uses :w

04:15 <Maka_Albarn> can you bit field a boolean?

04:16 <sonny> is this a cpp question?

04:16 * Maka_Albarn shrugs

04:16 <sonny> sounds like a trick question xD

04:16 <moon-child> clever: what would be wrong with that? struct { char x; unsigned y; } __attribute__((packed)) *foo; foo->x = 0x5a; foo->y = whatever

04:17 <clever> moon-child: but will the compiler merge the x+y writes together, and issue it to the hw as a single 32bit store?

04:17 <moon-child> oh you mean it might do one 1-byte store and then 1 2-byte store

04:17 <clever> yeah

04:17 <kazinsal> unlikely. it will in fact probably jam up the works as it attempts to do an unaligned access on the ->y

04:17 <moon-child> instead of reading back the x, oring with it, and writing back the result?

04:18 <moon-child> yeah makes sense. Though ^; depends on platform alignment restrictions

04:18 <clever> kazinsal: yep, un-aligned MMIO will misbehave in all kinds of fun ways

04:18 <clever> but with the PW missing, it will do less harm then usual

04:18 <moon-child> :D

04:18 <clever> basically, the bus coming out of the cpu is 32 bits

04:19 <clever> and if you do a mis-aligned 8bit load/store to aligned(32bit)+8bit, then it will present valid data on bits 8:15, with a bus-valid flag for that 8bit section

04:19 <clever> the other bits, may be anything

04:19 <clever> the far end must then match on the address, and map the whole 32bit bus to something

04:20 <clever> for example, if the address doesnt match any valid register in the gpio, then the literal string "GPIO" is presented on that 32bit bus

04:20 <clever> but the cpu was expecting a result on bits 8:15, so only 'P' comes back

04:21 <clever> so, if you do a 32bit read of an invalid register, that is 32bit aligned, you get back "GPIO"

04:21 <clever> if you do a mis-aligned 8bit read, you get 'P' 'I' or 'O', depending of the mis-alignment

04:21 <clever> and an 8bit read with 32bit alignment, gives you an 8bit slice of a real register

04:22 <moon-child> there was one chip that couldn't handle misaligned reads. But it didn't fault when you issued one either; rather, it used the low bits of the address to permute the result

04:23 <clever> but i have also seen other reports, that VPU side of things, is using a 64bit bus

04:23 <clever> somebody used the "load many" opcode, to load 4x32bit registers from MMIO

04:23 <clever> the first 2 cpu registers, got the same MMIO value, from the starting addr

04:23 <clever> and the next 2 cpu registers, got another MMIO value, from a 64bit offset ahead

04:24 <clever> which implies a 64bit bus, and then 8 bits of bus masking, to select what 8bit chunks to obey

04:24 <clever> and when you hit a 32bit MMIO reg, it just matches on the raw addr, and duplicates the reply to fill out the bus

04:24 Maka_Albarn has quit [Ping timeout: 240 seconds]

04:26 joe9 has joined #osdev

04:26 <clever> which now has me wondering

04:26 <clever> if i do a vector load, of 64 bytes (16 x 32bit), what will happen......

04:27 <clever> how will the hw malfunction when abused that hard!

04:29 <clever> > `v32ld HY(0++,0),(r1+=r2) REPx, 11 cycle startup (for L1 hit), plus 2*x, given that (r2%64)==0

04:29 <clever> from my notes

04:29 <clever> so there is a fixed overhead at the start, and then it can load 64 bytes in 2 clock cycles

04:30 <clever> that somehow implies that the bus is 32 bytes(256 bits) wide!?

04:37 k8yun has joined #osdev

04:44 <clever> oh!, its only able to hit those params when reading from the L1 cache

04:44 <clever> is there maybe a 256bit bus between l1 and the cpu, but then narrower going to L2 and dram?

05:04 dormito has quit [Ping timeout: 240 seconds]

05:06 dormito has joined #osdev

05:25 Vercas has quit [Ping timeout: 240 seconds]

05:25 Vercas has joined #osdev

05:29 rcvalle has quit [Quit: Leaving]

05:34 vdamewood has quit [Read error: Connection reset by peer]

05:35 vdamewood has joined #osdev

05:47 ElectronApps has joined #osdev

05:53 k8yun_ has joined #osdev

05:55 <geist> wouldn't be surprised

05:57 k8yun has quit [Ping timeout: 240 seconds]

05:57 <clever> geist: do you think the whole axi bus is 256bit, or just the cpu<->L1 part?

05:57 <geist> probably the cpu->L1. axi being that wide would be probably only in super high end things

05:57 <geist> since... lemme guess... you're talkiong about raspberry pi

05:57 <clever> yep

05:57 <geist> i am going to guesss it doesn't have a bus that wide

05:58 <clever> and given the way scalar load-many opcode winds up repeating a 32bit chunk, i think the axi bus may be 64bit

05:58 <geist> but having a 16 or 32 byte fetch from L1 i would think would be pretty standard

05:59 <geist> the cache line size is probably that wide anyway

05:59 <clever> that also gives me another thought, what if the cache-line yeah, is 32 bytes

05:59 <clever> so the cpu just puts part of the addr onto the bus, and the L1 gives you the whole damn cache-line at once

05:59 <geist> pretty standard, though this is probablyt e VPU you're talking about?

05:59 <clever> and the cpu can then detect the right bits it wants

06:00 <clever> yeah

06:00 <geist> if it was the a53 or a72 you can simply look it up in the manual

06:00 <clever> i can only dream of it being that easy :P

06:01 <clever> assuming a 256bit cacheline, and a 64bit axi bus, that means a burst of 4 transfers is needed to fill the cache-line

06:03 <clever> but i cant access MMIO thru L1, because the cpu knows what MMIO is

06:03 <clever> i'll have to just write some silly asm, abusing MMIO in ways it wasnt meant to, and see what it does

06:05 <clever> ive also been reading https://www.usenix.org/system/files/conference/usenixsecurity17/sec17-koppe.pdf

06:05 <clever> its about amd microcode

06:06 <clever> and now i can see that the https://www.bigmessowires.com/bmow1/ was using horizontal encoding for its microcode

06:06 <bslsk05> www.bigmessowires.com: BMOW 1 Computer | Big Mess o' Wires

06:07 <clever> and vertical encoding is what ive heard elsewhere, where the microcode just translates cisc into risc

06:09 eroux has joined #osdev

06:12 <geist> in general there's a heirarchy of speed as you go from L1 out, so makes sense that the pipe gets narrower

06:12 <clever> yeah

06:13 <geist> though could also be clock rate, etc

06:13 <clever> and if its both narrower and slower, thats more of an exponential speed loss

06:14 <clever> so with my above example, it may take 4 transfers to fill a 256bit cacheline with a 64bit bus

06:14 <clever> but if that 64bit bus is running at half the clock rate (edges aligned), it would take 8 clock cycles

06:14 Electron has joined #osdev

06:14 Electron has quit [Remote host closed the connection]

06:14 <clever> un-aligned, longer, clock domain crossing is hard

06:15 Electron has joined #osdev

06:15 ElectronApps has quit [Ping timeout: 240 seconds]

06:16 epony has joined #osdev

06:17 <geist> at some point you get to the memory bus which is probably 32bit at best on that device

06:17 <clever> *looks*

06:18 <clever> if i decode the ddr2 identification registers, i can see signs of a 32bit bus, your right

06:18 <clever> and the pi3 with 1gig of ram, is running a pair of 16bit 512mb chips in parallel, each on half the bus

06:21 <clever> geist: oh, and i'm also wondering how ddr2 on desktop differs, with all of those slots, and so many chips on each module!

06:22 <geist> well you can look up the width of it. i think in general DDR2 was probaby 32bit wide per chip?

06:22 <clever> just how fat is the bus on the controller?

06:22 <geist> er i mean 32bit per package

06:22 <clever> are any slots or chips sharing data lines, and needing chip-selects

06:23 <clever> yeah, every rpi ddr2 ram package has 32bit bus exposed to the BGA, but some (1gig) have a pair of 16bit die's inside a single package

06:23 <geist> right

06:24 <geist> that's one of the big things thats different about aple M1s

06:24 <geist> they have super wide busses. the M1 pro is 256 bit wide, M1 max 512

06:24 <geist> so it has crazy bandwidth (for a cpu) though more comperable to a high end gpu

06:24 <geist> which also have wide busses

06:24 <clever> if we switch gears for a moment, and think about a ddr2 x86 desktop

06:24 <clever> lets start by assuming i only have 1 memory module in the motherboard

06:24 <moon-child> ehhh

06:25 <moon-child> nice cpu w/4 memory channels will be like 100gb/s

06:25 <moon-child> gpu can get up to 1tb/s (tho probably more like 500-600gb/s)

06:25 <moon-child> m1 is in the middle

06:25 <clever> is every chip on the module accessible in parallel? or are they sharing the data bus, and you use a combination of chip-select and addr to select one row in a single chip?

06:25 <moon-child> definitely can't compete with nice gpus

06:26 <geist> clever: the latter. the odule itself has like a 32bit bus, but each chip is providing 4 bits or so

06:26 <clever> ahh

06:26 <geist> depending on the layout of the module

06:26 <clever> so its doing the exact same thing as the 1gig rpi's

06:26 <geist> i think it changed with DDR5 or so though. the width may have gone up

06:26 <clever> just reduce the bus width, and raid-stripe the 32bit bus over all of the chips

06:27 <clever> ok, so lets say i have 2 identical modules in the motherboard

06:27 <clever> i assume the tricky rules about matched modules, are because the controller is going into a sort of 64bit ddr2 mode? and driving both modules at once?

06:28 <clever> and they need to have the same cas timings?

06:28 <clever> and if you fail to meet those rules, it degrades into 2 32bit busses, possibly only allowing one active at a time?

06:29 <geist> ah looks like DDR2 dimms are 64bits wide. makes more sense because it has 240 pins

06:29 <clever> ah, but the same stripe thing

06:29 <geist> yah

06:29 <geist> 8 x 8 probably. also i'm sure if you have ECC version it's really 72 or dso

06:30 <clever> and then if i have a 4 slot motherboard, is that just a 256 bit bus into the ram controller?

06:30 <clever> and depending on configuration, it will operate in different modes (4 x 64, or 2x128)

06:30 <clever> ?

06:30 <geist> depends on how the motherboard does it. if it has two channels and 4 slots (common) it's two 64bit busses in this case

06:30 <geist> with two dimms per bus

06:30 <clever> yep

06:31 <clever> but if i mis-matched the dimms, it may degrade into 64 + 32 + 32

06:31 <clever> or even give up and just 32+32+32+32

06:31 <geist> doesn't mean the memory controller can't be interleaving things and selecting rows on one rank while another one is bursting etc

06:31 <clever> yeah

06:31 <geist> but that does mean when it's transferring data it's only pulling data off one dimm in a channel at a time

06:32 <geist> i think what M1 does that gives it more fflexibility is it just has 4 or 8 separate controllers at the same time

06:32 <clever> ive also heard rumors that the ddr4 in the pi4 has transparent ecc

06:32 <geist> so it's not that it has a really wide channel, it has a lot of channels

06:32 <geist> and it can stripe/etc however it wants

06:32 <clever> where the dram chip is internally doing ecc, and sending already repaired data down the bus

06:32 <clever> so the host controller isnt even aware of the ecc

06:33 <geist> that reminds me, now that i have a M1 pro i should try to write a memory benchmark

06:33 <geist> i think you can get close to 200GB/sec with it? or was it 100?

06:33 <geist> more so than the oribinal M1 which was still respectibly north of 50GB/sec

06:34 <clever> checking some other random numbers, even though the vector<->L1 is a 256bit bus, there is an 11 cycle overhead at the start of a vector op

06:35 <clever> so when moving max sized blocks, thats 235 bits/clock on avg

06:35 <clever> which comes out to about 13.67 gig/sec

06:36 <geist> that's geerally what i see in mid range single chip ARM devices

06:36 <geist> usually just a bit over 10GB/sec

06:37 <clever> thats at only 500mhz, the arm can get up to 1ghz, but i dont know its configuration as well

06:37 <geist> yah but DDR2 cant be clocked that high

06:37 <clever> yeah, so at some point, youll L1-miss, and performance will tank

06:38 <clever> L2 and uncached reads, are much harder to measure, because of clock domain crossing

06:38 <clever> the numbers are never the same, and depend on the clock ratio

06:41 <clever> geist: and one last question (i think), if ddr4 is clocked at 400mhz, is that 400 million 32bit transfers per second, or 800million (the ddr)?

06:41 <geist> the latter

06:41 <clever> ddr2 i mean

06:42 <geist> so that'd be probably 800 megatransfers * whatever the width is

06:42 <clever> yeah, thats what i thought

06:42 <geist> to get to 13GB/sec seems like you need 64bits there

06:42 <clever> but ive also heard that some companies now put the megatransfers on the spec sheet

06:42 <clever> because bigger numbers are better :P

06:42 pretty_dumm_guy has joined #osdev

06:43 <geist> well, actually makes sense, because its the rate at which its clocking bits off the bus, then you multiply the bus width

06:43 <clever> if one computer has 1600 ram, and another has 3200 ram, which would you buy?

06:43 <geist> actually no 400mhz would work

06:43 <geist> since that'd be 800MT/sec * 32 bits

06:44 <clever> > (800 * 1000 * 1000 * 32) / 8 / 1024 / 1024 / 1024

06:44 <clever> 2.9802322387695312

06:44 <clever> thats 2.9gig/sec?

06:44 <clever> vs the 13.6gig/sec the L1 cache can do

06:44 <geist> should be around 25GB/sec?

06:45 <kazinsal> 25.6 Gbps, yeah

06:45 <clever> > (800 * 1000 * 1000 * 32) / 1024 / 1024 / 1024

06:45 <clever> 23.84185791015625

06:45 <geist> though that's bits, so... anyway

06:45 <clever> ah, 2.9gigbyte, aka 23.8 gigabit

06:46 <geist> hmm, that doens't add up how you can get 13GB/sec out of it

06:46 <clever> > (500000000 * 235) / 8 / 1024 / 1024 / 1024

06:46 <clever> 13.67880031466484

06:46 <clever> 500mhz bus, 235 bits/clock on avg (overheads), to bytes, to gig

06:47 <kazinsal> should also be noted that MT/s and Gbps are always SI-prefixed

06:48 <clever> > (800 * 1000 * 1000 * 32) / 1000 / 1000 / 1000

06:48 <clever> 25.6

06:48 <clever> which makes the ddr2 400mhz bus capable of 25.6 gigabit/sec

06:49 <geist> all this aside raspberry pi 4 uses newer stuff

06:49 sonny has quit [Quit: Client closed]

06:49 <clever> > (500 * 1000 * 1000 * 235) / 1000 / 1000 / 1000

06:49 <clever> 117.5

06:49 <clever> and the L1 cache is 117.5 gigabit/sec

06:49 <clever> yeah, the pi4 has a ddr4 controller, running at much higher clocks, and support for up to 16gig of ram

06:50 <geist> looks like LPDDR4 at 3200mhz which comes out about 12.8GB/sec (not GiB) which is basdically what you observed

06:50 <geist> 3200 * 32 / 8

06:50 <clever> and given the major clock gating changes they did, they arent just pasting a pre-laidout set of gates for anything

06:50 <clever> like ive heard about with some esp? chips

06:51 <clever> i found a thread on twitter, where somebody was taking about a line of tiny MCU's, that where designed back before cad was as heavily involved

06:51 <clever> and you could visibly see where they just cut the cpu out of the chip design, and routed the bus over to a modern arm core

06:51 <clever> leaving a giant void in the middle, lol

06:53 <clever> oh, there was also a big forum thread a few years back, where people where trying to benchmark the pi4's ram, and they where getting rather poor numbers, and the rpi engineers basically could only say a few things

06:54 <clever> 1: your testing it wrong, its way faster

06:54 <clever> 2: due to NDA, we cant say how fast

06:54 * geist rolls eyes

06:54 <geist> just say it you dipshits your stuff is not that good

06:54 <geist> s/due to NDA/due to embarrasment/

06:55 <clever> yeah, i dont get what all this secrecy is for

06:55 <clever> who really benefits?

06:55 <geist> lawyers run the show

06:55 <clever> https://twitter.com/whitequark/status/1352335100424450052

06:55 <bslsk05> twitter: <whitequark> here's what happens if you upload the SiFive FU740 SoC manual somewhere. does this behavior remind you of someone? https://pbs.twimg.com/media/EsR1g2oW4AIdlVO.jpg

06:55 <geist> same thing with qualcomm. the default is to make stuff secret unless you expend energy to make it otherwise

06:55 <geist> and energy == money

06:56 <clever> ah, and here is the twitter thread i mentioned

06:56 <clever> wait no

06:56 <clever> wrong one

06:57 <clever> not sure where it went

06:57 <kazinsal> the rpi guys don't want to give concrete performance numbers because they know their whole architecture is lowest-bidder stuff held together with digital duct tape

06:57 <clever> there was also something similar recently

06:57 <clever> kazinsal: the graphics guy did admit on the forums, that the vc6 core is an unfinished and unreleased product

06:58 <clever> and they basically just shoe-horned the new 3d core into the old vc4 design

06:58 <clever> and that is where the bcm2711 came from

06:59 rcvalle has joined #osdev

06:59 <clever> geist: the new pi4 beta firmware, has a new bootmain.elf component, that is entirely accessing hw blocks that have open source drivers, so there arent any secrets left for them to hide, but this is the responce to asking for source: https://forums.raspberrypi.com/viewtopic.php?p=1975352#p1975352

06:59 <bslsk05> forums.raspberrypi.com: Network install beta test feedback - Page 5 - Raspberry Pi Forums

06:59 <clever> > Unfortunately, software license agreements don't work like that. .....

06:59 <kazinsal> it's amazing what happens when you accidentally create an extremely successful low-cost linux-compatible appliance board using whatever chip you could buy 100,000 of for the cheapest off of Digikey

07:00 <kazinsal> (you have to keep making newer and better low-cost linux-compatible appliance boards using cheap chips you can buy by the crateload from Digikey)

07:00 <clever> kazinsal: there was a time when small companies could buy bcm2835 directly from broadcom

07:00 <clever> but the first group to do that, violated the rpi firmware license, by running it on a non-rpi board

07:00 <clever> and the doors have been sealed shut ever since

07:01 <clever> but, with my understanding of the hw and the open firmware, a board could be designed that isnt compatible with the rpi firmware

07:01 <clever> so its then impossible to do that again

07:07 <clever> > (216 * 1000 * 1000 * 109) / 1000 / 1000 / 1000

07:07 <clever> 23.544

07:07 <clever> geist: oooo, interesting!, in past testing, i was getting about 109bits/clock from uncached ram, at 216mhz, which comes out to 23.54gbit/sec, and 400mhz ddr2 clocks in at 25.6gbit!

07:08 <clever> so i was getting the ddr2 bus to 91% of max load then

07:08 the_lanetly_052 has joined #osdev

07:09 <clever> from memory, i was loading the same 4kb array in a tight loop

07:16 <geist> seems about right

07:18 <clever> and if i raise the vpu to 432mhz, it now takes 593 clocks to do a 4kb load, avg of 55 bits/clock, which is 23.75gbit/sec

07:18 <clever> right in the same ballpark

07:19 <clever> confirming that the ddr2 was 400mhz/800mt, helps confirm those numbers

07:19 <geist> but what rpi was this?

07:20 <geist> one of the earlier ones because DDR2

07:20 <clever> probably a pi3

07:20 <clever> but the entire pi0-pi3 range has nearly identical performance, if you ignore the arm core

07:20 <clever> the same dram init code works on every model

07:26 Mutabah has quit [Ping timeout: 240 seconds]

07:26 Mutabah has joined #osdev

07:30 <clever> oh right, but its not 400mhz perfect, one min

07:33 <clever> its 398.4mhz ddr2 ram

07:33 <clever> so 25.4976gigbit, not 25.6gigbit, not that big of a loss

07:34 <kazinsal> yeah bus base clocks are usually somewhat variable between 99.6 and 100.4 MHz

07:35 <clever> > (19.2 * 0x53)/4

07:35 <clever> 398.4

07:35 <geist> 19.2 is a common crystal, so yeah makes sense

07:35 <clever> 19.2mhz crystal, 0x53 divisor in the driver source, /4 found by experimentation

07:37 <clever> but, now that i can compare vectorloads and expected ddr2 bandwidth, i could just raise the ram to 403.2mhz, and see if it still works, and if i get the expected performance increase

07:40 nitrix has quit [Ping timeout: 256 seconds]

07:40 nitrix has joined #osdev

07:47 theruran has quit [Quit: Connection closed for inactivity]

07:49 <clever> checking a random banana pi r1 (allwinner a20) board, i see it has a pair of 4Gb 256Mx16 1600Mbps modules from samsung

07:50 <clever> 1gig of ram total i believe

07:50 <geist> DDR2 is also pretty old

07:50 <clever> https://semiconductor.samsung.com/dram/ddr/ddr3/k4b4g1646d-byk0/

07:50 <bslsk05> semiconductor.samsung.com: K4B4G1646D-BYK0(4Gb) | DRAM | Samsung Semiconductor Global

07:51 <clever> yeah

07:52 <clever> oh, and youve mentioned seeing the bcm2711's pci-e core in other soc's, any chance its got a common name or public docs?

07:52 Payam has joined #osdev

07:54 <geist> design ware i think

07:54 <geist> DWC pci

07:54 <clever> ah, them again!

07:54 <geist> i've seen it over and over again. they seem to implement the basic PCIe goop that vendosr pick up

07:55 <geist> yah xhci controllers tend to be DWC too

07:55 <clever> which reminds me, is there a vendor string in xhci by chance?

07:55 <geist> good question, dunno

07:55 <geist> someone here that's written an xhci controller might know

07:55 <clever> given the root hub shows up in lspci, that string must come from somewhere...

07:56 <clever> 01:00.0 USB controller: VIA Technologies, Inc. VL805 USB 3.0 Host Controller (rev 01)

07:56 <clever> a pci-e xhci controller shows up as this

07:57 <clever> wait doh

07:57 <geist> that's probably just pci

07:57 <clever> Bus 001 Device 002: ID 2109:3431 VIA Labs, Inc. Hub

07:57 <clever> yeah, that first one was lspci, *doh*

07:57 <clever> the 2nd is lsusb

07:57 <clever> now, if i flip on the 2nd xhci controller...

07:58 xenos1984 has quit [Read error: Connection reset by peer]

07:59 <klange> lspci uses a database, no vendor/device description strings in pci

07:59 <clever> i now see an extra `Linux Foundation ?.0 root hub` that wasnt there before

07:59 xenos1984 has joined #osdev

07:59 <clever> this 2nd xhci isnt on a pci bus

07:59 <klange> isn't 001.002 on USB the hub that connects the USB2 ports?

08:00 <geist> probably the generic name the latter

08:00 <clever> i'm confirming which is which now

08:01 <clever> ah, there is both a `Linux Foundation 3.0 root hub` and a `Linux Foundation 2.0 root hub`, neither was there before!

08:02 <clever> let me try to exclude the pcie one from the mess

08:04 <clever> root@pi400:/sys/class/pci_bus/0000:01/device# echo 1 > remove

08:04 <clever> bit of a big hammer, the entire pci-e bridge vanished from lspci, lol

08:04 <klange> i have done that before

08:04 <clever> but this 2nd xhci and the gigabit are both non-pcie

08:04 <clever> so its fine, lol

08:05 <klange> as is the sd, wifi, bluetooth

08:05 <klange> video, of course...

08:05 <clever> yep

08:05 <clever> https://gist.github.com/cleverca22/ccf79400c6dc15098a6e5b7259dd193b this would be the 2nd xhci controller

08:05 <bslsk05> gist.github.com: gist:ccf79400c6dc15098a6e5b7259dd193b · GitHub

08:05 <klange> gods for a chipset that's promoted for being cheap, the hodpodge of interconnects and buses is _hilarious_

08:05 <clever> ive been refering to it as the broadcom xhci, because its directly in the soc, and its not the via-labs one

08:06 <clever> oh, you dont even know :P

08:06 <clever> there are even interconnects on the usb ph

08:06 <clever> phy*

08:06 <clever> 2 entirely different usb controllers, are sharing 1 usb phy!!

08:07 <klange> hm, cart-horse as it's not like I have an xhci stack yet anyway, but would my Apple C dongle work to provide power while giving me a USB-A port so I can poke that controller instead...

08:08 <clever> probably

08:25 Electron has quit [Remote host closed the connection]

08:26 Electron has joined #osdev

08:27 rcvalle_ has joined #osdev

08:30 rcvalle has quit [Ping timeout: 256 seconds]

08:46 [itchyjunk] has quit [Read error: Connection reset by peer]

09:04 k8yun_ has quit [Quit: Leaving]

09:09 zaquest has quit [Remote host closed the connection]

09:11 zaquest has joined #osdev

09:25 _xor has quit [Quit: brb]

09:45 Electron has quit [Remote host closed the connection]

09:58 ravan has joined #osdev

10:09 <gorgonical> I have just learned about intel's protection keys scheme. Stumbled upon it from a paper hacking the mechanism to allegedly improve ipc vs security tensions

10:13 GeDaMo has joined #osdev

10:22 Jari-- has quit [Ping timeout: 272 seconds]

10:36 gog has joined #osdev

10:44 _xor has joined #osdev

10:55 pretty_dumm_guy has quit [Ping timeout: 240 seconds]

10:58 <moon-child> gorgonical: have a link? Sounds interesting

10:59 <gorgonical> it's 4.6.2 in the sdm vol 3. kernel docs have a link here: https://www.kernel.org/doc/html/latest/core-api/protection-keys.html

10:59 <bslsk05> www.kernel.org: Memory Protection Keys — The Linux Kernel documentation

11:00 <gorgonical> basically introduces 16 regions that give read and write enable/disable in userspace by jamming a few bits into the page table addresses

11:00 <gorgonical> something like a hack that made it into hardware

11:00 <gorgonical> i'm very tired so that explanation may not make any sense

11:02 <moon-child> doesn't quite make sense, but I'm also tired, so math checks out :P

11:07 ElectronApps has joined #osdev

11:32 Payam has quit [Ping timeout: 256 seconds]

12:01 MrBonkers has quit [Quit: ZNC 1.7.5+deb4 - https://znc.in]

12:20 Bonstra has quit [Ping timeout: 250 seconds]

12:24 Bonstra has joined #osdev

12:32 not_not has joined #osdev

12:32 <not_not> Hi

12:32 <g1n> hi not_not

12:38 <klange> okay it's arguably terrible and not at all the fancy thing NTP defines, but I can at least set the clock on the RPi after booting... after discovering some terrible wrong math in 'mktime' that I guess I just wasn't using

12:39 <klange> also my CD FAT images weren't building _again_, and for some reason I am not catching the 'disk full' error the tool spits out so everything happily continues and I get an empty ramdisk...

12:39 <klange> because of course it's already put everything else in there and it's the 16MB ramdisk that is too much...

12:40 dennis95 has joined #osdev

12:40 DonRichie has quit [Quit: bye]

12:43 freakazoid333 has quit [Ping timeout: 245 seconds]

12:56 <not_not> Hi dennis95

12:57 <not_not> Ur name is Dennis?

12:57 <dennis95> yes

12:59 <not_not> Mine too

12:59 <not_not> I am dennis91

13:00 <not_not> Maybe people called Dennis Are more prone to os development like Dennis ritchie

13:10 not_not has quit [Ping timeout: 272 seconds]

13:19 ymwm has joined #osdev

13:27 Payam has joined #osdev

13:27 not_not has joined #osdev

13:39 Payam has quit [Quit: Client closed]

13:48 Payam has joined #osdev

14:16 gildasio has quit [Remote host closed the connection]

14:20 gildasio has joined #osdev

14:21 dude12312414 has joined #osdev

14:24 nyah has joined #osdev

14:31 sdfgsdfg has quit [Quit: ayo yoyo ayo yoyo hololo, hololo.]

14:35 <mrvn> moon-child: the layout of bitfields is implementation defined, more specifically it's in the C calling conventions. But generally bits are packed either high to low or low to high. Memebers of the struct will use the bit width of the type you use but get packed as long as they don't cross a boundary of the given type. The used type(s) determine the alignment and size. So { char a:5; char b:5; char c:5; } will

14:36 <mrvn> be 3 byte, alignment 1. { short a:5; short b:5; short c:5; } will be 2 bytes alignment 2. At least for any sane compiler/abi.

14:37 <mrvn> moon-child: also never use packed. On ARM a packed means the compiler MUST create byte access and your MMIO register will totaly break.

14:38 <mrvn> I'm not quite sure about what access the compiler is allowed to do with bitfields, e.g. { int a:8; int b:8; int c:8; int d:8; } Is the compiler allowed to do byte access or does it have to read/write int and mask?

14:41 <mrvn> https://godbolt.org/z/ns1MK5Gfd apparently gcc does byte access.

14:41 <bslsk05> godbolt.org: Compiler Explorer

14:44 <not_not> Nice always wondered that

14:52 <mrvn> moon-child: another point to your bitfield. You hope the compiler will merge multiple writes to bits into a single 32bit write. But MMIO register need to be volatile. The compiler is not allowed to merge the writes as you declared that each has a observable effect.

14:52 srjek has joined #osdev

14:55 <mrvn> moon-child: One thing I played with is a union of uint32_t and bitfield as temporary object. The read/write use the uint32_t of the union, the user uses the bitfield. So you read the register, toggle a few bits and then write it back, which you can do with RAII.

14:58 the_lanetly_052_ has joined #osdev

15:01 the_lanetly_052 has quit [Ping timeout: 256 seconds]

15:07 gildasio has quit [Ping timeout: 240 seconds]

15:10 gildasio has joined #osdev

15:11 not_not has quit [Read error: Connection reset by peer]

15:11 gwizon has joined #osdev

15:27 zaquest has quit [Read error: Connection reset by peer]

15:28 zaquest has joined #osdev

15:33 [itchyjunk] has joined #osdev

15:35 blockhead has quit []

15:37 ymwm has quit [Ping timeout: 272 seconds]

15:43 gxt has quit [Remote host closed the connection]

15:44 gxt has joined #osdev

15:55 freakazoid343 has joined #osdev

16:00 [itchyjunk] has quit [Read error: Connection reset by peer]

16:00 ElectronApps has quit [Remote host closed the connection]

16:09 [itchyjunk] has joined #osdev

16:18 sonny has joined #osdev

16:36 ymwm has joined #osdev

16:47 mctpyt has joined #osdev

16:50 freakazoid343 has quit [Read error: Connection reset by peer]

16:57 ymwm has quit [Quit: Leaving]

17:00 sonny has quit [Ping timeout: 256 seconds]

17:09 sonny has joined #osdev

17:30 sonny has quit [Quit: Client closed]

17:48 pretty_dumm_guy has joined #osdev

18:05 k8yun has joined #osdev

18:08 eroux has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

18:15 not_not has joined #osdev

18:16 <not_not> Wow

18:22 isaacwoods has joined #osdev

18:26 dennis95 has quit [Quit: Leaving]

18:36 xenos1984 has quit [Remote host closed the connection]

18:37 xenos1984 has joined #osdev

18:38 scoobydoob has joined #osdev

18:40 scoobydoo has quit [Ping timeout: 256 seconds]

18:40 scoobydoob is now known as scoobydoo

18:43 mctpyt has quit [Ping timeout: 256 seconds]

18:46 vdamewood has quit [Read error: Connection reset by peer]

18:46 vdamewood has joined #osdev

18:53 mctpyt has joined #osdev

19:07 not_not has quit [Read error: Connection reset by peer]

19:09 <geist> mrvn: yeah i've fiddled with that too. works pretty good *if* you can be sure that on all the arches you use the bitfields line up

19:09 <geist> depending on what arches you support, etc

19:19 pretty_dumm_guy has quit [Ping timeout: 240 seconds]

19:31 <mrvn> geist: doing this for the 16650 uart might be tricky and need a #ifdef around both orders to pick the right one.

19:32 gorgonical_ has joined #osdev

19:33 <mrvn> Hmm, kind of breaks freestanding. It has to pick an abi for the bitfields.

19:42 * mrvn likes: auto & avl() { return Bits<5,6,7>; } sort of things.

19:45 srjek has quit [Ping timeout: 240 seconds]

19:48 wootehfoot has joined #osdev

19:59 k8yun has quit [Quit: Leaving]

20:19 gorgonical_ has quit [Read error: Connection reset by peer]

20:38 the_lanetly_052_ has quit [Ping timeout: 260 seconds]

20:44 Teukka has quit [Read error: Connection reset by peer]

20:48 Teukka has joined #osdev

21:02 simpl_e has joined #osdev

21:02 gildasio has quit [Quit: WeeChat 3.4]

21:06 <moon-child> mrvn: I see. I haven't actually used bitfields in practice (never seemed worthwhile over manually fudging bits, and seemed rather twiddly). Guess I made the right choice!

21:06 <moon-child> though temp union with integer and bitfield is clever

21:07 <mrvn> not quite legal C though. You are only allowed to read the type out of a union you wrote to it. Using it to convert between uint32 and bitfield is not legal.

21:08 <mrvn> but it's such a nice way to access bits.

21:09 <mrvn> Saddly you also meet stuff like the x86 page tables where an address is split into multiple parts.

21:09 <kingoffrance> there's legal, lawful, grace, alchemy -- in that order ;D

21:09 <mrvn> you forgot magic

21:28 Payam has quit [Quit: Client closed]

21:29 xenos1984 has quit [Remote host closed the connection]

21:30 xenos1984 has joined #osdev

21:36 xenos1984 has quit [Remote host closed the connection]

21:36 xenos1984 has joined #osdev

21:39 <moon-child> mrvn: that's not true

21:39 <moon-child> it's only illegal in c++

21:39 <moon-child> it's fine in c

21:51 chigorin is now known as australopithecus

21:54 australopithecus is now known as chigoringrigorin

21:54 chigoringrigorin is now known as australopithecus

22:02 <sham1> > 23:07 <mrvn> not quite legal C though. You are only allowed to read the type out of a union you wrote to it.

22:02 <sham1> Not ever since C1999, you can do type punning with unions

22:03 GeDaMo has quit [Remote host closed the connection]

22:10 <mrvn> sham1: iirc if you have structs in the union that start the same then you can access the start. But as soon as they diverge you can only access the type you wrote.

22:11 <mrvn> Before C99 a union didn't even have to use the same address for all it's members.

22:13 <mrvn> anyway, the bitfield is already implementation defined and the implementation also says what happens when you type pune. So we are good.

22:13 <sham1> mrvn: oh but you can, although it's only on a footnote: http://www.open-std.org/jtc1/sc22/wg14/www/docs/n1256.pdf

22:14 <sham1> 82) If the member used to access the contents of a union object is not the same as the member last used to store a value in the object, the appropriate part of the object representation of the value is reinterpreted as an object representation in the ne w type as described in 6.2.6 (a process sometimes called "type punning"). This might be a trap representation.

22:16 srjek has joined #osdev

22:16 <mrvn> sham1: "the appropriate part of the object representation" is what?

22:16 <sham1> So for example, `union foo { uint32_t bits; float f; } f = { .f = 3.14195f };` and you can use `f.bits` to get the bit representation of the float, and that's completely valid in C1999 and beyond. At least that's how the footnote is interpreted

22:17 <sham1> Although I do feel that `float f = 3.14195f; uint32_t bits; memcpy(&bits, &f, sizeof(f));` feels nicer

22:18 <mrvn> sham1: assuming sizeof() is the same.

22:18 <sham1> Sure

22:19 <sham1> But we can essentially assume IEEE 754

22:19 <mrvn> nope.

22:19 <sham1> Especially in the context of osdev of course

22:19 <mrvn> It's implementation defined

22:20 <mrvn> The only thing you actually know is that uint32_t is 32 bit if it exists.

22:20 <sham1> Yes, standards sense it is indeed implementation defined, but you see what I'm driving at

22:20 <mrvn> so not even that strictly speaking.

22:20 <mrvn> sham1: If you accept implementation defined behavior then it's all good. Which we do.

22:22 <mrvn> sham1: My point was that if you have "union { struct Base { enum type; } base; struct Foo { enum type; ... } foo; struct Bar { enum type; ...} bar;} blub" then blub.base.type is always defined.

22:22 <mrvn> Before C99 that was implementation defined too.

22:23 <sham1> Yes, and it was in ANSI-C. It's just that now after 1999 that's possible for other stuff as well

22:24 <mrvn> In Ansi-C you could do #define union struct

22:25 [itchyjunk] has quit [Remote host closed the connection]

22:29 adder has quit [Read error: Connection reset by peer]

22:36 blockhead has joined #osdev

22:40 <mrvn> sham1: oh, one more nit-picking: memcpy(&bits, &f, sizeof(f)) is UB: The memory areas must not overlap.

22:41 wootehfoot has quit [Quit: Leaving]

22:41 <sham1> Yes, although I'd like to see an implementation which places two automatic storage duration variables in such a way that they'd overlap

22:42 <mrvn> oh, you didn't mean copying the members of the union, then never mind.

22:43 sdfgsdfg has joined #osdev

22:43 <sham1> Yeah. I do consider using memcpy with variables and such nicer than doing weird type punning with unions. My point was just that it's possible and in the standard

22:43 <mrvn> I wonder if memmove(&f.bits, &f.f, sizeof(f)}; becomes a nop

22:44 dmh has quit [Quit: rip]

22:47 <sham1> Apparently: https://godbolt.org/z/1GbfTTqnx

22:47 <bslsk05> godbolt.org: Compiler Explorer

22:49 <mrvn> and now I know

23:32 sdfgsdfg has quit [Quit: ayo yoyo ayo yoyo hololo, hololo.]

23:47 pretty_dumm_guy has joined #osdev