#osdev on 2022-05-28 — irc logs at libera.irclog.whitequark.org

2021-05-23 01:57 klange changed the topic of #osdev to: Operating System Development || Don't ask to ask---just ask! || For 3+ LoC, use a pastebin (for example https://gist.github.com/) || Stats + Old logs: http://osdev-logs.qzx.com New Logs: https://libera.irclog.whitequark.org/osdev || Visit https://wiki.osdev.org and https://forum.osdev.org || Books: https://wiki.osdev.org/Books

00:03 <dasabhi> geist: porbably explains why MIT ported xv6 to riscv, i am just getting started with riscv

00:03 <geist> yah it's almost certainly going to be the thing that lots of universities teach with now

00:11 <mrvn> oh yeah, lets teach on some hardware none of the students have and most can't afford to buy.

00:11 <zid> That was always the idea of riscv to begin with from what I understand

00:11 <zid> replace MIPS as the teaching arch

00:18 X-Scale has quit [Ping timeout: 256 seconds]

00:20 <mrvn> Anyone have and devs on a 64bit MIPS?

00:27 ripmalware_ has quit [Remote host closed the connection]

00:27 ripmalware_ has joined #osdev

00:31 Jari--- has joined #osdev

00:31 <Jari---> Morning

00:32 <mrvn> evening

00:32 <moon-child> afternoon

00:33 <Jari---> In Japan it is noon?

00:34 <zid> we just had one

00:34 <mrvn> noon? happens quite frequently.

00:35 <klange> it's 9:35am

00:35 <klange> lol

00:39 <mrvn> I wagely remember there being a delete_iterator from my pre c++11 days. Was that ever a thing? What has replaced that?

00:43 netbsduser` has joined #osdev

00:44 X-Scale has joined #osdev

00:45 netbsduser has quit [Ping timeout: 246 seconds]

00:47 pretty_dumm_guy has quit [Quit: WeeChat 3.5]

00:50 <dasabhi> you guys think we will get riscv machines soon enough in the cloud?

00:51 <dasabhi> i had some virtualization ideas for riscv and xv6, but i doubt i should mess around with virtualization on qemu

00:53 <klange> I see no real incentive for riscv-based cloud resources at this point in time. Hardware costs are the main factor at the moment, but ecosystem is also a key thing to keep in mind.

00:53 <geist> probably not for a while. i think the H extension only just got ratified

00:53 <geist> so need some hardware to actually exist to get real virtualization

00:54 <geist> i did think i remember seeing something about KVM support for it going in? I'm kinda curious now, should sit down to try to grok the H extension

00:54 <geist> i think qemu supports it now

00:54 <geist> mrvn: somewhere i have a mips dev board somewhere, can run linux. i think it's 32bit though?

00:54 <geist> creative something, is i think the board

00:54 <geist> fairly recent, like last 5 years

00:55 <geist> https://elinux.org/MIPS_Creator_CI20 ah, mips32. nope

00:55 <bslsk05> elinux.org: MIPS Creator CI20 - eLinux.org

00:55 <dasabhi> qemu can support the H extension even before the guys at berkley ratify it?

00:55 <dasabhi> so in theory i can run a really shitty rsicv hypervisor with emulated ring -1 ? LOL

00:56 <geist> qemu sure why not? most likely it's basically stable for a long time before it gets ratified so qemu is probably pretty safe implementing it. may actually be the main development platform for the extension itself

00:56 <dasabhi> yeah that makes sense

00:57 <geist> though i thought it recently got ratified. lemme see

00:59 Likorn has quit [Quit: WeeChat 3.4.1]

00:59 <klange> I'm still in the "riscv seems neat, let me know when there's reasonable hardware for it" stage... purplexing to me that so many of the "budget" options are things with a dozen 'media engine' things bolted on.

01:01 <geist> also FWIW I just saw that the hypervisor extension is at version 1.0 and in the latest priviledged spec 2.2

01:01 <geist> so i guess it's ratified

01:05 <mrvn> Does riscv beat out ARM on the MIPS-per-Watt?

01:06 <geist> i dont think that's an arch specific question

01:06 <geist> that has everything to do with implementation details

01:06 <geist> also depends on what the sector is. is this a deeply embedded machine? something with page tables? etc

01:06 <mrvn> geist: cloud service

01:07 <geist> in the super deeply embedded space i'd assume that an 'e' spec riscv core could get pretty close to a cortex-m0 in terms of implementation, since it's about as complex

01:07 <geist> oh who the hell knows. no such hardware exists

01:08 <heat> ask western digital

01:08 <heat> or nvidia

01:08 <heat> they're big fans of RISCV ;)

01:09 <heat> geist, issue with getting a real board rn is that all the CPUs are still crap

01:09 <geist> sure

01:09 <heat> paying 400 euro for a top notch SiFive board and getting half the performance of a rpi 4b is... underwhelming

01:10 <geist> it all depends on what you want to do with it. i fiddle with vaxen and 68ks and by definition those cpus are 'crap' too

01:10 <heat> yeah but those were designed a long time ago, not literally now

01:10 <geist> i hear you loud and clear

01:13 <geist> i assume some day hardware will catch up in which case i'll be ready to roll

01:13 <geist> but if you wanna use real hardware and only happy if it's performant, that's totally a valid call

01:15 <heat> right

01:15 <heat> I don't need performance, just decent

01:15 <mrvn> performant? That just means the kernel crashes sooner. :)

01:15 <geist> heat: then you have to define decent

01:16 <zid> It'll be interesting to see how quickly they can make a crap isa fast

01:16 <heat> but for 400 euro, yeah I definitely ask for performance at that price point

01:16 <geist> sure. what are your requirements? there are various boards out there i can point you at if you'd like

01:16 <geist> that are less than 400

01:16 <heat> geist, run linux/my OS decently, with like 2010 perf(?)

01:16 <geist> can yo ube more specific?

01:16 <geist> like 32 bit? 64bit? how much ram? does it require hard ethernet?

01:16 <geist> what about storage

01:17 <geist> 2010 perf what? an equivalent x86 at the time?

01:17 <heat> oh yeah 64-bit for sure, and like 512MB possibly (although I could probably get it lower)

01:17 <zid> 2010 perf puts you at ridiculous perf if you're willing to upclock the chips of the time..

01:17 <heat> storage should be rpi-like, ethernet would be nice

01:17 <geist> the 2010 perf thing is unreasonable, but aside from that i think i can find something

01:18 <mrvn> nvme would be nice

01:18 <geist> probably visionfive. it's a lot cheaper

01:18 <zid> https://www.cpubenchmark.net/cpu.php?cpu=Intel+Core+i7-980X+%40+3.33GHz&id=866 Imagine that at 5GHz

01:18 <bslsk05> www.cpubenchmark.net: PassMark - Intel Core i7-980X @ 3.33GHz - Price performance comparison

01:18 <geist> https://ameridroid.com/products/visionfive-starfive

01:18 <geist> i have one on order, actually wondering when that'll come in

01:18 <zid> 2500 is a good single core ranking at 5GHz *today*, fwiw

01:19 <geist> thats an actual sifive superscalar dual core + something < $400

01:19 <geist> it's still $200 of course

01:20 <heat> right, that's too much for me, for basically a toy board

01:20 <geist> hmm, unclear when it ships though, maybe that's why my preorder hasn't come through, but i think that;s the best bet for something kinda rpiish

01:20 <geist> then okay, what are your pricing constraints?

01:20 <geist> keep in mind it's hard to get ahold of ARM boards of similar horsepower at that price either

01:21 <geist> except rpi, (when it's available)

01:21 <zid> They're low volume and that's damn near a custom part

01:21 <heat> around the price of a raspberry pi 4b/400

01:21 <zid> so $200 is good tbh

01:21 <geist> yeah, rpi has set the bar so low i dont think *anything* can get down there that's not a rpi

01:21 <geist> also you get wha tyou pay for since the broadcomm chip on it is a piece of crap

01:22 <geist> well some of the odroid stuff is fairly low too, but keep in mind most of those are shipping with 10 year old arm cores too

01:22 <geist> so it's all a tradeoff one way or another

01:23 <geist> ie, https://www.hardkernel.com/shop/odroid-n2-with-4gbyte-ram-2/ is a bit nicer, so yeah it has a price premium

01:23 <bslsk05> www.hardkernel.com: ODROID-N2+ with 4GByte RAM – ODROID

01:23 <kazinsal> I'm still waiting for a risc-v board with a bunch of ethernet on it

01:23 <geist> then keep waiting. finding a good *arm* board with a bunch of ethernet is difficult

01:23 <geist> or at least cheap

01:23 <kazinsal> yeah, it sucks

01:23 <kazinsal> hell the lead times on x86 boards with a bunch of ethernet are pretty intense now

01:23 <zid> pci-e and a load of ethernet on a stick? :P

01:24 <kazinsal> pcengines is now in the "yeah, we might have some this year" stage

01:24 <geist> gosh remember when intel tried to make that a thing? oh the computer on a stick yay

01:24 <zid> https://www.ipcdevice.com/pic/products/IEC-95X8.jpg bwahaha I love it

01:24 <kazinsal> ha, yeah, you can find a lot of neat weird cards like that

01:25 <geist> kazinsal: oh did you hear i got the server machine to fail with a 3900x in it? so it's not the cpu per se

01:25 <kazinsal> oh interesting

01:25 <zid> moar volts

01:25 <heat> can the CPU keep up with all the ethernet?

01:25 <zid> It's not about the perf it's about sending a message

01:25 <zid> 8 times

01:25 <geist> i just did a bios update and am running it some more to see if that magically fixes it, otherwise i'll get a new mobo or maybe switch the power supply

01:25 <kazinsal> wonder if you've got some bad vdroop on one of the power supply lines yeah

01:26 <geist> zid: well to be fair i have recently discovered ethernet bonding and how relatively nice it is

01:26 <geist> and linux does a pretty good job with it

01:26 <zid> ISDN was basically bonded dialup

01:26 <zid> an OC-48 is 48 T1 lines bonded together etc

01:26 <zid> it's always been a really normal thing

01:26 <geist> my synology nas for eample has a 4 way bonded setup, and the unifi switch loves it

01:27 <geist> thoug of course it only really helps if you have a lot of clients and their connections tend to hash to separate nics

01:27 <kazinsal> yeah

01:27 <zid> Can you do it like RAID and set the mtu 4x as big and get 1/4 of a packet over each bitwise

01:27 <zid> then recombine them

01:28 <geist> no that's not how it works. bascally the switch and the OS agree to the hashing algorithm and it does a L2 level hash of the connection so that ports from a single client arrive in order

01:28 <zid> I know it isn't how

01:28 <zid> I'm asing if it's possible at all

01:28 <zid> It'd be funnier

01:28 <geist> oh i suppose

01:28 <kazinsal> I mean in theory it's possible but it wouldn't be clean

01:28 <kazinsal> and I've never seen any NICs that would allow you to do that

01:28 <geist> yah the reassembly across the nics would be hard

01:29 <zid> Yea reassembly would be hard unless you were like.. rtosing

01:29 <geist> anyway, thinking of replacing the random asus consumer board with a https://www.newegg.com/asrock-rack-x470d4u-amd-ryzen-2nd-generation-series-processors/p/N82E16813140023?Item=N82E16813140023&quicklink=true

01:29 <bslsk05> www.newegg.com: AsRock Rack X470D4U Micro ATX Server Motherboard AM4 Ryzen - Newegg.com

01:30 <zid> I like the idea of a server board but they all suck

01:30 <zid> in terms of features

01:30 <geist> it's a bit pricey, but gets reasonable reviews and has an actual IPMI thing and whatnot

01:30 <geist> main bummer is no 10gbe

01:30 <zid> no voltage twiddling, no stupidly overspecced VRMs, etc

01:30 <zid> but you can get 16 dimm slots...

01:34 <heat> https://wiki.osdev.org/Loading_files_under_UEFI gotta love the cheeky ad

01:34 <bslsk05> wiki.osdev.org: Loading files under UEFI - OSDev Wiki

01:34 <heat> i should do that too

01:35 <heat> "you could write an OS... but if you usE ONYX THE BEST OPERATING SYSTEM EVER GITHUB.COM/HEATD/ONYX IT WOULD BE A LOT EASIER"

01:35 <bslsk05> github.com: GitHub - heatd/Onyx: UNIX-like operating system written in C and C++

01:36 <klange> bzt wrote the whole article, so it's no surprise really

01:37 <kazinsal> naturally

01:38 <heat> klange, bzt isn't around anymore right?

01:38 <heat> i want to remove that and not cause a flamewar

01:38 <kazinsal> I could have sworn he was unbanned at some point

01:39 <klange> bzt had many poor interactions in violation of rules, was unbanned after some time, and then proceeded to do the same thing again

01:39 <kazinsal> some people just can't post good

01:40 <heat> "Sadly it is not so easy, considerably more complicated than reading sectors," <-- how is using a filesystem driver with Open(...), Read(), Write() harder than reading sectors?

01:40 <klange> It really is a shame because he was a solid technical contributor, when he wasn't shilling his own shit and starting fights.

01:40 <klange> heat: I think he means compared to the EFI interfaces for reading block devices directly, which aren't too bad

01:41 <klange> the filesystem APIs are full of stupid

01:41 <heat> no they're not what

01:41 <heat> tell me what stupids they have

01:42 arch_angel has joined #osdev

01:42 <geist> klange: yah always a bummer when you have someone that just sits there and ruins their whole thing they built

01:42 <mrvn> The switch could just fragment the frame and have the NIC or OS reassemble the frames.

01:42 <geist> by just being a complete asshat

01:43 <kazinsal> you can't fragment layer 2 frames

01:43 <mrvn> heat: read/write has to keep track of the position. Use pread/pwrite. :)

01:44 <geist> kazinsal: yah you'd probably need to define a new ethertype that just contains fragmented normal ethernet frames or something

01:44 <mrvn> kazinsal: but nobody is using that for large payloads

01:44 <geist> heat: maybe a compromise is to move the add down to a new section

01:44 <geist> like some sort of link to projects that might help

01:44 <mrvn> IP fragments just fine

01:45 <geist> vs it just being an ad on the second sentence

01:45 <kazinsal> and IP doesn't care about NICs and bonding and all that

01:45 <heat> mrvn, it's a simple open() -> read/write() -> close() API with GetPosition and SetPosition to do lseek

01:45 <mrvn> geist: can you set the hash method to use the sequence number of IP packets?

01:45 <kazinsal> also IP fragmentation makes things just really goddamn complicated and it should be illegal

01:46 <geist> i think we had this discussion before but ipv6 *does* allow fragmentation or not?

01:46 <geist> or was it simply that routers cant do it because that's not the routers problem in v6?

01:46 <mrvn> kazinsal: modern NICs de-fragment unfragmenetd IP packets. Meaning with a MTU=1500 you get 10k frames and such.

01:46 <heat> yes they allow fragmentation on v6

01:46 <heat> but it needs to be explicit

01:46 <heat> routers can't do it

01:47 <heat> you actively need to know the path's MTU to send a larger IPv6 message (or fragment it into chunks)

01:47 <mrvn> geist: imho if you get fragmentation you are doing something wrong. It's not like the MTU varries in unpredictable ways across the internet.

01:47 <geist> well, yah

01:48 <mrvn> MTU = path MTU should the the default.

01:48 <heat> ?

01:48 <geist> and yeah that's right, nic based fragmentation and reassembly is probably the main reason it still exists

01:48 <geist> since it's very convenient for the stack to blat out 64k packets and let the nic deal with it

01:49 <mrvn> geist: I believe he NICs split the data into packets, not fragments.

01:49 <geist> well, they have to do it in a way thats valid to the other side, because there's no assumption the other end can reassemble it

01:49 <mrvn> exactly.

01:50 <heat> they split it into fragments as well

01:50 <mrvn> heat: your local NIC?

01:50 <heat> tcp segmentation offloading makes packets, the others need to make fragments

01:51 <mrvn> we are talking about the former

01:51 <heat> no we're not

01:51 <kazinsal> that is a completely different train of thought

01:52 <geist> yah but it segments using ip fragments or does it just keep making new tcp packets?

01:52 <mrvn> heat: at least I hope geist was when he mentioned dumping 64k blobs into the NIC.

01:52 <geist> probalby the latter now that i think about it

01:52 <heat> geist, packets (proper RFC793 segments)

01:52 <geist> yah i guess that's more ideal: take a larger tcp packet and retransmit it (at the nic) as a series of smaller ones

01:53 <heat> I think it says somewhere that you shouldn't fragment a TCP segment (hence the need for path MTU)

01:53 <kazinsal> yeah, I think it's a SHOULD NOT

01:53 <kazinsal> not a MUST NOT

01:54 <mrvn> heat: you have to either handle the fragments correctly or send s ICMP_FRAGMENT. DSL router do neither.

01:54 <kazinsal> aka "you're going to do it anyways but please for everyone's sanity consider not doing it"

01:54 <heat> you'll get killed if you ignore a MUST

01:54 <klange> heat: maybe just personal opinions, but BufferSize being in-out, file paths being UTF-16 (and no I will not accept "but everything is UTF-16 in EFI" as an excuse for this)...

01:54 <heat> you'll just get shot if you ignore a SHOULD

01:55 <heat> klange, that's just MSFT API design

01:55 <heat> like literally the rest of UEFI is like this

01:55 <kazinsal> april fools RFC idea: introduction of the PLEASE DON'T requirement level keyword

01:55 <zid> Yea that's just winapi

01:55 <mrvn> klange: can your UEFI boot <rocketship><female><medium skin tone><2 children family>.pe?

01:56 <heat> no

01:56 <mrvn> (female astronaut with 2 kids)

01:56 <heat> it's not UTF-16 but UCS-2

01:56 <zid> UCS-2 is what *W is in winapi yea

01:56 <heat> and you clearly have an unhealthy obcession with unicode and emojis

01:57 <klange> As someone who works in internationalization, it is hard to pick which is worse between the two.

01:57 <zid> I work in internationalization in that I just use utf-8 and let god sort it out

01:57 <zid> aka freetype

01:58 <zid> It tells me how big of a texture to use or whatever and I say "yes daddy"

01:58 <klange> UCS-2 is outdated, but at least a codepoint is a codepoint. UTF-16 gets you the whole range, but surrogate pairs were a bigger mistake than Han unification.

02:00 <mrvn> UTF-16/UCS-2 are the worst of both worlds: Wastefull and insufficient to cover everything with constant size.

02:01 <heat> ok but

02:01 <heat> it's late 90s - early 2000s microsoft

02:01 <mrvn> no, sorry, the but emoticon is on a different code page

02:01 <heat> they literally had nt kernel people help out with UEFI

02:02 <heat> you were bound to get microsoft design and code and you got it

02:02 <mrvn> M$ just wanted to keep using their wchars.

02:03 <heat> let's not M$ this

02:04 <mrvn> heat: want to lough? https://stackoverflow.com/questions/72412102/what-is-the-safest-way-to-cast-a-pointer-type-to-any-other-pointer

02:04 <bslsk05> stackoverflow.com: c++ - What is the safest way to cast a pointer type to any other pointer? - Stack Overflow

02:04 <zid> The winapi people are contractually obligated to do BIGSTRUCTP a; DWORD out; ThisFunc(a, &out);

02:05 <mrvn> .oO(auto *t = std::thread_safe_pointer_cast<T>(p);)

02:05 <heat> "Making the DLL thread-safe won't solve the issue, but it will help."

02:06 <mrvn> zid: hey, no returning structs. That would be something a compiler from this millenium would support.

02:06 <heat> you don't return structs in winapi/UEFI because the return type is reserved for EFI_STATUS

02:06 <heat> or HRESULT

02:07 <mrvn> god, let there be std::expected

02:07 <heat> it's cleaner than the errno global variable

02:07 <heat> that got pretty much #define'd into a TLS variable

02:07 <mrvn> can't wait for c++23 support

02:09 <mrvn> heat: modern languages can return multiple values.

02:10 <heat> do you really want to fight that battle?

02:10 <mrvn> even c++ has jumpted on the wagon: auto [quotient, remainder] = divide(14, 3);

02:10 <heat> modern languages have decently integrated language features

02:11 <heat> modern languages do things inside the compiler instead of building piles of language magic in header files and "oh no why is it slow to compile?"

02:11 <zid> The HRESULT thing deals nicely with the whole "is 0 failure or success" thing too tbh, you just enum BLAH_OK and BLAH_ERRn to whatever matches

02:12 <heat> i like 0 as a success, negative error values

02:12 <heat> but that's UNIX-me talking :)

02:13 <zid> yea but it's contra to the whole.. true false thing, unfortunately

02:13 <heat> mrvn, by the way if you want a modern language we've got a GSoC student working rust support for EDK2 :)

02:13 <heat> working on*

02:14 <klange> don't worry, i already have a modern language in efi

02:17 <zid> We've still not figured C out properly yet, geez

02:17 <zid> You guys are rushing ahead

02:21 <mrvn> zid: (-1)[&i[a]] = 666;

02:25 <mrvn> Little things I want to fix in C: void free(void **)

02:25 <clever> to null the ptr out automatically?

02:26 <mrvn> exactly

02:26 <clever> simplest option, make a safer_free with that api, and then make the real free harder to link to

02:26 <clever> either by hiding the symbol or warning about uses not in a whitelist

02:27 <mrvn> stop passing (uninitialized) buffers to functions, have them allocate and return buffers

02:28 <clever> but constructors!

02:28 <clever> and virtual contrustors!

02:28 <mrvn> factory factories, jippey

02:29 <heat> how would you fread() a buffer into MAP_SHARED memory if you can't pass buffers to functions

02:29 <mrvn> simple: mmap() the data directly.

02:29 <mrvn> memcpy

02:29 <mrvn> splice

02:30 <heat> it's a pipe and I don't have splice

02:30 <mrvn> one more thing to fix :)

02:31 <mrvn> read() should just map the pipebuf into the user address space and return the address

02:33 <mrvn> There could also be a mread() for that special case

02:34 <heat> mread() -> you mean read()?

02:35 <mrvn> heat: no, read into an mmap()ed buffer that may copy data or remap pages.

02:35 <mrvn> the plain read should return a fresh buffer.

02:38 <mrvn> I kind of want write() to free the buffer but that's kind of problematic with EINTR.

02:43 <mrvn> readv/writev are underused

02:43 <dasabhi> geist: speaking of hardware, do you know if qemu emulates some shitty gpu?

02:43 <klange> It emulates many shitty GPUs.

02:44 <klange> Including some ATI cards!

02:44 <dasabhi> i know it emulates a NIC, because xv6 makes you build a tcp/ip stack for a shitty nic with documentation for the shitty nic

02:44 <dasabhi> interesting

02:44 <dasabhi> so i could technically write a gpu driver

02:44 <dasabhi> on xv6

02:44 <geist> well, i mean depends on what you mean by shitty gpu

02:44 <klange> QEMU emulates a wide range of hardware, some of which is _not_ shitty.

02:44 <mrvn> dasabhi: do you want an accelerated graphics card or actually do GPU calculations?

02:44 <klange> Like... several of the NICs.

02:45 <dasabhi> mrvn: i have zero clue how graphics works and i am looking to fix that with xv6

02:45 <dasabhi> so those are just words to me

02:45 <dasabhi> i guess both

02:45 <mrvn> dasabhi: then it totally emulates a ton of shitty and not so shitty stuff.

02:45 <dasabhi> accelerated graphics so like video codec stuff?

02:45 <geist> okay so maybe you mean video output?

02:45 <heat> virtio-gpu is decent

02:45 <heat> the rest... eh not so much

02:45 <geist> to many folks 'gpu' means somethign quite specific

02:45 <heat> mostly glorified framebuffers

02:45 <mrvn> dasabhi: could be something as simple as scrolling the screen

02:46 <geist> though it also tends to mean 'video device' or whatnot

02:46 <geist> or 'everything to do with putting crap on monitors'

02:46 <clever> in the case of the raspberry pi, the 2d subsystem (turning framebuffers into a video signal) is entirely seperate from the 3d subsystem (turning polygons into a 2d image)

02:47 <geist> right, which is generally still kinda the case, even if it's implemented on the same physical device (ie a video card)

02:47 <dasabhi> so i am really looking for something with some nice documentation, xv6 points us to a shitty NIC with nice documentation that helps you learn network drivers

02:47 <clever> and the video encode/decode is also its own seperate block

02:47 <mrvn> does qemu emulate the RPi 2D system now?

02:47 <dasabhi> and qemu emulates said nic

02:47 <heat> dasabhi, which nic?

02:47 <clever> mrvn: it only emulates the mailbox framebuffer api, not the true 2d hardware

02:47 <dasabhi> let me find out hold on

02:47 <geist> yeah my question

02:47 <clever> mrvn: so the emulation is of what the firmware wrapper around the hw exposes to linux

02:47 <geist> waiting for them to point out e1000 or something

02:48 <mrvn> clever: so only the bits that linux uses and nothing more :(

02:48 <dasabhi> xv6 makes you right code for an E1000

02:48 <heat> hahahaha

02:48 <clever> mrvn: even less!, some of the new stuff about the number of framebuffers is missing, so the modern linux drivers bail out and dont even drive the display

02:48 <geist> heat: ahahaha

02:48 <heat> that's like the best documented NIC you'll ever find ever

02:48 <dasabhi> i guess a look think to do would be to get xv6 to render a teacup

02:48 <geist> yah it's kinda the gold standard of nics right now

02:49 <zid> e1000e only has docs from like 2005 though, the newer chips are behind the intel insider thing still :(

02:49 <mrvn> clever: ever thought of writing a different API for the VC and linux?

02:49 <clever> mrvn: i have

02:49 <dasabhi> oh damn i didnt know NIC docs were that hard to get

02:49 <klange> Even the newest stuff has a degree of backwards compatibility, and the Linux drivers got split up mostly because support the whole range in one driver was becoming idiotic.

02:49 <clever> mrvn: with the open firmware, the VC firmware is currently in complete control, so linux looses its 2d acceleration

02:49 <geist> zid: not so sure, i gopt some fairly up to date docs, but indeed the 10gbe variants i think are harder to get docs for

02:50 <mrvn> clever: yeah, bring that back please.

02:50 <geist> right, e1000e seems to be the split where things started going multi-queue and heavily into MSI

02:50 <clever> mrvn: if i fix linux's access to the 2d hw (it has some bugs), then the VC becomes locked out, so i need to come up with an api for sharing the hw

02:50 <heat> dasabhi, good hw docs are hard to get because hardware people really don't bother

02:50 <geist> that i think was a line in the sand making it hard to support

02:50 <heat> the e1000/e1000e are like the best documented NICs you'll ever find

02:50 <zid> Thankfully these days hardware people at least drop a linux driver

02:50 <heat> and probably in the top 20 of general hardware documentation (barring CPUs)

02:50 <dasabhi> any gpu version of an e1000?

02:50 <geist> right, for opposite reasons than before: i think lots of hardware companies write a linux driver and then claim it's basically documentation

02:51 <geist> whereas say 20 years ago they just would be secretive about everything

02:51 <zid> Our cheap chinese clone card MUST be kept secret

02:51 <mrvn> clever: that's what I mend above. :) I remember the accelerated framebuffer interface had just a few simple functions. Maybe just implement an interface for that. Mostly scrolling stuff

02:51 <heat> geist, can they? what's the legality of using GPLv2 source code as docs?

02:51 <heat> dasabhi, right. depends on what you mean by GPU

02:51 <clever> geist: RPF has been doing pretty much that with every hw block they are allowing to have open drivers

02:51 <geist> no, but theres no legality about docs

02:51 <geist> it's more like they just say 'see linux driver' instead of bothering to release docs

02:52 <clever> mrvn: i did recently implement scrolling in LK's graphics console

02:52 <dasabhi> hmm i guess i could do rotating tea cup myself on a frame buffer

02:52 <geist> which is strictly speaking better than no docs or driver at all

02:52 <heat> the virtio-gpu is probably the gold standard of 3D acceleration in VMs

02:52 <dasabhi> but then i am just dumping memory to a sector of memory and calling it a day

02:52 <clever> mrvn: compare these 2 videos: https://ext.earthtools.ca/private/rpi/standard-gfxconsole.mp4 https://ext.earthtools.ca/private/rpi/faster-console-1.mp4

02:52 <heat> BUT the 3D acceleration parts aren't particularly well defined AFAIK, you need to dig for those

02:52 <dasabhi> virtio-gpu

02:52 <dasabhi> got it

02:53 <geist> heat: if what you mean is can you consider GPLv2 stuff to be docs for something that's not GPLv2

02:53 <heat> the rest are basically glorified framebuffers

02:53 <geist> that's a good question and i'd rather not push it

02:53 <heat> yup exactly

02:53 <geist> in general i avoid it because i dont know what my code will be used for downstream

02:53 <geist> and i'd rather not be dragged in front of a bunch of lawyers about it later on

02:53 <heat> geist, i've seen some FreeBSD code that looks like it was written looking at linux code

02:54 <mrvn> clever: that makes me miss my C64

02:54 <mrvn> faster-console-1.mp4

02:54 <mrvn> loads much slower

02:54 <clever> mrvn: the standard console, is a single dumb framebuffer, with memcpy() to scroll

02:54 <geist> also keep in mind getting dragged in front of a lawyer is less so about whethe or not you broke the law or whatnot, it has everything to do with someone making money

02:54 <clever> mrvn: while the faster console, just treats the bitmap as a circular buffer, and just wraps around to the top and never scrolls

02:54 <mrvn> clever: hehe, the faster just flashes and it's all scrolled through.

02:55 <geist> so if you have some situation where you think your little cog in the wheel ends up being part of some thing that is worth something, be careful

02:55 <heat> dasabhi, for real hw you could look at the i915 but that's a whole different ballpark

02:55 <clever> mrvn: but it then uses the 2d composition unit to slice the framebuffer in 2, and render it at the right offsets

02:55 <klange> The closest thing in GPU land to an e1000 NIC as far documentation goes (and which exists as real hardware) is, appropriately enough, probably Intel's i965 series? But then nothing emulates one of those.

02:55 <zid> 82810 for life

02:55 <zid> I have a physical one still somewhere

02:55 <heat> klange, these days you can get one in a VM

02:55 <mrvn> clever: yeah, you mentioned that before. Nice little trick to avoid copying anything.

02:55 <clever> mrvn: and it dynamically moves the slice, as the circular buffer's seam moves

02:55 <geist> oh i740 or gtfo

02:55 <clever> mrvn: yep

02:55 <klange> What emulates a 965? Or do you mean via passthrough while running a different card for the host?

02:56 <heat> the VT-(insert weird intel letter here, it's g I think) emulates an intel GPU inside the kernel and passes is through using vfio

02:56 <clever> mrvn: under linux, the /dev/fb0 api uses a virtual resolution and x/y offsets, but your going to hit the edge of the virtual screen eventually, and then what?

02:56 <zid> isn't i*** the chipset

02:56 <zid> the 82810 is the gpu on the i810

02:57 <heat> klange, not exactly an i965 but all the intel GPUs have great docs

02:57 <mrvn> clever: It's something I want to incorporate into my framebuffer interface. I already have slices so you can copy a a subset of a framebuffer. But having 2 or N blobs would be even better.

02:57 <heat> (great for the boatload of shit they do)

02:57 <geist> i740 was the predecessor to the i810 and was up until recently the only discrete gpu intel ever made

02:57 <geist> pretty rare, kinda a collectors item

02:58 <clever> mrvn: to implement the above, i had to add viewport support into my sprite framework, so now every visible image has a true w/h, a dest xy, a dest w/h (scaling), and a viewport xywh (cropping)

02:58 <mrvn> clever: the usual way is to have a FB with double height and draw everything twice. Then when you hit the bottom you can reset to the top and it's the same data.

02:58 <mrvn> wastefull and slow

02:58 <clever> mrvn: yep, thats exactly how android does page-flipping on framebuffers

02:58 <clever> thats also how i found a nasty bug in the original rpi firmware

02:59 <clever> when you change the y-offset to flip, the firmware clears the buffer

02:59 <clever> so android only ever displayed solid black

02:59 <klange> heat: ah GVT-g

03:00 <heat> aww jeez I was close

03:00 <clever> mrvn: and the "fix" i found in the rpi-android fork of linux, was for the driver to double the virtual height, after the userland had already doubled it once

03:00 <heat> the naming scheme is so clear and simple to understand

03:00 <clever> mrvn: the quad height virtual size, was rejected by the rpi firmware, causing android to just disable pageflipping

03:01 <heat> klange, anyway, the i915 series cards (but you should better pick a gen or two) are the best bet to get open source 3D acceleration going

03:02 <heat> for real cards that is

03:02 <klange> yeah, and like the e1000 there's some degree of backwards compatibility on them

03:02 <heat> yes

03:02 <heat> that's not necessarily good though

03:02 <klange> which was the real sin of that powervr chipset they tried to push as an "intel" gpu

03:03 <heat> the backwards compatibility make really hard to read drivers

03:03 <heat> the e1000e and the i915 linux drivers are the hugest mess of spaghetti if's and "do this if you're one model, do that if you're another model, ..."

03:04 <clever> heat: ive seen the same mess in dwc, and genet drivers

03:04 <heat> but, ya know, over 20 years

03:04 <clever> the rpi's 2d hvs drivers are slightly better, given that there are only 2 variants

03:04 <clever> so far...

03:04 <klange> It's a trade off, and of course Linux - and any maintained OS with the resources to put into it - will want to make the most of every new feature.

03:05 <heat> yeah but it's not just new features

03:05 <klange> But from our side of things, building hobby systems where we're lucky to get even the bare minimum working, it's nice to have hardware where that's still going to work as new things get added.

03:05 <geist> yah exaclty. you pick and choose what hardware to support, hopefully trying to get something to unlock that part of the os

03:06 <heat> klange, good chunks of the GPU get moved around between gens

03:06 <heat> for instance, the sequence to drive the display of the i915 GPUs changes so significantly that most functions have _<gen> variants

03:07 <heat> like i915_enable_display_hsw, i915_enable_display_skl, etc

03:07 <klange> Which is an example of _not_ having good backwards compatibility, not an example of backwards compatibility being bad.

03:08 <heat> it's just not backwards compatible, the design looks similar but you can't try to use your original i915 driver and expect it to work newer stuff

03:08 <dasabhi> wait so how do you guys render shit to the screen

03:08 <dasabhi> a lot of you have solid working OSes

03:08 <heat> you plot to the framebuffer

03:09 <dasabhi> ah so you just do the math and drop shit to the framebuffer,

03:09 <heat> *(framebuffer + pixel_off) = 0xAA0000; <-- hey look it's red now

03:09 <dasabhi> never touching and hardware to do the math faster is that right?

03:09 <heat> it's slow

03:09 <dasabhi> any hardware*

03:09 <heat> hm?

03:09 <heat> making hardware do the math and draw is way faster

03:09 <geist> the key is to know what a 'framebuffer' is. in general it's a byte/word/etc per pixel

03:10 <geist> so it looks like memory that you simply drop colors into at particular offsets

03:10 <klange> There is roughly one "hobby OS" worth mentioning that has an actual accelerated graphics pipeline, and it notably implemented Linux APIs to assist with that and only has minimally functional drivers for actual hardware.

03:10 <geist> and that makes a pixel light up

03:10 <dasabhi> so i guess qemu doesnt have anything that emulates the make hardware do the math part

03:10 <heat> klange, which one?

03:10 <klange> managarm

03:10 <heat> hmm

03:11 <heat> dasabhi, some qemu GPUs take a stab at it but they're usually pretty bad at it

03:11 <heat> virtio-gpu is the best stab you have

03:11 <heat> klange, porting DRM is possible. making a vulkan-only driver is easier

03:12 papaya has joined #osdev

03:12 <heat> mesa's Zink can do OpenGL on top of Vulkan, so you can still have OpenGL while only writing vulkan code, which is much easier

03:12 <klange> managarm implemented the DRM APIs; and I was mistaken, I thought they had more than one actual hardware driver - could have sworn they were working on an AMD one, but I only see the Intel stuff.

03:13 <heat> that's the BSD approach too

03:13 <heat> it's not trivial to write a graphics stack :)

03:13 <dasabhi> yeah not nearly as simple as networking

03:14 <dasabhi> where its, packets go brrr

03:14 <dasabhi> make a giant buffer with packet headers feed to NIC

03:14 <heat> networking also isn't easy

03:14 <heat> a good networking stack is _complex_

03:15 <dasabhi> yeah i am just having a laugh

03:15 <dasabhi> i just need something to learn off

03:15 <dasabhi> thankfully the e1000 exists

03:15 <dasabhi> so about this managram OS

03:15 <dasabhi> they have the GLES gears rendering

03:15 <heat> the rtl8138 is probably the most braindead nic you'll find

03:15 <dasabhi> did they just do that by dumping to frame buffers at the end of the day?

03:16 <heat> dasabhi, since they implemented the DRM apis I'm assuming they use mesa

03:16 <klange> They implement Linux's DRM interfaces, alongside lots of other Linux APIs, and ported Wayland.

03:16 <heat> which is the hard part that does the heavylifting in userspace

03:16 <heat> shader compilation, vulkan/opengl implementation, it's all done here

03:17 <heat> kernel drivers only drive the display, power management and submit commands

03:17 <dasabhi> so at the end of the day, are they talking to a gpu?

03:17 <heat> (and AFAIK commands usually are just buffers of opcodes the GPU executes)

03:17 <dasabhi> i dont know much about mesa

03:17 <heat> who's talking to a GPU?

03:18 <dasabhi> managram

03:18 <heat> managarm? probably, since they have a driver for it

03:18 <clever> heat: in some cases, the kernel driver has to perform relocation type patching of those opcodes, to populate physical addresses

03:18 <dasabhi> Generic VBE graphics, Intel G45, virtio GPU, Bochs VBE interface, VMWare SVGA

03:18 <dasabhi> list of hardwares they support

03:18 <dasabhi> virtio gpu, probably my best bet at graphics

03:19 <dasabhi> intel g45 and vmware svga interesting

03:19 <geist> your best bet at graphics is a simple framebuffer

03:19 <geist> that's easy

03:19 <heat> the G45 is a 2009 chipset ;) it's old as ballz

03:19 <dasabhi> oh intel g45 thats integrated graphics right

03:19 <heat> yup

03:20 <dasabhi> wait doesnt intel drop docs on integrated graphics?

03:20 <heat> yes

03:20 <dasabhi> there entire gpu driver is open source

03:20 <heat> yes

03:20 <geist> keep in mind your'e so far away from any of those docs making any sense

03:20 <dasabhi> yeah i understand

03:20 <geist> that's oay, just good to know where you are in the grand scheme of things

03:20 <dasabhi> LONG LONG away

03:20 <heat> right

03:20 <geist> i mean i dont even want to read those myself personally

03:20 <heat> i've looked at them for a good bit and they still don't make sense

03:20 <geist> yah

03:21 <dasabhi> exactly just dropping theory questions really, but this makes much more sense now

03:21 <clever> RPF dropped the 3d docs over a decade ago, only upon re-reading them on&off for years, have parts begun to make sense to me

03:21 <geist> but if you let the bios or uefi set up your framebuffer you can get by for quite a while without really needing to program the card directly

03:21 <geist> but you dont necessarily get the optino of changing resoluytions or detecting different monitors/etc

03:21 <heat> there are projects out there that do OpenGL/Vulkan in software to a framebuffer

03:21 <heat> Mesa has llvmpipe, Google has SwiftShader

03:22 <heat> you'll always need the LLVM for those though, not something you'll be able to do for xv6 AFAIK

03:23 <dasabhi> yeah graphics is out of the scope for now

03:24 <dasabhi> i was under the impression that it may not be too hard to run a simple game like doom

03:24 <dasabhi> when i say too hard, i mean less than 3 years

03:24 <clever> older dos era games like duke nukem 3d, where entirely software rendering

03:24 <clever> you just give it a framebuffer, and compiled c does the rest

03:24 <heat> xv6 isn't supposed to be a fully fledged OS

03:25 <dasabhi> ahh they never needed hw acceleration

03:25 <heat> it's naiively simple

03:25 <klange> Even Quake had a software rasterizer, a very solid one.

03:25 <dasabhi> yes i am supposed to flesh out xv6, was just thinking what i could do ontop of xv6

03:26 <heat> run normal programs first

03:26 <heat> improve the kernel part (it definitely needs some improvement)

03:26 <dasabhi> gonna try with generic pthreads first yeah implement that system call

03:27 <dasabhi> well maybe not first

03:27 <dasabhi> but yeah, am miles away

03:27 <dasabhi> thank you guys for the advice!

03:27 <klange> quake is great https://klange.dev/s/Screenshot%20from%202022-05-28%2012-27-14.png

03:27 <geist> yay sure!

03:27 <geist> glad we could help

03:28 <dasabhi> i cant believe all of that is just dumping to a framebuffer and doing math on your own

03:28 <heat> quake uses OpenGL

03:28 <dasabhi> game devs back then must have been math phds

03:28 <heat> klange is using an older version of OpenGL which did software rendering without LLVM

03:28 <klange> no

03:28 <heat> s/OpenGL/Mesa/g

03:28 <heat> no?

03:29 <klange> quake has a software rasterizer, the OpenGL backend was optional

03:29 <heat> oh TIL

03:29 <geist> it was during that period of time when it wasn't implied that you had a 3d card

03:29 <geist> and/or it was using things like GLide

03:29 <klange> The OpenGL backend adds better dynamic lighting, among other things, but the software rasterizer is still excellent on its own and quite fast.

03:30 <dasabhi> oh btw, what system calls do linux implement for graphics?

03:30 <dasabhi> opengl is just a shared library in user space

03:30 <dasabhi> what does it actually call to go to the kernel?

03:30 <dasabhi> ioctl?

03:30 <heat> ioctl with a bunch of IOCTLs

03:31 <geist> yah in general

03:31 <heat> and the IOCTLs are unique to each driver

03:31 <dasabhi> and then ioctl figures out which driver to call yeah?

03:31 <heat> there's no DRM_SUBMIT_COMMAND

03:31 <heat> no

03:31 <heat> the driver is already known because you opened the device

03:31 <heat> that's just how UNIX works

03:32 <dasabhi> right so opengl will call the nvidia /dev file?

03:32 <dasabhi> through ioctl

03:32 <dasabhi> ?

03:32 <heat> that's not how ioctl works?

03:32 <clever> heat: and if you issue the ioctl for one driver, against a different driver, you get a null pointer dereference in kernel space! https://gitlab.freedesktop.org/drm/amd/-/issues/2018

03:32 <dasabhi> yeah i am confusing talking to drivers through open()

03:32 <bslsk05> gitlab.freedesktop.org: amdgpu_cs_ioctl kernel null pointer when receiving v3d ioctl (#2018) · Issues · drm / amd · GitLab

03:32 <heat> it opens the nvidia DRM node, ioctls(fd, NVIDIA_IOCTL_SUBMIT_COMMAND)

03:33 <clever> in my case DRM_V3D_GET_PARAM and DRM_AMDGPU_CS happen to map to the same ioctl id

03:33 <papaya> \quit

03:33 papaya has quit [Quit: leaving]

03:33 <geist> you blew papayas mind

03:33 <clever> DRM_V3D_GET_PARAM is used to query the hardware ident registers, so userland knows what variant of v3d its dealing with

03:34 <clever> DRM_AMDGPU_CS is used to query something else, and having a count of zero causes it to not init some internal state and then fault out

03:35 <heat> syzkaller should fuzz amdgpu I guess

03:35 <dasabhi> oh btw, you guys know by now that, nvidia dropped an open source graphics card

03:35 <heat> they have fuzzing stuff for the i915

03:35 <dasabhi> i noticed a bunch of people pushing pull requests with advanced features

03:35 <heat> dasabhi, it's an open source kernel driver

03:35 <heat> not an open source graphics card, and not an open source driver

03:35 <dasabhi> kernel driver sorry***

03:35 <heat> it's a bunch of people making pull requests fixing typos

03:36 <dasabhi> LOLOL

03:36 <clever> i think i saw a PR deleting everything and claiming its going back to being closed source

03:36 <Mutabah> Well... typos do kinda matter

03:36 <klange> it's mostly typos in comments last I checked

03:36 <dasabhi> yes i was just wondering how the hell would people know what to push?

03:36 <klange> which is 100% just people trying to get their names in the contributor list

03:37 <dasabhi> since they def didnt drop a document

03:37 <heat> for instance

03:37 <heat> https://github.com/NVIDIA/open-gpu-kernel-modules/pull/254 <-- "easy fix" that fixes nothing

03:37 <bslsk05> github.com: Fix null pointer derefs in src/nvidia-modeset and src/common/nvlink by amada95 · Pull Request #254 · NVIDIA/open-gpu-kernel-modules · GitHub

03:38 <dasabhi> i know amd has an open source driver

03:38 <dasabhi> and a mailing list for it, not sure if they dropped a doc for it

03:41 <heat> https://github.com/NVIDIA/open-gpu-kernel-modules/pull/178/commits/006ef239c85ba880c1dfac415293a46d574b6c80 <-- insightful pull requests

03:41 <bslsk05> github.com: Proper capital letters and full stops. by Turbonator0 · Pull Request #178 · NVIDIA/open-gpu-kernel-modules · GitHub

03:41 <dasabhi> https://gpuopen.com/learn/documentation/

03:41 <bslsk05> gpuopen.com: Docs - GPUOpen

03:41 <dasabhi> might just get an amd card

03:41 <dasabhi> bslsk05: just a second late ahaha

03:41 <clever> dasabhi: "how the hell would people", guess, test it on real hardware, and if it works and is "better", push!

03:42 <heat> hey bslsk05 is always late

03:42 <heat> dumbass

03:42 <klange> (bslsk05 is a bot posting link titles)

03:42 <klange> (she does other things, too)

03:42 <heat> it's a she?

03:43 <clever> heat: in the same way a ship is a she?

03:43 <heat> sorry bslsk05

03:43 <dasabhi> just another hypothetical here

03:43 <heat> dasabhi, getting an AMD card to write a driver... yeah good luck

03:44 <dasabhi> shit man, how am i supposed to get hired as a new grad :"(

03:44 <dasabhi> cant get experience without work

03:44 <dasabhi> cant get work without experience

03:44 <dasabhi> lmfao

03:44 <clever> dasabhi: ive seen job position forms before, asking for 10 years experience with a language that was only 4 years old

03:44 <klange> Internships, and big oof if you missed the chance on those.

03:44 <clever> the creator of the language wasnt even qualified for the job

03:45 <heat> klange, the big bois take interns of any age

03:45 <dasabhi> i actually did an internship at nvidia lol

03:45 <dasabhi> but didnt get drivers :(

03:45 <heat> then you're set

03:45 <klange> still hoping one day i experience that, but I think I need to make a web framework first

03:46 <dasabhi> i dunno, i think one could learn the complexities of an amd card, seeing all these docs

03:46 <heat> <clever> RPF dropped the 3d docs over a decade ago, only upon re-reading them on&off for years, have parts begun to make sense to me

03:46 <dasabhi> might take a few years

03:46 <dasabhi> RPF?

03:47 <clever> raspberry pi foundation

03:47 <heat> the raspberry pi's GPU is wayyyyyyyyyyyyyyyyyyyyyyyyyy smaller and simpler than an amdgpu will ever be

03:47 <clever> heat: yeah, originally, i got completely lost on vertex shaders, and the most i could do was slightly adapt existing baremetal examples

03:47 <heat> and mind you, clever is fucking obssessed with the raspberry pi

03:47 <dasabhi> hmm rpi, sounds like boradcom gpu?

03:47 <clever> only years later, after re-reading it multiple times, do i have a grasp on how to start doing vertex shaders

03:47 <clever> dasabhi: exactly

03:48 <heat> it's literally stockholm syndrome

03:48 <dasabhi> hearing the name broadcom makes me want to throw up

03:48 <dasabhi> lmfao

03:48 <clever> dasabhi: ive been working at getting the rpi hardware usable without any closed-source blobs

03:49 <dasabhi> isnt that next to impossible?

03:49 <dasabhi> well

03:49 <clever> dasabhi: https://www.youtube.com/watch?v=BQyyVtmmVg8

03:49 <bslsk05> 'rpi open firmware boot' by michael bishop (00:02:23)

03:49 <dasabhi> you have to reverse engineer the blob first yeah?

03:49 <heat> yeah

03:49 <dasabhi> ahh interesting

03:49 <heat> who'd be crazy enough to do that

03:49 <clever> dasabhi: this video shows linux booting with both 2d and 3d animations, on the open firmware

03:50 * clever raises hand

03:51 <heat> clever, how do you do memory training?

03:51 <heat> call a blob?

03:51 <dasabhi> i guess its better to just land different team at these companies and work your way to drivers

03:51 <dasabhi> than spending 5 years reading docs

03:51 <dasabhi> lmfao

03:51 <heat> what did you do at nvidia?

03:51 <clever> heat: ddr2, its far simpler: https://github.com/librerpi/lk-overlay/blob/master/platform/bcm28xx/rpi-ddr2/sdram.c

03:51 <bslsk05> github.com: lk-overlay/sdram.c at master · librerpi/lk-overlay · GitHub

03:51 <dasabhi> ordinary linux kernel stress testing

03:52 <dasabhi> wrote python scripts all day :(

03:52 <clever> heat: although, i didnt write this driver, and i can barely tell what its doing

03:52 <clever> heat: the pi4 and its ddr4 controller is more complex, and i have yet to RE the blob well enough to make a functional open source driver

03:52 <heat> you probably won't be able to

03:53 <heat> i've heard the modern memory training code is super complex

03:53 <clever> luckily, RPF recently changed the blobs, and removed the bootloader from the ram-init blob

03:53 <clever> so the blob is now only ram-init and nothing else

03:53 <clever> so its more acceptable then before

03:53 <heat> you might want to take the L for now and just use the blob

03:53 <clever> yep

03:54 rustyy has quit [Read error: Connection reset by peer]

03:54 <clever> originally, there was a single bootcode.bin blob, that did both ram-init, and loading of start4.elf from sd/usb/tftp/nvme

03:54 <clever> and it uses some other memsys blobs for ram-init

03:54 <heat> intel people said they literally have several megs of debug logs in memory training, they fuckin print histograms

03:55 <clever> i have found such debug messages in the memsys files

03:55 <clever> ELLI LAGDAOLOTS/E ERPECXNOIT:CP PE

03:55 xenos1984 has quit [Read error: Connection reset by peer]

03:55 <clever> also, the byte order of the memsys files is backwards, every 4 byte chunk has to be byte-swapped

03:56 <clever> my theory, is that there is a FIFO with a 32bit write port, for loading the memsys files into the ddr4 controller

03:56 <mxshift> meh, I know someone who reverse engineered Intel's memory training process for a few ddr4 generations

03:56 <clever> and with the host cpu being LE, the 4 bytes in that 32bit write port, must exist backwards in ram, for the copy loop to be simple

03:57 <mxshift> err ddr3

03:57 <mxshift> can't type today

03:57 <heat> "In the PC ecosystem a single chipset family can power thousands of unique designs. So the DRAM memory needs to be external, support lots of different chipset packages(signal integrity...), support the lowest cost through the highest cost DRAM and thousands of different board layouts. So programing DRAM takes a masters degree in antenna design. I’ve seen MRC (Memory Reference Code) with over a MiB of DEBUG prints in it, and it literally is printin

03:57 <heat> g histograms of what it is tuning. So all this code has to run before the system has any DRAM, thus it is running using the cache as RAM. I’ve not looked at the x86 architecture specs form the vendors in a while, but back in the day they did not support page tables in ROM or pinned cached. Now it might work, but if it breaks your CPU vendor blames you so you don’t code PEI in X64…."

03:58 <clever> heat: ah found it, this is the byte-swapped version of the above message: https://gist.github.com/cleverca22/179454a343d9162b4c495852a1be1467

03:58 <bslsk05> gist.github.com: gist:179454a343d9162b4c495852a1be1467 · GitHub

03:58 <clever> ILLEGAL LOAD/STORE EXCEPTION PC:

03:59 <mxshift> I've been looking at Zen3 ddr4 training logs to sus out margin in our server motherboard's design

03:59 rustyy has joined #osdev

04:00 <clever> heat: the recent change in the rpi firmware, is a double-edged sword, they added the ability for the bootloader firmware (held in internal spi flash) to download the next stage over https, over the public internet

04:00 <heat> mxshift, how do you get those?

04:01 <clever> and i can see how thats a major red flag for the paranoid

04:01 marshmallow has quit [Ping timeout: 256 seconds]

04:01 <clever> but the benefit there, is that https support was too big to fit within the cache-as-ram based binary

04:01 <clever> so they had to seperate the bootloader from the ram-init

04:01 <heat> how is the firmware in cache-as-ram?

04:02 <mxshift> all zen parts do memory training via the PSP before the x86 cores are enabled. There are APCB tokens you can set to run MBIST which dumps out a bunch of timing/vref sweep data to show data eyes

04:02 <clever> heat: the .bin file gets loaded into a 128kb L2 cache, and executed from there, then normal cache-as-ram rules follow

04:02 <clever> dont cause a cache miss, dont cause a cache eviction

04:02 <heat> that's cray-cray

04:03 <clever> if you configure your linker script to stay within a 128kb block starting at 0, then your fine

04:03 <clever> the bootrom pre-fills that region with nulls

04:03 <heat> PC firmware just translates the firmware area up in the 32-bit space into SPI accesses

04:03 <clever> so the cache will have a hit on anything in that range

04:03 <clever> the rpi does have a boot rom mapped at 0x6000_0000

04:03 <clever> which is directly in the soc itself

04:03 <heat> also intel's cache-as-ram isn't as crazy as that

04:04 <clever> the boot rom is responsible for priming the cache, by using wide stores

04:04 <heat> you give it an address and it maps it as cache-as-ram for you

04:04 <clever> so the cache doesnt try to back-fill the half of a line you didnt write

04:04 <clever> the bootrom is also responsible for loading the .bin file into the cache, and passing off control

04:05 <clever> yeah, i suspect this cache-as-ram is much more of a hack

04:05 <heat> (also, something I found out about the other day: some server platforms use SRAM instead of cache-as-ram

04:05 <heat> )

04:05 <clever> it looks less like a proper mode, and more like just abusing the cache fill logic, so it never tries to contact dram

04:05 <clever> the rpi does also have some sram at 0x6001_0000

04:05 <clever> i'm not sure how big it is, and why the bootloader doesnt use it more

04:06 <clever> the bootrom runs from that sram, before the cache has been primed

04:06 <clever> and on some models, the bootrom leaves a copy of the boot signing keys in the sram!

04:06 <clever> oops :P

04:08 <clever> heat: the .bin stage also has a 0x200 byte hole at the start, the perfect size for a vector table, and the entry-point is always 0x8000_0200, with the whole .bin loaded at 0x8000_0000

04:08 <clever> that design holds true for the entire rpi line, both vc4 and bcm2711 era

04:09 gog has quit [Ping timeout: 246 seconds]

04:10 <clever> heat: and much like the PC platform, a flag can be set somewhere to make the rom/sram drop off the bus

04:13 xenos1984 has joined #osdev

04:14 <clever> https://github.com/librerpi/rpi-open-firmware/blob/master/docs/rom.txt#L11-L33

04:14 <bslsk05> github.com: rpi-open-firmware/rom.txt at master · librerpi/rpi-open-firmware · GitHub

04:15 <clever> heat: for the entire VC4 line (pi0 to pi3), the bootrom supports booting from 8 sources, 3 of them are just SD (different controllers, 4bit vs 8bit mode), nand flash, spi flash, usb, i2c-slave, and something called mphi

04:15 <clever> for the pi0/pi1/pi2, usb mode is in device only, like DFU on other boards

04:15 <clever> pi3/pi02 adds usb-host abilities, where it can drive MSD sticks, or tftp the onboard USB NIC

04:16 <clever> in all cases, it loads a bootcode.bin into the cache-as-ram, and executes it

04:16 <clever> the pi4 scales that back, to just 3 sources, SD, SPI, and usb-device

04:16 <clever> and the pi4 renamed it from bootcode.bin to recovery.bin

04:21 <heat> yes its defo stockholm syndrome

04:21 <clever> lol

04:21 <clever> my claim is that its to free the other slaves :P

04:21 <heat> you enjoy that cursed thing too much

04:21 <clever> think of how many existing users there are, that i could set free!

04:22 <heat> that might've been your original motivation but you definitely enjoy the thing now

04:22 <clever> lol

04:22 <clever> the original reason i picked this code up, was because the pi4 had onboard spi flash

04:22 <clever> so i could have it boot from that, and do something like standard uefi, without any blobs on the SD card

04:22 <clever> but its ddr4 got in the way, and i wound up fixing older models first

04:23 <clever> the pi4 also had other boardblocks, the .bin file must have an hmac-sha1 signature appended to the end of the binary

04:23 <heat> ok here's a cute fact

04:23 <clever> and if its not signed correctly, the rom acts like it doesnt even exist

04:23 <heat> the rpi UEFI builds have a non-osi package

04:23 <heat> the freakin logo :)

04:23 <clever> heh

04:24 <clever> those builds also rely on the closed firmware to even start

04:24 <heat> yeah

04:24 <clever> https://github.com/pftf/RPi3 it may be simple for me to port this to my open firmware

04:24 <bslsk05> pftf/RPi3 - Raspberry Pi 3 UEFI Firmware Images (22 forks/197 stargazers/NOASSERTION)

04:24 <heat> see, Intel/AMD were smart in adding FSP/AGESA

04:24 <clever> i need to try it at some point

04:25 <heat> clever, that's still EDK2 UEFI

04:25 <clever> whats the difference?

04:25 <heat> https://github.com/tianocore/edk2-platforms/tree/master/Platform/RaspberryPi/RPi3

04:25 <bslsk05> github.com: edk2-platforms/Platform/RaspberryPi/RPi3 at master · tianocore/edk2-platforms · GitHub

04:26 <heat> the RPi4 ones are for rpi4-likes, and the rpi3 ones are for rpi3-likes

04:26 <clever> yep

04:26 <heat> if you go to the .fdf files you can see what things they include

04:26 <clever> pi3 is the only one of that pair, where i can boot to arm currently

04:27 <clever> i dont have the pi4 booting to custom arm code yet

04:27 <heat> how large is the SPI flash?

04:27 <clever> 512kb

04:27 <heat> ok thats small

04:27 <clever> and the bootrom enforces that its either 128kb or 512kb, and wont boot if its anything else

04:27 <heat> no way you could fit EDK2 in there

04:27 <clever> yeah, i would need a secondary spi flash

04:28 <clever> let me double-check something

04:29 <heat> the current RPi3 RPI_EFI.fd is 2MB long

04:29 <heat> and that's with compression

04:29 <clever> the official pi4's SPI firmware is also using compression

04:30 <clever> ok, found the code, in the bootcode.bin, not the rom

04:30 <clever> let me double-check against the rom

04:31 <clever> ah right, the rom is uber-dumb

04:32 <clever> heat: the SPI boot support in the rom is very dumb, it just writes a 3 (the spi read command), and then an infinite stream of 0's, and then checks (with many offsets) for the magic 32bit number

04:32 <clever> that variable offset, deals with variable address size, for many spi flash chips

04:33 <clever> and then the extra 0's, are just dummy ones, so spi can keep transfering

04:33 <clever> once it finds the magic#, it expects a 32bit size, and then the raw bootcode.bin

04:33 <clever> bootcode.bin itself, is what enforces the 128kb/512kb size restriction

04:34 <clever> so i would need to patch it if i wanted to use a non-standard flash size like 2mb or 4mb

04:35 <heat> https://edk2.groups.io/g/devel/topic/90435699 has a lot of interesting UEFI design stuff if you're into that

04:35 <bslsk05> edk2.groups.io: devel@edk2.groups.io | [edk2-discuss] GSoC Proposal

04:35 <clever> https://github.com/librerpi/lk-overlay/blob/master/platform/bcm28xx/arm/payload.S

04:35 <bslsk05> github.com: lk-overlay/payload.S at master · librerpi/lk-overlay · GitHub

04:36 <heat> also note that "The ARM guys built LittleKernel" makes me physically cringe :)

04:36 <clever> heh

04:36 <clever> at the most basic level, this code has 3 payloads (raw arm code), and it picks the right payload at runtime

04:36 <clever> if i just replace one of those payloads with a tianocore build, i'm 90% done

04:36 <geist> awww

04:36 <clever> then i just need tianocore to not crash due to the missing rpi firmware

04:37 <clever> it might need to be a ATF+tianocore binary

04:37 <heat> there was also a horrible "We don't understand the security properties of ELF" take

04:39 knusbaum has quit [Quit: ZNC 1.8.2 - https://znc.in]

04:40 <clever> heat: i did also come up with a multiboot inspired protocol to solve some things

04:40 <clever> basically, when the arm comes out of reset, it begins executing whatever it found at 0 in ram

04:40 <clever> how do you pass it arguments?

04:40 <heat> tbf lots of intel FW engineers have been at the company for 20+ (intel is damn good at retaining talent I guess) so they don't have to work on stuff other than UEFI

04:41 <clever> https://github.com/librerpi/lk-overlay/blob/master/app/inter-arch/inter-arch.c#L18-L22

04:41 <bslsk05> github.com: lk-overlay/inter-arch.c at master · librerpi/lk-overlay · GitHub

04:41 knusbaum has joined #osdev

04:41 <clever> the VPU firmware searches the arm blob for this struct (identified by magic# and alignment), and then populates the fields

04:44 <clever> i could then put the same struct into uboot and tianocore

04:44 <clever> and then they can detect when the open firmware has loaded them, and accept config params

04:45 <dasabhi> hey btw

04:45 <dasabhi> i am guessing the developement of the i915 intel driver isnt very open is it

04:45 <heat> huh?

04:45 <dasabhi> just the userland stuff is

04:46 <heat> it's all open

04:46 <heat> not sure what you mean

04:46 <dasabhi> https://github.com/intel/intel-vaapi-driver

04:46 <bslsk05> intel/intel-vaapi-driver - VA-API user mode driver for Intel GEN Graphics family (118 forks/242 stargazers/NOASSERTION)

04:46 <dasabhi> these guys have a slack page

04:46 <dasabhi> where any one can hop in and ask qustions about the userland vaapi driver

04:47 <dasabhi> but i doubt they work on the actual i915 kernel driver

04:47 <heat> that's for video acceleration

04:47 <geist> vaapi is a... right

04:48 <heat> intel-gfx had IRC channels

04:48 <dasabhi> right different things

04:48 <heat> you also have #dri-devel for general Linux DRI stuff

04:48 <dasabhi> they still do

04:48 <dasabhi> and i found the irc

04:49 <heat> great

04:49 <dasabhi> but what i mean is, i think they discussion in the irc for intel-gfx is kept to userspace stuff

04:49 <dasabhi> the discussion*

04:49 <heat> that's most of the work done for graphics drivers

04:50 <dasabhi> so whats the i915 driver doing in the end?

04:50 <dasabhi> just sending op commands

04:50 <dasabhi> and managing power?

04:50 <dasabhi> yes oversimplification

04:50 <clever> i think it would also manage buffer alocations

04:51 <heat> just sending commands, managing the device, driving the display and connecting the userspace to it

04:51 <heat> buffer allocation is done in DRM

04:51 <clever> when i was writing a v3d driver, it also had an api to allocate chunks of physically contiguous ram, and map them into the client

04:51 <clever> and the ram was tied to the file-handle, so closing the handle for any reason (even segfault), would free it

04:52 <heat> (plus the usual stuff attached to the device operation like its internal page tables, etc)

04:52 <clever> did DRM exist back in 2014?

04:52 <heat> yes

04:52 <clever> guess i wasnt the first with that idea then

04:52 <heat> https://en.wikipedia.org/wiki/Direct_Rendering_Infrastructure

04:52 <bslsk05> en.wikipedia.org: Direct Rendering Infrastructure - Wikipedia

04:53 <clever> the bcm2711 v3d does have paging tables, the 3d core lives in a 32bit space, but the soc supports 16gig of ram

04:53 <heat> https://www.systutorials.com/docs/linux/man/7-drm-gem/

04:53 <bslsk05> www.systutorials.com: drm-gem: DRM Memory Management - Linux Man Pages (7)

04:53 <clever> but its currently one set of paging tables for the entire 3d core, shared by all 3d clients

04:53 <clever> so a stray pointer can let you peek at 3d state for another client

04:54 <clever> the vc4 era v3d lacks an mmu, and the 3d core just has direct access to the entire physical memory, and possibly even mmio

04:55 <clever> heat: do you know if most (say i915) drivers copy the drm buffers to gpu memory, or if the gpu is instructed to read directly from the buffer?

04:55 <dasabhi> i am staying away from graphics for a long time

04:56 <dasabhi> but hey why didnt the managram guys just....port i915 and talk to the gpu themselves?

04:57 <dasabhi> i am guessing its probably a bigass driver

04:58 <heat> clever, I don't think the i915 needs to copy buffers since it has no VRAM

04:58 <clever> ahh

04:58 <heat> as far as I remember you just map stuff to the GART (can't remember the proper name they give it)

04:58 <clever> but directly sharing a buffer between userland and gpu can have risks

04:59 <clever> in the case of the rpi, the control structures contain physical addresses for things

04:59 <clever> so if the userland can still modify the buffer, then it can point the 3d core anywhere in ram

04:59 <clever> and now permissions are effectively dead

04:59 <heat> the i915 lacked per process protection for like 15 years :P

04:59 <dasabhi> learned a lot asking dumbs questions here today, thank you for answering my stupid questions

05:00 <dasabhi> i am gonna call it a night

05:00 <dasabhi> goodnight every one

05:00 <clever> heat: i think the bcm2711 v3d still lacks process level isolation, but at least prevents you from gaining root and escaping your user

05:00 <heat> https://bwidawsk.net/blog/2013/8/i915-command-submission-via-gem_exec_nop/

05:00 <bslsk05> bwidawsk.net: i915 command submission via gem_exec_nop — Dumbing Things Up

05:00 <heat> dasabhi, night

05:01 <clever> https://github.com/raspberrypi/linux/blob/rpi-5.10.y/drivers/gpu/drm/v3d/v3d_mmu.c

05:01 <bslsk05> github.com: linux/v3d_mmu.c at rpi-5.10.y · raspberrypi/linux · GitHub

05:01 <clever> re-reading this again...

05:01 <clever> > we load all BOs into the same 4GB address space.

05:01 <clever> and from your links

05:01 <clever> > BO: Buffer Object. GEM uses handles to identify the buffers used as graphics operands in order to avoid costly copies from userspace to kernel space. BO is the thing which is encapsulated by that handle.

05:02 <heat> clever, https://bwidawsk.net/blog/2014/6/the-global-gtt-part-1/ and the other parts are very good references

05:02 <bslsk05> bwidawsk.net: The Global GTT [Part 1] — Dumbing Things Up

05:03 <clever> v3d_mmu_insert_ptes(struct v3d_bo *bo) looks to be the core api

05:03 <clever> https://github.com/raspberrypi/linux/blob/rpi-5.10.y/drivers/gpu/drm/v3d/v3d_drv.h#L145-L154

05:03 <bslsk05> github.com: linux/v3d_drv.h at rpi-5.10.y · raspberrypi/linux · GitHub

05:03 dasabhi has quit [Quit: Lost terminal]

05:03 <clever> it just contains a DRM GEM, and some other stuff

05:04 <clever> https://github.com/raspberrypi/linux/blob/rpi-5.10.y/drivers/gpu/drm/v3d/v3d_bo.c#L87-L123

05:04 <bslsk05> github.com: linux/v3d_bo.c at rpi-5.10.y · raspberrypi/linux · GitHub

05:04 <clever> v3d_bo_create_finish() will map every BO to the v3d core

05:04 <clever> heat: that implies only BO's created via v3d can be used by v3d?

05:04 <heat> yes I believe so

05:05 <heat> if you want to share buffers between devices you need dma_buf as far as I'm aware

05:05 <clever> v3d_prime_import_sg_table() and v3d_bo_create() can call finish

05:06 <clever> v3d_create_bo_ioctl() is then a wrapper around v3d_bo_create

05:06 <clever> that import function looks fishy

05:07 <clever> .gem_prime_import_sg_table = v3d_prime_import_sg_table,

05:07 <clever> its part of the main drm api

05:07 <clever> https://www.kernel.org/doc/html/v4.10/gpu/drm-mm.html#c.drm_gem_cma_prime_import_sg_table

05:07 <bslsk05> www.kernel.org: DRM Memory Management — The Linux Kernel documentation

05:08 <clever> > produce a CMA GEM object from another driver’s scatter/gather table of pinned pages

05:08 <clever> > This function imports a scatter/gather table exported via DMA-BUF by another driver.

05:08 <clever> heat: yep bingo, this is how a dma_buf gets converted into a BO!

05:09 <heat> an important thing to note is that dma_buf is only available for GPLv2 licensed drivers

05:09 <heat> (meaning nvidia couldn't use it prior to open-sourcing their driver)

05:09 <clever> things like that make no sense to me

05:10 <clever> your just drawing arbitrary lines in the sand, and blocking people out

05:11 <clever> https://www.youtube.com/watch?v=OwHGE7uhjco

05:11 <heat> https://blog.ffwll.ch/2018/08/no-2d-in-drm.html

05:11 <bslsk05> 'The Simpsons - No Homers Club' by kentuckyfriedpanda1 (00:00:45)

05:11 <bslsk05> blog.ffwll.ch: Why no 2D Userspace API in DRM?

05:11 <heat> probably interesting to you

05:11 <clever> just like in this clip

05:12 <clever> ah, i have been wondering that exact question!

05:12 <clever> the general answer ive gotten before, is that most GPU's arent very capable in the 2d realm

05:13 <clever> and linux itself then enforces a max of 16 planes in DRM

05:13 <clever> and with some GPU's only supporting 3, the main framebuffer, an xvideo overlay, and the cursor

05:13 <clever> so nobody bothers implementing code that uses more

05:13 <clever> which makes the rpi weird, having support for ~292 planes on-screen at once

05:14 <clever> thats enough that you could implement your entire compositing window manager in hardware, just tell DRM the dma_buf of every back-buffer, and the xy to display it at

05:14 <clever> your done!

05:15 wxwisiasdf has joined #osdev

05:15 <wxwisiasdf> my os now can load programs

05:15 <wxwisiasdf> unfortunely it's all physical so i have to live with the caveats of not having paging

05:15 <clever> no-mmu linux works around that by only having vfork()

05:15 <wxwisiasdf> i could implement paging if i figure out the ibm docs from 1998

05:16 <clever> in the really old days, fork() would actually copy the userland memory (and used mmu to not collide), and that was a major performance cost

05:16 <wxwisiasdf> unfortunely "figuring out" means no gdb and only an emulator from 2003 to aid in the quest

05:16 <wxwisiasdf> hmm yes, fork()

05:16 <clever> vfork() was the hack to fix that, the parent would suspend until the child execve()'d, and the child would temporarily steal the parents ram

05:17 <heat> https://www.microsoft.com/en-us/research/uploads/prod/2019/04/fork-hotos19.pdf

05:17 <clever> but then fork() gained copy-on-write support, so it could share the parents ram cheaply, and vfork "died"

05:17 <wxwisiasdf> my os of course has multitasking but it's all on a single 16MiB address space :(

05:17 <clever> but without an mmu, vfork can return from the dead!

05:17 <wxwisiasdf> hmm yes, let's go, vfork

05:17 <heat> see, this pretty much shows that OS documentation is crap

05:17 <clever> suspend the parent, whild the child runs

05:18 <clever> when the child execve()'s, map it to an unused region of ram, and resume the parent

05:18 <heat> i know a lot of stuff from random collections of links

05:18 <wxwisiasdf> why don't archive.org it?

05:18 <heat> idk

05:18 <wxwisiasdf> :p

05:19 <heat> just as an FYI

05:19 <heat> you can't touch anything with vfork

05:19 <wxwisiasdf> why

05:19 <heat> undefined

05:19 <wxwisiasdf> fair

05:19 <heat> meaning shit will blow up in your face

05:19 <clever> in the no-mmu case, the parent and child are literally using the exact same ram

05:19 <clever> anything you modify in the child, will remain modified in the parents view

05:19 <heat> and in the mmu case as well

05:19 <wxwisiasdf> clever: literally everything in my OS is in the same RAM

05:20 <wxwisiasdf> but the IBM mainframe has physical protection

05:20 <wxwisiasdf> so it's beautifully like, "hey i am in the same ram as the kernel... but with less priv :("

05:20 <clever> yeah, thats called an MPU in the cortex-m region

05:20 <clever> and a sandbox in he VPU

05:20 <wxwisiasdf> oh

05:20 <clever> more common when you lack an mmu

05:21 <heat> how do you use vfork() in a defined way inside C?

05:21 <clever> rather then mapping memory, you just limit what addresses can be touched

05:21 <wxwisiasdf> hmmm yes, time to spam skeys()

05:21 <wxwisiasdf> (storage protection keys)

05:21 <clever> heat: good question, any local vars in the stack frame could be trampled, kinda need to rely on barriers and the compiler not re-ordering things

05:22 <wxwisiasdf> is vfork like setjmp?

05:22 <clever> kinda

05:22 <clever> the function has to return twice

05:22 <clever> on a no-mmu kernel, the first time it returns is as the child

05:22 <clever> and once the child does execve, it returns a second time, as the parent

05:23 <heat> i guess you just use it and pray it doesn't do much stuff

05:23 <clever> and the execve args are used to setup a new proc at a new addr

05:23 <heat> like literally the only defined use of vfork is: "pid_t pid = vfork(); if (!pid) execve(...);

05:23 <clever> and if everybody follows the rules (only touching what malloc gave it), they wont notice other procs sharing the addr space

05:24 <clever> yeah, due to the parent suspending, there isnt much else you can do with vfork

05:24 <clever> fork on the other hand, is a lot more flexible for abuse

05:24 <wxwisiasdf> ok can i fork on nommu

05:24 <heat> no

05:24 <wxwisiasdf> like copy paste the whole thing

05:24 <clever> factorio for example, can fork() out a clone of the entire game engine, and let the child serialize it to disk, while the parent continues to run

05:24 <heat> if you swap everything, yes

05:24 <wxwisiasdf> :/

05:25 netbsduser` has quit [Read error: Connection reset by peer]

05:25 <wxwisiasdf> so basically i have to paging yes or yes

05:25 <geist> indeed, the entire mechanism of fork is generally assuming there's at least some sort of way to set the current cpu context to some other memory

05:25 <geist> paging or segmentation or whatnot

05:25 <wxwisiasdf> oh sorry, paging, IBM calls it "Dynamic Address Translation" :/

05:26 <geist> *or* you as heat says, literally swap everything. copy the process somewhere else, copy the new one in, run it, swap back

05:26 <wxwisiasdf> that could be a solution yeah

05:26 <geist> the minicomputers unix was designed to run on at least had some sort of address segment register to let you keep a few things in ram that thought they were 'at 0'

05:26 <geist> even if it wasn't full paging (that came along a few years later)

05:27 <wxwisiasdf> yeah IBM had like DAT

05:27 <geist> i dont know precisely what pdp7 had, but pdp11 had reasonable segmentation scheme

05:27 <heat> wxwisiasdf, that would be painfully slow

05:28 <geist> even something like pdp8 which eventually grew the ability to run multiple tasks (though generally didnt) had an address extension register that extended 12 bits of address space out to 8 x 12 bits. (ie a 3 bit 'what page am i on' register)

05:28 <geist> with that you could run basically 8 processes at once, each with 4K of ram

05:29 <wxwisiasdf> so unfortunate the emulator doesn't print page tables

05:29 <geist> what emulator and what arch?

05:29 <wxwisiasdf> hercules - s390(x)

05:30 <geist> ah

05:30 <wxwisiasdf> qemu has s390x but it doesn't like css devices as much

05:30 <wxwisiasdf> plus it's meant to run z/linux and not the "cool" things like mvs3.8

05:30 <heat> on that end it has feature parity with qemu-system-riscv64 :)

05:30 <heat> s/riscv/aarch/g

05:31 <wxwisiasdf> i used to do riscv64 osdev :p

05:31 <wxwisiasdf> it's amazing because you actually have documentation and stuff

05:31 <wxwisiasdf> and of course i can hook gdb to it

05:31 <heat> yes I meant aarch64

05:31 <wxwisiasdf> ah

05:38 <clever> > Gem-buffers cannot be created with a generic API. Each driver provides its own API to create gem-buffers.

05:38 <clever> heat: ah, so while GEM is a common support framework in the kernel, the api to actually create a gem object isnt exposed to userland

05:39 <clever> and every driver must create its own wrapper around it

05:39 <clever> makes sense, since some share host ram, while others exist in gpu ram

05:39 <clever> and different drivers will have different choices on syncing the object to gpu ram, when it exists

05:39 <clever> and different rules about where in ram the buffer must live, its page size, and stride

05:42 <heat> yes

05:42 <heat> that's the rule of thumb in DRM

05:42 <heat> no common API, but a common framework

05:43 <clever> vc4 v3d lacks an mmu, so all textures/control-lists must be contiguous

05:43 <clever> rpi hvs (display scanout) lacks an mmu, so all framebuffers must be contiguous, and in the lower 1gig of the physical space

05:44 <clever> combine those 2, and how the bcm2711 v3d has to deal with more complications

05:44 <clever> texture and control-list can exist anywhere in ram, thanks to the v3d mmu

05:44 <clever> but the final 2d output frame, must both be contiguous, and in the local 1gig

05:45 <clever> ive seen a use-case flag in some of the drm docs, that i think covers this

05:45 <clever> intermediate textures, such as render to texture dont have to follow that rule, so it would be beneficial to allocate them from the larger pool

05:49 heat has quit [Ping timeout: 244 seconds]

05:49 <clever> heat: another thing i notice, is that the dumb-buffer api, expects a width*height bitmap image, so how are things like compressed textures and control-lists handled?

05:52 wxwisiasdf has quit [Quit: leaving]

06:05 ThinkT510 has quit [Quit: WeeChat 3.5]

06:07 crm is now known as orthoplex64

06:09 ThinkT510 has joined #osdev

06:18 Mikaku has quit [Quit: server maintenance ...]

06:31 Mikaku has joined #osdev

06:33 <Griwes> hmm. so I'm still trying to figure out how to square "I want the syscall code to not have to deal with interrupts" and "a syscall needs to issue tlb shootdowns and not deadlock", and just had what I am fairly certain is an insane idea - what if I made the memory unmapping syscall spin off the part that modifies the page tables into a temporary thread that steals the time of the thread invoking the syscall? this way I'd be able to have interrupts

06:33 <Griwes> enabled while that is happening (to be able to handle external tlb shootdown requests) without having to rework everything about task switching that I have right now

06:33 <Griwes> I'm absolutely certain there's very fundamental technical issues with this idea, but I guess we will see once I start mocking it up on

06:57 Likorn has joined #osdev

07:00 <geist> hmm, so i'm not entirely sure what you're trying to work around here

07:00 <geist> you want the syscall not to have to deal with interrupts. what precisely do you mean there?

07:03 <Griwes> I'd like to not have to ever need to handle an interrupt while inside a syscall handler. I have this perfect imagined system where the mechanism for context switches is just iret/sysret, but having to do ipis to do tlb shootdowns is really hurting this goal

07:03 <geist> so basically you want to leave interrupts disabled the entire time you're in a syscall?

07:04 <Griwes> yes

07:04 <geist> thus the kernel is non reentrant?

07:04 <geist> well, non preemptiable that is

07:04 <Griwes> the syscall handlers, yes

07:04 <geist> okay. so given that, are you planning on supporting SMP?

07:05 <Griwes> there's parts that'd run as their own threads in kernel space that would be preemptible

07:05 <Griwes> yes

07:05 <geist> so the issue is cross cpu tlb shootdown you have a deadlock to avoid?

07:05 <geist> since two cpus cross tlb shooting down can wait forever for the other to complete?

07:05 <Griwes> yes

07:05 <geist> (really you can extend that to any cross cpu synchronous IPI, but TLB shootdowns are generally the big ones)

07:06 <geist> so. here's what i'd suggest, and you really need it if you're preemptible or not: when a cpu is doing a cross cpu IPI, it also spins and looks for any incoming messages

07:06 <geist> and handles them

07:07 arch_angel is now known as arch-angel

07:07 <geist> if it's a synchronous IPI, then there's generally a per cpu mailbox of incoming events (or something thereabouts). so you put together a message for the other cpu(s), fire the IPI, and then spin waiting for either a complete on their message or an incoming message, which you then handle locally

07:07 <geist> that avoids a cross cpu deadlock

07:08 <geist> can be tricky business, but that lets you send IPIs from essentially any context, even with interrupts disabled. in your case interrupts are already disabled because you're in a syscall

07:08 <geist> note synchronous ipis are different than async ones (like, 'cpu go reschedule yourself')

07:08 <geist> synchronous ones you're sending a message to the other cpu (do a TLB sync, etc) and waiting for it to ack it

07:09 <Griwes> hmm, doing this with per-cpu inboxes like that is an interesting idea that I need to ponder on

07:09 <geist> doesn't necessarily have to be per cpu. you could have a global queue of pending IPIs that has a cpu bitmask on it or something, but the gist is the same

07:09 <Griwes> I have a mechanism for doing concurrent execution on all the cores, but it has slots that are per request and not anything that's per cpu

07:09 <geist> key is you have to wait for the IPI to complete inside a shared piece of code that generically handles ipi

07:10 <Griwes> but that's too limited for a thing as dynamic as this

07:10 <geist> mp_send_sync_request(tlb_sync); etc

07:10 <geist> and inside that it spins and handles incoming ones

07:10 <Griwes> yeah I think you gave me the right idea

07:11 <Griwes> I'll need to rework my higher level ipi api

07:11 <Griwes> because it clearly doesn't pass muster for anything non-trivial

07:11 <geist> https://fuchsia.googlesource.com/fuchsia/+/refs/heads/main/zircon/kernel/kernel/mp.cc#184 is basically what i was talking about

07:11 <bslsk05> fuchsia.googlesource.com: zircon/kernel/kernel/mp.cc - fuchsia - Git at Google

07:11 <geist> (i need to back port that to LK now that i have mentioned it)

07:12 <Griwes> (it's a bunch of slots that each has its own irq number, but that doesn't work for having a part of sending them do polling on incoming requests)

07:12 <kazinsal> some days I wonder if I should just bolt all my shit onto LK instead of writing a new kernel core

07:12 <kazinsal> but the allure of NIH keeps me from actually doing it, heh

07:13 <geist> kazinsal: there are serious gaps on the x86 side of things that need to be addressed

07:13 <geist> though most of them were addressed in zircon, so it's a matter of pulling some of those back and making sure they're appropriate for LK

07:13 <Griwes> hmm, unsure if you're doing that in zircon (I'm going to try to figure it out without looking), but since most of the use cases I can see for this are going to be synchronous, I could do the trick of building an intrusive linked list out of state objects on the stack of the caller

07:14 <geist> yep, that's precisely what zircon is doing

07:14 <Griwes> it's such a great technique that only becomes obvious once you see it

07:14 <geist> however, there's a caveat there: that means that you're effectively limiting the max number of cores you can send

07:14 <geist> there's a TODO item there for zircon to move the local state *off* the stack, because of that

07:15 <geist> ie, if you have 31 cores to send an IPI to, you have to make 31 copies of the message to put in 31 queues

07:15 <geist> *or* you have some sort of notion of a broadcast queue that all cpus look at, etc. can get complicated

07:15 <geist> the zircon code does the 31 copies thing

07:16 <Griwes> okay, thanks for as usual pointing me into a sane direction

07:17 <geist> cool! also you can use a sane arch like anything that's not x86 so you dont have to do this cross cpu shootdown

07:17 <geist> or get a Zen 3+

07:17 <geist> i hope intel picks that up eventually

07:18 <Griwes> for async things I'll just carve out an irq number (can't really think of any async thing other than "do your preemption thing", and I think I can spare an irq number or two) and for sync things I'll try to figure this out

07:18 <geist> yep, that's precisely what i do too

07:18 <Griwes> I mean I'm sure tlb shootdown isn't the only thing that'll end up using this mechanism

07:19 <geist> in my experience there's not a lot aside from some initial set up stuff (synchronizing MTRRs, etc) or shutdown (stop the other core RIGHT NOW)

07:19 <geist> most IPIs are fire and forget

07:19 <geist> which is good of course

07:20 <Griwes> yeah I have some init steps that use the slots thing, there it works fine because the whole thing isn't particularly dynamic

07:20 <Griwes> I also wish intel has done the sane thing from the start, but oh well

07:21 <geist> ugh looking at this code now someone came along at some point and 'optimized' it and seriously nerfed it

07:21 <Griwes> heh

07:21 <geist> the structure that is allocated on the stack is a) no longer a POD and thus needs a constructor run and b) is now always constructed SMP_MAX_CPUS even if only one is needed

07:21 <geist> that was a case where it was initially *not* a class or struct so that it would not require construction

07:21 <geist> but someone moved thigns around and made it fancy because we have to have fancy things

07:22 <Griwes> sounds like OOP brain worms

07:22 <geist> general usual 'make things safer' mentality. which is generally not bad, but should be carefully watched sometimes

07:23 <geist> that aside i think the compiler would garbage fill it anyway because we also have that feature on

07:23 <Griwes> my SMP_MAX_CPUS is currently happily set to 1024, I don't think allocating that many on the stack will work so well :'D

07:23 <geist> so need to deoptimize this a bit

07:23 <geist> no, exactly. you'll need another scheme

07:23 <geist> in zircon i think this is one of a handful of places where things are statically sized based on it. need to remove it some day, but its not a huge priority

07:24 <Griwes> I'll ponder on it while it's raining around my camping spot for the whole day tomorrow

07:24 <geist> yay fun

07:24 <Griwes> (you may notice a pattern of me being more active here when I'm out in the wilds lol)

07:40 psykose has quit [Remote host closed the connection]

07:41 psykose has joined #osdev

07:41 psykose has quit [Remote host closed the connection]

07:42 psykose has joined #osdev

07:47 <Griwes> Hmm, since a core can't only be doing a single outgoing request like this, maybe the solution is to just bite the N*N bullet and store a linked list entry table per core per target core within the cpu core object array, and store the actual state that those entries duplicate by pointing to it on the stack

07:48 kingoffrance has joined #osdev

07:55 the_lanetly_052_ has joined #osdev

08:07 diamondbond has joined #osdev

08:08 Likorn has quit [Quit: WeeChat 3.4.1]

08:14 floss-jas has quit [Remote host closed the connection]

08:39 zaquest has quit [Remote host closed the connection]

08:40 zaquest has joined #osdev

09:03 GeDaMo has joined #osdev

09:05 the_lanetly_052 has joined #osdev

09:07 the_lanetly_052_ has quit [Ping timeout: 246 seconds]

09:07 <geist> Griwes: yeah that's probably a good idea

09:08 <geist> was just thinking about having on the local cpu's struct, N entries, so that each cpu can have up to N other ones out there

09:08 <geist> where N is the total number of cpus in the system

09:09 <geist> since any given cpu can only be sending one round of IPIs at a time

09:50 Jari--- has quit [Ping timeout: 260 seconds]

10:05 the_lanetly_052_ has joined #osdev

10:08 the_lanetly_052 has quit [Ping timeout: 246 seconds]

10:23 pretty_dumm_guy has joined #osdev

10:29 floss-jas has joined #osdev

10:59 Jari--- has joined #osdev

11:08 Likorn has joined #osdev

11:13 mahmutov has joined #osdev

11:24 jafarlihi has joined #osdev

11:28 <jafarlihi> I decided the only way I'm going to finish these compiler books if I read a chapter of each per day

11:35 lanodan has joined #osdev

11:39 diamondbond has quit [Ping timeout: 258 seconds]

11:45 jafarlihi has quit [Ping timeout: 240 seconds]

11:55 arch-angel has quit [Ping timeout: 272 seconds]

12:06 arch_angel has joined #osdev

12:10 Jari--- has quit [Remote host closed the connection]

12:13 Jari-- has joined #osdev

13:04 gog has joined #osdev

13:17 diamondbond has joined #osdev

13:30 <ckie> afternoon, #osdev

13:50 gog has quit [Ping timeout: 258 seconds]

13:54 jafarlihi has joined #osdev

13:59 <mrvn> jafarlihi: why are you reading compiler books when you are designing a kernel?

14:00 <ckie> god the gdt packing is hurting my brain

14:00 <mrvn> geist: If the message is too big to have 31 copies why can't you have a shared_ptr?

14:01 jafarlihi has quit [Quit: WeeChat 3.5]

14:03 the_lanetly_052_ has quit [Ping timeout: 246 seconds]

14:04 <mrvn> ckie: that descriptor with struct { uint16_t pad; uint16_t high; uint32_t low; }?

14:04 <ckie> size matches, sure

14:04 <ckie> the one on pg 103 https://www.intel.com/content/dam/www/public/us/en/documents/manuals/64-ia-32-architectures-software-developer-vol-3a-part-1-manual.pdf

14:04 <mrvn> geist: If you have the message in the per cpu struct aren't you effectively sending abroadcast to other cores to look there?

14:05 <mrvn> ckie: It's what happens when you extend a 16bit struct by 32bit.

14:06 <ckie> mrvn: i'm dealing fine, just being annoyed at the structure

14:07 arch_angel has quit [Ping timeout: 258 seconds]

14:12 <ckie> oh wait, you don't have to set base and limit on long mode!

14:12 <ckie> i am freee~

14:14 <zid> yea long mode is mostly like that

14:14 <zid> "This is hardcoded as 0 or will #GP"

14:14 <zid> Go look at the long mode TSS when you get a chance

14:14 <ckie> zid: i am trying to bootstrap as little as possible so i can start porting tcc and then writing my compiler above it

14:15 <zid> Well you'll need a TSS if you want IRQs

14:15 <j`ey> ckie: you want to use tcc to build your OS or want to run tcc on your OS?

14:15 <mrvn> I'm always a bit annoyed that in the docs they tell you all about this huge collection of bits and then at the end: In long mode a,b,c,d,e,f must be 0.

14:16 <ckie> j`ey: yes

14:16 <mrvn> Would be nice to have a switch "I'm in long mode" and all the cruft gets hidden.

14:16 <ckie> (i am writing just enough with my host compiler to make tcc run and compile the rest of the OS when it starts)

14:16 <zid> https://cdn.discordapp.com/attachments/417023075348119556/980112245155565598/unknown.png

14:16 <zid> Big scary 32bit TSS

14:17 <zid> https://cdn.discordapp.com/attachments/417023075348119556/980112405059239966/unknown.png 64bit TSS, of which only 'RSP0' is used in practice.

14:18 <ckie> what pdf reader is that?

14:18 <j`ey> ckie: quite a bit is needed for tcc to run still

14:18 <zid> sumatra, why?

14:18 <zid> too ugly or too pretty? :P

14:18 <zid> pdf's are supposed to look identical

14:18 <ckie> looked nice for some reason

14:19 <ckie> huh

14:19 <ckie> * legacyPackages.x86_64-linux.sumatra (1.0.34)

14:19 <ckie> Fast and exact comparison and clustering of sequences

14:19 <zid> I wonder what the word with the most results in portage is

14:19 <mrvn> Why is it always upper 32 bit / lower 32 bit? It's a 64bit arch, just say 64 bit.

14:20 <ckie> mrvn: humans will human

14:20 <mrvn> ckie: do you have a libc for tcc to use?

14:20 <ckie> mrvn: no, so i guess i'll be hitting it with many sticks until it doesn't need as much

14:21 <mrvn> ckie: you need malloc/free/open/read/write/close and a bunch of libm.

14:21 <j`ey> and you need processes and stuff.. oh.. are you embedding tcc?

14:22 <ckie> mrvn: malloc/free i am gonna get soon anyway, file io is going to get monkeypatched out, libm is ughh why would it need that much of libm

14:23 <mrvn> ckie: fabs, cos, sin, pow, log, ... whatever else gets used by the source you compile.

14:23 <sham1> Just remove maths

14:23 <mrvn> or used BSDs libm like everyone else

14:24 <ckie> mrvn: why would tcc need to care about that? i can define my own libm inside the stuff i wanna compile

14:24 <mrvn> Like crypto libm isn't something you want to write yourself.

14:24 <ckie> anyhoo

14:24 <ckie> that is for Future Me

14:24 <mrvn> ckie: it might not use any of it. It also might.

14:25 <ckie> right now, there is a stupid segment descriptor for me to figure out

14:25 <sham1> Just do sines and such with Taylor series and Newton's method, you'll be fine

14:25 <ckie> sham1: no trig yet, but i wrote a ln and various other logs

14:26 <sham1> Anyhow, why does the 64 bit TSS separate the lower and upper 32 bits? Well, it's not aligned at 8 byte boundary now is it‽

14:26 <ckie> also wow, it is *very* easy to start getting sucked into the chat instead of coding (you all seem to be decent humans)

14:26 <Jari--> Hey OSDEV guys, does a standard (64-bit) UEFI as a default support booting up a 32-bit protected mode or Real Mode OS in general?

14:27 <mrvn> sham1: because people, see above

14:27 <mrvn> Jari--: since when is 64bit uefi standard?

14:27 <zid> yea, it's unaligned

14:28 <zid> and saves you having to think about endian if they write it like this :P

14:28 <zid> you just do a shift and a write

14:28 <sham1> Although endianness shouldn't matter because you know the endianness because it's AMD64 and thus by definition little-endian

14:29 <mrvn> unaligned doesn't matter either since you know it's x86

14:29 <Jari--> I am wondering also whether if I should keep a 32-bit compatibility mode kernel for 32-bit apps.

14:29 <zid> still annoying to have to split a qword into two dwords properly in the middle of trying to think about something else

14:29 <mrvn> Jari--: no, just support 32bit userland

14:30 <mrvn> Jari--: there is also x86_32 to support. YOu can easily do all 3 in one kernel.

14:31 <Jari--> makes sense

14:43 yewscion has joined #osdev

14:45 <mrvn> zid: What stops you from just defining it as uint64_t?

14:54 Jari-- has quit [Read error: No route to host]

14:57 heat has joined #osdev

15:00 <ckie> what's the struct { u8 just_one_bit : 1; } C syntax called?

15:01 <j`ey> bitfield

15:01 <zid> structure declaration

15:01 <zid> first member is a bitfield on a typedef

15:01 <mrvn> #include <stdbool.h>

15:02 <mrvn> ckie: it's called implementation defined behavior. You don't know if it's the LSB or MSB

15:02 <ckie> compiler implementation?

15:02 <mrvn> more ABI

15:02 <ckie> i thought it meant "this struct is packed, represents just one bit"

15:03 <zid> no

15:03 <mrvn> Nothing packed there. you can't pack a char.

15:03 <zid> packed isn't a part of C, it's a part of the extensions of some compilers, meaning not align members

15:03 <zid> if they would require padding bytes inbetween to stay aligned, don't bother

15:03 <mrvn> Note that the bnitfiel still takes up 1 char in memory. It just uses only 1 bit of it.

15:04 * ckie sets out to check that

15:04 <zid> It isn't needed, you can always implement it with an array of char instead, it's just much nicer syntax wise

15:04 <zid> for gcc you can either __attribute__((packed)) or do the.. #pragma pack with some pushes and pops

15:04 <ckie> annnd emacs hanged when i made a new file. nice

15:04 <mrvn> never use __attribute__((packed)). It's basically always a bad idea and the wrong thing.

15:05 <zid> hung

15:05 <zid> hanged is when you kill someone with a noose

15:05 <ckie> pssh

15:05 <ckie> go away grammar witch

15:05 <zid> It's my language you have to pay extra for me to let you ruin it

15:05 <mrvn> zid: you never felt like throwing a rope over the light fixture and tying it around emacs?

15:08 <zid> america still owes me 14 million dollars

15:13 <ckie> okay, how the hell is this 0

15:13 <ckie> https://godbolt.org/z/9TxzaE3c6

15:13 <bslsk05> godbolt.org: Compiler Explorer

15:14 <zid> <source>:8:20: warning: overflow in conversion from 'int' to 'signed char:1' changes value from '12' to '0' [-Woverflow]

15:14 matrice64 has joined #osdev

15:14 <zid> because 12 is even

15:14 <ckie> oh that.. overflowed my screen

15:14 <ckie> zid: so it is just a bit?

15:15 <zid> that's what 1 bit often means yes, 1 bit

15:15 <zid> also you don't need to pack chars

15:15 <zid> and that's a union

15:15 <zid> and you didn't compile with optimizations

15:15 <ckie> zid: it wasn't initially, then i wanted to test More Things

15:15 <zid> so literally none of this makes sense, grats :P

15:16 <ckie> zid: )^:

15:17 <mrvn> <source>:8:20: warning: overflow in conversion from 'int' to 'signed char:1' changes value from '12' to '0' [-Woverflow]

15:17 <heat> the best part of a bool is how it literally just isn't a bit

15:17 <ckie> mrvn: read above

15:17 <heat> so you waste 7 other bools you could've stored there in the process

15:17 <zid> heat: Bool should be 8 bits and have massively different bitpatterns and invariant checks for bitflips

15:18 <zid> int isRoot :1; much less secure :p

15:18 <zid> google had a machine hacked because of that, scale + bitflip rate is a bitch

15:18 <ckie> the odds

15:19 <mrvn> ckie: hopefully can only ever be 0 and -1

15:19 <mrvn> or 0 and 1 on other archs

15:20 <mrvn> Tip: never ever use char for anything but `const char *` C-style strings.

15:20 <ckie> huh, the struct *does* get ceiled to byte

15:20 <zid> ceiled?

15:20 <zid> as in, ceil()? floating point round up to nearest integer?

15:21 <ckie> yes, my brain works in weird ways

15:21 <mrvn> ckie: it's a char. it's always going to be a char. Union members since C99 must share memory.

15:21 <ckie> bit=0.125

15:21 <mrvn> sizeof(union) == max(sizeof...(member))

15:21 <heat> you mean rounded?

15:21 <ckie> heat: yes, but always up, so ceil

15:21 <zid> well there's no way to address a single bit

15:21 <zid> it HAS to live in a byte

15:21 <heat> but these are integers

15:22 <heat> so rounded up

15:22 <mrvn> zid: and doesn't that make you wonder how std:vector<bool> works? :)

15:22 <zid> and the struct which contains the bitfield is 1 byte long, so making an array of those structs.. is an array of bytes

15:22 <zid> not an array of bits

15:23 <ckie> my actual situation is the segment descriptor type, i want a struct with four `u8 thing : 1;`s

15:23 <kingoffrance> i found my action replay and ran your psx cd zid. i get a purple gradient background, the picture, and static-ish sound :/

15:23 <zid> so why not look up how bitfields work

15:23 <mrvn> ckie: question: shat size does A and B have: struct A { char x : 5; char y : 5; char z : 5; }; struct B { short x : 5; short y : 5; short z : 5; };?

15:23 <zid> instead of guessing

15:24 <mrvn> s/shat/what/

15:24 <zid> but the answer is just that consecutive bitfields pack together naturally

15:24 <ckie> zid: i mean, the reason i came here was because i didn't realize that that syntax also used the same term. i figured it'd have a fancy word like rust's ::<>

15:25 <ckie> mrvn: -O?

15:25 <mrvn> ckie: doesn't matter what -O you use.

15:25 <ckie> okay then A, 3 chars, 3 bytes

15:25 <heat> you're overengineering this

15:25 <ckie> short 6

15:26 <mrvn> ckie: no, B is 2

15:26 <heat> it's a segment descriptor

15:26 <heat> you can totally hardcode it

15:26 <ckie> heat: i am a sucker for satisfying things like this

15:26 <heat> that's the big stupid

15:26 <heat> you'll get lost in the details

15:26 <psykose> it's the big fun

15:26 <ckie> yeah!

15:26 <heat> also bitfields << shifts and masks

15:27 <heat> you can't write them atomically without a union

15:27 <ckie> heat: am i supposed to parse that message with << as shift

15:27 <heat> no

15:27 <psykose> bitfields << (shifts & masks)

15:27 <mrvn> ckie: I would just define the bits in an enum and then you can declare the descriptor as NX | BIT64 | KERNEL or something.

15:27 <zid> bitfields are an extension and are backwards wrt various compilers

15:27 <zid> they're not very usable

15:27 <zid> some compilers treat them as ABCD -> ABCDxxxx some as xxxxABCD some as xxxxDCBA etc

15:27 <heat> right

15:27 <mrvn> ckie: or do you plan to use lgdt?

15:28 <ckie> mrvn: well, what else would i do?

15:28 <heat> if you need to make sure they're in the correct order, don't use them

15:28 <zid> [16:04] <zid> It isn't needed, you can always implement it with an array of char instead, it's just much nicer syntax wise

15:28 <ckie> heat: but the syntax is so preeeettty

15:28 <mrvn> ckie: wait, ldt? what's the acronym for the local gdt?

15:28 <zid> That was 25 mins ago fwiw

15:28 <ckie> ldt?

15:28 <ckie> yeah

15:29 * ckie goes to check

15:29 <heat> ckie, the syntax is "pretty" but also horrific because YOU CANT SET TWO FIELDS AT ONCE

15:29 <mrvn> ckie: the gdt you install once and never change. so making that fancy is wasted time. But some people use the local descriptor table dynamically.

15:29 <zid> gotta rely on the compiler being clever to a large degree is always fun

15:29 <heat> i'm also not sure what &s.bitfield returns and I'm scared to know

15:30 <ckie> okay, you got me, i'll write it out by hand

15:30 <heat> thanks

15:30 <mrvn> heat: <source>:9:12: error: cannot take address of bit-field 'hopefully'

15:30 <ckie> (but just for this though)

15:30 <heat> don't use them for paging either

15:30 <mrvn> ckie: I have it defined in my boot.S

15:30 <ckie> once i get tcc working i can do whatever i want to it

15:30 <heat> just a tip

15:30 <heat> the "I can't set two fields at once" thing really hurts there

15:30 <ckie> i can make it horribly not standard

15:30 <zid> does tcc even support bitfields

15:31 <mrvn> zid: it claims to be fully C99

15:31 <ckie> zid: idk, but it will if i want it to

15:32 <heat> mrvn, thankfully it errors out

15:32 <GeDaMo> "TCC not only supports ANSI C, but also most of the new ISO C99 standard and many GNUC extensions including inline assembly." https://www.bellard.org/tcc/tcc-doc.html

15:32 <heat> ckie, is this scope creep?

15:32 <ckie> heat: no, the plan has always been to hit everything i don't like with hammers until it's how i want it to be

15:33 <ckie> tcc is just a convenient starting point

15:33 <ckie> for a while i was considering learning FPGA and starting from there but i recently rerealized that i hate hardware very much

15:33 <heat> that sounds horrific good luck

15:33 <ckie> (^:

15:34 <ckie> heat: there's also a bit more of a plan than that FWIW

15:35 <ckie> but the actual things i've thought a lot about are only *after* i get a working tcc

15:35 <ckie> i've been mostly working on userspace C++/Rust things before this, and raw C is actually kind of nice

15:36 <GeDaMo> https://github.com/rswier/c4/blob/master/c4.c

15:36 <bslsk05> github.com: c4/c4.c at master · rswier/c4 · GitHub

15:36 <ckie> except inline asm and weird register allocation bugs

15:37 <ckie> GeDaMo: that's neat, but i need to be able to read it ideally

15:37 <ckie> without a headache*

15:37 <GeDaMo> It only implements a subset of C anyway :P

15:38 Mikaku has left #osdev [Leaving]

15:38 <ckie> also, some more context, i won't be writing the runtime-compiled code in C, i want it to just be an IR since it's C and it's easy to read

15:39 * ckie unless its a cursed bitfield

15:39 <j`ey> i dont understand that sentence

15:40 <mrvn> ckie: at least with older gcc bitfields aren't optimized well. Using your own bit masks gives better code.

15:40 <kingoffrance> using C as intermediate representation

15:40 <ckie> j`ey: i am writing barebones bootstrapping code rn so I can port tcc. then I will write a compiler that outputs C code which I'll feed to tcc, then jump to the asm

15:40 <j`ey> oh ok

15:40 <mrvn> ckie: and if your add volatile you really don't want bitfields as the compiler must not optimize those.

15:40 matrice64 has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

15:40 <j`ey> i got confused by the 'i want it', i thought it refered to what you would write the code in

15:42 diamondbond has quit [Quit: Leaving]

15:48 <heat> why C?

15:48 <heat> llvm bitcode does what you want, but in a proper way

15:49 <j`ey> harder to generate maybe?

15:49 <ckie> heat: i don't wanna port llvm

15:49 <j`ey> tcc is tiny

15:49 <ckie> and i know c

15:49 <heat> are you trying to port tcc to run on your os?

15:49 <ckie> heat: later, yes

15:49 <heat> what do you mean with "port tcc" then?

15:51 <ckie> well, once i hit enough things to make the CPU happy, i'll go get the tcc code and hit it with enough hammers until it links

15:52 <ckie> then i'll make it compile a hello world on startup and go from there

15:52 <ckie> (and run)

15:52 <j`ey> heat: ckie wants to compile their code at boot time

15:52 <j`ey> with tcc

15:52 <heat> hmm

15:52 <heat> odd but ok

15:53 <ckie> the end goal is not having to trust anything but the little bootstrap runtime

15:53 <ckie> (by adding memory safety things to tcc so you can't just do whatever you want)

15:58 <ckie> okay, and maybe some drivers

16:11 heat has quit [Remote host closed the connection]

16:11 heat has joined #osdev

16:13 * ckie did something really dumb

16:14 <ckie> god i should really write notes about where i left off

17:04 heat has quit [Ping timeout: 244 seconds]

17:28 alpha2023 has quit [Ping timeout: 240 seconds]

17:31 alpha2023 has joined #osdev

17:33 floss-jas has quit [Remote host closed the connection]

17:34 jimbzy has joined #osdev

17:49 dude12312414 has joined #osdev

17:53 nsmb has joined #osdev

17:54 puck has quit [Excess Flood]

17:55 puck has joined #osdev

18:37 pretty_dumm_guy has quit [Ping timeout: 246 seconds]

18:51 pretty_dumm_guy has joined #osdev

19:09 lanodan has quit [Ping timeout: 258 seconds]

19:11 lanodan has joined #osdev

19:19 kingoffrance has quit [Ping timeout: 265 seconds]

19:26 <geist> good afternoon folks

19:26 <geist> hows your weekend?

19:31 rustyy has quit [Quit: leaving]

19:32 rustyy has joined #osdev

19:41 <geist> okay, after bios the server machine still crashed. so that's pretty much proof that it's the motherboard/ram/power supply. i guess i can swap the PS next before writing off the mobo

19:42 <zid> I still call mobo

19:42 <zid> ps is unlikely to be like, *slightly* flakey, it'll either die under load every time, or not work

19:42 <zid> ram would cause ram errors that wouldn't MCE it'd just make things crash

19:48 <Griwes> okay, I think I now have the new IPI handling code in place and it... appears to work, yay

19:48 <Griwes> together with a thing that batches the invlpgs into batches of 32 to somewhat reduce the ipi pressure

19:50 <Griwes> at some point I'll need to figure out how to do heuristics that decide to switch between that and just telling other cores using the modified page tables to just reload cr3 instead of doing invlpg, but that day is not today

19:51 <mrvn> Griwes: If you collected 32 pages to invlpg isn't it faster to reload the page table?

19:52 <Griwes> no idea

19:52 <Griwes> I'll do some experiments at some point

19:52 <mrvn> if you have to send an IPI every 32 pages then I would say that's your cutoff.

19:52 <j`ey> Griwes: when you batch them.. that's just a loop right? I mean there's no hw support

19:52 <Griwes> but right now I need this to be functional, not necessarily as optimized as possible

19:53 <Griwes> j`ey, yeah I have an object that collects them and sends an IPI every 32 or on destruction

19:53 <mrvn> collect pages and if the buffer overflows send a "reload CR3"

19:53 <Griwes> more ideas stolen from geist :'D

19:54 <Griwes> we'll see

19:54 <Griwes> I'll tinker with it in the future and try to collect like actual data to make an informed design decision there

19:55 <j`ey> Griwes: so the object is shared between all cores?

19:56 <Griwes> no, it's local to a call to unmap()

19:56 Likorn has quit [Quit: WeeChat 3.4.1]

19:56 <geist> yah something like 16 or 32 is probably about right

19:56 <geist> i think we do 16 in zircon, but can probably test it

19:56 <j`ey> Griwes: i mean once you send the IPI, how do the other cores know what to invplg

19:56 <geist> the other optimization you'll want to do is store some sort of bitmap per address space of which cpus have it active

19:56 <geist> and then only shoot TLB shootdowns to cpus that currently have that aspace active. unforunately the kernel aspace is always active

19:57 <geist> which is why it's generally expensive to mess around with kernel mappings

19:57 <Griwes> j`ey, the task inserted into the ipi work queue of any given core has a pointer back to the object

19:57 <geist> j`ey: it's part of the message. you put together a list of things to flush, then send a *synchronous* (ie, wait for it to complete) message to all the affected cores

19:57 <mrvn> microkernel, much fewer mapped pages. :)

19:57 <geist> when they've all marked that they finished, you can move on

19:58 <j`ey> Griwes: right, that's what I meant with 'so the object is shared between all cores?'

19:58 <geist> note this is generally an x86ism. ARM has a much better solution

19:58 <geist> riscv is curious: they basically make it a firmware call to do this

19:58 <Griwes> geist, yeah, I know there's things to limit how many cores I notify; I'm planning to experiment with logical apic destinations to see where I can get

19:59 <geist> but it does avoid the interrupts-disabled issue, because the firmware (at machine mode) is free to interrupt other cores when it wants to, even if they have interrupts disabled

19:59 <geist> Griwes: frankly by the time you get to that many cores there are generally much bigger issues to fight

19:59 <geist> but it's good to think about it

19:59 <Griwes> hmm, so riscv effectively also has SMM?

19:59 <geist> well, SMM *kinda*? it's much closer to EL3 in ARM

20:00 <geist> but yeah the x86 analogy is SMM, except the higher modes are much more transparent on riscv and arm

20:00 <geist> it's simply another priviledge level, and largely symmetric

20:00 <geist> SMM is a bit more specialized and wonky

20:00 <Griwes> I see

20:00 <geist> but yes, functinoally it's similar to having a piece of SMM firmware that just does whatever magic is necessary to sync tlb

20:02 <geist> https://github.com/riscv-non-isa/riscv-sbi-doc/blob/master/riscv-sbi.adoc#rfence-extension-eid-0x52464e43-rfnc etc

20:02 <bslsk05> github.com: riscv-sbi-doc/riscv-sbi.adoc at master · riscv-non-isa/riscv-sbi-doc · GitHub

20:03 <geist> i dunno if this is a good idea in the end, but riscv has generally hard tilted on putting complex stuff inn firmware. i think the idea being that if you're being virtualized it turns into largely a paravirtualization interface for you

20:04 <geist> as a side note i read the riscv virtualization extensions yesterday. looks basically like ARM64 + VHE extension

20:05 <geist> j`ey might be interested in that

20:05 <mrvn> On the other hand in hardware it turns into every SOC doing it's own thing and you have to support them all.

20:05 dude12312414 has quit [Quit: THE RAM IS TOO DAMN HIGH]

20:05 <j`ey> geist: im not looking at riscv at all!

20:06 <geist> ie, it's not really another mode, but supervisor mode when the V extension is enabled (by writing an enable bit in misa) has a second set of control registers to deal with, that are largely banked

20:06 <Griwes> huh, so it uses the exact same instruction as syscalls?

20:06 <Griwes> that's very elegant tbh

20:07 <geist> so you're kinda running in EL2 at that point. when you then eret to supervisor mode with the 'V' bit set in the main control registers, the banking happens and suddenly the guest code (in supervisor mode) is seeing the banked copies of stuff

20:07 <geist> so effectively the 'V' bit in sstatus enables nested virt and switches core control regs to a shadow copy

20:07 <geist> ie, sstatus is now the hypervisors status, and vsstatus is what the guest sees as sstatus, etc

20:08 <geist> so it's pretty clean. the entire thing is basically a banked copy of about 6 control registers and the additional functionality of specifying a second level page table

20:09 <geist> j`ey: well, i figured you might be interested only in that you probably know arm64 the most of folks here that are active right now

20:09 <geist> anyway, think i'll fiddle with this a bit today. need to toy with riscv some more

20:10 <Griwes> huh, IPIs are also an ECALL

20:10 <Griwes> playing with this will be interesting

20:10 <j`ey> im interested in virtualisation, working on a hypervisor type thing

20:10 <geist> right. yeah, they basically say 'hey dont worry about how to do that, that's machine specific, we'll hide it behind firmware'

20:11 <geist> Griwes: you'll also notice that timers themselves are implemented in machine mode. i think there's some effort underweigh to provide a supervisor level timer, since that may have been a bridge too far in 1.0 spec

20:11 <Griwes> I like how they already have a "legacy extensions" section lol

20:11 <geist> but, basically as it stands each cpu gets *1* timer, and its machine mode only, so you actually use SBI calls tos et a timer for you

20:12 <geist> and that works by firing the timer irq in machine mode, and then that code reflects it down to supervisor mode.

20:12 <Griwes> huh

20:12 <geist> which at least arch wise is fairly easy to do, you could write a handful of assembly to look at the irq reason (in machine mode) and then decide, oh i'll just bounce this down to S mode by setting the corresponding timer irq bit in the status register and then eretting

20:13 <geist> it'll then instantly fire in S mode

20:13 <geist> basically s mode timers are virtual

20:13 <Griwes> so effectively on riscv you're always virtualized

20:13 <Griwes> neat

20:14 <geist> yeah

20:15 <geist> of course if yuo're some embedded cpu you probably just dot have supervisor mode, so you write your code to directly run in machine mode, so then you have to drive everything manually

20:15 <geist> but that's fine, it's embedded

20:15 <Griwes> yeah, and I don't have a particular desire to deal with truly embedded stuff myself

20:16 <geist> it's kinda fun, but mostly because you put on a different hat and have to deal with differet set of constraints

20:19 <Griwes> huh, armv9 is armv8 compatible? I guess this means that neither "arm64" not "aarch64" is going to become a horribly imprecise name any time soon

20:20 <j`ey> yes armv9 isnt a break in compatability like that

20:21 <geist> yah it's roughtly armv8.5 + SVE mandatory

20:22 <geist> they didn't even spin it out into it's own manual. the armv9 manual is simply the current vrsion of the v8 manual

20:23 <geist> i dunno how that's going to continue long term though. i've watched v8 in the last 10 years just grow exponentially more complex

20:23 <geist> i'm still mostly dealing with v8.0 and v8.1 stuff, but once you getinto all the optional bits post v8.1 it gets really really really hard to follow

20:23 <geist> need a cheat sheet of extensions

20:23 <Griwes> is it still risc? *hides*

20:23 <geist> very little fiddling with the ISA virtually all of these are kernel level things

20:24 <geist> actually htat's probaby why they switched to v9: it's the first place where the user space ISA has changed fundamentally (with the addition of SVE)

20:26 <geist> much to my chagrin what they didn't do in v9 was remove a lot of stuff, like arm32. it's still there, just only EL0

20:26 <j`ey> still optional

20:26 <geist> but, since the manual still covers v8 it means it still has this huge pile of chapters dealing with legacy mode arm32 supervisor mode

20:27 <geist> which i think if they made a hard v9 manual you could at least delete a good 1000 pages out of it

20:27 <j`ey> yeah a 64-bit only manual would be nice

20:27 <geist> yah some sort of feature on a pdf viewer to just functionally knock out a range of pages would be nice

20:27 <geist> like, just skip these thousand pages, dont let them show up in searches, etc

20:28 <geist> i guess some sort of editor and re-save it with the 32bit chapters missing would work

20:29 <geist> reminds me of a little mini battle i've been fighting with my coworkers the last few years: seems that lots of folks have the idea that when you add #defines/etc for registesr and whatnot out of the manual that you should give all the fields 'nice' names

20:29 <geist> such that they're descriptive of what htey do. i get the idea

20:29 <geist> but... it means it's really hard to cross reference them from the manual

20:30 <geist> i have a generally hard rule that you *must* use the exact same name as the manual. and if it's not clear you add some comments at the place it's defined

20:30 <geist> that way you can just do a pdf search for 'what does this bit mean?'

20:31 <geist> but also accordingly it's really helpful of vendors use fairlyu unique monikers for registers and bits. ARM is good about it. SCTLR_EL1 only shows up as a single thing, etc

20:31 wand has quit [Remote host closed the connection]

20:31 <geist> its worst when vendors use generic names like ENABLE

20:31 wand has joined #osdev

20:32 <geist> some folks, for example, will define SCTLR_EL1 as something like SystemControlRegister with bits like EnablePaging

20:32 <geist> nice. i get it, but that's really hard to figure out what it means in the ARM ARM

20:32 <Griwes> I do rename some registers in code but I try to do that only when it's either very obvious where the docs are, or very obvious what the bits are

20:34 Bonstra has quit [Ping timeout: 260 seconds]

20:35 <geist> yah you *could* make an argument that you rename it but then make sure at the place of definition you document what the manual calls it

20:36 <j`ey> I'd rather have the defines as named by the spec, and just helper functions named better

20:36 <geist> that's of course hard to do if the way you define thigns involves a bunch of templates and whatnot such that the editor can't 'find' the definition

20:36 <geist> but thats another battle.

20:36 <mrvn> geist: I want that with much more flexibility. Every optional thing should have an on/off switch in the reader.

20:37 <mrvn> Give me an interactive manual.

20:37 <geist> well we have this very complicated and flexible thing in fuchsia called hwreg which i'm not a fan of

20:37 <geist> but it does do basically what you awnt

20:37 <geist> you define a set of fields and it creates all these accessors and whatnot

20:39 <geist> https://fuchsia.googlesource.com/fuchsia/+/refs/heads/main/zircon/kernel/lib/arch/include/lib/arch/arm64/system.h#69 is for example what i'm talking about

20:39 <bslsk05> fuchsia.googlesource.com: zircon/kernel/lib/arch/include/lib/arch/arm64/system.h - fuchsia - Git at Google

20:39 <geist> that builds a whole class for that register with all the accessors and whatnot

20:39 <geist> with some trait class thing that i guess stamps out 3 copies of it (for different ELs)

20:40 <mrvn> geist: I was refering to the magic pdf reader :)

20:40 <geist> ah yeah

20:41 GeDaMo has quit [Quit: There is as yet insufficient data for a meaningful answer.]

20:41 <mrvn> I have a template mess too for describing registers for the RPi that deal with all the cases: bit, bits, mbz, ign, mbo, scattered bits, read-only, write-only.

20:42 <geist> yeah. i guess i'll have to get used to it, but i'm still a bit grouchy about it

20:42 <geist> the boilerplate to use it is pretty nasty

20:42 <geist> but, it is efficient: this hwreg stuff does generate exactly what you'd want

20:42 <mrvn> Not using it much because it's too much boilerplate as you say.

20:43 <mrvn> enum class { NX = 1<<17 .... } is just so much simpler to write.

20:44 <mrvn> What was the new way for volatile read/write again?

20:44 <geist> right. the usage of it is something like `arch::ArmSctlEl1::Get().FromVal(0).SetFoo().ClearBar().SetBaz(54).WriteTo(system_register);` or something

20:45 <geist> i forget how the writeTo part works, i think there's some factory to generate an accessor for real hardware register or something

20:45 vdamewood has joined #osdev

20:45 <Griwes> OOP brain worms, smh

20:45 <geist> i have always hated chaining things together like that, but that is very Rusty and i've seen that pattern make its way more heavily into work C++

20:45 <Griwes> what's wrong with good old fashioned bitwise math

20:46 <geist> well, as security folks will say it's very error prone, etc. which in my experience really isn't the case

20:46 <mrvn> geist: That looks horrible.

20:46 <geist> but that's a hard sell

20:46 <geist> https://fuchsia.googlesource.com/fuchsia/+/refs/heads/main/zircon/kernel/arch/arm64/arch.cc#164 here's a good one

20:46 <bslsk05> fuchsia.googlesource.com: zircon/kernel/arch/arm64/arch.cc - fuchsia - Git at Google

20:46 <geist> using a lambda even!

20:46 <mrvn> geist: What's the semantic of that anyway? Does it do one after the other? Or does it optimize it into a single read-modify-write?

20:47 <geist> that's right, there's a Modify() call you can make that gives you a place to run a lambda with the a variable already filled out that you then fiddle with and it'll write it back

20:47 * Griwes barfs a little

20:47 <geist> but. as i said the code actually compiles to precisely what you want, at least

20:48 <mrvn> geist: My syntax is: reg_foo |= NX | MODE(3) | KERNEL_RW | USER_R;

20:48 <geist> so strictly speaking it's 'better' because it efficiently implmenets a thing more safely and with some side debugging (there's a .dump() style routine you can call on it)

20:48 ripmalware_ has quit [Ping timeout: 240 seconds]

20:48 <geist> but i'm being dragged kicking and screaming into this world

20:49 <Griwes> it's like people looking at .unwrap().unwrap().unwrap() and saying that it's beautiful

20:49 xenos1984 has quit [Read error: Connection reset by peer]

20:49 <Griwes> hang on, I'm not even 29 yet, I'm too young to be a grumpy old man

20:49 <geist> Griwes: heh

20:49 <geist> i mean yeah you kinda get used to it, i guess

20:50 <geist> it's easier for me to accept when it's a new language that has that idiom

20:50 <geist> since yuo have to sort of brain context switch

20:50 <mrvn> geist: what happens when you forget the .WriteTo(system_register)?

20:50 <geist> then it just hangs around

20:50 <mrvn> geist: or write t0o the wrong one?

20:50 <geist> that's how you read from it

20:50 ripmalware has joined #osdev

20:51 <geist> you can do an `auto foo = blahblahbgetregister; something = foo.field();`

20:51 <geist> doesn't require you write it bakc

20:51 <mrvn> geist: modifications should be RAII

20:51 <geist> that's probably why that `Modify()` helper routine exists, it does the read and writeback for you

20:51 <geist> there's probably some feature for that. keep in mind myexample above was just off the top of my head

20:52 <geist> the hwreg stuff has *lots* of helper routines and whatnt. wouldn't be surprised if there wasn't some RAII wrapper that does precisely what you want

20:52 <geist> OTOH, I'd kinda rather not have writebacsk happen at arbitrary times

20:52 <geist> generally the writeback is sort of important, so i'd rather it be explicit or and error if you forgot to (if you want to have safety rails)

20:53 <mrvn> I'm still trying to get to something like this: WITH(reg_bla) { blub = 1; if (bla) baz = 2; ... };

20:53 <geist> well, that's exactly what that Modify() thing above does

20:53 <mrvn> The closing } writes back.

20:53 <geist> takes a functor of some type. maybe thats your solution

20:53 <mrvn> And the WITH opens up the namespace or scope or whatever so blub/bla/baz have meaning.

20:54 <geist> that might e difficult

20:54 <mrvn> yeah. that's the part I'm stuck with

20:54 <geist> anyway my brain is running out of my nose now, thinking about c++ right now

20:54 <geist> think i'm going to do something productive with my afternoon

20:54 <mrvn> Can't do "using namespace REG_BLA { ... }"

20:57 <geist> unrelated: i did my first surface mount chip solder yesterday. wasn't that hard

20:58 <geist> but it was a SOP-14, so still fairly large

20:58 <geist> going to try to solder some small ass resistors in a bit

20:58 Bonstra has joined #osdev

20:58 <mrvn> ahh, nothing like the smell of hot led in the morning.

20:58 <mrvn> lead

20:59 <mrvn> geist: per hand or with an oven?

20:59 <geist> by hand

20:59 <geist> used a drag soldering method, though each pin would have also been doable

21:00 <zid> for resistors my advice is give up now

21:00 <zid> or use paste

21:00 <zid> either works

21:01 <geist> well, i think the only reason any of this works is i have a decent stereo binocular scope

21:01 <geist> otherwise i wouldn't be able to see any of it

21:01 <geist> and reasonably steady hands

21:01 <zid> paste is gooey and then the positionig and the heating are diff steps

21:01 <zid> you just pray it doesn't tombstone

21:01 <mrvn> the quality of the board and pads is important too

21:02 <geist> yah but you can pop that back off though if it tombstones (assuming my assumption is what it hin)

21:02 <zid> yea if it does you just go again

21:02 <geist> but you're right, i need to do some resistors next. i have a batch of 0805s which are pretty big

21:02 <zid> What about a heat gnu?

21:02 <zid> When a heat elk just won't cut it

21:03 <Griwes> gnu/heat

21:03 <geist> i have some rework station gun somewhere

21:03 <geist> but yes i should get some skills there too

21:03 <geist> like maybe use it to remove the chip i soldered down yesterday

21:03 <zid> microsoldering is just not any good with an iron

21:04 <Griwes> I think the time has come for me to start figuring out a DSL for userspace IPC, I'm expecting that to be quite the journey

21:05 <geist> domain specific language?

21:05 <zid> it's either that or dick sucking lips

21:05 <geist> <gasp>

21:06 <zid> I was about to ask though if that meant "A formal grammar" or just "a protocol"

21:06 <geist> but you blew it right?

21:06 <geist> now the conversation can never recover

21:06 <Griwes> lol

21:06 <zid> not for free

21:06 <Griwes> Yes a domain specific language for protocol definitions, you filthy osdevvers

21:07 xenos1984 has joined #osdev

21:07 <geist> btw i may have linked this the other day but this is a great description of drawing with atari 2600

21:07 <geist> https://youtu.be/sJFnWZH5FXc

21:07 <bslsk05> 'Racing the Beam Explained - Atari 2600 CPU vs. CRT Television' by Retro Game Mechanics Explained (00:38:25)

21:07 <mrvn> I want segments for my IPC. I want to pass a blob of memory and any pointers should be relative to the message start and limited to the message size.

21:07 <zid> I thought it was just "count cycles"

21:07 <geist> this guy does pretty great deep dives into old stuff

21:07 <geist> it's ver much count cycles, but he really has a great example and good visualizations of it

21:07 wootehfoot has joined #osdev

21:21 <mrvn> I'm waiting for the TFT just being a device with RDMA for the framebuffer you write to directly.

21:22 <mrvn> It's kind of stupid for the cpu to generate a video format for the output just so the monitor can parse it back into memory, scale to fit, and drive the display.

21:23 floss-jas has joined #osdev

21:30 <geist> well that was no sweat. just soldered a couple of 0805s and that was ez

21:33 <zid> how much cheating did you need

21:35 <geist> zero. just went down fine

21:35 <geist> put a solder blob, then tweezered it over and heated it up, stuck fine

21:35 <zid> tweezers, pre-tinned iron, magnifiers, robot arms?

21:35 <geist> then added solder to the other end, works fine

21:35 <geist> well aside from robot arms, i already had the other stuff

21:35 <zid> so 3/4 cheating

21:36 <geist> sure i mean i didn't do it entirely by willing it to be

21:36 <geist> but i dont see how you could do it without tweezers at the minimum

21:36 <zid> a lot of prodding? :p

21:37 <geist> with what? your finger?

21:37 <zid> tip of the iron while it's cold? some solder?

21:37 <zid> fingee

21:37 <zid> watch out with fingee though

21:37 <zid> https://cdn.discordapp.com/attachments/136321206260334593/980012968840404992/unknown.png

21:38 <geist> anyway, yes. this somewhat unlocks future designs. i've been putting this off for a long time

21:39 heat has joined #osdev

21:43 Likorn has joined #osdev

21:44 kingoffrance has joined #osdev

21:55 wootehfoot has quit [Quit: Leaving]

22:09 mahmutov has quit [Ping timeout: 258 seconds]

22:30 thatcher has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

22:30 thatcher has joined #osdev

22:39 <mrvn> I think I need a summer cleaning:

22:39 <mrvn> Filesystem Size Used Avail Use% Mounted on

22:39 <mrvn> rpool/home/mrvn 1.5T 1.3T 244G 84% /home/mrvn

22:44 mahmutov has joined #osdev

23:33 <mrvn> There are currently -639987340 people living on the Earth. Thank you `int`. Much more living space now.

23:41 <moon-child> that's it?

23:41 <moon-child> Filesystem Size Used Avail Use% Mounted on

23:41 <moon-child> d 3.5T 3.3T 185G 95% /d

23:41 <zid> Filesystem D, the hot new benchmark racing anime

23:46 <moon-child> hmm looks like only 10-15% anime (depending on how you count)

23:47 <zid> does it have initial D

23:47 <moon-child> some of which I don't care about and should probably delete...

23:47 <zid> god knows how much of my 4T drive is anime rn, I've not deleted anything since I got the drive, it's almost full now

23:47 <moon-child> zid: ( ͡° ͜ʖ ͡°)

23:55 <zid> I am going out at 8am tomorrow, it's 1am rn and I've been going to bed at 4am.

23:55 <zid> I am hosed

23:56 mahmutov has quit [Ping timeout: 240 seconds]

23:56 <moon-child> lol

23:56 <moon-child> good luck

23:56 <zid> There's always the RTA method, sleep skip

23:57 <zid> any% going outside (no sleep)

23:57 <moon-child> sleeping a couple hours is probably better

23:57 <moon-child> but it doesn't always _feel_ better

23:57 <zid> sometimes, depends when you wake up in your sleep cycle