#osdev on 2022-05-21 — irc logs at libera.irclog.whitequark.org

2021-05-23 01:57 klange changed the topic of #osdev to: Operating System Development || Don't ask to ask---just ask! || For 3+ LoC, use a pastebin (for example https://gist.github.com/) || Stats + Old logs: http://osdev-logs.qzx.com New Logs: https://libera.irclog.whitequark.org/osdev || Visit https://wiki.osdev.org and https://forum.osdev.org || Books: https://wiki.osdev.org/Books

00:01 pretty_dumm_guy has quit [Quit: WeeChat 3.5]

00:03 <zid> clever: Yea the pic I link was a thumbnail from one of these vids

00:04 <clever> zid: that exact vid, i just took the vid ID from your link

00:04 <mrvn> zid, clever: Now I will have nightmares about some helicopter pilot going postal and flying through NY City with that thing. :)

00:04 <clever> :D

00:07 mrlemke has joined #osdev

00:35 gog has joined #osdev

00:36 nyah has quit [Ping timeout: 246 seconds]

00:37 <heat> how do you set the LBA size in nvme? is it when you're formatting the drive (namespaces)?

00:49 <heat> woah linux can get 13M IOPS on a single core, 12900K, optane

00:51 Likorn has quit [Quit: WeeChat 3.4.1]

01:31 <zid> I wouldn't be surprised if you were the only one who knew nvme at all

01:31 <zid> we regularly discuss the pros and cons of 1979 intel

01:32 <geist> heat: i think it's when formatting

01:33 <geist> or at last when i did on the one drive i had it had to reformat it

01:33 <heat> ah right

01:33 <heat> that makes sense

01:34 <zid> which is sort of odd

01:34 <zid> a layer separation of sorts, but trim kinda murders that

01:34 <geist> well, also nvme has the notion of multiple namespaces and you can create/delete/resie/etc them individually

01:34 <geist> so it sort of makes sense that that's when you specify the block size

01:35 <geist> unclear if you can mix/match block sizes on any firmware or if all namespaces have to have the same

01:35 <geist> and/or if different namespaces are allowed to mix the underlying flash blcoks

01:35 <geist> presumably no and yes

01:36 <heat> different namespaces can have different lba formats

01:37 <zid> nvme is silly and should feel silly

01:37 <heat> it's part of the identify command for the individual namespace

01:37 <geist> yah whethe ror not real hardware allows that i dunno

01:37 <geist> see `nvme id-ns`

01:37 <heat> fun silly(tm) fact: the identify command can list up to 64 lba formats

01:37 <heat> you then choose one

01:38 <heat> they even have a 2-bit performance rating on them

01:38 <zid> ARGB_8_8_8-8 please

01:38 <geist> yah i think thats what i remember with this one too

01:39 <geist> alas most of my nvmes dont support more than one format

01:39 <zid> I don't have any :P

01:39 <zid> sata is life

01:39 <geist> but i think it makes sense, most of mine are samsungs, and the samsungs all have onboard dram

01:39 <heat> me neither :|

01:39 <heat> i need to get one though

01:40 <zid> yea I was surprised, my 850 which is.. ancient now, has 512MB of DDR3

01:40 <geist> so, i dont think there's a fundamental reason why they would bother supporting less than the 512 byte lbas

01:40 <geist> because they have enough ram to hold that table in memory

01:40 <heat> less?

01:40 <geist> but, the WD blue i have that *does* support reformatting

01:40 <heat> like 256 byte lbas?

01:40 <geist> also has the dram-sharing 'feature' because it doesn't have any ram on board

01:41 gog has quit [Ping timeout: 244 seconds]

01:41 <zid> When are we rooting an SSD and running ponyos on it

01:41 <geist> so in that case it actually does make some sense to support 4K because it then doesn't need as big of a translatino table, and it just asks for correspondingly less stolen ram

01:41 <geist> heat: less blocks is what i really mean (via larger LBAs)

01:41 <heat> the nvme spec explicitly disallows < 512 byte lba

01:41 <heat> ah yeah

01:41 <geist> since the size of the translation table is inversely proportional to the block size

01:41 <clever> geist: what about 512 byte virtual block sizes, and 1mb physical block sizes, smaller translation table, at the cost of bigger rmw cycles

01:42 <geist> i remember doing some math and it kinda lines up if you think of there being a large in memory table with say 4 bytes per block

01:42 <geist> clever: that's basically what you get with SD cards

01:42 <heat> my 870 evo (sata) has 1GB of dram

01:42 <heat> pretty impressive

01:42 <geist> and fundamentally why they're slower

01:42 <zid> yea I was super surprised by how much mine had

01:42 <clever> ah

01:42 <zid> 70 is 2 gens lower and only has 1GB

01:42 <zid> newer*

01:42 <clever> geist: that could also explain why SD cards deal with sequantial write loads better, its not a rmw if you write the physical block in one shot

01:43 <geist> right

01:43 <clever> i have been thinking about testing a log based fs on the rpi

01:43 <clever> so all write load is sequential

01:43 <heat> f2fs!

01:44 <clever> there have been many mentions on the rpi forums, about the warantee on your SD card being void if you use it as an OS disk on any system

01:44 <clever> because they are designed for highly sequential loads like photo of video

01:44 <geist> right, my general experience with that is samsung SD cards are pretty good for random io

01:44 <clever> but a log based fs would entirely solve that, and possibly increase both lifetime and performance

01:45 <geist> whether or not that's because they're intrinsically less safe i dunno

01:45 <geist> but sandisks, for exmaple, run like ass

01:45 <clever> i have a sandisk that i managed to kill with one too many gcc builds

01:46 <clever> it went internally read-only

01:46 <geist> heat: my math is off so i dont remember what i was thinking but consider something like 1TB

01:47 <geist> 2^40. that has 2^(40 - 9) 512 byte blocks (2^31)

01:47 <geist> so that is already 2GB of memory if you have one byte per, so i dunno how a translation table could copletely hold in ram

01:47 <geist> answer is it probably doesn't and it swaps it in and out as its in used

01:47 <heat> yeah it can't

01:47 <geist> for say 4 bytes per block that'd be 2^33

01:48 <heat> my 870 evo is 1TB, 1GB of ram

01:48 <geist> with 4K blocks it's a little nicer though: 40 - 12 + 2

01:48 <geist> 2^30

01:48 <clever> and now it fits within a 32bit int!

01:48 <heat> I wonder what NVMEs run as a CPU

01:49 <heat> I think western digital has been using riscv

01:49 <heat> they contribute quite a lot to riscv code

01:49 <geist> right, i had read somewhere that one of the samsung controllers a few years ago was something like 3 arm cortex-r5s

01:49 <geist> one acts like the interface and the other two run the wear levelling/drive the flash or something

01:50 fwg has joined #osdev

01:50 <geist> it was in some anandtech review

01:50 <heat> 3 SoCs or 3 cores?

01:50 <clever> https://spritesmods.com/?art=hddhack

01:50 <bslsk05> spritesmods.com: Sprites mods - Hard disk hacking - Intro

01:50 <geist> https://www.anandtech.com/show/16636/the-inland-performance-plus-2tb-ssd-review-phisons-e18-nvme-controller-tested has something like that two mentioned

01:50 <bslsk05> www.anandtech.com: The Inland Performance Plus 2TB SSD Review: Phison's E18 NVMe Controller Tested

01:51 <geist> 3 cores. i dont think any nvme has remotely any space to fit more than one soc

01:51 <clever> > This confused me for a bit... I expected a single tap, for the single ARM core that's inthere... but instead, I found three taps... does that mean this chip has three ARM-cores?

01:51 <clever> > There's two Feroceons, which are quite powerful arm9-like cores, and a Cortex-M3 core, which is a bit smaller, more microcontroller-ish core

01:51 <clever> this guy also found 3 cores on the same jtag ring, in a mechanical hdd

01:51 <geist> sure. not surprising at all

01:52 <clever> he then modified the firmware, so the drive lied about the contents of /etc/shadow when you write a magic string to any block

01:52 <geist> maybe odd to folks that haven't looked at this sort of thing before, but it's totally not odd to find piles of dissimilar cores on a single soc, even on the same jtag chain

01:52 <clever> and boom, free root

01:53 <heat> "free" root

01:53 <clever> heat: well, yeah, you need to mitm the supply chain first, lol

01:53 <clever> so not really free

01:53 <geist> unclear what each of these would even run anyway, bt it's entirely possible you have 3 cores like that so each one of them simply runs a big loop of command processing

01:53 <clever> a checksummed fs like zfs would also prevent this, as would luks

01:53 <heat> and you need root to know where /etc/shadow is stored

01:54 <geist> kinda makes sense doing that than trying to build some sort of RTOS and then deal with it

01:54 <clever> heat: pattern match any block that starts with root::something

01:54 <heat> geist, maybe the new stuff runs lk :)

01:54 <clever> you could also expose a full api, where you can talk to the backdoor by writing blocks to disk

01:55 <geist> possibly, but as far as i know no one is using LK for riscv stuff

01:55 <clever> ive even seen a flash->floppy adapter earlier

01:55 <clever> where you are basically emulating a serial port, by reading/writing to sector 0

01:55 <clever> that thing is crazy

01:56 <clever> https://youtu.be/5g3afPmSnbY?t=2511

01:56 <bslsk05> 'The first thing that ever used MPEG4' by Cathode Ray Dude [CRD] (00:50:59)

01:56 <clever> thats basically like those cd->tape adapters

01:56 <clever> but it cant emulate tracks, and it cant know the movements of the head

01:57 <clever> so its instead a kind of mmio over the floppy interface, where the tracks are dynamically changing their own data

02:03 <zid> It's 3am, so I made egg fried rice.

02:03 <zid> Made sense at the time

02:03 <klange> I think I had one of those at some point, I recognized it in the video when I watched it earlier.

02:04 <clever> zid: been there, made pizza at 5am, lol

02:04 <zid> I'm going out for coffee with a friend "saturday morning"

02:04 <zid> guess that means.. stay awake until lunch?

02:05 <zid> Either that, or try to bed soon and then set a possibly needlessly early alarm and hate myself all morning. Decisions

02:05 <heat> ditch the friend and sleep

02:05 <heat> antisocial behaviour 101

02:06 <zid> no, I begged him to pay attention to me because he brushed me off the last 2 weekends because he was busy :P

02:06 <zid> we usually go out every sunday to window shop antique shops and stuff

02:07 floss-jas has joined #osdev

02:07 floss-jas has quit [Excess Flood]

02:07 <zid> This egg fried rice isn't bad, I am an amazing chef.

02:09 <geist> yah i think ripping on old video recording things from early 2000s is a bit much

02:09 <geist> yeah it desn't look great, but damn, any sot of camera that records digital video back then is pretty great

02:10 <geist> i'd have been pretty chuffed to have something like that

02:10 <clever> around that time, i had a toy photo camera for kids

02:10 <clever> it was horrible resolution, and it downloaded over serial

02:11 <clever> still got it

02:11 <geist> i think i had a Kodak DC260 or something like that in around 1999 that was like VGA rez and it was pretty cool. 16MB card i think

02:11 <zid> when I was in hs one of the teachers took pictures of our projects with a camera that saved to a floppy disk, I thought it was pretty rad

02:11 <zid> must have been old as fuck at the time though

02:11 <clever> mine is from nickelodeon

02:11 <zid> floppies were on the way out when I was in hs

02:12 <geist> by 2005 i think i had gotten a tiny little olympus camera that took the then new SD card format. still hvae that somewhere, was a neat little camera

02:13 <clever> https://www.youtube.com/watch?v=wfFeCfp_xPk ah, LGR did a vid on my camera

02:13 <bslsk05> 'Nick Click: The 90s Nickelodeon Digital Camera Experience' by LGR (00:14:24)

02:13 <geist> yah late 90s they were starting to get slightly better than toys.

02:13 <clever> no removable storage, less for a kid to break

02:13 <geist> or at least in thet oy to maybe usable range

02:14 <geist> ah yeah the DC260 was actually 2 megapixel. was farther along than i thought

02:15 <clever> i think mine was 640x480 max

02:15 <clever> at best

02:16 <zid> The 384kbit video looks a lot better than he's shitting on it

02:16 <zid> the sensors and stuff back then were garbage anyway

02:16 <clever> zid: some of that is the re-encoding improving it

02:16 <zid> anything pre mobile phone cameras was pretty garbage

02:17 <clever> half way thru the video, he says that re-encoding to h264, then upscaling, improves the quality

02:17 <geist> but yeah i sort of like watching him discover thins that were fairly commonplace when i was younger

02:17 <clever> if he first upscales, then re-encodes, it looks far worse, the same as just playing it directly

02:17 <geist> like zip disks and whatnot. sort of interesting to see how someone presumably younger sees stuff like that

02:17 <clever> i found the video editing deck that CRD played with rather interesting

02:18 <zid> I view a camera like that as basically like a dictaphone but with some crappy video attached, but you don't have to pay for film

02:18 <zid> film is expensive to take notes with :p

02:18 <clever> zid: some of the cameras CRD talked about in that vid, only handle 2 or 3 minutes of video

02:18 <zid> I suppose magnetic tape would have just been better though

02:18 <clever> before the storage is entirely full

02:18 <geist> reminds me i always kinda wanted one of those tiny tape recorders

02:18 <zid> magnetic tape is surprisingly good

02:18 <geist> with the little micro tape. i dunno why

02:19 <geist> though i guess the real auteurs of tape recorders all go for that particular 'portable' model that folks used back in i guess the 70s

02:19 <geist> it's like a little mini reel to reel that you carry out when recording wildife or whatnot

02:19 <zid> What about a wire recorder? :P

02:20 <zid> Who needs magnetic tape when you can just use a spool of steel wire

02:20 <zid> 2.2km/hr

03:10 heat_ has joined #osdev

03:11 heat has quit [Ping timeout: 248 seconds]

03:18 heat has joined #osdev

03:18 heat_ has quit [Read error: Connection reset by peer]

03:31 <heat> geist, trippy: the NVMe queues can be non-contiguous

03:31 <geist> oh yeah?

03:31 <heat> yeah, it's optional to support but it's a thing

03:32 <geist> wonder if htat has something to do with mapping them into virtual machines

03:32 <geist> ie a guest thinks it's giving you a contiguous run of physical but the host says 'aww shit that's actually discontig'

03:34 heat_ has joined #osdev

03:34 <heat_> love m'internet

03:34 <heat_> <heat> i was thinking that the contiguous bit in the capabilities meant that completion queues and submission queues needed to be contiguous (one after the other in physical memory)

03:34 <heat_> <heat> it actually means that if !contiguous, you pass in a sg-list of queue pages

03:34 heat has quit [Read error: Connection reset by peer]

03:34 heat_ is now known as heat

03:35 <heat> yeah maybe passthrough

03:35 <heat> good point

03:35 <heat> because I don't see how this is can be fast at all

03:35 <heat> but it's still way faster than not having NVMe inside the VM

03:35 <geist> though i think for that to work i guess the iommu would solve that

03:53 <Griwes> iommu is such a nice thing for so many reasons

03:56 Likorn has joined #osdev

04:01 <heat> https://kernel.dk/blk-mq.pdf

04:01 zaquest has quit [Remote host closed the connection]

04:05 theruran has joined #osdev

04:10 No_File has joined #osdev

04:31 rustyy has quit [Quit: leaving]

04:36 vinleod has joined #osdev

04:37 vinleod is now known as vdamewood

05:07 Likorn has quit [Quit: WeeChat 3.4.1]

05:10 rustyy has joined #osdev

05:11 rustyy has quit [Client Quit]

05:13 rustyy has joined #osdev

05:31 <heat> ✅

05:31 <heat> this was fun

05:31 <heat> i'm still missing some stuff (SGLs, all the fancy features I'll never want to support, multiple MSI vector support)

05:31 <heat> but I got basic multiqueue read/write working

05:38 <heat> all in all I think this was simpler than AHCI

05:50 heat has quit [Ping timeout: 244 seconds]

05:52 troseman has quit [Ping timeout: 260 seconds]

06:27 crm has joined #osdev

06:30 orthoplex64 has quit [Ping timeout: 260 seconds]

06:37 ripmalware has quit [Ping timeout: 276 seconds]

07:00 ThinkT510 has quit [Quit: WeeChat 3.5]

07:04 ThinkT510 has joined #osdev

07:57 jafarlihi has joined #osdev

07:57 zaquest has joined #osdev

07:57 lg has quit [Read error: Connection reset by peer]

07:58 <jafarlihi> I'm making x86 OS and have implemented GDT, IDT, ISRs, IRQs, keyboard. What is the logical next step? PMM/VMM/Paging?

07:58 lg has joined #osdev

07:59 <ThinkT510> neato, do you have a public repo?

08:05 <jafarlihi> https://github.com/jafarlihi/meaty-skeleton

08:15 jafarlihi has quit [Ping timeout: 244 seconds]

08:16 <geist> task switching

08:16 <geist> i'd implement some sort of context switching so you can run more than one thread

08:16 <geist> oh they left, okay

08:19 mahmutov has joined #osdev

08:21 the_lanetly_052 has joined #osdev

08:21 <kazinsal> yeah, I'd say memory management -> tasking -> user mode -> syscalls

08:22 <kazinsal> at that point you're basically just hooking bits up to the user space as you go

08:24 <geist> i'd generally do tasking first since you dont really need any more than a heap (if that)

08:24 <geist> you can switch threads you're basically at a useful embedded rtos level

08:24 <geist> then you can keep going

08:24 <kazinsal> true

08:24 <kazinsal> then your threads can mmap in additional address space

08:25 <geist> you can run without paging, etc and work on cpus that dont have a mmu, etc

08:25 <geist> it's one of the reasons it's easy to port LK to a bunch of different arches: mmu is optional

09:09 GeDaMo has joined #osdev

09:22 mahmutov has quit [Ping timeout: 260 seconds]

09:32 <mrvn> you don't even need mmap, just give every process 4GB heap and implement demand paging.

09:35 <mrvn> zid: are't wire recorder more the precursor to vynil?

09:35 Burgundy has joined #osdev

09:37 zid has quit [Ping timeout: 250 seconds]

09:45 Likorn has joined #osdev

10:03 wolfshappen has joined #osdev

10:27 wootehfoot has joined #osdev

11:37 No_File has quit [Quit: Client closed]

11:49 zid has joined #osdev

11:50 <zid> I did syscalls first but that's because I've got nothing useful to actually run so I just wanted to test I had set the descriptor tables and stuff up properly for them to work seeing as the code was in the repo :P

11:52 <mrvn> making an echo syscall is useful

11:52 <mrvn> or log

11:57 <kazinsal> yeah, "blast sev + message into syslog" is a good initial syscall

11:57 <kazinsal> lets you test both argument passing and make sure your userspace code for formatting strings etc works nicely

11:57 <cookie> hey, here once more to request someone tells me to "just fucking start it"

11:58 <cookie> i have all my ~~ducks~~ ideas in order, but starting is hard

11:58 <kazinsal> starting is hard

11:58 <kazinsal> I plan to eventually restart work this weekend! hopefully my undiagnosed unmedicated adhd brain can accomplish that

11:58 <kazinsal> (I say, for the nth weekend in a row)

11:59 <cookie> i did another project recently that happened to be a good test for rust and i don't think it'll work unfortunately

12:00 <cookie> (it was a very over engineered static site generator, and i've written enough rust to not fall in the common pitfalls, i just had a loooot of types and it got unsustainable to fix multi page type errors)

12:00 <kazinsal> my largest barrier to entry for rust is rust people

12:01 <cookie> the rrir crowd?

12:01 <j`ey> you mean the trolls that pretend to be rust people? :P

12:02 <cookie> the actual rust people i've talked to are okay-nice, though there's a fair amount of variance

12:02 <j`ey> as with any community

12:11 gog has joined #osdev

12:14 <zid> that's why you should trick yourself and not actually start

12:14 <zid> just prototype a bunch of pieces

12:14 <zid> and accidentally end up compiling them all together

12:15 <gog> mew

12:15 * kazinsal gives gog headpats

12:15 * gog prrs

12:15 * cookie scritch gog

12:16 * kazinsal scritchies

12:16 <gog> o:

12:16 <gog> my gsoc proposal was accepted now wtf do i do

12:16 <cookie> uh, code?

12:16 <gog> idk if i can work full time and do that

12:16 <kazinsal> learn go

12:16 <kazinsal> or whatever googlers are into these days

12:17 <gog> i'm kinda on a sigma grindset and i don't know if i can add anything to my grinding set

12:19 <cookie> zid: i like that

12:48 gog has quit [Ping timeout: 246 seconds]

12:54 <cookie> stupid printer issue came up

12:54 <cookie> android refuses to connect to CUPS because i'm not serving it with a TLS cert

12:54 <cookie> so now i have to add a lot of jank to serve a valid tls cert.. in my home network

13:04 blockhead has quit []

13:30 nyah has joined #osdev

14:23 dude12312414 has joined #osdev

14:26 dude12312414 has quit [Remote host closed the connection]

14:35 gog has joined #osdev

14:43 <zid> gog: what was the proposal?

14:43 <gog> code sharing for EDK core

14:44 <gog> every firmware image has a bunch of duplicated code and they want to not have that

14:44 <zid> Buy bigger hard drives, done, that'll be $8000 consultation fee.

14:45 <gog> true

14:52 doorzan has joined #osdev

14:56 ethrl has joined #osdev

15:06 heat has joined #osdev

15:07 <heat> congrats gog

15:13 tomaw has quit [Quit: Quitting]

15:20 tomaw has joined #osdev

15:37 mahmutov has joined #osdev

15:38 puck has quit [Excess Flood]

15:38 puck has joined #osdev

15:44 Teukka has quit [Read error: Connection reset by peer]

15:46 Teukka has joined #osdev

15:52 dude12312414 has joined #osdev

15:55 dude12312414 has quit [Client Quit]

15:59 freakazoid333 has joined #osdev

15:59 <freakazoid333> https://betanews.com/2022/05/20/hp-pop_os-linux-system76-dev-one/

15:59 <bslsk05> betanews.com: HP chooses Ubuntu-based Pop!_OS Linux for its upcoming Dev One laptop -- could System76 be an acquisition target?

16:04 No_File has joined #osdev

16:08 the_lanetly_052 has quit [Ping timeout: 276 seconds]

16:20 No_File has quit [Quit: Client closed]

16:27 Likorn has quit [Quit: WeeChat 3.4.1]

16:36 <mrvn> yeah, why change a distribution with long time security support? Lets pick something obscrube that will disappear next year?

16:37 <mrvn> s/change/choose, what am I typing?

16:37 <heat> pop_os isn't obscure

16:37 <heat> but you're right, they should've used hp-ux

16:40 cookie is now known as ckie

16:41 <mrvn> 4 years old, better than I thought. But very much looks like made by a manufacturer to sell it's own systems because the marketing division couldn't have someing labeled Ubuntu on the system.

16:42 doorzan has quit [Remote host closed the connection]

16:42 <mrvn> I wish people would call these clones something else than an OS. It's more like a theme for Ubuntu.

16:43 <heat> it's not just a theme

16:45 <j`ey> they said they're going to write a new DE to replace GNOME

16:45 <mrvn> No, not just. But quite like it. How many extra packages does it add? Maybe 10? Some patched packages for a different look&feel? 99% is pure Ubuntu I bet.

16:46 <mrvn> A new DE to replace GNOME is like a new theme to an app.

16:46 <heat> no

16:46 <heat> it's new software

16:46 <heat> it's not "like a theme to an app"

16:46 <mrvn> so is a theme

16:46 <heat> GNOME is super complex

16:47 <mrvn> It's also supper simple: It shows me icons and when I click them something starts.

16:47 <mrvn> New desktop. Oh now it's green background instead of blue. I still click at an icon and something starts.

16:48 <heat> you do realise GNOME isn't just the desktop right?

16:48 <mrvn> Might be a million lines of code that changed but it's still just a desktop.

16:50 <j`ey> so all OS's with a desktop are themes of ubuntu?

16:51 <mrvn> if it's based on Ubuntu then I would say so pretty much.

16:51 <heat> ubuntu is just a theme of debian

16:52 <mrvn> That's how it started. But they make a lot of their own packages now with their own security support, their own kernels, ...

16:53 <mrvn> That's where I would draw the line. When they have their own security support for packages not developed in-house then they've complete the fork to OS / distribution status.

16:55 <j`ey> but at the end of the day, it's still an OS heh

16:55 <heat> arch doesn't really have security support for packages

16:55 <heat> is it a theme?

16:55 <heat> also no custom kernels

16:57 <j`ey> well they still package it / have a custom config

16:57 <mrvn> no idea what arch is. A random collection of unmaintained binaries?

16:58 <heat> arch linux

16:58 <heat> which i do in fact use

16:58 vai has joined #osdev

16:59 <heat> j`ey, that's nothing compared to the patch fest redhat and canonical have on their kernels

16:59 <j`ey> true

17:00 <mrvn> I know hat arch linus is. I just don't what to call it. It's not a "theme based on xyz". If it has no security support it would fail my "distribution" threshold.

17:00 <mrvn> +w

17:00 <heat> they're maintained

17:00 <heat> but there's no security support

17:00 <heat> no backporting of patches, etc

17:00 <j`ey> it's clearly a distro

17:00 <j`ey> (btw)

17:01 <mrvn> security support can mean upating to newer versions. backporting isn't needed.

17:01 <heat> your definition of distro is clearly skewed by the far more corp-y redhat and canonical distros

17:02 gog has quit [Ping timeout: 260 seconds]

17:03 <mrvn> heat: except my point is there are no canonical distros (except maybe a hdnafull), they are just something like a theme is to an app but for distributions.

17:03 <heat> it's a distribution if you're distributing the packages

17:04 <heat> they distribute the packages from their own servers

17:04 <mrvn> so every Debian mirror is a distribution?

17:04 <j`ey> im not really sure what is gained by saying a certain linux-thing is not a distro

17:04 gog has joined #osdev

17:04 <heat> you're being deliberately obtuse

17:05 <mrvn> I just wish for a better name that reflects the difference between maintaining thousands of packages and just adding 10 packages to someone elses work.

17:06 <j`ey> remix

17:07 <heat> I just wish for a better name that reflects the difference between remembering to update a package/filing bugs and actually reviewing code

17:08 <heat> :)

17:12 Likorn has joined #osdev

17:13 <mrvn> j`ey: remix is a good word for it. Now make everyone else use it please.

17:17 troseman has joined #osdev

17:23 ethrl has quit [Quit: Textual IRC Client: www.textualapp.com]

17:23 terminalpusher has joined #osdev

17:42 No_File has joined #osdev

17:54 eryjus has joined #osdev

18:00 No_File has quit [Quit: Client closed]

18:02 <dzwdz1> is it safe to assume that (on x86) PUSHAL will never push more than 32 bytes?

18:02 dzwdz1 is now known as dzwdz

18:04 <mrvn> it will push as many as specified in the docs.

18:09 <mrvn> isn't pusha avoided by everyone because it's faster to push register one by one?

18:09 xenos1984 has quit [Read error: Connection reset by peer]

18:09 sortie has quit [Quit: Leaving]

18:09 gog has quit [Ping timeout: 260 seconds]

18:09 <dzwdz> huh, til

18:10 <dzwdz> i think it still makes sense for my use case, though

18:10 sortie has joined #osdev

18:12 <GeDaMo> No pusha on 64-bit

18:12 <dzwdz> my os is 32-bit

18:13 <dzwdz> which might be a bit short-sighted but oh well

18:16 <mrvn> yeah, doing x86 32bit really is a bit *ARGS*

18:16 <zid> It's just kinda.. overly complicated and not very useful

18:17 <zid> long mode is simpler and better

18:17 <dzwdz> wait, are you saying x64 osdev is easier?

18:17 <dzwdz> i kinda assumed otherwise

18:18 <j`ey> yeah, just switch to 64 bit, no reason not to!

18:18 jafarlihi has joined #osdev

18:18 <jafarlihi> Can someone please tell me what is the purpose of if-else statement here? https://gitlab.com/sortie/meaty-skeleton/-/blob/master/libc/string/memmove.c

18:18 <bslsk05> gitlab.com: libc/string/memmove.c · master · Jonas Termansen / Meaty Skeleton · GitLab

18:18 <sortie> Hey jafarlihi!

18:18 <heat> jafarlihi, memmove handles overlapping

18:19 <sortie> jafarlihi, the interesting case is when the source and destination memory overlaps, in which case you don't want to lose the original data, so you need to be careful. memcpy(3) has undefined behavior in that case (it can assume the memory does NOT overlap), but memmove(3) has to handle that case per the standard :)

18:20 <sortie> The easy trick I went with here is to simply copy backwards

18:20 <sortie> Inefficient to only do byte copies though

18:20 <sortie> But hey it's an initial skeleton, it's meant to be obviously correct and simple :)

18:21 <jafarlihi> How copying backwards matter for overlapping?

18:22 <GeDaMo> [---src---]

18:22 <GeDaMo> [--dest---]

18:22 <mrvn> Line 6 should check if src + size < dst (with overflow protection)

18:22 <sortie> Think about it for a bit :) I imagine a source buffer that is before the destination buffer, and they overlap, so you copy one byte at a time forward, and then you overwrite the data you were supposed to copy, so you have to do it backwards which works in that case

18:22 <GeDaMo> If you copy from the beginning of src to the beginning of dest, you would overwrite the middle of src

18:23 <jafarlihi> Oh, I get it now. Thanks!

18:23 <GeDaMo> But copying from the end of src to the end of dest is safe

18:23 <zid> which is why memcpy on overlapping regions is ill advised

18:23 <zid> you don't know if it copies forwards or backwards

18:23 <zid> hence memmove

18:23 <sortie> s/ill advised/undefined behavior that WILL blow up on your fact/g :)

18:24 <mrvn> I would just have memcyp call memmove

18:25 <heat> adding needless branches 101

18:26 pretty_dumm_guy has joined #osdev

18:26 <mrvn> heat: you think that matters with that implementation of memmove?

18:26 xenos1984 has joined #osdev

18:27 <heat> *shrug*

18:27 <heat> memcpy is the thing you use 99% of the time

18:27 <heat> memmove isn't

18:28 <mrvn> if (dst < src | src + size < dst) should branch predict very well.

18:28 <mrvn> and in user space it can't overflow.

18:28 <heat> now, making memmove call memcpy would be a good idea

18:28 <sortie> memcpy implemented using memmove is fine

18:28 <sortie> Honestly all that matters is what performance you want, how actual machines perform, and how simple / complicated / large / small you want the code

18:29 <sortie> YMMV

18:29 <mrvn> if (dst < src | src + size < dst) memcpy(); else reverse_memcpy(); works too

18:30 <mrvn> Just stick an asm("memcpy:") label after the if. :)

18:39 terminalpusher has quit [Remote host closed the connection]

18:40 terminalpusher has joined #osdev

18:40 terminalpusher has quit [Remote host closed the connection]

18:40 <heat> in this edition of dense musl code: https://github.com/heatd/Onyx/blob/master/musl/src/string/memmove.c

18:40 <bslsk05> github.com: Onyx/memmove.c at master · heatd/Onyx · GitHub

18:41 <j`ey> hm: while (n) n--, d[n] = s[n];

18:42 <geist> cookie: oh hey cookie

18:42 <geist> aww they left

18:42 <geist> was going to tell them to just start it

18:42 <heat> you mean ckie?

18:43 <Ameisen> Is there a way to determine if a CPU will downclock itself when using AVX2 or AVX512

18:43 <geist> Ameisen: it's probably fair to assume it will, eventually, because of heat?

18:43 <heat> what did I do

18:43 <ckie> geist: i is cookie

18:44 <Ameisen> geist: possibly, but a number of CPUs _explicitly_ downclock with it. Like skylakes.

18:44 <geist> ckie: oh was just going to say start i

18:44 <ckie> (the other nick gets a looot of mentions)

18:44 <Ameisen> if they detect avx2/512 instructions, they downclock

18:44 <ckie> geist: snooze, few hours, attempting repair of stereo system

18:44 <ckie> but thanks :P

18:44 <geist> sure. and also similarly it was well known that some xeons would take some number of time to 'spin up' the AVX512 engine

18:44 <heat> 12:01 <j`ey> you mean the trolls that pretend to be rust people? :P

18:44 <heat> i feel attacked

18:44 <geist> ie, actully turn on that part of the silicon

18:45 <geist> i have no idea if that's still the case

18:45 <j`ey> heat: ;)

18:45 <geist> Ameisen: possible there's a cpuid bit somewhere in the fairly dense cpuid bits concerning AVX features that says somethingliket his?

18:46 <heat> the cpu needs some time to import the avx512 crate man

18:46 <j`ey> :<

18:46 vdamewood has quit [Quit: Life beckons]

18:48 <heat> https://github.com/heatd/Onyx/blob/master/musl/src/string/memcpy.c

18:48 <bslsk05> github.com: Onyx/memcpy.c at master · heatd/Onyx · GitHub

18:49 <Ameisen> geist: I don't remember seeing one, but it's possible that there's one somewhere.

18:49 <geist> but if there were it'd probably be something like the delayed spin up thing i was talking about

18:49 <geist> downclocking i think is more implicit, due to TDP

18:50 <heat> a quick ctrl+f on wikipedia says no, there's no such bit

18:51 <geist> yah was thinking it'd be a similar bit to ERMS: basically warning so you can tune your code

18:55 <heat> it may be one of those details that compilers just know about

18:56 <geist> it's also possibly a general notion that i fyou're using avx512 it's probably for some sort of long running calculation so you generally dont ust use it for one off things

18:57 <geist> https://travisdowns.github.io/blog/2020/01/17/avxfreq1.html looks like an interesting page

18:57 <bslsk05> travisdowns.github.io: Gathering Intel on Intel AVX-512 Transitions | Performance Matters

18:58 <heat> yup

18:58 <heat> llvm-mca could probably tell you more if you feed it some code

19:02 <geist> this is a really interesting blog as i get into reading it. it seems to directly assert that the instant you run one of the 512s (or even the 256s) it instantly starts to transition to a lower clock while spooling up the 512 units

19:02 <geist> at least on that class of skylake he's testing on

19:02 <geist> duno what the current state of the art is on modern cores

19:05 <geist> the interesting thing is its actually throttling the dispatch of all ALU ops while it's waiting for the voltage regulator to stabilize, so it's actually running the instruction but it just runs at a reduced rate

19:08 vai has quit [Ping timeout: 272 seconds]

19:12 <jafarlihi> Is there any resource out there for implementing terminal scrolling?

19:12 <heat> no

19:13 <heat> it's just something you need to do

19:13 <heat> imagine, you need to scroll things up

19:13 <heat> so move every line upwards, discard the top one and empty out the bottom one

19:15 <geist> right. it may be possible to use hardware scrolling in some very specific situations, but that also usually involves eventually running out of framebuffer and needing to reset to the top

19:15 <geist> depends on hardware, etc etc. but it's an optimization

19:15 <clever> ive also done this just recently

19:16 <clever> in the case of the rpi, you can specify a multiple bitmap/w/h/x/y pairs to display

19:16 <clever> so i never run out and have to reset to the top of the bitmap

19:16 <heat> geist, right, this is text mode

19:16 <clever> so i treat the bitmap like one big circular buffer, and then i just split it at the wrap point, and tell the hw to render the 2 halves

19:16 <heat> specifically the "exercise is left to the reader" in the meaty skeleton page

19:17 <geist> ah that's odd, seems like writing a little routine that uses a memcpy + memset would be easy enough to leave in

19:17 <geist> since that's basically what it boils down to

19:17 <geist> copy everything up a line, clear the last line

19:18 <heat> you can't use those, it's video memory

19:18 <heat> but yeah, it's like a "let's see if you can do more than copying code from the wiki"

19:18 <geist> sure you can, it's just not great

19:19 <geist> it's a good starting point, then you optimize (by keeping off screen buffer and dirty lnes, etc) later

19:19 <heat> not if it's virtualized

19:19 * geist shrugs

19:19 <heat> (well, you can try but things may crash)

19:19 <geist> if they crash your VM is broken

19:20 <heat> i've crashed qemu before when doing a popcnt on ahci mmio memory

19:20 <geist> right, using exotic instructions on mmio may do it

19:20 <geist> i've personally found bugs with HVF with that

19:21 jafarlihi has quit [Quit: WeeChat 3.5]

19:21 <geist> anyway, it's a general observation that you should avoid reading from the framebuffer, but it's not *illegal*

19:21 <geist> well they left anyway

19:22 <heat> yup

19:23 <heat> possibly not in text mode though

19:23 <heat> a much smaller buffer

19:25 <mrvn> geist: memmove, the srd+dst overlap

19:25 <geist> yah my eperience is that text mode buffers are small enough that the slowdown isn't bad

19:25 <geist> mrvn: indeed, but not in the direction that matters, though you're strictly right

19:26 <mrvn> With a framebuffer on old hardware it's also worth checking if the char in one line is the same as the char in the next line and then not move that part.

19:26 <geist> ie a sane memcpy with a stride less than a single line of the text mode would still work fine if it's copying lower to high

19:26 X-Scale has quit [Ping timeout: 240 seconds]

19:26 X-Scale` has joined #osdev

19:27 <mrvn> geist: scrolling down could be a problem, scrolling up the size you move at once shouldn't matter.

19:27 X-Scale` is now known as X-Scale

19:27 <geist> yah, tis true. i'm assuming scrolling up

19:27 <mrvn> if the stride is larger than the width you've already read the data you overwrite. no harm done.

19:30 <mrvn> Does scrolling the console on x86_64 turn into a single repl movs?

19:30 <geist> probably + a clear at the end for the new line

19:32 <mrvn> just copy an extra 80 bytes that you memset to ' ' at boot.

19:32 <geist> i suppose you could store just off the end of the framebuffer a blank line of ' '

19:32 <geist> and then do a memcpy that reaches off the end

19:32 <geist> scrolls one extra line back

19:33 <mrvn> Soon you want to have a scroll back buffer though and then it's just copying blocks from the ring buffer to the screen.

19:33 <geist> especially since clearing a line on vga text mode is really 0x2000 (or 0x0020 i forget)

19:33 <geist> or something like that

19:33 <mrvn> or framebuffer and it all changes

19:36 <clever> geist: and if you where doing scrollback in LK's gfxconsole, would you maintain the scrollback as text or graphics?

19:36 <geist> text

19:37 <clever> text would need more complex cpu and re-rendering, while graphics needs more ram and is just a pure memcpy bandwidth job

19:37 wootehfoot has quit [Quit: Leaving]

19:37 <clever> and then there is the question of the scrollback just being flat char[80]'s or if its just a mess of "foo\nbar\n" and you have to re-parse it to find the lines

19:38 <geist> though actually no that's not what it does: https://github.com/littlekernel/lk/blob/master/lib/gfxconsole/gfxconsole.c#L86

19:38 <bslsk05> github.com: lk/gfxconsole.c at master · littlekernel/lk · GitHub

19:38 <geist> it errs on the side of assuming the hardware knows how to copyrect faster than yo udo

19:38 <geist> but that's one of those could have an either or path depending on different parameters of the surface

19:38 <clever> and thats assuming the copyrect of the surface was populated?

19:38 <clever> in my case, it was just a memcpy via the default copyrect

19:39 <geist> i think it always is, and the fallback is a memcpy

19:39 <clever> but the 2d mode in the dma block can do the same job, i need to play with that

19:39 <geist> ther eare all sort sof possibilities of optimization in the lib/gfx stuff. it's designed to just work and not be particularly complex

19:39 <geist> but yes you can create custom surffaces with hooks that do better stuff

19:39 <clever> my understanding is that dma in 2d mode, will copy N bytes, but increment by stride, and repeat M times

19:40 <clever> oh right, there is a bug ive ran into a number of times

19:40 <clever> the cache flush function in the surface, isnt initialized

19:41 <clever> so the gfx_flush in there, calls a random pointer

19:41 <clever> https://github.com/littlekernel/lk/blob/master/lib/gfx/gfx.c#L516

19:41 <bslsk05> github.com: lk/gfx.c at master · littlekernel/lk · GitHub

19:42 <clever> this line randomly crashes the system

19:42 <geist> patches welcome

19:42 <geist> would initialize it to zero and then in the wrapper function test for null

19:43 <clever> its already testing for null

19:43 <clever> it just isnt initialized

19:43 <clever> i'll see about filing a PR for that tonight

19:43 <geist> change the malloc to a calloc then

19:43 <mrvn> clever: I like having 64k text scrollback buffer or so. Doing it as gfx would cost too much ram.

19:43 <geist> i'm much better about zero initializing things nowadays due to doing that a lot in zircon

19:44 <geist> especially with C++ where you can easily zero initialize

19:44 <mrvn> I really like that you can write "Foo *bla{CAFEBABE};" for class members now.

19:45 <clever> geist: ehhh, every single byte in the surface (except flush) is written to in create_surface, so calloc feels like a waste of bandwidth

19:45 <mrvn> geist: Have you tried out the new compiler options to initialize (zero fill) padding in structs?

19:45 <geist> yeah, it's one of those compromises for future expansion

19:45 <mrvn> and stack

19:45 <geist> mrvn: yes we use it in zircon kernel, much to the detrement of performance

19:46 <geist> but all is not lost, there's an attribute you can use judiciously to turn it off in particular situations

19:46 <mrvn> You have that much padding that it is noticeable?

19:46 <geist> we actually measured a fair amount of performance loss in some benchmarks as a result of the compiler folks just turnign it on one day and not telling us

19:46 * geist grumbles

19:46 <geist> mostly little temporary objects on the stack that now suddenly go through a more complicated setup

19:47 <geist> objects with buffers in them etc

19:47 <heat> linux also uses it

19:47 <mrvn> well, buffers are not padding. Sounds more like a force zero fill of everything than just padding.

19:47 <geist> for example, we have a neat little object that you pass around when doing mmu operations that tracks up to N pending invlpgs so you can queue up some pending TLB invalidations and then flush a the end

19:48 <geist> so it ends up with an array of uintptr_ts and a count

19:48 <mrvn> so basically a vector with fixed capacity

19:48 <geist> it's intended to be very fast to initialize, since we create one preemptively before diving into the mmu code for any reason

19:48 <geist> but now with the zero fill its dumping down 300 bytes or so on the stack every time

19:49 <geist> so that's a good case where it actually shows up in benchmarks. we dont need it zero filled, becuse the inner array is totally safe because of the counter (which is constructed with 0)

19:49 <mrvn> aloca() it?

19:50 <geist> a) we absoklutely forbid all forms of local stack allocations and b) the point is it doesn't know the sie so it allocates a thing up front in the wrapping function

19:50 <geist> that's the point, you create it in the outer stack and pass it into the inner functions so it can accumulate a list of pages to flush

19:50 <mrvn> sometimes I wish for "Bla * foo[[uninitialized]]" for function arguments to signal the function will initialize the data.

19:50 <geist> there are a few patterns like that in the kernel that

19:51 <geist> but again there's an attribute you can put on things that tells the complier to not do the zero fill, so all is not lost

19:51 <mrvn> do you happen to remeber what it's called?

19:51 <geist> and actually we dont do a zero fill, it does some sort of pattern fill. with a pattern that's designed to trigger exceptions if you deref it etc. i forget it

19:51 <geist> mrvn: i think it's uninitialized or something like that

19:51 <geist> lemme see if i can find it

19:53 <geist> hmm, actually we *dont* mark it as unitialized: https://fuchsia.googlesource.com/fuchsia/+/refs/heads/main/zircon/kernel/arch/arm64/mmu.cc#1374

19:53 <bslsk05> fuchsia.googlesource.com: zircon/kernel/arch/arm64/mmu.cc - fuchsia - Git at Google

19:53 <geist> i should actually look into that next week. we did it eksewhere, maybe this was a missed case

19:53 terminalpusher has joined #osdev

19:57 <mrvn> geist: carefull though. I believe the initialize everything was designed as a security mitigation so previous data from the stack doesn't leak. Don't compromize safety for speed. :)

19:58 <geist> right! SECURITY > *

19:58 <geist> but yeah basically my experience is generally initializing everything in the object is a good idea except in cases wher eyou can prove there's no need

19:59 <geist> and it's a performance thing, so it'susually objects with arrays in them where some other variable controls access to the array

19:59 <geist> i am usually okay with leaving out initialiation for that

20:02 <geist> mrvn: anyway i cant find it offhand, but it's something like unitialized or unused or one of those attributes

20:02 <geist> easy enough to figure it out in godbolt

20:02 <mrvn> Maybe this should be solved from the other side. Mark everything security relevant and wipe the stack when leaving the function instead of initializing

20:03 <geist> we do also use safe stack and/or shadow call stack in the kernel

20:03 <geist> which is kinda a pain, since each thread now has two stacks, but it gives you a lot of that benefit

20:03 <mrvn> shadow stack is where you place return addresses on one stack and stack frames on another one, right?

20:03 <geist> yah i dont thik it's implemented on x86, but we use it on arm

20:03 <geist> basically it's an upwards growing stack of 8 byte values, simply the return addresses

20:04 <mrvn> is it predictable where the two are?

20:04 <geist> x18 points to it at all times

20:04 <geist> no. we randomly allocate them in the kernel

20:04 <geist> x86 it's harder to do something like that, but x86 has the whole safe stack/regular stack thing, and the return addresses are on the safe stack

20:05 <mrvn> yeah, can't push/pop anymore with shadow stack

20:05 <mrvn> unless you swap the SP between the two I guess

20:06 <geist> right. off the top of my head the idea is you leave RSP looking at the safe stack, which still has a traditional stack frame, but you also use TLS to store an unsafe stack where you put any locals that have any possibiility of escaping

20:06 <geist> so the compiler knows what locals have no pointers to them, etc and can still put them on the regular safe stack

20:06 <mrvn> doesn't rust have this implicitly since the stack frame is on the heap? Or am I confusing something there?

20:06 <heat> the safe stack is the stack where you put big vulnerable stuff right?

20:06 <geist> but anything you get a pointer to goes on the unsafe stack

20:07 <heat> or yeah the unsafe stack

20:07 <heat> ah

20:07 <geist> or has any sort of possiblity of you doing an overflow, etc

20:07 <geist> arrays of things, etc

20:07 <mrvn> every buffer needs to be on the unsafe stack so over/underflows can't change the return address

20:07 <geist> it'll still registrers on the safe stack too, so even if you could overflow on the unsafe stack you can't trash the register/return state

20:08 <geist> s/still/spill

20:08 <mrvn> yeah, anything not accessed with pointer arithmetic should be save.

20:09 <geist> iirc the shadow call stack is basically superior in the sense that it's simpler and avoids the ROP exploits, but is only really useful on architectures where there's not any real cost to it (ie, risc machines that return from functions via register indirection)

20:09 <mrvn> just an odd though: can you prefix push/pop with %fs or %gs?

20:09 <mrvn> +t

20:09 <mrvn> 21:58 < geist> right! SECURITY > *

20:10 <geist> yah so it's a more freebie one

20:10 <geist> but only on arches where it's a freebie

20:10 <mrvn> isn't push/pop on x86 extra fast? accessing a second stack is slower, right?

20:10 <geist> probalby

20:11 <geist> SECURITY > *!

20:11 <geist> just repeat that every time your silly reptile brain starts to consider performance as a consideration

20:11 <geist> otherwise the beatings will recommence

20:12 <mrvn> geist: my whole kernel is design kind of like that. KISS, must work > fast

20:12 * geist gets out the taser

20:13 <geist> i guess that's what i get for working at google for 10 years. hardware is free! security > *!

20:14 <heat> no, you get p r e b u i l t s

20:14 <heat> :P

20:14 <geist> (i'm just being snarky, obviously security is important, and its part of the tension of engineering to work with multiple constraints)

20:15 <mrvn> My experience is that improving your algorithm will gain you so much more speed than any little security or unoptimized loops or such will cost.

20:15 <mrvn> Saving 5% on memcpy can't beat not calling memcpy at all etc.

20:15 <geist> heat: actually was reading something that i had never considered before: idea is that all tools should be rebuilt at least every say 6 weeks. reasoning is it wont be able to pick up new compiler features, etc without that

20:15 <geist> ie, leaving old tools around built last year is another path for security sploits, etc

20:15 <geist> and kinda makes sense, i had never considered it before

20:16 <mrvn> geist: it also tests the compiler for bugs and makes sure the source still compiles and conforms to modern syntax.

20:16 <geist> vs conventional wisdom of finding a solid release of some thing and sticking with it and then roll when new release is out, features, etc

20:16 <heat> everyone updating tools is nice but it only works if most people are on the same page (i.e same company)

20:16 <geist> oh 100%

20:16 <GeDaMo> Don't forget to recompile the compiler :P

20:16 <mrvn> Try compiling a 6 year old c++ source with todays clang

20:16 <geist> GeDaMo: absolutely

20:17 <heat> tianocore still supports like VS2010 and GCC 4.8

20:17 <mrvn> And when you have to fix a security bug is not the time to try to update sources to the current compiler.

20:17 <geist> but i think it's even further. it also means *all* libraries and all applications you use interanlly for your company/etc should be constantly rebuilt

20:17 <geist> and something older than N units of time is a cause for alarm

20:17 <mrvn> geist: I would consider that the test suite for every (major) compiler update.

20:18 <geist> and this is versus to the notion of only rebuilding somethig if the source changes

20:18 <geist> anyway, a thing i hadn't really thought about, but i'm just a low level kernel person. i dont think about those things much

20:18 <mrvn> Some years back there was a group that would rebuild all of Debian every month.

20:19 <geist> so here's a completely unrelated x86 question

20:19 <geist> if i were to sya breadboard up a 386 or whatnot, would it be possible to wire up the address space such that there is *no* ram below 1MB

20:19 <geist> idea being that you put some rom at the start address (0xfff0... somehting)

20:19 <GeDaMo> Interrupt vectors?

20:20 <geist> and the first thing it does is immediately bounce into protected mode

20:20 <geist> so question is can you get to protected mode with no ram and no cache

20:20 <mrvn> geist: and go straight to 32bit mode in the bios? Or do you have an UEFI for that?

20:20 <geist> i think so, since you can pre-can a GDT in the rom and then load it

20:20 <geist> mrvn: right. but not a regular bios. i'm saying something you build yourself, not attempting to make aPC clone

20:21 <geist> and it'd be simpler if you say started RAM at some higher address and just left < 1MB to rom or whatnot

20:21 <mrvn> don't see why you can't constexpr the whole boot process up to starting your kernel in long mode and make that your bios image.

20:21 <geist> so question is can you write code that uses no ram at all and gets to protected mode. i think so

20:21 <geist> again. you're missing the point

20:21 <geist> did you read the problem statement?

20:21 <mrvn> hardcoded GDT, page tables, ... for 32bit and 64bit

20:21 <geist> no. i said 386

20:22 <geist> like literally an 80386

20:22 <mrvn> ups, drop the 64bit. The answer remains though. nothing needs ram there.

20:22 <geist> yes but really? i'm worried somethig implicitly needs a stack or whatnot

20:22 <geist> since you'd hve to operate in a few instructios that literally only has the registers or read only memory to operate

20:23 <mrvn> geist: only thing that needs stack would be building page tables dynamically with some recursive function.

20:23 <geist> yah if you wated to get to 64bit you'd have to at least pre-can a few page tables in the rom

20:23 bxh7 has quit [Quit: ZNC 1.8.2 - https://znc.in]

20:23 <geist> which is a bummer, sicne it'd use up a few pages, but so it goes

20:24 <mrvn> recursive page table. only needs 1 page.

20:24 <geist> but once you're in protected mode you can run >1MB and then you have ram

20:24 <mrvn> And the parts you don't use you can put other stuff in.

20:24 <geist> the idea is if you're wiring up your own x86 you have no need to reproduce any of the 640k memory hole or bios or whatnot

20:25 <mrvn> Dd 386 have 2MB pages?

20:25 <geist> and i'd just re-layout memory to do something similar to arm64 qemu or whatnot: put hardware in low addresses and start ram somewhere higher

20:25 <geist> no not at all. large pages came at least 10 years later

20:25 <mrvn> what about graphics memory? Your own gfx card with dedicated ram?

20:25 <geist> of coure

20:25 <mrvn> So that could still be at A000/B000

20:25 <geist> no. the point is not to do that

20:26 <mrvn> I though hardware in low addresses

20:26 <geist> the whole point is you are breadboarding something that has *no* backwards compatibility

20:26 <geist> well i' thinking low addresses as in say < 1GB

20:26 <geist> no need to cram it low there. can really leave say a gigantic run of memory for framebuffer. no reason to think so small

20:26 psykose has quit [Remote host closed the connection]

20:27 <mrvn> Just don't put the ram at 2GB. that makes going higher half more complicated.

20:27 <mrvn> start of the ram

20:27 psykose has joined #osdev

20:27 <geist> anyway i think the anser is yeah, the only thing it'd really need is a pre-canned GDT and a LGDTR pointer in rom to get up to protected mode

20:27 <geist> and then you can use ram

20:28 <mrvn> nod

20:28 <geist> idea is a 386 is pretty easy to breadboard

20:28 <heat> geist, offtopic but I genuinely don't know if current chipsets can use those tables if they're in ROM

20:28 <mrvn> Are there any 386 clones that can power up into 32bit mode?

20:28 <geist> and nothing really there aside from the cpu starting in real mode has any real PC legacy

20:29 <mrvn> Another stray thought: Have you ever put a 386 into an FPGA?

20:29 <geist> could

20:29 <geist> assuming something like that exists and intel hasnt squashed it

20:30 <mrvn> should make it simple to modify it to go right to 32bit on power up.

20:30 <geist> that would be interesting. if a 386 compatible machine started in 32bit mode it'd have to contend with the fat that there's no GDT. yet

20:31 <geist> so i guess it'd have to define the starting state as 'already in 32bit mode, no GDT pointer, but segment registers are set up with 32bit mode as if they had been loaded'

20:31 <heat> you could hardcode the segment bases and limits

20:31 <geist> so presumably the very first thing you do is load a real GDT so that it can continue

20:31 <mrvn> straight 1:1 memory map, all segment registers loaded with 0-4G.

20:31 <geist> i guess you'd have a NMI hazard right off the bat, since you have no IDT or whatnot loaded on instruction #1

20:31 <geist> whereas that problem doesn't exist for real mode since the vectors are implicitly 0

20:31 <mrvn> interrupts disabled. Why would you get an NMI?

20:31 <geist> because that's the point of NMI

20:32 <geist> one cannot disable it

20:32 <mrvn> yes, but what do you expect to throw one?

20:32 <geist> not the cpu's problem

20:32 <geist> still have to consider the design, since nothing prevents a syste designer from doing it

20:33 <mrvn> true

20:33 <mrvn> Nothing stops you from pre-loading an IDT that points into the rom though.

20:33 <mrvn> (on FPGA)

20:33 <geist> so you'd probably have to do something like have some sort f window where nmis can't fire, or require the system designer deal with the hazard by putting an external NMI gate (a-la a20)

20:34 <geist> yeah in an fpga you can do what you want. i'm thinking more like if intel had designed a 386 back in the day that started in protected mode. say via a pin strapping

20:34 <mrvn> How does this work on ARM?

20:34 <geist> ARM has no nmi, so it avoids the problem

20:34 <mrvn> .oO(rake a wire cutter and cut the NMI pin)

20:34 <geist> and i assume when it starts in EL3 (or highest EL) it starts with everything masked

20:35 <geist> irq/fiq/serror/debug

20:37 <heat> geist, you can mask NMIs if you really want to

20:37 <heat> it's in one of the RTC registers

20:37 <heat> you would just start with that pre-masked, easy solution

20:38 <geist> yah but that assumes a RTC exists and is wired up the way PCs are

20:39 <geist> from the point of a raw early x86 (before half of the PC arch got integrated into it) RTC/PIC/etc were all just part of the syscal

20:39 <geist> system

20:40 <geist> nmi on first instruction could be like, there's an nmi button on the board that some person holds down when releasing reset

20:40 <geist> boom, cpu starts nmi is asserted on first instruction

20:40 terminalpusher has quit [Remote host closed the connection]

20:41 <heat> yes, you'd need to make an NMI register a platform detail

20:42 <geist> z80 has some logic delaing with nmis and whatnot, but the key there is x86 post real mode has all this *state* that implicitly relies on stuff already existing in memory

20:42 <geist> hence why in general it's hard to start the cpu > real mode

20:42 <geist> and perhaps why it has never been removed thus far even though it'd make a lot of sense

20:44 <geist> whereas no other modern (or even contemporary architecture to x86 in the 80s) had all this in-memory state

20:44 <heat> there's not that much state, just the IDT

20:44 <geist> ie, data structures that live in memory that the cpu reads whe it feels like it

20:44 <geist> not true: the IDT referes to segments that must live in a GDT/LDT

20:44 <heat> you can hack your way to a memoryless GDT if you change the way the CPU works at startup

20:44 <geist> but the cpu re-fetches data from the GDT upon exceptino

20:45 <heat> hmm

20:45 <heat> right

20:45 <geist> ote other arches like 68k or whatnot had a table (VBAR) bu they were just addresses

20:45 <geist> and that can safely just start at an implicit address (0)

20:45 <heat> well, you curl up in a ball and cry

20:45 <geist> yah further 'x86 sucks'

20:45 <heat> maybe you just put it in ROM and hopefully it works

20:46 <geist> and pretty uch 100% of this can be traced to the 286. that's when they had been really affected by the apx432 koolaid that had been flowing in the breakrooms at intel

20:46 <geist> which was like this except on steroids

20:46 <mrvn> What does the IDT point to on power up?

20:46 <heat> nothing

20:46 <geist> nothing, cpu starts in real m ode, which doesn't use it

20:46 <mrvn> it still has the register with some context

20:46 <mrvn> contents

20:46 <geist> probably either 0 or UNDEFINED

20:46 <heat> ^^

20:46 <geist> since its not used until software makes it so

20:47 <mrvn> yeah, but that is an important difference.

20:47 <geist> not really, since the cpu doesn't use it

20:47 <mrvn> You can map rom to 0 but not to UNDEFINED

20:47 <geist> doesn't matter, because it's not used until you enter protected mode, which software is required to do, and software is supposed to set up the IDT before doing so

20:48 <geist> prior to 286 x86 simply existed and could run meaningful stuff at instruction 0. no additioal state to set up

20:48 <mrvn> but you want to go straight to 32bit with a pin. And then you cold have an IDT at 0 for the NMI.

20:48 <geist> mrvn: oh sure. that was smy point of the whole discussion. there's all this extra shit you have to set up before going to protected mode

20:49 <geist> anyway i think we may have beaten this horse

20:49 <geist> more of a though experiment in the challenges of starting an x86 in > real mode

20:50 <heat> ok right so

20:51 <heat> all the descriptor tables can be in ROM, and are in ROM when switching from 16 to 32-bit in firmware

20:51 <geist> yah would just have to define what the starting addresses of the tables are

20:51 <geist> an ugly hack but so it goes

20:51 <heat> what's a platform without ugly hacks

20:51 <geist> like 'entr point is at X, IDT is at Y, GDT is at Z, suggested contents for GDT is WWWW'

20:52 <geist> heat: riscv!

20:52 <heat> yet!

20:54 <heat> how's lk-user coming along?

20:55 <mrvn> heat: all my hacks a beautiful :)

21:01 <geist> heat: nothing today, going to get LK working with gcc 12.1 first, and debating what to do with the shared riscv thing but i dont need it yet

21:01 <geist> i had wired up a little file descriptor table though

21:02 <geist> probably thenext thing to do is either decide to go the musl route or continue to newlib for now

21:02 <geist> probably the second for a bit, since i can wire up mor stuffs that way

21:02 <geist> musl is a huge task and it's highly linux centric

21:02 <heat> if you call it a handle table you're 50% less UNIX-y :P

21:02 GeDaMo has quit [Quit: There is as yet insufficient data for a meaningful answer.]

21:03 <geist> yah trouble is really it's the whole 'how does the table allocate and pack it's values'

21:03 <geist> posix has very specific first pack which is pretty bad honestly, but its fairly baked into the design

21:03 <mrvn> geist: 1342 endianess

21:03 <geist> i' really debating in my head what i'm trying to build aside from hello world

21:04 <mrvn> mandelbrot? computing PI?

21:04 <mrvn> frogger

21:04 <j`ey> qemu

21:04 <heat> irc client

21:04 <mrvn> tetrinet

21:05 <heat> by the way, uclibc-ng is a thing, although I've heard it's of lesser quality than musl (and LGPL licensed)

21:15 <heat> geist, by the way why do you want to switch from newlib?

21:15 <heat> is it that bad?

21:15 <geist> it's fine for embedded, but it seems to be lacking

21:15 <geist> thigns like internal locking, it's pretty unclear how well it does any of that

21:15 <geist> i shouild work with it a bit more though, to be fair. it's more of a basic libc in the stdio + heap sense, it seems

21:27 Likorn has quit [Quit: WeeChat 3.4.1]

21:32 <heat> have you tried building some simpler packages with it

21:32 <heat> klange used it for quite a bit of time so it must be capable of something

21:35 <geist> yah i probably should stick with it honestly

21:35 <geist> the muslthing was more of a 'lets see how hard this would be' and it seems like it's at least straightforward, just non trivial

21:38 <geist> haha looking at the fragmentation of some of my VM disk files on my nas server

21:38 <geist> 401k fragments for my windows 10 img file!

21:41 <heat> i think that if you want to avoid having a posixy interface, you should avoid musl, at least for now

21:41 <geist> i think so too to be honest

21:41 <heat> it did influence quite a bit of my design

21:41 <heat> you could also go full posix and make the perfect svr4 clone

21:41 <geist> i'm torn between 'lets just do posix so i can be like sortix' and 'meh more fun to build a lower level more embeddedy but user space thing'

21:42 <geist> and the latter is probably an actually useful thing

21:42 <heat> how lower level?

21:42 <geist> but since i'm keeping the lkuser stuff a separate project, there's nothing that keeps someone from building something else on it

21:42 <geist> ie, still maintaining a kernel vs user space implementation as separate layers

21:43 <geist> oh i dunno, i mean lower level as in doesn't need fork() signals, etc. more of 'here are some files, here's a way to access the network, here's a way to create more processes and threads'

21:43 <geist> 'here's a way to get to devices other than /dev nodes'

21:43 <geist> 'here's some ipc and futexes to build stuff out of'

21:43 <mrvn> like an exokernel or lib kernel?

21:44 <geist> mrvn: what are those?

21:44 <geist> or more specfically what do you mean when you say those (everyone has different definitions of that0

21:44 <mrvn> Basically the whole kernel as a library you link against.

21:44 <geist> and run in user space?

21:44 <mrvn> kernel space

21:45 ozarker_ has joined #osdev

21:45 <geist> i guess? basically? depends on if you consider linking a bunch of .o files as 'linking against' or whatnot

21:45 <mrvn> I think the big point is that you replace system calls with function calls

21:45 <heat> well, that's not it

21:45 <geist> lk build system is very modular, so in this case what i'm doing is providing another module called 'lkuser' that implements the syscall layer

21:46 <mrvn> hmm, so the equivalent of libc but not posix-y

21:46 <geist> so it's basically building a vaneer routine, an 'executive' so to speak that acts as a user space interface to the kernel that is largely unconcerned weith user space

21:47 <mrvn> I kind of envision that as ldso segment every user space program gets in my kernel.

21:47 <geist> so no it's not at all the equivalent of a libc, it's more of adding a module to an existing modular system

21:47 <geist> no i'm talking a layer below that as in LK currently has no concept of user space. it doesn't switch out of protected mode. you can write bmedded stuff with that

21:47 <mrvn> vdso even

21:48 <geist> but you can simply add another layer that adds a user space and then provides syscalls with a 'personality' that interfaces with teh inner LK code

21:48 <geist> and you could build multiple kinds of these if you want, even at the same time

21:48 <mrvn> Would you still link against .o files for that personality in the user space progs?

21:48 <geist> this is functionally what we did for zircon, we started with LK and then started building up the zircon syscall interface

21:48 <geist> no... i think you're thinking wayy too hard about this

21:49 <geist> user space is user space. judst like any other

21:49 <geist> 100% of what i've been talking about the last 5 minutes is how the kernel code is organized

21:49 <mrvn> just wondering how the user space side is going to talk with the kernel side

21:49 <geist> via syscalls

21:50 <geist> of which the kernel side is implemented in the lkuser module

21:50 <geist> which is linked with the LK kernel

21:50 <heat> via lcall 0x7, per i386 svr4 abi

21:50 <geist> (just to be clear, not what heat just suggested)

21:51 <mrvn> seems to fit with what linux / bsd have as personality then

21:51 <heat> nooooooooooooooooooooooooooooooo

21:51 <geist> right

21:51 <geist> basically add a particular user space personality to LK as a module you add to LK kernel

21:52 <geist> where it'll get nasty is when i want to start implementing process termination and i need to be able to unblock threads in the LK kernel, which you currently can't do, i think

21:52 <heat> anyway geist I think you're overthinking it

21:52 <heat> both of those two designs can be useful

21:52 <geist> whcih two designs?

21:52 <heat> the posix thing and the light embedded userspace thing

21:53 <geist> oh surfe. exactly. it's mostly which oens i want to fiddle weith first

21:53 <geist> i think a point i was trying to make a while ago and got lost in the noise is i can do *both*

21:53 <geist> because it's a personality that should be logically seperate from the core kernel

21:53 <heat> damn right

21:53 <heat> suck it people that write budget linux

21:54 <geist> at the expense of perhaps an added layer of abstraction

21:57 <mrvn> don't you have that layer anyway because you have to copy the syscall args from user space memory to kernel and then call the actuall kernel function that do the job?

21:57 <heat> what if you write an nt executive layer?

21:57 <mrvn> heat: then you will get stoned

21:57 <mrvn> The L4 microkernel has an optional linux portability layer.

21:57 <geist> mrvn: sure but i mean a layer as in user space will say treat handles or file descriptors this way or use futexes for blocking, but the inner kernel may not have any of those concepts

21:58 <geist> but in general good modular designs work that way anyway

21:58 <geist> so it's not that much of a stretch

21:59 <mrvn> geist: sure. My point is you already have that for a posix layer. Your kernel won't implement eveything posix and the layer has to emulate it already. Adding an lkuser layer just changes what you have to emulate or even reduces it because it will be finetuned to lk

21:59 <geist> right

21:59 <geist> this is versus some sort of designs where the kernel *is* posix

21:59 <geist> you could really bake a lot of that all the way down

21:59 <geist> if you didn't care to implement anything else

22:00 <geist> like say directly testing for pending signals inside interrupt handlers, instead of at least making some sort of veneer routine that separates those two layers

22:00 <geist> thread_check_pending(...) -> generic return kinda stuff

22:01 <geist> i think that's generally one of my good engineering practices i'm pretty good at: building separation of layers and concerns between layers

22:02 <mrvn> it's layers, all the way down. No, wait, that was turtles.

22:04 <dh`> depends what you mean by "posix"

22:04 <dh`> that is, any kernel that's supposed to be able to do complex things needs a way to interrupt stuff in progress

22:04 <geist> usualy the big standouts are: signals, fork, file descriptors that work in a particular way, notion of everything being in a fs namespace

22:04 <dh`> and asynchronous notifications

22:05 <mrvn> dh`: why?

22:05 <geist> signals i think tend to infect a bunch of the core kernel pretty quickly, though not usually inexorably

22:05 <geist> if nothing else because ou have to test for various things at various points

22:05 <mrvn> geist: is signalfd cleaner?

22:05 <geist> but that can usually be abstracted with a layer of 'test for whatever the other layer wants here' sort of things you sprinkle around

22:06 <geist> which is kinda a layering violation, but at least it's abstracted

22:06 <dh`> mrvn: because in production uses you end up in situations where something's spending time/resources doign something useless and you want to stop it

22:06 <dh`> you can get away without any kind of kill/interrupt mechanism in a special-purpose system but it's a significant limitation

22:06 <mrvn> dh`: so check a signalfd every now and then. No need to interrupt the process.

22:07 <dh`> what does "interrupt" mean in this context? that you poke the process while it's busy and it responds

22:07 <dh`> there are a lot of ways to implement that

22:07 <mrvn> dh`: stopping it and changing the IP/SP

22:07 <clever> mrvn: i found signalfd handy to easily deal with ctrl+c in a select() loop, without having to deal with volatile vars and checking them on every iteration

22:07 <dh`> no signal implementation I know of does that directly

22:08 <dh`> (then again, I'm sure there are lots I don't know of)

22:08 <clever> epoll actually

22:08 <mrvn> dh`: that's basically the posix way. you set a function to be executed when interrupted

22:08 <clever> https://github.com/librerpi/rpi-open-firmware/blob/master/uart-manager/uart-manager.cpp#L162

22:08 <bslsk05> github.com: rpi-open-firmware/uart-manager.cpp at master · librerpi/rpi-open-firmware · GitHub

22:08 <dh`> uh

22:08 <dh`> *userland* needs a function to be executed when interrupted

22:09 <dh`> that has litle effect on the kernel internals

22:09 <mrvn> dh`: signalfd says otherwise

22:10 <dh`> how? signalfd is just an alternate mechanism for delivery to userland

22:10 <mrvn> I think there are many way to implement it both for user and kernel.

22:10 <clever> depends on what you want done with the signal

22:10 <heat> sigqueue also works

22:11 <dh`> the part of signals that actually affects the kernel architecture is the machinery that causes a blocked process to unblock and bail out

22:11 <geist> yah and really you need that the instant your user space has the notion of a forceful termination of a thread or process

22:11 <clever> signalfd doesnt do that, and just converts the SIGINT into a write on an FD

22:11 <geist> ie, thread A in process A calls proc_exit() and takes out thread B

22:11 <mrvn> dh`: so start fixing the underlying issue: blocking. Most kernels do async IO internally nowadays.

22:11 <geist> that's the part i'll have to plumb through LK that i'm not looking forward to

22:12 <clever> forcefull termination doesnt really need userland signals

22:12 <geist> but... already done it at last once, for zircon

22:12 <clever> it just needs a way to wake the thread and have the kernel side clean up after itself

22:12 <geist> right

22:12 <dh`> I don't claim to understand what signalfd does and doesn't do, but in order to deliver a signal you still need to interrupt the target process

22:12 <clever> forcing the userland to temporarily run a sig-handler is entirely seperate

22:12 <geist> functionally it means every place you block on an event_t or whatnot (in LK terminoloty) has to handle it returning with a specific error code

22:12 <geist> like ERR_UNBLOCKED

22:12 <clever> dh`: signalfd basically just routes certain signals to an fd, which you then read/poll/select/epoll as a normal fd, and it no longer interrupts your process

22:13 <clever> you just block on it along with all of your other inputs

22:13 <dh`> mrvn: you can write your kernel so _nothing_ blocks, but that's very expensive from a code structure standpoint

22:13 <geist> oh yeah that's another posixy thing that's a pain in the ass: select()

22:13 <geist> poll() is a little easier, but ugh. select

22:13 <clever> geist: why not just do epoll only?

22:13 <dh`> doing some bulk I/O ops asynchronously is not like e.g. doing ops like mkdir asynchronously

22:13 <geist> because if you're doing posix you gotta do it all

22:14 <clever> geist: userland wrapper around epoll?

22:14 <geist> possibly

22:14 <mrvn> dh`: with signalsfd you no longer block on select/poll/epoll because they return activity on the signalfd instead your programm being interrupted

22:14 <heat> poll and select aren't easy to emulate with epoll

22:14 <heat> definitely isn't fast

22:14 <geist> yah hand't looked into it

22:14 <heat> and most software out there uses poll/select and not epoll

22:14 <clever> heat: slap any program doing that and tell them to get with the times :P

22:14 <dh`> mrvn: I don't understand what you mean

22:14 <geist> yah maybe i'll just build another message passing kernel :)

22:14 <mrvn> dh`: the interruption of a signal turns into a normal return of the syscall.

22:15 <dh`> application calls poll, goes to sleep, signal comes in, you still need to wake up the sleeping process

22:15 <mrvn> dh`: yes, you wake up. But with activity on the FD, not with EINTR

22:15 <dh`> but also, application is in the middle of its select loop doing something else and blocked in say mkdir, you still need to wake it up

22:16 <dh`> and as I've been trying to say, EINTR is a userland-facing phenomenon

22:16 <mrvn> dh`: no, that gets redirected into the FD. you don't get interrupted anymore.

22:16 <mrvn> dh`: if you block in mkdir you are screwed.

22:17 <dh`> if you do that, then the signal maybe never gets delivered

22:17 <dh`> and at least for unix signals, that's not part of the expectation

22:17 <dh`> maybe you don't care, but in that case you also need a way to unstick a process that's sitting in mkdir forever on a dead network volume

22:18 <heat> that's generally not interruptible in linux, only interruptible for kill signals

22:18 <mrvn> dh`: it's for processes that use a select/poll/epoll loop. And you pick which signals you want to keep as interrupts and which to redirect to the signalfd

22:18 <dh`> heat: that's a matter of nfs mount options

22:18 <dh`> at least in nfs

22:18 <dh`> it's also linux-specific from what I can see

22:19 <clever> i had my first thundering herd in years yesterday

22:19 <clever> turns out, my nas was hard nfs mounted to my irc box

22:19 <clever> and when the nas hung, the df's from cacti hung

22:19 <clever> and when the nas recovered, some 300 df's came to life at once

22:19 <geist> noice

22:20 <dh`> > it's also linux-specific from what I can see <-- that is, signalfd

22:20 <heat> write a 4.4BSD clone

22:20 <heat> the peak of unix

22:21 <heat> everything went downhill from there on

22:21 <geist> dh`: oh hey you know a thing about binutils and riscv

22:21 <dh`> maybe

22:21 <geist> https://github.com/bminor/binutils-gdb/blob/master/ld/emulparams/elf32lriscv-defs.sh#L19

22:21 <bslsk05> github.com: binutils-gdb/elf32lriscv-defs.sh at master · bminor/binutils-gdb · GitHub

22:21 <geist> as far as i can tell riscv (along with microblaze) are the only arches i've tried that have that particular nerf in place

22:21 <geist> i hit it the other day when trying to make a shared lib with my -elf toolchain

22:21 <geist> nuking that test seems to have no ill effect. i wonder why that's there?

22:21 <dh`> that looks useless and broken

22:22 <heat> that settles it

22:22 <geist> as someone on another discord put it it's another case of discrimination against embedded elves

22:23 <dh`> it's not like shared libraries don't work on riscv or something

22:23 <dh`> nor are elf shared libraries os-specific; I mean, it _says_ "elf"

22:23 <geist> exactly. it seems to work just fine, and though sure maybe the gcc defaults for -elf are not useful (though they seem to be) but you can drive ld with all the switches manually if you really want

22:23 <geist> except when it doesn't support -shared

22:24 <geist> i guess i should file a bug about it then

22:24 <geist> i tried about 10 other arches i maintain toolchains for and i think the only other one i saw that had a similar thing is microblaze

22:24 <clever> something else i was thinking about, relocations and shared vs static libraries

22:24 <geist> but that tends to be a *highly* embedded target

22:24 <dh`> yeah I would file a bug on it

22:24 <geist> yay validation!

22:25 <clever> for say a static userland binary, would relocations usually be missing, and the kernel just loads it to a fixed addr and job done?

22:25 <dh`> that kind of thing creeps in by accident and then nobody will notice it's there unless they happen to step on it themselves

22:25 <heat> geist, maybe just send a patch

22:25 <mrvn> clever: before secruity mitigations, yes

22:25 <heat> from my experience, things go slowly in GNU toolchain land

22:25 <dh`> (also, binutils configury is very extra so it's very hard to avoid having stuff like this happen)

22:25 <clever> mrvn: and how would i convince the linker to include relocation data anyways?

22:25 <mrvn> clever: PIE

22:26 <geist> or, it has relocations but if the loader puts it where it's natively linked, the relocations all end up evaluating to a NOP

22:26 <geist> that gives you the best of both words: something you *could* relocate but if you dont have to you dont do any of the work and dirty COW pages

22:26 <heat> clever, ld has switched to keep relocations

22:26 <dh`> traditionally ever since virtual memory first appeared statically linked programs have a fixed load address

22:26 <heat> s/switched/switches/g

22:26 <geist> clever: generally i think just -dynamic i think? or something like that

22:27 <clever> geist: but wont that also then require an interpreter and a runtime linker?

22:27 <clever> and its not static anymore

22:27 <mrvn> clever: who else would do reloactions?

22:27 <clever> even if you have 0 DT_NEEDED

22:27 <geist> well not necessary. depends on definition of static here

22:27 <geist> i'd say static is if you have no external references

22:27 <geist> but that's independent of relocations

22:27 <clever> mrvn: trying to extrapolate into a static kernel, with relocations, and the bootloader patching it

22:27 <dh`> it used to be that it was impossible with elf binutils to generate an executable image that still had ordinary relocations in it

22:27 <geist> however, ELF combines the two mechanisms, so it's sometimes confusing

22:27 <mrvn> clever: do you have binaries that don't use ANY dynamic libs?

22:27 <heat> "--emit-relocs" or -q

22:28 <heat> clever, ^^

22:28 <geist> all an external patch looks like to ELF is a relocation that refers to a non-local symbol

22:28 <dh`> if that has changed, that's a nice plus

22:28 <mrvn> clever: that's what geist is doing with some magic

22:28 <clever> heat: *checking*

22:28 <dh`> because it made it extremely painful to build executables for nommu systems

22:28 <geist> in the past i used to jsut manually drive the toolchain, specifocally ld, to make what i want

22:29 <geist> vs using a more higher level pre-canned notion of what gcc wants me to do

22:29 <mrvn> clever: So far I'm just writing my boot.S by hand to be 100% PC relative and setup the kernel to run in higher half and then the kernel is linked to a fixed address.

22:29 <geist> but that tends to be nasty business, but usually doable since ld usually lets you get what you want if you try hard enough

22:29 <geist> though that doesn't apply necessarily to gold or lld alas.

22:30 <clever> mrvn: in my case, there is no mmu, and the top of the address space moves based on how much ram i have

22:30 <geist> haha oh that's the classic CP/M problem

22:30 <geist> SYSGEN.COM to fix that!

22:30 <clever> either i put it at a fixed addr and create a hole in the middle of my ram

22:30 Likorn has joined #osdev

22:30 <mrvn> clever: then go write an elf loader with relocations and add that as a stub before the kernel (or equivalent). That also lets you do kernel address space randomization.

22:30 <clever> or i apply relocation patches to it

22:31 <geist> actually apple ][ DOS 3.3 had that problem too: if yuo booted a disk on a 48k machine that was formatted at 64k it wouldn't boot because the dos it loaded off the disk was linked at the wrong address

22:31 <mrvn> clever: or run the kernel at a fixed low address and the user space higher up

22:31 <clever> mrvn: ive already got an elf loader in the previous stage, which i can extend to support relocations, so i just need the linker to emit them

22:31 <geist> at least x86 DOS didn't have this problem because segments FTW

22:32 <mrvn> clever: look at the PIC/PIE output to see if that suits you. Be aware that the reolcation format changes from arch to arch and depending on flags.

22:32 <clever> yeah, segmentation was basically a cheaper MMU

22:32 <zid> I was about to joke just add segments

22:32 <geist> they were trashy, but actually kinda helpful when you think about it. lets you easily build relocatable stuff

22:32 <geist> for that particular era of hardware/software at least

22:33 <zid> I mean that was always the point right

22:33 <mrvn> geist: you mean in 16bit mode?

22:33 <zid> it solves the issue of how to run multiple programs very neatly

22:33 <geist> yah. it served two purposes: extend the address space and also build relocatable stuff

22:33 <zid> virtual memory is way too galaxy brain for that era (too many gates if nothing else)

22:33 <geist> yah and drivers too. DOS TSRs and drivers just got loaded somewhere modulo 16 bytes and had segments for their base/etc

22:33 <mrvn> I think they used segments in 32bit just because they already had them.

22:34 <geist> or 16 bit protected mode

22:34 <dh`> the 80286 segments were clearly intended to run multics but done by people who didn't read the directions adequately

22:34 xenos1984 has quit [Read error: Connection reset by peer]

22:34 <heat> zid, virtual memory is the galaxy braniest idea known to man

22:34 <dh`> and the 80386 segments were basically structurally identical

22:34 <heat> even now

22:34 <geist> yah. i was saying earlier they had too much axp432 koolaid in the water coolers around Intel at the time

22:35 <heat> "imagine memory, but it's not really there, until it is, but they it might not be, and if you write to it, it might switch to something else"

22:35 <geist> which was segments but galaxy brain version

22:35 <zid> "First, imagine a full 32bit lookup table to map any integer to any other integer"

22:36 <geist> 'why dont we just implement OO in the hardware directly' <mind blown> 'lets throw in garbage collection too!'

22:36 <heat> Jazelle was also great

22:37 <geist> but when you think about it, for example, a non mmu 68k machine (say a mac) would have to deal with relocating binaries and whatnot

22:37 <geist> since there was no hardware to do it for you. presumably when they loaded apps either it was at a fixed spot (one app on early macs) and/or had to be relocated on load. and the OS and drivers probably did

22:37 <clever> amiga is also non-mmu 68k and nearly everything is relative addressing

22:37 <j`ey> heat: BXJ!

22:37 <heat> "java is slow in our processors" "what if we <hits bong> run java bytecode directly on the CPU"

22:37 <geist> yah amiga too

22:37 <clever> there is a magic pointer to the description of a library

22:38 <zid> heat: I cry every time

22:38 <clever> negative offsets give you a function pointer table, but i think the table is relative to that root pointer

22:38 <zid> Every time someone mentions sim cards I cringe internally

22:38 <clever> positive is a struct describing the library

22:38 <dh`> in amigaos the call to load and relocate an executable was "LoadSeg"

22:38 <dh`> not sure why I remember this

22:38 <geist> yah i remembe rlookinat a jazelle once at a company i was at, but at the time it was heavily restricted

22:38 <geist> was some binary blob you weren't really alloewd to look at, and it would trap at the drop of a hat

22:38 <clever> isnt that the java on arm thing?

22:38 <j`ey> yeah

22:38 <geist> yah was an actual mode bit in the CPSR and everything

22:39 <geist> i think it ran some subset of java bytecodes and would trap out as soon as something tricky happened

22:39 <clever> from what ive read, hw support for every single java bytecode is optional

22:39 <clever> and its possible for it to support not a single opcode, and still technically support the mode

22:39 <geist> but it was very hidden uner layers of licenses and whatot, so we didn't go for it and just wrote our interpreter in assembly

22:39 <geist> this wa back when ARM was much harder to deal with. they were being hyper secret about everything like it mattered

22:39 <clever> explains why i couldnt find much info online

22:40 <heat> could you write a bootloader in java

22:42 <zid> Should you though

22:42 <heat> obviously yes

22:42 <heat> if the cpu supports it

22:42 <heat> I mean, it's right there

22:42 <heat> why would you not use it

22:43 <zid> heat are you feeling okay

22:44 <geist> it's bad idea jeans day

22:46 <kazinsal> also might be a bad idea to wear jeans today. it's actually above 20 C for the first time this year :toot:

22:46 <geist> it's getting up there here!

22:46 <geist> though it's like 17 here

22:47 <clever> "2022-05-21 19:47:18 bedroom temp: 21.44c(70.59f), kitchen: 22.31c(72.16f), living room: 21.88c(71.38f), outdoor: 15.06c(59.11f), server: 22.56c(72.61f) VCC: over 4.5 volts portb: 00000000"

22:48 <clever> i just finished wiring that software into mqtt and home-assistant, so now i can write automation rules to change the temp

22:48 <clever> just in time for summer, so i can not use any of that for a few months, lol

22:49 mahmutov has quit [Ping timeout: 246 seconds]

22:52 xenos1984 has joined #osdev

23:10 <geist> you know i'm starting to think this 3950x is simply unstable. now that i think about it it was always vaguely crashy when it was my main desktop cpu

23:11 <zid> shame cus that's a nice cpu

23:11 <zid> give it more voltage?

23:11 <zid> might just be a crappy bin

23:11 <geist> i generally assumed it was the usual graphics stuff, etc

23:12 <zid> does prime95 exist on linux

23:12 <geist> absolutely

23:12 <geist> but it doesn't seem to crash under load

23:12 <geist> generally just sort of sitting there. after a few days

23:13 <geist> i had a while back replaced the 3950x with a 5950x on my main desktop, and the latter is rock solid. that's when i had moved the 3950x down to my server

23:13 <geist> and then it started gettnig more unstable over the next 6 months or so

23:13 <geist> but it doesn't run hot or anything

23:14 <zid> sounds like a clear candidate for MOAR VOLTS

23:14 <heat> microcode?

23:14 <zid> does microcode give you more volts

23:14 <heat> no but it might fix bugs

23:15 <geist> yah have that updated, but... part of that problem may be it's a relatively old AM4 mobo that hasn't gotten a bios update in a few years and probably wont any more

23:15 <geist> so it'll be out of date except what linux uploads

23:15 <geist> though i guess linux will do the right thing there

23:19 <dh`> how much ram and is it ecc?

23:21 <geist> 64GB and yes

23:22 <dh`> not that then

23:25 <geist> i do have a 3900x i can stuff in and see though, will probably try that next

23:25 <geist> but if it doesn't fail it doesn't mean a lot honestly

23:26 <geist> though it's closer, power profile, to the 3950x so if it's a vreg on the mobo or whatnot i'd expect it to push it

23:26 <geist> yay consumer level hardware being pressed into server usage. sigh.

23:27 heat has quit [Remote host closed the connection]

23:27 andreas303 has quit [Quit: fBNC - https://bnc4free.com]

23:29 <raggi> geist: they should be still publishing update bios for the amd stuff I would have thought

23:29 <geist> yah but if motherboard vendor doesn't actually put out a patch you dont get it

23:30 <geist> but linux should have its own microcode patches

23:30 <clever> yeah, ive heard that windows relies on the bios to patch microcode

23:30 <clever> while linux just prepends the microcode to the initrd, and linux uploads it on boot

23:31 <raggi> My gigabyte one I had to go root around in their "ftp" servers to find the more recent ones, they're just terrible at putting them where they're supposed to be

23:32 <raggi> I swear the mobo manufacturers have some really weird software practices, no idea why but they just can't write and manage software in any normal ways

23:33 <raggi> Yeah, my am4 even has an update from this month, and they finally published on the main site again

23:33 <clever> my motherboard is capable of reaching out to the internet on its own, and updating its own firmware

23:33 <clever> its just a button in the bios config

23:35 <raggi> Yeah, that should be standard now, but alas. Mine still has this weird thing where when it boots in UEFI mode the GUI is super laggy, 1s pause then .5s run, repeating

23:36 <clever> my gui has bloody twinkling stars in the background artwork, lol

23:36 <geist> oh actually there is an updated bios, i should grab it

23:38 <raggi> My am4 board is stable tho, which is better than the one my haswell is in, so they successfully drove my bar down to "MVP"

23:38 <geist> yah honeslty i'm really really sad if it turns out this cpu is fundamentally unstable

23:39 <geist> since i'm pretty much team AMD at the moment. dont let me down1

23:39 <geist> i'm trying hard to blame everything else. actually the fact that they've stuck with the same socket for at least 5 years makes things really nice to debug

23:39 <raggi> Yeah, my 3850 I got on your recommendation and "touches wood 3 times* is still working real nice

23:41 <raggi> You did reseat everything I presume?

23:41 <geist> well, i swapped cpus

23:41 <geist> so i should put the 3950x back and see again

23:41 <geist> the annoying thing is the MTBF is like 3 days so it's a long term solution

23:42 <raggi> I mean reseat the ram, and the gpu, etc

23:43 <raggi> At this point could be arbitrary electrical fault

23:45 <geist> yah i did that too

23:45 <geist> i thought i was onto something by removing the vid card and it appearing stable, but it blew up eventually

23:45 <geist> note it blowing up is a hard lockup. everything just stops. only a power cycle or reset fixes it

23:49 andreas303 has joined #osdev

23:50 <dh`> I have an (older) machine that does that, eventually figured it was the motherboard not really supporting ecc dram

23:51 <geist> possible. yes. maybe it's an ecc fault that the mobo then explodes on

23:52 <geist> though i the past i've had ecc faults from time to time that showed up at least in linux's dmesg log, but maybe the 3950x (or zen 2 in general) has a different fault mechanism that the mobo can't deal with

23:52 <dh`> characteristic symptom was that it would lock cold under load, including when rebooting into memtest86+, but a power cycle would clear the problem for weeks

23:52 <dh`> or months

23:53 <geist> whereas before i was using a zen 1 era cpu... that may be onto something

23:53 <geist> i have generally found that ECC ram does generate recoverable errors fairly frequently

23:53 <geist> though not usually every few days

23:54 <dh`> in my case what appears to happen is that the error gets generated (might only be an unrecoverable error?) but nothing happens until you access that memory, so the system dies, usually under load, at intervals of days

23:54 <gamozo> My Piledriver box would throw like 2-3 ECC erors a day (cheap ram, many sticks). Was kinda wild

23:54 <zid> my friend's an ecc nut, he links me machines that have 500 ecc errors logged since they last booted

23:55 <geist> hmm,also possible this 1500x i put back in it doesn't support ECC

23:55 <geist> and thus doesn't have the problem

23:56 <geist> i dont see any mention of it in the dmesg, but that may not be meaningful. but i thought linux woiuld at least mention it

23:56 <zid> I'm still on the hunt for those 32GB of 933MHz ECC UDIMMs

23:56 <dh`> anyway there's all manner of possible similar problems

23:57 <geist> yah that's a good suggestion

23:59 <zid> I like to crash hardware so that it's broken through reboots

23:59 <zid> I kept doing it when I was testing my CPU with prime95