#osdev on 2021-08-18 — irc logs at libera.irclog.whitequark.org

2021-05-23 01:57 klange changed the topic of #osdev to: Operating System Development || Don't ask to ask---just ask! || For 3+ LoC, use a pastebin (for example https://gist.github.com/) || Stats + Old logs: http://osdev-logs.qzx.com New Logs: https://libera.irclog.whitequark.org/osdev || Visit https://wiki.osdev.org and https://forum.osdev.org || Books: https://wiki.osdev.org/Books

00:02 thinkpol has quit [Remote host closed the connection]

00:03 thinkpol has joined #osdev

00:13 dbana has joined #osdev

00:14 <geist> Ah yeah that LK code is *old*

00:14 <geist> Actually predates LK somewhat, i think i wrote it in mid 2000s sometime

00:14 <geist> Really was only ever written against plain ext2. Needs some more features and cleanup.

00:15 <klange> You probably don't need to do anything with some of those? I think some of the compat features are "hey this thing is here, but if you're read only you can ignore it"

00:15 <geist> Iirc they have multiple flag fields

00:15 <geist> One for ‘you need these features to read/write’ and another one for ‘you need this feature to just read’

00:15 <geist> Like, yeah the different i node sizes, and whatnot is a read only mount breaker

00:16 <clever> geist: i also notice, its not respecting the s_feature_incompat field

00:17 <clever> it just doesnt even check it

00:17 <clever> ext2_mount:124: incompat features 0x2c2

00:17 * geist nods

00:17 <clever> INCOMPAT_FILETYPE, INCOMPAT_EXTENTS, INCOMPAT_64BIT, INCOMPAT_FLEX_BG

00:17 <clever> those flags from the wiki i linked above, are all set

00:17 <geist> In general there’s a question as to whether or not you want to be able to RO mount a journaled fs with an unplaced journal

00:17 <clever> (on ext4)

00:18 <geist> Un played

00:18 <klange> RO_COMPAT_HUGE_FILE → Files (some?) have sizes in logical blocks rather than 512-byte sectors.

00:18 <clever> > This filesystem has files whose space usage is stored in i_blocks in units of filesystem blocks, not 512-byte sectors. Inodes using this feature will be marked with EXT4_INODE_HUGE_FILE. (RO_COMPAT_HUGE_FILE)

00:18 <clever> according to the wiki i'm reading

00:18 <klange> RO_COMPAT_DIR_NLINK → The i_links_count in a directory can be reset to 1 if it has >=65k links.

00:18 <clever> so its a per-inode thing

00:19 <geist> Hmm, somewhere along the way they must have found space for the top 32bits of the file size too

00:19 <geist> Since iirc early ext2 was 32bit only

00:19 <clever> geist: thats probably INCOMPAT_64BIT

00:19 <geist> At the time it was one of those things FreeBSD did better

00:19 <clever> if you dont support INCOMPAT_64BIT, then you cant even do a ro mount, because youll mis-parse the huge files

00:19 <geist> Yah

00:20 <geist> Except the code currently ignores it, so it probably still mounts but will be off if its a large file

00:20 <clever> my main goal is just to read boot files out of /boot

00:20 <clever> without forcing the user to use ext2 for all of / (or a second /boot)

00:20 <geist> Yah, traditionally for a long time lots of distros would format /boot as plain ext2 + SYNC mount options

00:20 <geist> Presumably for boot loader compatibility

00:20 <clever> ext2 doesnt get you much benefit over fat32

00:21 <clever> one bad shutdown at the wrong time, and you dont boot anymore

00:21 <geist> Well, that’s debatable, but yeah

00:21 <clever> a journal should prevent that

00:21 <geist> That’s why traditionally you mounted /boot with sync option

00:21 <clever> if your bootloader can deal with the journal

00:21 <geist> That’s the problem: read only mounting with a journal is problematic. What do you do with it?

00:21 <clever> i dont know about the ext3/4 journal specifically

00:22 <graphitemaster> Okay doing the whole mmap as a memcpy optimization actually is really fast after 128 KiB of data.

00:22 <clever> but with sqlite, the journal has a backup copy of any blocks it was about to overwrite

00:22 <geist> About the best thing you can do is parse it, build an in memory representation of blocks in flight, and then use a look aside list to consult journal blocks as you read things

00:22 <clever> so if you re-route all reads, to the journal version, like an overlay

00:22 <geist> Right

00:22 <clever> then youll be viewing the rolled back version of the state, without actually applying it

00:22 <geist> Generally it’s the rolled forward state in ext4

00:23 <clever> ext4 writes the "new" block to the journal first?

00:23 <geist> Essentially ‘imma bout to write these blocks’ (goes to write the blocks) (marks the transaction as complete)

00:23 <clever> ahh

00:23 <geist> Yah, it’s a forward journal

00:24 <geist> That’s fairly standard and traditional. Not even that slow, since you can cache up a fairly large journal, and fairly lazily write it out and the real data

00:24 <clever> sqlite has a complex dance, write the journal block without a header, sync, write the journal header, sync, overwrite the original blocks in the db, sync, destroy the journal header, sync, destroy the journal file entirely

00:24 <geist> You just have these sync points where you have to ensure the journal is fully written before entering the next phase, etc

00:24 <clever> sqlite also assumes that extending a file, creates garbage in the holes

00:24 <clever> and other funky assumptions

00:25 <geist> Yep. I left out the ‘update the journal superblock’ thing

00:25 <clever> sqlite also combats the raw sectors, writing from either end

00:25 <geist> I dunno exactly where ext4 puts that, but it’s probably just a single sector that says what the head of the journal is, since you can write that single sector atomically

00:25 <clever> so if a partial write occurs, and the raw drive writes the sectors tail-first, sqlite handles it

00:26 <clever> both the head and tail of a sector must agree, or the block is considered corrupted

00:26 <geist> But anyway, the journal is a pain for embedded things. Even if you don’t play it back on mount you still have to track a fairly large amount of pending metadata

00:26 <geist> Which may be difficult in a constrained memory situation

00:26 <clever> the main case where i want ext4, is after i bring dram online

00:27 <clever> so i'll have between 256mb and 1gig of ram

00:27 <geist> Word.

00:27 <clever> https://sqlite.org/atomiccommit.html is where i got most of that sqlite info from, if you want to read up on it

00:27 <bslsk05> sqlite.org: Atomic Commit In SQLite

00:27 <geist> Gotta go, boat is coming into dock

00:27 <geist> I’m on a boat!

00:27 <clever> lol :D

00:28 <clever> klange: step 1, is to just falsely claim LK can read everything just fine, that is done

00:28 <clever> step 2, realize LK lacks readdir() for ext2/3/4, so i have no way to test the next phase easily :P

00:29 <klange> directories are for losers, Real Programmers know where the inodes for their files are

00:30 <klange> in fact why do you even need a filesystem, REAL PROGRAMMERS know all the blocks constituting their files and just use them directly!

00:30 <clever> klange: you dont use butterflies? :D

00:30 <geist> Haha that was the LILO strategy

00:30 <graphitemaster> inodes are for losers, Real Programmers know the physical location on the platter and can read it with a tiny magnet on the end of a needle without any drive controller.

00:30 <clever> and also what a lot of ARM chips do for the bootup code

00:31 <clever> rpi is a rare beast, where it handles MBR and fat32 in the bootrom, and loads from a file

00:31 <geist> A fairly common strategy is to have a fixed run of sectors on NAND that you store the loader, etc. But since NAND can have bad sectors, it skips any bad ones and just reads the next

00:32 <geist> But that’s easy to detect. As long as you overprovision a bit and probably store multiple copies of the binary, you’re cool

00:32 <clever> geist: i think the bootcode.bin was just in the first 128kb of sectors on the raw nand, when booting from that

00:32 <clever> ive not read that code closely, because i have no chips to wire up

00:32 <geist> Yep. If it’s well designed it overprovisions, or has multiple copies

00:32 <geist> Like 192KB to store 128KB, in case it has some bad nand sectors in the middle

00:32 <clever> when booting from SPI, its instead a 32bit magic, 32bit size, and then the raw bootcode.bin body

00:33 <clever> the sd card code, was meant more as a recovery option, to un-brick things

00:33 <clever> and it was using a file, so you could make the recovery media from a normal pc

00:33 <geist> That Microsoft format, i forget what it was called, used in the rpi nano has a offsets and whatnot in the block itself

00:33 <clever> then the rpi came out, and used the recovery route as the main route

00:33 <geist> Very wasteful, but pretty flexible in the end

00:34 <geist> Since you can scatter something all over the place and reconstruct it later, with checksums on each block

00:34 <clever> nice

00:34 <clever> you mean the pico, and uf2?

00:34 <zid> Don't allow updating the firmware, verify it in the factory, gg ;)

00:34 <geist> Downside is it’s about 2:1 overhead

00:34 <geist> UF2 is yeah i think it

00:34 <clever> yeah

00:34 <geist> Doesn’t have to store it that way on flash but you *could*

00:34 <clever> uf2 uses 512 byte packets, so it nicely sends one packet per MSD write command

00:35 <clever> and then it sends data in blocks of flash write size

00:35 <geist> Yah and you can fake out a FAT partition without actually parsing FAT

00:35 <clever> it could be >512, if your flash chip supports that

00:35 <geist> By just scanning the faked out block device for fragments of the file

00:35 <geist> Since each block is self-identifying

00:35 <clever> but since the pico uses 256 byte flash write blocks, its got the 2:1 waste

00:36 <clever> ive programmed a pico with cat before, cat foo.uf2 > /dev/sda

00:36 <geist> Anyway at first i was kinda grossed out and then started to appreciate the elegance of UF2

00:36 <clever> all it cares about is the uf2 packets being written

00:36 <geist> Yah

00:36 <clever> its abusing MSD as a packet transfer protocol

00:36 <geist> Anyway, gotta go

00:36 <clever> *waves*

00:36 <clever> get out of here!

00:36 gog has quit [Quit: bye]

00:37 dbana has quit [Quit: Lost terminal]

00:37 <clever> zid: for the roku2, there was per-device keys burned into the OTP

00:37 <clever> zid: and bootcode.bin had to be correctly signed, or it would just not boot

00:37 <clever> the board was capable of reflashing itself in the field to update things, but i can see there being a major bricking risk, if it lacks a fallback

00:38 <clever> pi1/pi0 use the exact same soc, but never enabled that feature

00:38 <zid> The wii had a couple of stages, one of which was fixed inside the cpu itself, and had the keys for the one on flash, way fancier

00:38 <clever> zid: thats pretty much exactly what the rpi has

00:38 <clever> there is a boot rom in the cpu

00:38 <clever> half of the key is in the rom, fixed for the entire batch of chips

00:38 gog has joined #osdev

00:38 <clever> half of the key is in OTP, unique to the device

00:39 <clever> the 2 halves xor together, and then hmac-sha1 sign the bootcode.bin file

00:39 <zid> It meant the console was a brick if you never dumped your keys out though and the nand died

00:39 <clever> zid: for the bcm2835, there is a timing exploit in the final signature compare method

00:39 <zid> wii's was better

00:39 <zid> they used strcmp instead of memcmp

00:39 <clever> the more signature bytes you get right in a row, the longer the check takes

00:39 <clever> *facepalm*

00:39 <zid> so you needed 256 attempts to bruteforce a lead zero

00:39 <zid> and you were done

00:40 <clever> for the bcm2711, you just append the signature to the end of the bootcode.bin file

00:41 <clever> zid: like this: https://github.com/librerpi/rpi-tools/blob/master/signing-tool/sign.js#L34

00:41 <bslsk05> github.com: rpi-tools/sign.js at master · librerpi/rpi-tools · GitHub

00:41 <zid> NDS was better, all the hashing worked, but it didn't cover the entry point in the header

00:41 <clever> for the rpi, the entry-point is hard-coded into the rom

00:41 <clever> bootcode.bin gets copied to 0x8000_0000, and if the sig is valid, jump to 0x8000_0200

00:42 <clever> cache-as-ram

00:42 <clever> pi0-pi3 all have the sig checks disabled, so "invalid" and "non-official" files just work, without any fuss

00:42 <clever> roku2 has proper per-device keys, so you need to gain execute first to dump them

00:43 <clever> bcm2711 (pi4/pi400/cm4) has the sig checks half enabled, but every unit has the same keys

00:44 <clever> zid: at some point, broadcom realized what a mistake an hmac was, the bcm2711b1 and further, also have proper RSA signature support, but its not enabled on any boards i know of

00:48 <clever> zid: do you know why an hmac is so bad for security?

00:52 gog has quit [Quit: byee]

00:58 AssKoala has joined #osdev

01:07 freakazoid343 has quit [Ping timeout: 245 seconds]

02:06 sts-q has quit [Ping timeout: 240 seconds]

02:19 sts-q has joined #osdev

02:20 mahmutov has quit [Ping timeout: 240 seconds]

02:21 freakazoid333 has joined #osdev

02:24 ZipCPU has joined #osdev

02:44 srjek has quit [Ping timeout: 240 seconds]

02:49 kulernil has joined #osdev

02:50 nyah has quit [Quit: leaving]

02:54 flx-- has joined #osdev

02:57 flx- has quit [Ping timeout: 245 seconds]

03:30 justyb11 has quit [Quit: Leaving]

03:56 shlomif has joined #osdev

04:19 Izem has joined #osdev

04:36 <Izem> what is required to boot a kernel on (sea) bios?

04:38 <clever> Izem: a bootloader of some kind

04:38 <zid> two bytes

04:38 <clever> legacy bios will only load 512 bytes of MBR from disk, and execute it, if it has that 2 byte magic

04:39 <Izem> I know I need a kernel imagine but I don't know what goes between that

04:39 <Izem> s/imagine/image/

04:39 <zid> Try grub, it's easiest

04:39 <clever> simplest answer, use something like grub, and then throw in something like a multiboot header

04:39 <Izem> yeah I am on windows that's why I was wondering if I could just use virtualbox

04:40 <Izem> I suppose grub is available on cygwin

04:40 <zid> i'd just do it in a linux VM but i am way more comfortable there than windows

04:41 <clever> zid: same, i dont know how i would even make a disk image on windows, with tooling i'm familiar with

04:41 <Izem> ok

05:04 <fedorafan_altern> morning

05:08 johnjay has quit [Ping timeout: 248 seconds]

05:11 johnjay has joined #osdev

05:57 paulman has joined #osdev

05:58 kulernil has quit [Ping timeout: 244 seconds]

06:14 devcpu has quit [Ping timeout: 268 seconds]

06:14 devcpu has joined #osdev

07:58 superleaf1995 has joined #osdev

08:01 Izem has quit [Quit: Connection closed]

08:07 mctpyt has quit [Ping timeout: 268 seconds]

08:20 rubion has joined #osdev

08:23 superleaf1995 has quit [Quit: leaving]

08:40 <klange> thinking about kernel logging suddenly, mostly because I glanced at a printf in my ELF loader and it sparked some further thought

08:41 <klange> toaru32 had 'debug_print' which did a bunch some macro magic, and had different log levels, and would output to nothing, serial, or even certain VFS files (particularly TTYs)

08:43 <klange> misaka has... essentially nothing, there's a few places where code calling debug_print was ported and it's macro'd to nothing, and there's a `printf` that spits to serial / vbox debug log but it's only used in strict debugging situations, and there's an fprintf that can print to userspace TTYs that I've been using in my in-progress modules (ahci, xhci) mostly so I can see the output on my laptop

08:44 <klange> oh there's also a framebuffer bitmap font printer for that, but it's a relic from early bringup of the kernel and its results will happily be wiped away by moving the mouse or the clock ticking

08:44 <klange> I was thinking I need something better for early boot... like, maybe a little reserved area for some log messages, then off to the heap, and then maybe expose that as a character device that something can read in userspace to flush it out to a file? how does Linux do this?

08:45 <klange> I know Linux does some silly thing where log messages are just strings and if you want log levels there's a special low byte at the beginning to say "there's a header here, not just text"?

08:45 sortie has joined #osdev

08:46 <klange> it's a sortie

08:46 <sham1> A ring-buffer of messages kinda like dmesg

08:46 <sham1> Well, whatever dmesg reads at least on Linux. It basically has a message log storage buffer thing

08:46 <kazinsal> I should start working on my project again

08:47 <kazinsal> been playing too much FFXIV to do so lately

08:47 <klange> I usually put in a couple of daily roulettes and then crack open the ol' editor.

08:48 <kazinsal> I stopped playing after Heavensward and have spent the last month and a half banging out SB and ShB

08:48 <kazinsal> just finished the 5.0 MSQ last night

08:49 <klange> I can't get the new expansion's benchmark tool to run in Wine, I've tried a half dozen different builds but it's crashing trying to show EULA text (and the config option that suggests it skips that... does not)

08:49 <klange> I've been told there's no actual notable engine changes happening, so hopefully things just keep working fine.

08:50 <kazinsal> yeah, from what I've heard it's effectively the same. I was kind of hoping they'd add some more high end graphics features in for endwalker but apparently not

08:51 <kazinsal> TAA instead of SMAA, etc

08:52 <kazinsal> that being said I get a consistent 120fps even in alliance raid boss pulls on a two-generation-old card so

08:52 <sortie> It's the klange!

08:52 <sortie> ka-lang-gah

08:53 <klange> I usually get a solid 60@1080p but I do experience some dips... when I ran natively in Windows, though, I had no trouble running at 5760x1080 on this card.

08:53 <klange> sortie: how goes the sortixes, I hear your website is self-hosted now? that is exciting!

08:53 <kazinsal> the worst framerate I get on my home world is in the limsa lominsa aetheryte plaza, because of all the reflections and a hundred catgirls (myself included) and a dozen bards playing five different off-tempo renditions of A Cruel Angel's Thesis

08:53 <klange> meanwhile I have neglected my network stack, but somehow my SMP support is stable

08:54 <sortie> It is quite the thing to have GoogleBot crawling my OS

08:55 <sortie> klange, I had soo many worries about things happening if I did this (trouble with my old host, hacking, logs, reliability) but eventually I just went fuck it I'm doing it

08:56 <sortie> I should insert a banner “You may already be a Sortix user.”

08:56 <sortie> klange, wow you did SMP?

08:56 <sortie> ACIPCA or what it's called?

08:56 <klange> Technically, as someone reminded me on the forum, I did SMP _during my lunch break_ (a couple months ago)

08:57 <klange> There's only one ACPI table you need to find the other cores, and it's not got any AML to worry about.

08:57 <sortie> Neat

08:57 <klange> And my bootstrap is this is absolutely awful little straight-to-longmode thing and every step of this has been "I am honestly amazed this works".

08:58 <sortie> Oh wow

08:58 <klange> Like the first time I got secondary cores firing up and scheduling tasks, I got to a functioning GUI before it crashed.

08:58 <sortie> Makes me want to give this a ago

08:58 <sortie> Not for this release cycle (decade) tho

08:58 <klange> Turns out those useless locks I implemented in toaru32 weren't so useless? Though some of them were wrong-but-only-in-the-actually-parallel case.

08:59 <sortie> How much stuff did you do to handle IPIs and such for page invalidation?

08:59 <sortie> I suppose one address space one CPU is a reasonable initial limitation

08:59 <sortie> Aah but the kernel

09:00 <klange> Page invalidation only really matters when you start paging things out, so I've somehow skirted a lot of it - but there is an IPI implemented for it that seems to work.

09:01 <klange> The last couple of bugs were a race condition in my Unix pipes that was mostly presenting as the package manager failing to unpack tarballs (it pipes my gunzip into itself for compressed archives, and it kept getting stuck; many minutes were spent poking at things in gdb)...

09:02 <klange> And a lock ordering issue in my poll-alike, which _appears_ to be the last kernel crash.

09:06 <geist> First time you tear down a thread is probably whenyou’ll start seeing it

09:07 <geist> Since you’ll probably want to unmap the heap, user and kernel

09:07 <geist> And that’s an unmap

09:10 <klange> I think the one place it was clearly necessary for me was unmapping shared memory, but there's just so many places I don't unmap or reuse address ranges that I manage to stupidly avoid the problem.

09:11 fedorafan_altern has quit [Remote host closed the connection]

09:13 <sortie> So cool :)

09:15 <sham1> Smart quotes

09:15 <sham1> Yay

09:20 Burgundy has joined #osdev

09:21 Brnocrist has quit [Ping timeout: 248 seconds]

09:22 <klange> < sham1> Smart quotes ← I support those, now, too! https://klange.dev/s/Screenshot%20from%202021-08-18%2018-21-54.png

09:22 <klange> (* just the rendering of them)

09:25 GeDaMo has joined #osdev

09:27 <sham1> Nice

09:28 Brnocrist has joined #osdev

09:28 <sham1> Meanwhile on this old netbook I am in the Linux VTY. No smart quotes for me :(

09:45 Arthuria has joined #osdev

09:49 kingoffrance has quit [Ping timeout: 245 seconds]

09:58 dennis95 has joined #osdev

10:05 warlock has quit [Remote host closed the connection]

10:17 warlock has joined #osdev

10:20 pretty_dumm_guy has joined #osdev

10:21 x_ has joined #osdev

10:21 x_ is now known as kingoffrance

10:38 <sortie> Sometimes I worry I am quietly judged by iOS people for using '

10:53 paulman has quit [Remote host closed the connection]

10:53 paulman has joined #osdev

11:14 rubion has quit [Ping timeout: 240 seconds]

11:16 rubion has joined #osdev

11:24 Vercas has quit [Remote host closed the connection]

11:24 Vercas has joined #osdev

12:05 ElectronApps has joined #osdev

12:14 NeoCron has joined #osdev

12:21 isaacwoods has joined #osdev

12:52 heat has joined #osdev

12:57 dormito has joined #osdev

13:03 Izem has joined #osdev

13:03 rubion has quit [Ping timeout: 240 seconds]

13:05 <Izem> the problem I see with capabilities is that you still have to define which process get's to do what. On the other hand if you have capabilities do you need a micro kernel?

13:10 ahalaney has joined #osdev

13:15 rubion has joined #osdev

13:19 <sham1> Well, capabilities are indeed very much orthogonal to microkernels, although microkernels might be interested in having capabilities since it's a nice and unified way to make sure that only the programs that are supposed to be drivers have direct access to hardware

13:20 <sham1> If you have a permission/capability system for that, might as well use it for other things as well

13:20 <Izem> I thought in a micro kernel that is getting restricted anyways due to the design?

13:21 <sham1> How could the microkernel tell what process is supposed to be the keyboard driver and what is not?

13:21 <sham1> The kernel still needs to grant the driver access to the hardware. Stuff like IO ports and/or MMIO

13:22 <Izem> ok

13:22 <sham1> Having a "driver capability" of some sort could be used to discriminate this access in such a way that only the driver gets access

13:23 <sham1> There are other ways as well, ofc

13:37 gog has joined #osdev

13:46 rorx_ has joined #osdev

13:46 rorx has quit [Ping timeout: 245 seconds]

13:56 rorx_ is now known as rorx

14:14 srjek has joined #osdev

14:16 MrBonkers has quit [Quit: The Lounge - https://thelounge.chat]

14:16 CWiz has joined #osdev

14:18 MrBonkers has joined #osdev

14:22 varad has joined #osdev

14:23 <varad> What's the RIP-relative equivalent of `mov %ax, ptr+2(%rbx)` on x86_64 GAS?

14:29 <gog> lea ptr(%rip), %rdx; add %rbx, %rdx; mov %ax, 2(%rdx) ?

14:29 <gog> maybe

14:38 freakazoid333 has quit [Ping timeout: 256 seconds]

14:39 <zid> I don't understand why you'd need to add two pointers together like that for

14:40 <gog> i think there's a way to do it with lea

14:42 <gog> nope you can with rip-relative

14:43 <gog> can't

14:51 <GeDaMo> Maybe lea ptr+2(%rip), %rdx; mov %ax, (%rdx, %rbx)

15:00 Izem has quit [Ping timeout: 252 seconds]

15:02 <gog> yeah that seems to work

15:03 <GeDaMo> Do you mean "it assembles"? Or did you check it produces the required result? :P

15:04 <gog> the former lol

15:05 Izem has joined #osdev

15:05 <zid> don't see why it wouldn't

15:07 ElectronApps has quit [Remote host closed the connection]

15:12 CWiz has quit [Quit: WeeChat 3.2]

15:14 <Bitweasil> "It compiles, ship it!"

15:14 <Bitweasil> :p

15:16 <gog> yes

15:20 <kazinsal> warnings are just reminders that life is fleeting

15:21 <GeDaMo> Warnings are just reminders that you didn't turn off warnings :P

15:21 nismbu has quit [Ping timeout: 258 seconds]

15:27 <gog> maybe mov %ax, 2(%rdx, %rbx) ?

15:27 <gog> i like that better than adding in the rip-relative lea

15:27 <zid> depends where the +2 is

15:27 <zid> if it's on the rip GeDaMo's is right

15:27 <zid> pointers are hard

15:28 <gog> well ptr+2(%rbx)

15:28 <gog> is the original

15:29 <zid> right but does "making it rip rel" mean ptr+rip, or rip[all that crap]

15:29 <gog> what i don't understand is adding an offset to a local

15:30 <varad> this did what I wanted : `lea ptr(%rip), %rcx; add %rbx, %rcx; mov %ax, 2(%rcx);`

15:30 <sham1> Warnings are just a reminder of the fact that your code isn't perfect

15:32 <gog> i have no code, therefore all of my code is perfect

15:32 <sham1> nice

15:33 <heat> my ext4 driver got committed with the wrong author so if anything breaks it's technically not my fault!

15:34 Arthuria has quit [Ping timeout: 240 seconds]

15:34 nismbu has joined #osdev

15:40 <gog> git blame

15:40 <heat> git dont-blame-me

15:48 <klange> ars article on serenity... says they're nearly up to 500 contributors... still doesn't make me feel less shit making comparisons...

15:49 <zid> why do you feel shit

15:51 <Izem> they cited discord in helping the uptake :P

15:51 <klange> partly *because* of that 500 contributor count, I guess... I've been doing this for a decade now and haven't gotten anything close to that kind of attention

15:52 <zid> that's not how fads work and you know it though

15:52 <Izem> that is quite a bit yeah

15:52 <Izem> how would you manage it?

15:52 <gog> step one of emotional self-care: don't compare yourself to others :p

15:53 <gog> (she said unironically)

15:53 <Izem> yeah

15:53 <klange> instructions unclear, stopped comparing myself to baseline health norms and have withered away in my bed

15:53 <heat> how many of those contributors have made actual high quality contributions?

15:53 <j`ey> https://github.com/SerenityOS/serenity/graphs/contributors

15:53 <bslsk05> github.com: Contributors to SerenityOS/serenity · GitHub

15:53 <GeDaMo> Just get all the voices in your head to start contributing :|

15:53 <j`ey> there's a lot of people with big +-s

15:54 <sham1> klange: I don't know if you've pushed Toaru for publicity as much as Andreas has with Serenity

15:55 <sham1> That's probably a major contributing (heh) factor

15:56 <klange> just makes me want to reject any new contributions to toaru, so i can grasp at the idea that i'm still better because I did this myself... that's what I did with redox after all!

15:57 <j`ey> lol

15:58 <klange> and we have SMP and actual x86-64 support and I'm not an arrogant ass who thinks not having a downloadable pre-made build is somehow a feature

15:58 <zid> do we have nakiri ayame theme pack

15:59 <gog> who?

15:59 <zid> you heard me

15:59 <gog> oh a vtuber

16:00 <gog> i wanna steal her look

16:00 <klange> maybe instead of a PonyOS release, 2.0 stuff should get a VTuber-themed release

16:01 <zid> A YA ME

16:01 <klange> gura

16:01 <sham1> The simps are gonna love it /s

16:01 <zid> :(

16:01 <zid> https://cdn.discordapp.com/attachments/417023075348119556/862703794068127775/unknown.png ayame or riot

16:01 <klange> nice mojibake vomit

16:02 <zid> I fixed it since but the other screenies aren't as good

16:03 <gog> zid: are you familiar with tohou project

16:03 <gog> you want demon girls? tohou project is all demon girls :p

16:03 <zid> only through doujinshi

16:03 <sham1> I see you are a man of culture as well

16:03 flx- has joined #osdev

16:05 flx-- has quit [Remote host closed the connection]

16:05 <klange> fantastic wallpaper, 10/10 https://klange.dev/s/fantastic_wallpaper.png

16:06 <zid> hahahah

16:06 <gog> lol

16:09 <klange> that's what I really need, none of this usb stuff - gotta get my own TLS implementation so I can compete...

16:10 <gog> thread-local storage or transport-layer security

16:10 <sham1> Why not both

16:10 <zid> transport-local storage

16:10 <gog> thread-layer security

16:10 <sham1> Thread-local security

16:11 <heat> thread-local storage

16:12 <klange> gog: the latter, I have thread-local storage - it was a necessity for supporting threads on my bytecode VM.

16:12 <zid> pirate libressl?

16:12 <klange> For the encrypted stream standard, I've been using mbedtls.

16:13 <heat> argh

16:13 <klange> [formerly PolarSSL]

16:13 <zid> [formerly formerly the software project known as polar ssl]

16:13 <heat> [somehow formerly openssl]

16:14 <gog> lol

16:15 <heat> I'm still traumatised by my openssl port

16:15 <heat> the build system is a big ??????????

16:15 <gog> didn't it have-- yeah i wwas gonna say it had a wacky build system

16:37 <sham1> Speaking of unf-ing build systems, my investigation towards redo seems to pay dividends

16:38 <sham1> Helps that it's actually just shell scripts and thus it can do a lot more than what standard make can do

16:39 <Izem> gnu make can be extended by guile or python if you wish :P

16:41 <sham1> But that's GNU make

16:41 <gog> i use guile for everything

16:42 <gog> just kidding i'm a socially awkward mess hahahahahaha

16:42 <gog> :'(

16:42 <Izem> lol

16:42 <Izem> sham1 "everything" seems to use gnu make

16:44 <sham1> Indeed they seem to, and they probably do. However, I like not relying on implementation-specific stuff if I can avoid it. And besides, I just find the ideas behind redo to be saner than make

16:44 <Izem> what are the major differences?

16:46 <heat> use something that can generate ninja

16:46 <heat> make is shitty

16:47 <sham1> Well as said, redo build files are shell. Also redo is inherently recursive, and avoids a lot of the problems with "recursive make considered harmful"

16:48 <sham1> The way target regeneration is also different since it's not just based around mtime like with make

16:49 <heat> do you need shell scripts all the time to build stuff?

16:49 <heat> that's not a good idea...

16:49 <GeDaMo> gcc *.c -o myprog

16:51 <sham1> Dependencies also don't require hacking around like Makefiles do, if you have gcc -MMD or whatever. In fact, the dependencies can be declared "after" the build according to the information gathered during the build

16:53 <heat> most modern build systems take care of -MMD and whatnot

16:53 <sham1> With 10x the complexity

16:54 <heat> and 10x the usefulness

16:54 dennis95 has quit [Ping timeout: 240 seconds]

16:54 dennis95 has joined #osdev

16:55 <gog> considering things harmful considered harmful

16:56 <heat> make and ninja are just low level build systems, they're good for the actual backend

16:56 <heat> but if you're a developer and you want to write a build thingy that does what you want and how you want it to, you should go for something higher-level

16:57 <sham1> A lot of these build systems make assumptions about the compilation, whether it's meson, cmake, or whatever. Make, redo and ninja all are just taking a list of dependencies, usually files, and doing stuff with the targets

16:57 <heat> that's why most other build systems output ninja or make

16:57 <heat> what assumptions?

16:58 wgrant has quit [Ping timeout: 248 seconds]

16:59 <Izem> doesn't redo use crypto hashes instead of filesystem timestamp?

16:59 <Izem> that's probably a better idea

17:00 <sham1> Yeah. My own implementation uses sha-512 for example, although others may use a combination of things like mtime and the content hash digest

17:00 <Izem> otherwise the idea of make is great

17:00 <Izem> how to make it general and access os features is the trick part

17:00 <heat> why is that a better idea?

17:00 <sham1> Well this is basically recursive make without Makefiles

17:01 <sham1> Because just because your mtime changes, that doesn't mean that you necessarily need a rebuild

17:01 <heat> an mtime check is exponentially faster than calculating a file's checksum

17:01 <sham1> Also things like builds being distributed over multiple machines

17:01 <Izem> it's a tradeoff for sure

17:01 <heat> if you're changing a file and undoing it, well, it's your fault that you're doing nonsense

17:03 <Izem> but I think folks have been happy to have make in their own tools due to the specific tools the need

17:03 <Izem> rake, shake etc

17:04 <Izem> so then you get to what ninja does which is taking instructions and the generation / program is someone else job

17:04 <sham1> Anyway, about assumptions. With things like cmake and meson, you need to teach quite a bit about your system to cross-compile. And "quite a bit" ranges from maybe just creating just a stub file to creating a more substantial file that does stuff

17:04 amine has joined #osdev

17:04 <heat> cmake needs a file that's literally, what, 10-20 lines?

17:05 <sham1> That's 10-20 lines more than I want to write

17:05 <Izem> comes with the territory

17:05 <Izem> have you seen auto tools? :P

17:05 <heat> those 10-20 lines save you 200 in other files

17:05 freakazoid343 has joined #osdev

17:06 <sham1> Well most targets in programming probably can just be built by one target file, like how you'd use one rule in make

17:08 <sham1> And things like cmake and meson aren't as specific. Meanwhile redo, like make and ninja for example, can be used straight away to build things like LaTeX documents. Of course you can teach cmake and meson those things, but it requires more effort than I'm at least willing to put in

17:08 <sham1> Err, aren't as general

17:09 <sham1> That of course comes from the fact that they're specifically made for programming, but that still feels like a slight limitation

17:17 shlomif has quit [Ping timeout: 245 seconds]

17:24 isaacwoods has quit [Quit: WeeChat 3.2]

17:26 isaacwoods has joined #osdev

17:27 mahmutov has joined #osdev

17:31 vdamewood has quit [Quit: My MacBook Pro has gone to sleep. ZZZzzz…]

17:56 tacco has joined #osdev

18:03 mctpyt has joined #osdev

18:09 varad has quit [Ping timeout: 245 seconds]

18:24 Izem has quit [Ping timeout: 252 seconds]

18:35 srjek has quit [Ping timeout: 240 seconds]

18:39 rubion has quit [Remote host closed the connection]

18:43 Izem has joined #osdev

18:43 rubion has joined #osdev

18:56 <Santurysim> Hello, do I understand correctly that if I plan to use VESA VBE, I need to do all things that require int 10h in real mode?

18:56 <zid> or emulate your own bios

18:59 mctpyt has quit [Ping timeout: 252 seconds]

18:59 <geist> but yeah quickly you find that you can't do much with VBE once you get into protected mode

18:59 <geist> hence why most Real OSes have native drivers for the chipset

19:00 mctpyt has joined #osdev

19:02 <zid> you could also use VT-x I guess

19:02 nyah has joined #osdev

19:02 <zid> unrestricted guest runs the bios code

19:04 <Santurysim> Why? All I want for now is to display some text

19:08 <moon-child> 0xb8000

19:08 <Santurysim> ... with bigger screen size

19:09 <Santurysim> I'm working with already existing os, and it has 0xb8000 tty driver already

19:09 <moon-child> grab framebuffer

19:10 <Santurysim> vbe gives framebufer, doesn't it?

19:11 <moon-child> I don't know, as I've only used efi; but I expect so. https://wiki.osdev.org/Getting_VBE_Mode_Info

19:11 <bslsk05> wiki.osdev.org: VESA Video Modes - OSDev Wiki

19:11 <zid> so do that, you asked for alternatives to doing that

19:12 <zid> either you use the bios, emulate the bios, run the bios as a guest in a vm, or write a driver

19:18 GeDaMo has quit [Quit: Leaving.]

19:43 rubion has quit [Ping timeout: 252 seconds]

19:47 mhall has quit [Quit: Connection closed for inactivity]

20:00 <sortie> Santurysim, the easy solution is to use an existing bootloader like GRUB and having it give you the best framebuffer it can arrange

20:00 rubion has joined #osdev

20:00 <sortie> GRUB has drivers for virtual machines, VBE, and various things, plus EFI support, so it can do a decent job getting you a framebuffer on a real machine

20:01 <sortie> Of course if you do the use-your-own-bootloader route you're on your own

20:01 <sortie> Personally I went with this route and implemented a simple mode setting graphics driver for common virtual machines

20:01 <gog> it's pretty straightforward with EFI

20:01 <gog> the API already has everything you need

20:01 <sortie> *this route being GRUB

20:01 <gog> (if you're rolling your own)

20:02 <sortie> Yeah so messing with VBE from protected mode usually means you're going down a bad path

20:02 <sortie> Since you can use tech like GRUB or EFI or your own bootloader to solve a basic framebuffer for you

20:03 <sortie> BIOS calls are just massively unsafe from a kernel

20:04 <gog> remember the time before linux had kms and you had to use v86d to use vesa calls in x86_64

20:04 <gog> bad times :p

20:04 <gog> ah yes uvesafb needed it i remember

20:05 <gog> apparently it's still around

20:05 <zid> lynx -g pls

20:05 <gog> yes exactly

20:05 <gog> ok ttg

20:06 gog has quit [Quit: bye]

20:07 Izem has quit [Ping timeout: 268 seconds]

20:25 Izem has joined #osdev

20:27 <Izem> sortie: there's no way to isolate those calls?

20:27 <sortie> Izem, it's worse than that

20:27 <Izem> ah man

20:27 <sortie> Izem, these calls fundamentally program the GPU and whatever hardware specifics

20:27 <sortie> You don't really know what they do

20:28 <Izem> yeah that's tough

20:28 <sortie> Once you're a kernel you're in control of the PCI devices and so on and caching bits and memory layouts etc.

20:28 <zid> I mean, for vbe, they just run the rom supplied by the vga device

20:28 <sortie> Yeah in practice

20:28 <Izem> I guess efi has an upside despite the complexity :P

20:28 <zid> which is why it works for "any" card

20:28 <zid> becuse they all support that interface

20:29 <sortie> But it's hard to know what's going to happen when you do a BIOS call / use VBE, and do so in a safe way

20:29 <sortie> I suppose if you can access that ROM directly, one might be able to use that (mostly?) safely

20:29 <zid> yea who knows if your VBE impl. has a bug that makes it write to a random address nobody cares about from DOS

20:30 <zid> That's why I like the emulation strat

20:30 <sortie> Izem, absolutely EFI has that big advantage

20:30 <zid> Just run it in a sandbox and only do the io port accesses it wants

20:33 <vin> Is there a place where cache latency (cycles) is published per cpu? like wikichip/cpu-world

20:34 <Santurysim> sortie: thank you! As I mentioned, I'm playing with already existing os, and it has its bootloader (however, it is multiboot-capable). Also, it supports bios only at the moment. If I worked on my own os, i would definitely use grub or efi

20:39 <sortie> Santurysim, ah cool, I missed that part :)

20:39 <sortie> Absolutely, you know your constraints :)

20:40 air has quit [Ping timeout: 268 seconds]

20:43 dormito has quit [Ping timeout: 258 seconds]

20:49 <amine> Hey everyone, I'm following along the xv6 book and came across https://github.com/mit-pdos/xv6-public/blob/master/bootasm.S#L79 in the bootloader code. It seems the global descriptor table is 4 bytes aligned when in the manual and looking around in the internet that's supposed to be 8 bytes aligned. Any idea what I'm missing there ?

20:49 <bslsk05> github.com: xv6-public/bootasm.S at master · mit-pdos/xv6-public · GitHub

20:50 <amine> also the structure of an entry in https://wiki.osdev.org/LGDT shows that the total is 8 bytes (assuming a packed struct) so that made sense to me that the alignement should follow

20:50 <bslsk05> wiki.osdev.org: Global Descriptor Table - OSDev Wiki

20:50 <zid> I think it works with 4 but 16 is prefered for shpeed reasons

20:50 <zid> I could check the manual

20:52 <amine> 16 ? now I'm more confused :p

20:52 <zid> The

20:52 <zid> base address of the GDT should be aligned on an eight-byte boundary to yield the best processor performance.

20:52 <zid> 8 apparently

20:52 <zid> 3.5.1 below figure 3-10

20:53 <zid> (gdt is 16 bytes in long mode so it made sense to me that it'd want 16)

20:53 <amine> yes that's what I was reading

20:53 <sham1> Well the processor can indeed read unaligned data, it's just not that fast

20:53 <zid> I'd align it to 64 cus why not

20:54 <sham1> 64 bits or 64 bytes, because those are two very differnet things

20:55 <zid> bytes

20:55 <zid> cache lines are fun

20:56 <amine> yeah but for that link (the xv6-public repo), they set it to 4 bytes which is smaller, doesn't that mean it wouldn't work ?

20:57 <sham1> Future-proofing for the 512-bit CPUs I see

20:57 <zid> no, it says 'for performance'

20:57 <zid> not 'required'

21:00 <amine> hm true. I guess my confusion is once it's set to 4, how is that organized in memory exactly. I just assumed once you have an address say 0x00 you would need 8 contiguous bytes which would fit the struct, but if we set 4 bytes that's almost like it would be truncated as the next address 0x04 would have the next entry.

21:00 <amine> I'm sure Im getting it all wrong :p

21:00 <zid> 00 is aligned to EVERYTHING

21:01 <zid> we're talking about alignment, meaning, what the address ends in, think multiplication tables

21:01 <zid> so if your table is at 0xDEADFEE4 then it's aligned to 1 2 and 4

21:01 <zid> if your table is at 0xBEEF0000 then it's aligned to 1 2 4 8 16 32 64 128 256 512 1024 2048 and 4096 :P

21:01 <sham1> Think modular arithmetic

21:02 <sham1> Modular arithmetic in hexadecimal even, because looking at addresses makes it easier to reason about like that

21:03 <zid> 0x00007 is aligned to.. 7, stupid primes

21:03 <sham1> And it of course happens that for powers of two specifically, there is a very neat way of testing if something is divisible by said power

21:03 <sham1> Let x be 2^n where n is some positive integer (could also be zero, but that wouldn't be interesting)

21:04 <sham1> Now, addr mod x can be represented as addr & (x - 1), that is, you grab the n least significant bits of the addr and check if they're zero. If they're all zero, it's divisible

21:05 lg has quit [Ping timeout: 258 seconds]

21:06 <sham1> And of course, because this is such a well-known optimization, compilers will turn code like `num % 16` into bitwise ANDing num with 15, or 16-1

21:06 <sham1> On that note, I do slightly cringe when people do something like (num & 1) to check if something is divisible by two, for this very reason. Your compiler is smart. Use it

21:07 <zid> num & 1 is just as idiomatic to me

21:12 lg has joined #osdev

21:15 <amine> yeah I think I get that now. Here is my confusion: say I have a `foo` array of a packed struct of 64 bits ( the gdt entries), I assumed that by declaring it 4 bytes aligned, the memory addresses would be chosen in a way that it's exactly 4 bytes between each entry contiguously. So | entry1 (4bytes)| entry 2 (4bytes)|...| entry N | where address of

21:15 <amine> entry1 could be 0x0, entry2 0x4, entry3 0x8, etc .. in that exact order. But I think I get it now, that's not necessarily the case and it's just about what the address would end up with, the rest would be figured out automatically in terms of where exactly each element is ?

21:15 wgrant has joined #osdev

21:17 <sham1> Well the GDT entries should just fit snugly next to each other without any padding. Indeed, that's what the CPU requires and why you have to use packed structs if you want to represent the GDT in C for some reason

21:18 <zid> or use chars :P

21:18 <amine> right so choosing 4 bytes alignement or 8 would essentially have the same result in terms of how things are in memory right ?

21:18 <zid> but no, that alignment directive is just a directive to align the current output location to n

21:18 <zid> so if you were on byte 0x123 of your output and you told it to align to 8, it'd output 5 bytes of padding to get you to 0x128

21:19 <zid> then your data follows, completely unrelated

21:19 <sham1> Yeah. The initial address of the array just has the alignment. The elements then get their natural alignment which in this case is ignored in favour of being packed tightly

21:19 <amine> ohhh

21:19 <amine> that makes sense

21:19 <zid> which has the effect of.. aligning your data

21:19 <amine> thanks zid and sham1 :-)

21:20 <zid> it doesn't change the padding in your struct, just changes its alignment

21:20 <zid> your 37 bytes are still 37 bytes, regardless of whether they start with an address ending in a 4 or a 7

21:20 <sham1> But yeah. chars or uint64_t. After all, GDTs are specific to x86 and AMD64 so the undefined behaviour of potentially reading unaligned data should still work

21:20 <zid> idb

21:20 srjek has joined #osdev

21:20 <heat> Izem, EFI's GOP is still running a rom supplied by the GPU

21:21 <heat> there's no advantage

21:21 <heat> it just supports more stuff, but that's about it

21:21 <Izem> ok

21:49 ahalaney has quit [Quit: Leaving]

21:53 dennis95 has quit [Quit: Leaving]

22:02 heat has quit [Remote host closed the connection]

22:11 Izem has quit [Quit: Connection closed]

22:22 mahmutov has quit [Ping timeout: 248 seconds]

22:28 mahmutov has joined #osdev

22:36 gog has joined #osdev

22:39 NeoCron has quit [Remote host closed the connection]

22:53 AssKoala has quit [Ping timeout: 258 seconds]

22:55 isaacwoods has quit [Quit: WeeChat 3.2]

22:56 nismbu has quit [Ping timeout: 252 seconds]

22:57 rubion has quit [Ping timeout: 258 seconds]

23:06 sortie has quit [Quit: Leaving]

23:18 mahmutov_ has joined #osdev

23:21 mahmutov has quit [Ping timeout: 252 seconds]

23:23 mahmutov_ has quit [Ping timeout: 248 seconds]

23:44 freakazoid343 has quit [Read error: Connection reset by peer]

23:58 Burgundy has quit [Ping timeout: 248 seconds]