#osdev on 2022-04-29 — irc logs at libera.irclog.whitequark.org

2021-05-23 01:57 klange changed the topic of #osdev to: Operating System Development || Don't ask to ask---just ask! || For 3+ LoC, use a pastebin (for example https://gist.github.com/) || Stats + Old logs: http://osdev-logs.qzx.com New Logs: https://libera.irclog.whitequark.org/osdev || Visit https://wiki.osdev.org and https://forum.osdev.org || Books: https://wiki.osdev.org/Books

00:05 bradd has quit [Ping timeout: 240 seconds]

00:06 bradd has joined #osdev

00:12 <geist> bada-bum-tiss

00:15 heat has quit [Remote host closed the connection]

00:19 <klys> how's that FAT driver working out for ye?

00:20 pretty_dumm_guy has quit [Quit: WeeChat 3.5]

00:22 <gog> thank you i'll be here all night

00:22 <gog> but not really i'm gonna go to bed soon

00:35 Likorn has quit [Quit: WeeChat 3.4.1]

01:12 psykose has quit [Remote host closed the connection]

01:13 psykose has joined #osdev

01:22 gog has quit [Ping timeout: 276 seconds]

01:23 <geist> klys: basically have more or less full RO working. Been pondering how to deal with more structural open file/dir tracking in prep for RW support

01:23 <geist> https://github.com/littlekernel/lk/tree/master/lib/fs/fat is the current state

01:23 <bslsk05> github.com: lk/lib/fs/fat at master · littlekernel/lk · GitHub

01:23 <geist> oh also started to write some more comprehensive unit tests

01:28 bradd has quit [Quit: No Ping reply in 180 seconds.]

01:30 bradd has joined #osdev

01:33 mahmutov has quit [Ping timeout: 272 seconds]

01:40 <clever> geist: i was just thinking, how do you know what blocks are even free in fat?

01:40 <geist> they have a zero in the FAT

01:40 <clever> my understanding, is that the FAT table still has the singly linked list for deleted files?

01:40 <geist> that's probably the primary reason cluster 0 (and 1) are unused by definition

01:40 <clever> or are those slots zerod for the entire file?

01:40 <geist> slots zeroed

01:40 <clever> ahh

01:40 <geist> so finding a free block is basically a linear scan of the FAT

01:41 <clever> yeah

01:41 <clever> semi related, i had an old hdd that suffered a drop, and dos wouldnt boot anymore

01:41 <geist> FAT32 has an extra secondary data structure called FSINFO after the BPB that stores some hints to bootstrap the next mount instance, like first free cluster, etc

01:41 <clever> but one day while playing with the "known dead" drive with the covers off, i discovered "dir" still could list files!

01:41 <clever> and only when dir prints the free space, does it go into the click of death

01:42 <geist> so to delete a file in the dir you just mark the first character of the name with 0xe5

01:42 <clever> and having learned more about fat by that point, i knew free space was just iterating over the entire FAT table

01:42 <geist> and presumably you dont touch the rest of it. or at least that's how DOS behaved

01:42 <clever> so i just turned off displaying free space with a flag, and was able to recover some files

01:42 <geist> since the FAT was zeroed out, presumably what would happen is if you wanted to undelete a file you hoped it was in one segment and would recover the clusters started where it was

01:42 <geist> since you would have lost the chain

01:42 <geist> and yeah total free space is one of the hints in the FSINFO

01:43 <clever> this hdd originally ran win 3.11 i believe

01:43 <clever> so it probably predates FSINFO?

01:43 <geist> what's kinda dumb about it is the data structure 'helps' the next mount instance, but then in FAT.pdf it says the next mount must validate the info anyway

01:43 <geist> yes, FSINFO is a FAT32 only thing

01:43 <geist> so if you cannot assume it's up to date then what purpose does it serve?

01:44 <clever> perhaps if it was cleanly unmounted?

01:44 <geist> yah that's my thought. there's another cheezy hack where FAT cluster 1 has basically a mounted/clean unmounted bit you're supposed to set and clear

01:44 <geist> i dunno when that shoed up, but looks like one of those 'oooh we have another bit that previously was ignored, lets use this)

01:44 <geist> the whole design is full of that

01:44 <geist> layers of backwards compatible hacks

01:45 <clever> yeah

01:45 <clever> another fun bug i had

01:45 <clever> one of my mp3 players, lacked directory support

01:45 <clever> all music must be in the root directory

01:45 <geist> yah that was pretty common back then

01:45 <clever> also, the root directory table is of a fixed size

01:45 <geist> but the even worse FAT12 and FAT 16 is fixed size, yeah

01:45 <clever> so the SD card runs out of space way too quickly

01:46 <geist> the default is i think 512 entries

01:46 <clever> i couldnt even use 10% of the card

01:46 <geist> FAT32 fixes that too, actually responsible for a lot of the hackery in the dir parsing code i wrote

01:46 <geist> having to deal with the root dir being a special case (in fat12/16) vs a normal looking file with a cluster list

01:46 <clever> that mp3 player also had a weird quirk with its random play

01:46 <clever> it was prng, with the seed saved somewhere to flash

01:46 <clever> if the battery runs dead mid playback, the seed isnt updated

01:47 <geist> hah

01:47 <clever> so upon starting up again, it plays the same "random" songs again

01:47 <geist> random on embedded is actually kinda a tricky problem

01:47 <clever> i think it ran ~2 days on 2 non-chargable AAA's

01:47 <clever> ~1 day on rechargable ones

01:47 <clever> then i said "screw it" and taped 4 D cells together, in both parallel and series

01:48 <clever> it ran for 6 months :P

01:48 <geist> yah i have some old Olympus voice recorder/m3p player thing that basically has all these limitations

01:48 <geist> from the early 2000s

01:48 <clever> 6 month battery life!!, at the cost of the batteries being 3x the size of the mp3 player

01:48 <geist> like those old lantern style flash lights. kinda miss em

01:48 <geist> had a handle and a large square box under them

01:48 <geist> usually took 2 of those square lantern batteries

01:49 <clever> i think ive got one of those in the garage

01:49 <geist> very 70s

01:49 <clever> my mp3 adapter was very ugly :P

01:49 <clever> 2 D's end to end, tapped to the side of another 2 D's, to form a 2x2 square

01:49 <geist> i never tried to see how the car handles mp3s. i should try it some day

01:49 <clever> paperclips tapped to the ends as a bus bar

01:49 <geist> i'm sure it can deal with an mp3 stick, or even a cdrom with mp3s on it

01:49 <geist> but who carries those around, right?

01:50 <clever> and some speaker wire going to dowels that act as AAA's

01:50 * geist heads out

01:50 <clever> laters

01:54 <klys> night

02:38 Belxjander has quit [Ping timeout: 276 seconds]

02:49 srjek has quit [Ping timeout: 250 seconds]

03:01 knusbaum has quit [Quit: ZNC 1.8.2 - https://znc.in]

03:02 knusbaum has joined #osdev

03:09 nyah has quit [Quit: leaving]

03:55 Gooberpatrol66 has quit [Quit: Leaving]

03:55 Gooberpatrol66 has joined #osdev

04:22 Likorn has joined #osdev

04:26 raggi has quit [Ping timeout: 252 seconds]

04:26 raggi has joined #osdev

04:48 Ali_A has joined #osdev

04:55 <Ali_A> So I had to read 3-4 chapters of intel's manual, reading CSAPP Book chapter 7, reading a couple of books about C, a couple of books about assembly (there were for RISCV so the transition was a bit hard at first) reading a 4-5 chapters about 64 bit assembly, reading SYSV ABI , Linker and Libraries guide from Oracle, gas (gnu assembler) guide, tons

04:55 <Ali_A> 20~30 blogs about the process of linking, relocations, and sort of stuff, (and many other blogs for understanding bootloading from a floppy disk using real mode + many other things), and gdb and IDK, it is been 2-3 month since I started to make my own OS and all of that, was just to get a bootloader running, that loads an image from a compiled C

04:55 <Ali_A> file into qemu, and being able to debug it. I knew OS dev was no walk in the park, but that feels like a bit too much? like shouldn't there be a better way? (tho, I like the result of it, I managed to learn how to not be surprised by how big manuals are and just read them, googling for hours, and learned new tools for sure) but I feel, considering

04:55 <Ali_A> how many OS devs are out there (or at least people who knows what happens at the level of OS) I assumed there is like a standard set of things where u acquire before starting to build your own OS

04:56 <Mutabah> Well... you don't need to write a bootloader really

04:57 <Mutabah> I usually suggest using multiboot and getting grub/pxelinux to load your code

04:57 <Mutabah> (which starts you in protected mode with a working framebuffer)

04:57 <Mutabah> But, there is still a lot of other information you need to know to properly write an OS.

04:58 <Mutabah> There are quiet a few tutorials that will either give you the rough outline, or just show you a away to do it that you just need to copy

05:00 <Ali_A> I mean, I am just writing an OS from scratch, so I at least wanted to be able how loading is happening.

05:01 <Ali_A> what is the natural next step for me in this case? there are many things, different tutorials continue differently after this step, my ultimate goal is to have a shell and port tinyCCompiler or any other C compiler and just probably have a hello world program from my own OS

05:01 <Mutabah> Knowing abstractly how it works is useful, but bootloaders (especially BIOS ones) are lots of effort for not much gain.

05:02 <Mutabah> Protected mode, text output, keyboard input, ...

05:02 <Mutabah> then maybe a filesystem and usermode

05:02 <Ali_A> is usermode necessary?

05:03 <Ali_A> well, I guess it is necessary u don't want to write an app that corrupt your kernel so that was a silly question

05:04 nur has quit [Remote host closed the connection]

05:04 <Ali_A> Mutabah thanks! will look at how to move to protected mode (I assume it is just a few asm instruction) and then will look how to output text from my kernel.

05:07 nur has joined #osdev

05:08 <moon-child> Ali_A: usermode is not necessary if you restrict your applications to a capability-safe (ie memory-safe) language

05:08 <moon-child> if your apps are all written in java, they won't be able to corrupt your kernel no matter what you do, so there is no point in doing hardware context switching

05:09 <moon-child> if you want to support capability-unsafe languages, then yes, you should use hardware privilege controls

05:16 hwdyki has joined #osdev

05:16 <Ali_A> Thanks

05:24 hwdyki has left #osdev [#osdev]

05:33 Likorn has quit [Quit: WeeChat 3.4.1]

05:33 xenos1984 has quit [Read error: Connection reset by peer]

05:33 <geist> right basically most of what you listed off is complexities that exist on the x86 platform because of 40 years of backwards compatibility and lots of hardware flexibility

05:33 <geist> you can skip most of that up front by letting firmware and bootloaders do that for you

05:34 <geist> you may eventually want to figure it out, but it's not a good use of your time getting started

05:34 <geist> and also a great way to really get discouraged

05:40 <moon-child> indeed

05:51 xenos1984 has joined #osdev

05:57 bliminse_ has quit [Quit: leaving]

05:58 lg has quit [Read error: Connection reset by peer]

06:00 lg has joined #osdev

06:04 bliminse has joined #osdev

06:34 <Ali_A> just to be sure on the terminology, firmware refers to the hardware and its ReadOnlyMemory that ships with it right ?

06:34 <zid> not the hardware no

06:34 <zid> just the software that sits on it

06:37 <Ali_A> (y) thanks

06:48 Ali_A has quit [Quit: Connection closed]

06:48 Likorn has joined #osdev

06:55 Likorn has quit [Quit: WeeChat 3.4.1]

07:59 bauen1 has quit [Ping timeout: 256 seconds]

08:43 PotatoGim has joined #osdev

09:09 GeDaMo has joined #osdev

09:42 ptrc has quit [Ping timeout: 240 seconds]

09:42 ptrc has joined #osdev

09:51 bauen1 has joined #osdev

09:56 bradd has quit [Ping timeout: 246 seconds]

09:56 bradd has joined #osdev

10:09 pretty_dumm_guy has joined #osdev

10:39 gog has joined #osdev

10:40 Ali_A has joined #osdev

10:48 divine has quit [Ping timeout: 248 seconds]

10:49 divine has joined #osdev

11:05 divine has quit [Ping timeout: 240 seconds]

11:08 knusbaum has quit [Quit: ZNC 1.8.2 - https://znc.in]

11:08 knusbaum has joined #osdev

11:11 divine has joined #osdev

11:28 nyah has joined #osdev

11:53 <stephe> I see a lot of bootloaders using 16 bit code when stored on the MBR on an i386 machine. So i386 can't run 32 bit code right away or is it just due to trying to keep it within the 512 byte limit of the MBR?

11:53 <zid> cpu turns on in real mode

11:54 <zid> You'd struggle to boot your existing software if it booted in the new fancy incompatible mode

11:55 Burgundy has joined #osdev

11:55 <zid> and by 'new' I mean '1982'

11:58 <stephe> Hmmm ok

11:59 <gog> you can indeed run 32 bit instructions off the MBR during real mode but they add an extra byte of override prefix

11:59 <stephe> Right, so partially backwars compatibility and partly space savings?

11:59 <gog> basically

12:00 <gog> but it's also not necessecary because you can't even address >1MiB until A20 is enabled

12:01 <gog> but getting into protected mode involves some time in spooky mode

12:01 <gog> with A20=1 but CR0.PE=0

12:08 dennis95 has joined #osdev

12:14 <stephe> gog: hmm i see

12:14 <stephe> why is it spooky?

12:15 <stephe> so with a20 enabled you can address 2MB?

12:17 <gog> i was mistaken, spooky mode is when you have a segment selector loaded and then disable CR0.PE

12:18 <gog> and it's spooky because you have the full range of address space for 32 bits but you're still technically in real mode

12:19 <stephe> aha

12:24 <zid> I'd say unreal mode is a cool mode, but then I remembered long mode existed

12:26 Ram-Z has quit [Ping timeout: 276 seconds]

12:27 <gog> long mode best mode

12:27 <zid> all other modes are dead to me

12:27 <gog> any mode without IP-relative addressing is cringe

12:27 <gog> so that leaves only one

12:28 <zid> any mode where you have to fill out more than one tss value is right out, any mode where you don't get at least 32MB of memory is also right out

12:28 <zid> really narrows it down

12:28 <gog> yes

12:29 <gog> it is unfortunate that the design of long mode means you can't really just start with the CPU already in it

12:29 <gog> but that's what UEFI is for

12:29 <zid> that's what my tiny bootstrap program is for

12:29 <gog> also that

12:32 <vdamewood> stephe: You can access more than 2MB with a20 disabled, you just can't access odd-numbered megabytes of memory, (using a 0 index).

12:33 <vdamewood> So, megabytes 0, 2, 4, and 6 would work fine, but 1, 3, 5, and 7 would access the megabyte one lower instead.

12:33 <zid> heh hadn't thought about it that way, but it makes sense

12:33 <zid> it doesn't disable address lines >= 20, just.. 20

12:33 <zid> you have a stuck bit for the parity of megabytes

12:35 <stephe> vdamewood: ahh right

12:35 <stephe> that makes sense

12:48 Ram-Z has joined #osdev

13:05 laocid has joined #osdev

13:14 dude12312414 has joined #osdev

13:19 dude12312414 has quit [Client Quit]

13:24 laocid has quit [Ping timeout: 248 seconds]

13:41 srjek has joined #osdev

13:53 Ali_A has quit [Quit: Connection closed]

14:09 doorzan has joined #osdev

14:14 Ali_A has joined #osdev

14:18 gwizon has joined #osdev

14:23 <Ali_A> regarding bootloaders, I just remembered, I saw a video once about a person who used grub to load his kernel (which does nothing more than an infinite loop)

14:23 <Ali_A> however, neither in linker script nor in the assembly he mentioned where should grub entry's point be? like in my own bootloader, I load my kernel at some sector, and then jump to a specific address within the loaded kernel to start executing the code, that section contains the first byte of my entry point. So my question, does grub have a defined

14:23 <Ali_A> entry point that operating systems should follow or something?

14:23 xenos1984 has quit [Remote host closed the connection]

14:25 <Ali_A> https://www.youtube.com/watch?v=1rnA6wpF0o4&list=PLHh55M_Kq4OApWScZyPl5HhgsTJS9MZ6M&ab_channel=WriteyourownOperatingSystem

14:25 <Ali_A> I just found the video for the reference, (it set's ` .= 0x100000` then puts the `.text` sections together, but that address is never mentioned again)

14:25 <bslsk05> www.youtube.com <no title>

14:26 <vdamewood> Ali_A: https://wiki.osdev.org/Multiboot

14:26 <bslsk05> wiki.osdev.org: Multiboot - OSDev Wiki

14:29 <Ali_A> vdamewood thanks, that explains it, it just uses the .multiboot section to define the type of the architecture, and I believe it then knows that it is an elf, and just jumps to the elf's entry point, thanks!

14:32 <vdamewood> This guy is saying some things I find kind of fishy

14:33 <vdamewood> Like he talks about loading the first two megabytes from the hard drive, no. BIOS systems load the first 512 bytes from the heard drive.

14:33 <vdamewood> hard*

14:33 xenos1984 has joined #osdev

14:35 <Ali_A> I didn't follow his videos (tho I watched the first 3 or so), but after setting my kernel to be an elf (for making debugging with gdb possible) and I know elf format loads the entry point at 0x18 offset from the start of the image, I had to jump to it to start executing my entry point so I remember something like that never happened in this video

14:35 <Ali_A> and wanted to ask

14:50 <gog> you can tell ld where the entry poitn is on the command line, use _start, or it'll complain and default to the start of the first executable section

15:00 Likorn has joined #osdev

15:06 <Ali_A> I usually just use the ENTRY command or -e

15:10 doorzan has quit [Quit: Leaving]

15:13 gwizon has quit [Quit: leaving]

15:42 dude12312414 has joined #osdev

15:43 <gog> yeh

15:54 k8yun has joined #osdev

16:07 Ali_A has quit [Quit: Connection closed]

16:07 Ali_A has joined #osdev

16:11 knusbaum has quit [Quit: ZNC 1.8.2 - https://znc.in]

16:12 knusbaum has joined #osdev

16:16 dennis95 has quit [Quit: Leaving]

16:28 Ali_A has quit [Quit: Connection closed]

16:40 srjek has quit [Ping timeout: 240 seconds]

16:44 srjek has joined #osdev

16:51 doorzan has joined #osdev

17:20 hgoel[m] has joined #osdev

17:47 k8yun has quit [Quit: Leaving]

17:48 vdamewood has quit [Quit: Life beckons]

17:50 hgoel[m] has quit [Quit: Reconnecting]

17:50 hgoel[m] has joined #osdev

17:57 Likorn has quit [Quit: WeeChat 3.4.1]

17:57 ptrc has quit [Remote host closed the connection]

17:57 ptrc has joined #osdev

18:03 Celelibi has quit [Ping timeout: 272 seconds]

18:11 Celelibi has joined #osdev

18:18 genpaku has quit [Ping timeout: 276 seconds]

18:19 <zid> Well that's fun

18:19 <zid> Windows' sound mixer crashed, and now my audio has automatic volume supression somehow

18:22 genpaku has joined #osdev

18:23 <zid> so if any of ya'll get as far in your osdev to get a sound mixer, don't do that pls

18:41 srjek has quit [Ping timeout: 240 seconds]

18:41 xenos1984 has quit [Read error: Connection reset by peer]

18:46 dennis95 has joined #osdev

18:48 Likorn has joined #osdev

18:51 Likorn has quit [Client Quit]

18:57 zaquest has quit [Remote host closed the connection]

18:58 xenos1984 has joined #osdev

19:26 dude12312414 has quit [Quit: THE RAM IS TOO DAMN HIGH]

19:41 <nomagno> Do conditional branches in ISAs usually just skip the next instruction?

19:42 <zid> nope

19:42 <nomagno> Then what do they usually do?

19:42 <zid> usually a small relative offset added to pc

19:42 <GeDaMo> ARM used to have conditional instructions

19:42 <zid> but still weren't usual

19:43 <nomagno> zid: right so 'jcz 12' usually skips 12 if nonzero?

19:43 <zid> ... if(x) { printf("meow"); } ... -> jz skip1; do exceptional case; blah; blah; skip1: do normal crap

19:43 <zid> you jump to the } effectively

19:44 <nomagno> zid: I'm talking about machine code, not assembly, if that makes sense

19:44 <nomagno> I care about the microops

19:44 <zid> Microop cache does none of this stuff

19:44 <zid> on a modern isa

19:44 <zid> It just completely eliminates the instructions it won't run from the stream

19:45 <nomagno> Why does trying to get sane design lessons from modern CPUs always fail?

19:45 <zid> so if a; b; c; d; e and there's a jump from b to e and it predicts it will be taken

19:45 <zid> it just decodes a; b; e; into the micro-op cache

19:45 <GeDaMo> Sanity is over-rated :P

19:45 <zid> and if the jump really wasn't taken, it'll stall and refill it correctly instead

19:45 <nomagno> That's messed up

19:46 <zid> It's incredibly fast, though.

19:46 <zid> The code itself should still be in L1, so the penalty isn't that big, and the successful prediction is a HUGE win.

19:46 <zid> It's like rewriting the program to have no jumps or conditionals in it ahead of time

19:47 <zid> It turns things like a 'loop over 4 things adding them together' into 4 straight adds, clearing all the overhead of the compare test and jump

19:48 <zid> With a sufficiently clever computer: mov eax, 4; L1: add ebx, 1; dec eax; jnz L1 can just turn into "add ebx, 4"

19:49 <zid> That's why we have a billion transistors implementing the same thing as 30k transistors did in the 80s :p

19:50 <zid> and why they use 250W of power

19:52 <nomagno> I'd take a decrease-in-ops-per-Mhz if it means all of a sudden there's more than three CPU vendors.

19:52 <zid> for what reason

19:52 <zid> nobody else would

19:52 <nomagno> On the grounds of not liking oligopolies

19:52 <zid> and you can always use an 8051 if you're serious about that

19:53 <GeDaMo> There are more than three vendors but none of the others make anything you actually want to use :P

19:53 <geist> sure or use some arm or riscv cpu. tons of vendors there

19:53 <geist> but anyway, modern superscalar design is complex but not that different between vendors. it's pretty standard stuff

19:54 <geist> just varying levels of tradeoffs and design ability of the different teams at different companies is all

19:54 <nomagno> I'm trying to use RISC-V yes. But I also want to buy a patent-free one, which means being picky

19:55 <geist> you can't always get what you want

19:55 <nomagno> Starlight got close but it clearly wasn't good enough to get regular runs

19:55 <nomagno> BeagleBoard should release something in the next few years that is decent

19:56 <geist> anyway i dunno how you're getting superscalar design and patent free and whatnot intermixed. you're saying you only want cpus that are sufficiently old that they dont use modern, patented techniques?

19:57 <geist> if so, as zid says. use something 20+ years old

19:58 <nomagno> geist: No, I'm saying on the one hand I think superscalar design gatekeeps the industry, and on the other that when I buy RISC-V it'll be something patent-free, which really isn't that far fetched

19:58 <mrvn> zid: why should it include c, d? Should it do that for jumps covering 1GB? That's the same as predicting the jump not being taken (or predicting both branches being taken)

19:59 <geist> i think it's more like the branch predictor will up front decide which way the branch is going to go, and the cpu will start running that path. it should be correct most of the time

19:59 <j`ey> nomagno: are you sure its patent free? i thought it was one of those things where people with patents allpw them to be used by riscv

20:00 <zid> risc-v will be slow as fuck without patented implementation techniques

20:00 <nomagno> j`ey: RISC-V is specifically not patented

20:00 <zid> might as well use an 8 051

20:01 <nomagno> zid: I'm trying to support open development in general, thank you for expressing your disillusionment.

20:01 <zid> reality*

20:01 <zid> If you braindead implement risc-v like we did in the 80s it will.. perform like an 80s chip

20:01 <nomagno> Well you just add more hertz!!!!

20:02 <zid> it isn't some magic isa with easy performance gains, it's actually a really crap isa that clever implementation may make fast enough to be usable

20:02 <mrvn> Here's some food for thought: All the pipelining and branch prediction and such is because waiting for stuff to finish before doing the next takes way too long. But why not extend hyperthreading way way more? Run 8, 16 or even 64 threads and waiting for the condition of a branch to compute is a none issue.

20:02 <zid> power.

20:03 <mrvn> you save power by eliminating the pipeline gates

20:03 <zid> It's the literal answer.

20:03 <zid> The reason you don't, is it's incredibly power wasteful

20:04 <zid> You could just pack in twice the clockrate and then waste 20% on branch prediction, instead of using 200% the power

20:04 <mrvn> You can also better utilize functional units. Instead of 6 cores with 3 adders each you have 1 core with 18 adders used by 64 threads.

20:04 <zid> branch prediction is a win in 99% of cases, that means you're running the wrong code 99% of the time.

20:05 <nomagno> zid: It's pretty clear RISC-V gets destroyed by ARM and AMD64 any day of the week, doesn't really change my sentiment that supporting ISAs where it's reasonable for new players to enter the market at speedy rates is a decent idea

20:05 <mrvn> zid: branch prediction is only a win if you don't have any other work to do inbetween.

20:05 <zid> go for it, but it isn't practicable

20:06 <zid> chip fab is expensive, designing cpus is expensive. The ISA is the easy boring bit.

20:06 <zid> The reason you use an existing ISA is to cover the costs of the former, you get to be a VIA selling chips into a market you don't own.

20:06 <zid> or a cyrix

20:06 <mrvn> The thing is that the average PC now has something like 4-6 cores and software (and games) are utilizing it more and more. Getting 8-64 threads do actual work is not impossible.

20:07 <zid> *only* using my fpu units already breeches my power budget.

20:10 <mrvn> zid: try using >80% of your cpus functional units

20:10 <zid> okay give me 400W more cooling and I'll try

20:11 <mrvn> just let it throttle down

20:11 <raggi> I don't always wear my power budget, but when I do I stay warm

20:11 <zid> sounds identical to doing nothing to me, but you get worse latency

20:12 <zid> I get to run the genuinely single thread things at 5GHz too instead of 1GHz

20:12 <mrvn> zid: nah, you have to do something useful, not just dummy ops to keep units busy.

20:12 <zid> so now you owe me 400W and a unicorn program

20:13 <nomagno> I don't know, while it may very well be faster to do CPUs the modern way with their speculative execution and OOE and all of that, on the other side why does AMD64 have like 3 layers of abstraction between almost-unnavigable machine code and dead-simple microcode?

20:13 <zid> yes, an amd64 'may be faster' than an 8051.

20:13 gwizon has joined #osdev

20:14 <nomagno> I get legacy compat, but the 90/10 rule is an understatement with x86

20:14 <mrvn> nomagno: backward compatibility

20:14 <zid> nobody cares about how complicated the machine code is, concentrating on decode speed is a relic from the 80s before CPUs were faster than memory.

20:15 <zid> As soon as memory is slower than the cpu your considerations completely invert.

20:15 <zid> At that point you care about keeping caches filled and speculation for read-ahead etc

20:16 <nomagno> zid: A monster ISA is not really better by any possible heuristics ever tough

20:16 <zid> Except the one that matters, performance.

20:16 <mrvn> Would be fun to see a massively SMP 6502, like 1024 core.

20:17 <nomagno> zid: what's the logic behind a monster ISA being faster than a small one?

20:18 <zid> nomagno: hardwre is faster than software.

20:18 <mrvn> nomagno: the ISA isn't the bottleneck

20:18 <zid> a magic complicated crc32 instruction

20:18 <zid> is VASTLY faster than a software implementation of crc32

20:18 <nomagno> Not being the bottleneck wasn't my point. It was x86_64 is ugly as fuck

20:18 <zid> ngl it very much appears to me that you're just here to troll

20:19 <moon-child> nomagno: oh yeah, well

20:19 <moon-child> you're ugly as fuck

20:19 <mrvn> nomagno: but it runs win95.

20:20 <nomagno> I'm not trolling, but that's unironically what a troll would say so fuck. I'm just frustrated at modern machines

20:20 <nomagno> The complexity is justified but still not-nice

20:20 <mrvn> nomagno: and nobody is arguing that x86 is ugly as fuck

20:20 <mrvn> why do you think so many work on ARM instead?

20:21 <nomagno> Well I think so. It isn't very objective mind you

20:21 <nomagno> mrvn: nobody is or isn't?

20:21 <nomagno> I'm not so sure nobody is arguing for either side honestly :P

20:21 <mrvn> nomagno: it's a 64bit system based on a 32bit extension of a 16bit cpu build out of a 8 bit design copied from a 4 bit crap.

20:22 <zid> and as mentioned, it's *completely* irrelevent

20:22 <sbalmos> finish out the line

20:22 <mrvn> nah, I think the 4 bit was an original design

20:22 <nomagno> mrvn: Yeah well I'm sure you can find some people that like it, but anyways

20:22 * moon-child grabs popcorn

20:22 <sbalmos> by a 2 bit company that can't take 1 bit of criticism

20:22 <zid> it'd be relevent only if decode speed became a major factor again somehow, and if we lost access to C compilers

20:22 <GeDaMo> nomagno: https://archive.computerhistory.org/resources/text/CDC/cdc.6600.thornton.design_of_a_computer_the_control_data_6600.1970.102630394.pdf

20:22 <mrvn> sbalmos: oh that part :)

20:22 <sbalmos> yup

20:23 <mrvn> nomagno: the ISA is for legacy code and since it takes less than 1% of power / gates nobody cares.

20:24 <mrvn> (except us poor programmers)

20:25 <mrvn> Is there any x86_64 CPU yet that has dropped 16bit support?

20:25 <nomagno> On a related note, the ME/PSP are like, total security holes

20:25 <zid> I believe someone mentioned there was an embed range that didn't have it

20:25 <kingoffrance> +with software from a 2 bit company <insert vendor you dislike here> that cant stand 1 bit of competition

20:25 <zid> if you consider 'being able to use your PC' a security hole, then yes

20:26 <zid> There was always a ME, just now you know some vague details about it.

20:26 <mrvn> And if you don't want to mess with ME/PSP you can always put your rootkit on the NIC.

20:26 <zid> or literally anywhere

20:27 <mrvn> Run a nice little ARM OS on the WiFi card.

20:27 <zid> the point of these firmwares is to run, it doesn't matter which you modify

20:27 <zid> the ME is at least signed

20:27 <nomagno> The capabilities of the ME and PSP are kinda not the same as those of any kind of predecessors

20:27 <mrvn> I would love to be able to run a tiny webserver on the wifi card.

20:27 <nomagno> Doesn't it also have network capabilities?

20:28 <zid> always has done

20:28 <zid> wake-on-lan isn't new

20:28 <mrvn> nomagno: it has access to the NIC

20:28 <mrvn> zid: Question then is: Can it send network traffic while asleep?

20:28 <zid> Yes

20:28 GeDaMo has quit [Remote host closed the connection]

20:29 <mrvn> I could imagine the NIC powering down the sending part of the hardware when in standby

20:29 <nomagno> Well that's not very nice is it. Signed firmware that can't be turned off and likely will never be patched on most computers, and has network capabilities

20:29 <nomagno> At that point I'd prefer the firmware not be signed by intel, and it requires you to enter a DNA sample to swap it

20:29 <nomagno> :P

20:30 <mrvn> nomagno: worse: you can't patch it because you can't sign it.

20:30 <mrvn> And the firmware is controlled by foreign states

20:30 <nomagno> mrvn: Well intel releases patches

20:30 <nomagno> So not exactly true

20:30 <mrvn> nomagno: intel is not you

20:31 <mrvn> They and the US can patch in a backdoor into your system. They will never ever patch one out.

20:31 <zid> You don't

20:31 <zid> have to plug anything into the system NIC port

20:31 <mrvn> zid: doesn't help if it's a shared port

20:31 <zid> The one on the mobo is the special system access and control port

20:32 <zid> which the ME uses for remote management in the server room

20:32 <zid> if you give a shit about it being connected to the internet.. don't

20:33 <nomagno> mrvn: well you can patch it

20:33 <nomagno> You are applying the patch

20:33 <nomagno> Yes, I know semantics is irrelevant here, but it irked me :P

20:33 <mrvn> I always hate the servers where eth0 and management share a physical connector. It's insecure and I haven't seen one that doesn't break and need a power cycle every now and then.

20:34 <mrvn> nomagno: by "you can't patch it" I obviously mean you can't fix the code yourself.

20:35 <mrvn> Apropo hiding code somewhere: The x86 MMU is turing complete. Just compute stuff on it.

20:35 <nomagno> Is there any part of x86 that isn't turing complete?

20:35 gwizon has quit [Quit: leaving]

20:36 <mrvn> floppy controller?

20:36 <nomagno> I bet the list of turing-complete instruction mnemonics is greater than that of non-turing complete

20:36 <nomagno> has anyone actually taken a look at this!? :P

20:37 <mrvn> sure. there are lots of papers on minimal opcodes required for a cpu

20:37 <zid> turing completelness is a pretty low bar

20:37 <zid> the problem is the infinite tape

20:37 <nomagno> No no, I'm talking about specifically x86 instruction mnemonics

20:37 <nomagno> the ones that depending on the arguments and context change opcode

20:37 <mrvn> well, the infinite tape part is always ignored for a finite tape large enough to do at least something

20:38 <nomagno> the MOV instruction/mnemonic is LBA-complete by itself, by a long shot, for instance

20:38 <zid> "x86 mmu is turing complete, you just need 128TB of ram per conventional kilobyte" isn't as impressive :p

20:38 <mrvn> nomagno: nand is. add I believe isn't.

20:38 <zid> "and it runs at 4IPC, instructions per century"

20:38 <mrvn> zid: but I have 128TB :)

20:39 <nomagno> Is there actually any way to actually build a computer with only non-conditional NAND applications on a turing machine?

20:39 <mrvn> nomagno: yes

20:39 <nomagno> Because it seems to me like you'd need a NAND tree, rather than a NAND list

20:40 <mrvn> so?

20:40 <nomagno> mrvn: yes to the first or second message?

20:40 <nomagno> so what?

20:40 <nomagno> urgh I'm confused now

20:40 <mrvn> a turing machine can compute anything that is computable. Evaluating a nand tree certainls is computable.

20:41 <nomagno> mrvn: No, I'm saying if a NAND tree is turing-complete

20:41 <nomagno> asking*

20:41 <mrvn> nomagno: you can build a turing machine out of nand gates

20:42 <mrvn> connect both inputs and you have NOT. That gives you AND, OR, NOR, XOR and every other gate you usualy have.

20:42 <nomagno> mrvn: yes, I played the little game where you do it, but the thing is I'm not sure which data structure would be able to represent the arrangement of nand gates necessary

20:42 <nomagno> a tree? a directed graph?

20:42 <mrvn> s/tree/graph/ by the way

20:42 <mrvn> a directed cyclic graph

20:43 <nomagno> mrvn: Yes, I know that, there's a little web game somewhere where you put it in practice, neat

20:43 <nomagno> but what kind of graph?

20:43 <nomagno> Necessarily cyclic?

20:43 <mrvn> yeah. Has to loop somewhere

20:43 <nomagno> HM... yes

20:44 <mrvn> You can compute anything with a tree or non cyclic graph but a turing machine needs to loop

20:44 <mrvn> you don't want to build one tree per problem.-

20:44 gwizon has joined #osdev

20:44 <nomagno> I'm not actually sure you'd be able to make useful memory out of this tough, don't the weird toy NAND flip-flops that keep state need essentially to be async to work?

20:45 <mrvn> nomagno: you can build all the gates out of NAND and then just implement the common flip flops

20:46 <mrvn> I believe some of the common flip flop designs (inclusing master/slave) even use NAND naturally.

20:46 <nomagno> mrvn: they do

20:46 <nomagno> mrvn: yes but the issue now is that is the nand cyclic graph can only be executing one NAND at a time, you can't keep alivethe signal can you?

20:47 <nomagno> you need multiple NAND 'tape heads' going trough the graph at the same time to achieve persistent state, which I guess makes sense

20:47 <mrvn> heah? every NAND gate in the graph evaluates every tick

20:48 <nomagno> mrvn: Hm...

20:48 <nomagno> you're right

20:48 gwizon has quit [Client Quit]

20:49 <mrvn> if you want to simulate this digitally then you double buffer all gates so signals take 1 tick to propagate through the gate.

20:49 <mrvn> Same if you simulate it analog and want it accurate

20:50 Likorn has joined #osdev

20:51 <nomagno> I don't see how double buffering will help simulate proper propagation

20:52 <mrvn> you compute all the outputs from the inputs into an internal buffer. None of the inputs change during the computation. Then when it's all done you propagate the internal results to the output and all the inputs change.

20:52 <geist> re: riscv and patents. the riscv *design* is patent free but that doesn't mean you can't build an implementation that uses all sorts of tricks

20:52 <geist> and i dont think the riscv license forbids you from using patented implementation techniques

20:53 <nomagno> geist: It doesn't

20:53 <nomagno> that's why I didn't say I'm buying a RISC-V CPU

20:53 <nomagno> I said I'll try to wait for a patent-free RISC-V CPU

20:53 <geist> sooo..... that doesnt leave a lot

20:53 <nomagno> which is a narrower subset

20:53 <nomagno> anyways mrvn: I see, thanks, useful insight!

20:53 <geist> how do you know something is patent free?

20:53 <geist> because it says it is doesn't mean it doesn't infringe on patents. the creator just may not know it

20:53 <zid> geist: That's what I mentioned earlier, might as well use an 8051 if you don't want patents, risc-v will be just as slow without patented implementation techniques.

20:54 <nomagno> geist: Same can be said of copyright

20:54 <geist> right. something so old its bound to have not touched any modern patents

20:54 <mrvn> nomagno: it's something that you need for any cellular automaton and that most game designers aren't capable of using.

20:54 <zid> Plus who gives a fuck if something was *created* using patents.

20:54 <zid> Everything is

20:54 <nomagno> you can make a copyright license and it's considered safe. Why wouldn't a patent license be safe?

20:54 <zid> coke, bread, water

20:54 <geist> nomagno: sure, but one can claim something is patent free but it doens't mean anything when the lawyers find it infringes

20:54 <geist> that just means the creator is saying they dont knowingly use any of their own patents

20:55 <geist> that doesn't give you some sort of legal protection

20:55 <geist> though IANAL

20:55 <nomagno> mrvn: Do you really need it for a cellular automaton? ... Oh yeah I see what you mean, my implementation of rule 110 a while ago definitely wasn't stupid enough to edit the buffer live

20:55 <geist> whereas if you use an implementation from a company that says 'we license this for use in X and we will give you patent protection if you use it' you're in basically a better place

20:55 <nomagno> So that's what you mean with double buffering

20:55 <nomagno> Makes sense

20:56 <mrvn> nomagno: it's also something you want to multithreading. Because with this simple buffering trick you can compute all elements in any order on any number of cores.

20:56 <nomagno> mrvn: Yeah I'm pretty sure I actually apply this everywhere without calling it double buffering :P

20:57 <mrvn> nomagno: it's what you call it in graphics. :)

20:57 doorzan has quit [Remote host closed the connection]

20:57 <mrvn> you have it a lot on the hardware level too. Just think how many latches your CPU has.

20:58 <nomagno> mrvn: Yeah well my game's software renderer is underway, but mentally I was just thinking 'on my side I have a buffer, then I blit it to the screen when the frame is done'

20:58 <mrvn> screen being the second buffer hence double buffering

20:58 <nomagno> Yep

20:59 <nomagno> Is anyone actually not actively doing?

20:59 <mrvn> https://www.youtube.com/watch?v=xP5-iIeKXE8

20:59 <bslsk05> 'Life in life' by Phillip Bradbury (00:01:30)

20:59 <nomagno> doing this*?

20:59 <mrvn> nomagno: a ton of games

20:59 <nomagno> 'oh yeah I'll just copy stuff to the screen live, makes perfect sense'

20:59 <nomagno> I guess the atari 2600 games DID do this, which is bonkers

21:00 <nomagno> they actually blitted just in time for the CRT scanning beam to copy the stuff over

21:00 <nomagno> shit tier machine

21:00 <mrvn> A lot of games don't buffer the game state and then you can't tirivially multithread the code or at all.

21:01 <nomagno> mrvn: I think my game state isn't actually buffered, thinking about it. My reasoning was since the output is the only way to know when it changes, and that is obviously buffered, it's fine

21:01 <nomagno> But yeah that's a fair point

21:02 <nomagno> Thanks, actually!

21:02 <nomagno> I'll write this down in my design notes

21:02 <mrvn> nomagno: as soon as the output is the input of something else you introduce a dependency there

21:03 <nomagno> mrvn: is that to say my strategy is fine or I should make it double buffered to make it more modular?

21:03 <mrvn> nomagno: i would always double buffer to break the dependency

21:04 <nomagno> Was thinking so too, I'll add it to my notes then for sure

21:04 <mrvn> Otherwise you end up with e.g. items moving down faster than up.

21:05 <mrvn> Or things depend on the order you place them on the board. Worse, saving a loading a game can change the order of items and suddenly the timing changes.

21:05 <mrvn> and as said, the big one: double buffered you can trivial multithread stuff.

21:07 <mrvn> nomagno: another tip: you don't have to copy the data in the propagation phase. Just have a "Foo output[2];" and a global "int phase" that switches between 0 and 1 every tick.

21:08 <mrvn> output[phase] is last turns output (this turns input) and output[1-phase] is this turns output

21:09 <nomagno> http://halfworld.nomagno.xyz/specs/HWDESIGN.html

21:09 <bslsk05> halfworld.nomagno.xyz: Half-World Design N otes

21:09 <nomagno> last bullet point at the bottom of the page

21:09 <nomagno> thanks!

21:09 <nomagno> wait NOOO

21:09 <nomagno> I accidentally added a space in the notes of the page description

21:09 <mrvn> N otes?

21:09 <nomagno> mrvn: RIP me

21:10 <nomagno> My thumbs and pinkies are very trigger-happy sometimes

21:10 <mrvn> otes are a kind of cereal, right?

21:10 <mrvn> .oO(and now I'm hungry)

21:11 <nomagno> I think it wasn't a good idea to store the docs in the same public repo as the code

21:11 <nomagno> my commit count gets inflated a lot every time I make typos

21:11 <nomagno> http://halfworld.nomagno.xyz/specs/HWDESIGN.html

21:11 <geist> does that really matter though?

21:11 <mrvn> nomagno: why not? https://github.com/mrvn/pi-by-hand

21:12 <mrvn> nomagno: I like documenting stuff in .md files

21:12 <nomagno> Should have been fixed, if the bot didn't hate me

21:12 <nomagno> mrvn: I like it too!

21:13 <mrvn> nomagno: I would even say code and docs should be in the same repo so you always have the code and docs match. Even more have it in the same file.

21:13 <nomagno> mrvn: Hm... fair

21:13 <nomagno> Well I try to make everything be self-documenting... and then I add tons of comments anyways because verbosity cool

21:13 <mrvn> saddly, that still doesn't garantee people will update the docs when they update the code.

21:14 <mrvn> a++; decrement a by 1.

21:14 <mrvn> +//

21:14 <nomagno> Any suggestions to improve my website workflow? I guess my current approach is safe. I commit to the repo, push, log in manually to the webserver box, then git pull

21:15 <nomagno> An automatic git pull would risk someone hacking some less secure device and uploading malware to the website trough the docs/ folder

21:16 <mrvn> why not push to the webserver when you commit/merge to the stable branch as commit hook using an ssh key?

21:16 <mrvn> or when you sign a commit

21:16 <nomagno> commit signing is a good idea, the ssh hook IDK

21:17 <mrvn> ssh just takes care of the manual part

21:17 <nomagno> I guess

21:17 <nomagno> But wouldn't this require the source forge to have some kind of CI/CD?

21:18 <mrvn> You can have the webserver fetch, check the signature and only then merge

21:18 <nomagno> Aaah yes! Put that crontab to use

21:18 <nomagno> currently I only really run a cronjob for http://halfworld.nomagno.xyz/traffic.html

21:18 <bslsk05> halfworld.nomagno.xyz: Half-World Traffic

21:18 <mrvn> but a commit hook is nicer as you don't have to wait for cron to fire.

21:19 <nomagno> I really need to find a better system to log web traffic, with apache every few hundrer visits the logs get rotated

21:20 <nomagno> so that page isn't as useful as it could be

21:20 <mrvn> rotate by size, not time

21:21 dequbed has quit [Quit: bye!]

21:21 fwg has joined #osdev

21:23 <nomagno> mrvn: it's not me rotating it

21:23 <nomagno> It's apache2, no idea how to change it

21:23 <nomagno> ooooh search engines, smart!

21:29 <nomagno> Oh, I'm dumb

21:29 <nomagno> I'm using nginx, not apache

21:31 <nomagno> Hm... I'll make them rotate weekly, actually

21:36 dennis95 has quit [Quit: Leaving]

21:57 Burgundy has quit [Ping timeout: 276 seconds]

22:01 immibis has quit [Ping timeout: 256 seconds]

22:13 fwg has quit [Quit: .oO( zzZzZzz ...]

22:14 immibis has joined #osdev

22:33 lainon has joined #osdev

22:37 RAMIII has quit [Quit: WeeChat 2.8]

22:48 Ali_A has joined #osdev

23:14 <Ali_A> https://wiki.osdev.org/Stack#Stack_example_on_the_X86_architecture

23:14 <Ali_A> I was reading this and it says, "ake care when implementing your kernel. If you use segmentation, the DS segment should be configured to have its base at the same address as SS does. Otherwise you may run into problems when passing pointers to local variables into functions, because normal GPRs can't access the stack the way you might think." I

23:14 <Ali_A> don't quite get that, any ideas?

23:14 <bslsk05> wiki.osdev.org: Stack - OSDev Wiki

23:14 <Ali_A> what does it mean, "normal registers can not access the stack the way u think"

23:19 <zid> ds and ss not being in the same range puts locals and statics/dynamics into different address spaces effectively

23:19 <zid> so if you copied the value of esp into esi for example

23:19 <zid> *esi would no longer refer to the same place *esp does

23:19 <zid> which will make a C compiler *very* unhappy

23:23 <Ali_A> I am not very familiar with segmentation mode, but I thought `data segment` points to `.data` in the assembler? and that is a segment (at least on ELF format) points right after `.text segment` , while I know that the stack usually is just below the kernel and grows downward, so they are very far from each others no?

23:23 <zid> no

23:24 <zid> ds is a selector that determines the attributes of regular memory accesses

23:24 <zid> mov eax, [esi] and such

23:24 <zid> permissions, base, range, etc

23:24 <zid> ss is a selector that determines the attributes of stack accesses

23:24 <zid> push, pop, etc

23:25 <zid> (and cs for code, instruction fetches)

23:25 <Ali_A> I might need to re-read chapter 3 from intel's manual, because I thought it was stack segment (SS), code segment (CS) data segment (DS)

23:26 <zid> You could for example, load your code to 0x0, your data to 0x10000 and put your stack at 0x20000

23:26 <zid> and then set cs, ds and ss's bases to 0, 10000 and 20000 respectively

23:26 <zid> and then they all see their respective data at '0'

23:28 <zid> so esp of b00 would give 'push blah' -> base 20000 offset b00 -> access address 20b00

23:28 <zid> and esi of b00 would give 'mov [esi], blah' -> base 10000 offset b00 -> access address 10b00

23:28 <zid> That's called a segmented memory model and it's awful please don't

23:29 <Ali_A> yeah, this is the first time I hear or this, '=D , besides chapter 3, which part I should read to understand this in a bit more detail + (yeah I never used segmented memory mode, so no wonder)

23:29 <zid> You will never see non-zero bases

23:29 <zid> so read it if you get bored

23:29 <zid> but only then

23:31 <Ali_A> thanks!