#osdev on 2021-07-14 — irc logs at libera.irclog.whitequark.org

2021-05-23 01:57 klange changed the topic of #osdev to: Operating System Development || Don't ask to ask---just ask! || For 3+ LoC, use a pastebin (for example https://gist.github.com/) || Stats + Old logs: http://osdev-logs.qzx.com New Logs: https://libera.irclog.whitequark.org/osdev || Visit https://wiki.osdev.org and https://forum.osdev.org || Books: https://wiki.osdev.org/Books

00:01 <vin> I guess it's something most people want to achieve but the world is only making it harder with 100s of distractions.

00:02 Arthuria has quit [Remote host closed the connection]

00:04 sortie has quit [Quit: Leaving]

00:12 zoey has quit [Ping timeout: 255 seconds]

00:14 silverwhitefish has quit [Quit: One for all, all for One (2 Corinthians 5)]

00:20 dutch has quit [Quit: WeeChat 3.2]

00:23 dutch has joined #osdev

00:23 <heat> can osdev generally be weighed in as "relevant work experience" on a kernel/embedded/os dev job?

00:27 silverwhitefish has joined #osdev

00:33 <klysm> vin, how did you arrive at your above question? are you a disciplined person? why so/not?

00:38 <klysm> and, I am generally working on a problem until the inevitable peek at irc, which usually takes me away from my work. as an aside, others have generally not expressed an interest in solving problems that I create. I've gone towards several projects in the last several years, and done so mostly solo. Talking can be educational, just that not everyone can see things the same way. This would be required for c

00:38 <klysm> ollaboration.

01:00 <vin> klysm: I starting reading "deep work" (I could have unconsiously picked it up because I was not happy with the amount of work I do in a day). Especially when you are alone working on a project with no deadline and no schedule to put pressure on me, I realized I constantly seek instant gratification although I really want to do a good job on the project.

01:03 <vin> klysm: Am I disciplined person? Right now I don't think so. Which is why I believe creating silos are important. DNS blocking, throwing my phone away, are some things I am asorting to. I wonder if this normal?

01:03 Oli has joined #osdev

01:03 <vin> *this is normal

01:11 gog has quit [Ping timeout: 265 seconds]

01:21 srjek|home has quit [Ping timeout: 255 seconds]

01:37 <doug16k> omg. two triangles 640x480 32bpp 24bit-z opengl 3.3 core simplistic shader render is 11-13 microseconds per frame, lol

01:38 <doug16k> ah. it's hex. ok not that impossibly good

01:38 <doug16k> oops

01:39 <doug16k> 71-73us per frame

01:39 <doug16k> debug context though

01:52 vai has quit [Ping timeout: 255 seconds]

01:58 <klange> if any graphics masters [*cough*] want to take a poke at my shoddy little bilinear filter transformation system and help make it faster...: https://github.com/klange/toaruos/blob/master/lib/graphics.c#L764

01:58 <bslsk05> github.com: toaruos/graphics.c at master · klange/toaruos · GitHub

02:11 freakazoid333 has joined #osdev

02:12 ids1024 has quit [Ping timeout: 256 seconds]

02:30 ids1024 has joined #osdev

02:41 sts-q has quit [Ping timeout: 256 seconds]

02:44 sts-q has joined #osdev

02:44 ElectronApps has joined #osdev

03:03 isaacwoods has quit [Quit: WeeChat 3.2]

03:53 nyah has quit [Ping timeout: 250 seconds]

03:56 Izem has joined #osdev

03:58 <Izem> what's and interesting idea for a graphics layer if your are only interested in ascii?

03:59 <Mutabah> A framebuffer-backed terminal you mean?

03:59 <Izem> yeah, but specifically I don't want to go the terminal route with escape codes and all that

04:00 <Mutabah> You could make your API use out-of-band signalling

04:00 <Izem> graphics are not gonna be important to me for a while

04:00 <Mutabah> or be stateless (and calls that provide the rendering particulars)

04:01 <Izem> out of band signalling sounds like a terminal?

04:01 <Izem> oh right, I messed up

04:01 <Izem> I meant english text

04:01 <Izem> not ascii

04:02 <moon-child> Izem: look at bearlibterminal and libtickit

04:02 <moon-child> good libs for text-based graphics

04:03 <moon-child> tickit has a concept of a 'pen', which is associated with some set of style information (fg/bg colour, bold/italic/underline, ...), and you can say 'draw using this pen'. It keeps the good parts of palettes (a la curses) without the bad

04:06 <Izem> thanks

04:07 <klange> Curses was more of a solution to a problem that no longer exists than it was a good approach to managing TUIs.

04:07 <moon-child> at the lowest level, the simplest thing is double buffered grid of cells, where each cell contains style information and text (or a note that the current cell is the continuation of a double-width character in the previous cell). There is more you can do, but past a certain point it stops making sense to call it 'text' (or, it stops making sense to put all that in a text-only framework)

04:07 <klange> ^ This is pretty much how my terminal works.

04:08 <moon-child> klange: talking specifically about curses's colour pairs; partly they're part of the hardware (afaik?), but they're also meaningful at the application level as a way to create a set of associated widgets

04:09 <moon-child> e.g. alpine uses them this way

04:09 <klange> Color pairs aren't really a thing, nothing does paletized fg/bg colors like that that I'm aware of, and frankly I think that's one of curses' biggest sins that did not adapt as terminals got better.

04:10 <klange> Now that we can throw 24 bit colors at each of the fg and bg, having a limited set of pairs to pick from is an antiquated relic of the days when memory was measured in kilobytes.

04:10 <moon-child> yes, I hate colour pairs with a passion

04:10 <Izem> conversely, have people gotten to do osdev without a ui? I can't picture that

04:11 <moon-child> even tried to get them to let me fix it for ncurses (https://lists.gnu.org/archive/html/bug-ncurses/2019-08/msg00019.html)

04:11 <bslsk05> lists.gnu.org: A colouring api that doesn't suck

04:14 <klange> Plenty of kernel projects that don't touch anything front-end-related, but I think without at least some attempt at a user interface you're not doing "operating system" development if you're _just_ doing a kernel with no user-facing way to do stuff with it.

04:16 <klange> As the classic GNU/Linux copy-pasta implies, a kernel alone is not an operating system.

04:17 <Izem> that made me think of a server focused os, but yeah without a ui how do you poke about? :P

04:17 <Izem> this will be interesting

04:18 <moon-child> you don't need much more than dumb tty to run a shell. Can even punt on the actual display part if you do net and expose telnet (or ssh even!) instead

04:20 <moon-child> s/much//

04:20 <klange> Serial or bust.

04:21 <Izem> do emulation tools support serial?

04:21 <klange> Of course.

04:21 <klange> https://klange.dev/s/Screenshot%20from%202021-07-14%2013-21-05.png

04:21 <Izem> like vbox and vmware

04:21 <Izem> oh ok cool

04:23 <klange> You'll find options such as serial-over-TCP, serial-over-Unix-socket, serial to a file, and qemu has both serial on stdio and also has options for tabs in the GUI frontends though I have no idea what sort of terminal emulation it supports.

04:24 <klange> This clock reminds me that I should really implement timezone stuff so I can do UTC RTC...

04:24 <kazinsal> vmware's serial-over-tcp on ESXi requires a proper full monty vSphere license

04:24 <kazinsal> but yeah, serial over whatever you want is common

04:24 <klange> Workstation supports pretty much everything but hides virtually everything in a config file.

04:26 <klange> Quick explanation of the QEMU options in that screenshot since it's kinda odd:

04:26 <moon-child> don't you have to pay for all the versions of vmware?

04:26 <klange> No, Workstation is free-for-noncommercial-use.

04:26 <moon-child> I mean, month-long free trial (that you can reset at will with questionable legality), but

04:26 <moon-child> ah, hmm

04:27 <klange> It's a pain in the ass for actually debugging, with intentional misfeatures like having to restart the whole application to get back to the machine configuration.

04:28 <klange> `-nographic` pretty self-explanatory, `-no-reboot` quits on restart and exiting the shell triggers restart, so `exit` does what it should, `-audiodev none,id=id` shuts up pulse to keep the output clean as it inevitably complains about something

04:29 <klange> `-serial null -serial mon:stdio` this one is fun; this disables "COM1", which in this case doesn't mean much but in UEFI boot keeps OVMF from spamming crap to the terminal; the monitor and stdio serial running "COM2", which I map as /dev/ttyS1

04:30 <Izem> does COM{1,2} predate windows?

04:30 <klange> Those names for them are DOS-era and not really used in x86 Unix-likes.

04:31 <Izem> I see, I wondered at that since I remember qemu is an open project

04:31 <klange> The fw_cfg options: opt/org.toaruos.gettyargs gets passed to the 'getty' app that manages serial consoles, -a like in Linux getty means "autologin", and /dev/ttyS1 is COM2.

04:32 <Izem> thanks

04:32 <klange> opt/org.toaruos.bootmode - my bootloader parses this to pick a boot mode without the UI, supports a few different strings for quick boot.

04:33 <klange> opt/org.toaruos.term - the Makefile is actually setting this to $TERM, gets read by one of my init apps, possibly getty? I don't even remember! but ensures the hosted terminal knows what it's running on

04:33 <klange> getty also does one other fun little hack where it rams the cursor into the lower right corner and does a position report request, so it can get the size of a remote terminal.

04:34 <klange> Or more correctly, shells out to a tool that does that: https://klange.dev/s/Screenshot%20from%202021-07-14%2013-34-08.png

04:34 * kingoffrance .oO( "if any graphics masters [*cough*] " ) *coughs* and sticks magnets under graphics master

04:34 <kingoffrance> ive done all i can do

04:35 <klange> Just in case anyone thought I was a GUI hardliner, heck no, I provide first-class experience over serial and in VGA text mode.

04:35 <Izem> klange: I don't get that bit about the cursor

04:36 <klange> Terminals have a size. If you are attached directly to a terminal emulator there is a signalling mechanism where the terminal emulator can tell the TTY layer how big it is.

04:36 <klange> This is important for running any TUI app, of course.

04:36 <Izem> yeah

04:36 <klange> And if you are using ssh, and even telnet, there are mechanisms for those to pass this information between endpoints.

04:37 <klange> Serial does not have this, you have to manually configure sizes.

04:37 <Izem> but can't you do that without putting the cursor in the corner?

04:38 <klange> Weirdly, there is no standardized escape sequence for "tell me how big you are". Not sure why, just never seemed to happen. But there's a silly workaround: There are cursor movement sequences, very standard, been around for ages, and there is "cursor position report" that shoves data into the input buffer.

04:38 <Izem> oh I see

04:38 <Izem> makes sense

04:38 <Izem> kingoffrance: did you ever read graphics gems?

04:38 <klange> And the standard handling of a position that is too big is to 'trap' the cursor in the bottom right corner. So you ask for a ridiculous position like 10000,10000 and then ask where the cursor is and bam, you know the size of the terminal.

04:38 <moon-child> you can also use the 'cursor report' sequence to do other fun things (http://nethack4.org/blog/portable-terminal-codes.html)

04:38 <bslsk05> nethack4.org: Towards being able to ignore $TERM

04:39 <klange> You can also use cursor report to figure out a remote terminal's wcwidth, but it's, uh, messy.

04:40 <kingoffrance> Izem no, i know nothing, just ...theres ways to summon certain channel members...

04:41 <kingoffrance> i used to have https://www.jagregory.com/abrash-black-book/ but never went through it

04:41 <bslsk05> www.jagregory.com: Michael Abrash’s Graphics Programming Black Book, Special Edition

04:42 <Izem> thanks, seems to have a good bit about the vga

04:43 <kingoffrance> if i ever got that far, i will start with serial port and "at the lowest level, the simplest thing is double buffered grid of cells, where each cell contains style information and text"

04:44 ElectronApps has quit [Remote host closed the connection]

04:44 <Izem> sounds like what I'm gonna do :P

04:44 <Izem> but I'm also going to have to answer important questions about what an OS is so I don't end up making emacs

04:44 <kingoffrance> and then "client" whatever can decide how many of "style" stuff it can honour/display, else fall back is ignore them all i suppose

04:45 ElectronApps has joined #osdev

04:45 <kingoffrance> even that, "text" still means "charset" or utf or whatever, so itself needs defined

04:45 heat has quit [Ping timeout: 276 seconds]

04:46 <kingoffrance> anyhow, i have no idea if that is good idea, just i envision the "style" stuff that you could still have "client" display portions, even if it cant handle the full deal

04:47 <klange> Things like `screen` and `tmux` do that, they can take in their own particular dialect of the standard escape sequences and output through a variety of dialects.

04:48 <kingoffrance> "Implementing and Optimizing Bresenham’s Line-Drawing Algorithm" i do have a very crude that, not optimized, but i dont consider that anything except "this is how you normalize/fudge a line to square pixels"

04:48 <kingoffrance> (mentioned in book, i used that elsewhere, but not really hooked up to anything)

04:48 <kingoffrance> i think that is very basic/famous/simple, just thats maybe as far as graphics i have got

04:49 <klange> https://klange.dev/s/Screenshot%20from%202021-07-14%2013-46-00.png

04:49 <klange> I can report it is sunny and nice on my side of Tokyo, so these thunderstorms must be on the south side...

04:49 <klange> The little bugfix I did to my rounded rectangle renderer has a noticable effect on the corners of these bubbly popups.

04:52 <klange> I wonder if my fuzzy unhinted text would be improved with a gamma curve or whatever it's called?

04:55 <moon-child> try it. Just do clr_val = pow(clr_val, 2.2)

04:55 <moon-child> (where the value is in [0,1])

04:58 <Izem> when doing the cross compiler does that mean all the binutils have to be prepared the same way?

04:59 lucf117 has quit [Remote host closed the connection]

05:03 <klange> standard 2.2 way too wiry, but might look into other gamma curves...

05:06 Izem has quit [Quit: Izem]

05:51 MarchHare has quit [Ping timeout: 255 seconds]

05:52 MarchHare has joined #osdev

06:07 ^[ has quit [Ping timeout: 276 seconds]

06:29 ^[ has joined #osdev

07:05 ElectronApps has quit [Read error: Connection reset by peer]

07:08 ElectronApps has joined #osdev

07:49 MarchHare has quit [Ping timeout: 255 seconds]

07:50 vdamewood has joined #osdev

07:53 sortie has joined #osdev

07:56 Burgundy has joined #osdev

08:09 elastic_dog has quit [Ping timeout: 255 seconds]

08:16 elastic_dog has joined #osdev

08:18 mhall has joined #osdev

08:24 elastic_dog has quit [Ping timeout: 255 seconds]

08:25 elastic_dog has joined #osdev

08:47 gmacd has quit [Remote host closed the connection]

08:54 zaquest has joined #osdev

09:02 dennis95 has joined #osdev

09:11 gog has joined #osdev

09:22 z_is_stimky has quit [Read error: Connection reset by peer]

09:22 z_is_stimky_ has joined #osdev

09:26 GeDaMo has joined #osdev

09:32 elastic_dog has quit [Ping timeout: 245 seconds]

09:39 elastic_dog has joined #osdev

09:51 dormito has quit [Ping timeout: 255 seconds]

10:12 Skyz has joined #osdev

10:16 vdamewood has quit [Quit: My MacBook Pro has gone to sleep. ZZZzzz…]

10:17 <Skyz> I think it would be interesting to make an OS that starts off with only two colors, black and white

10:23 dormito has joined #osdev

10:23 <Skyz> https://cdn.arstechnica.net/wp-content/uploads/2018/04/DSC07486.jpg

10:24 <Skyz> Something that can resemble the game boy original feel

10:34 <klange> 1) That's a hardware thing, not really an OS thing. 2) The original GameBoy, rather famously, was 2-bit grayscale, and its LCD's color contrast and green backdrop meant it was really more "4 shades of olive" than even "gray". Not at all "black and white".

10:35 <Skyz> 2-bit grayscale huh

10:36 <Skyz> It could be self-imposed on a software level

10:37 <Skyz> I'm not looking to follow the tutorial to make the same thing

11:09 elastic_dog has quit [Ping timeout: 255 seconds]

11:15 elastic_dog has joined #osdev

11:17 silverwhitefish has quit [Quit: One for all, all for One (2 Corinthians 5)]

11:46 scaleww has joined #osdev

12:02 elastic_dog has quit [Ping timeout: 245 seconds]

12:04 ahalaney has joined #osdev

12:09 elastic_dog has joined #osdev

13:10 xenos1984 has quit [Remote host closed the connection]

13:11 xenos1984 has joined #osdev

13:13 vai has joined #osdev

13:13 <vai> hi all :)

13:14 <vai> sortie: hi! :D

13:14 <sortie> Hi

13:14 <Skyz> Hi

13:21 iorem has joined #osdev

13:22 isaacwoods has joined #osdev

13:28 johnjay has quit [Ping timeout: 255 seconds]

13:30 silverwhitefish has joined #osdev

13:30 heat has joined #osdev

13:35 shikhin has quit [Quit: Quittin'.]

13:36 zgrep has quit [Quit: It's a quitter's world.]

13:37 zgrep has joined #osdev

13:38 shikhin has joined #osdev

13:39 johnjay has joined #osdev

13:43 <gog> hi

13:52 Skyz has quit [Quit: Client closed]

13:53 <jimbzy> Sup gog

13:56 Skyz has joined #osdev

13:56 <Skyz> Okay, so I'm doing some research into reflection-oriented programming

13:58 <Skyz> I think if you use a language to construct the os that has a bytecode interpreter it could reflect on itself

14:00 <jimbzy> Why?

14:01 srjek|home has joined #osdev

14:01 <Skyz> self-awareness

14:01 nyah has joined #osdev

14:01 <Skyz> In computer science, reflection is the ability of a computer program to examine (see type introspection) and modify its own structure and behavior (specifically the values, meta-data, properties and functions) at runtime.[1]

14:02 <jimbzy> I know what reflection is, but why are you interested in it?

14:02 <Skyz> It can be used for hacking games

14:04 <jimbzy> That's cheating, tho.

14:05 <Skyz> ¯\_ (ツ)_/¯

14:05 <jimbzy> I never used reflection for that. I used a debugger and a hex editor.

14:06 <jimbzy> I also got an angry letter from Hasbro because they didn't like my RCT trainer.

14:09 <Skyz> I only cheat at games that you can't beat

14:09 <jimbzy> I can beat any game, except one. I have never beaten the original Battletoads on the NES.

14:11 <Skyz> I hear that game is hard to beat

14:12 <gog> i can't beat any games

14:12 <Skyz> I played frogger as a kid for ps

14:12 <jimbzy> I love frogger, too.

14:13 <jimbzy> gog, I have to play offline because the kids beat my ass and make fun of me :(

14:17 <gog> :(

14:19 <heat> i love getting told what to do in online games by Polish 9 year olds

14:19 <Skyz> lol

14:20 <heat> it's the cherry on top of the cake

14:20 <jimbzy> heat, I had a great team when CoD:BO first came out.

14:20 <jimbzy> We were all over 30 and very organized.

14:21 <jimbzy> At one point, we were ranked like 5000th overall in zombie mode on Kino Der Toten.

14:22 <jimbzy> It's funny, too, because to this day when my son sees a Nazi in uniform on TV he calls them "Zombzis"

14:25 <Skyz> I haven't played CoD since black ops

14:25 <Skyz> Never got into it

14:25 <Skyz> Was more of a fan of Halo

14:26 <jimbzy> Never tried Halo

14:26 <jimbzy> Hell, I've never actually played Half-Life.

14:26 <heat> i've never played COD in my life lol

14:27 <jimbzy> I played the hell out of that one, but that was about it.

14:27 <heat> all I play is counter strike and rocket league

14:28 <jimbzy> I started playing the original FFVII again on PS4.

14:28 <heat> the toxicity is truly part of the experience

14:28 <jimbzy> It's too much like going to a family reunion for me, heat :p

14:29 <heat> :D

14:29 <gog> i'd rather get called f-g by somebody whose face i can't see than my cousin :p

14:30 <jimbzy> That happened at the last one I went to in 2009.

14:30 <jimbzy> My cousin called my younger brother that. It didn't end well for him.

14:30 freakazoid333 has quit [Read error: Connection reset by peer]

14:30 <gog> good

14:31 <jimbzy> Yeah, they're pretty ignorant.

14:32 MarchHare has joined #osdev

14:34 <jimbzy> I think I'm going to walk down to the store and get a cup of coffee before it heats up out there. I'll catch you all later.

14:40 <gog> byee

14:41 heat has quit [Read error: Connection reset by peer]

14:51 kingoffrance has quit [Ping timeout: 252 seconds]

15:00 mahmutov has joined #osdev

15:01 iorem has quit [Quit: Connection closed]

15:03 kingoffrance has joined #osdev

15:12 kingoffrance has quit [Ping timeout: 255 seconds]

15:27 freakazoid333 has joined #osdev

15:30 kingoffrance has joined #osdev

15:35 <Skyz> What if Hollywood is right about skynet :o

15:35 scaleww has quit [Quit: Leaving]

15:41 Oli has quit [Quit: Lost terminal]

15:51 <Skyz> Well it looks like skynet is built

15:52 <Skyz> https://en.wikipedia.org/wiki/Skynet_(satellite)

15:52 <bslsk05> en.wikipedia.org: Skynet (satellite) - Wikipedia

16:01 ElectronApps has quit [Ping timeout: 272 seconds]

16:02 mahmutov has quit [Ping timeout: 245 seconds]

16:08 <Skyz> geist: there's even a satellite called Zircon https://en.wikipedia.org/wiki/Zircon_(satellite)

16:08 <bslsk05> en.wikipedia.org: Zircon (satellite) - Wikipedia

16:15 <Skyz> G2G, that's some food for thought

16:15 <Skyz> https://en.wikipedia.org/wiki/John_von_Neumann

16:15 <bslsk05> en.wikipedia.org: John von Neumann - Wikipedia

16:15 <Skyz> His wiki got updated significantly

16:18 Skyz has quit [Quit: Client closed]

16:27 <nur> would I be a bad osdevver if I started poking at another hardware platform while also doing another

16:27 <sortie> It will change your alignment from lawful to chaotic, but no change to your moral standing

16:28 srjek|home has quit [Ping timeout: 255 seconds]

16:33 <gog> chaotic neutral gang

17:15 <nur> I always wanted to be more Han Soloesque

17:15 <nur> what the hell

17:15 <nur> let's do this

17:22 <kingoffrance> as ive said months ago, han solo is ship of theseus "ive made some upgrades"

17:22 <kingoffrance> so, i see no conflict...

17:30 zoey has joined #osdev

17:33 <nur> is raspi4 a valid qemu-arm target machine

17:33 <nur> I feel like it's been merged yet

17:34 <nur> $ qemu-system-aarch64 -machine raspi4

17:34 <nur> qemu-system-aarch64: -machine raspi4: unsupported machine type 'raspi4'

17:34 <nur> lol nope it doesn't work

17:34 <clever> nur: what about `-machine help` ?

17:34 <nur> yeah it's not there

17:34 <nur> $ qemu-system-aarch64 --version

17:34 <nur> QEMU emulator version 4.2.1 (Debian 1:4.2-3ubuntu6.16)

17:35 <nur> maybe it's the version

17:35 <clever> hw/arm/raspi.c

17:35 <clever> its defined in this file of thesource

17:36 <clever> https://github.com/qemu/qemu/blob/master/hw/arm/raspi.c#L369-L399

17:36 <bslsk05> github.com: qemu/raspi.c at master · qemu/qemu · GitHub

17:36 <clever> nur: yep, 4 is missing from master

17:36 <j`ey> no rpi4

17:36 zoey has quit [Ping timeout: 255 seconds]

17:36 <nur> someday

17:36 <clever> nur: what pi4 specific feature are you wanting to emulate?

17:36 <nur> not there yet :)

17:37 <nur> just wondering if it's very different

17:37 <clever> behind the scenes, a lot has changed

17:37 <clever> but qemu didnt emulate 90% of the stuff that has changed

17:37 <clever> the only real difference qemu supports, is >1gig of ram

17:37 <clever> and pci-e support

17:38 <clever> everything else is missing from qemu, even on the pi3 machine

17:38 <nur> I guess I can start working on my arm OS on RPI3 Qemu edition and worry about it when I can afford to buy a real one

17:39 <j`ey> nur: or just target the 'virt' machine

17:39 <nur> so it'll be like a "generic ARM OS"?

17:39 <clever> nur: ive been doing baremetal on the rpi as well, but not using the arm core

17:39 <j`ey> nur: ish

17:40 <nur> It still boggles me how I can just "not worry about it too much"

17:41 <clever> nur: do you want to support the rpi specificly, or just do generic arm dev? does it have to be arm?

17:41 <nur> I want to buy a RPI so I can boot my OS on it

17:41 <nur> so yes?

17:42 <nur> it doesn't _have_ to be ARM but it has to be something right

17:42 <clever> nur: there is also the much less traveled road, of doing VPU development on the rpi

17:42 <j`ey> dont do that

17:42 <j`ey> lol

17:42 <clever> nur: the rpi has 2 seperate cpu clusters in the same chip

17:42 <clever> j`ey: why not? :D

17:43 <nur> yeah I just wanna target ARM so that when I can get my hands on a real machine I can try it out

17:43 <j`ey> clever: you know why P

17:45 Skyz has joined #osdev

17:45 <clever> https://github.com/librerpi/lk-overlay/blob/master/arch/vpu/start.S#L8-L20

17:45 <bslsk05> github.com: lk-overlay/start.S at master · librerpi/lk-overlay · GitHub

17:45 <clever> nur: an example of VPU asm, line 12 turns IRQ's off, 13 sets up the stack, 15 may enable the uart very early, 17 clears .bss, 18 passes control to C, and 20 will loop forever if C somehow returned

17:46 <clever> from then on, you can just use C, and any rpi peripheral, same as if you where on the ARM core

17:46 <Skyz> Nur: I would do virtu or you can target the "hack" platform from NandToTetris

17:47 <Skyz> https://onatm.dev/2019/04/05/anatomy-of-a-hack-assembly-program-part-1/

17:47 <bslsk05> onatm.dev: Anatomy of a Hack assembly program - Part 1 | Extremely random blog posts from Onat

17:49 zoey has joined #osdev

17:49 srjek|home has joined #osdev

17:54 <nur> thanks :)

18:07 <Skyz> Np

18:09 dennis95 has quit [Quit: Leaving]

18:12 <Skyz> I think you can put the hack computer on an FPGA

18:12 <Skyz> https://hackaday.io/project/160759-nand-to-tetris-in-verilog-part-1-icarus

18:12 <bslsk05> hackaday.io: Nand to Tetris in Verilog Part 1 - Icarus | Hackaday.io

18:15 <immibis> hey guys, hey guys, guys, hey. What if the processor could speculate BOTH sides of the branch?

18:15 <GeDaMo> I think that some do

18:39 tacco has joined #osdev

18:40 Vercas has quit [Remote host closed the connection]

18:40 Vercas has joined #osdev

18:48 Mooncairn has joined #osdev

18:54 <Skyz> Found something working on FPGA

18:54 <Skyz> https://gitlab.com/x653/nand2tetris-fpga/-/raw/master/09_Hack9-SD/Hack9.jpg

18:58 <Skyz> Full project is here

18:58 <Skyz> https://gitlab.com/x653/nand2tetris-fpga/

18:58 <bslsk05> gitlab.com: Michael Schröder / nand2tetris-FPGA · GitLab

19:00 freakazoid333 has quit [Read error: Connection reset by peer]

19:06 Skyz has quit [Ping timeout: 246 seconds]

19:08 <jimbzy> NAND to Tetris, eh?

19:09 <GeDaMo> NAND 2 Tetris is good

19:09 srjek|home has quit [Ping timeout: 245 seconds]

19:09 <GeDaMo> At least the hardware part is, I lost interest when it got to the software part :P

19:09 * geist yawns

19:10 <geist> good aftenoon folks

19:10 <jimbzy> What hardware does it support?

19:10 <jimbzy> Hey g.

19:10 <jimbzy> What's up?

19:10 <GeDaMo> It's all done through simulators

19:10 <geist> oh not much

19:11 <jimbzy> If I'm not mistaken, Tetris was originally written for the Electronika 60, which was similar to the PDP-11.

19:31 asymptotically has joined #osdev

19:32 <gog> yes

19:32 <gog> LSI's clone

19:33 Skyz has joined #osdev

19:33 <Skyz> Yes, your right. Tetris was made in Russia

19:34 <Skyz> I'm actually surprised how advanced Russia is with technology

19:34 <Skyz> Germany too

19:35 <gog> it was made in the soviet union, tovarisch

19:36 <j`ey> linus tovarisch

19:38 <Skyz> Well, there are a lot of places technology is being produced

19:39 <Skyz> The course was written by someone in Israel, they have some affinity with the Soviet Union

19:44 freakazoid333 has joined #osdev

19:45 immibis has quit [Killed (NickServ (GHOST command used by immibis_!~immibis@2a02:3032:404:1b60:a9d2:4a7a:60e:e127))]

19:45 immibis has joined #osdev

19:46 <moon-child> immibis: wait'll you hear about hyperthreading. If the cpu runs into a branch (or cache miss, just generally runs out of things to speculate), it can just pick some other instructions to execute instead

19:46 <Skyz> I've recently seen a lot with China, there is an interesting look on tech on Bloomberg's YT

19:47 <moon-child> ;o

19:47 <clever> moon-child: from what ive heard, x86 hyperthreading is a complex blend, of having say 2 opcode decoders, and 2 very basic opcode execution units, but then sharing more expensive things like sse and fpu cores

19:48 <Skyz> https://www.youtube.com/watch?v=SUfjtKtkS2U

19:48 <bslsk05> 'Inside China's Accelerating Bid for Chip Supremacy' by Bloomberg Quicktake (00:19:49)

19:52 <clever> moon-child: on the VPU for example, each core has 2 scalar units and 1 vector unit, so its able to cheat and run 2 scalar opcodes in the same clock cycle (if conditions are right)

19:52 <clever> moon-child: and if you start a vector opcode, but dont try to read its result right away, the scalar units can run dozens of opcodes, in parallel to the vector unit doing one opcode

19:53 <clever> x86 hyperthreading, is probably just sharing some of those units between multiple cores

19:57 <geist> clever: not really the right terminology

19:57 * sortie works on Unix socket file descriptor passing support in sendmsg/recvmsg

19:57 <geist> x86 hyperthreading is a marketing term for SMT (simultaneous multitthreading). been around a while, lots of architectures have had it

19:58 <geist> think of it as a single core that is simply maintaining state of N separate hardware threads

19:58 <geist> and internally switching between instructions between threads

19:58 <sortie> Damn this convoluted 2017-era prototype code of mine scares me and there's reasons why it's not simpler and lots of unhandled edge conditions

19:58 <geist> so it's less that multiple cores are sharing hardware (except maybe AMD's bulldozer) and more than a single core holds multiple hardware thread state at the same time

19:59 <clever> geist: ah, maybe i'm thinking of something else

19:59 YuutaW has quit [Ping timeout: 240 seconds]

19:59 <geist> AMD's somewhat ill conceived CMT (bulldozer) is a different story

19:59 <clever> so hyperthreading is more about just having say double the registers, and context switching when it would have stalled?

19:59 <geist> clever: basically right

19:59 <kingoffrance> not that i know anything but another approach? https://en.wikipedia.org/wiki/Digital_signal_processor DSPs are usually optimized for streaming data and use special memory architectures that are able to fetch multiple data or instructions at the same time, such as the Harvard architecture or Modified von Neumann architecture, which use separate program and data memories (sometimes even concurrent access on multiple data buses). all i know is

19:59 <kingoffrance> , "MIPS" you must compare apples and oranges to have any semblance of meaningfulness. even that ancient black book graphics programming "As I was writing my last game, I discovered that the program ran perceptibly faster if I used look-up tables instead of shifts and adds for my calculations. It shouldn’t have run faster, according to my cycle counting, but it did. In truth, instruction fetching was rearing its head again, as it often does, a

19:59 <bslsk05> en.wikipedia.org: Digital signal processor - Wikipedia

19:59 <kingoffrance> nd the fetching of the shifts and adds was taking as much as four times the nominal execution time of those instructions." under the heading "assume nothing"

20:00 <clever> and i can see how each stage in the pipeline could be doing a different thread

20:00 <clever> so it can weave them together

20:00 <geist> there are all sorts of schemes to do decide when to context switch, but i think in the steady state most designs will just toggle back and forth

20:00 <geist> clever: right, and all of the usual ependency tracking hardware Just Works in a SMT case

20:00 <geist> since instructions from different threads are intrinsically not dependent on each other

20:01 <geist> so you already have this highly out of order cpu that just tosses a bunch of unrelated instructions at it, it is even more efficient at it

20:01 <clever> https://en.wikipedia.org/wiki/Bulldozer_(microarchitecture)

20:01 <bslsk05> en.wikipedia.org: Bulldozer (microarchitecture) - Wikipedia

20:01 <clever> model name : AMD FX(tm)-8350 Eight-Core Processor

20:01 <clever> and i'm on the list in that wiki page!

20:02 <clever> so my cpu is the exception to that rule, and isnt the same kind of hyperthreading?

20:02 <geist> not that bulldozer and amd's CMT was distinctly different from standard SMT, since it shares far less than the whole cpu

20:02 <geist> right. CMT turned out to be a Bad Idea

20:02 <geist> zen switched to full SMT

20:02 <clever> what was a bad idea with it?

20:03 <geist> didn't work worth a crap

20:03 <geist> or more to the point SMT simply does a better job

20:03 ccx has quit [Ping timeout: 272 seconds]

20:03 <clever> ah

20:04 <geist> hard to tell if CMT was torpedoed mostly by bad implementation or was a fundamentally bad idea

20:04 <immibis> clever: you are describing the modern out-of-order-execution system, not sure if it has a name, but "tomasulo's algorithm" may come close

20:04 <geist> but on paper SMT should generally be superior in every way

20:04 <immibis> the decoder issues instructions as fast as it can; they wait in some kind of buffer until their dependencies are satisfied; then they get allocated to any available execution unit (or wait further if none is available)

20:04 <geist> my understanding is CMT simply shares the fpu/vector back end between pairs of cores, but otherwise they're standalone

20:04 <immibis> any available execution unit which can execute that instruction*

20:05 <geist> yah one way of thinking about modern designs is theyr'e really a pile of little cpus all talking to each other asynchronously

20:05 <clever> immibis: and my understanding is that the bulldozer CMT, is sharing those execution units between cores?

20:05 <kingoffrance> *must compare apples and apples

20:05 <clever> geist: so if 2 cores are doing non-vector ops, they can run in parallel, and do more work then a SMT design? not counting the stalls wasting time

20:06 <geist> clever: right, thats the idea

20:06 <clever> or only one core in the pair limits itself to fpu/vector ops

20:06 <geist> OTOH that'd use more space than a SMT design, so you could potentially cram two SMTs in the same space, etc etc

20:06 <immibis> it appears yes although i hadn't heard of it until today. The FPU units are shared. Though Wikipedia's diagram also only shows one decoder, which is odd...

20:06 <clever> while SMT will have the 2nd core suffer far more, if your compiler was smart enough to avoid stalls

20:06 <geist> it's all about what tradeoffs you get. in the bulldozer case they seemed to bet that fpu/vector performance wasn't as important and they could save space by sharing one between cores

20:07 <immibis> which makes sense unless you are doing machine learning. Most code is boring. But, maybe most code that cares about performance is not so boring.

20:07 <geist> immibis: it got complicated. there were 4 cores in the family 15h: bulldozer, piledriver, steamroller, excavator

20:07 <geist> in the later revisions they split the decoders out, etc

20:07 <geist> by the end of it (excavator) it was fairly decent, but still outclassed by equivalent intel cores

20:07 <immibis> i guess you only really need one core-equivalent to run windows and word. Extra cores are for gaming and machine learning. And you'd better be able to use them for that

20:07 <geist> it was okay, just not good enough

20:08 <clever> immibis: a lot of the games i play tend to be single-threaded :(

20:08 <geist> it was a case where the whole family 15h was not a complete dumpster fire, just middling performance

20:08 <mjg> your web browser maeks up for it

20:08 <geist> and thus no reason to get it

20:08 <geist> and/or they sold it as bargain bin low end, with no margins

20:09 <geist> by the end of the line they weren't even making new desktop cpus, since no one would buy it. excavator was a reasonably good design but i think it only made it to laptops, since that was the only market AMD could sell it into (at cheap prices)

20:10 <clever> > The longer pipeline allowed the Bulldozer family of processors to achieve a much higher clock frequency compared to its K10 predecessors. While this increased frequencies and throughput, the longer pipeline also increased latencies and increased branch misprediction penalties.

20:10 <geist> anyway. Zen completely destroyed it, since they went back to the drawing board and made a much more proper (re: more like intel) core design

20:10 ccx has joined #osdev

20:11 <geist> kinda recommend, there's a good interview with Jim Keller on anandtech. doesn't talk about a lot of tech details, but he was the guy they brought in to fix AMD's problem

20:11 <geist> seems like a real smart guy and a straight shooter, so to speak

20:11 YuutaW has joined #osdev

20:12 <geist> also side note he straight up talks about there being two front ends in development for what became the zen core: K12 (arm decoder) and the x86 decoder

20:12 ZetItUp has quit [Read error: Connection reset by peer]

20:12 <geist> presumaly the K12 project is parked, but i've always had the strong suspicion that the zen backend has a lot of ARMisms in it because it was designed to also support arm front end

20:15 <clever> would the frontend be set in stone when fabbed, or is there any chance of context switching it at runtime?

20:15 <geist> you could, though it'd be complicated

20:15 <geist> to a certain extend apple doing what they do wth the M1 (having a strongly ordered mode) is probably the right way to go about it

20:16 <geist> make it so that the cpu runs more or less the same 'way' memory order wise, then you can do a fairly straightforward binary translation between the two fully capable ISAs

20:16 <clever> from what ive read, the M1 is just an arm frontend with the memory ordering mode being toggleable

20:16 <geist> right, but that memory order thing is a Big Deal

20:16 <clever> so you still have to translate the x86->arm, in the raw binary, and fixup the addressing

20:16 <geist> since aside from ISA the two architectures do approach memory order completely differently

20:17 <clever> having 2 frontends, would eliminate the need for the translation step

20:17 <geist> sure, but translating between two ISAs like that is a solved problem

20:17 <geist> sure, but like all things its not free, so you can make it a software problem, which can get better over time (ie, can be upgraded in the field) and you can even cache the translations, etc

20:17 <geist> plus in their case the idea is to eventually not run x86 anymore, so there's little point investing in hardware to do it

20:17 <clever> yeah

20:18 <clever> the dalvik stuff on android is doing similar

20:18 <clever> at one time, it was interpreted bytecode, with only an install-time linker patching

20:18 <geist> itanium, for example, had an x86 decoder built in (could run it in x86 compatibility mode) but it ran so terribly it was hard to use

20:18 <clever> but now its using llvm to translate it into native at install time

20:18 <geist> another one of those failures of the itanium design. they probably should have gone in with a good SW translator instead of worrying about wasting hardware on it

20:19 <geist> though it was also designed to be dropped in later itaniums, which it was IIRC

20:19 <clever> i'm also reminded of the BMOW1 and its micro-code flash chip

20:19 <clever> basically, a 4 bit micro-code PC, the 8bit opcode latch, and a condition var, are used as address lines into the micro-code flash chip

20:19 <immibis> seems about right. That's just a slightly upgraded PLA

20:20 <clever> and the raw data lines out, control all of the latches/buffers in the cpu, to route data betweenregisters/alu

20:20 <immibis> A ROM is really just a fully decoded PLA and NOR flash is a reprogrammable ROM

20:20 <clever> but, if you just have an opcode set register as a few more addr bits

20:20 <clever> you could context switch to an entirely different microcode table, at runtime

20:20 <geist> oh side note, replacement PLA for my C64 came in. fixed it right up

20:21 <clever> immibis: yeah, ive seen die shots of the 6502, and it has a maskrom for the microcode

20:21 <geist> not really. that not microcode so much

20:21 <clever> immibis: the BMOW1 is just a 6502 compatible cpu, made out of raw logic gates and wire-wrap, with a pinch of standard flash/ram added in

20:21 <immibis> wait the BMOW is 6502 compatible? heh I didn't realize that

20:22 <clever> immibis: i think it was

20:22 <clever> https://www.bigmessowires.com/bmow1/

20:22 <bslsk05> www.bigmessowires.com: BMOW 1 Computer | Big Mess o' Wires

20:22 <geist> the the https://gigatron.io/ is also 6502 compatible now with a new rom

20:22 <bslsk05> gigatron.io: Gigatron – TTL microcomputer

20:22 <clever> > The high-level instruction set that’s implemented in this microcode is a close cousin to 6502 assembly language.

20:22 <immibis> <clever> but, if you just have an opcode set register as a few more addr bits <- you can also do that with spaghetti logic, or whatever you're using, you could even power off a whole decode unit and start up a new one

20:23 <geist> similar idea. simple 8 bit microcode class cpu that is furiously running an emulator for a larger 16 bit cpu, and now an optional 6502 mode

20:23 <immibis> not sure if the gigatron really counts, isn't it using software emulation?

20:23 <geist> same thing

20:23 <geist> that's the point, microcode looks like software at a particular level

20:23 <immibis> by that logic my gameboy can be switched to the x86 instruction set

20:23 <immibis> just by changing a rom

20:23 <geist> sure. also.... remember transmeta?

20:24 <immibis> in fact I don't even need soldering because the rom comes on a user-replaceable cartridge

20:24 <clever> immibis: for the 6502 in the c64, the maskrom was inside the cpu itself

20:24 Mooncairn has left #osdev [Leaving]

20:24 <geist> the lines are blurred when you get into deep microcode. in general i think you can tend to call it microcode if it looks a particular way

20:24 <geist> ie, if the instruction indexes directly into a rom, which starts a sequence of control logic

20:25 <immibis> clever: I believe that kind of "microcode" will be tightly integrated with the CPU. In fact, even the BMOW1's probably is. It can't be reprogrammed arbitrarily, it can only run instruction sets that look enough like the one it was designed for

20:25 <clever> https://www.bigmessowires.com/block_diagram.png

20:25 <immibis> which could be quite wide in the case of the BMOW1

20:25 <clever> immibis: i think it has 2 main limits

20:25 <clever> 1: the number of raw registers

20:25 <clever> 2: it relies on a design where you have 8 bits of opcode all by itself, followed by operands and immediates in the next bytes

20:25 <immibis> things like ALU flags will be hard-wired. You want a half-carry flag and you don't have one in hardware? too bad

20:25 <immibis> ah yes that too

20:25 <clever> 2: the current design is also limited to 16 steps for an opcode

20:26 <clever> 3*

20:26 <clever> 1 can easily be solved, by just throwing more registers into the design, and having a wider output from the micro-code rom

20:26 <geist> immibis: but yeah i think you're right in that the low level microcode on the gigatron is a bit more cpu like. i guess to me the real question is whether or not it runs a code that fetches the next instruction and then looks in a table

20:26 <clever> 2 gets tricky.....

20:26 <geist> vs if the microcode itself directly dispatches. it's a detail and probably not important

20:26 <clever> 3, just add more bits to the micro-code PC counter

20:27 <geist> anyway, too many conversations at once

20:27 * geist bows out

20:27 <immibis> IMO real microcode is designed with hardware integration; direct instruction dispatch is part of that. It's hardware and "software" designed together, not just software running on hardware

20:27 <immibis> if it's easy to implement and makes the CPU faster then you do it, even if it limits the microcode you can write

20:28 <clever> immibis: in the case of the bmow1, there isnt really a pipeline

20:28 <clever> so if an opcode takes 4 microcode steps to run, then it takes 4 clock cycles to run, and cant share a cycle with anything

20:29 <immibis> didn't say anything about pipelining

20:34 GeDaMo has quit [Quit: Leaving.]

20:35 <geist> immibis: you're right though, the gigatron's microcode is more of an interpreter: https://github.com/kervinck/gigatron-rom/blob/master/Core/dev.asm.py#L1458

20:35 <bslsk05> github.com: gigatron-rom/dev.asm.py at master · kervinck/gigatron-rom · GitHub

20:35 <geist> that seems to be the core loop for the vcpu

20:35 <clever> that sounds a lot more powerful, at the cost of spending more cycles to do a given task

20:35 <geist> it basically runs as many instructions as it can before a vblank interrupt comes along, in which case it runs the logic to bit bang the video output, and back to interpreting instructions

20:36 <clever> is the bit-banging written in the micro ops or the interpreted ops?

20:36 <geist> micro ops. what you're looking at there is the micro op assembler

20:37 <geist> they did something clever that i wouldn't have though of: instead of writing an assembler, they simply defined all of the asm instructions as a bunch of python functions

20:37 <geist> and then implement all of their assembly as python itself

20:37 <clever> ah, so they are cheating a bit, and having the high performance (bit-banging video) stuff skip the interpreter, and run directly on the raw microops

20:37 <geist> when you run this .py it spits out the rom

20:38 <geist> yah the whole point is it's ust TTL and there's no dedicated sound or video hardware

20:38 <geist> the TTL itself context switches between interpreting instructions and bit banging hardware

20:38 <clever> the original VPU assembler was just C macro's, compiling and running it would generate a rom image

20:38 <geist> actually not too unlike the xeros alto

20:38 <clever> same basic idea

20:39 <geist> xerox alto did the same thing: had like 16 levels of microcode, hardware switched, and the different tasks did various hardware keeping

20:39 <geist> and then the lowest priority task just interpreted data general nova instructions which is what application code ran on

20:40 <clever> https://github.com/hermanhermitage/videocoreiv/blob/master/dumpbootrom/dumpbootrom.s

20:40 <bslsk05> github.com: videocoreiv/dumpbootrom.s at master · hermanhermitage/videocoreiv · GitHub

20:40 <clever> an example of that macro based assembly code

20:47 silverwhitefish has quit [Quit: One for all, all for One (2 Corinthians 5)]

20:48 Skyz has quit [Quit: Client closed]

20:50 dormito has quit [Ping timeout: 255 seconds]

20:59 <sortie> This 2017 Unix socket recvmsg/sendmsg code of mine is in dire need of comments explaining wtf is going on :)

21:01 <sortie> It's tricky because the control data also goes in the actual buffer so I need to maintain headers describing the layout and there are reference pointers (file descriptors) in there too (that can cause reference counting cycles!)

21:04 <immibis> anyone here looked at capability-based operating systems deriving from GNOSIS?

21:04 <immibis> KeyKOS, EROS, CapROS

21:06 <immibis> is L4 similar?

21:07 PapaFrog has quit [Ping timeout: 258 seconds]

21:07 <immibis> these are nanokernels where the kernel cannot even allocate memory; all kernel state is held in pages given to it by userspace and persisted alongside userspace data

21:09 <moon-child> l4 doesn't allocate because dynamic memory allocation is hard to verify :^)

21:10 <immibis> presumably it does something similar then, stores data about capabilities in pages accounted to userspace

21:11 <moon-child> the main thing that's interesting about keykos (imo) is persistence, which l4 doesn't have

21:11 <immibis> wikipedia lists fuschia in the same category :)

21:12 <immibis> but fuschia doesn't look like it fits into this class, it's just also a microkernel

21:12 <immibis> persistence is also interesting. Apparently they demoed it by ripping the power cord out of a running computer, then plugging it back in

21:13 <immibis> i'm not convinced a single-level store is efficient, but it certainly is interesting for that reason

21:13 <moon-child> the cpu does pretty well at synchronizing a virtually single-level store across 4-5 different levels of actual storage

21:14 <moon-child> (l1,l2,l3,ram, maybe virt. registers if your cpu is fancy enough)

21:15 <graphitemaster> https://twitter.com/cmuratori/status/1415376457036009475

21:15 <bslsk05> twitter: <cmuratori> This is a great diagram from Anandtech ( <anandtech.com/show/16805/amd… https://t.co/c2j2THqgcW> ). It uses color to show the relative cost of communicating between any two cores of a 64-core Threadripper. The physical layout of chips is becoming increasingly important to performance-oriented programming! https://pbs.twimg.com/media/E6RtVi1VkAIS3iS.jpg

21:15 <immibis> and yet we have all these tricks to try and trick it into being efficient. I suppose it's a standard flexibility/efficiency tradeoff. If you had to allocate cache lines, either nobody would ever bother, or you'd run out of cache lines sometimes and slow everything else down

21:15 <immibis> increasingly important? I thought NUMA was already important

21:15 <moon-child> perf beyond that can be taken care of with hints (the cpu has hints too--prefetch and such), and the programmertime/computertime tradeoff applies

21:15 <graphitemaster> Hate so say it but realtime requirements of video games are soon going to be looking into actual physical distance between CPU cores

21:16 <immibis> doubts on "soon"

21:17 <moon-child> immibis: allocating cache lines is workload-sensitive. If you did that manually, (adjusted cache allocations in response to workload) you'd basically be duplicating work in every application/lib, and get a negligible perf benefit

21:17 <immibis> this is latency, right? gamedev is moving in the direction of storing and processing big streams of data

21:17 <immibis> I think

21:18 <immibis> with minimal interactions between the different streams

21:18 <immibis> each time you join two streams you have to pay an inter-core latency, but... that's 0.1us per join and you have 16000us and not so many joins

21:19 <moon-child> monitors are getting faster, and kernels are as greedy for throughput as they've ever been. You gotta keep up with the monitor and not drop frames cause the scheduler decided to jitter you a little

21:19 <immibis> hopefully

21:19 koolazer has joined #osdev

21:19 <immibis> rather than caring about physical distance they might start caring about keeping data on the same node that processes it.

21:20 silverwhitefish has joined #osdev

21:20 <moon-child> yah, I was think about ways to model video games as actors

21:20 <immibis> i can see physical distance mattering if they want to parallelize further - then they want to run 8 parallel threads on the same dataset on the 8 tightly-coupled CPUs if they have some auxiliary data structure they all share

21:20 <moon-child> then you basically do graph partitioning to try to put actors that like to talk with each other on the same cores

21:20 <moon-child> similar space for numa opts

21:22 <immibis> i don't think that's new. Well it's relatively new, but you're not coming up with it right now, and they call them "systems" rather than "actors"

21:22 dormito has joined #osdev

21:22 <immibis> don't have a good reference other than "something i saw on a gdc presentation on youtube once"

21:22 <moon-child> systems as in ec/ecs?

21:23 <immibis> yes. But ECS is a vague term with many specific variants

21:23 <immibis> and many unrelated ideas some of which are incompatible with each other

21:23 <moon-child> concept is very different from that

21:23 <immibis> but one of the ideas is decomposing your game loop into transformations on arrays of components

21:24 <moon-child> I later found out that carmack had experimented with largely the same thing 10 years ago. Impossible to beat him :P. https://www.youtube.com/watch?v=1PhArSujR_A 16:25, I think

21:24 <bslsk05> 'John Carmack's keynote at Quakecon 2013 part 4' by Kostiantyn Kostin (00:29:59)

21:24 <immibis> the trivial canonical example being `position += velocity * timestep;` -> `all_positions += all_velocities * timestep;` -> run this on the core that has all_positions and all_velocities in local memory, while another other core culls bounding boxes or something

21:24 <moon-child> everything is immutable, frame data is double buffered. Very good for multicore because you don't have any contention on your writes

21:26 <moon-child> and things like interpolation, time travel (cf multiplayer/rollback, or braid-style as a game mechanic) become much easier as an added bonus

21:35 <immibis> immutability is not necessary, but clear data flow is. Allocation and garbage collection when you know the old thing isn't used any more is a waste of time

21:35 ahalaney has quit [Remote host closed the connection]

21:35 <immibis> if you prefer, think of it as compile-time garbage collection. "This isn't used any more but we want a new buffer of the exact same size, so overwrite it"

21:38 asymptotically has quit [Quit: Leaving]

21:46 superleaf1995 has joined #osdev

21:50 <moon-child> call it sophisticated manual memory management (where malloc/free, naive manual memory management, is rarely of use)

21:51 <moon-child> immutability is the only mechanism I know of for guaranteeing clear dataflow at the architectural level

21:51 <immibis> proper prior planning prevents piss poor performance

21:51 <immibis> you update the thing. now every bit of code that accesses the thing is accessing the new thing

21:51 <moon-child> okayyyyy, but I want to evolve my designs...

21:52 <immibis> then you'd better evolve the plan

21:52 <moon-child> I can't redesign my entire application every time I think of a new feature I want to add

21:52 <immibis> but you can update your dataflow graph

21:53 <immibis> are we still talking about game loops?

21:55 <moon-child> I was about to say, I think we're talking at cross purposes :P

21:56 <moon-child> I was talking about game loops. I think I'm not quite sure what point you're making, though

22:00 <immibis> you can certainly write games in haskell, and most games have stuff at the periphery that can benefit from it, but what do you get from making the core game loop functional?

22:01 <immibis> you mentioned past states for interpolation

22:01 <immibis> but you can design that as a ping-pong or circular buffer for example. You don't need to involve the garbage collector.

22:02 <immibis> if different entities interpolate from different time points it may destroy the cache, but i'm not sure how you would fix that or how immutability would help it

22:04 <moon-child> ok, then let me explain from scratch, because I don't think you need to involve garbage collection

22:07 <moon-child> proposed model is that 'game state' (call it S) is a collection of entities, each of which can be transformed, updating them from one frame to the next

22:07 <moon-child> every entity has r/w access to its own state as of frame n+1, and ro access to the state of every other entity as of frame n

22:09 <moon-child> doing things this way lets you update all the entities in parallel and gets you (imo) an architecture which enforces separation of concern

22:10 nur has quit [Remote host closed the connection]

22:10 <moon-child> you don't need garbage collection in a general sense; you do need to deal with entity creation/destruction/reference somehow, but that's a general problem which gc doesn't even help with

22:11 <immibis> I am not thinking of "entities" having access to anything. Rather processing steps, or systems

22:11 <immibis> (I have tried writing a game in this style and it is practical for some systems, impractical for others. DoorSystem, really??)

22:12 <immibis> (I hear that some games have a catch-all "scripting system" for little events like "click mouse to open door")

22:13 <immibis> (on the other hand, there's a differential equation that updates the world state, and evaluating that as one step on big vectors is much better than trying to evaluate it for every grid square in parallel)

22:14 <immibis> (in sequence*)

22:15 <moon-child> yeah, physics is something I struggled with coming up a sensible design for

22:16 <moon-child> one thought I had is--currently there's one sync point, which is 'tick', and graphics are centralized. Could add multiple sync points and do centralized physics as well. But that's getting dangerously close to an explicit dependency graph (a la make), which is something I'm trying to stay away from

22:17 <immibis> an explicit dependency graph is exactly what you want to maximize parallelism

22:17 Skyz has joined #osdev

22:17 <immibis> you can do any step as soon as all its input steps have completed

22:20 <Skyz> KeyKOS came out when systems research had relevance, interesting

22:21 <Skyz> http://cap-lore.com/CapTheory/KK/Patent/Patent.html

22:21 <bslsk05> cap-lore.com: U.S. Patent 4,584,639

22:22 <immibis> it is odd because you were not online when I mentioned capability-based systems

22:22 <moon-child> https://libera.irclog.whitequark.org/osdev/2021-07-14 logs

22:22 <bslsk05> libera.irclog.whitequark.org: #osdev on 2021-07-14 — irc logs at whitequark.org

22:22 <Skyz> Yeah I looked at the logs

22:23 <immibis> apparently GNOSIS was designed for a time-sharing system, their equivalent of cloud computing

22:23 <immibis> Rather than many identical servers, they wanted to share a smaller number of computers among a large number of users, and software would be written specifically for their system

22:24 <immibis> if programs can be ported to ARM because $megacorp said so, they could be ported to KeyKOS because $othermegacorp said so

22:25 <Skyz> Not true

22:26 <Skyz> I guess it works with M$ now and windows 11

22:26 <immibis> programs have already been ported to a Mach because of $megacorp, but it's not entirely fair because they actually use the unix emulation layer

22:26 <immibis> on mac is unix built on top of mach or is it a peer with mach?

22:27 <immibis> i think i read darwin implements both unix and mach and the mach stuff is mostly vestigial

22:28 <Skyz> Mach-o? or Mach?

22:28 <Skyz> Because GNU has a project called Mach

22:29 <Skyz> https://developer.apple.com/library/archive/documentation/Darwin/Conceptual/KernelProgramming/Mach/Mach.html

22:29 <bslsk05> developer.apple.com: Mach Overview

22:29 <Skyz> I've never actually read apple's documentation

22:31 <Skyz> Mach is built on top of unix I believe

22:33 <immibis> well in any case the point is you use what $bigmegacorp says or else

22:33 <immibis> if you're a $bigmegaos programmer

22:33 <immibis> because your customers are using $bigmegaphones

22:35 <Skyz> Yeah

22:38 <Skyz> They are who pay in the end

22:38 <Skyz> So the $megacorp does what they say

22:39 <Skyz> But $megacorp can also impose new things

22:43 sortie has quit [Quit: Leaving]

22:48 Skyz has quit [Quit: Client closed]

22:49 <geist> immibis: i see ewhat you did there with $

22:58 <immibis> users never paid for ARM hardware. That was entirely the corp's decision. They wouldn't pay for a new kernel, similarly

22:58 <immibis> well they're paying for hardware, but they don't care whether it's ARM or not

23:01 immibis_ has joined #osdev

23:01 Burgundy has quit [Ping timeout: 265 seconds]

23:03 immibis has quit [Ping timeout: 268 seconds]