#osdev on 2023-02-10 — irc logs at libera.irclog.whitequark.org

2021-05-23 01:57 klange changed the topic of #osdev to: Operating System Development || Don't ask to ask---just ask! || For 3+ LoC, use a pastebin (for example https://gist.github.com/) || Stats + Old logs: http://osdev-logs.qzx.com New Logs: https://libera.irclog.whitequark.org/osdev || Visit https://wiki.osdev.org and https://forum.osdev.org || Books: https://wiki.osdev.org/Books

00:14 bch has quit [Quit: quitter]

00:22 bnchs has quit [Remote host closed the connection]

00:32 bauen1 has joined #osdev

00:36 bauen1 has quit [Remote host closed the connection]

00:37 bauen1 has joined #osdev

00:41 dutch has quit [Quit: WeeChat 3.8]

00:45 dutch has joined #osdev

00:48 Matt|home has quit [Quit: Leaving]

00:53 nyah has quit [Quit: leaving]

01:00 Left_Turn has quit [Ping timeout: 264 seconds]

01:02 thinkpol has quit [Remote host closed the connection]

01:03 thinkpol has joined #osdev

01:17 gog has quit [Ping timeout: 268 seconds]

01:24 epony has joined #osdev

01:26 slidercrank has quit [Ping timeout: 265 seconds]

01:29 Burgundy has quit [Ping timeout: 260 seconds]

01:40 _xor has quit [Quit: brb]

01:42 divine has quit [Quit: leaving]

02:17 heat has quit [Ping timeout: 256 seconds]

02:31 Starfoxxes has quit [Ping timeout: 248 seconds]

02:33 Starfoxxes has joined #osdev

02:49 fedorafansuper has quit [Ping timeout: 248 seconds]

02:52 fedorafan has joined #osdev

03:23 srjek has quit [Ping timeout: 252 seconds]

03:27 <immibis> what if instead of shoehorning everything into a handful of file verbs, the correct solution is to make verbs cheaper

03:31 <zid> we decided spectre was more fun

03:32 <Mutabah> APIs are hard yo

03:35 <zid> I do think we should break out ioctl to some more general verbs though

03:35 <zid> it's hard though

03:35 <immibis> No, more specific. Windows has verbs like EnumerateNetworkAdapters

03:36 <zid> which lives in userspace

03:36 <zid> and that's what linux does too, but leaves it to 3rd party userspaces

03:37 <immibis> How to enumerate network adapters on windows: call EnumNetworkAdapters or something like that. On Linux: open a netlink socket, send a request for network adapter list, receive messages until you get the last one, ignore messages not related to your request

03:37 <zid> windows has cm/ex/hal/io/ke/mm/ob/po/tm and nt/zw

03:37 <zid> which is more than linux has to be sure

03:38 <zid> but it doesn't *actually* have 99% of the winapi as syscalls

03:38 fedorafan has quit [Ping timeout: 248 seconds]

03:40 <immibis> who said syscalls

03:41 <immibis> which should really be called kernel calls, because all those DLLs are definitely part of the system

03:41 <zid> It's a paradigm difference, windows provides its 'verbs' by bundling them with the kernel inextricably as a userspace dll

03:41 <zid> linux provides nothing

03:41 fedorafan has joined #osdev

03:43 fedorafan has quit [Client Quit]

03:44 <zid> and on linux, you just ls /proc/net

03:44 <zid> cat /proc/net/dev

03:44 <zid> that's what ifconfig does I just checked

03:45 <Mutabah> For system information like that, files seems like a pretty appropriate method

03:46 <zid> yea files is actually really workable for that specific case

03:46 <Mutabah> (You're just reading a blob of information)

03:46 <zid> There's a whole shit load of 'desktopy' stuff that windows does provide that doesn't work that great as files though

04:02 gildasio1 has quit [Remote host closed the connection]

04:03 gildasio1 has joined #osdev

04:04 <immibis> that is an outdated interface zid. You're supposed to use netlink

04:05 <immibis> netlink is equivalent to device files but with a different interface for some reason that makes everything more complicated

04:05 <immibis> instead of opening /dev/netsystem you connect a netlink socket to the networking system using a hardcoded address number

04:05 <immibis> which goes in the protocol field and not the address field because they said so

04:07 _xor has joined #osdev

04:14 <sham1> It's even worse than I imagined

04:18 wand has quit [Ping timeout: 255 seconds]

04:19 <zid> so the problem is that the api moved *away* from files :P

04:20 [itchyjunk] has quit [Remote host closed the connection]

04:23 <sham1> Something something "those who do not understand UNIX are bound to recreate it, badly" something

04:28 dude12312414 has joined #osdev

04:29 dude12312414 has quit [Remote host closed the connection]

04:34 wand has joined #osdev

04:35 AFamousHistorian has joined #osdev

05:03 mehdix has quit [Quit: ZNC 1.7.5+deb4 - https://znc.in]

05:08 x8dcc has quit [Ping timeout: 246 seconds]

05:11 bradd has joined #osdev

06:04 fkrauthan has quit [Quit: ZNC - https://znc.in]

06:05 fkrauthan has joined #osdev

06:07 fkrauthan has quit [Client Quit]

06:09 fkrauthan has joined #osdev

06:10 <sham1> And this is why computing has been stagnant. We peaked in the 70s

06:12 sebonirc has quit [Remote host closed the connection]

06:12 sebonirc has joined #osdev

06:14 <zid> 2011*

06:16 <sham1> UNIX isn't from 2011

06:19 <zid> no but sandy bridge is

06:20 <zid> there has to be a dip afterwards for there to be a peak, rather than a plateau

06:20 <zid> and we definitely went up until sandy, then down

06:20 <kazinsal> maybe intel did

06:20 <kazinsal> the rest of the computing world moved on

06:21 <kazinsal> no one else is to blame for the most modern intel processor continuing to be a microwaved sandy bridge with the marketing flavour of the week bolted on for 1d4 generations before being dumped

06:23 <zid> hey that's unfair, they also roll the 1d4 for how many pci-e lanes to remove and how many memory channels too

06:25 <zid> 13xxx was semi-decent though, a good tock to sandybridge

06:26 <kazinsal> I ended up just building a 7700X machine because either way I'd need to jump to DDR5 so I figured I'd go with a fresh socket that's guaranteed to get at least two generations of tangible CPU improvements

06:27 <zid> if I had to give up SB for something before 13xxx came out it would have been a 3600x or something

06:30 <immibis> zid: files were insufficiently extensible; netlink allows structured requests and extensible TLV requests and responses and update notifications

06:31 <immibis> isn't sandy bridge also a slight update of whatever was before that, and isn't that also the case for all processors back to the first Core 2 which was a major microarchitectural change?

06:32 <sham1> This is why devices and such need to be file trees instead of single files

06:33 <immibis> or it's why memory needs to be an SQL database

06:33 <moon-child> how many avxes does sandybridge have?

06:33 <zid> 1

06:33 <moon-child> exactly

06:34 <moon-child> you can track the quality of intel cpus by how many avxes they have

06:34 <zid> sandy is where they figured shit out

06:34 <moon-child> hence why zen4 is the best zen

06:34 <zid> nahlem is missing stuff and is tuned worse etc

06:35 <moon-child> I mean, sure. But skylake > broadwell > haswell > ivybridge > sandybridge. It's not like it ever got _worse_

06:35 <zid> except that's not what chips they actually released

06:35 <zid> if you look at the SKUs, all they did was remove pci-e lanes and memory channels for 10 years

06:35 <zid> you got a couple of percent ipc, though, woo

06:36 <moon-child> idk man

06:36 <moon-child> I have a bunch of pcie lanes and memory channels

06:36 <zid> what cpu?

06:37 <moon-child> skylake

06:37 <zid> what cpu?

06:37 <moon-child> actually cascadelake now

06:37 <moon-child> w-2295

06:37 <zid> lol no shit

06:37 <zid> that's one of the best cpus they ever made

06:37 <zid> and is firmly outside that 10 year window

06:37 <zid> SB got uop cache(!), dual port memory, doubled branch prediction targets, avx, integrated graphics, etc

06:38 <zid> lots of other various internal hidden buffers doubled

06:38 <zid> it's *way* better than nahalem

06:38 <zid> it also clocks better by like.. gigahertz

06:38 <sham1> Memory need not be an SQL database. A filesystem however could be

06:40 <zid> biggest missing thing between SB and skylake (actual skylake, not fucking mega-rocket-cascade-ice-lake, actual skylake) is the cache clocks got massively improved, and avx2

06:40 <moon-child> cascade lake is literally skylake

06:40 <moon-child> icelake is different

06:41 <kazinsal> yeah most improvements have on the intel side since SB have been "we can get it to clock 125 MHz higher on stock voltage" every gen

06:41 <zid> cascade lake is skylake-sp++ or skylake-x++

06:41 <moon-child> well

06:41 <kazinsal> it was kinda neat just setting up my 7700X and telling Ryzen Master "yeah go nuts homie" and getting 5.3 GHz all-core without having to try

06:41 <moon-child> they also increased the number of avxes

06:42 <moon-child> which must count for something

06:42 <zid> kazinsal: I might get near that if I had enough fans :(

06:42 <zid> wtb more fans

06:42 <kazinsal> 360mm rad here

06:42 <zid> yea I'm still using the evo 212 from my q6600 :D

06:42 <kazinsal> it still maxes out at 95C but apparently they just do that

06:42 <zid> I hit 95C at like.. 4.5GHz, but I still have a shit load more voltage I can throw at it

06:42 <kazinsal> temps only matter if you can't actually dissipate the load

06:42 <zid> just don't have a cooler that'd let me

06:43 <kazinsal> and thing thing just goes BRRRRRRRRRRRRRRRRRR

06:43 <zid> maybe the silicon fails at 4.55GHz idk

06:43 <zid> but it seems like it won't given how much headroom I have left

06:44 <zid> The single core OC record for this chip is like.. 6.5GHz

07:05 <immibis> sham1: wrong. Everything should be SQL. How much time have you spent writing code to maintain multiple indexes of the same data?

07:06 <zid> I agree with S

07:06 <zid> disagree with QL

07:09 <moon-child> no one likes QL

07:11 <immibis> update dudes set position=position+velocity*0.03

07:20 AFamousHistorian has quit [Remote host closed the connection]

07:25 fedorafan has joined #osdev

08:04 knusbaum has quit [Ping timeout: 248 seconds]

08:08 knusbaum has joined #osdev

08:11 LostFrog has joined #osdev

08:11 PapaFrog has quit [Ping timeout: 256 seconds]

08:13 knusbaum has quit [Ping timeout: 255 seconds]

08:14 danilogondolfo has joined #osdev

08:16 knusbaum has joined #osdev

09:00 shinbeth has joined #osdev

09:05 jjuran has quit [Ping timeout: 260 seconds]

09:06 jjuran has joined #osdev

09:07 epony has quit [Remote host closed the connection]

09:10 gog has joined #osdev

09:17 bauen1 has quit [Ping timeout: 248 seconds]

09:18 slidercrank has joined #osdev

09:28 remexre has quit [Read error: Connection reset by peer]

09:35 nyah has joined #osdev

09:50 small has joined #osdev

09:56 GeDaMo has joined #osdev

10:08 joe9 has quit [Quit: leaving]

10:24 fedorafan has quit [Ping timeout: 252 seconds]

10:25 les has quit [Quit: Adios]

10:25 les has joined #osdev

10:28 Burgundy has joined #osdev

10:30 fedorafan has joined #osdev

10:38 bauen1 has joined #osdev

10:45 bauen1 has quit [Ping timeout: 265 seconds]

11:02 bauen1 has joined #osdev

11:06 slidercrank has quit [Ping timeout: 252 seconds]

12:07 dutch has quit [Quit: WeeChat 3.8]

12:10 spikeheron has joined #osdev

12:36 gog has quit [Ping timeout: 252 seconds]

12:38 gog has joined #osdev

12:39 bradd has quit [Ping timeout: 264 seconds]

12:43 [itchyjunk] has joined #osdev

12:58 DynamiteDan has quit [Excess Flood]

12:59 DynamiteDan has joined #osdev

13:00 bgs has joined #osdev

13:13 heat has joined #osdev

13:35 Burgundy has quit [Ping timeout: 256 seconds]

13:44 shinbeth has quit [Remote host closed the connection]

14:07 heat has quit [Read error: Connection reset by peer]

14:08 heat has joined #osdev

14:56 slidercrank has joined #osdev

15:03 srjek has joined #osdev

15:20 Left_Turn has joined #osdev

15:38 srjek has quit [Ping timeout: 265 seconds]

15:43 <mrvn> *gaehn*

15:46 <nikolar> *agheh*

15:51 grange_c0 has quit [Quit: The Lounge - https://thelounge.chat]

15:52 grange_c0 has joined #osdev

16:33 Burgundy has joined #osdev

17:03 gog has quit [Quit: Konversation terminated!]

17:14 Piraty has quit [Quit: -]

17:16 Piraty has joined #osdev

17:20 zhiayang has quit [Quit: oof.]

17:22 zhiayang has joined #osdev

17:27 xenos1984 has quit [Ping timeout: 248 seconds]

17:28 xenos1984 has joined #osdev

17:30 dude12312414 has joined #osdev

17:35 bauen1 has quit [Ping timeout: 248 seconds]

17:43 bch has joined #osdev

17:45 x8dcc has joined #osdev

17:51 spikeheron has quit [Quit: WeeChat 3.8]

18:04 dutch has joined #osdev

18:13 Terlisimo has quit [Quit: Connection reset by beer]

18:16 Terlisimo has joined #osdev

18:20 xenos1984 has quit [Ping timeout: 246 seconds]

18:23 <sham1> ping

18:25 <zid> dong

18:26 janemba has joined #osdev

18:34 xenos1984 has joined #osdev

18:34 bauen1 has joined #osdev

18:36 FreeFull has joined #osdev

18:39 craigo has joined #osdev

18:39 <nikolar> bang

18:44 <mrvn> I don't like greek pop-duos

18:48 <zid> and the dirt is gone

18:49 <geist> kazinsal: oh you got a 7700x? how you liking it? do you have an opportunity to single core bench it vs a 5000 or 3000 series?

18:49 <geist> curious how the zen 4s perform single core

18:50 <zid> looking it up, very good

18:50 <geist> i can look up benchmarks of course, but always nice to see it confirmed on the street

18:51 <zid> like, I am surprised

18:51 <zid> ryzen 5 was pretty average, my SB competed

18:51 <zid> his 7700 destroys us

18:52 <nikolar> It was like 13% IPC uplift

18:52 <nikolar> Add to that significantly higher clocks

18:52 <zid> and it's running like a gigahertz faster yea

18:55 <geist> you mean ryzen 5xxx when you say ryzen 5?

18:55 <zid> 3xxx

18:55 <zid> at least

18:55 <zid> is what I checked for a comparison

18:55 <zid> I'm not that into zen3

18:56 <geist> yah that'd be zen 2 vs zen 4, good jump

18:56 <zid> zen3 looks to be rouhgly.. the exact midpoint

18:56 <zid> intel went SB, crap crap crap crap, 13900k, amd went zen2, zen3, zen4 in equal steps

18:57 <nikolar> 13900k was less efficient than amd

18:57 <zid> now get one of those nice epycs with 768MB of L3

18:58 <nikolar> You can store a whole os in the cache :)

19:06 small has quit [Ping timeout: 252 seconds]

19:11 <mrvn> Now I wonder how long a syscall will take on a "AMD Ryzen 5 2400G with Radeon Vega Graphics" with kvm.

19:12 <mrvn> I should measure raw, kvm and nested kvm.

19:14 danilogondolfo has quit [Remote host closed the connection]

19:18 zhiayang has quit [Quit: oof.]

19:18 zhiayang has joined #osdev

19:19 dude12312414 has quit [Remote host closed the connection]

19:20 dude12312414 has joined #osdev

19:35 epony has joined #osdev

19:37 dude12312414 has quit [Quit: THE RAM IS TOO DAMN HIGH]

19:38 xvmt has quit [Remote host closed the connection]

19:40 xvmt has joined #osdev

19:40 fedorafan has quit [Ping timeout: 248 seconds]

19:40 bgs has quit [Remote host closed the connection]

19:43 fedorafan has joined #osdev

19:44 AFamousHistorian has joined #osdev

19:55 k0valski18891 has quit [Quit: Peace out !]

20:12 AFamousHistorian has quit [Ping timeout: 248 seconds]

20:28 bnchs has joined #osdev

20:28 <bnchs> hi osdevelopers and other developers alike :3

20:29 <zid> That's fair, I make messes and stupid jokes mainly

20:31 <nikolar> Hello

20:32 <bnchs> what are you all up to?

20:38 <zid> dark souls mainly?

20:42 <kof123> a "filesystem" thing. its important to have a plan/design and stay motivated .oO( ♫ and we'll save terrance and phillip too, cuz that's what brian boitano'd do ♫ )

20:42 brocellous has left #osdev [#osdev]

20:45 <mrvn> hey, me too. I need a cachefs for fuse.

20:46 k0valski18891 has joined #osdev

20:51 <mrvn> Anyone know if copy_file_range will pre-allocate the output file?

20:54 <moon-child> zid: that's a lotta l3 ;o

20:55 <nikolar> kof123: wanna share some details :)

20:56 <kof123> you are assuming i am a good influence/idea. ls -d */function.c | wc -l 1624 that is one function per file

20:57 <kof123> skeletons, havent written a line. i dunno, hundreds of fields but most are optional. so code will be very branchy

20:57 <kof123> just doing docs, for my own sake, to keep track of things

20:58 <mrvn> you are doing it wrong for sure

20:58 <kof123> eh, its simple. real stuff will call down to these, these will be very simple few liners

20:58 <moon-child> meh just style stuff. I wouldn't do it like that, but if it works, eh

20:58 <kof123> and a large amount is "abstractions"

20:59 <kof123> well, that is just fascist directory layout for other reasons

20:59 <moon-child> sun libm is one function per file and no one complains about that :)

20:59 <kof123> the declaration "headers" will be autogenerated

21:00 <kof123> this is kind of a....meant to bootstrap other things. so is kind of a grab bag everything and the kitchen sink goes here

21:00 <mrvn> moon-child: those are independant and highly complex functions though.

21:00 <kof123> normally i would split into many tiny libraries

21:00 bnchs has quit [Quit: Lost terminal]

21:01 <mrvn> kof123: you should have a macro for unimplemented functions and one file just listing them all.

21:01 <kof123> i have a whole #pragma thing planned. but this code is special because it is "boostrap" lol

21:01 <mrvn> todo.c /* All the function stubs I still need to implement */ :)

21:01 <kof123> like real code will eventually all be pragmas lol

21:02 <kof123> this is the tip of my horrible ideas lol

21:02 <mrvn> what's a #pragma?

21:03 <mrvn> kof123: I'm writing a fuse filesystem that will store up to 1TB of data in ram and has an ioctl to make a snapshot and sync that back to a network filesystem in the background.

21:04 <kof123> yeah, i need this to load a "kernel" in, so my "bootloader" will jump to this, and then can get to real "kernel" stuff.....and then try to bridge with prior "userland pseudo-oo stuff". point being, you are way ahead of me there

21:04 <moon-child> mrvn: why?

21:04 <kof123> i just have lots of scattered pieces eventually have to "merge" them all

21:06 <mrvn> moon-child: because user apps are exceedingly stupid. Like creating 4 byte files or overwriting data over and over and that's just horrible slow over the network.

21:06 * kof123 observed bnchs left in horror

21:06 <moon-child> oh, I missed 'network'

21:06 <moon-child> so nfs can't do what you want because coherency

21:06 <mrvn> even on local disks it's painfull

21:06 <moon-child> makes sense

21:07 <mrvn> moon-child: worse, lustre. so 1MB block size for files.

21:07 <moon-child> cache not aggressive enough?

21:07 <mrvn> moon-child: can't cache create(), flush() and fsync()

21:07 <moon-child> mmmm

21:07 <mrvn> because coherency

21:09 <moon-child> if I were a computer I would simply not crash

21:09 <moon-child> or lose power

21:10 <mrvn> But there are some really stupid DNA programs out there. They take a 50MB DNA sequence and split it into 3 base pairs long files with 1 extra char info. YOu end up with a working dir with 00000, 00001, 00002, 00003, 00004, ...

21:10 <moon-child> wat

21:10 <moon-child> why

21:10 <mrvn> exactly

21:11 <mrvn> Imagine running that on ext2 where file creation is O(n^2)

21:12 <mrvn> well, files creation, each file is O(n)

21:18 <geist> yah was just thinking if there's much of an optimization you can do there but make an in memory copy of all the dirnames and use it to speed up the lookup

21:18 <geist> the standard dir_cache obviously helps for positive lookups

21:18 <geist> but for negative lookups where you want to know if something already exists you have to have guaranteed a complete cache

21:19 <geist> but all told you still have to search it to find a slot to add the new entry too

21:20 <mrvn> geist: dir_hash helps tons.

21:20 <geist> sure, but that's an on disk structure

21:20 <mrvn> once the dir is cached it's all O(1) in memory lookups.

21:20 <geist> or at least whatever the hash collision stuff is

21:20 <mrvn> true.

21:21 <geist> but yeah was thinking you could read it in and keep a fairly compressed hash of all the entries

21:21 <mrvn> But networking still kills it because the op just takes freaking long.

21:21 <geist> still basically a full cache, but just more compressed for simple hit detection

21:21 <geist> am now kinda curious exactly what the structure of the dir_hash is on ext*

21:22 <mrvn> geist: I think you have an array of pointers into an array or file names.

21:22 <mrvn> First is indexed by the hash the other just concatenation of all names with potential holes.

21:23 * geist nods

21:24 <mrvn> Not sure if it does chaining or using the next (+hash2()) slot

21:24 <kof123> this sounds less bad: ls -d hash_algorithm* | wc -l 45 (looks like 3 functions per alg.) ls -d *is_list_set_to_* | wc -l 187 (query if option is set) ls -d *set_list_to_* | wc -l 374 (set/unset toggle option) 196 "fields" (beyond header), x2 if we assume get/set function for each, some are like "arrays" or "structs" with "subfields" etc., so this is simplified . 45+187+374+400 1006

21:26 <mrvn> Some distributed filesystems can give a host "ownership" over a directory contents. So all file creation by that host is basically local unless some other host contents it. (Hint: never have 2 hosts use the same working/temp dir)

21:26 <kof123> in some cases i just didnt want giant argument list

21:27 <mrvn> Wasn't there some kernel module for Linux that allows running a list of syscalls with a single call?

21:27 <mrvn> or something writing syscalls to an uring?

21:27 <moon-child> io_uring?

21:27 <moon-child> :P

21:27 <moon-child> yea

21:28 <mrvn> moon-child: isn't that just IO calls?

21:30 <moon-child> I think the idea is you should be able to do any syscall. I don't know if that's been fully implemented yet, though

21:30 <mrvn> I think that was what the extra module did

21:33 <kazinsal> geist: yeah, it's great! I'm running windows on it so if you have something I can single core bench on that for a comparison for you I can run one

21:37 <mrvn> moon-child: One thing that would be really cool would be if io_uring would allow reading data into a buffer and writing it back without passing it through user space. But I see no copy_file_range support. Best you can do is add a pipe and splice.

21:38 <moon-child> at some point, you end up writing your entire application in ebpf

21:39 * kazinsal . o ( x86 emulator in eBPF, running eBPF hello world in Linux )

21:43 <mrvn> Looks like there is some work towards offloading copy operations: https://lore.kernel.org/lkml/cd772b6c-90ae-f2d1-b71c-5d43f10891bf@nvidia.com/

21:43 <bslsk05> lore.kernel.org: Re: [PATCH v5 00/10] Implement copy offload support - Chaitanya Kulkarni

21:49 <mrvn> clever: Do you have some simple barebones examples that do flat shading and texture mapping in 3D for the RPi?

21:49 bch has quit [Ping timeout: 265 seconds]

21:50 <mrvn> just a rotating cube or equally simple 3D stuff.

22:09 <kof123> nikolar: https://0x0.st/s/HS9qzWb6hAPM0w2ILUIB6A/HrPG.c just a .h file with planned "fields" . the reason i am not worried, is anything i screw up, those are all optional, so if i find a better way to do something, add a new field, maybe remove one of those, etc. higher level "logic" functions will call get/set functions basically.

22:09 <kof123> then there is like: caching, locks, ....

22:09 <kof123> ask mrvn :D

22:10 <kof123> *higher level things will do all the real logic

22:13 <kof123> some things are just silly like "Inode" i think nfs wants a unique inode. it doesnt really do anything but maybe someday makes "exporting" easier

22:13 <kof123> i mean, i may only actually implement like 1/5 of that or less lol

22:13 <kof123> more of a brainstorm at this point

22:15 <clever> mrvn: texturing yes, shading based on angles, not currently

22:15 <nikolar> Oh that's a lot of defines lol

22:15 <nikolar> And very long names

22:16 <clever> mrvn: https://github.com/cleverca22/gl/blob/master/texture.s this is a fragment shader for doing texture lookup and alpha blending

22:16 <bslsk05> github.com: gl/texture.s at master · cleverca22/gl · GitHub

22:16 gog has joined #osdev

22:17 <clever> mrvn: it expects 5 varyings per vertex, the texture UV, and then an RGB color to mix in (the source texture in this case is just solid white on transparent)

22:18 <clever> mrvn: the hw leaves the varyings half interpolated, so each time you read the vary FIFO, you have to then add r5 to that, lines 3-6 will fetch UV, finish that add, and then 8/9 passes the UV off to the texture lookup hardware

22:18 <clever> 14/15 then blocks until the texture lookup is complete

22:19 <clever> 17-27, will pop the R/G/B tint off the varyings, and store them into r1, and 29 sets r1's alpha to 100% (opaque)

22:20 <kof123> i say "filesystem" in quotes because it is not defined where those might live (in ram, on disk...just a "stream" somewhere). thus, its like a "build your own" kind of. you might enable some fields for something stored in RAM, other fields stored on disk (or swap or something) and link them together, etc. so really, for anything practical, i will have like "templates" and some DSL to specify at "creation" time what "fields" you

22:20 <kof123> t

22:20 <nikolar> That's an interesting idea

22:20 <clever> mrvn: then it gets a bit more fuzzy, it loads the existing color from the framebuffer, and does alpha blending between them, and writes back

22:21 <kof123> so i mean, theoretically it will have more friendliness on top. that is what i mean by "kitchen sink" too....and where the are "registers for process" there ...

22:21 <kof123> *why there are

22:21 GeDaMo has quit [Quit: That's it, you people have stood in my way long enough! I'm going to clown college!]

22:21 <gog> hi'

22:22 <clever> mrvn: each time you feed a UV pair into the texture hardware, it will also pop a uint32_t[2] off the uniform fifo, this generates that data: https://github.com/cleverca22/gl/blob/master/core.c#L499-L500

22:22 <bslsk05> github.com: gl/core.c at master · cleverca22/gl · GitHub

22:22 <clever> mrvn: that contains the phys addr of the texture, and the size and other params

22:24 <clever> mrvn: so the big unknown in your question, is how to do the shading/lighting, my example code entirely ignores lighting

22:27 bradd has joined #osdev

22:29 <mrvn> clever: 3D without a light source looks rather bad.

22:31 <mrvn> do I have to assemble that for th vc?

22:31 <clever> mrvn: yeah, let me find the assembler...

22:32 <clever> https://github.com/hermanhermitage/videocoreiv-qpu has some more notes/examples

22:32 <bslsk05> hermanhermitage/videocoreiv-qpu - Fun and Games with the Videocoreiv Quad Processor Units (34 forks/238 stargazers)

22:32 <clever> where was it...

22:34 <clever> 18 19:46:14< clever> /usr/bin/node /media/videos/4tb/rpi/videocoreiv-qpu/qpu-tutorial/qpuasm.js [--showbits] [--dumpglobals] [--dumpsymbols] [--verbose] [--ignore-errors] [--strict-match] [--in]filename

22:34 <clever> mrvn: aha, found the filename

22:34 <clever> https://github.com/hermanhermitage/videocoreiv-qpu/blob/master/qpu-tutorial/qpuasm.js

22:34 <bslsk05> github.com: videocoreiv-qpu/qpuasm.js at master · hermanhermitage/videocoreiv-qpu · GitHub

22:35 <clever> mrvn: there, thats the original assembler i was using, back before mesa was properly ported

22:35 <clever> the end result of running that assembler, looks like: https://github.com/librerpi/lk-overlay/blob/master/platform/bcm28xx/v3d/v3d.c#L143-L153

22:35 <bslsk05> github.com: lk-overlay/v3d.c at master · librerpi/lk-overlay · GitHub

22:36 <kof123> for something very simple, you could enable, say 8.3 filename field, dos "permissions", say like 200 entries max (all these are fixed-size -- configurable, but header has sizes...many more get/set functions lol), and have like: <header> <this stuff for "entries", say 200 spots reserved> and use the rest of a 1.44M floppy to store file data. so, you might only need like a tiny amount of fields for something simple like that

22:36 <clever> https://docs.broadcom.com/doc/12358545 is where you can find VideoCoreIV-AG100-R.pdf, that tells you how the whole 3d core works

22:37 <kof123> thats not really a priority, but should be possible

22:37 <clever> mrvn: so the question then, is how does the math for shading work, how can we implement it?

22:39 <mrvn> clever: for flat shading it's just a cross product of 2 sides to get a normal (normalize by length) and then vector product to get the angle to the light source.

22:39 tejr has quit [Remote host closed the connection]

22:39 <mrvn> clever: that gets mulitplied to the color.

22:40 <clever> mrvn: does this value vary over each pixel in a polygon, or is the entire polygon sharing one value?

22:40 <mrvn> If you interpolate the normals for the 3 vertexes you get more smooth lightning.

22:40 <mrvn> flat shading has every polygon as uniform color

22:41 <clever> so you can either pre-compute the normal map, and feed it in as a second texture

22:41 <clever> or you can feed just a few points in as varyings on the vertices

22:41 <clever> and interpolate across the polygon

22:42 <clever> but, from just the XYZ of a vertex, how do you know what angle the polygon is at?

22:42 <mrvn> clever: you need 3 points, 2 sides and then the cross produce gives you the normal

22:42 xenos1984 has quit [Read error: Connection reset by peer]

22:42 <mrvn> 90° to both sides.

22:43 <clever> i'm not sure the vertex shader can do that, so you would have to pre-compute it in cpu first

22:43 <clever> i still need to get a working demo of vertex shaders as well

22:44 <clever> https://github.com/cleverca22/gl/blob/master/core.c#L231-L244 this is a very primitive opengl implementation, where each vertex has the 5 varyings i showed earlier, UV + RGB

22:44 <bslsk05> github.com: gl/core.c at master · cleverca22/gl · GitHub

22:44 <mrvn> the multiply vector ALU should do it

22:45 <clever> https://github.com/librerpi/lk-overlay/blob/master/platform/bcm28xx/v3d/v3d.c#L349-L356 and a far simpler shader, with just RGB in vary and nothing else

22:45 <bslsk05> github.com: lk-overlay/v3d.c at master · librerpi/lk-overlay · GitHub

22:45 <clever> the vertex shader, basically loads an `uint32_t attributes[attr_count][16]` into the vector registers, and then runs your vertex shader on a 16-lane vector core, computing 16 vertices in parallel

22:46 <clever> the shader must then fill that vector register bank, with x[16], y[16], w[16]?, vary[vary_count][16]

22:46 <mrvn> clever: you could compute 5 normals in parallel 5 * XYZ = 15.

22:46 <clever> but the shader isnt aware of which polygons are using each vertex

22:47 <clever> and a triangle it made up of 3 verticies, which come from 3 sets of attributes

22:47 <clever> the hw scheduler expects the 16 lane vector core, to produce 16 shaded vertices

22:48 <mrvn> yeah, first you have to get the 3 vertices for each triangle in clockwise rotation. Then compute 2 sides and build the cross product and last normalize.

22:48 <clever> but due to the primitive list, the vertices can be in any order

22:48 <mrvn> Might make sense to compute 16 triangles at a time.

22:48 <clever> https://github.com/librerpi/lk-overlay/blob/master/platform/bcm28xx/v3d/v3d.c#L407-L411

22:48 <bslsk05> github.com: lk-overlay/v3d.c at master · librerpi/lk-overlay · GitHub

22:49 <clever> a triangle is made by putting the index of 3 verticies into this primtiveList

22:49 <clever> and you can just use any 3 vertices

22:49 <mrvn> that's normal. each vertice is also part of 3 or more triangles.

22:49 <clever> i dont know what the hardware does when your polygon is fragmented over multiple "pages" of 16

22:50 <clever> it might run the shader 3 times, producing 48 vertices, and throwing 45 of them into /dev/null

22:50 <clever> and then how can the vertex shader know what the other corners are?

22:51 <clever> enless you bake that into the attributes, and dont share a vertex

22:51 <mrvn> is the index only 0-15?

22:51 <clever> *looks*

22:51 <clever> the primitive list, is passed to the renderer thread as opcode 32

22:52 bnchs has joined #osdev

22:52 <clever> https://docs.broadcom.com/doc/12358545 page 68, opcode 32

22:52 <clever> index primitive list

22:52 <clever> you give it a 32bit maximum index, so it can fault upon corrupt data

22:52 <clever> a 32bit phys addr for the primitive list

22:53 <clever> a 32bit length, and a type of either 8bit or 16bit

22:53 <clever> so the index can be a 16bit int, 0 to 65535

22:53 <clever> and you also give it a primitive mode, points, lines, line_loop, line_strip, triangles, triangle_strip, and triangle_fan

22:54 <clever> ah yes, and this happens in the binning thread, not the rendering thread

22:55 <mrvn> So you can do 21k triangles without shared vertexes at a time.

22:55 <clever> the binning thread, is going to run your coordinate shader over things (that produces XY but no varys)

22:55 <clever> the binner will then figure out which on-screen tile the polygon is in, and generate some draw commands for those tile(s)

22:56 <clever> then the renderer thread, renders 1 tile at a time, operating on a subset of the polygons

22:56 <clever> where it will vertex shade, and fragment shade

22:56 <clever> and a tuning param, controls how far ahead the vertex shader stays

22:56 <clever> too far ahead, and you have fewer vector regs available

22:56 <mrvn> Note: Normaly your shapes are fixed so you compute normals once. If your light / object isn't moving you can even compute the brightness statically.

22:56 <clever> too far behind, and it keeps stalling to vertex shade

22:57 <clever> yeah, ive heard of some games baking the normal map into a dedicated texture

22:57 <mrvn> SO I guess you make each triangle have 4 vertexs: v0, v1, v2, normal

22:57 <clever> so you just feed the fragment shader 2 textures and a UV

22:57 <mrvn> 16k triangles at a time then

22:58 <clever> the vertex data itself...

22:58 <clever> is held within a shader record, https://github.com/librerpi/lk-overlay/blob/master/platform/bcm28xx/v3d/v3d.c#L186

22:58 <bslsk05> github.com: lk-overlay/v3d.c at master · librerpi/lk-overlay · GitHub

22:58 <kof123> i should call it duckfs -- does this field exist? ok you can use that feature

22:58 <clever> https://github.com/librerpi/lk-overlay/blob/master/platform/bcm28xx/v3d/v3d.c#L237-L250

22:58 <bslsk05> github.com: lk-overlay/v3d.c at master · librerpi/lk-overlay · GitHub

22:59 <mrvn> where it gets real fun is shadows and mirrors.

22:59 <clever> mrvn: so line 240-241, sets up the whole vertex array (and shaders), then 245-249 says how to make polygons from that

22:59 <clever> in theory, you could just repeat those 2 blocks, for every 16k triangles

22:59 <mrvn> Z sorted

23:00 heat has quit [Remote host closed the connection]

23:00 <clever> there is depth testing going on somewhere

23:00 <clever> but i'm fuzzy on the details

23:00 <clever> i had also heard how mirrors work in a YT vid a few weeks ago, just draw the scene from the viewpoint of the mirror, using a transformation matrix, and a stencil

23:01 <mrvn> clever: and then set that as texture. That's the general hack.

23:01 heat has joined #osdev

23:01 xenos1984 has joined #osdev

23:01 <clever> the method i saw didnt use it as a texture, but drew right into the final framebuffer

23:01 <clever> and used the stencil to limit where it could draw, and let depth testing do the rest

23:01 <clever> also, using the 3d output as a texture, requires an extra step

23:02 <mrvn> clever: that just combines the 2 passes into one then

23:02 <clever> yeah

23:02 <clever> https://github.com/librerpi/lk-overlay/blob/master/platform/bcm28xx/v3d/v3d.c#L282-L294

23:02 <bslsk05> github.com: lk-overlay/v3d.c at master · librerpi/lk-overlay · GitHub

23:02 <mrvn> But you might want the mirror to also be a (textured) light source.

23:02 <clever> this sets up the format and physical addr of the output frame, this code is doing a linear bitmap image

23:02 <clever> but the texture core cant accept linear images!

23:02 <mrvn> or you transform all the light sources to their mirrored equivalents.

23:03 <clever> if you then lookup opcode 113 in the pdf, youll find that bit 70:71, is the memory format, linear, t-format, lt-format

23:03 <clever> change that to t-format, and then you can happily use the output as a texture

23:05 <clever> oh, and there was another thing i have yet to try using...

23:05 <clever> page 72 in the pdf

23:06 <clever> "vg inline primitives"

23:06 <clever> you can skip the entire vertex index layer, and just put vertex data directly into the control list

23:06 <clever> for every 3 vertices, it makes 1 triangle, feed it vertex data until you exaust all ram, lol

23:16 <clever> the hardware also accepts 3 types of shader records

23:16 <clever> the "gl shader state record" contains the addresses for the coordinate shader, vertex shader, fragment shader, and the strides and bitmasks for all the attributes

23:17 <mrvn> I want my blitter back from my Amiga. That was so much simpler.

23:17 <clever> the "nv shader state record" takes a fragment shader, and the shaded vertex data addr

23:17 <clever> if you just want to blit, the dma core on the rpi can already do that, it has 2d dma modes

23:18 <clever> its basically just a memcpy in a loop, copy X bytes, increment src by Y, increment dst by Z, do it I times

23:18 <mrvn> draw the edges with bresenham, run the blitter over the rect to fill the polygon and then again to copy it into the bitmaps as the right color.

23:19 <mrvn> clever: the blitter could fill an area turning on/off every time it hits a set bit.

23:19 <clever> ah, thats something that the pi cant really do, that i know of

23:20 <nikolar> what sort of hardware accelleration is there for 2d anyway

23:21 <mrvn> nikolar: copying. everything else is 3D

23:21 <nikolar> yeah though so

23:21 <nikolar> basically just blitting

23:21 <clever> nikolar: the 2d core on the rpi can composite a large number of sprite like layers, and do scaling, alpha blending, and pixel format conversions

23:21 <mrvn> you have DMA and GPU

23:21 <nikolar> and video decode too i imagine

23:21 <clever> but the 2d core cant do skew or rotation, only axis flips

23:22 <mrvn> nikolar: no, codecs are extra code. video is too complex.

23:22 <clever> yeah

23:22 <nikolar> clever: yeah compositing is really useful for windowing stuff

23:23 <mrvn> nikolar: not powerfull enough though to composite on the fly every frame.

23:23 <nikolar> scratch that then

23:23 <mrvn> nikolar: you give every window a framebuffer and then composit them into a global framebuffer and that you display.

23:23 <clever> mrvn: are you sure? https://www.youtube.com/watch?v=JFmCin3EJIs

23:23 <bslsk05> 'Chaos, 13 sprites randomly bouncing around' by michael bishop (00:00:12)

23:23 <nikolar> i am really new to gpu accelleration so sorry for dumb questions

23:24 <mrvn> and hopefully not too much changes every frame.

23:24 <mrvn> clever: I have more than 290 windows.

23:24 <clever> mrvn: for the rpi's 2d core, there is no global framebuffer, it composites on the fly, and is constantly racing the (virtual) electron beam

23:24 <clever> ah yeah, with that many, you would need to use the offline composition and global framebuffer options

23:24 <mrvn> clever: yes, but as said, not powerfull enough.

23:24 <clever> the 2d core can just be copied back to ram with dma

23:25 <clever> and then you can do multiple passes

23:25 <clever> that 290 limit, is also assuming you want to pageflip between 2 frames, each of 290

23:25 <nikolar> you can use the gpu to draw only the differences i imagine

23:25 <mrvn> Usualy you also don't have many windows visible and changing. So you only have to composite small reactanges of the overall screen each frame.

23:25 <nikolar> is there a way you can save the output

23:25 <clever> if your displaying 1 frame, and then doing offline composition, the limit is more like 500

23:25 <mrvn> nikolar: yes

23:26 <mrvn> 00:24 < clever> the 2d core can just be copied back to ram with dma

23:26 <clever> nikolar: thats what the offline composition is doing

23:26 <nikolar> ah sorry, missed the message

23:26 <clever> offline composition is also the only way to do a 90 degree (axis swap) rotation

23:28 <nikolar> is virtio-gpu representative of how actual hardware works

23:28 <mrvn> nikolar: you could also do it dynamically. Check out how many non overlapping (or just a few overlapping) rectangles you have and if it's < 290 you composite dynamically direct to the video out.

23:28 <nikolar> mrvn: but if it's more then you'd have to cache

23:29 <mrvn> nikolar: yes. and then you would sort windows by how much they change and combine the not chaning ones first.

23:29 <clever> mrvn: but if you want to use offline composition, you can still have the hw do ~500 layers in a single batch, and then you still have enough for 40+40 to pageflip between

23:29 <clever> so it could draw all of the idle windows in one batch, then use that like a wallpaper behind the 39 most active windows

23:29 vdamewood has joined #osdev

23:30 <mrvn> excatly what I said

23:30 <clever> yep

23:30 <nikolar> yeah that's what i was thinking

23:30 <mrvn> Splitting windows into exposed rectangles in some clever way probably gets you below 290 in almost all cases.

23:30 <nikolar> isn't that what xorg does

23:31 <clever> splitting like that, also drasticaly reduces the resource usage for drawing

23:31 <clever> drawing over a pixel multiple times is costly, and can bring the limit below 20

23:31 dutch has quit [Quit: WeeChat 3.8]

23:31 <mrvn> nikolar: xorg does have exposed ractangles. It tells you what parts of your window become visible.

23:32 <mrvn> But if you have transparency or shape extension this gets rather ugly.

23:32 <nikolar> yeah that's why you have xorg compositors as an external things :)

23:32 <mrvn> You can easily stack 100 terminals with transparency on top of each other. Have fun rendering that.

23:33 <mrvn> .oO(if you have 50% transparency you can stop after 8 though)

23:34 <nikolar> i have transparency on my terminal and there's actually a noticable difference in battery life when it's enabled and when it's disabled

23:34 <mrvn> I find that horrible to read. Doesn't gains you anything.

23:35 <mrvn> I don't even like a background image. Too easy to reduce the contrast of the text.

23:35 <clever> the 2d core can draw 4 pixels per clock, and runs at ~500mhz, 2 billion pixels/second, 1280x1024@60 is 78 million pixels/second, so the rpi hardware can composite ~25 frames, each 1280x1024, with per-pixel alpha, and do it at 60fps

23:35 <nikolar> it wasn't that noticable, and i did like how it looked, but i wanted to tune my laptop for maximum efficiency :)

23:36 <mrvn> turn down brightness. :)

23:36 <nikolar> that helps too

23:36 <clever> the only reason i have the compositor enabled, is so i dont notice the laggy redraws in chrome

23:36 <mrvn> also keep the desktop which with little black.

23:36 <nikolar> hardware decode also helps, which i didn't realise wasn't enabled

23:36 <mrvn> s/which/white/

23:36 <clever> due to every window having its own off-screen buffer, i can switch to it instantly, without having to wait for a repaint

23:36 <clever> that, and the preview alt+tab shows

23:36 <nikolar> mrvn: can't really do light theme though lol

23:36 <clever> all the fancy effects are disabled

23:37 <mrvn> nikolar: why not?

23:37 <nikolar> can't stand it lol

23:37 <mrvn> nikolar: too bad. TFT work by having a bright light and then blocking it to make pixel darker. That blocking takes power.

23:38 <nikolar> i know that, but there are some things i just find easier to look at

23:38 <nikolar> one of those being dark themes

23:38 <mrvn> I think there are some screens that can control the LED light for regions of the screen. So if a large area is dark it dimms the LEDs.

23:38 <clever> mrvn: my cellphone has an oled display, and its freaky how dark it can go

23:38 <clever> every time a loading screen makes it go black, it looks like the phone just died

23:39 <mrvn> clever: sucking in light?

23:39 <clever> there is no difference between "black screen" and "just off"

23:39 <clever> so it can be hard to tell if its even on sometimes

23:39 <mrvn> clever: are you sure black screen isn't detected and really is just off?

23:39 <mrvn> ever had a black screen with 1 white pixel?

23:39 <nikolar> maybe oled phones need a small led somewhere so you know it's not off

23:40 <mrvn> Hmm, aren't oleds actually producing light in the right color?

23:40 <clever> i believe this is just 3 LED's in each pixel, r/g/b

23:40 <clever> so black is just turning all of those LED's off

23:40 <mrvn> eaxcty. So black is just power off.

23:40 <clever> yep

23:41 <clever> the control circuits are still on, but you cant see those

23:41 <mrvn> On TFTs white is power off.

23:41 <clever> also on lcd, black is a dim grey

23:41 <clever> so you can tell that its on, even when its "black"

23:41 <mrvn> And the pixels can't block 100% of the light so black is actually just dark.

23:41 <clever> yep

23:43 <mrvn> When you setup the wall for the home projector or in the cinema before the film starts what color does it have?

23:44 <clever> ideally, something pure white?

23:44 <clever> and decently reflective

23:44 <mrvn> Nope. that's black. You aren't putting any color on it yet. :)

23:44 <clever> but retroreflective is bad

23:44 <clever> ah yeah, you mean when the projector light is off

23:45 <mrvn> Black in the film is just the color of your ambient light. Doesn't get really black. Just like TFTs.

23:45 <clever> and thats where you want to control the lighting in the room

23:45 <clever> avoid pointing it towards the screen

23:45 <clever> and make the floors/chairs absorb all light

23:46 <mrvn> Are there any oled cinemas yet?

23:46 <clever> not heard of any

23:47 <mrvn> What I have seen though is "green screens" for movies that are actually displays. So they display the background in real time while filming and not in post processing.

23:47 <mrvn> Avoids the border effects where the actors fade into the green screen.

23:47 <clever> i was watching a kyle hill livestream a few days ago

23:47 <clever> and i noticed a green tint to his hair

23:48 <clever> the hair is semi transparent, and the greenscreen was bleeding thru between the strands

23:48 <mrvn> and because it's a mix of hair and green it doesn't trigger the replacement.

23:48 <clever> yep

23:48 <clever> ideally, you should detect shades of green, and make those pixels partially transparent

23:48 <mrvn> or when it gets too thin it replaces and you have hair loss

23:48 <clever> but then you cant have any shades of green on the actor

23:49 <mrvn> clever: now the fern on the desk is transparent.

23:49 <clever> those projector setups avoid that issue, and also fix lighting issues

23:49 <clever> you dont want the actor to have a green face, because of all of the green light the screen is reflecting

23:49 <clever> and you want the actor to be lid up by the lights in the scene

23:49 <mrvn> clever: also produce lighting issues since you see the shadows on the wall.

23:50 <mrvn> https://www.molton-markt.de/images/slider_images/greenscreen-kaufen.jpg

23:51 <mrvn> you have to be carefull that you still replace the shadowed green on the floor

23:51 <clever> yeah, you need to adjust the thresholds, so it deletes all of the screen

23:51 <clever> but the underside of the girrafe also looks a tad green?

23:51 <mrvn> Bad if you get those on screens on the walls. Haven't heard of screens for the floor yet.

23:52 <clever> yeah, then you dont have to deal with it as much

23:52 <mrvn> indeed.

23:53 <mrvn> .oO(green screen was also something Amigas did in hardware.)

23:53 <clever> yep

23:53 <clever> genlock i think it was called?

23:53 <clever> ive heard of even the c64 doing it?

23:54 <mrvn> You had 6 (later 8) bitmaps forming a palette index. Or you could split that into 2 parts and one color of the first lookup is transparent.

23:54 <clever> i recently figured out why they are doing that

23:54 dutch has joined #osdev

23:54 <mrvn> what?

23:55 <clever> why it used bit planed images

23:55 <mrvn> easier to deal with different color depths that way.

23:55 <clever> compresses far better

23:55 <clever> https://i.imgur.com/o2Px3Qq.png this from a YT channel talking about gameboy games and how the encoding works

23:55 <mrvn> that works for image files but rather irrelevant for display hardware.

23:55 <clever> and with all of the variation in shading, that doesnt compress nicely

23:56 <clever> https://i.imgur.com/HppxKEy.png but this is then bit0 and bit1 as seperate images

23:56 <clever> and now it can RLE encode much more easily

23:57 <mrvn> clever: How would you build image in memory if you have 3 bit per pixel?

23:57 <clever> 3 seperate 1bpp images

23:57 <mrvn> Would be ugly to deal with (2,0) having 2 bit in the first byte and 1 bit in the second.

23:58 <mrvn> So yeah, bitmaps are far easier to deal with.

23:59 <clever> do you remember the list of pixel formats the rpi 2d core accepts?

23:59 <clever> https://github.com/librerpi/lk-overlay/blob/master/platform/bcm28xx/hvs/include/platform/bcm28xx/hvs.h#L48-L72

23:59 <bslsk05> github.com: lk-overlay/hvs.h at master · librerpi/lk-overlay · GitHub

23:59 <clever> its this whole enum, and the palette format accepts the entire 1bpp to 8bpp range

23:59 <mrvn> The Amiga hardware had another cool feature. You could set 6 bitmaps and 00xxxx would select from the palette. 01xxxx would take the last pixel and replace the red component with xxxx. 10 for gree and 11 for blue.

23:59 <mrvn> Later it was 8 bitmaps with a 64 colors palette for the same encoding.