#osdev on 2021-07-02 — irc logs at libera.irclog.whitequark.org

2021-05-23 01:57 klange changed the topic of #osdev to: Operating System Development || Don't ask to ask---just ask! || For 3+ LoC, use a pastebin (for example https://gist.github.com/) || Stats + Old logs: http://osdev-logs.qzx.com New Logs: https://libera.irclog.whitequark.org/osdev || Visit https://wiki.osdev.org and https://forum.osdev.org || Books: https://wiki.osdev.org/Books

00:01 <klange> Oh, stb_truetype doesn't do instruction processing beyond basic shape extraction... if I can get that my lib might actually be useful outside of just me...

00:01 nyah has quit [Read error: Connection reset by peer]

00:02 <geist> it has a little interpreted bytecode? i guess i knew that but never looked at it

00:03 <geist> is it a stack based thing?

00:04 vdamewood has joined #osdev

00:05 <klange> yeah

00:06 <klange> i think it has some registers, and they are adorably font-specific things

00:09 vinleod has joined #osdev

00:10 vdamewood has quit [Ping timeout: 240 seconds]

00:14 vinleod is now known as vdamewood

00:15 Skyz has quit [Quit: Client closed]

00:16 Skyz has joined #osdev

01:05 <Skyz> Klange: Font is an OS specific format

01:05 <Skyz> So you have to have some way to implement it in the OS

01:09 <klange> Fonts are not an "OS specific format" and I do not have the patience this morning to explain to you just how off-base saying that to me is.

01:10 <Skyz> I'm just looking through the files in graphics right now

01:14 <vdamewood> Is 'Font' the name of a format I'm not familiar with, or is this actually about fonts such as .otf, .ttf, and such?

01:15 <Skyz> .ttf .otf and such

01:16 <vdamewood> Freetype supports both .otf and .ttf. So, if your OS supports libraries, your OS supports .ttf and .otf.

01:16 <kazinsal> I feel like I put someone on ignore and made the right decision because this conversation is missing chunks

01:16 richbridger has joined #osdev

01:17 <klange> vdamewood: somehow you've managed an even worse take then Skyz :P

01:17 <kazinsal> Oh. OH. That explains it

01:17 <vdamewood> klange: And I wasn't even trying that hard.

01:18 aquijoule_ has quit [Read error: Connection reset by peer]

01:18 <vdamewood> klange: I'm actually kind of curious what's bad about my take.

01:21 <vdamewood> Did I miss something?

01:22 <klange> You missed that you were talking to me, king of NIHing :)

01:23 <klange> [hacked my ongoing work into an SDL app because waiting for VMs to boot to iterate was annoying] That's a pretty nice looking 'd', right? https://klange.dev/s/Screenshot%20from%202021-07-02%2010-22-46.png

01:23 <gog> oh

01:24 <kazinsal> FreeType also happens to be designed to cover basically every use case whereas in a hobby OS it makes more sense to cover the use cases that you and your software actually need

01:24 <klange> I want more than just basic glyph shapes (stb_truetype) but not necessarily all of the features of FreeType, and most importantly... I want to write my own.

01:24 <vdamewood> https://xkcd.com/974/

01:24 <bslsk05> xkcd - The General Problem

01:25 <vdamewood> Well, I didn't mean to imply that an osdever should actually use FreeType, just point out how positively trivial it is to support the most common formats.

01:27 <gog> i feel attacked

01:27 <gog> i get nothing done because i need to cover literally every scenario

01:27 <vdamewood> gog: Want a fishy?

01:28 <gog> yes.

01:28 * vdamewood gives gog a fishy.

01:28 * gog chomps

01:34 <klange> https://klange.dev/s/Screenshot%20from%202021-07-02%2010-33-37.png okay I'm reasonably content with this so far...

01:36 <gog> snowman!

01:36 <gog> what's that green line?

01:37 <pony> klange: oh, that's pretty!

01:37 <klange> it's a drawing tool that's loading up basic glyph points from a font to start off, the green line indicates next edge to mouse cursor; right click does a move-to, left click does a line-to, I hacked up curves with fixed subdivisions for now

01:38 <klange> (normally you'd do something smart like tesselate until the midpoint is within an error range)

01:39 <klange> so if I click a few times it'll insert more vertices: https://klange.dev/s/Screenshot%20from%202021-07-02%2010-38-42.png

01:39 <gog> ah neat so a crude vector drawing thing

01:39 <klange> it also selects a new random color each time

01:39 <klange> yeah

01:41 ^[ has joined #osdev

01:41 <klange> The antialiasing can still use some work, but I'm happy with it for now - grid fitting / hinting will have a far more visible effect.

01:43 <gog> :)

01:43 <klange> Then I'll need to actually parse metrics, kerning pairs, etc. and we'll have a nice little text rendering system and I can finally throw away the SDF renderer and get some semblance of basic Unicode support back~

01:45 <moon-child> sdf--signed distance fields?

01:45 <moon-child> isn't that a gpu rendering technique?

01:46 isaacwoods has quit [Ping timeout: 268 seconds]

01:48 <kazinsal> iirc the GPU rendered SDF font technique was somewhat pioneered by Valve but you can do it in regular software too

01:48 isaacwoods has joined #osdev

01:48 <klange> It's pretty easy to implement and produces pretty solid results, so I banged one out as my first in-house antialiased text renderer.

01:49 <klange> take any of my random screenshots from the last couple of years and the text there is with the SDF renderer and some baked ASCII-only DejaVus: https://klange.dev/s/Screenshot%20from%202021-06-23%2021-04-23.png

01:51 <klange> But baking is expensive, the final textures are still much larger per-glyph than the original sources, and I don't really care to add all of the features necessary to make it a full general-purpose text system.

01:51 <klange> So it's time to sunset it and get this TrueType implementation in there instead.

01:52 <clever> klange: ignoring the scaling requirements, how would you compare one big texture like https://gallery.earthtools.ca/v3d2/arial.png ?

01:53 <klange> SDF is a significant improvement over a baked bitmap texture.

01:53 <klange> You can use a much smaller texture with distance fields to get the same quality _and_ with SDF you get viable scaling.

01:53 <clever> yeah, vector vs bitmap

01:54 <clever> in this image, i think its 100% solid white

01:54 <clever> the only information, is in the alpha layer

01:54 <clever> which is only 1 bpp, essentially (havent confirmed the actual encoding)

01:55 <klange> let me get one of my SDF bakes into a visible format - they're PNGs but github is confused by my choice of file extension.

01:56 <klange> https://klange.dev/s/sdf_thin.png this is a horribly wasteful bake of DejaVu. It's at a slightly bigger size than that Arial bitmap, but this is the sole source of all of the different sizes of text in my last screenshot [except the "Jun 23" that's the bold font]

01:56 <klange> With SDF we can scale _down_ and still get viable shapes and do nice anti-aliasing.

01:56 <clever> for context, i found that arial.png in wowmapviewer

01:57 <clever> when your rendering multi-gigabit 3d map files, the size of a font matters much less

01:57 <clever> gigabyte*

01:58 <clever> https://gallery.earthtools.ca/v3d2/out.png

01:58 <clever> this is a final render coming out of it

01:59 <klange> A good implementation would do rect-packing, a very good implementation would do multi-dimension distance vectors which make for super crisp hard edges, and if I were to invest time in this the next steps would have been to add glyph mappings for Unicode, x-advance and kerning tables, etc.

02:01 <klange> (The x-advance is currently a separate config file with each letter and its width for the base size written out, by hand)

02:01 <klange> (With some obvious mistakes, especially in the bold font)

02:02 <clever> https://github.com/cleverca22/wowmapviewer/blob/master/src/font.cpp#L100

02:02 <bslsk05> github.com: wowmapviewer/font.cpp at master · cleverca22/wowmapviewer · GitHub

02:02 <clever> looks like wowmapviewer's x-advance, was simply width + 2

02:02 <klange> That works fine if you have the width, which you do if you're rect-packing :)

02:03 <klange> My thing only did grid layout; easy lookup, but no width information stored in it.

02:03 <clever> there is a text file, giving the xywh of each glyph in the .png

02:03 <clever> but with a vector file, its more fuzzy, and you can allow overlap

02:04 <clever> a vector file also allows you to not store gaps

02:04 <clever> with what wowmapviewer is doing, if you want more padding on a glyph, you need to include dead space in the texture and width

02:16 iorem has quit [Quit: Ping timeout (120 seconds)]

02:26 isaacwoods has quit [Quit: WeeChat 3.2]

02:38 Skyz has quit [Quit: Client closed]

02:44 freakazoid333 has joined #osdev

03:02 sts-q has joined #osdev

03:11 paulusASol has quit [Read error: Connection reset by peer]

03:11 medvid has quit [Remote host closed the connection]

03:12 paulusASol has joined #osdev

03:14 hgoel[m] has quit [Ping timeout: 250 seconds]

03:21 medvid has joined #osdev

03:21 hgoel[m] has joined #osdev

03:33 ElectronApps has joined #osdev

03:35 paulusASol has quit [Quit: node-irc says goodbye]

03:36 tenshi has quit [Quit: WeeChat 3.2]

03:39 mctpyt has joined #osdev

03:40 vdamewood has quit [Quit: My MacBook Pro has gone to sleep. ZZZzzz…]

03:40 medvid has quit [Quit: node-irc says goodbye]

03:41 ElectronApps has quit [Ping timeout: 272 seconds]

03:41 ElectronApps has joined #osdev

03:42 <doug16k> clever, I used an alpha-only texture format for my bitmap font renderer texture atlas. 8 bits per pixel

03:42 <doug16k> https://github.com/andryblack/fontbuilder

03:42 <bslsk05> andryblack/fontbuilder - Bitmap font generator (89 forks/393 stargazers/MIT)

03:43 <doug16k> then just instanced rendering to draw all glyphs on the screen in one call

03:45 <doug16k> makes text take a negligible amount of gpu time

03:45 iorem has joined #osdev

03:47 <clever> thinking about the rendering cost some....

03:47 hgoel[m] has quit [Quit: node-irc says goodbye]

03:48 <clever> lets say we are working with a fixed-width font, like a VGA console, 8 pixels wide, 16 pixels tall

03:48 <doug16k> you can boil it down to just giving it an actual array of characters and positions, then the vertex shader looks up the texcoord and passes them down. fragment shader does trivial texture fetch and multiplies that value with the color

03:49 <clever> ahh, your going one step further then ive done

03:49 <doug16k> or does color*pixel+existing_color*(1-pixel)

03:49 <doug16k> then the alphas are really just pixel coverage values

03:49 <clever> if i was to do it without the gpu, then each glyph on screen, involves a 1 byte read (the char itself), then a 128 pixel copy

03:50 <moon-child> doug16k: if you're on gpu, you have unlimited memory b/w, so does squishing to 8bpp make sense?

03:50 <moon-child> I guess you can lut so it's not a big deal

03:50 <clever> lets assume i'm using RGB565, so 16bit color

03:50 <doug16k> it tells you what proportion, from 0.0 to 1.0, of a mix

03:50 <doug16k> you broadcast the value you got. why make it waste memory bandwidth?

03:50 <clever> that means each glyph needs 256 bytes of writes!!

03:51 <clever> yeah, i can see how things scale rapidly, and need gpu accel

03:51 <doug16k> do you mean software rendering?

03:51 <clever> did the math on software rendering first, so i have something to compare to

03:51 <doug16k> to go fast with software text rendering, the key is to go as far across each scanline as possible and make the stores contiguous bursts

03:52 <doug16k> let the reads be more scattered so the stores can be more contiguous

03:52 <clever> yeah, because thats just just 256 bytes worth of writes, thats 16 seperate bursts of 16 bytes each

03:52 <doug16k> I mean go across glyphs as much as you can

03:52 <clever> i could improve that burst by doing it one scanline at a time, yeah

03:53 <doug16k> Hello would render scanline 0 of H e l l o then scanline 1 of H e l l o, etc

03:53 <clever> then i have to read each glyph from the text buffer 16 times, but the writes are better

03:53 <clever> and write combining will like that

03:53 <doug16k> yeah, the idea is to make it do bursts

03:53 <clever> now, if we switch gears, to using the 3d core of the rpi (since i know its internals well)

03:54 <clever> https://github.com/cleverca22/wowmapviewer/blob/master/src/font.cpp#L57-L86

03:54 <bslsk05> github.com: wowmapviewer/font.cpp at master · cleverca22/wowmapviewer · GitHub

03:54 <clever> lets assume we are using triangles, so we have to draw 2 tri's to cover an 8x16 glyph

03:54 <doug16k> ya then do what I said

03:54 <doug16k> use instanced rendering

03:54 <doug16k> earlier I mean

03:55 <clever> your technique is even more advanced then what i'm mathing out now

03:55 <clever> depending on how well you implement it, you need either 4 or 6 vertex points

03:55 <doug16k> then it is a very simple vertex shader to figure out the atlas coords

03:55 <doug16k> it passes down hello-world level texcoord+vertex

03:55 <clever> 6 is the dumb way, 4 reuses 2 vertex's between the tris

03:56 <doug16k> yeah but the idea of instancing it is, you are actually just drawing a bunch of instances of two triangles

03:56 <doug16k> and the vertex shader can tell what instance this is

03:56 <clever> id like to finish the math first, and see how costly a lack of instancing is

03:56 <doug16k> so it knows how to lookup the texture atlas coords

03:56 <doug16k> it gets the glyph

03:57 <doug16k> if you are really close to the hardware like that, just immediate mode doing simple pair of triangles is probably good enough

03:57 <doug16k> the instancing is to save API call overheads

03:57 <clever> https://github.com/cleverca22/gl/blob/master/core.c#L296-L303

03:57 <bslsk05> github.com: gl/core.c at master · cleverca22/gl · GitHub

03:58 <clever> setting the UV of a vertex involves 2 floats, but this code is cheating and storing them in a global var for later use

03:58 <clever> https://github.com/cleverca22/gl/blob/master/core.c#L231-L244

03:58 <bslsk05> github.com: gl/core.c at master · cleverca22/gl · GitHub

03:58 <clever> this then inputs the XY of the vertex, and copies the UV from the global var

03:59 <doug16k> yes, opengl is a bunch of state

03:59 <doug16k> it leaves most things unsaid, assumed you selected it and that state is set already

03:59 <clever> https://github.com/cleverca22/gl/blob/master/core.c#L14-L17

03:59 <bslsk05> github.com: gl/core.c at master · cleverca22/gl · GitHub

03:59 <clever> and ultimately, the vertex data is just a flat array of this struct

04:00 <clever> 16 bits of x, 16 bits of y, 32bits of z/w/u/v/r/g/b

04:00 <doug16k> it can do instancing though, right?

04:00 <doug16k> you can set a divisor?

04:01 <doug16k> if you use instancing, it is insanely fast

04:01 <clever> so 8 bytes per vertex, times 4 (best case), gives 32 bytes per glyph

04:01 <doug16k> it just converts everything into a texture lookup that it can do trivially easy

04:02 <doug16k> even getting the glyph and positions are using the same mechanism as texture lookup

04:02 <clever> so if i dont do instancing, and i rebuild the vertex data on every frame, its ~1/8th the writes, not counting some other misc overheads

04:02 <doug16k> with instancing you don't keep saying the vertices

04:02 <clever> vertex shading can probably do instancing too, if i knew the GPU better

04:03 <clever> there are also tricks i could do, if i was rendering a vga text console

04:03 <doug16k> ya go ahead and immediate mode do it for sure

04:03 <clever> i can assume the tri's are stationary

04:03 <doug16k> it'll be close enough

04:03 <clever> and only update the UV in each vertex

04:03 <doug16k> instancing gives the implementation the power to do it extremely well. it's shocking how fast my nvidia drivers do instanced text render

04:04 <clever> if i was to cheat in such a matter, updating the text would just involve 8 bytes worth of writes per glyph, with a bit of scatter

04:04 <doug16k> it is using the hottest case that they optimize heavily

04:04 <clever> updating the color would involve 12 bytes of writes

04:05 <clever> and this isnt even taking into account dirty status's

04:05 <doug16k> yeah which still amounts to very little compared to how much video memory store bandwidth it generates

04:05 <clever> i'm mostly thinking about the main cpu core's store bandwidth

04:06 <doug16k> can you make it a strip or fan?

04:06 <doug16k> then it would be 4 glVertex for 2 tris

04:06 <clever> its index based

04:07 <clever> you create an array of raw vertex data, then you create an array containing sets of 3 index's into the 1st one

04:07 <doug16k> being index doesn't mean it isn't fan or strip

04:07 <clever> how does fan and strip work?

04:08 <doug16k> fan keeps repeating the 0th vertex in every triangle. the other two points are the last two vertices

04:08 <doug16k> a strip uses the last 2 vertices plus the new one for each new triangle

04:08 <clever> oh, i just noticed that in the docs...

04:08 <clever> > primitive mode: 0,1,2,3,4,5,6 = points, lines, line_loop, line_strip, triangles, triangle_strip, https://github.com/cleverca22/wowmapviewer/blob/master/src/font.cpp#L57-L86

04:08 <clever> ack, mis-paste

04:08 <clever> triangle_fan was the last one

04:09 <clever> so the hardware does support strip and fan modes, but i'm not using it yet

04:09 <clever> and i was thinking of simulating that, by manually reusing vertex data in the index list

04:10 <clever> in plain triangles mode, i would tell it to make 2 tris, with vertexes 0,1,2 and 1,2,3

04:10 <doug16k> ya or with fan you would go 0, 1, 2, 3 and get both tris

04:11 <clever> so i have to feed it 4 vertex structs (32 bytes each) plus 6 indexes (8bit or 16bit, depending on vertex array size)

04:11 <doug16k> strip would also work in case that simple

04:11 <doug16k> I think

04:11 <doug16k> fan definitely

04:11 <clever> i can picture why its called fan, if you are mapping out pizza slices of a circle

04:12 <doug16k> ya exactly

04:12 <clever> reusing the center vertex

04:12 <doug16k> same with clipped polygons. if you just make sure you emit all the clipped vertices in the same order around the edges, you can just toss N verts at a fan call and it does it right

04:12 <doug16k> not ideal but works

04:12 <clever> so if i change this to triangle_fan mode, then i can implement glVertex2squad better, and switch the font code over to using a quad

04:13 <doug16k> yeah

04:13 <doug16k> there is also the possibility of a "restart index"

04:13 <clever> but, all tris in the shader, must be in the same mode

04:14 <doug16k> also, you can make a degenerate triangle with two points the same to start a new strip or fan

04:14 <clever> rendering with a different mode, requires a second vertex index list, and another shader record

04:14 <doug16k> in one big index list

04:14 <clever> how would the GPU know how many slices in the fan?

04:14 <doug16k> verts - 1

04:14 <doug16k> by definition

04:14 <doug16k> er - 2

04:15 <clever> i dont see how that can work, with the v3d

04:15 <clever> there is no tri count

04:15 srjek|home has quit [Ping timeout: 256 seconds]

04:15 <doug16k> so?

04:15 <clever> you just give it an array of vertex indexes, and when in triangles mode, it knows that its sets of 3

04:15 <doug16k> why does it care?

04:16 <doug16k> fans and strips only look at the last one or two verts

04:16 <clever> but if i want to render quads, i need sets of 4, each fanning out from a new center point

04:16 <doug16k> if the verts just keep coming it just keeps using the last and second_last verts it is keeping

04:16 <clever> but then how do i reset it, to a new center point?

04:16 <doug16k> and each vert is pushed down that little sliding window of recent verts

04:16 <doug16k> you say the same vert twice

04:17 <doug16k> in a strip

04:17 <doug16k> that makes it have 0 area

04:17 <clever> ah, so it would be like 0/1/2/3/3 then 4/5/6/7/7 ?

04:17 <clever> to render 2 quads

04:18 <doug16k> just look at how the strip indexes into the history of verts, and emit the appropriate pair of identical verts to make a 0 area degenerate triangle that has a 3rd point in the new place

04:18 <doug16k> OR

04:18 <doug16k> the hardware might support a "restart index" you can set

04:19 <doug16k> then when you say that magic number in an index array, it knows to start a new strip/fan

04:19 <clever> either way, thats 5 indexes per quad

04:19 <clever> vs 6 indexes, with 2 being repeated

04:21 <clever> i can see how fan would help more, the more slices you have

04:21 <clever> with 2, its barely a benefit

04:21 <doug16k> the true benefit is how much more it can do in one call

04:21 <doug16k> it's not primarily about saving bandwidth

04:22 <doug16k> it throws that in as a bonus

04:22 <doug16k> you want to deluge a gpu with work it can overlap

04:22 <doug16k> one thing at a time is hideous

04:22 <doug16k> you are so close to the hardware though, it should be fine to just do it manually

04:23 <doug16k> most of the cool opengl speed stuff is about saving api call overheads

04:23 <doug16k> throwing blocks of work at it

04:24 <clever> i think VBO's where about exposing this vertex array directly to the app?

04:24 <doug16k> vbo is about remembering how you configured it to look up stuff in arrays

04:24 <doug16k> how you did binding between arrays and shader variables

04:24 <clever> ah

04:26 <clever> i think ive also heard instancing being mentioned, in doing things like copy/pasting a 3d object like a tree

04:27 <clever> so you just give the xyz and rotation/scaling params for a tree, and then the ?? shader will paste a duplicate copy of the tree, with those params applied

04:27 <doug16k> sooner or later, you run into a case where, you have many copies of the same thing, but just a couple of things differ

04:27 <doug16k> maybe different color, position, orientation

04:27 <doug16k> everything else is identical

04:27 <clever> yeah, thats why my vertex has an RGB on it

04:28 <clever> the pixel shader will tint the texture to that color

04:28 <doug16k> so with instancing, you can have an array of those things that differ. an array of colors, an array of positions, an array of orientations. then on those you also set a divisor. typically 1 so each 1 instance advances to next color/position/orientation

04:29 <doug16k> then you draw it once and it renders one for each instance

04:29 <clever> and is that all handled by the vertex shader?

04:29 <doug16k> shader can be oblivious

04:29 <doug16k> you setup the array binding for the color already

04:29 <doug16k> it is bound to some vertex shader input, regardless

04:30 <doug16k> you just happened to put an instance divisor on it

04:30 <doug16k> then it means, do the whole draw, advance the instance things to the next one appropriately, respecting the divisor, and render it again

04:30 <doug16k> and again, until the instance count

04:31 <doug16k> in one draw!

04:31 <doug16k> what would have been uniform becomes an attribute

04:32 <doug16k> but that attribute behaves like a uniform - but changes with the instance id

04:32 <doug16k> it does the lookup into the array that has the divisor

04:32 <doug16k> array[instance index divided by divisor]

04:34 <clever> doug16k: https://i.imgur.com/6rlXGm6.png

04:34 <doug16k> think of it like a uniform that magically sets itself across instances in one big draw call

04:35 <doug16k> yes that is exactly what you want

04:35 <clever> that is what line 14-17 was having to generate

04:35 <clever> and i suspect the vertex shader has to create these?

04:35 <doug16k> even if you were software rendering, there is a place in the code just like that link

04:35 <doug16k> no

04:35 <doug16k> this is the output of your clipping

04:36 <doug16k> this is the edge data

04:36 <doug16k> x y are screenspace coords, 16 bit integers?

04:36 <doug16k> z is 32 bit float

04:37 <clever> yeah

04:37 <doug16k> 1/wc is the reciprocal of the 4d clip-space coordinates

04:37 <doug16k> reciprocal of w

04:37 <doug16k> you interpolate down the edge and have one of those per scanline

04:37 <doug16k> then another sequence of those for right edge

04:38 <doug16k> this isn't how you deal with it in an opengl api

04:38 <doug16k> this is you scan converting the triangles and just using fragment shader

04:38 <doug16k> this is you doing the rasterization step by hand and just letting it do shading in gpu

04:38 <clever> there is a clipping step that still happens

04:39 <clever> the v3d has a "binning" phase, where it will figure out which tri's are in a given tile

04:39 <doug16k> hideous one that only a fool would use

04:39 <clever> the v3d can only render one tile at a time

04:39 <doug16k> you must clip your homogeneous coordinates

04:39 <doug16k> it expects everything to be in the -1<x<1 range

04:39 <doug16k> and z in 0<z<1

04:40 <doug16k> you need to project them

04:40 <doug16k> there's no other way

04:40 <clever> the tiles are all 64x64 pixels

04:40 <doug16k> see that 1/wc there?

04:40 <doug16k> how you going to 1/0 ?

04:41 <doug16k> answer: you don't have to worry, clipping will make it always in range and lowest z ever is 1

04:41 <clever> nextVertex->w = 1;

04:41 <doug16k> trust me, you must clip this: https://i.imgur.com/6rlXGm6.png

04:41 <clever> i never figured out what that did, and was just hard-coding it

04:42 <doug16k> if you make the input vertex be ,1 that means make the position {x/1, y/1, z/1}

04:42 <doug16k> now it should be obvious why 1

04:42 <doug16k> if you said ,2 and divided x,y,z by 2, nothing changed

04:42 <doug16k> er multiplied x,y,z by 2

04:43 <doug16k> I wish I knew that 20 years ago

04:43 <doug16k> I was in the "wtf w?" camp for quite a while :D

04:43 <clever> ah, so this would scale the xyz coords

04:44 <doug16k> it is for projection

04:44 <clever> ooo, does this handle things in the distance appearing to be a diff size?

04:44 <doug16k> the projection matrix can use that 4th column to make it divide

04:44 <doug16k> exactly

04:44 <doug16k> it ends up with a number that seems like it is 1/z but offset weirdly to account for the z input range being 1 to whatever and output z range mapping to 0 to 1

04:45 <clever> how does rotation of the camera work then?

04:45 <doug16k> think of the three rows as three arrows

04:45 <clever> if i had a proper vertex shader, and the camera was moving in 3d space, should the XYZ's change, or does the vertex shader compute that for me?

04:45 <doug16k> they tell you "which way" each axis goes

04:45 <doug16k> for each dimention

04:45 <doug16k> s

04:46 <doug16k> so you take the input and map it onto those directions by dotting the distance along each dimension with that direction in the output

04:46 <doug16k> the matrix is actually the inverse of that though. to look right 45 degrees you actually rotate the whole world left 45 degrees

04:46 <doug16k> camera is always at 0,0 looking down z axis

04:46 <clever> my rough guess, on how this all works

04:47 <clever> is that the vertex shader will first translate every vertex, by adding a constant to the x/y/z coords (because of the camera moving)

04:47 <clever> then it will do some funky math to rotate everything around that center point

04:47 <doug16k> oh you mean how do you do it

04:47 <clever> then it will compute the W, so the gpu will scale distance objects

04:48 <clever> W being the distance to the object

04:48 <doug16k> you set one or two uniform matrices

04:48 <doug16k> then in the vertex shader, you just multiply the incoming vertex by the matrix

04:48 <doug16k> write the output to the position and set any varyings

04:49 <clever> yeah, ive heard about how matrix mult can do both the rotation and translation in one step

04:49 <clever> i still have no clue how

04:49 <doug16k> and projection

04:49 <clever> so if its doing projection, do i even need W ?

04:49 <doug16k> you can do any number of projections, scales, translations, shears, and rotations in one

04:49 <doug16k> you need w, you want w. you love w

04:49 <doug16k> without w, don't waste your time

04:50 <doug16k> it's genius

04:50 <doug16k> the clipping is easy and fast too

04:50 <doug16k> and the clipping doesn't vary - clipping is the same no matter how the engine does stuff

04:51 <doug16k> wipes all divide errors off the map

04:51 <clever> what about the gpu just ignoring tri's behind a tri?

04:51 <clever> how does that clipping happen? and how does it know if a tri is transparent or not

04:51 <doug16k> you are skipping way past what I mean

04:52 <doug16k> what if you draw a triangle that is behind the camera

04:52 <doug16k> did you think it would magically give you sensible coordinates?

04:52 <clever> yeah, those should be dropped hard

04:52 <doug16k> it will happingly give you total crap

04:52 <clever> yeah

04:52 <doug16k> no

04:52 <clever> something will have to filter them out

04:52 <doug16k> if you think you can cheat out of w use, you are wasting your time

04:52 <clever> i think i see what you mean now

04:53 <doug16k> if you want to do 2d stuff, then make w 1 always, and it works

04:53 <doug16k> you can set up the projection matrix so w ends up 1 no matter what

04:53 <doug16k> that is what happens in isometric projection

04:53 <doug16k> it has no idea how far away anything is

04:54 <doug16k> you can keep the z around and see

04:54 <doug16k> but I mean, the z gets incorporated into the x y

04:55 <doug16k> and you have 1/w = 1 always, the z has no effect unless it is being used to do z fill/test

04:57 <clever> need to take a break now, but i need to re-implement this under LK some time soon

04:57 <clever> LK can render a 2d buffer to the screen, so having tris would be some insane performance boosts

04:59 <clever> doug16k: how insane would it be, for a 128kb binary, to boot an rpi to a spinning teapot, without any blobs? lol

05:01 <doug16k> this will make a projection matrix for you: https://github.com/doug65536/qemu-rom/blob/master/vec.h#L715

05:01 <bslsk05> github.com: qemu-rom/vec.h at master · doug65536/qemu-rom · GitHub

05:02 <doug16k> just multiply a few rotations and translations by that and use that in vertex shader

05:02 <doug16k> when I said you must clip, I meant that thing you linked you know

05:03 <clever> first step, is just re-creating this 2d render, without any blobs helping out

05:03 <clever> then i need to figure out vertex shaders

05:03 <doug16k> of course if you just drawindexed then part of pipeline is doing that clipping I said

05:03 <clever> then i can actually implement that

05:04 <doug16k> for that it is a one liner

05:05 <doug16k> two liner if texcoord

05:05 <doug16k> position = vertex * mvpMatrix

05:05 <doug16k> varyingTexcoordIMadeUp = input_texcoord

05:06 <doug16k> at hardware level, you might have to put in derivatives

05:06 <doug16k> for interpolation

05:08 <doug16k> the beauty of it is, you interpolate the 1/w, then per-pixel it is a multiply to do projection divide

05:08 <doug16k> has the right weird non-linearity to look right

05:09 <doug16k> pixels expand way out close up, and squish way down far away

05:09 <doug16k> but the more far away, the less smaller it seems to make it appear

05:09 <doug16k> that nonlinearity

05:10 <doug16k> if flying away from something at a constant rate

05:10 <doug16k> or the other way, as you get closer, the scale of it gets exponential

05:11 <doug16k> clipping prevents numeric explosion or weird flipped projection of point behind you

05:11 <doug16k> faces behind the camera that you didn't clip will render as their backface

05:11 <doug16k> because you are dividing by negative w

05:12 <clever> there is also an option in the v3d, on which side of a face can render

05:12 <doug16k> I mean if you used https://i.imgur.com/6rlXGm6.png you must prevent negative w

05:13 <doug16k> with glvertex, sure, throw whatever at it, it does that clipping part implicitly

05:13 <doug16k> I suppose you could emit those in the vertex shader assembly

05:14 <doug16k> you scan convert in the vertex shader too? that doesn't sound right

05:14 <clever> i still need to figure out how vertex shaders work on v3d

05:14 <doug16k> could be that it keeps giving you an edge

05:14 <clever> thats one step i never got thru

05:16 <doug16k> it's a shame that you can't copy a boot image into a pi4 across usb

05:16 <doug16k> makes it borderline useless. hanging on by a thread

05:16 <clever> what do you mean?

05:17 <clever> there are several ways to boot over usb

05:17 <doug16k> I can't just update the bootloader over usb

05:17 <clever> you can, but there are 2 things blocking it

05:17 <geist> happy belated canada day

05:17 <clever> doug16k: do you want the pi4 usb to be in host or device mode?

05:18 <doug16k> geist, good day off :D

05:18 <geist> toss off, eh?

05:18 <doug16k> would need to be device to appear as serial or block storage

05:19 <clever> doug16k: 3 options then, 1: erase the spi first, 2: adjust the BOOT_ORDER to co-operate, 3: short a pin out to force it into device mode

05:19 <clever> doug16k: once those are done, it shows up as a vendor usb device, and you must use rpiboot to push firmware over

05:20 <doug16k> firmware?

05:21 <clever> doug16k: if doing 1 or 3, you can send recovery.bin over the usb port

05:21 <clever> and it will then request pieeprom.{bin,sig}, and re-flash the SPI

05:21 <doug16k> that requires me to know all about rpi4 hardware. I couldn't care less about rpi4 hardware though

05:22 <clever> https://github.com/raspberrypi/usbboot/tree/master/recovery

05:22 <bslsk05> github.com: usbboot/recovery at master · raspberrypi/usbboot · GitHub

05:22 <doug16k> it would be fun to play with the opengl-like gpu though

05:22 <clever> you just run `rpiboot -d recovery` and it reflashes

05:22 <doug16k> is that just a kernel elf boot thing?

05:22 <doug16k> or is that wake up with no ram and doornail mode pci?

05:23 <clever> when using mode 1/3, no ram at all

05:23 <clever> when using 2, ram can be online, and you can boot a full linux

05:24 <clever> https://www.raspberrypi.org/documentation/hardware/raspberrypi/bcm2711_bootloader_config.md

05:24 <bslsk05> www.raspberrypi.org: Raspberry Pi 4 bootloader configuration - Raspberry Pi Documentation

05:24 <clever> if BOOT_ORDER hits a 3, then it will use the rpiboot protocol to request start4.elf from a host

05:24 <doug16k> request how

05:24 <clever> and then it can pull config.txt/kernel.img/initrd over that

05:25 <clever> rpiboot has 2 protocols, dumb, and file-server

05:25 <clever> https://github.com/raspberrypi/usbboot/blob/master/main.c#L536

05:25 <bslsk05> github.com: usbboot/main.c at master · raspberrypi/usbboot · GitHub

05:25 <clever> when in file-server mode, you read a `struct file_message` from an endpoint, that contains a command and a filename

05:26 <clever> the host must then respond correctly, and wait for another file_message

05:26 <clever> it supports 3 commands, query file size, read file, quit

05:26 <doug16k> who reads

05:27 <clever> the pi4 initiates the reads

05:27 <clever> while acting as a usb device

05:27 <doug16k> what usb device

05:27 <doug16k> nothing? just some hacked together hack using libusb?

05:27 <clever> when the bootloader firmware hits a 3 in BOOT_ORDER, the usb-c port of the pi4 goes into device mode

05:28 <clever> and talks to that libusb code

05:29 <clever> when using rpiboot correctly (or my webusb re-implementation), you can push over the start4.elf/kernel.img/initrd, and boot linux up

05:29 <clever> linux can then take over the usb controller, and gadget mode anything it wants to

05:29 <doug16k> so it can't just be a compile step that updates it

05:30 <doug16k> I have to spoon feed it each time from some daemon

05:30 <clever> yeah

05:30 <doug16k> screw that

05:31 <clever> you can always make something better, if you boot something from SD

05:31 <clever> and then use a protocol of your own choosing

05:31 <geist> hmm, not too bad. wanted to benchmark how fast a arm64 core can crc32 check a buffer of moderate size

05:31 <doug16k> the only thing keeping my rpi4 out of the garbage can is use as a 3rd fallback of none of my 3 better machines work

05:31 <geist> it just did 16GB of crc32 in units of 64k (so the cache is pretty hot) in about 5.7 seconds

05:31 <geist> so somewhere around 3GB/sec

05:32 <geist> this is a rpi4

05:32 <clever> doug16k: thats part of why i want to get proper source for the ddr4 controller

05:32 <clever> doug16k: so i could then put custom code into the SPI flash, and it could boot directly into a usb gpu, for example

05:32 <clever> no daemon involved

05:36 mctpyt has quit [Ping timeout: 258 seconds]

05:38 <klange> I think... https://klange.dev/s/Screenshot%20from%202021-07-02%2014-37-20.png

05:38 <klange> This should work quite nicely... https://klange.dev/s/Screenshot%20from%202021-07-02%2014-38-20.png

05:49 elastic_dog has quit [Ping timeout: 256 seconds]

05:52 <clever> https://github.com/mesa3d/mesa/blob/main/src/gallium/drivers/vc4/vc4_context.h#L208-L224

05:52 <bslsk05> github.com: mesa/vc4_context.h at main · mesa3d/mesa · GitHub

05:52 <clever> doug16k: this is the source for driving the 3d pipeline on the whole pi0 to pi3 range

05:52 <clever> if you want to look at vertex shaders in more depth on that

05:52 <clever> i'm currently trying to find out where in the source it generates one..

05:55 elastic_dog has joined #osdev

06:03 <clever> found some relevant code in vc4_draw.c

06:09 <doug16k> ah you have to do stuff for tiled render too eh?

06:09 <clever> doug16k: there is a dedicated control list for the binning process, and the gpu somehow sorts tri's into tiles for you

06:10 <clever> it will generate an array of functions that does something to render a tile

06:10 <clever> the renderer control list, is then an unrolled for loop, to call each function in that generated array

06:11 <clever> https://github.com/cleverca22/gl/blob/master/core.c#L452-L475

06:11 <bslsk05> github.com: gl/core.c at master · cleverca22/gl · GitHub

06:11 <clever> doug16k: opcode 115 sets the coordinates for the destination tile, opcode 17 calls some code the binner generated, rendering one tile worth of tris, and opcode 24/25 commit that tile to ram

06:12 <clever> kernel/vc4_packet.h: VC4_PACKET_TILE_COORDINATES = 115,

06:12 <clever> aha, and thats how mesa refers to it

06:15 <clever> 27 * In the VC4 driver, render command list generation is performed by the

06:16 <clever> 28 * kernel instead of userspace.

06:16 <clever> doug16k: aha, that could explain why some code appears to be missing!

06:16 <clever> the shader code can basically write to any ram it wants to, all security is out the door

06:16 <clever> and validating that the configuration is sane is cpu intensive

06:17 <clever> far safer for the kernel to just generate a sane config, based on your requirements

06:18 <doug16k> dma can usually write to whatever it wants

06:18 <clever> yep

06:19 <clever> in the case of the pi4, there is an extra MMU between the 3d hw and ram

06:19 <clever> which kinda acts like an IOMMU

06:19 <clever> but it was added more for ram size reasons, the 3d hw is still 32bit based, and cant address beyond 4gig

06:20 <clever> but the bcm2711 can support up to 16gig of ram

06:20 <clever> so the extra MMU lets the dma write to 64bit addresses, while only dealing with 32bit internally

06:20 <clever> its 3am, i should get to bed now

06:39 Phibred has quit [Quit: Leaving]

06:49 klysm has quit [Quit: Lost terminal]

07:16 <doug16k> I made a few performance fixes in gnuchess

07:16 <mjg> :)

07:17 <doug16k> it's hilarious how strong it is if you let it think during your turn, with 16GB transposition table and PGO LTO build

07:17 <doug16k> it now spends more time on chess than select spin

07:18 <mjg> did it migrate to a neural network?

07:18 <geist> gosh i was getting owned by sargon chess on z80 the other day

07:18 <mjg> afair stockfish did

07:18 <geist> even downclocked it to 1mhz and still got my butt kicked

07:18 <doug16k> 3950x on heavily optimized gnu chess is a monster against me

07:18 <mjg> geist: https://www.youtube.com/watch?v=Ytkf3qZTj74

07:18 <bslsk05> 'Grandmaster Naroditsky Chess Speedrun Pt. 1' by Daniel Naroditsky (00:13:59)

07:19 <geist> though i may be possible that the original 8080 sargon is actually pretty good

07:19 <geist> at least for the time. looking at the page on wikipedia it was no slouch

07:20 <doug16k> if I enable post, the thinking just jumps straight to depth 18 in a flash, then it is getting hard to get to 19, 20, 21, etc

07:21 <doug16k> how can you beat that?

07:21 <mjg> ask stocfksih

07:21 <doug16k> ya stockfish would destroy a grandmaster though

07:23 <doug16k> grandmaster games are weird

07:23 <doug16k> they will just put some good piece right where a pawn can take it, and if you take it you guaranteed loss

07:24 <geist> ah Sargon 2.1

07:24 <doug16k> he's seeing if you are so stupid that you will take it

07:25 paulusASol has joined #osdev

07:26 <doug16k> it is a tiny bit creepy though, how you can make a move and it "knows" what you are doing and starts defending against that line or whatever

07:27 hgoel[m] has joined #osdev

07:27 medvid has joined #osdev

07:30 <doug16k> stockfish doesn't work for me

07:30 <doug16k> xboard just stops working if I try to use it

07:31 <doug16k> stockfish is uci or nothing

07:31 <mjg> lol

07:31 <mjg> xboard is incredibly buggy

07:31 <doug16k> yeah

07:31 <mjg> i used to play with a coworker using xboard

07:31 <mjg> it would crash every single time there was a checkmate

07:32 <mjg> also you could promote a pawn to a king

07:32 <doug16k> there are weird variants where you can

07:32 <doug16k> but ya not normal chess

07:32 <mjg> we played regular chess

07:33 <doug16k> suicide chess allows promotion to king

07:34 <doug16k> you win by losing all your pieces, there is no check and losing king is no big deal. if you can attack, you must attack. you get to pick which attack to do if multiple available

07:35 <mjg> personally i like anti-chess

07:35 <mjg> at least that's how i know it by name

07:36 <doug16k> that is gentler name

07:36 <mjg> i tried setting up leela locally but it's just too painful

07:36 <doug16k> kid friendlier

07:36 <mjg> i tried several frontends and they all keep fucking up in some manner

07:36 <doug16k> ikr? what is wrong with linux chess ui code?

07:37 <doug16k> broken!

07:37 <mjg> ye that's an example of something i assumed would be on lock

07:37 <mjg> but no

07:37 <mjg> it's fucking worse than your typical webstack

07:38 <geist> huh seems like it'd be trivial

07:39 <geist> at least the interface. isn't it just ascii based moves?

07:39 <mjg> it's better than that, there is a clear text protocol

07:39 <geist> yah that's what i figured

07:39 <mjg> which makes the entire thing even more perplexing

07:41 <mjg> eh, programming classic: write a patch in 5 seconds, fuck with tooling for 50 minutes to test it

07:43 <mjg> https://www.youtube.com/watch?v=AbSehcT19u0

07:43 <bslsk05> 'Hal fixing a light bulb (from Malcolm in the Middle S03E06 - Health Scare)' by Vincent Verschuren (00:00:42)

07:43 <doug16k> where is the terminal chess client?

07:43 <doug16k> unicode has all the pieces

07:43 <mjg> on c64! :)

07:45 <doug16k> mjg, that video describes my OS project right now too

07:48 <mjg> btw https://www.youtube.com/watch?v=DpXy041BIlA

07:48 <bslsk05> '30 Weird Chess Algorithms: Elo World' by suckerpinch (00:42:35)

07:48 <mjg> the guy is a genius

07:49 paulusASol has quit [Quit: Client limit exceeded: 20000]

07:53 hgoel[m] has quit [Quit: Client limit exceeded: 20000]

07:54 tacco has joined #osdev

07:55 medvid has quit [Quit: Client limit exceeded: 20000]

07:55 tacco has quit [Client Quit]

08:00 paulusASol has joined #osdev

08:09 medvid has joined #osdev

08:09 sortie has joined #osdev

08:09 hgoel[m] has joined #osdev

08:22 dennis95 has joined #osdev

08:23 paulusASol has quit [Quit: Client limit exceeded: 20000]

08:28 _mrlemke_ has joined #osdev

08:28 scaleww has joined #osdev

08:31 mrlemke has quit [Ping timeout: 258 seconds]

08:32 medvid has quit [Read error: Connection reset by peer]

08:32 hgoel[m] has quit [Read error: Connection reset by peer]

08:33 paulusASol has joined #osdev

08:40 paulusASol has quit [Remote host closed the connection]

08:40 sortie has quit [Quit: Leaving]

08:46 mrlemke has joined #osdev

08:49 _mrlemke_ has quit [Ping timeout: 252 seconds]

09:06 GeDaMo has joined #osdev

09:06 paulusASol has joined #osdev

09:07 KidBeta has joined #osdev

09:14 medvid has joined #osdev

09:14 hgoel[m] has joined #osdev

09:17 <sahibatko> Hi, I have aquestion to qemu and uefi: I run qemu with ... -accel hvf -device virtio-vga ..., that results in one "virtio-vga" entry in the qemu's "view" menu. I see a single window, but detect 2 GOP devices with a framebuffer (plus one "protocol" without a framebuffer, but that is expected) - where did the second framebuffer come from? Next issue is that writing to the framebuffer directly does not

09:17 <sahibatko> light up any pixel in the active qemu window, but that later on.

09:23 xenos1984 has quit [Ping timeout: 256 seconds]

09:25 xenos1984 has joined #osdev

09:36 <sahibatko> * same result with -device bochs-display btw

09:38 janemba has quit [Ping timeout: 272 seconds]

09:41 janemba has joined #osdev

09:48 _mrlemke_ has joined #osdev

09:51 mrlemke has quit [Ping timeout: 246 seconds]

09:55 <klange> I think you need to disable the default display device, but doug16k probably knows better.

10:03 <geist> also i think you might be able to do something like -vga virtio?

10:03 <geist> instead of -device

10:03 <geist> but possibly you want -vga none -device virtio-vga

10:03 <geist> lots of combos, sometimes you gotta just try em all until something sticks

10:08 <sahibatko> will try those, as soon as I finish some refactoring, but that could be it

10:53 ElectronApps has quit [Ping timeout: 258 seconds]

10:54 ElectronApps has joined #osdev

11:00 <sahibatko> neither does work, but perhaps this is setting a different thing - in the real world, there is a GPU (or more), that has some outputs and some outputs can have a monitor connected. Question is, UEFI firmware enumerates a dedicated GOP protocol (handle?) for each GPU? Output? Monitor? I will probably just take the first one as default and be done with it... but I don't like that attitude

11:28 silverwhitefish has quit [Ping timeout: 265 seconds]

11:45 sortie has joined #osdev

11:54 iorem has quit [Quit: Connection closed]

12:05 pieguy128 has quit [Ping timeout: 272 seconds]

12:06 pieguy128 has joined #osdev

12:49 freakazoid333 has quit [Read error: Connection reset by peer]

13:01 vai has joined #osdev

13:01 <vai> oh well, sharing int 0x80 with PIC controller and system calls

13:02 <vai> tons lines of assembly to figure it out

13:02 <vai> which is it

13:02 <vai> definite you dont it to probe PIC every it makes system call :))

13:03 <vai> *everytime

13:03 <vai> using int 0x7F actually for system calls

13:08 KidBeta has quit [Ping timeout: 250 seconds]

13:34 springb0k has joined #osdev

13:43 SGautam has joined #osdev

13:50 srjek|home has joined #osdev

14:06 archenoth has joined #osdev

14:14 wootehfoot has joined #osdev

14:19 ElectronApps has quit [Read error: Connection reset by peer]

14:27 SGautam_ has joined #osdev

14:29 SGautam__ has joined #osdev

14:30 SGautam has quit [Ping timeout: 272 seconds]

14:33 SGautam_ has quit [Ping timeout: 268 seconds]

14:59 SGautam__ has quit [Quit: Leaving]

15:03 Terlisimo has quit [Quit: Connection reset by beer]

15:09 theruran has quit [Quit: Connection closed for inactivity]

15:10 mahmutov has joined #osdev

15:12 Terlisimo has joined #osdev

15:29 Skyz has joined #osdev

15:33 Skyz has quit [Client Quit]

15:38 freakazoid333 has joined #osdev

15:44 mahmutov has quit [Ping timeout: 272 seconds]

15:51 Brnocrist has quit [Ping timeout: 256 seconds]

16:09 Brnocrist has joined #osdev

16:15 Gravis has joined #osdev

16:15 Gravis has left #osdev [Murdered]

16:22 Skyz has joined #osdev

16:28 nick8325 has quit [Ping timeout: 256 seconds]

16:30 nick8325 has joined #osdev

16:40 mlugg has joined #osdev

16:42 <mlugg> Hi, I've read about a 46-bit limit on physical addresses on AMD64. From what I've seen online, this is a limitation in 4-level paging (resolved by Intel's 5-level page table proposal), but I can't for the life of me figure out where the limit actually comes from; the addresses in standard page tables (as described in vol 2 of the architecture

16:42 <mlugg> programmer's manual) all extend up to 52 bits, so where's 46 come from?

16:44 <GeDaMo> Is it not 48?

16:45 nick8325 has quit [Quit: Leaving.]

16:45 <GeDaMo> "Similarly, the 48-bit virtual address space was designed to provide 65,536 (216) times the 32-bit limit of 4 GB (4 × 10243 bytes), allowing room for later expansion and incurring no overhead of translating full 64-bit addresses." https://en.wikipedia.org/wiki/64-bit_computing#Limits_of_processors

16:45 <bslsk05> en.wikipedia.org: 64-bit computing - Wikipedia

16:46 <mlugg> That's virtual

16:46 <mlugg> And that's a limit I understand

16:50 <mlugg> https://www.kernel.org/doc/html/v5.6/x86/x86_64/5level-paging.html here are kernel.org docs for instance, saying "Original x86-64 was limited by 4-level paing [sic] to [...] 64 TiB of physical address space" (64 TiB = 46 bits)

16:50 <bslsk05> www.kernel.org: 22.4. 5-level paging — The Linux Kernel documentation

16:51 <mlugg> And I saw a few random tech news bits online also suggesting that 64 TiB was a physical limit which 5-level paging resolves although it's very possible they were just quoting those kernel.org docs

17:10 mlugg has quit [Ping timeout: 240 seconds]

17:10 asymptotically has joined #osdev

17:16 Brnocrist has quit [Ping timeout: 258 seconds]

17:18 <geist> awwthey're gone

17:18 <geist> the physical limit is fairly arbitrary and different on different cores

17:19 <geist> some are 36, lots are 40, some are 46

17:19 <geist> it's described in a ID register

17:19 <geist> same as on x86

17:19 <j`ey> they said AMD64, not ARM64 :P

17:21 <geist> ah you're right. just woke up, eyes arne't seeing well

17:21 <geist> but it works same way on x86 really

17:22 <geist> cpuid describes the size and you get up to 52. and it can vary between cores

17:22 <geist> my take is they just dont include more TLB bits than that particular market the core is designed for targets

17:22 <geist> since at the end of the day it's basically tag width

17:22 <geist> also tag width in the caches too, but those are invisible

17:24 zoey has joined #osdev

17:40 <Skyz> I'm not sure I'm gonna continue programming, there is too much math involved and solving these problems takes long time and it's very mentally tasking

17:41 <Skyz> I don't know how you know so much about OS now that I started programming

17:44 silverwhitefish has joined #osdev

17:54 <Skyz> Publishers working on software for an homebrew OS would be cool

17:54 <Skyz> I have ideas for an OS I just can't implement them

18:09 vai has quit [Remote host closed the connection]

18:15 dennis95 has quit [Quit: Leaving]

18:20 Brnocrist has joined #osdev

18:25 klysm has joined #osdev

18:33 tenshi has joined #osdev

18:41 Arthuria has joined #osdev

18:53 Arthuria has quit [Ping timeout: 265 seconds]

19:00 <immibis> Skyz: publishers? Publishers are people who want to make money. Homebrew OSes do not.

19:15 mahmutov has joined #osdev

19:28 <Skyz> Maybe this time around homebrewers can make money, enough to sustain the OS

19:28 <j`ey> ahttps://awesomekling.github.io/I-quit-my-job-to-focus-on-SerenityOS-full-time/

19:28 <bslsk05> awesomekling.github.io: I quit my job to focus on SerenityOS full time – Andreas Kling – I like computers!

19:32 freakazoid333 has quit [Read error: Connection reset by peer]

19:32 <immibis> nope.

19:41 freakazoid333 has joined #osdev

19:47 freakazoid333 has quit [Read error: Connection reset by peer]

19:49 tenshi has quit [Quit: WeeChat 3.2]

19:50 freakazoid333 has joined #osdev

19:52 freakazoid333 has quit [Read error: Connection reset by peer]

19:57 dutch has quit [Quit: WeeChat 3.0.1]

20:00 dutch has joined #osdev

20:02 warlock_ has joined #osdev

20:02 warlock_ is now known as doubletoker

20:03 doubletoker has left #osdev [#osdev]

20:10 Skyz has quit [Quit: Client closed]

20:11 Skyz has joined #osdev

20:20 offlinemark has quit [Quit: Connection closed for inactivity]

20:23 Skyz has quit [Quit: Client closed]

20:24 Skyz has joined #osdev

20:37 GeDaMo has quit [Quit: Leaving.]

21:02 robert_ has joined #osdev

21:21 <kazinsal> the serenityos guy is also a dickhead who thinks that operating system hobbyist purity tests should be conducted and include ideas such as "no precompiled images because if you're not building it yourself you're not a real osdev hobbyist"

21:25 <j`ey> I still hope they'll budge on that one day heh

21:41 SanchayanMaity has quit [Ping timeout: 272 seconds]

21:42 paulbarker has quit [Read error: Connection reset by peer]

21:42 jakesyl has quit [Read error: Connection reset by peer]

21:43 <Skyz> Serenity seems pretty significant

21:45 asymptotically has quit [Quit: Leaving]

21:45 freakazoid333 has joined #osdev

21:52 <Skyz> If it only goes to show that there is interest still in people having a new pc experience

21:52 <Skyz> I think win11 will be interesting once it comes out

21:53 SanchayanMaity has joined #osdev

21:53 jakesyl has joined #osdev

21:54 paulbarker has joined #osdev

21:58 <Skyz> Seems difficult to understand all the specifications for an OS

21:58 <j`ey> yup

21:59 <Skyz> I'm looking at the APIC now, and can only grasp a small amount of it, something about IRQs

21:59 <Skyz> I see that spoken about alot

22:21 pony has quit [Quit: WeeChat 2.8]

22:22 pony has joined #osdev

22:24 CryptoDavid has joined #osdev

22:27 pony has quit [Quit: WeeChat 2.8]

22:28 pony has joined #osdev

22:29 pony has quit [Client Quit]

22:30 mrlemke has joined #osdev

22:30 _mrlemke_ has quit [Read error: Connection reset by peer]

22:32 freakazoid333 has quit [Read error: Connection reset by peer]

23:35 zoey has quit [Remote host closed the connection]