#osdev on 2022-07-10 — irc logs at libera.irclog.whitequark.org

2021-05-23 01:57 klange changed the topic of #osdev to: Operating System Development || Don't ask to ask---just ask! || For 3+ LoC, use a pastebin (for example https://gist.github.com/) || Stats + Old logs: http://osdev-logs.qzx.com New Logs: https://libera.irclog.whitequark.org/osdev || Visit https://wiki.osdev.org and https://forum.osdev.org || Books: https://wiki.osdev.org/Books

00:00 <mrvn> amd64 has 2 calling conventions but you can follow both at the same time.

00:00 SpikeHeron has quit [Quit: WeeChat 3.5]

00:01 <mrvn> Or looking at it another way: The compiler is free to optimize away the 'al' when it knows no varargs are involved.

00:02 <mrvn> But back to my problem: How do I allocate uninitialized memory for a T[] in c++20?

00:02 <doug16k> ahh so this means if you injected a library that took ... for twofold it would break

00:03 <doug16k> but you declared a prototype, so I guess that is ok

00:03 <mrvn> doug16k: if you declare twofold(float) and then twofold(...) that violates the one definition rules

00:03 <mrvn> (in C), in c++ that would simply be 2 separate functions

00:03 <dh`> if you inject libraries things will ~always break

00:04 <mrvn> doug16k: This falls unter the heading of: don't lie to your compiler

00:05 <doug16k> mrvn, std::aligned_storage, placement new

00:06 <mrvn> (deprecated in C++23)

00:06 <mrvn> already on the way out of the standard

00:06 <doug16k> how do you allocate nodes in a container implementation then?

00:07 SpikeHeron has joined #osdev

00:07 <mrvn> doug16k: other than "new T(args)" I have no idea.

00:07 <doug16k> if you can't slap an aligned_storage of T in the node struct

00:08 <mrvn> doug16k: why do you think I'm asking what the way for that is?

00:08 <doug16k> it is deprecated because you might as well alignas(T) char thing[sizeof(T) * N]

00:09 <mrvn> doug16k: and how do I heap allocate that?

00:10 <mrvn> doug16k: note: I need something that can be freed with delete[]

00:10 <doug16k> are there allocators in 23?

00:11 <doug16k> placement new[][

00:11 <mrvn> placement new[] doesn't allocate and there is construct_at instead

00:11 <mrvn> struct allocator<void>; (removed in C++20), hence my question.

00:12 <mrvn> There is template< class T > struct allocator; it seems.

00:12 <mrvn> Looks like I just have to use the T flavour instead of the void specialization.

00:16 <mrvn> doug16k: is it save to call `delete[]` on something allocated by the default allocator?

00:16 <doug16k> yeah, you have to construct with new[] though

00:17 <mrvn> My feeling is that if I use std::allocator<T>::allocate I also have to use std::allocator<T>::deallocate

00:17 <doug16k> compiler will use the count it sneaked in for destructs then it will call your delete[]

00:17 <doug16k> yes, in you delete[]

00:17 <mrvn> how does delete[] know to call the allocator?

00:18 <doug16k> you wrote it

00:18 <doug16k> a class new[] and delete[] will make it use those

00:19 <mrvn> doug16k: no class new involved

00:19 <mrvn> std::allocator<T> alloc; T *t = alloc.allocate(10); /* init t[10] */ return std::unqie_prt(t);

00:20 <mrvn> unique_ptr even

00:20 <doug16k> it forgot the 10

00:20 <vdamewood> Did you bring us presents unqie_ptr?

00:20 <mrvn> using std::unique_tr<T[]>

00:21 <doug16k> yeah that would do it

00:21 <vdamewood> Let's ust pretend that mrvn types everything correctly.

00:21 <mrvn> It feels like this requires a destructor that calls alloc.deallocate(t)

00:21 <vdamewood> Let's pretend I type everything correctly too.

00:21 <mrvn> vdamewood: you are the great pretender

00:21 <vdamewood> I'm the master of pretending!

00:21 <mrvn> s/destructor/deleter/

00:21 <doug16k> mrvn, unique_ptr<T[]> will placement new[] and delete[] for you

00:22 <vdamewood> Yesterday, I pretended I was a functional human being!

00:22 <mrvn> doug16k: the question is whether delete[] is even the right thing.

00:22 <vdamewood> Today, I'm pretending that I'm an imperative human being.

00:22 <doug16k> it needs to remember to destruct 10. how would it?

00:22 <mrvn> doug16k: delete[] is potentially different from alloc.deallocate(t)

00:23 <doug16k> delete[] gets a pointer just like free(), it couldn't care less how many items there are - it destructed them already!

00:23 <doug16k> you can just free(p) if the new[] return malloc(p)

00:24 <mrvn> doug16k: that isn't the problem. The problem is that alloc.allocate() might do something sifferent

00:25 <doug16k> er malloc(sizeof(*p) * N) I mean

00:26 <mrvn> doug16k: "Allocates n * sizeof(T) bytes of uninitialized storage by calling ::operator new(std::size_t) or ::operator new(std::size_t, std::align_val_t) (since C++17), but it is unspecified when and how this function is called." I know it can't call malloc.

00:29 <mrvn> "The "unspecified when and how" wording makes it possible to combine or optimize away heap allocations made by the standard library containers" is the part I'm worried about.

00:29 <doug16k> you expect the optimizations to break it?

00:29 <mrvn> no, just that it is unspecified how it behaves in detail

00:32 <doug16k> unique_ptr can take a deleter if you want to do something funny to free it

00:32 <mrvn> but then it has a different type. I really want to avoid that

00:37 <geist> Oh no i look away and it has once again turned into a C++ channel

00:38 <geist> Though i guess that’s better than a ‘hack Linux to install rootkit’ channel

00:38 <zid> geist: Got a decent reason why amd64 sysvabi uses regs for varargs?

00:38 <geist> Versus what?

00:38 <zid> stack I guess

00:39 <zid> which has the benefit of being linear rather than piecemeal

00:39 <geist> It’s fairly common for most systems I’ve seen to do varargs as pretty much regular calling convention

00:39 <moon-child> zid: so you can declare void f(); in one TU and void f(actual params here) { ... } in another TU

00:39 <zid> moon-child: what's that have to do with anything?

00:39 <moon-child> zid: afaik apple arm is the only abi that passes varargs on the stack

00:39 <geist> In the case of piecemeal one solution the callee can do is just allocate the space on the stack linearly, and dump the regs such that they line up with the ones that are pushed

00:39 <doug16k> zid, it is why varargs are called the same - you can't mindread if it is varargs

00:39 <zid> x86 pmode did

00:40 <zid> () is varargs already

00:40 <zid> or rather, unspecified

00:40 <geist> Though i guess that’s hard on x86 because it’s already pushed the return address

00:40 <zid> so it actually emits code, xor rax, rax

00:40 <mrvn> moon-child: but only in C

00:40 <geist> Yah the ram-holds-number-of-float thing is unique to x86-64 AFAIK

00:40 <geist> I haven’t seen anything like that in other arches

00:40 <geist> S/ram/rax (stupid autocorrect)

00:40 <moon-child> mrvn: where else?

00:40 <zid> geist: oh right yea didn't think about just doing a big reg dump if you *wanted* the linear case, right

00:41 <moon-child> topic of discussion was c abis, presumably, not for other languages

00:41 <moon-child> zid: pushad go brrr

00:41 <mrvn> moon-child: in C++ void f(); means no arguments.

00:41 <geist> Yah that’s what arm 32/64 does in general

00:41 <geist> And probably riscv

00:41 <geist> But those are easier because the callee can control where the return address goes

00:41 <zid> I'm of the opinion that 99% of the time you *are* in a linear situation (printf etc) rather than a 'counted' `int fmt, ...` style

00:41 <geist> So it can just up front allocate space on the stack for the args to get slammed down on the stack, prior to pushing anything else

00:41 <zid> so it seemed saner to me to just form the linear dump in the caller

00:42 <mrvn> hmm, what does this declare in C++: extern "C" void f();?

00:42 <zid> where it saves a mov to a reg

00:42 <moon-child> zid: I agree, it is more sensible

00:42 <moon-child> but, compat

00:42 <moon-child> mrvn: probably no-args

00:42 <geist> I think the general idea is that you can call a varargs function without a varargs declaration, but that’s probably not legal

00:42 <geist> Because of zero rax if nothing else

00:42 <zid> It's not, eax will be corrupt

00:42 <mrvn> moon-child: probably. the extern is just about linage

00:42 <mrvn> linkage

00:42 <geist> Or argument widening implications of varargs (float -> double, etc)

00:42 <moon-child> mrvn: consider that you can say extern "C" void f() { ... }. extern "C" is just for link--yeah, exactly

00:42 <doug16k> geist, gcc emits eax changes if it doesn't have a prototype and you pass a float

00:43 <mrvn> geist: it's legal due to implicit prototypes

00:43 <doug16k> and not in same unit

00:43 <geist> doug16k:yah makes sense. It’s at worst just a waste of time to null rax

00:43 skipwich has joined #osdev

00:43 <geist> Since it otherwise doesn’t participate in the call

00:43 <zid> so I guess my takeaway is it isn't *as* wasteful as I thought

00:43 <zid> but I'm still not sold on regs being the default over stack

00:43 <mrvn> geist: do you know any other arch where the first argument register isn't the first return value register?

00:44 <mrvn> zid: it's generally faster

00:44 <geist> Well, would only apply to arches with args in registers (vs say x86-32 or 68k which are stack based)

00:44 <geist> Register window ones dont do that

00:44 <geist> Ie, sparc/etc

00:44 <geist> Alas getting kicked out of cofeee shop

00:45 <geist> But in general no. Most of the time the return is the same as the arg regs

00:45 <mrvn> arm, alpha, mips don't do it. powerpc?

00:45 <doug16k> oh yeah, I use fastcall in my 32 bit bootloader code (eax edx ecx parameters) - guessing it does something similar to x86_64 to get 1st varargs from edx ecx

00:47 <doug16k> mregparm=3

00:47 <mrvn> here is another C++ question (sorry geist): If I have `<=>` with strong ordering why doesn't that provide `bool operator==(const T &other) { return (*this <=> other) == 0; }`?

00:48 <zid> I keep telling geist to wear pants but he won't listen

00:49 <mrvn> zid: don't make giest wear pants or I have to put on some too

00:49 <moon-child> zid: why can't you just be normal and take off your pants like everybody else?

00:49 <klys> lewd

00:50 <moon-child> pants are just an imposition by society, man

00:50 <doug16k> you can be sure I have pants on if you see me in irc

00:50 <klys> haven't see you for a while

00:51 <klys> except in #asm and #glibc

00:51 <zid> moon-child: It's not normal it's barbaric

00:51 <heat> #glibc exists?

00:51 <zid> heat exists? :(

00:51 <moon-child> zid: come over tonight, and I'll show you barbaric

00:51 <heat> zid, yes

00:51 <zid> moon-child: 'Come over, the goths aren't home'

00:52 <moon-child> Legend has it that #glibc is just where ulrich drepper goes to let out his frustrations. Not recommended.

00:52 <moon-child> (with apologies)

00:52 <heat> #musl > #glibc

00:52 <heat> deal with it

00:53 <klys> because onyx runs musl dude

00:53 <heat> and alpine

00:53 <zid> and javascript, so instantly discounted

00:53 <heat> actually I'm working on porting v8 right now

00:53 <heat> so, not yet

00:53 <doug16k> heat, how awful is it to port v8?

00:53 <mrvn> doug16k: I don't see you on irc, no webcam.

00:54 <zid> I bet heat is cheating and not actually porting it, and just hosting it

00:54 <heat> doug16k, I don't know? hope that's its not much

00:54 <moon-child> mrvn: even if there were a webcam, you wouldn't be seeing doug16k, just a representation of him

00:54 <moon-child> mrvn: as is, you still see a representation of him (unless you are using a screenreader?), just a lower-resolution one

00:54 <doug16k> heat, the multiprocess stuff looked terrifying when I took a peek

00:54 <mrvn> moon-child: unless he is a weeping angle. The image is the thing.

00:54 <heat> doug16k, what multiprocess stuff?

00:55 <doug16k> tons of IPC - it hosts each domain in separate process etc

00:55 <heat> isn't that all chrome?

00:55 <doug16k> V8 has workers

00:55 <doug16k> webworkers

00:55 <doug16k> isn't whole multiprocess mess in there too?

00:55 <heat> I don't know?

00:55 <heat> we don't use that in workers

00:55 <zid> They could be green threads and it's all uniprocess uniaddress space cus javascript is UNHACKABLE

00:56 <clever> doug16k: ive found that a lot of the multi-process stuff in chrome is kind of pointless, there are too many things that require a single thread in the master process

00:56 <heat> I thought you only have isolates

00:56 <clever> doug16k: very often, that one thread takes too long to answer, and all of chrome hangs

00:56 <zid> clever: I still use a single thread main browser

00:56 <mrvn> clever: it also only creates a limited number of sandboxes

00:56 <zid> (firefox went multiprocess at the same time they fucked all the addons and stuff, so there isn't an ff fork with the old plugins that's multiprocess)

00:56 <clever> mrvn: in think in the pre spectre/meltdown days, there was a lower limit, but since then, the rules have changed, and its now one domain per renderer process

00:56 <doug16k> heat, I think you might pull it off though :D

00:57 <mrvn> firefox has the same problem with a single blocking main thread.

00:57 <clever> so if you do exploit the system and access other stuff in the pid, its only your own domain at risk

00:57 <clever> so example.com can never scrape data from paypal.com

00:57 <heat> doug16k, I hope so, from a very quick node strace it seemed pretty basic

00:57 <heat> i was only lacking epoll but I assume there's a poll fallback

00:58 <mrvn> clever: the bigger problem I have is stopping domains. I don't want invisible tabs to run javascripts for days and days

00:58 <zid> You were supposed to properly port it heat

00:58 <zid> Implement your *own* stupid syscalls, and make it use them

00:58 <clever> mrvn: yeah, i can only manage that by checking cpu usage in the chrome task manager, and killing child procs

00:58 <heat> zid, what stupid syscalls

00:59 <mrvn> clever: I want the javascript engine to pause when a tab hasn't been active for a while

00:59 <mrvn> preferably configurable with exceotions per domain

00:59 <clever> mrvn: most recently, i discovered that one of my bugs, involves chrome adding/removing elements from a std::vector<ObserverStorageType>, which is very cpu costly

00:59 <heat> mrvn, that's possible in v8

00:59 <clever> yeah, that would be lovely

00:59 <zid> heat: WaitForObjectWithMonadFungibleTokenEx

01:00 <heat> poll

01:00 <mrvn> e.g. I want my mega tabs to run to completion but pretty much nothing else.

01:01 <heat> why does v8 use depot fucking tools

01:01 <doug16k> in chrome you can open chrome's task manager and blow away stuff deemed to be hogging resources

01:01 <mrvn> doug16k: to much work

01:01 <clever> personally, only discord, facebook messenger, and the tabs that are focused in each window, should get JS time

01:01 <heat> there's a way to kill v8 isolates

01:01 <mrvn> doug16k: and as said, I wantr to pause them, not kill them

01:01 <clever> and everything else should grind to a screeching halt the instant it looses focus

01:01 <heat> they get an uncatcheable exception and die

01:01 <zid> chrome has suspended out of focus shit for ages now

01:02 <zid> unless you use webworkers or whatever

01:02 <clever> zid: perhaps my version 96 build is to blame

01:02 <mrvn> zid: which naturally every dirty site does

01:02 <zid> which just meant everybody switched to being abusive via webworkers or whatever

01:02 <zid> and I have to leave a pixel of my idle games exposed and in a sep. window

01:03 <heat> i use firefox in linux because im a hipster

01:03 coelho has joined #osdev

01:03 <heat> i use chrome in windows and android because I'm a normie

01:04 <doug16k> I was playing a game on my main machine from my kitchen machine, then ended up using my desktop from there, so I just sent sigstop to the game and froze it so I could just use the remoteplay for remote control for a bit

01:04 <doug16k> I wonder if you could sigstop things

01:04 <mrvn> how do you know what to stop?

01:04 <heat> that seems like a poor idea

01:05 <doug16k> that becomes the new problem yeah

01:05 <mrvn> if you stop something that's calling the main thread everything stops

01:05 <heat> im fucking dying

01:05 <heat> its 25C here

01:05 <heat> 2am

01:05 <doug16k> air conditioner

01:05 <mrvn> that's not too bad

01:06 <heat> my air conditioner is no bueno

01:06 <doug16k> they are giving them away nowadays

01:06 <zid> It is 15.5C here, 25C was the peak at 2pm

01:06 <doug16k> it's amazing how much cheaper they are now

01:06 <heat> it was 38C here around 2pm

01:07 <zid> thirty WHAT

01:07 <zid> have you considered not living on the surface of the sun?

01:07 <doug16k> drastically more efficient now too - uses the condensation to wet the condenser

01:08 <heat> zid, sunny southern europe weather be like

01:08 <doug16k> not even one drop comes out of my AC. it's all boiled off on the condenser

01:08 <zid> heat: Portugal is a monument to man's hubris and an affront to god

01:09 <heat> agreed

01:09 <heat> lets nuke it

01:11 <zid> sink it into the atlantic

01:11 <heat> where will british people lose their children then

01:12 <zid> That's a sacrifice I am willing to make

01:12 <zid> They can abuse them and hide the body at home like the rest of us I guess

01:13 <heat> magaluf maybe?

01:14 <zid> I'm picturing like, wile e. coyote sawing portugal off iberia

01:15 <zid> and it sinking into the sea

01:16 <heat> it wouldn't sink

01:16 <heat> we'd just end up in the american east coast

01:16 <heat> if we're lucky, canada

01:17 <zid> Nope landmasses absolutely sink, ask pikamee

01:18 pretty_dumm_guy has joined #osdev

01:19 <mrvn> doug16k: getting a bit of extra cooling from evaporating the water?

01:19 <zid> https://www.youtube.com/watch?v=7oL5qf3mwu8

01:19 <bslsk05> 'Pikamee Made me Lose Faith in the Japanese Education System...' by ClipChama (00:02:15)

01:20 <doug16k> getting a huge amount. it takes an enormous amount of energy with it when it evaporates

01:20 <doug16k> it's amazing how much energy it takes to change the state of water

01:21 <clever> zid: is this the clip about whats under japan?, ah yep, it is

01:22 <zid> Imaging being *famously* dumb

01:25 <clever> zid: at least others are asking how fire works in water, lol

01:25 <zid> WATER IN THE FIRE? HOW!?

01:25 <clever> exactly

01:26 <clever> oops, not

01:26 <clever> it was why, not how

01:26 <clever> https://www.youtube.com/watch?v=VbwsZ2D3wto

01:26 <bslsk05> 'Korone Inugami - Water in the Fire' by Omega Kun (00:00:43)

01:26 <zid> ah close enough

01:26 <clever> and that reminds me...

01:26 <clever> https://www.youtube.com/watch?v=AK47S

01:26 <bslsk05> www.youtube.com: - YouTube

01:27 * clever slaps chrome

01:27 <clever> https://www.youtube.com/watch?v=AK47SC6kr_A

01:27 <bslsk05> '42" Water Main Repair' by Underwater Marine Contractors (00:06:58)

01:27 <clever> zid: in this video, they are fixing a tear in the side of a water main by welding it

01:27 <clever> note, they did not dig it up

01:27 <zid> yea underwater welding has some great safety videos

01:27 <clever> they sent a bloody diver, 300 feet down the pipe, to weld it from the inside, while its full of water

01:27 <clever> this isnt just underwater welding

01:28 <clever> this is welding a pipe from the inside

01:28 <zid> Including things like "So, he put a pinhole leak into the pipe, the diver is now on the other side of the pipe completely atomized"

01:28 <zid> *slurrrrp*

01:28 <zid> There's a video of a crab just going *poof* to that somewhere

01:28 <clever> yeah

01:28 <clever> https://www.youtube.com/watch?v=PXgKxWlTt8A

01:28 <bslsk05> 'A crab getting sucked into a underwater pipeline | Delta P' by mchvll (00:00:12)

01:28 <zid> DELTA P

01:28 <clever> the exoskeleton doesnt do jack

01:29 <zid> Yea this is the safety vid :D

01:29 [itchyjunk] has quit [Ping timeout: 244 seconds]

01:29 <zid> DELTA P

01:29 <clever> i also watch a guy that works on water features at a park

01:29 <clever> and he also mentions delta-p

01:29 <clever> specifically, how the intakes have large grates, to spread out the flow, to produce a lower delta-p

01:29 <clever> so you cant get stuck to the grate

01:29 <zid> I also have a large delta P

01:30 <clever> that crab is an extreme case, he isnt just stuck, he is being ripped to shreds

01:30 <zid> His insides are now his outsides.

01:30 <clever> https://www.youtube.com/c/ThePoolGuy

01:30 <bslsk05> 'ThePoolGuy - Home' - 'Here with ThePoolGuy, i'm posting content related to water parks, swimming pools and aquatics in general. I've been in water safety, operations, management and maintenance of water parks and aquatic facilities. I want to share what I know and of course I learn as I go.

01:31 <clever> on this channel, you can see all of the behind the scenes stuff about maintaining a water park

01:31 <clever> and about the only thing he doesnt show, is what buttons to hit on a VFD to turn the motor on/off

01:32 <zid> I assume water park people talking about delta P is like rocket surgeons talking about delta V, totally normal

01:32 <zid> There's also sort of the opposite delta P issue one of my friend works with, diesel leaks

01:32 <zid> go google high pressure injection injury if you're brave

01:33 <clever> i already know of inflation and degloving :P

01:33 <zid> hpij is weird, you feel fine

01:34 [itchyjunk] has joined #osdev

01:34 <zid> then your hand swells up and turns purple

01:34 <clever> there was an episode of NCIS i saw many years ago, where there was a loud bang in a sub, somebody dropped to the floor, and then a loud constant noise

01:34 <zid> and you die of toxic shock and 'dirty diesel in my blood disease'

01:34 <clever> and gibbs stopped somebody from going to help the guy

01:34 <zid> which I suppose are basically the same thing

01:34 <clever> he then waved a broom handle across the room, and half of the broom just fell off

01:35 <clever> a high pressure air line ruptured, and was creating an instant-death laser across the room

01:35 <zid> high pressure jets are the closest things we have to sci-fi lasers :P

01:35 <clever> NCIS turned the sci-fi up to 11, and made it invisible as well

01:35 <zid> diesel injection shit is infact, invisible

01:35 <zid> it won't cut you in half, but you're not supposed to run your fingers along the pipes etc, because of the injection injurys tuff

01:36 <zid> if you 'seal' the hole with your fingertip or whatever

01:36 <clever> ive also been worried about the same thing, if somebody is messing with a pressure washer

01:36 <zid> pressure washers are really low pressure, comparitively

01:37 <zid> diesel injection on engines is 30kPSI, pressure washers cap out at single digit thousands

01:37 <clever> how much do you know about hvac?

01:37 <zid> I don't even know what it spells

01:37 <clever> air conditioners

01:37 <clever> and heat pumps, fridges, freezers

01:38 <zid> I know the princples, no prctical exp

01:38 <clever> got 2 good sources on them, both channels i watch a lot

01:38 <zid> I don't really care about the subject

01:38 <clever> i would have said the same before, but i cant stop watching these 2 guys :P

01:38 <zid> I care that the person isn't a "youtuber"

01:39 <zid> that's not what I said

01:39 <zid> I said I don't care what the subject is, not I don't care about *that* subject

01:39 <clever> ah

01:39 <zid> They have to not be a youtuber, not have awful audio, not have certain vocal ticks or inflections etc

01:39 <zid> and I'm fairly good to go

01:39 <clever> https://www.youtube.com/watch?v=z_Ti4GP0ntE

01:39 <bslsk05> 'Adam Savage's One Day Builds: Refrigerated Cooling Suit!' by Adam Savage’s Tested (01:07:08)

01:39 <zid> adam savage is a youtuber

01:40 <clever> in this one, he works with an AC tech to build a cosplay AC unit, that chills a liquid, and then runs that thru a body suit

01:40 <zid> I watched a great video the other day, nobody spoke.

01:40 <clever> and they explain some of the mechanics of how it works

01:40 <zid> It was close to the platonic ideal youtube video.

01:40 <clever> https://www.youtube.com/c/HVACRVIDEOS

01:40 <bslsk05> 'HVACR VIDEOS - Home' - 'Hello my name is Chris and I will be posting videos about the cool stuff I see out in the field of HVACR. A little bit about my self, I've been involved in the trade for over 15 years and I will tell you right now that I do not know it all, this trade is constantly evolving and new technology is coming out every day. I am based out of Southern California and I mainly work in restaurants. If you have any questions or even suggesti

01:40 <zid> It had the least amount possible of annoying youtubers

01:41 <clever> and this guy just does AC repair work in californaia, and origianlly was recording the work as training for his own employees

01:41 <clever> and decided to just make the videos public

01:46 <mrvn> clever: what would you expect to see? refraction due to different air density? fog due to the air cooling when expaning?

01:46 <mrvn> (in the air jet)

01:46 <clever> mrvn: i would have expected a water jet, not an air jet

01:47 <clever> water would do a lot more damage at lower velocities

01:47 <zid> Turns out water is heavy, who knew

01:47 <mrvn> clever: but it's not a high pressure water pipe, it's the air pipe

01:47 <doug16k> wouldn't there be refraction along the density gradient where the high pressure jet met the surrounding air?

01:47 gog has quit [Ping timeout: 244 seconds]

01:48 <mrvn> doug16k: sure. You can see that with rockets hitting the sound barrier.

01:48 <doug16k> can't be invisible can it?

01:48 <zid> I wonder if you can create shock diamonds

01:48 <zid> by moving water fast enough

01:48 <clever> mrvn: i'm also not sure what a sub would need with such high pressure air

01:48 <mrvn> But that assumes you actually have such an air jet in the scene.

01:48 <mrvn> clever: blowing out the balast tanks

01:49 <mrvn> Doing CGI to show the refraction from an air jet is probably costly.

01:49 <clever> i would think those would be pressure tanks near the balast tanks, and not a pipe bomb running thru half the ship

01:49 <clever> mrvn: also, part of the script was that the danger was supposed to be invisible, and gibbs saved a dude from becoming the 2nd victim

01:50 <clever> just to make him more knowledgable

01:50 <mrvn> clever: but where would that leave the plot? I bet you have such pipes (unpressurised normaly) as backup when you have to blow out the aft tank with the forward air.

01:50 <doug16k> I suppose the background could be unluckily shaded to make the refracted light similar enough

01:50 <mrvn> clever: I'm pretty sure they wouldn't be totaly invisible. But could be hard to spot.

01:51 <clever> mrvn: i would have thought the volume of the pipes would greatly reduce how much pressure and water volume you can displace then

01:51 <zid> clever: I need you to make me some shock diamonds in a stream of water, stat

01:51 <mrvn> clever: that's what pumps are for

01:51 <zid> If you wanna keep it as liquid water I assume it's going to need to be moving *FAST*

01:51 <zid> but maybe at that point some other process just takes over, that seems likely

01:51 <zid> like.. relativity

01:52 <mrvn> zid: want to make a water yet shooting water at fractions of c?

01:53 * clever heads off to bed

01:53 <doug16k> you mean a hydrogen-oxygen plasma right?

01:54 <mrvn> doug16k: how much energy would it take to accelerate 1g of water to .5c?

01:54 <doug16k> 1/2mv^2 right?

01:54 <mrvn> doug16k: no, that only works for low speeds

01:55 <mrvn> there is some 1/(c-v) factor in there.

01:55 <doug16k> .5 won't start having dilation

01:55 SpikeHeron has quit [Quit: WeeChat 3.5]

01:55 <doug16k> small as hell anyway

01:55 <mrvn> c/(c-v) I mean

01:56 <mrvn> m isn't constant

01:56 <doug16k> I played around with making a space travel sim game where it did the whole time dilation / length contraction thing

01:56 <doug16k> seems I should remember better :P

01:56 <mrvn> doug16k: how does that matter?

01:57 SpikeHeron has joined #osdev

01:57 <mrvn> the time dilation only matters to the person on the rocket, the length contraction only looks like it changes something

01:57 <doug16k> trying to remember the term for the length contraction, I forget. nevermind

01:58 <zid> lorentz

01:58 <doug16k> yeah thanks

01:59 <doug16k> you really can accelerate forever, you just shorten the distance to the target if your relative speed is too high

01:59 <mrvn> doug16k: I made a game with the speed of light limit for communication and dead slow rockets (<.2c) but I never made up a good research tree to make it interesting.

01:59 <doug16k> otherwise you could tell you were reaching absolute c

01:59 <mrvn> doug16k: you can accelerate forever but you get less and less out of it

02:00 <doug16k> so you can measure when you reach c then

02:00 <mrvn> doug16k: you never reach C

02:00 <doug16k> what does the apparatus do when you are almost at c

02:00 <mrvn> accelerate very very slowly

02:01 <mrvn> (to an outside observer)

02:01 <doug16k> I mean in the apparatus frame

02:01 <mrvn> nothing

02:01 <doug16k> it looks like the pendulum has no trouble swinging toward and away from forward, right? even though swinging forward is exceeding c

02:02 * klange refreshes irssi statusbar

02:02 <mrvn> doug16k: it's not exceeding c though

02:02 <klange> hm, must be broken, still says this is #osdev...

02:02 <doug16k> bnecause it is shortened to compensate outside that frame

02:03 <mrvn> it also takes a lot longer to swing

02:03 <mrvn> outside that frame

02:04 <heat> why does v8 assume bleeding edge compilers

02:04 <heat> for fuck sake

02:05 <klange> why are you building v8 and what for?

02:05 <heat> this is annoying

02:05 <heat> klange, for nodejs

02:05 <heat> for my OS

02:05 <klange> ew

02:05 <heat> maybe I'll port chromium one day

02:06 <heat> its being relatively smooth since I have the latest LLVM release but... annoying to disable warnings that got introduced 1 month ago

02:07 <heat> and they suffer from googleitis so much

02:07 <heat> prebuilts all the way baby

02:07 <heat> and they also try to build libc++ in tree for some reason

02:08 <doug16k> you sure it isn't such that the people inside the ship see foreshortening of the space in front of them as they accelerate, and they can keep accelerating forever, and they begin to zip forward in time compared to the people outside that see them simply getting squished on forward axis going almost c?

02:09 <doug16k> so they traversed all the time to go that far, but inside didn't see it elapse

02:09 <sbalmos> RelativityOS

02:11 gog has joined #osdev

02:17 <heat> at this point compiling google projects must draw as much power as a small country

02:22 <klys> no header bug yet heat?

02:25 <dh`> this is a way back now, but: <mrvn> geist: do you know any other arch where the first argument register isn't the first return value register?

02:25 <dh`> mips

02:25 <dh`> (arguments are a0-a3 or a0-a5 depending on abi, return is v0 or v0/v1)

02:29 <doug16k> the people inside the ship said they didn't go over the speed of light because the distance was shorter than you saw from outside

02:30 <klange> I ported Pyodide's JS object wrapper thing, "hiwire", and improved the JS integration for the WASM build of Kuroko.

02:30 <klange> And replaced a bunch of eval()s with direct object manipulation and calls.

02:31 <klange> I can even pass Kuroko functions as callbacks to JS things.

02:34 pretty_dumm_guy has quit [Quit: WeeChat 3.5]

02:39 <mrvn> doug16k: time runs slower inside so 1m/s^2 inside becomes less and less from the outside.

02:41 <mrvn> doug16k: the squishing is just an illusion because the photons from the front take a different time to reach an observer than the photons from the back. Or do you mean another effect?

02:42 <doug16k> length contraction

02:44 <doug16k> as you approach infinite velocity in your frame, you become increasingly flattened along your dimension of movement

02:44 <doug16k> from my frame

02:44 <doug16k> when infinite, you are a plane in my frame

02:45 <mrvn> when you reach c you are everywhere at once.

02:46 <doug16k> ...going c

02:46 <doug16k> the lorentz factor affects movement through time and dimensions of space

02:47 <doug16k> when you want to translate from one frame to another

02:48 <doug16k> the infinite speed plane person says they feel fine

02:48 <mrvn> is that stronger or weaker than the contraction or lengthening from photons having to travel different length to reach an observer?

02:49 <doug16k> I think it mostly says that you won't see changes propagate through the universe at a different rate

02:49 terrorjack has quit [Quit: The Lounge - https://thelounge.chat]

02:51 <mrvn> but that is affected by the time dilation

02:51 <doug16k> as if when you move too much distance, it has to make less time elapse

02:51 <doug16k> to the people outside your frame

02:51 zaquest has quit [Remote host closed the connection]

02:51 terrorjack has joined #osdev

02:53 zaquest has joined #osdev

02:54 <mrvn> Like when you send a IP packet through 1foot of cable it takes 1ns. But from outside the second NIC has moved 9 foot in the same time so it takes 10ns outside for the IP paket to reach the destination.

02:54 <doug16k> so you see them slowed down, and they zip to the destination in a week, and it's years later when they get there, they said it was a week and weirdly got closer as we sped up

02:55 <mrvn> And now you are saing that cable will be only half a foot seen from the outside.

02:56 <doug16k> so they didn't go superluminal to go that far in a week, they just burn all the time to get there and skipped forward through it in their frame

02:57 <mrvn> that makes no sense, that's just time dilation

02:59 <doug16k> I don't think the packet thing is right. the other observer would see the start of the transmission a bit later and measure the same time

02:59 <mrvn> doug16k: no, they see the packet move through the cable and move with the frame.

03:00 <mrvn> longer distance seen from the outside. That's why the clock slows down as you move faster.

03:00 <doug16k> you agree that if the sun vanished altogether, we would be in orbit and not know for 8 min right?

03:01 <mrvn> depends on how fast you are going

03:01 <doug16k> earth would keep following the curved space for 8 min after it fanished

03:01 <mrvn> and what mass is near you

03:02 <mrvn> the gravity wave experiments seem to indicate that that is true

03:02 <doug16k> because changes propagate through the universe at c

03:02 <mrvn> on the other hand it can't vanish. Even if it goes supernova e=mc^2 so it's still there

03:03 <mrvn> On that note: I want a condensator which capacity is measured in gramm

03:03 <doug16k> let's say you applied a trillion 800 yotta newton pulses of acceleration per femtosecond to the sun

03:04 <mrvn> then 8s later you would see it start to move

03:04 <mrvn> 8 min

03:04 <doug16k> lol

03:06 <doug16k> I wasn't sure I was saying enough force, the sun is ridiculous

03:07 <doug16k> I showed my nephew a drawing that compares the sun and earth. I drew a little circle for earth. then a straight line down the side showing the side of the sun in comparison :P

03:07 <doug16k> he understood

03:10 <doug16k> earth is totally mindblowing to comprehend, both size and mass

03:10 <mrvn> Ever wonder how time machines work? You are here one second and then you are at the same spot yesterday the next second. Do you realize how many km that spot has travelled in that time ina rather complex direction?

03:10 <doug16k> 32 bit coordinate gets you withing over a foot?

03:11 <doug16k> because earth is so big, 32 bit is not big enough

03:11 <mrvn> 32bit is just 65536 units on each axis. That must be way more than a foot

03:13 <doug16k> you could use it for coordinate system of whole planet, for zoning up sectors or something

03:13 <doug16k> it's precise enough to do that with it

03:13 <doug16k> then data inside the sector is relative to 0, s you have good precision

03:13 <mrvn> you want to use variable length encoding for longitude and latitude though

03:13 CYKS6 has joined #osdev

03:15 <mrvn> Like first 4 bit of the latitude say how many bits latitude uses.

03:15 <mrvn> No pint having 16 longitude at the poles.

03:15 <doug16k> yeah, the true range doesn't go very far north and south

03:15 <doug16k> it becomes uninhabitable pretty quickly

03:15 CYKS has quit [Ping timeout: 255 seconds]

03:16 CYKS6 is now known as CYKS

03:16 <doug16k> maps make it seem like civilization goes way further north

03:17 <mrvn> Well, you do want to be able to place MacMurdoc station (or however it's spelled)

03:17 <doug16k> that is part of the problem though, because most of the range is where longitude is huge full distance, it minimizes precision

03:17 <mrvn> might also go further north/south in the near future

03:17 <doug16k> way up north the longitude is precise

03:19 <mrvn> Just release 4 billion numbered beach balls that repulse each other and let them spread out over the globe. Then after a while record where each ball is and that's yoir 32bit coordinate system.

03:19 <doug16k> longitude becomes excessively precise as you approach the poles

03:20 <doug16k> which is cool when you wasted bits getting it close to 90 deg

03:21 <doug16k> not that many left

03:22 <doug16k> it would be radians but same problem

03:24 <mrvn> it would be 180/2^n and 360/2^n or PI/2^n and 2PI/2^n. doesn't make one bit of difference.

03:24 <doug16k> it would cancel out yeah

03:27 <mrvn> doug16k: here is something fun for you to play with: Build a VR world where c=100m/s or so.

03:28 <mrvn> Cars moving at 0.5c should be fun.

03:40 <doug16k> have home clock and ship clock

03:40 <doug16k> how long it takes to go to work even though it takes a lot less on your car clock

03:41 <doug16k> exactly. that's a scaled down version of what I was thinking floating through space

03:41 heat has quit [Remote host closed the connection]

03:41 <doug16k> probably a better idea because you can see the effects easier

03:41 heat has joined #osdev

03:42 <doug16k> better if you scale c down and put it at ground level

03:44 <doug16k> that is where I saw that 2060 super is 12000 fps with a huge earth mesh with real world topology heights and practically no cpu with sdl+opengl 3.3

03:45 <doug16k> so utterly OP compared to what I need

03:45 <doug16k> topology data from SRTM project

03:47 <doug16k> whole thing is one beautiful triangle strip but come on, I should have had to partition earth

03:47 <doug16k> throw whole earth at it in one drawarrays, poof, utterly effortlessly

03:55 <mrvn> I wonder what that would do to relativistic effects due to mass. The effect would get stronger with lower c, right? And black holes form far earlier too.

03:56 <doug16k> yeah

03:57 <doug16k> your face and the object in your hand could have significantly different lorentz factor

03:59 <doug16k> it would seem hard to push the object fast relative to your chest and face. the people inside the object said oh no all that pushing worked, the space in front of us contracted

03:59 <doug16k> no?

04:02 <doug16k> the people in the object said it took a few seconds, but you pushed it for 25 seconds

04:03 <doug16k> but they observed traversing less space right?

04:03 <doug16k> foreshortening

04:08 <doug16k> they got there impossibly fast because the distance contracted to balance it out

04:09 <doug16k> penalty was more time elapsing in the other frame

04:10 <doug16k> they think they went under c the whole time because the distance was short enough for ludicrous speed to say < c

04:12 <doug16k> but ages elapsed

04:26 frkzoid has quit [Read error: Connection reset by peer]

04:41 <doug16k> speed of light equals mach 1 would probably be survivable

04:42 <doug16k> I doubt much physiology relies on past mach 1

04:43 [itchyjunk] has quit [Read error: Connection reset by peer]

04:43 <doug16k> or maybe mach2 actually, so it is in the mostly linear region most of the time

04:43 <doug16k> or mach 1.414 if you want to be fancy

04:45 <doug16k> 1/sin(45 deg) IIRC

04:47 <doug16k> PI_4 radian constant sometimes

04:48 <doug16k> I thought it would be neat to fly in space with a magic amazing amount of fuel efficiency and power, so the artificial gravity is just 1g accelerate forever

04:48 <doug16k> half way there do 180 and 1g until you stop

04:48 <doug16k> there's your gravity

04:49 <doug16k> and do all the lorentz transforms to see home clock and your clock and the length contraction

04:50 <doug16k> and put time acceleration and see how utterly infeasible it is to travel to another galaxy

04:51 <doug16k> by the time you get 1/4 there earth has hyperdrive and is annihilating neighbour species

04:54 coelho has quit [Ping timeout: 276 seconds]

04:55 heat has quit [Remote host closed the connection]

04:56 heat has joined #osdev

05:10 <doug16k> it would be fun to make a solar system exporer with a lot of stuff in the solar system, with lorentz transforms and magic forever 1g thrust

05:11 <doug16k> see how many years you need to visit something

05:18 <doug16k> strip mine the asteroid belt and get your money to say single precision inf

05:20 <doug16k> then destroy all the neighbours? not sure what humans are expected to do

05:27 CYKS has quit [Quit: Ping timeout (120 seconds)]

05:27 CYKS has joined #osdev

05:39 chartreuse has quit [Read error: Connection reset by peer]

05:42 mzxtuelkl has joined #osdev

05:55 GeDaMo has joined #osdev

06:33 vdamewood has quit [Quit: My MacBook Pro has gone to sleep. ZZZzzz…]

06:36 opal has quit [Remote host closed the connection]

06:51 the_lanetly_052_ has joined #osdev

06:52 the_lanetly_052 has quit [Ping timeout: 256 seconds]

06:53 opal has joined #osdev

07:02 the_lanetly_052_ has quit [Ping timeout: 244 seconds]

07:37 the_lanetly_052_ has joined #osdev

08:24 Affliction has quit [Quit: Read error: Connection reset by beer]

08:24 mzxtuelkl has quit [Read error: Connection reset by peer]

08:24 Affliction has joined #osdev

08:40 heat has quit [Ping timeout: 240 seconds]

08:50 dennis95_ has joined #osdev

09:00 toluene has quit [Ping timeout: 256 seconds]

09:02 toluene has joined #osdev

09:26 dennis95_ has quit [Changing host]

09:26 dennis95_ has joined #osdev

09:28 dennis95_ has quit [Quit: Leaving]

09:32 dennis95 has joined #osdev

09:52 dennis95_ has joined #osdev

09:55 pretty_dumm_guy has joined #osdev

09:55 dennis95_ has quit [Client Quit]

12:04 arch_angel has joined #osdev

12:39 Weiland has joined #osdev

13:27 _whitelogger has joined #osdev

13:32 heat has joined #osdev

13:37 tomaw has quit [Quit: Quitting]

13:41 tomaw has joined #osdev

13:49 Weiland has quit [Quit: WeeChat 3.4.1]

14:05 arch_angel has quit [Ping timeout: 255 seconds]

14:42 <zid> anybody touch any grass?

14:46 <heat> no

14:47 <gog> what's grass

14:47 <mrvn> global resource allocation singleton semantic?

14:50 <GeDaMo> Too hot :P

14:50 SpikeHeron has quit [Quit: WeeChat 3.5]

14:52 <gog> :O

14:53 <GeDaMo> https://www.youtube.com/watch?v=KoiUO9MCpdU

14:53 <bslsk05> 'Too Hot' by Kool & The Gang - Topic (00:03:47)

14:55 <zid> does porto even have grass

14:55 <heat> probably

14:55 <heat> im not from porto

14:55 <zid> It is 27.5C, highest 29.1C at 14:03, currently falling 1.0C/hr

14:55 <zid> I'm just too lazy to type portugal

14:55 <zid> that they have a city called porto is just unfortunate

14:55 <heat> its 34C here

14:56 <GeDaMo> What's the humidity like?

14:56 <zid> 40%

14:56 <zid> highest 71% at 7:19

14:56 <heat> 34C 21% humidity

14:56 <GeDaMo> It's 20°C 62% here

14:59 <zid> GeDaMo profiting from global warming

15:00 <GeDaMo> It's a bit warm for me :P

15:00 <heat> where are you from GeDaMo

15:01 <GeDaMo> Scotland

15:04 <zid> My condolences

15:05 <zid> were you born a scotch or did you just get hooked on irn bru and shortbread later in life

15:05 SpikeHeron has joined #osdev

15:06 <GeDaMo> Shaddup or I'll set the haggis on ye! :P

15:07 <zid> I'll just walk the other way around, I know the secret

15:08 <GeDaMo> There are two kinds of mountain haggis,: lefties and righties :P

15:09 <zid> Which is why the trick is to fling yourself down hill

15:12 <heat> https://i.imgur.com/M8FEd8S.png

15:12 <heat> ez

15:13 <heat> the prompt looks a bit broken, I assume its because of stdout buffering in musl not being the same as in glibc

15:13 nur has quit [Remote host closed the connection]

15:13 <zid> That's a funny username

15:14 <heat> what username?

15:14 <zid> It's your image!

15:14 <zid> username:Socket successfully created..

15:14 <heat> lol

15:15 <heat> my daemons' standard streams go out to tty0 for convenience purposes

15:16 <\Test_User> ".."

15:16 <\Test_User> that can't be right no matter how you look at it

15:16 <zid> ... is too long on a mono font

15:16 <zid> and the ... single glyph is fuck ugly

15:16 <\Test_User> true

15:16 <zid> .. is kinda the perfect *length* even if it isn't gramatically sound

15:17 <\Test_User> one single . would work though because it's a "has been done" and not a "is being done"

15:17 <zid> No it's being wistful

15:17 <zid> not final

15:17 <\Test_User> ah right

15:17 <heat> i need to delete all that code

15:17 <zid> login: Remove wistfulness

15:17 <heat> its an artifact of when my sockets weren't stable

15:17 <mrvn> less code, less bugs

15:17 <heat> also, partly copied from a tutorial

15:18 <heat> I would never do ".."

15:18 <zid> well you already did

15:18 <zid> so all we learned is that you're a liar

15:19 <heat> no u

15:30 <gog> mew

15:33 <heat> new

15:34 <GeDaMo> phew

15:35 <zid> gedamo finally found his shortbread

15:35 <zid> he was panicing

15:35 matt__ has joined #osdev

15:35 matt__ is now known as freakazoid333

15:36 <heat> https://elixir.bootlin.com/musl/latest/source/src/aio/aio.c#L109

15:36 <bslsk05> elixir.bootlin.com: aio.c - src/aio/aio.c - Musl source code (v1.2.3) - Bootlin

15:36 <heat> late stage musl code

15:36 <zid> wow, five star programmers

15:36 <zid> better than me :(

15:37 <heat> you're not a 5 star programmer, you're a 10x engineer

15:37 <zid> Thanks babe

15:38 <heat> np sweetie

15:39 <zid> goto bedroom;

15:40 <mats1> yes daddy

15:42 <heat> affirmative father

15:44 <GeDaMo> ita, pater

15:45 <gog> já pabbi

15:45 <zid> I thought you were saying 痛い！ for a moment there

16:23 freakazoid333 has quit [Read error: Connection reset by peer]

16:26 blockhead has joined #osdev

16:28 <clever> zid: heh, by random chance, the latest HVACRVIDEOS vid, he explains the whole reason the channel started, rather then pay the new guys to follow him and learn on the job, he just films the job, explains everything, and then has the new guys watch it whenever

16:29 scoobydoo has quit [Ping timeout: 240 seconds]

16:35 <GeDaMo> https://www.youtube.com/watch?v=z_Ti4GP0ntE

16:35 <bslsk05> 'Adam Savage's One Day Builds: Refrigerated Cooling Suit!' by Adam Savage’s Tested (01:07:08)

16:36 <clever> GeDaMo: yep, i had linked that yesterday

16:36 <clever> GeDaMo: along with https://www.youtube.com/c/HVACRVIDEOS

16:36 <bslsk05> 'HVACR VIDEOS - Home' - 'Hello my name is Chris and I will be posting videos about the cool stuff I see out in the field of HVACR. A little bit about my self, I've been involved in the trade for over 15 years and I will tell you right now that I do not know it all, this trade is constantly evolving and new technology is coming out every day. I am based out of Southern California and I mainly work in restaurants. If you have any questions or even suggesti

16:41 scoobydoo has joined #osdev

16:50 <zid> am I scrolled up or something

16:50 <zid> bloody time portals

16:52 freakazoid333 has joined #osdev

16:52 SpikeHeron has quit [Quit: WeeChat 3.5]

16:54 SpikeHeron has joined #osdev

17:01 skipwich has quit [Quit: DISCONNECT]

17:02 vdamewood has joined #osdev

17:03 skipwich has joined #osdev

17:05 vinleod has joined #osdev

17:05 jafarlihi has joined #osdev

17:05 vinleod is now known as vdamewood

17:05 vdamewood has quit [Killed (lithium.libera.chat (Nickname regained by services))]

17:05 <jafarlihi> Can someone please tell me what's the right way of getting PID a physical address belongs to in Linux?

17:08 <zid> very unlikely!

17:08 <zid> This being osdev and not a linux support chan

17:09 <GeDaMo> What's PID in this context?

17:09 <heat> you can try and get the struct page but that still won't work

17:10 <jafarlihi> GeDaMo: I want the PID of process it belongs to, can be TID

17:10 <GeDaMo> Oh, Process IDentifier

17:11 <clever> jafarlihi: i believe it is possible, one min

17:11 <mjg_> afair it can be found in /proc/pid/pagemap

17:11 <clever> [root@amd-nixos:~]# hexdump -C --length 32 /proc/self/pagemap

17:11 <clever> 00000000 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 |................|

17:11 <mjg_> but are you going to have to walk all of them

17:11 <clever> so /proc/*/maps, tells you what is mapped

17:11 <mjg_> clever: chancesa re it is not exported by default as a hardening measure

17:12 <mjg_> iirc there was a discussion about it after rowhammer showed up

17:12 <mjg_> i think the real question is what jafarlihi needs this for

17:12 <clever> pagemap is then a virtual file, that is basically the leaf nodes of the entire paging table

17:12 <heat> mjg_, he was working on an linux module that's an antivirus kind of thing

17:13 <clever> one problem i have noticed with pagemap, is that even if it can show normal memory, it wont show mmio mappings

17:13 <mjg_> anti backdoor?

17:13 <heat> fwiw getting page->mapping and then walking through every mapping would work I guess

17:13 <mjg_> any time someone claims that i suspect the opposite :-P

17:13 <jafarlihi> I've got 3 interrupts pointing outside kernel address space, want to find what they are

17:13 <heat> huh?

17:13 <jafarlihi> https://github.com/jafarlihi/ksec

17:13 <bslsk05> jafarlihi/ksec - [WIP] x86_64 Linux 5.15.x security suite (0 forks/0 stargazers/GPL-3.0)

17:14 <jafarlihi> run it "cargo run checkInterrups"

17:14 <zid> jafarlihi: make sure you thank the.. three people doing your work for your currently :P

17:14 029AAGSSJ has joined #osdev

17:14 <jafarlihi> clever: How do I decode that?

17:15 <heat> i'm not running anything

17:15 <clever> jafarlihi: you can treat pagemap as an uint64_t[] (assuming your on a 64bit os), and the index into that array is a virtual address page#

17:15 <clever> and the value is the physical page

17:15 <clever> #

17:15 <heat> an unvetted kernel module is an easy way to fuck up my system

17:15 <jafarlihi> heat: If you ran it you'd probably get to see that three of your interrupt vectors point outside kernel as well

17:16 <heat> what interrupts?

17:16 <heat> what's "outside the kernel"

17:16 <jafarlihi> IDT entries

17:16 <heat> but what vectors

17:16 <jafarlihi> Pointing to address not within kernel pages

17:16 <heat> what addresses

17:16 <GeDaMo> I wouldn't expect divide by zero to point to the kernel

17:16 <heat> why do you think that physical addresses have anything to do with this

17:16 <heat> GeDaMo, you should

17:17 <GeDaMo> Huh , I thought that would be a user level thing

17:17 <heat> no

17:17 <heat> how do you think you get signals?

17:17 <heat> those are all injected by the kernel

17:18 <mjg_> jafarlihi: have you tried disassembling it?

17:18 <jafarlihi> clever: Thanks

17:18 <jafarlihi> mjg_: No

17:19 <mjg_> jafarlihi: perhaps this code is constructed by the kernel on boot

17:19 <mjg_> jafarlihi: and only then installed

17:19 <mjg_> assumiong you are not running a backdoored kernel it should not be hard to find how it gets there

17:19 <jafarlihi> Yea, I will eventually

17:20 <jafarlihi> I'm too ADHD to do it now

17:20 <jafarlihi> Can someone lend me 16gb server so I can build AOSP? I've got 8gb only and it dies with that much

17:20 <heat> 0 of my questions got answered

17:20 <heat> lovely

17:20 <mjg_> fwiw it would be too unreliable to have a userspace process hack the kernel and install an interrupt handler from its own address space

17:20 <mjg_> i mean what if the proc dies

17:21 <mjg_> so i don't think any rootkits are even trying to do it :_P

17:22 <mjg_> hell it does not have to die. assuming no mlock, imagine swap

17:22 <jafarlihi> I found stack overflow in Android bluetooth stack

17:22 <heat> did you?

17:23 <heat> first, check that it's an actual vulnerability (it's probably not)

17:23 <heat> then https://bughunters.google.com/

17:23 <bslsk05> bughunters.google.com: Home | Google Bug Hunters

17:23 <jafarlihi> If it works then I'll go around wardriving before I disclose it

17:23 <jafarlihi> just for lulz

17:24 <heat> that's an easy way to not get your bounty

17:24 <jafarlihi> No one will know, I live in third world

17:25 <heat> except... for the fact that you just snitched on you

17:27 <jafarlihi> Why disclose to Google when you could sell it to Zerodium for 10x more pay?

17:28 <heat> you sure about that?

17:28 <heat> also

17:28 <jafarlihi> Most Google paid in 2021 is 157k, Zerodium lists 2.5m for zero click rce

17:28 <heat> do you have RCE?

17:28 <heat> lol

17:29 <jafarlihi> No, but I can try

17:29 <heat> i bet it's not even an exploit

17:29 <heat> "person who struggles to write a linux kernel module finds out millionaire exploit in android" is a new headline for sure

17:30 [itchyjunk] has quit [Remote host closed the connection]

17:31 <jafarlihi> You jelly of my primitive?

17:36 <mjg_> heat: you mean bug/0day. exploit is code which exploits the bug

17:36 <mjg_> heat: s/exploits/takes advantage of/

17:38 heat has quit [Read error: Connection reset by peer]

17:38 heat has joined #osdev

17:57 <jafarlihi> Is it possible to get kernel address range from userspace app in Linux?

18:02 <zid> I t hink I just heard an audible grown

18:02 <zid> groan

18:03 <jafarlihi> take ur meds

18:03 <mrvn> what is a kernel address range?

18:03 <jafarlihi> nvm, i don't need it anymore, design change

18:05 <CompanionCube> are you extra sure you wanna be saying all this when the public logs are *right there* lol

18:06 <geist> plus you're asking people at google about stuff to try to get a bounty against a google thing?

18:06 <geist> that makes me even less likely to even talk to you

18:06 <zid> I wonder if you could get yourself fired for that, colluding with someone to claim bug bounties

18:06 <geist> exactly

18:06 <zid> I assume "yes, very easily"

18:07 <zid> You might get away with a wrist-slap at first if you definitely didn't have anything to do with the code that was exploited, but if it was *your* code then straight to jail

18:07 <zid> https://tenor.com/view/jail-no-excuse-gif-24460693

18:07 <bslsk05> tenor.com: Jail No GIF - Jail No Excuse - Discover & Share GIFs

18:07 <jafarlihi> hehe, sorry, ill see myself out

18:09 <geist> one million years dungeon!

18:10 * gog interrupts

18:10 * zid ticks the timer then resumes

18:10 <geist> that was a Fast Interrupt

18:11 Vercas has quit [Remote host closed the connection]

18:11 Vercas has joined #osdev

18:22 <gog> i need a new experiment

18:26 <zid> Cut ham into pieces of various animals

18:26 <zid> err pictures

18:26 <zid> like bear face ham

18:27 <gog> ok

18:27 <gog> what kind of ham

18:27 <zid> ...pig?

18:43 <gog> i'll just go with proscuttio di parma

18:47 <jafarlihi> Does anyone know if `core_kernel_text` func is supposed to take in physical or virtual address>

18:47 <jafarlihi> ?

18:49 <jafarlihi> Or does it not matter since kernel is identity mapped? (is it?)

18:49 <gog> https://elixir.bootlin.com/linux/v4.2/source/kernel/extable.c#L70

18:49 <bslsk05> elixir.bootlin.com: extable.c - kernel/extable.c - Linux source code (v4.2) - Bootlin

18:52 <jafarlihi> Looks like physical, no?

18:52 <gog> looks like virtual

18:54 <gog> but it can be physical

18:54 <gog> init_kernel_text

19:03 [itchyjunk] has joined #osdev

19:04 <jafarlihi> What's the best way of serializing a packed struct to be sent over Netlink as a string?

19:05 <jafarlihi> base64?

19:07 <gog> it has to be string, you can't send the raw bytes?

19:09 <jafarlihi> As far as I know from sparse netlink doc you can't do that

19:09 <doug16k> base64 causes 20% data expansion in exchange for being all [a-zA-Z0-9/+]

19:10 <doug16k> it could be the best way if your communication channel restrictions make it the best way

19:10 <jafarlihi> Oh wait, there's NLA_BINARY

19:11 <doug16k> sorry 25%. it's 6 bits for every 8 bits of output

19:23 jafarlihi has quit [Ping timeout: 276 seconds]

19:27 [itchyjunk] has quit [Remote host closed the connection]

19:40 <zid> also known as 64/256

19:41 Ali_A has joined #osdev

19:42 <mrvn> "best" and "packed struct" are already contradictory

19:50 GeDaMo has quit [Quit: There is as yet insufficient data for a meaningful answer.]

20:06 <gog> yeh packed structs are bad

20:06 <gog> unless you really need them for some reason+

20:06 <gog> can't imagine why

20:07 <mrvn> you never need them, they just are convenient if you don't care about speed.

20:07 <zid> I use them when I am being filthy and overlaying mmio

20:07 <zid> with structured names

20:07 <mrvn> zid: on ARM? That will be UB because writing bytes to a 32bit MMIO reg is bad.

20:07 <mrvn> hope you use clang.

20:08 kingoffrance has left #osdev [#osdev]

20:08 <clever> gog: the only code ive worked with that used packed structs, was dealing with an MBR table

20:08 <gog> ah yes both valid uses

20:09 <gog> of varying deegress of dubiousness

20:09 <clever> it also has the nasty surprise of containing an un-aligned 32bit int

20:09 <\Test_User> packed structs are useful for decreasing memory usage at the cost of cpu usage, or reducing network usage if for whatever reason you're sending them raw through the internet (second is prob a bad idea but at least a valid usage)

20:09 <clever> and gcc may or may not do byte-wise loads, depending on the -O level

20:09 <mrvn> Where do you even find MMIO structures that aren't aligned?

20:10 <mrvn> clever: no, depending on the allow-unaligned flag.

20:10 <zid> I only do it for mmio because I know the host cpu if I am doing it

20:10 <zid> i.e for accessing a GDT entry

20:10 <mrvn> clever: off for gcc, on for clang by default.

20:10 <clever> mrvn: ~4 of the SDHCI registers on the rpi, are 16bit sub-regs, and allow doing 16bit load/store to a part of a 32bit reg

20:10 <zid> if I know the host cpu, I know if unaligned reads/writes are legal

20:10 <clever> mrvn: however, the axi bus doesnt deal with that well, and will merge/corrupt 16bit transfers

20:10 <mrvn> clever: that's still aligned though

20:11 <clever> yeah, its just smaller then the usual size

20:11 <clever> other then that special case, basically everything is 32bits in size, and 32bit aligned

20:11 <clever> and violating that causes problems

20:11 <mrvn> and the AXI bus will merge/corrupt it when gcc does 4 byte writes.

20:12 <clever> for some peripherals, any 8bit load, that isnt 32bit aligned, is treated as an invalid register

20:12 <clever> so reading from +0 works, but +1, +2, and +3 are all invalid registers

20:13 <mrvn> anyway, fazit: don't do packed.

20:14 <zid> You can't just go around calling registers invalids

20:15 <mrvn> I'm tempted to declare MMIO regs atomic<uint32_t>

20:15 <clever> zid: it behaves the same as accessing registers that dont exist, via 32bit aligned addresses

20:16 <clever> oops, forgot about a different thing

20:16 <clever> attempting to read XHCI registers, when its disabled, causes linux to 100% lock up

20:16 <zid> k thanks?

20:16 <clever> ran the wrong cmd, and it killed the machine, lol

20:16 * zid definitely didn't ask

20:17 <clever> i was trying to get a diff example, and killed it

20:18 * mrvn is watching Godzilla while it's on Amazon Prime.

20:18 <clever> # ramdumper -m -a 0xfe204000 -l 512

20:18 <clever> 0xfe204010 01 00 00 00 20 10 20 30 30 69 70 73 30 69 70 73 |.... . 00ips0ips|

20:18 <clever> 0xfe204020 30 69 70 73 30 69 70 73 30 69 70 73 30 69 70 73 |0ips0ips0ips0ips|

20:19 <clever> any register that doesnt exist, returns a constant of "spi0"

20:19 xenos1984 has quit [Read error: Connection reset by peer]

20:19 <clever> this spi peripheral, only has 6 x 32bit regs within it, and the rest are all just an error/ident code

20:20 <clever> but, if you do a mis-aligned read, to say +1, you will always get 0x69(i) back, even if it was part of a valid reg

20:20 <clever> (if its 8bit)

20:20 Ali_A has quit [Quit: Connection closed]

20:21 Ali_A has joined #osdev

20:25 freakazoid333 has quit [Ping timeout: 264 seconds]

20:37 <mrvn> if you do an byte read do you get back an 32bit value?

20:37 xenos1984 has joined #osdev

20:40 frkzoid has joined #osdev

20:40 <clever> mrvn: i think internally, its a 32bit bus, that is ALWAYS 32bit aligned, and a byte-wise read, is the cpu selecting the right 8bits of that 32bit bus, with a byte-enable signal to tell the remote end which 8bits to use

20:40 <clever> mrvn: and if the address doesnt exactly match, the entire 32bit bus gets loaded with a 32bit constant of "spi0"

20:40 <clever> but the cpu was expecting an answer on bits 8:15 of the bus, so it only gets the 69 in '30 69 70 73'

20:49 <clever> it does indeed sound simpler, if you just drop the lower 2bits from the address, set a rule that the 32bit bus must always be 32bit aligned, and then the bus just has a 30bit addr, 32bit data, 4bit enables, and some misc control flags

20:51 <mrvn> but aparently it passes all 32bit down or the unaligned read wouldn't get the wrong result

20:54 <clever> yep

20:54 <clever> i forget the exact results, but ive seen people probing that with a load-many as well

20:55 <clever> and it did something funky, like repeat a 32bit reg twice, when doing 4 32bit loads

20:55 <clever> if i'm remembering that right, it would imply there was a 64bit bus at one stage, connected to the 32bit bus

20:55 <clever> and that one, expects everything to be 64bit aligned

20:56 <clever> so, to allow reads from +4/+5/+6/+7 to function, it duplicates the 32bit answer, when connecting a 32bit bus to a 64bit bus

20:56 <clever> but, if you then do a 4x32bit load, the cpu has to break it up into 2x64bit

20:56 <clever> and then the 64bit gets lost at the junction between 64bit/32bit busses

20:57 <clever> so it becomes 2x32bit loads, with each value repeated twice

20:58 <clever> all kinds of "implementation specific undefined behavour", where the specs say dont do this, and each implementation is free to do whatever it wants

20:59 bauen1 has quit [Quit: leaving]

21:00 <clever> other testing i did, found that the vector core has a 256 bit bus to the caches, and can move 256 bits on every clock, sustained(for a total of 4096 bytes), but there is an 11 cycle startup cost

21:01 <clever> an engineer has also stated, that the arm core on the bcm2835 (pi0/p1) only has a 32bit bus to the rest of the child

21:01 <clever> chip*

21:02 <clever> while the pi2/pi3, have a 64bit bus

21:02 <clever> but its not clear which clock domain dominates it, the arm, or the axi

21:02 <clever> perhaps the slower of the 2, plus issues waiting for an edge

21:06 bauen1 has joined #osdev

21:08 <doug16k> if it is a 32 bit bus, you can expect somewhere to be thinking in terms of 128-bit blocks, because 4 bytes * 4 cycle burst = 128 bits. if it is thinking in terms of DDR memory interface, it would be 256 bits, 8 bytes * 4 cycles

21:10 <doug16k> 8 because double data rate 32 bit

21:11 <clever> doug16k: the fun part, is that there are multiple buses, all of different sizes

21:12 <clever> the mmio is all in a 32bit bus, while the vector<->cache is 256bit, the arm<->bus is 32bit, the dram is 32bit DDR

21:12 <clever> and several others i cant name off the top of my head

21:16 <doug16k> could be 16 bit wide DDR so it's back to 128-bit burst

21:16 <clever> and yes, bursts help massively, like how the vector core takes 11 cycles to start a vector load, but once started, its only paying 1 clock per 256 bits

21:16 <clever> so doing multiple consecutive loads greatly saes time

21:16 <doug16k> yeah pipelining is why SDRAM is still with us. it's awesome

21:17 <clever> the engineer on the forum said that the axi controller will automatically split and merge transactions

21:17 <clever> so a 128bit bus, may turn into 4 x 32bit transfers on a 32bit bus

21:20 <clever> i did also do some experiments with 4096 byte loads from uncached dram

21:20 <clever> and while increasing the VPU clock speed, did result in it taking more clock cycles (it just stalled more, as it got too fast)

21:20 <clever> doing the math on bytes/sec, it was actually more flat, and near the max capacity of a 400mhz 32bit DDR bus

21:28 dude12312414 has joined #osdev

21:28 <doug16k> DDR can be pipelined out into one endless burst, well, until you have to do a memory refresh cycle

21:29 <doug16k> but you can get close to using every cycle for data

21:29 <clever> doug16k: what about having to open new rows?

21:30 <doug16k> overlap that with the burst from another bank's row

21:30 <clever> so you can open one row, while reading another row, in the same clock?

21:30 <doug16k> yes

21:30 <doug16k> you stagger the commands across the banks, they have independent row buffers

21:30 <clever> i assume it has seperate addr and data bus then?

21:31 <doug16k> only one can be using the data bus

21:31 <doug16k> you stagger the commands so the end up not conflicting

21:31 <doug16k> "you" meaning the memory controller

21:32 <clever> where might i find better docs, on how i could drive ddr2 ram directly?

21:32 <doug16k> to make that work you need to either make the blocks so big you ask for whole bursts, or you have a request queue that can complete out of order

21:32 <clever> assuming i can bit-bang it fast enough

21:33 <clever> > or you have a request queue

21:33 <doug16k> in any case you need to get at least one request ahead to saturate it right

21:33 <clever> one of the rpi engineers has stated, that the bcm2711 ddr4 controller, has multiple slave ports

21:33 <clever> so it can queue up requests from several bus masters in the same clock cycle

21:33 <clever> and then re-organize them internally,

21:34 <doug16k> an FPGA memory controller might present it as 128-bit blocks because it fixes the problem of the memory being more clock speed than your soft core cpu

21:35 <clever> yeah

21:35 <clever> then you dont need to complicate the bus by telling the controller how big your burst will be

21:35 <clever> and can run that bus at a far lower clock

21:35 <doug16k> yeah

21:36 <clever> > Most of the high-bandwidth bus masters have 128-bit data widths to SDRAM

21:36 <clever> source: https://forums.raspberrypi.com/viewtopic.php?p=1994512#p1994512

21:36 <bslsk05> forums.raspberrypi.com: Multicore PI System Bus Design Question - Page 2 - Raspberry Pi Forums

21:37 <clever> combined with what you said, it does feel like the dram controller would have a 128bit slave port

21:37 <doug16k> yeah, my toycore has 128 bit cache lines, one request is whole line, 128 bit

21:37 <clever> even if its 32bit ddr to the outside, it may be 128bit internally

21:37 <mrvn> my toycpu has 16bit bus and no cache

21:37 <clever> and with 400mhz ddr2 32bit, thats 64bits per 400mhz, or 128bits per 200mhz

21:38 <clever> so the internal bus can run as low at 200mhz, and still saturate the ram

21:38 <doug16k> mine is fancy because it is a educational board with piles of peripherals

21:38 <doug16k> has 64MB DDR

21:39 <doug16k> I forget what number ddr

21:39 <doug16k> ddr2 probably

21:41 <doug16k> I use the 9th bit for dirty bit, then when doing writeback burst I use it for the byte enables, and store port is hardwired to write 1 to it, and line fill is hardwired to load 0 into it

21:41 Weiland has joined #osdev

21:41 <doug16k> so I could enable a no-allocate store mode that clears it to 0 without MC request

21:42 <doug16k> on store miss

21:42 <doug16k> then writeback only writes what got written by stores

21:44 <clever> the other thing i dont fully understand, is how you can wire a 32bit ddr bus, into a pair of ddr2 dies

21:44 <clever> 16bits each

21:45 <doug16k> they don't care which bit of the word they are storing

21:46 <clever> but how do you deal with sending commands to both at once?

21:46 <doug16k> you can just put the same control signals into them both, but they use different data

21:46 <clever> and reading a reply from both

21:46 <clever> what about the MR's?

21:46 <doug16k> they both work in total sync

21:46 <doug16k> one gets and provides low bits, other gets and provides upper

21:47 <doug16k> exact same command wires

21:47 <clever> what about non-deterministic things, like say an on-die temp sensor?

21:48 <doug16k> on what die

21:48 <clever> lets say there was a temp sensor on the dram die itself

21:48 <clever> and you had a command to read it

21:48 <doug16k> that is probably on the SPD super low speed thing

21:48 <mrvn> clever: then use one of the bti to chip select

21:48 <clever> what pins are the reply sent over? and how does it not conflict when the 2 dies disagree

21:49 <clever> i need some bus timing diagrams, to fully make sense of this

21:49 <doug16k> SDRAM is given commands, it doesn't reply. the commands determine when a bank is allowed to use the data pins

21:49 <mrvn> you need to select each chip for training too and pick the more conservative timings.

21:50 <doug16k> you completely boss sdram around, it has no say in anything

21:50 <clever> https://media-www.micron.com/-/media/client/global/documents/products/data-sheet/dram/ddr2/2gb_ddr2.pdf?rev=2396cd31ea1641f98927c509eb97f486

21:50 <clever> this looks like it may have answers

21:50 * clever reads

21:50 <geist> right. so you can gang as many as you want

21:51 <clever> ah, that gives me an idea

21:51 <clever> the 1gig pi3, is ganging a pair of 512mb dies together

21:51 <clever> and rpi engineers have stated both that, no 16gig dies exist, but the bcm2711 can address up to 16gig

21:51 <clever> so in theory, you could gang a pair of 8gig dies together, on a bcm2711

21:52 <clever> however, a fork in the command traces, would likely be a nightmare for signal reflections

21:52 <doug16k> commands are SDR

21:52 <doug16k> easy

21:52 <clever> and the data bus splitting in half and going to 2 places, means the training for each half of the bus differs

21:53 <clever> the pi3's 1gig, solved that, by having the forking within the dram package

21:53 <clever> 2 ties in 1 epoxy package, that looks like a single "chip"

21:53 <clever> very short stubs, less reflection delay

21:54 <clever> doug16k: that that include things like read and open row? that would imply each command must request 64bits at once, because the data is moving twice as often as the commands

21:54 <doug16k> the main case is needing one command every 4 cycles

21:54 <doug16k> which is 8 transfers on DDR

21:54 <doug16k> gives an idea how easy the commands are

21:54 <clever> 8 * 2 * bus_width bits?

21:55 frkzoid has quit [Ping timeout: 244 seconds]

21:55 <doug16k> right, 4 cycle burst, then width * 2 bits

21:55 <clever> which is 256 bits in this case

21:56 Ali_A has quit [Quit: Connection closed]

21:57 <doug16k> during that "idle" time, it gets stuff going on other banks, so it would be intermittently more than 1 command per 4 cycles

21:58 <doug16k> so it can just stream right into it back to back

21:58 <doug16k> back to back on data burst I mean

21:59 <clever> my rough understanding, is that you can have one open row for every bank of ram?

21:59 <doug16k> right

21:59 <clever> you must first open a row to do anything

21:59 <doug16k> think of it like a really wide latch

21:59 <doug16k> and you can ask for a burst of a range

21:59 <clever> and then read/write access that opened row, which may be sram?

21:59 toluene has quit [Read error: Connection reset by peer]

21:59 <clever> and refresh is just open+close

21:59 <clever> and closing commits the row back to the array of capacitors

22:00 <doug16k> yeah. "activate" means drain the caps for the row into the row buffer SRAM and hold it there

22:00 <doug16k> precharge means put the data back into the caps

22:00 <clever> and recondition the levels

22:00 <clever> it may also do any on-die ecc during the open

22:01 <clever> some posts on the rpi forums, claim the ddr4 chips, have on-die ecc, where it checks the ecc during the open, and auto-corrects any errors

22:01 <clever> and the host controller is then entirely unaware of the ecc

22:01 <doug16k> so you activate, then after a while that bank's row buffer has correct values, then you can issue requests to burst ranges of it back to back, then if you need a different row, you precharge the row back into the capacitor array, and activate a different row into the row buffer

22:01 toluene has joined #osdev

22:01 <doug16k> but there are multiple copies of that whole thing so you can stagger them and get continuous data flow

22:02 <doug16k> called banks

22:02 <clever> and if you can arrange for data in ram, to be in different banks, you reduce the latency to switch between them

22:02 <clever> because you wont be causing a collision within a bank

22:02 <doug16k> you stagger the commands so only one needs data bus at once

22:02 <doug16k> yeah, you can make stuff bank aware and not have to activate

22:02 <doug16k> usually you don't know the memory that closely

22:03 <clever> i think bank-aware and bursts, is why the h264 encoder on the pi is so nutty

22:03 <doug16k> it could be mapped to address space funny

22:03 <clever> its kinda yuv, but the planes are a total mess

22:03 <clever> so you can access all 3 planes, in a 32x32 region, while taking all of the above into account

22:04 <clever> > The H264 blocks need the frames in a weird column format(*), and also a second 2x2 subsampled version of the image to do a coarse motion search on. The ISP can produce both these images efficiently, and there isn't an easy way to configure the outside world to produce and pass in this pair of images simultaneously.

22:04 <clever> > (*) If you divide your image into 128 column wide strips with both the luma and respective U/V (NV12) interleaved chroma, and then glue these strips together end on end, that's about right. The subsampled image is either planar or a similar column format but 32 pixels wide. Cleverer people than me designed it for optimised SDRAM access patterns.

22:04 frkzoid has joined #osdev

22:04 <clever> https://github.com/raspberrypi/linux/blob/rpi-5.10.y/include/uapi/drm/drm_fourcc.h#L765-L809

22:04 <bslsk05> github.com: linux/drm_fourcc.h at rpi-5.10.y · raspberrypi/linux · GitHub

22:05 <clever> yep, there it is: The pitch between the start of each column is set to optimally switch between SDRAM banks.

22:05 <doug16k> yeah I can't wait for ECC to become ubiquitous. it's ridiculous we got this far with hardly any ECC

22:05 <clever> so, when you switch to another column, your also garanteed to be switching to another bank

22:06 <doug16k> no

22:06 <clever> so as you move left->right, your not going to run into bank level collisions

22:06 <doug16k> column is offset into row buffer

22:06 <doug16k> unless I lost you

22:06 <clever> or i got lost

22:06 <doug16k> think of row buffer like L1 cache

22:06 <clever> re-read the linux link i pasted, and see what your interpretation is?

22:07 <doug16k> you say column offset into it to read/write burst

22:07 <doug16k> oh that's not DRAM columns. sorry my bad

22:07 <clever> i think by column, it means a grouping of 128 pixels

22:07 <clever> so if you step to the right by 128 pixels, you land in a different dram bank

22:08 <doug16k> the low part of the address you give DRAM is the "column"

22:08 <doug16k> upper part is "row"

22:08 <clever> but where does bank then come in?

22:08 <doug16k> RAS/CAS Row address strobe / column address strobe

22:08 <doug16k> now you give upper part when activating row

22:08 <doug16k> lower part in read/write command (column part)

22:09 <clever> ah, column is the addr within the row buffer?

22:09 <doug16k> bank is copies of that whole thing

22:09 <doug16k> whole capacitor array - whole row buffer

22:09 <doug16k> so you can stagger them and pipeline use of data bus even with delays

22:09 <doug16k> when one is delaying, other bank is bursting

22:09 <doug16k> round robin when ideal

22:09 <clever> yeah

22:10 <doug16k> it can become less than perfect sometimes

22:10 <doug16k> pathologically changing row, for example

22:10 <clever> less then perfect, if your accessing data where the stride matches up with the bank size?

22:10 <doug16k> unluckily mapping to same bank too much

22:10 <clever> so every access is in the same bank?

22:10 <clever> yep

22:10 <clever> and i think the SAND format for h264, was timed to not do that

22:11 <clever> sized

22:11 <doug16k> yeah, CRT accesses are so sequential, it's easy mode for MC

22:12 <doug16k> can do that utterly perfect staggering

22:12 tsraoien has joined #osdev

22:12 <clever> except, the h264 core needs Y, U, and V planes, it wants to access 1 macroblock at once (32x32), and it needs a second half-res image of the same frame

22:12 <clever> so now its far more complicated to make it fit dram

22:13 <doug16k> ah

22:13 <clever> and thats why SAND is so much more complex

22:16 <doug16k> yeah column is low bits of address you could say

22:16 <doug16k> when you first look at ram, it seems really weird because it has half the address pins you think it needs

22:16 <doug16k> because half is row, half is column

22:16 <doug16k> might be off by 1 bit on one sometimes

22:17 <doug16k> DDR standards say a specific width though IIRC

22:17 <doug16k> they use so many

22:17 <clever> https://github.com/librerpi/rpi-open-firmware/blob/master/firmware/sdram.c#L198-L201

22:17 <bslsk05> github.com: rpi-open-firmware/sdram.c at master · librerpi/rpi-open-firmware · GitHub

22:18 <clever> the default timings in my ram driver, say 2 row bits and 1 column bit, that seems kinda low

22:18 Ram-Z has quit [Ping timeout: 246 seconds]

22:18 <doug16k> no that can't be physical DDR interface related

22:18 <clever> lines 571-577 then modify that, for some ram sizes

22:18 <clever> 1gig modules, having 3 column bits and 3 row bits

22:18 <doug16k> could be 1 is some enum

22:19 <clever> and yeah, thats still way too low

22:19 <clever> ah, that would make a lot more sense

22:20 <clever> i should dig up datasheets for the ram i'm using, and see what dimensions they actually have

22:20 <clever> i would also assume, the address is broken up into 3 parts, bank, row, and then column?

22:20 <doug16k> right

22:20 <doug16k> sometimes they switch them around funny to prevent patterns

22:21 <clever> so if i knew how many bits for each, i could know the stride between banks, or just divide the capacity by bank-count

22:21 <doug16k> trying to stagger it across the banks

22:21 <doug16k> or whatever

22:21 <clever> the rp2040 also does some weirdness

22:21 <klange> ah, electronic structuring

22:21 <clever> it has 4 banks, of 64kbyte of sram each

22:21 <clever> with a 32bit port

22:21 tsraoien has quit [Ping timeout: 276 seconds]

22:21 <clever> but it then stripes across them

22:22 <klange> (not even a transcribed groan for that?)

22:22 <clever> so basically, if you treat the ram as an uint32_t[x], then each slot, lands in bank x%4

22:23 <clever> klange: my brain is a bit fried from reading the ddr datasheets, and its getting late

22:23 Weiland has quit [Quit: WeeChat 3.4.1]

22:23 <clever> https://media-www.micron.com/-/media/client/global/documents/products/data-sheet/dram/ddr2/2gb_ddr2.pdf?rev=2396cd31ea1641f98927c509eb97f486

22:23 Weiland has joined #osdev

22:23 <clever> doug16k: this has a good diagram on page 12

22:24 <doug16k> yeah, see what I mean by "copies of that whole thing"

22:24 <clever> lets see, it has 8 banks of ram, which is listed as "32,678 x 512 x 16"

22:24 <clever> A[14:0] looks to be the addr bus, which can be either row or column

22:25 <doug16k> yeah, the row column half-address thing

22:25 <clever> while BA[2:0] looks like the bank selector, 3bit int, 0-7, 8 banks, perfect

22:25 <doug16k> exactly

22:25 [itchyjunk] has joined #osdev

22:26 <doug16k> onus on MC to stagger the commands so only one bank is using data lines

22:26 <clever> the 18bit addr (A+BA) goes into the mode registers (likely the MR's ive seen elsewhere)

22:26 <clever> while 15bits goes into the row muxes, and 11bits to the column stuff

22:27 <clever> 15bit row#, explains the 32678 in the memory array, so that is the row count

22:27 <mrvn> normaly you have to latch the row and then can read multiple columns

22:27 <clever> yep

22:28 <mrvn> every 512 words you have to increase the row so you get a blib.

22:28 <mrvn> blip

22:28 <clever> and the 512x16 part, looks to be 512 columns, with each cell being 16 bits

22:28 <doug16k> CAS latency is how many cycles after you issue a read or write it starts to use the data lines

22:29 <doug16k> column read or write

22:29 <clever> and then you have to plan out that many reads in advance

22:29 <clever> so the bus isnt idle

22:29 <doug16k> right

22:29 <doug16k> they use the next 4 cycles

22:29 <doug16k> when they start N cycles later, N=CAS latency

22:30 <doug16k> so you have to make sure you don't go to another bank and say "read" too early or their burst will clash

22:30 <doug16k> but you could go activate or precharge no problem

22:30 <clever> yeah

22:31 Weiland has quit [Quit: WeeChat 3.4.1]

22:31 <clever> i also notice, there is only a 2bit value, from the refresh counter to the bank control logic

22:31 <clever> i suspect its refreshing 2 banks at once?

22:32 <clever> this also came up, with the 8gig model of the pi4, it needed the dram power supply redone, i think because with 8gig of ram, you couldnt/didnt want to refresh more often

22:32 <clever> so they instead refresh twice as much per refresh command

22:33 <clever> and the diagram i linked shows that, the counter simply cant select 1 bank for refreshing

22:34 wand has quit [Ping timeout: 268 seconds]

22:34 <doug16k> IIRC there is a "refresh this bank" and "refresh all banks" command

22:34 <doug16k> depends how fancy you want to be

22:35 <clever> combining what ive seen before, and this diagram with a refresh counter, it sounds like you just say "refresh something" at a regular interval

22:35 <doug16k> fancy might get refresh over with during period it predicts will be idle

22:35 <clever> and the refresh counter then increments its way thru all banks and rows

22:35 <clever> and if the interval is high enough, it gets back to a row within the required timeframe

22:36 <doug16k> yeah you just say "do a refresh" and the SDRAM logic worries about keeping track of where

22:36 <clever> but, that counter only has a 2bit bank# coming out, 0-3, but this die has 8 banks

22:36 <clever> so it would likely refresh 2 rows (banks 0+1) at once

22:36 <doug16k> you have to do all the rows every 64ms. it doesn't care what timing pattern

22:36 <clever> yeah

22:36 <clever> .tREFI = 3113, //Refresh rate: 3113 * (1.0 / 400) = 7.78us

22:36 <clever> a line from the open firmware

22:37 <clever> i think its a count of cycles on the 400mhz clock, between "do a refresh" commands, that will then meet the 64ms requirement

22:37 <doug16k> yeah

22:38 <doug16k> countdown is how you do it when you don't mind too much when there are delays

22:38 <clever> but, thats not being changed with ram size

22:38 <clever> so, the refresh interval changes based on how much ram you have

22:38 <mrvn> I looked into using DIMMs for my toy CPU. But when I run it at 1KHz it's kind of hard to meet the 64ms refresh cycle.

22:38 <doug16k> not necessarily, it would have more banks if row count gets too much

22:39 <doug16k> think more banks across sticks

22:39 <clever> yeah

22:39 <doug16k> multiple sticks refreshing in parallel

22:39 <clever> only x86 and other larger controllers can do that

22:39 <clever> the rpi is limited to a single 32bit ddr port, on all models

22:40 <doug16k> yeah, just making sure you didn't picture a 1TB machine doing nothing but refresh :D

22:40 <clever> and this is also where ram slots on a motherboard having 2 colors comes into play

22:41 <clever> i'm guessing the command/data for each port with the matching color, are wired in parellel?

22:41 <clever> and they must all run at the same freq then

22:41 <clever> and then an extra bit is used, to select which slot answers? or is the data busses instead ganged together, turning a pair of 64bit slots, into an effective 128bit slot?

22:43 <clever> doug16k: hmmm, the only bit on that ddr2 diagram i linked, that starts to confuse me, is the DQ[13:0] area, it looks like its using bits 1/0 of the column to select a 4bit chunk of a 16bit cell, but then why is DQ 14 bits??

22:45 <doug16k> yeah you just send the command to all of them, maybe conditional chip select

22:45 <doug16k> that must be a typo

22:45 <doug16k> see page 11, DQ[15:8]

22:45 <clever> i also just noticed, page 12 is for the "512Meg x 4" chip, that does kind of imply its got a 4bit bus?

22:46 <clever> and page 13 is the same thing all over again, but for "256 Meg x 8"

22:46 <doug16k> x 4 means 4 data lines exactly

22:46 <clever> where DQ is now 8bits wide, the MUX is selecting 1 of 4 8bit chunks from a 32bit cell

22:46 <clever> and the column is one bit shorter

22:46 wand has joined #osdev

22:46 <clever> so half the columns, half the capacity, plus the output interface is wider

22:46 <doug16k> every other cycle is the other low bit

22:47 <doug16k> edge, not cycle

22:47 <clever> yeah

22:47 <clever> this sounds like the kind of ram you would find on an x86 pc

22:47 <clever> where you gang multiple 4bit cells together on a stick

22:47 <clever> to create a 32 or 256bit stick

22:48 <clever> and yeah, that DQ is looking more like a typo, page 14 then makes more sense, now its 64bit cells, with a 16bit bus going out

22:48 <doug16k> yeah, it's 8 or 9 x8 on modern ones because it's a 64/72 bit interface

22:49 <doug16k> you could use twice as many x4

22:49 <clever> that makes me wonder, since the pi expects a 32bit bus, could i just gang together 8 * 4bit ddr2's and make it work?

22:49 <doug16k> you can if you length match the data lines decently

22:50 <doug16k> and make them almost the same impedance

22:50 <clever> and thats why motherboard traces are so wiggly

22:50 <doug16k> yes, they are making them the same length

22:50 <doug16k> and/or impedance

22:51 <doug16k> impedance is mostly the PCB person's problem. length is PCB layout's problem

22:52 <doug16k> think of impedance as the resistance, accounting for the skin effect, where high frequency signal components only travel along the outside of conductors

22:53 <doug16k> the rising and falling edges have extremely high frequency components

22:53 <doug16k> way higher than what the clock implies

22:53 <doug16k> the more straight up and down the rise/fall, the more unreasonable the frequency

22:53 <doug16k> wires become resistors because that part of the signal wants to be on the outside of the wires

22:54 <mrvn> and if the line wiggles you create induction

22:54 <doug16k> but low frequency part goes right through like nothing

22:55 <doug16k> yeah. even a straight trace is a "1 turn" inductor

22:55 <doug16k> it makes a pitiful magnetic field around it

22:56 <mrvn> butbut that's counted in the resistance of the wire

22:56 <doug16k> inductance prevents current flow changes

22:56 <clever> that reminds me, somebody had mis-wired the ethernet an ethernet cable, and it ran perfectly fine, for 11 months out of the year

22:56 <clever> but in december, it had horrible packet loss

22:57 <clever> miswired the pairs*

22:57 <doug16k> it prevents off from turning on, and prevents on from turning off, as if by momentum

22:57 <mats1> amazing

22:57 <clever> can you guess why it only failed in december?

22:57 <doug16k> autocrossover hid the problem partially?

22:58 <doug16k> neat

22:58 <clever> the ethernet cable was run outdoors, along the same hooks as the xmas lights

22:58 <clever> 60hz mains, magnetically coupled right into the ethernet lines!

22:58 <clever> and because he didnt use the pairs correctly, it wasnt canceling itself out on the differential pairs

23:02 <mrvn> *grrr* No speed of light lag on Moonhaven.

23:02 <doug16k> CMOS inputs look like capacitors. the rising edge needs a current pulse to change the gate charge, then the falling edge needs a current pulse in the opposite direction. after the pulse part, it is no current flow at the gate

23:02 <doug16k> so you want to instantaneously increase the current and have it gradually go down to zero, but inductance wants the opposite, a gentle ramp up to a current

23:03 zaquest has quit [Remote host closed the connection]

23:03 <doug16k> so what you wanted to be a straight up voltage change "spins up" a one turn inductor that feels like a flywheel

23:04 <doug16k> and it actually makes the gate voltage go past what you applied, then back, then past, ringing

23:04 <doug16k> after it delayed the rise

23:05 zaquest has joined #osdev

23:05 <clever> bbl

23:11 <doug16k> the trace inductance interacts with the gate capacitance and makes the voltage ring when it changes

23:12 <doug16k> energy is captured in the magnetic field, then later collapses back onto the wire and drives the voltage past the original at the gate input, then the gate input charge is higher than the driver end, and current goes backward through the trace

23:13 <doug16k> but over distance the gradient gets smaller

23:13 <doug16k> more inductance preventing the current change

23:15 <doug16k> ends up looking like it oscillated around the desired signal value

23:15 <doug16k> briefly

23:19 Vercas has quit [Remote host closed the connection]

23:19 Vercas has joined #osdev

23:21 <doug16k> the universe is very springy

23:22 <doug16k> springs everywhere

23:22 <zid> I've seen scope traces of high freq circuits and to me it's fucking amazing they *ever* get it right

23:22 <zid> rather than always getting it right

23:23 <doug16k> yeah, that's why overclocking is so terrifying for me

23:23 <doug16k> I realize how insane default is

23:23 <zid> overclocking is pretty empirically solveable though

23:27 Ram-Z has joined #osdev

23:30 liz has quit [Quit: Lost terminal]

23:38 <mrvn> doug16k: in CMOS electrons flow when you cross from hight to low or low to high. The less power you have to cross the neutral zone the more power it consumes.

23:39 <mrvn> In modern circuit the fan in/out ratio is much higher than traditional CMOS.

23:39 <doug16k> the input impedance is infinite for steady state. the gate is an open circuit, it's not connected to anything. all you are doing is charging it, it's basically a capacitor plate

23:40 <doug16k> but you need to flow current in and out to charge/discharge it

23:40 <doug16k> holding state is 0mA

23:40 <mrvn> not quite

23:40 <doug16k> it'll never be perfect because you'll have a tad of ripple. but close

23:41 <mrvn> and some charge drains

23:41 <doug16k> yeah but so close to nothing

23:41 <doug16k> it's incredibly low leakage

23:42 <zid> Watch out for those sneaky electrons though, they like to pretend that they're on different wires

23:42 <zid> "but daaad I'm a waaave"

23:42 <mrvn> zid: that's only for realy moddern circuits, not traditional CMOS

23:42 <doug16k> that's why we call it "CMOS memory". because holding it in steady state takes almost nothing

23:43 <mrvn> doug16k: there is enough leakage that you have to refresh the memory often

23:43 <doug16k> that nearly zero power use to stay still is why it got named cmos

23:44 <doug16k> dram is not nmos gate

23:44 <mrvn> the bigger problem is getting into that high resistance state.

23:44 <doug16k> it's a capacitor

23:44 <doug16k> cmos is static ram

23:44 <doug16k> it's gates holding themselves on or off

23:44 <doug16k> but since each gate has essentially no current flow to hold it still, it takes almost no power

23:45 <doug16k> it's like holding a balloon full. if you keep the pressure constant, no air flows

23:45 <doug16k> power is flow times potential

23:45 <doug16k> hold flow at 0 then power is 0

23:46 <doug16k> it would be perfect if the voltage rail was perfectly flat

23:46 <doug16k> neglecting tunnelling

23:47 <doug16k> the gate charges up to the perfect charge to perfectly oppose your signal, and current is 0

23:48 <doug16k> it's like putting dc into a full size capacitor. at first there is a pulse, until it charges up to your exact voltage and nothing flows

23:48 <doug16k> in series with it I mean

23:50 <doug16k> "it blocks dc" is what they say. same as the gate on a mosfet, it "blocks" that dc when in steady state

23:51 <doug16k> put a squarewave into a mosfet gate in a simulator. you'll see a pulse of current that falls back to zero if state stays there, then a pulse when state changes, falling to zero again

23:52 <clever> and thats basically what all of the decoupling caps around a cpu are for

23:52 <doug16k> they fix the trace inductance

23:52 <doug16k> the traces want current to ramp up and down diagonally

23:53 <doug16k> caps provide the pulse

23:53 <zid> I've never looked at what makes the various zener/mosfset/blahs different, maybe I should at some point

23:53 <zid> I just know that transistors are npn/pnp and you suck the two sides together

23:53 <zid> slurp

23:54 <doug16k> the big thing with mosfets is, if you want them to have amazingly low on resistance, then you probably need to put a big charge into the gate, but if you can tolerate more on resistance, you can get ones that switch way faster

23:54 <zid> I forget what the terms are now, P and N touching make a tiny neutral zone.. depletion area?

23:55 <zid> depletion region!

23:55 <zid> Man, this is crusty old knowledge from a long time ago

23:55 <doug16k> you have to calculate whether you are losing too much in the switching region with a low resistance mosfet, or if you'd be better off with more on resistance, but switching way faster. you get a pulse of losses during switching

23:55 <zid> We should just make idealized components and save everybody the hassle.

23:55 <doug16k> or drive the gate really hard with low on resistance mosfet

23:55 <gog> you contaminate some hot sand and it goes brrr

23:56 <doug16k> like, using a pair of mosfets to switch the pulse to turn the big mosfet on and off

23:56 <zid> gog: basically, but there's TWO types of sand, like those cool sand-in-a-bottle guys

23:56 <zid> and it mixes a little sometimes if you shake it

23:57 <zid> https://s7.orientaltrading.com/is/image/OrientalTrading/VIEWER_ZOOM/tropical-sand-art-bottles-12-pc-~48_5821a "Mosfet, 1974"

23:58 <zid> That's a PNPNPNPNPNPNPNPNP and an NPNPNPNPNPNPNP

23:59 <clever> doug16k: oh, and this is also why voltage in cpu core is important

23:59 <clever> doug16k: the lower the voltage, the less chage you need to put into the gate, and the smaller that current pulse will be