#osdev on 2021-09-14 — irc logs at libera.irclog.whitequark.org

2021-05-23 01:57 klange changed the topic of #osdev to: Operating System Development || Don't ask to ask---just ask! || For 3+ LoC, use a pastebin (for example https://gist.github.com/) || Stats + Old logs: http://osdev-logs.qzx.com New Logs: https://libera.irclog.whitequark.org/osdev || Visit https://wiki.osdev.org and https://forum.osdev.org || Books: https://wiki.osdev.org/Books

00:00 scoobydoo_ has joined #osdev

00:00 scoobydoo_ has quit [Excess Flood]

00:00 scoobydoo_ has joined #osdev

00:00 scoobydoo_ has quit [Excess Flood]

00:02 scoobydoo_ has joined #osdev

00:02 scoobydoo_ has quit [Excess Flood]

00:02 scoobydoo has quit [Ping timeout: 265 seconds]

00:02 scoobydoo_ has joined #osdev

00:02 scoobydoo_ is now known as scoobydoo

00:02 scoobydoo has quit [Excess Flood]

00:08 aejsmith has quit [Remote host closed the connection]

00:20 YuutaW has quit [Ping timeout: 260 seconds]

00:20 YuutaW has joined #osdev

00:31 elastic_dog has quit [Ping timeout: 260 seconds]

00:36 <geist> wow, someone put together a ridiculously detailed M1 analysis

00:37 <geist> https://twitter.com/handleym99/status/1437537535018684417

00:37 <bslsk05> twitter: <handleym99> Bigger than Jesus! Longer gestating than Chinese Democracy! Rarer than Once Upon a Time in Shaolin! ␤ It's finally available in (very) preliminary form! My first masterpiece -- M1 Explainer. <drive.google.com/file/d/1WrMYCZ… https://t.co/h3RuiXlro2> 1/ @dougallj @andreif7 @trav_downs @silicongang @stuntpants

00:38 elastic_dog has joined #osdev

00:38 <clever> let me check on the comments you left in the PR

00:41 asskoala has quit [Ping timeout: 252 seconds]

00:44 <clever> geist: oh, i just had a bit of a hacky idea, after arch_chain_load() turns irq's off on one core, and calls platform_quiesce, can platform_quiesce still spawn 3 pinned threads, and get the other cores to re-schedule?, and then block until they act via a spinlock maybe?

00:44 <clever> so when arch_chain_load tries to quiesce the entire system, platform_quiesce will re-park the other cores in a known location

00:45 <geist> probably not

00:45 <clever> what is most likely to fail there?

00:45 <geist> well, okay with the current scheduler it'll probably work

00:45 <geist> because it's not possible to deadlock on the 'pinned' cpu0 that has disabled interrupts

00:46 <geist> but in a queue-per-cpu style design, you've already stepped off the edge the moment you essentially hijack cpu 0 by disabling ints

00:46 <geist> that's fine as long as you dont intend to ever touch the scheduler again, on any cores

00:46 <clever> yeah

00:46 <geist> but if you do, then it's possible there's another thread blocked up on cpu0 that is holding a mutex in the heap, for example

00:46 <geist> and then another cpu tries to malloc something, boom

00:47 <geist> the current scheduler is a single queue, so there's no blocking up like that

00:47 <geist> so it'll probably work

00:47 <clever> reading the code, i can see that arch_chain_load will do: 1: arch_disable_ints, 2: target_quiesce (no-op), 3: platform_quiesce

00:48 <clever> platform_quiesce could temporarily turn IRQ's back on, and ask the scheduler to get all 4 cores running code i control, each grabbing a spinlock

00:48 <geist> yeah so if either target or platform goes and does stuff that involves grabbing mutexes or whatnot (heap) or fiddling with the scheduler

00:48 <clever> with spinlocks held, irq's are already off

00:48 <geist> then the fact that it idisabled ints is not blown

00:48 <geist> now

00:48 <clever> platform_quiesce can then return back to arch_chain_load for hijacking core-0

00:48 <heat> what's a good resource for knowing how a modern CPU actually works under the hood?

00:48 <clever> and i can hijack the other 3 in my own way

00:49 <heat> uops and whatnot

00:49 <geist> i think we came up with a good solution about an hour ago: run LK in UP mode, grab the other cores, park them for eventual handoff

00:49 <geist> that's basically what all LK based bootloaders do

00:49 <heat> although something lower level would actually be cool as well

00:49 <geist> heat: a lot of what i learned was in the mid to late 2000s with an excellent series of articles on arstechnica, later collapsed into a book

00:50 <geist> basically a whole series of cpu architecture articles. more specifically superscalar cpu architecture

00:50 <geist> a lot of the rest of it i've learned here by talking to doug16k and whatnot

00:50 <clever> line numbers are also missing from your comments on the PR

00:50 <geist> oh?

00:50 <clever> normally, a comment is on a range of lines

00:50 <geist> huh. i just pushed a little +_ next to the line and started typing

00:50 <clever> weird

00:51 <geist> they ssemed a little strange though, like they were some sort of mini-comment

00:51 <geist> i never saw a ui for 'start a review' or whatnot

00:51 <geist> i still dont fully grok the github review UI

00:51 <geist> and it seems to change on me every time i use it

00:51 <clever> for the first comment, i should check to see if loader_pa is within the vmm_get_kernel_aspace()->arch_aspace first, right?

00:51 <geist> yah

00:51 <clever> and then if its not, add my own aspace

00:51 <geist> a good example if the previous is arm-virt qemu

00:52 <geist> kernel aspace starts at 0x4000.0000+ and lo and behold that's also where physical ram starts

00:52 <geist> lots of socs i know (seems to be most modern ones 've seen) now start DRAM later on, 0x4000.0000 or 0x8000.0000 and just go right past 4GB

00:52 <geist> and then stuff all the peripheral stuff below that

00:52 <clever> thats a good reason for that testcase i mentioned earlier

00:52 <clever> can qemu be configured easily to make ram start anywhere?

00:52 <geist> no

00:53 <geist> it's extremely hard coded

00:53 <clever> ah, but if i pick the rpi qemu mode, it starts at 0

00:53 gog has quit []

00:53 anon16_ has quit [Ping timeout: 252 seconds]

00:53 <clever> so i can just swap between arm-virt and rpi, in qemu

00:53 <geist> sure

00:53 gog has joined #osdev

00:53 <geist> and write some sort of bootloader test case sure

00:53 <clever> so i could write a testcase, where i try to chainload linux, and ensure it works in both cases

00:54 <clever> how does qemu-arm-virt deal with unparking the other cores? PSCI was it?

00:54 <geist> yes

00:54 <geist> basically the way all modern arm64s do it

00:55 <geist> you call a piece of firmware to unpark/park the cores

00:55 <clever> so LK boots in UP mode, and just tells linux to go ask PSCI for the other cores?

00:55 <geist> what does linux have to do with it?

00:55 <clever> when you chainload a SMP capable kernel, and it wants more cores

00:55 <geist> oh if it's a bootloader yes

00:55 <geist> just dont touch the other cores, let linux deal with it

00:56 <heat> x86 also works like that

00:56 <clever> the other option, would be to make LK into a PSCI firmware

00:56 <geist> and PSCI specifically has a defined state the cores are brought up in

00:56 <heat> cores go through the BIOS single-threadedly and just wait for the wakeup in a loop

00:56 <geist> right

00:56 <geist> in the case of PSCI on arm it's the firmware job to do what it does. most of them actually do park the core and really bring them up from cold

00:56 <clever> heat: and my problem, is that the bios is missing, and i'm using LK as my bios

00:56 <geist> saves power that way

00:57 <geist> but that's the nice thing about it, it abstracts how the cores are brought up

00:57 <geist> well yeah. so PSCI runs in EL3 for one thing

00:57 <geist> and a proper PSCI firmware does *nothing* if it's not told. so really LK is compeltely inappropriate for 'sticking around' like that

00:57 <clever> but i'm on arm32, so EL3 style things would be a bit more tricky

00:58 <geist> there is no PSCI on arm32 i think

00:58 <geist> well okay no that's not true, but i haven't seen it implemented on arm32

00:58 <geist> because arm32 is dead.

00:58 <clever> and i'm the one re-animating the zombie :P

00:58 <geist> well to be more precise: a pure 32bit only core i dont think implements PSCI. but you can make PSCI calls from a 32bit only os (running at say EL1) on a 64bit core with a 64bit EL3

00:59 <clever> that makes sense

00:59 <geist> arm32 as a subordinate EL is still somewhat alive (though newer cpus dont implement it above EL0 or at all)

00:59 <heat> nintendo switches still have arm7s in them

01:00 <geist> the big wrinkle is... cortex-a32 which is a 32bit only armv8 core (with four ELs)

01:00 <clever> i think i wound up fixing both your PR comments at once

01:00 <geist> so actually in that case everything i said is a lie.

01:00 <heat> note 7, not v7

01:00 <clever> i switched aspace to being a pointer, so i can trivially select between the 2 available ones

01:00 <geist> so really it's pre-armv8 that doesn't do PSCI

01:00 <clever> and now i have to malloc it, which solves the 2nd issue

01:00 <clever> if (loader_pa isin vmm_get_kernel_aspace()->arch_aspace) {

01:00 * zid makes a note in geist's fine: Do not trust on star sequences or arm boot processes

01:00 <clever> so i just need to fill in this blank

01:00 <zid> file*, bah

01:00 <geist> heat: that's probably the little hidden portalplayer 'security processor' in the tegra

01:03 <heat> yup

01:03 <heat> also the boot cpu

01:03 <geist> yep

01:03 <geist> in a past life i dealt with a nvidia tegra. it booted the arm7 first, then booted the other cores

01:03 System123 has joined #osdev

01:04 <geist> very much like the broadcomm mess that is the early raspberry pi cpus

01:04 <geist> basically an existing design (arm7tdmi + some dsp stuff) with some 'big' arm cores bolted onto the side

01:04 <clever> geist: i'm guessing i use arch_mmu_query to see if a given VA is within an aspace?

01:04 <geist> over the years the arm7 has turned into more and more of a security thing

01:04 <geist> clever: no. you have to test against KERNEL_ASPACE_BASE, etc

01:04 <clever> DEBUG_ASSERT(is_valid_vaddr(aspace, vaddr));

01:04 <clever> oh, maybe that

01:05 <geist> it's hard coded

01:05 <clever> is_valid_vaddr feels like the best option, because if that returns false, there aspace can never map it

01:05 gog has quit [Remote host closed the connection]

01:05 gog has joined #osdev

01:05 <clever> kernel/vm/vmm.c: arch_mmu_init_aspace(&_kernel_aspace.arch_aspace, KERNEL_ASPACE_BASE, KERNEL_ASPACE_SIZE, ARCH_ASPACE_FLAG_KERNEL);

01:06 <klange> I wonder what I can poke on my ThinkPad through ACPI without having to go all the way with AML... would be nice to have a bettery level widget...

01:06 <heat> geist, why the arm7 though?

01:06 <klange> Would also be nice if I actually did the thing with panel widgets I said I was going to do and make them shared objects...

01:06 <geist> heat: becuase nvidia bought an existing design from a company called portalplayer in the mid 2000s

01:07 <geist> and then morphed it into tegra over time

01:07 <klange> Where is my notebook... of the paper variety..

01:07 <geist> https://en.wikipedia.org/wiki/PortalPlayer

01:07 <bslsk05> en.wikipedia.org: PortalPlayer - Wikipedia

01:08 <geist> much the way broadcomm had bought another company and then morphed it into their ARM base of socs (they had been doing mips before)

01:08 <geist> and then the VCPU and whatnot came in via that path

01:08 System123 has quit [Ping timeout: 268 seconds]

01:09 <geist> in 2009 or so i was dealing with the second gen tegra, tegra 2

01:09 <geist> it still was basically a portalplayer with a cortex-a8 bolted onto the side

01:09 <heat> still doesn't explain why they didn't change it

01:10 * geist shrugs

01:10 <heat> why do they really like old CPUs booting new CPUs (see intel x86)

01:10 <geist> gotta ask them. 'them not changing it' is extremely common

01:10 pony has joined #osdev

01:10 <geist> well that kinda makes sense i guess. you can run a very old, small, extremely power efficient security processor of just internal SRAM

01:11 <geist> and do whatever crypto/etc you need to then decide to power up the many of order of magnitude larger cores

01:11 <geist> nice for things like sitting there with the soc 'off' and just sipping power keeping the battery running, etc

01:11 <geist> in their case they probably just kept it since they already paid for the arm7 IP

01:11 <geist> sometimes you see various socs stuff in one or more cortex-m class cpus for the same thing

01:12 <clever> that reminds me, ive had issues in the past, where a laptop battery just entirely died, because i left it fully charged and unused for months

01:12 <geist> same. worth bringing them out and powering them up every once in a while

01:12 <clever> ive heard of a product somewhere, that would automatically drain its own battery, if left unused for too long

01:12 <clever> to prevent exactly that

01:12 <clever> it had an MCU keeping track of idle time, and probably a mosfet and resistor, to dump things into heat

01:13 <heat> D:

01:13 <clever> and thats the kind of thing you could add into that security processor, if the battery is non-removable

01:13 <clever> its already running when "off"

01:13 pony has quit [Client Quit]

01:13 <clever> and you could even omit the dummy load resistor, just turn the fat cores on, and spin!

01:14 <geist> dumping things into heat is always fun

01:14 <heat> noooooooooooo

01:14 pony has joined #osdev

01:14 <clever> static inline bool is_valid_vaddr(arch_aspace_t *aspace, vaddr_t vaddr) { return (vaddr >= aspace->base && vaddr <= aspace->base + aspace->size - 1);

01:14 <clever> geist: i think this function does exactly what we need, but its private to arch/arm/arm/mmu.c, so arch/arm/arm/arch cant see it

01:15 <geist> elevate it to an arch_mmu_* routine or an arch_aspace routine

01:15 <heat> vaddr < aspace->base + aspace->size is way clearer :)

01:15 <geist> seems like it could be generically used

01:15 <geist> heat: problem is wraparound

01:15 <clever> arch_aspace does sound like a good place to move it

01:16 <geist> heat: in the very common case where base + size == 0 it explicitly calculates based on base + size - 1

01:16 freakazoid343 has joined #osdev

01:16 <heat> hmm good point

01:16 <geist> when dealing with inner VM address calculations and whatnot these sort of wraps and whatnot are extremely common, have to be very very careful

01:17 <clever> arch_mmu_ actually, compared to the other funcs

01:17 <geist> problem is arch_aspace_t is an opaque type, so you can't really put it in a public header next to the other methods

01:17 <geist> or at least can't do an inline version

01:18 <geist> tis the one really nice thing you can do easily in C and not as easily in C++: have completely opaque types with a bunch of methods defined on it

01:18 <geist> C++ tends to force you to expose the guts of your object *or* use a pimpl to hide the guts

01:18 <zid> seems like a job for a macro

01:19 <clever> *looks*

01:19 <moon-child> geist: isn't that what 'private' is for?

01:19 <zid> no

01:19 <moon-child> or is that not private enough

01:19 <geist> moon-child: sure but you still have to expose the guts in the .h

01:19 <zid> you still get the declaration

01:19 <moon-child> oh abi stuff?

01:19 * moon-child has not read scrollback

01:19 <clever> arch/include/arch/mmu.h:typedef struct arch_aspace arch_aspace_t;

01:19 <clever> ah, thats where its defined

01:19 <zid> so it still compiles slow and still breaks when things change (read: needs recompiling)

01:19 <geist> in C you can build OO style stuff, but just use an opaque struct for your pointer

01:20 <geist> but... then you dont get the advantage of lots of little inline accessors. so it's all a tradeoff

01:20 <zid> by like.. 'default' the idiom in C is basically to make things OO but on the TU level instead of the runtime memory level

01:20 <clever> geist: i think you can also do that with `class Foo;` in c++, you can pass a `Foo*` around, but you can never allocate a Foo or access any of its members

01:20 <geist> exactly same thing. you can *write* C style OO in C++ if you want

01:20 <clever> but member functions are a thing, and then you need to declare it fully

01:21 <geist> but my point is if you do it the C vs C++ way

01:21 <clever> arch/arm/arm/include/arch/aspace.h:struct arch_aspace {

01:21 <clever> ahhh, thats where its hidden

01:21 <geist> i'm not saying its great but it always pains me when you have to stuff so much crap in the .h file for some nominally opaque C++ object in its header because that's how you do it

01:21 gog has quit [Remote host closed the connection]

01:22 gog has joined #osdev

01:22 <geist> clever: right, each arch defines their own version of it

01:22 <clever> the typedef was making it harder to spot, but i found it now

01:27 <geist> this M1 pdf is really interesting

01:28 <geist> it spends about 40 pages really trying to explain superscalar design but then really gets into it

01:28 <zid> The first chip in the range so it's going over all the basics super deep?

01:28 <clever> link?

01:28 <geist> see above

01:28 <geist> warning it's 300 pages

01:28 <geist> and very detailed

01:29 <clever> ah, up at https://twitter.com/handleym99/status/1437537535018684417

01:33 <clever> geist: what happens if i change the core affinity for the currently running thread, and then hit reschedule or yield?

01:33 <moon-child> Wow. It's 100 pages longer than agner vol 3

01:35 <geist> clever: good question, looks like i might kinda fall through

01:35 <geist> it'll not pick it, but i dont see it send any sort of broadcast for the other cores to pick it up

01:36 <geist> so if the target cpu is idle it wont 'pick it up'

01:36 <clever> ah

01:36 <clever> i do see a wakeup_cpu_for_thread function

01:36 <clever> oh, what if i change the pinned core, and then thread_sleep() ?

01:36 gog has quit [Remote host closed the connection]

01:37 <geist> that'll do it

01:37 <clever> it will temporarily suspend, and the irq will wake it up later

01:37 <clever> and route it to whatever it should be on now

01:37 <clever> so i can use that, to ensure i'm on a given core, without having to spin up a new thread

01:37 <geist> welll... not so sure

01:38 <geist> the affinity stuff is kidna half baked. it's intended to be a thing you set once

01:38 <clever> yeah

01:38 <clever> but i do notice it has both curr_cpu and pinned_cpu

01:38 <clever> i'll review the pinned_cpu code, and do some testing

01:38 <geist> the gist is with the single unified run queue, 'pinning' a thread to a cpu just means any other cpu will skip it

01:39 <geist> so there's some logic in the thread wakeup path to make sure the target cpu wakes up if it's idle and/or reevaulates

01:39 <clever> so the scheduler has to grab a global lock, when deciding which thread to run next?

01:39 <geist> yes

01:39 <clever> ive also noticed, if there are no other competing threads, the pre-emption timer is never set

01:39 <geist> that's right

01:40 <clever> but if a new thread is created, pinned to such a core, and resumed

01:40 <clever> then you need to add the timer in

01:40 <geist> right

01:40 <clever> so it has to interrupt the task immediately?

01:40 <geist> probably

01:40 <clever> since it would be too costly to set a timer on the current core

01:40 <geist> there's a bit of a mental disconnect in my brain because i also completely rewrote all of this in zircon

01:41 <clever> "note to self, wake the neighbor at 5am", lol

01:41 <geist> this particular part (affinity) i did a complete gut and rewrite and it's far more complicated to handle all the edge cases

01:41 <clever> ah

01:41 <geist> but lots of it is also because i rewrote the scheduler to be per-cpu queues

01:41 <geist> and then there's this whole 'the thread is in the wrong queue' sort of edge cases

01:42 <clever> when using per-cpu queues, what happens if one of the cores winds up idle ahead of plans, and another queue is filling up?

01:42 <geist> also just the other day the trusty branch of LK added a bunch of logic for affinity

01:42 <geist> i need to pull it back

01:42 <clever> does something re-balance them?

01:42 <geist> i did some code reviews at work for it

01:43 <geist> https://android.googlesource.com/trusty/lk/common/+/01d4cc46a1a8f108bcb118bff9bc73b2ab2bac56 specifically

01:43 <bslsk05> android.googlesource.com: 01d4cc46a1a8f108bcb118bff9bc73b2ab2bac56 - trusty/lk/common - Git at Google

01:43 <geist> i occasionally cherry pick stuff out of that branch

01:44 <geist> clever: uyeah per cpu queues is clearly much more efficient lock/etc wise and scales better

01:44 <geist> but suddenly your scheduler isn't 'perfect' as far as keeping all the cores occupied

01:44 <clever> i also need to investigate the thread priority some

01:44 <geist> a single queue that all cpus opportunistically pull from is 'ideal' in the sense that no cpu is wasted (if yuou're aggressive about waking them)

01:45 <geist> but clearly doesn't scale

01:45 <clever> i moved all of my animation code into threads, that block on wait_queue_block()

01:45 <geist> modern complicated systems have lots of logic to deal with trading off efficiency vs throughput vs overhead with regards to balancing threads between cpus

01:45 <clever> but if one thread is hogging cpu, the pre-emption wont interrupt it much

01:45 <clever> and the animation then slows to a crawl

01:45 <geist> why not?

01:46 <clever> the pre-emption is not going to interrupt a task at 60hz, to run 2 other threas

01:46 <clever> because the priorities are all equal

01:46 <geist> sure it will, it'll jsut round robin them

01:46 <clever> but how much time does each one get? before it rotates?

01:46 <geist> oh depends on what the quantum is set to

01:47 <geist> probably higher than you want

01:47 <clever> if i mess with the thread priority, then wait_queue_wake_all and INT_RESCHEDULE, will forcibly switch to the animation threads on each vsync irq

01:48 <clever> and those are supposed to be very quick routines, so it will get back to the cpu heavy part

01:48 <geist> https://github.com/littlekernel/lk/blob/master/kernel/thread.c#L494 is literally the line

01:48 <bslsk05> github.com: lk/thread.c at master · littlekernel/lk · GitHub

01:48 <geist> looks like 50ms, since htat's 5 ticks of 10ms

01:49 <clever> ah, and that would explain the stuttering, when i have a 16ms vsync interrupt

01:49 <geist> yep

01:49 <clever> so its missing 3 or 4 frames

01:49 <geist> but yes the priorities are hard so if you have some long running thing make it lower priority

01:49 <clever> even with the irq saying INT_RESCHEDULE, the scheduler says no and lets the quantum run out

01:50 <geist> but actually it's more complicated than that, the other threads *should* be preempting it

01:50 <geist> since the scheduler has a feedback loop

01:50 <clever> the long-running thread, is doing tga decode with irq enabled

01:50 <clever> the 2 animation threads, are blocked on wait_queue_block waiting for vsync

01:51 <geist> oh wait no. yes, again i have a disconnect. the LK scheduler is hard priority

01:51 <clever> and the vsync irq handler will wait_queue_wake_all(&channels[hvs_channel].vsync, false, NO_ERROR); and INT_RESCHEDULE

01:51 <geist> no feedback. if you set it to priority 18 something that is set to 19 will *always* preempt it

01:51 <geist> the only difrerence is threads marked as 'real time' will not get a preemption timer on them

01:51 <geist> so they're even more uber, they will simply run until something higher priority gets them or they yield

01:51 <clever> and a INT_RESCHEDULE from an interrupt handler, can force that pre-emption, without any care about the remaining quantum?

01:52 <geist> yes

01:52 <clever> thats what i was expecting

01:52 <geist> well, okay actually no. it's more subtle

01:52 <clever> so i can use a slightly higher priority for animations, to keep them smooth, but i might use realtime for thermal throttling, so i dont cook things

01:53 <clever> or just handle that entirely in the irq handler, and dont give the scheduler a chance to mess up

01:53 <geist> it means 'call thread_preempt()' on this which will decrement the quantum by 1 and invoke the scheduler

01:53 <clever> ah right, let me check the arch irq routine

01:54 <geist> so it doesn't always reschedule the current thread. it only reschedules if the current thread runs out of quantum or something higher priority is in the queue

01:54 <clever> thread_preempt(); yep, exactly

01:54 xenos1984 has joined #osdev

01:54 <clever> so each vsync irq, is eating up one quantum, making it run out slightly faster then normal

01:54 <geist> if the current thread is out of quantum it may still pick it if it's still the highest priority thread and there's nothing else in the same queue

01:54 <clever> until it runs dry, and has to reschedule

01:54 <clever> yeah, makes sense

01:55 <geist> right the quantum stuff is explicitly sloppy: only uses a counter of ticks and then the accounting of the ticks is sloppy

01:55 <geist> it's intended to not invoke current_time() or use a higher res thing like ms or us

01:55 <clever> and realtime threads can still be interrupted by irq's i assume, they just dont get interrupted by a pre-emption timer, and the scheduler will likely only interrupt it with another realtime?

01:56 <geist> the obvious thing to do there is to make quantum be tracked in actual time or at least some sort of jiffies thing

01:56 <geist> but in this case it's a sloppy notion of 'times we've checked'

01:56 <clever> i can see how avoiding current_time might help with speed

01:56 <clever> at least on the rpi, that involves MMIO leaving the cpu cluster, and going off to the clock peripheral

01:57 <geist> on a lot of cortex-m class stuff it can involve a divide or two

01:57 <geist> so it can takes hundreds of cycles

01:57 <clever> yeah, my clock peripheral returns uSec, so it also needs a /1000

01:58 <clever> i wonder...

01:59 <clever> c4001e1c: 80 90 d0 61 bl c400e1bc <__udivdi3>

01:59 <clever> yeah, thats a pretty big function

02:00 <geist> yah few hundred instructions at least

02:00 <clever> i can see why you want to avoid it

02:00 <clever> 0x488 bytes, and an opcode is a minimum of 16 bits

02:00 <geist> looks like the realtime thread stuff is still a bit half baked. it still calls thread_preempt on it

02:00 <geist> which really it shouldn't be fiddling with the quantum on a real time thread

02:01 <geist> but it does keep it from interrupting the cpu that's runningo ne

02:01 <geist> which is really what it's for

02:01 <geist> so if you took a real time thread, pinned it on cpu 2 then it'll basically leave that cpu alone, not send it IPIs, etc. that was the intent at the time

02:01 <clever> yeah

02:02 <clever> but if its pinned on a core that can service hw irq's, it will be getting interrupted by things like uart and vsync

02:02 <geist> so *really* that's all the real time flag does: it marks a cpu that s running a real time thread as off limits to ipis

02:02 <geist> right

02:02 <clever> for the rpi arm, only one core can ever get a hw irq, the rest are IPI only

02:02 <clever> for rpi vpu, each core has its own irq mask set, and vector table

02:02 <clever> so i could balance it however i want

02:03 <geist> been a while since i looked at this stuff. it's odd this sort of disconnect where i've been dealing with multiple derivatives of this and to go back to the ancestor

02:03 <geist> it's like going back and looking at linux 1.0 or something

02:03 <clever> heh

02:03 <clever> i recently updated my linux source, just incase it could fix something (it had basically no effect)

02:03 <geist> but... honestly i still mostly like how the LK stuff is for simple designs. it's good for 'i need some threads and wanna run some shit'

02:03 <clever> then i noticed it printing weird things on boot

02:03 <clever> C:0x010000C0-0x015B43E0->0x01095700-0x01649A20

02:04 <clever> https://github.com/raspberrypi/linux/blob/rpi-5.10.y/arch/arm/boot/compressed/head.S#L495-L503

02:04 <bslsk05> github.com: linux/head.S at rpi-5.10.y · raspberrypi/linux · GitHub

02:04 <clever> its debug info, for when the kernel copies itself

02:04 <clever> linux will take the PC for the decompression stub, round it down to the nearest 128mb (to find the start of ram), and then add 32kb

02:05 <clever> and it unpacks itself to that addr

02:05 <clever> but if the compressed kernel is in the way, it has to first memcpy itself out of the way

02:05 freakazoid343 has quit [Ping timeout: 252 seconds]

02:06 <clever> so ideally, the kernel should be at least 32kb + $uncompressed_size away from the start of ram, and the start of ram should be 128mb aligned

02:07 <clever> this is also where LK is in danger

02:07 <clever> -r-xr-xr-x 1 root root 126K Dec 31 1969 result/rpi2-test/lk.bin

02:08 <clever> my LK build is 126kb, so 94kb of it gets overwritten

02:08 <clever> any other cores LK had spinning, will then promptly malfunction on chainload

02:09 <clever> if they are idle, i expect them to be waiting for an IPI

02:09 <clever> and if linux never pokes the bear, they will remain idle forever

02:10 <clever> so any kind of parking i do, must either be within the first 32kb of the binary, or be a blob i copy to a safe place

02:10 <clever> PSCI doesnt have to worry, because it can use EL3 features to protect itself

02:14 <clever> geist: PR updated: https://github.com/littlekernel/lk/pull/305

02:15 <bslsk05> github.com: [arch][arm] improve arm chainload by cleverca22 · Pull Request #305 · littlekernel/lk · GitHub

02:15 freakazoid12345 has joined #osdev

02:15 <clever> ive also confirmed it can still boot linux on an rpi

02:21 mahmutov has quit [Ping timeout: 265 seconds]

02:25 isaacwoods has quit [Quit: WeeChat 3.2]

02:31 freakazoid12345 has quit [Ping timeout: 268 seconds]

02:32 sm2n has joined #osdev

02:33 freakazoid343 has joined #osdev

02:36 mahmutov has joined #osdev

02:38 freakazoid343 has quit [Read error: Connection reset by peer]

02:39 sts-q has quit [Ping timeout: 252 seconds]

02:40 pony has quit [Quit: WeeChat 2.8]

02:43 freakazoid343 has joined #osdev

02:52 mahmutov has quit [Ping timeout: 268 seconds]

02:57 vdamewood has quit [Quit: Life beckons]

03:05 srjek has quit [Ping timeout: 260 seconds]

03:16 mahmutov has joined #osdev

03:17 vdamewood has joined #osdev

03:19 freakazoid343 has quit [Ping timeout: 252 seconds]

03:25 smeso has quit [Quit: smeso]

03:30 smeso has joined #osdev

03:36 anon16_ has joined #osdev

03:43 dude12312414 has joined #osdev

03:45 dude12312414 has quit [Client Quit]

03:58 anon16_ has quit [Read error: Connection reset by peer]

03:58 anon16_ has joined #osdev

04:01 mahmutov has quit [Ping timeout: 268 seconds]

04:02 Burgundy has joined #osdev

04:02 freakazoid12345 has joined #osdev

04:07 bradd has quit [Read error: Connection reset by peer]

04:07 bradd has joined #osdev

04:09 mahmutov has joined #osdev

04:10 pony has joined #osdev

04:20 mahmutov has quit [Ping timeout: 260 seconds]

04:41 <gorgonical> i'm reading about MLIR and the cool stuff it does, and I realized I was never taught was a lattice was in mathematics, even though I acquired a degree in computer science with a focus on math

04:41 <gorgonical> wtf america

04:47 fedorafan has quit [Ping timeout: 268 seconds]

05:01 fedorafan has joined #osdev

05:08 heat has quit [Ping timeout: 252 seconds]

05:35 System123 has joined #osdev

05:36 freakazoid12345 has quit [Ping timeout: 265 seconds]

05:40 System123 has quit [Ping timeout: 268 seconds]

05:53 sm2n_ has joined #osdev

05:56 sm2n has quit [Ping timeout: 268 seconds]

06:17 aejsmith has joined #osdev

06:27 fedorafan has left #osdev [Textual IRC Client: www.textualapp.com]

06:51 hanzlu has joined #osdev

06:53 dutch has quit [Ping timeout: 268 seconds]

07:01 System123 has joined #osdev

07:07 dutch has joined #osdev

07:32 tacco has joined #osdev

07:47 <klange> "Hm, there's nothing in my compositor to send the hotspot for the cursor to the vbox driver..." I say to myself... and sure enough, I have harded my wacky choice... of 26,26.

07:51 <klange> hard-coded*

07:54 System12_ has joined #osdev

07:57 <mjg> klange: does your os boot on bare metal?

07:57 System123 has quit [Ping timeout: 265 seconds]

07:58 System12_ has quit [Ping timeout: 268 seconds]

08:03 <klange> mjg: of course https://klange.dev/s/thinkpad_top.jpg

08:04 <mjg> nice

08:04 <mjg> i never had the guts to try even my hello world kernel

08:07 <klange> I have a photo from what appears to be December of 2011, before I had a GUI - actually, before I even seem to have had a proper userspace - of this same ThinkPad running in VGA text mode on my desk in my dorm.

08:08 <klange> It must be December of 2011, because there's a date in the kernel shell prompt of 12/14 and my nameplate from Apple is on the pegboard behind the screen...

08:09 <klange> And a later one from what looks to be April, and possibly with a real userspace shell: https://i.imgur.com/JPRYk.jpg

08:10 vai has quit [Quit: Lost terminal]

08:11 <klange> Old GUI running on a desktop that had a funny idea of display centering: https://i.imgur.com/Pps6H.jpg

08:12 <klange> running on an old netbook: https://i.imgur.com/u9Kz7.jpg ← This little guy is one of the reasons I wasn't doing 64-bit support for so long, those early Atoms were 32-bit.

08:21 <klange> that netbook playing quake: https://i.imgur.com/RR8ahQO.jpg

08:28 System123 has joined #osdev

08:31 <mjg> pretty solid stuff, fortunatley does not make me want to go back to writing an os from scratch :)

08:32 <clever> mjg: thats why ive opted for the simpler, yet more complex route, of porting an existing kernel, to an under-documented cpu core!

08:35 <klange> Just booted on my desktop as I hadn't actually tried the new kernel here yet, and it works and brings up SMP and sees all 64GB of memory and Grub happily hands me a nice 1080p framebuffer for one of my four displays

08:36 <klange> But this box has a Realtek 8168-series NIC, so no network support, and also the PS/2 emulation layer is giving me a really slow mouse cursor (I think because this mouse is super-high DPI and it's being lazy about dealing with that)

08:38 <klange> I also booted on a Surface once, got to the desktop and full res, though I know that doesn't have older ACPI tables I'm looking for so no SMP, and since it has no PS/2 emulation it's utterly useless until I get this USB stack into a state of existence

08:39 <klange> But the clock ticked, so that's nice.

08:39 <klange> Desktop: https://cdn.discordapp.com/attachments/488304956453945354/887255858654892042/image0.jpg

08:40 <klange> I think those two Renesas USB controllers are integrated on the PCIe video capture boards I have installed, amusing that they're the same model as the one in the ExpressCard card I use in my laptop so I've got them in my hand-written PCI ID database.

09:00 CryptoDavid has joined #osdev

09:19 GeDaMo has joined #osdev

09:21 mctpyt has quit [Ping timeout: 268 seconds]

09:24 hbag has quit [Quit: The Lounge - https://thelounge.chat]

09:24 gog has joined #osdev

09:35 dormito has quit [Quit: WeeChat 3.1]

09:48 anon16_ has quit [Ping timeout: 252 seconds]

09:55 <klange> I guess that NIC's in the 8169 series so we have a quick reference page... https://wiki.osdev.org/RTL8169

09:55 <bslsk05> wiki.osdev.org: RTL8169 - OSDev Wiki

10:01 pretty_dumm_guy has joined #osdev

10:07 hanzlu has quit [Quit: Konversation terminated!]

10:14 zaquest has quit [Quit: Leaving]

10:16 zaquest has joined #osdev

10:16 vinleod has joined #osdev

10:16 vdamewood is now known as Guest6509

10:16 vinleod is now known as vdamewood

10:16 Guest6509 has quit [Killed (copper.libera.chat (Nickname regained by services))]

10:17 dormito has joined #osdev

10:18 dude12312414 has joined #osdev

10:31 hanzlu has joined #osdev

10:45 anon16_ has joined #osdev

10:59 asskoala has joined #osdev

11:43 hanzlu has quit [Ping timeout: 268 seconds]

11:43 X-Scale` has joined #osdev

11:44 hanzlu has joined #osdev

11:45 X-Scale has quit [Ping timeout: 268 seconds]

11:45 X-Scale` is now known as X-Scale

11:49 drewlander has quit [Quit: ZNC 1.7.2+deb3 - https://znc.in]

11:50 drewlander has joined #osdev

11:58 tacco has quit [Remote host closed the connection]

11:59 tacco has joined #osdev

12:04 asskoala has quit [Ping timeout: 265 seconds]

12:05 pony has quit [Quit: WeeChat 2.8]

12:18 ahalaney has joined #osdev

12:35 System12_ has joined #osdev

12:37 dude12312414 has quit [Quit: THE RAM IS TOO DAMN HIGH]

12:39 System123 has quit [Ping timeout: 252 seconds]

12:40 System12_ has quit [Ping timeout: 265 seconds]

12:40 isaacwoods has joined #osdev

13:05 ElectronApps has joined #osdev

13:29 shikhin has quit [Quit: Quittin'.]

13:31 shikhin has joined #osdev

13:51 anon16_ has quit [Ping timeout: 252 seconds]

13:52 srjek has joined #osdev

14:01 anon16_ has joined #osdev

14:01 anon16_ has quit [Client Quit]

14:02 anon16_ has joined #osdev

14:02 Izem has joined #osdev

14:14 Izem has quit [Ping timeout: 265 seconds]

14:20 elastic_dog has quit [Ping timeout: 268 seconds]

14:23 Izem has joined #osdev

14:37 elastic_dog has joined #osdev

14:50 dude12312414 has joined #osdev

15:22 <junon> So the deal with NIC's is that there is a typical set of chips that they use, and you need to support each of them to have general support for all of them, right? I'm sure there are outliers that have different, proprietary chips or whatever, but that's kind of the idea right?

15:23 <junon> It's the reason why e.g. linux can just automatically connect to the internet during setup in *most* cases whereas graphics card drivers and the like are much less general.

15:32 ElectronApps has quit [Remote host closed the connection]

15:42 asskoala has joined #osdev

15:49 dude12312414 has quit [Ping timeout: 276 seconds]

15:49 dzwdz has quit [Quit: I'm a quit message virus. Please replace your old line with this line and help me take over the world.]

15:50 dzwdz has joined #osdev

15:53 <zid> linux just has thousands of network drivers

15:53 <zid> there are huge overlaps though, like there are 80 variants of the same e1000 based network card

15:53 <zid> and 800 8139too cards

15:57 freakazoid12345 has joined #osdev

16:04 <junon> And it comes shipped with all of them at once?

16:05 <junon> i.e. in the installation medium?

16:05 <zid> most distros just provide basically every network driver with the default kernel, yes

16:06 <zid> They're tiny and most are grouped into families like that e1000 driver

16:06 <zid> supports hundreds of actual card revisions

16:06 freakazoid12345 has quit [Ping timeout: 268 seconds]

16:08 <zid> https://github.com/torvalds/linux/blob/master/drivers/net/ethernet/intel/e1000/e1000_main.c#L22

16:08 <bslsk05> github.com: linux/e1000_main.c at master · torvalds/linux · GitHub

16:09 <zid> 26 major card revisions

16:13 <junon> Gotcha, interesting. Thanks :)

16:15 Izem has quit [Quit: Going offline, see ya! (www.adiirc.com)]

16:16 System123 has joined #osdev

16:21 System123 has quit [Ping timeout: 268 seconds]

16:30 scaleww has joined #osdev

16:42 System123 has joined #osdev

16:48 freakazoid343 has joined #osdev

16:49 sortie has quit [Ping timeout: 252 seconds]

16:50 sortie has joined #osdev

17:17 scaleww has quit [Remote host closed the connection]

17:21 scoobydoo has joined #osdev

17:24 tacco has quit []

17:36 tacco has joined #osdev

17:41 amine has quit [Quit: Ping timeout (120 seconds)]

17:42 amine has joined #osdev

17:44 sprock has quit [Ping timeout: 268 seconds]

18:16 FreeFull has joined #osdev

18:18 <geist> re: nics though there are far less active modern nic chipsets then there used to be 15-20 years ago

18:18 <geist> so it's also gotten easier

18:18 <geist> most of the nic drivers that linux has are for obsolete chips at this point

18:19 <geist> over time these sort of things become commodity and most vendors get out of the business and you're left with a handful

18:22 <mjg> man

18:46 srjek has quit [Ping timeout: 260 seconds]

18:47 asskoala has quit [Ping timeout: 252 seconds]

18:48 freakazoid343 has quit [Ping timeout: 268 seconds]

19:06 hanzlu has quit [Quit: Konversation terminated!]

19:13 <geist> mjg: you said it

19:17 <mxshift> server NICs tend to be "fancy" and constantly broken

19:17 <clever> -00007a50 2f 52 5f 28 4a 57 27 49 56 36 53 61 37 54 62 39 |/R_(JW'IV6Sa7Tb9|

19:17 <mxshift> Intel i350, Chelsio T6, etc

19:18 <clever> +00007a50 00 1c c4 6e 4a 57 27 49 56 36 53 61 37 54 62 39 |...nJW'IV6Sa7Tb9|

19:18 <clever> i dont know how, but i recently had issues with random `00 1c c4 6e` chunks, appearing in the middle of my tcp streams

19:18 <clever> i downloaded a file with plain http and curl, and then did a diff to see how it got corrupted

19:19 <clever> and every single corrupt part, is that 4 byte sequence, 32bit aligned

19:19 <mxshift> does it happen regardless of upstream route?

19:19 <clever> mxshift: the route is just desktop -> gigabit switch -> rpi

19:20 <mxshift> plenty of things can go wrong inside a NIC but failing RAM in routers cause problems like that too

19:20 <mxshift> oh, you're copying locally

19:20 <clever> and it only affects one destination, when its running my open firmware

19:20 <clever> if i run the closed firmware, its fine

19:20 <mxshift> which NIC model is doing this?

19:20 <clever> the usb ethernet chip on an rpi2

19:21 GeDaMo has quit [Quit: Leaving.]

19:21 <mxshift> well, that removes a few potential causes

19:21 <clever> the thing i'm wondering, doesnt tcp/ip have checksums on the packets?

19:21 <mxshift> usb ethernet chip isn't going to have the NCSI packet matching that occasionally screws thigns up

19:21 <clever> how is the network stack letting this garbage hit userland?

19:22 <mxshift> TCP checksums are 16-bit CRC

19:22 <mxshift> and many devices don't actually validate them

19:22 <clever> i would expect the tcp checksum to be handled in linux

19:22 <clever> not the NIC

19:22 <mxshift> checksum offloading is very common

19:23 <clever> *looks*

19:24 <mxshift> also TCP checksum is very weak: https://dl.acm.org/doi/10.1145/347059.347561

19:24 <bslsk05> dl.acm.org: When the CRC and TCP checksum disagree | Proceedings of the conference on Applications, Technologies, Architectures, and Protocols for Computer Communication

19:24 <clever> Dec 31 20:00:35 nixos kernel: smsc95xx 1-1.1:1.0 eth0: register 'smsc95xx' at usb-3f980000.usb-1.1, smsc95xx USB 2.0 Ethernet, b8:27:eb:77:df:95

19:25 <clever> Sep 11 23:48:28 nixos kernel: smsc95xx 1-1.1:1.0 eth0: Link is Up - 100Mbps/Full - flow control off

19:26 <clever> mxshift: does usb also have any checksums/ecc?

19:27 <mxshift> yes, 16-bit CRC

19:28 <mxshift> `ethtook -k <ifname>` will tell you what offloads are enabled/available

19:28 <clever> the usb phy is currently mis-configured, so i cant even see the usb device

19:28 <clever> all i have to go on is the journal logs from a past boot

19:28 <clever> and the source for the driver

19:29 <clever> over at drivers/net/usb/smsc95xx.c within linux

19:30 <clever> /* Enable or disable Tx & Rx checksum offload engines */

19:30 <clever> mxshift: this comment implies the nic can offload things

19:31 <mxshift> yup

19:31 <mxshift> but keep in mind that the ACM paper I linked earlier shows you can have a valid TCP checksum with invalid data fairly easily

19:32 <clever> i was consistently getting the same 32bit data, replacing different values

19:32 <clever> https/ssh noticed, and immediately died

19:32 <clever> http didnt care and corrupted the file in transit

19:34 <clever> that seems less like corruption, and more like a stray write or something

19:55 elastic_dog has quit [Ping timeout: 252 seconds]

20:00 elastic_dog has joined #osdev

20:03 sprock has joined #osdev

20:07 dude12312414 has joined #osdev

20:26 asskoala has joined #osdev

20:30 gog has quit []

20:55 dormito has quit [Quit: WeeChat 3.1]

21:25 sprock has quit [Ping timeout: 265 seconds]

21:27 Burgundy has quit [Ping timeout: 265 seconds]

21:36 dormito has joined #osdev

21:39 dude12312414 has quit [Ping timeout: 276 seconds]

21:41 dude12312414 has joined #osdev

21:41 xenos1984 has quit [Read error: Connection reset by peer]

21:41 * Bitweasil whistles.

21:42 <Bitweasil> You know what doesn't work if you screw up your TLB invalidation?

21:42 <Bitweasil> Anything relying on TLB invalidation, like using one page to map multiple regions of memory.

21:51 anon16_ has quit [Ping timeout: 265 seconds]

21:53 srjek has joined #osdev

21:54 h4zel has joined #osdev

21:58 xenos1984 has joined #osdev

22:02 dude12312414 has quit [Quit: THE RAM IS TOO DAMN HIGH]

22:02 System123 has quit [Ping timeout: 268 seconds]

22:09 elderK has joined #osdev

22:18 <junon> Bitweasil: what's TLB?

22:18 <zid> translation lookaside buffer

22:18 <zid> it's where your cpu keeps its spare frozen pies

22:18 <zid> or cached virtual to physical lookups, one of those two, I forget

22:19 <junon> according to google, apparently I want to translate it into german

22:19 <junon> "Lookaside-Puffer"

22:19 <junon> Thank you google.

22:19 anon16_ has joined #osdev

22:20 ahalaney has quit [Remote host closed the connection]

22:20 <junon> It never occurred to me that virtual->phys memory translation wasn't... I guess constant time?

22:20 <junon> But yeah it makes sense.

22:21 <j`ey> it has to do like 2-4 extra memory lookups

22:21 <j`ey> depending on the tables

22:27 System123 has joined #osdev

22:27 <geist> Bitweasil: yah and a fun one is orgetting to tlb invalidate the page table cache

22:27 <geist> which is also exposed on ARM (and optionally on AMD)

22:28 <geist> it's a subtle detail, but if you screw that up and forget to invaldate it you get some *really* weird shit

22:29 <geist> it's one of those things that intel x86 Just Deals With so it's invisible

22:29 <Bitweasil> Oof. Yeah...

22:30 <Bitweasil> junon, when you do a virtual to physical translation, it takes quite a few memory steps to do it - you're chasing at least a few page tables down, and this takes time and DRAM bandwidth.

22:30 <Bitweasil> So the result (virtual 0x80000000 maps to physical 0x00010000, size 4kb) is stored in the TLB.

22:30 <Bitweasil> And it's searched every time you do a memory access.

22:30 <Bitweasil> The TLB is typically quite fast, so if the result is in there, great, you just do the access and go on your way.

22:31 <Bitweasil> But, if you change the mappings, you have to be able to invalidate it.

22:31 <Bitweasil> So, if, for example, I decide to use the page at 0xffe01000 as the mapping interface for CPU 0, I can point that virtual address to any physical page I want.

22:31 <Bitweasil> Buuuuut, if I want to point it to another physical page, I have to say, "Ok, that virtual address is no longer valid, I want you to use the page tables next time."

22:31 <Bitweasil> And some refactoring had screwed up the plumbing to route that through.

22:32 <Bitweasil> So the page was being remapped, but because it was still in the TLB, the page walker never hit the actual page tables.

22:32 System12_ has joined #osdev

22:33 System12_ has quit [Remote host closed the connection]

22:33 System12_ has joined #osdev

22:36 System123 has quit [Ping timeout: 265 seconds]

22:38 System12_ has quit [Ping timeout: 260 seconds]

22:39 <junon> Can you invalidate individual mappings? or just the entire page table all at once?

22:40 <junon> If the latter, doesn't that mean any time a new memory page is acquired by a user process, the TLB has to be flushed and thus subsequent memory fetches process-wide will have a TLB miss penalty or something?

22:41 <clever> junon: i think it depends on the cpu, but usually there is a way to invalidate a range of virtual addresses

22:41 <junon> Gotcha, okay

22:42 <clever> but on some systems, the tlb flush is per-core

22:42 <zid> invlpg on amd64

22:42 <clever> junon: so you need to interrupt every core on the system, force them all to flush, and wait for them to ack

22:43 <zid> and it will do all the IPIing to inform all the other cores and stuff automagically

22:43 <clever> so the more cores you have, the more they are going to be interrupting eachother, and the worse your performance becomes

22:43 <clever> but the thing zid just said, saves you from having to interrupt the other cores yourself, its automated in hw

22:44 <junon> I see, that constitutes to process switching overhead, right?

22:44 <zid> not directly

22:44 <zid> invalidating a tlb entry is for when you've changed it, so the cached version is now invalid

22:44 <clever> junon: the pid is also often in the TLB records, so you dont have to flush on process switch

22:44 <junon> ohh okay

22:45 <clever> so only if you change a mapping, does it need a flush

22:45 <zid> If you've got the kernel mapped in both processes.. those don't need flushing

22:45 <zid> but it's often easier to just flush the whole thing anyway

22:45 <zid> there are some complicated schemes to allow you to do partial flushes, or flush based on PID tagging and stuff blah blah

22:45 <junon> I'm deep-diving on wikipedia, and single-address space is mentioned.

22:45 <junon> It lists advantages but no disadvantages

22:46 <junon> I suppose as memory usage increases and pages become more fragmented, you might run into more failed allocations that are > page size, rather than being able to map two disparate physical pages into a contiguous virtual memory area, right?

22:46 <junon> Or am I misunderstanding something?

22:47 <zid> it doesn't matter if physical memory is fragmented

22:47 <zid> virtual memory keeps it all linear

22:47 <zid> and dram doesn't really care beyond 64byte rows

22:47 <clever> only if you wave to save some pagetable size, and use hugepages, does physical fragmentation matter

22:49 <junon> Right but in SAS you can avoid flushes if you use direct mappings, right?

22:49 <clever> SAS?

22:49 <zid> To avoid flushes you'd need to use unique virtual memory addresses, so you might as well just not bother

22:49 <junon> https://en.wikipedia.org/wiki/Single_address_space_operating_system

22:49 <bslsk05> en.wikipedia.org: Single address space operating system - Wikipedia

22:49 <zid> and just use no MMU at all

23:00 <clever> junon: ah, that looks like what you might use if you had no MMU at all, or just an MPU

23:04 <geist> most of what you lose is security

23:04 <geist> since everything can see everything

23:05 <geist> you can mitigate that somewhat with using 'safe' languages

23:05 <clever> an MPU could be used to restrict what you can see, but then you loose the speed of context switching

23:05 <zid> which re-adds the slowdown but probably worse

23:05 <zid> (almost certainly worse)

23:05 <geist> but SAS systems aren't used much anymore

23:06 <geist> except in embedded where you probaby dont have a MMU/etc

23:06 <geist> but mid 80s or so there were lots of SAS systems on desktops, even multithreaded and preemptive

23:08 <geist> also note that not all arches require that you dump the entire TLB when context switching

23:08 <geist> so that particular SAS advantage isn't universal

23:08 <clever> PID tagging, right?

23:08 <geist> right

23:08 <geist> *most* modern arches have that feature, including modern x86s

23:09 <clever> did you get around to checking the 2nd version of my chainload pr?

23:09 <geist> nein

23:12 <geist> but we really should take it to the other channel anyway

23:12 <geist> shouldn't spam a bunch of that stuff here

23:12 <clever> sure

23:20 * Bitweasil mutters something about in-order architectures and avoiding speculation.

23:20 <Bitweasil> I really need to replace the CMOS battery on my old netbook.

23:20 <Bitweasil> It now... no longer sleeps competently, and I'm pretty sure the CMOS battery being dead has something to do with it.

23:26 gioyik has joined #osdev

23:35 h4zel has quit [Ping timeout: 268 seconds]

23:36 pony has joined #osdev

23:38 Retr0id3 has joined #osdev

23:40 Retr0id has quit [Ping timeout: 252 seconds]

23:40 Retr0id3 is now known as Retr0id

23:43 anon16_ has quit [Read error: Connection reset by peer]

23:43 anon16_ has joined #osdev

23:45 tacco has quit []

23:47 anon16_ has quit [Remote host closed the connection]

23:47 anon16_ has joined #osdev