#osdev on 2023-02-24 — irc logs at libera.irclog.whitequark.org

2021-05-23 01:57 klange changed the topic of #osdev to: Operating System Development || Don't ask to ask---just ask! || For 3+ LoC, use a pastebin (for example https://gist.github.com/) || Stats + Old logs: http://osdev-logs.qzx.com New Logs: https://libera.irclog.whitequark.org/osdev || Visit https://wiki.osdev.org and https://forum.osdev.org || Books: https://wiki.osdev.org/Books

00:00 k8yun has quit [Max SendQ exceeded]

00:01 k8yun has joined #osdev

00:11 k8yun has quit [Ping timeout: 260 seconds]

00:14 * kof123 drops an origami unicorn next to mrvn, and walks off

00:19 SpikeHeron has joined #osdev

00:24 nyah has quit [Quit: leaving]

00:28 <moon-child> table lookup rather than a polynomial? No multiplier?

00:38 <gorgonical> just realized that for instructions like adr, objdump lies to you and presents it in a way that it doesn't actually work

00:38 <gorgonical> adr x21, ffffffc000082710 is a lie

00:38 <gorgonical> That's the address it will end up with, but the actual value encoded is 0x26d0

00:45 <heat> hah

00:45 <heat> it should be commented like the rip-rel accesses

00:45 <heat> like mov 0x2314f7(%rip),%rdi # ffffffff814be340 <acpi_gbl_gpe_lock>

00:46 <gorgonical> and also I think my endiannness is backward

00:46 <heat> oopsie

00:46 <gorgonical> why would that be though

00:46 <gorgonical> No I manually flipped all the bytes for the decoder

00:46 <gorgonical> but I'm using aarch64-buildroot-linux-gnu-objdump for it

00:46 <gorgonical> ...

00:54 <gorgonical> am I dumb or something? When I xxd -g 1 I get the correct byte sequence of 95 36 01 10. But when I objdump they are printed in that reverse order: 10013695. Is this because it's printing the leftmost byte as the first one at the address, and the rightmost byte as the last one?

00:54 <gorgonical> The file *is* little-endian

00:54 <gorgonical> Oh crap, reverse what I said about objdump

00:54 <gorgonical> The RIGHTMOST byte is the first one at the address

00:55 <gorgonical> e.g. it says ffffffc000080040: 10013695 adr x21, ...

00:59 wand has joined #osdev

01:00 <gorgonical> Have I just been misreading objdump's hex output for years?

01:03 <moon-child> it does that for jumps too

01:03 <moon-child> presents the absolute offset instead of relative

01:04 <heat> gorgonical, for arm64 yes

01:04 <heat> because they are 32 bit words and not individual bytes

01:05 <gorgonical> heat: I think I know what you mean. In my head it makes more sense to think of it all as a long sequence of bytes though

01:05 <gorgonical> So seeing it written as 4-3-2-1; 8-7-6-5 can be strange to me

01:07 <zid`> gorgonical learns endian exists day today?

01:08 <moon-child> endianness for simd shuffles is really annoying

01:09 Iris_Persephone has joined #osdev

01:09 <Iris_Persephone> hiiiiiiiii

01:09 <moon-child> because the numbers are written as big endian in source, but interpreted as little endian--as it were--by the instruction

01:09 <moon-child> sup

01:09 <gorgonical> wait oh no

01:09 <gorgonical> I am a giant dingus

01:09 <Iris_Persephone> have I ranted about how annoying the windows gfx shell is here yet

01:12 <gorgonical> So xxd -g 1 shows 95 36 01 10. That's how the bytes are laid out in memory. little endian means that the word is 0x10013695. But then this online converter is wrong

01:12 <gorgonical> That's what's confusing me

01:13 <zid`> usually the little online tools have a toggle for le or 'no spaces'

01:13 <zid`> if it's showing as 95360110 then it's just bytes but without spaces

01:13 <zid`> if it's showing 0x95.. it's wrong

01:14 <gorgonical> I think this converter just isn't very good

01:14 <Iris_Persephone> what are you trying to convert

01:14 <zid`> 32bit words

01:14 <gorgonical> I'm just poking at my head.S and making sure I'm doing some variable addressing right

01:16 k8yun has joined #osdev

01:16 Burgundy has left #osdev [#osdev]

01:25 <gorgonical> I'm glad none of you know me in real life. Today's events would be embarrassing otherwise

01:25 <gorgonical> lol

01:25 <gorgonical> The shame of forgetting how endianness works is great

01:31 wand has quit [Remote host closed the connection]

01:35 <heat> i'm starting to think that allowing memory allocation in IRQ context is a good idea

01:36 <dh`> difficult to avoid

01:36 wand has joined #osdev

01:37 <heat> is it? a quick look at freebsd malloc(9) hints that it does not support "fast interrupt handlers"

01:39 <heat> my problem here is that I want to make all my block IO drivers follow a strict io-queue method where you queue requests and on IRQ you complete() them and submit the next one, if it exists

01:40 <heat> currently this may involve memory allocation and deallocation for bounce buffers, sg lists, etc

01:41 <heat> which may just hint that I'm trying to do too much on the top half IRQ and doing some softirq here would be a good idea

01:41 <heat> honestly, I don't know

01:43 <dh`> in general you don't want anything slow on the irq path code

01:43 <dh`> both because interrupt latency is bad in general and also because you can drop inputs if you don't react to the hardware fast enough

01:43 <dh`> my usual thought is capture what you need from the hardware immediately and defer the rest for further processing

01:44 <dh`> but it's been a long time since I wrote a real device driver

01:45 <heat> right, I think the real question here is "what is slow?"

01:45 <heat> malloc could be slow-ish (rare) or it could be blazingly fast (hopefully frequently)

01:47 <dh`> traditionally malloc is slow

01:48 <dh`> anyway for anything besides incoming network packets you probably already have a place to put what you get from the hardware

01:48 <dh`> it's incoming network packets that are a headache in this context

01:48 <heat> you can allocate upfront for rx no?

01:49 <dh`> only if you allocate the maximum input size every time

01:49 <dh`> maybe that's not a problem

01:50 <heat> oh right, yes, I see what you mean. I was thinking about raw NIC rx buffers for DMA

01:50 <heat> yes, that is usually all under softirq or threaded interrupts

01:51 <dh`> I have no idea how rx works for network devices with tcp offload

01:52 <dh`> even when I did write a few real drivers long ago, none of them were anything like that

02:12 matthews has quit [Quit: ZNC 1.8.2+deb2+b1 - https://znc.in]

02:13 matthews has joined #osdev

02:15 craigo has quit [Ping timeout: 252 seconds]

02:18 <mrvn> heat: if you allocate in the IRQ then you have to disable IRQs in malloc. And for SMP that means you have to lock it globally. Bad idea.

02:21 heat has quit [Remote host closed the connection]

02:21 heat has joined #osdev

02:21 <mrvn> Why can't you allocate all the memory when someone submits a read or write? E.g. the TCP stack would allocate bounce buffers and stuff and submit that to the NIC to add to the discriptors for receiving data.

02:21 k8yun has quit [Ping timeout: 268 seconds]

02:22 <mrvn> The descriptors having buffers to read into activates the IRQ for the NIC.

02:23 Arthuria has joined #osdev

02:25 <mrvn> dh`: a frame is 1500 bytes and a NIC has space for a limited numberd of descriptors so you don't need that much memory to max that out. With tcp offloading you might need 64k per frame which would eat a lot more memory. Still not all that much in servers that have that offloading and frame merging.

02:26 <mrvn> 1024 jumbo frames would be just 64MB for systems with >64GB of ram.

02:26 <mrvn> peanuts

02:28 <mrvn> .oO(and give you about 1s to allocate more in the soft irq when busy)

02:58 <moon-child> don't have to disable irqs

02:58 <moon-child> if you do the rop trick

02:58 k8yun has joined #osdev

03:02 <heat> what rop trick

03:04 <heat> mrvn, did you miss the last 20 years of malloc advancements? no need for a global lock on SMP at all

03:05 <heat> and it would also still be there without irq-safe malloc soooo, no idea what you're on about

03:08 mctpyt has joined #osdev

03:17 <moon-child> heat: isr probes stack to see if malloc is currently running. If not, then it can malloc freely. Otherwise, it overwrites malloc's return address with its own continuation (stashing the original return address somewhere)

03:18 <moon-child> so the isr runs right after malloc finishes

03:19 <moon-child> this can be seen as an implicit mutex, and a very simple scheduler (in particular, you have to deal with the case when another isr runs and wants to malloc). In the limit, it's an actual scheduler and mutex. But there is some interesting space to play with ahead of the limit

03:19 <heat> thanks, i hate it

03:19 mctpyt has quit [Ping timeout: 246 seconds]

03:20 <moon-child> :<

03:20 <moon-child> another avenue which might be interesting is to make the allocator 'lock-free' (relying on instruction-level atomicity). That sound annoying though

03:20 <moon-child> sounds*

03:22 <heat> that's not the right lock-free

03:22 <moon-child> probably the actual right thing to do is, if there's some reason you might need to malloc, shove an event in a queue somewhere and let someone else get to it in due time. But both of the above ideas are a lot cuter

03:22 <moon-child> heat: wym

03:22 <heat> you can have a totally percpu allocator

03:22 <heat> no need for shared state

03:22 <moon-child> it's a different meaning of 'lock-free' from the normal one, but the right one for this situation

03:22 <heat> in fact, I think slub actually does this

03:23 <heat> SLUB's slabs are all percpu AIUI

03:23 <moon-child> yes. per cpu. But 'lockfree' in that it's totally reentrant

03:23 <moon-child> so you can malloc concurrently from regular code and an isr

03:23 k8yun has quit [Quit: Leaving]

03:24 [_] has joined #osdev

03:24 <moon-child> probably this is way easier on x86 than other arches, since you have lots of instruction-level-atomic rmw without synchronisation guarantees

03:24 <heat> dude just cli and sti?

03:24 <moon-child> laaaaame

03:24 <moon-child> :^)

03:24 <heat> lol

03:25 <moon-child> what even is the point of writing your own os if you can't commit awful, terrible crimes?

03:25 <moon-child> normally we just have to live with the crimes of existing os authors

03:25 <moon-child> need some equity, ne?

03:26 <heat> writing poor man's UNIX is already a bad enough crime isn't it

03:26 <moon-child> heat

03:27 [itchyjunk] has quit [Ping timeout: 246 seconds]

03:27 <heat> moon-child

03:34 gxt_ has quit [Remote host closed the connection]

04:01 gxt_ has joined #osdev

04:07 heat has quit [Ping timeout: 246 seconds]

04:16 [_] is now known as [itchyjunk]

04:17 <zid`> heatchilder

04:22 <zid`> warumkinder

04:28 [itchyjunk] has quit [Read error: Connection reset by peer]

04:38 <zid`> someone needs to make a video player that doesn't suck

04:38 <zid`> vlc can't play shit properly on its best days, doesn't hw accelerate by default. mpc-hc is dead, and doesn't downmix to stereo by default and has bad hotkeys etc

06:10 bgs has joined #osdev

06:10 Arthuria has quit [Remote host closed the connection]

06:22 <dh`> disabling interrupts in malloc in no way means you need a global lock

06:36 Turn_Left has joined #osdev

06:38 Left_Turn has quit [Ping timeout: 248 seconds]

06:54 <Terlisimo> zid`: mpv?

06:54 <zid`> Isn't that a venereal disease

06:55 <zid`> >mpv is a free (as in freedom) media player for the command line

06:55 <Amorphia> zid`: it's not PC to say venereal disease anymore :L

06:55 <zid`> why, did people stop having sex

06:56 <Amorphia> yeah sex is not PC anymore

06:56 <zid`> what do they have instead?

06:56 <Amorphia> "intimacy"

06:56 <zid`> that's not what I am talking about though

06:56 <Amorphia> what are you talking about

06:56 <zid`> I'm talking about diseases you get from shagging

06:56 <Amorphia> lmao

06:59 * Amorphia hands zid` a test kit for "intimately sourced infections"

07:01 <zid`> like, TB?

07:01 <zid`> I hear that's rife in prisons

07:02 <Amorphia> not "inmately sourced"

07:02 <zid`> sure it is

07:02 <zid`> I have to cough on you, a lot

07:02 <Amorphia> hahahaha

07:04 slidercrank has joined #osdev

07:05 bgs has quit [Remote host closed the connection]

07:13 wand has quit [Ping timeout: 255 seconds]

07:31 wand has joined #osdev

07:32 arminweigl_ has joined #osdev

07:33 arminweigl has quit [Ping timeout: 255 seconds]

07:33 arminweigl_ is now known as arminweigl

07:56 Left_Turn has joined #osdev

07:59 Turn_Left has quit [Ping timeout: 248 seconds]

08:36 danilogondolfo has joined #osdev

08:57 shinbeth has quit [Remote host closed the connection]

09:15 bauen1 has quit [Ping timeout: 255 seconds]

09:16 gxt__ has joined #osdev

09:16 gxt_ has quit [Ping timeout: 255 seconds]

09:18 gog has joined #osdev

09:36 <Iris_Persephone> did I just walk into a discussion about the clap

09:38 awita has joined #osdev

09:38 GeDaMo has joined #osdev

09:39 <zid`> no, TB

09:39 <Iris_Persephone> the terminology for this changes a lot for some reason, probably the euphemism treadmill

09:40 <Iris_Persephone> is it STI or STD that's in vogue these days?

09:40 <FireFly> sti::cout

09:41 <Iris_Persephone> lmao

09:42 <sham1> Standard Incantation

09:43 <zid`> none of these are euphamisms though so idk why the euphamism treadmill is applicable, I think it's just it used to be VD then became STD for reasons unknown, maybe better public understanding? Then STI later because more accurate?

09:43 <zid`> That's my heresay anyway.

09:44 <gog> STD isn't necessarly inaccurate but it'd be more applicable for chronic conditions like herpes, hepatitis or HIV

09:44 <zid`> yea

09:44 <zid`> it's.. less accurate, but not inaccurate, imo

09:44 <zid`> hence, more accurate

09:44 <gog> yes

09:45 <gog> but even those aren't necessarily sexually transmitted only

09:45 <zid`> with hep and hiv etc not being sexually transmitted a lot of the time, maybe we'll go change again in future

09:45 <gog> yeah

09:45 <gog> lol

09:45 <Iris_Persephone> I mean it's the whole "we need a more scientific term for this because the older one is now used as an attack"

09:45 <zid`> yea that's what the treadmill is

09:45 <Iris_Persephone> like "retard" and shit used to be a technical term

09:45 <zid`> but none of these get used as euphamisms

09:46 <zid`> so as far as I am concerned, this must be a different process

09:46 <Iris_Persephone> I'd say it's a different manifestation of the same process

09:49 <Iris_Persephone> also "VD" turning into "STI" is literally listed on the wiki page about the euphemism treadmill lmao

09:50 <zid`> destigmatisation I'd go for

09:50 <zid`> euphamism treadmill o

09:50 <zid`> no

09:50 sympt5 has joined #osdev

09:51 <Iris_Persephone> like if you want to get technical I think the proper linguistic term is "pejoration"

09:51 sympt has quit [Ping timeout: 246 seconds]

09:51 sympt5 is now known as sympt

09:51 <zid`> we're talking about the inverse process though

09:52 <Iris_Persephone> the inverse is "melioration" but I don't think that applies because that's like reclaiming a word

10:27 slidercrank has quit [Ping timeout: 255 seconds]

10:27 bauen1 has joined #osdev

11:39 craigo has joined #osdev

11:58 netbsduser` has quit [Ping timeout: 255 seconds]

12:05 netbsduser has joined #osdev

12:06 netbsduser has quit [Client Quit]

12:27 terminalpusher has joined #osdev

12:42 Starfoxxes has quit [Ping timeout: 248 seconds]

12:48 <mrvn> moon-child: if you do the rop trick then what you basically do is a soft-irq. Might as well just do that from the start. Note: even with the rop trick you still need a global lock, it's just done in hardware when you do atomic cmpxchg.

12:49 <mrvn> a percpu allocator as heat suggested helps

12:51 <mrvn> Note: the isr probing the stack actually only works with percpu alloc or you have to probe all cores stacks and that's rather racey.

12:52 <mrvn> s/works/works well/

13:13 SpikeHeron has quit [Quit: WeeChat 3.8]

13:21 Starfoxxes has joined #osdev

13:23 marshmallow has quit [Remote host closed the connection]

13:38 gog has quit [Quit: Konversation terminated!]

13:38 gog has joined #osdev

13:55 <lav> mow

13:55 <gog> meo

13:56 <sham1> mov

13:57 <mrvn> moo

13:57 <gog> are there any arcitechtures with the mnemonic "moo"

13:58 <gog> that'd be cool

13:59 <mrvn> "I accidentally took my cats medicin. Don't ask meow."

14:00 <mrvn> gog: it's a Memory-Out-of-Order read. :)

14:00 <mrvn> aka prefetch

14:04 novasharper has joined #osdev

14:04 zxrom has joined #osdev

14:04 <gog> :o

14:04 <zid`> move offsettable object

14:04 <gog> moo

14:05 <lav> oom

14:05 <zid`> I am listening to japanese pop jazz stuff

14:06 <zid`> It's surprisingly good

14:06 <gog> i've got madonna on again

14:06 <zid`> with or without an intrusive r

14:06 <gog> :P

14:07 <gog> MA'DONN

14:07 <zid`> maddonna on sounds like a prescription medication to me cus of the intrusive r

14:13 slidercrank has joined #osdev

14:29 dutch has joined #osdev

14:39 <mrvn> lols @ The Ark. A LSD like compount that's smaller than a water molecule.

14:53 craigo has quit [Ping timeout: 255 seconds]

15:09 awita has quit [Ping timeout: 246 seconds]

15:16 nyah has joined #osdev

15:42 bgs has joined #osdev

15:51 [itchyjunk] has joined #osdev

15:56 pbx has joined #osdev

15:56 <pbx> 2~/wind 21

16:05 pbxvax has joined #osdev

16:07 <pbxvax> after writing some drui##ivers i managed to get this VAX11/750 on the net

16:08 pbxvax has quit [Remote host closed the connection]

16:33 <geist> oh neat

16:38 gog has quit [Remote host closed the connection]

16:38 gog has joined #osdev

16:42 <gog> fuuuuuuuuuuuucking god damn can i please have a stable internet

16:42 <gog> i'm trying to get work done and my db connection times out

16:42 <gog> i have one more feature to test before i'm done with this stupid thing T_

16:42 <gog> T_T

16:42 <gog> ToT

16:42 <Amorphia> gog: F

16:42 <lav> download this https://commons.wikimedia.org/wiki/File:H%C3%A4ststall_Elfviks_g%C3%A5rd_dec_2008.jpg

16:42 <bslsk05> commons.wikimedia.org: File:Häststall Elfviks gård dec 2008.jpg - Wikimedia Commons

16:42 <lav> now you have stable internet

16:43 <gog> hauststallar???

16:43 <gog> hvað fokk er þessu

16:43 <gog> oh i get it

16:43 <gog> stable

16:44 <lav> >_>

16:46 <gog> <_<

16:47 <dzwdz> v_v

16:47 <Amorphia> lav: you can do so much better than this, I'm disappointed in you, please try harder in futuer

16:47 <Amorphia> 2/10

16:48 <lav> don

16:48 <lav> 't be so neighgative

16:51 <gog> -1ULL/10

16:51 <gog> :3

16:51 <lav> 0x333c

16:52 <gog> SEMICOLON THREEEEEE

16:52 * kof123 .oO( malbolge OS )

16:54 <gog> base three arithmetic

16:54 <gog> oh god

16:57 <kof123> i actually meant befunge, the arrows

16:58 <kof123> "a cross between Forth and Lemmings"

16:58 <lav> hmm there should be a turing tarpit lang using cat emoticons

16:59 <GeDaMo> https://en.wikipedia.org/wiki/LOLCODE

17:00 <gog> omg yaaaaaay it worked

17:00 <gog> one minute before 5pm

17:05 Arthuria has joined #osdev

17:06 <geist> pbx: you have an actual 11/750 there?

17:09 gog has quit [Quit: Konversation terminated!]

17:18 sinvet__ has joined #osdev

17:18 sinvet__ is now known as shinbeth

17:19 bauen1 has quit [Ping timeout: 268 seconds]

17:23 Arthuria has quit [Ping timeout: 255 seconds]

17:30 xenos1984 has quit [Ping timeout: 246 seconds]

17:31 wand has quit [Remote host closed the connection]

17:31 xenos1984 has joined #osdev

17:38 Arthuria has joined #osdev

17:40 Arthuria has quit [Killed (NickServ (GHOST command used by Guest684531))]

17:40 Arthuria has joined #osdev

17:40 dutch has quit [Quit: WeeChat 3.7.1]

17:41 wand has joined #osdev

18:07 <mjg> in this episode of 'great work bsd, really appreciate it'

18:07 <mjg> 28049: unmount("FSID:-2024145069:135",MNT_MULTILABEL|MNT_ACLS) = 0 (0x0)

18:07 <mjg> right?

18:08 <mjg> wrong

18:08 <mjg> the actual flags are MNT_BYFSID | MNT_NONBUSY

18:09 <mjg> but these macros happpen to have the same values and as the above set and the magic machinery to stringify flags picks them instead

18:09 <mjg> g g

18:09 <mjg> the first set is part of flags which can be set on a mount point, while the latter concerns unmount requests

18:10 <mjg> you may notice there would be no problem if there was no completely unnecessary clash in the MNT_ namespace

18:10 <mjg> i mean i'm sure there were reasons for it incomprehensible for minds not on lsd

18:15 dutch has joined #osdev

18:23 xenos1984 has quit [Ping timeout: 260 seconds]

18:37 bauen1 has joined #osdev

18:39 xenos1984 has joined #osdev

18:42 danilogondolfo has quit [Remote host closed the connection]

18:50 dminuoso_ has joined #osdev

18:57 bnchs has joined #osdev

18:57 <lav> hi bnchs :3

18:58 <bnchs> hi lav

19:00 xenos1984 has quit [Ping timeout: 248 seconds]

19:04 gog has joined #osdev

19:05 netbsduser has joined #osdev

19:06 <dzwdz> are the linker flags i set in #define LINK_SPEC supposed to show in gcc -showspecs?

19:06 <dzwdz> if so my build is fucked

19:06 <dzwdz> i mean if they aren't then it's fucked anyways

19:09 <lav> hm what if there was a filesystem but it was comprised entirely of lisp s-expressions

19:09 <dzwdz> that's gotta exist already

19:16 xenos1984 has joined #osdev

19:17 awita has joined #osdev

19:57 k8yun has joined #osdev

19:57 Turn_Left has joined #osdev

20:00 Left_Turn has quit [Ping timeout: 252 seconds]

20:01 heat has joined #osdev

20:02 <heat> hello

20:02 <heat> how to write hello world in c language??

20:04 <lav> system.out.println

20:06 <mjg> you write a hello world in rust

20:06 <mjg> compile that

20:06 <mjg> then decompile to c

20:06 <mjg> et voila

20:07 <heat> @lav help pls

20:07 <heat> main.c:1:7: error: expected ‘=’, ‘,’, ‘;’, ‘asm’ or ‘__attribute__’ before ‘.’ token

20:07 <heat> 1 | system.out.println

20:07 <heat> | ^

20:09 <heat> mjg, hello funny freebsd man, please tell me why irq-disabled malloc is a bad idea

20:10 <mjg> ouch

20:10 <mjg> what will you do if it fails

20:11 <heat> if what fails?

20:11 <mjg> malloc

20:11 <heat> nothing different

20:12 <heat> i'm just asking you why you think irq disabled malloc is bad and preemption disabled malloc is good

20:12 <mjg> i don't

20:12 <lav> heat: looks like you're using an open source compiler. You need to use a Microsoft™ compiler if you want to be a real programmer.

20:12 <mjg> where did you get that idea

20:12 <heat> you told me back when I wrote the original slab stuff and profiling stuff

20:12 <mjg> ooh

20:12 <mjg> sorry, i misunerstood

20:13 <mjg> i thought you meant having irq disabled and then calling malloc

20:13 <heat> oh no, i mean yes, but what I really mean is s/disable_preemption/irq_disable/g

20:13 <mjg> as for what to do in malloc fast path intenrally

20:13 <mjg> irqs are expensvie to take a trip around

20:13 <mjg> while preemption trip is cmparatively very cheap, just a branch

20:14 <heat> wdym?

20:14 <mjg> i mean disabling and enabling interrupts takes more time than bumping a local counter, decrementing it later and branching on the result

20:14 <FireFly> heat: e.g. https://www.ioccc.org/1984/anonymous/anonymous.c

20:15 <FireFly> re. how to write hello world

20:15 <mjg> and if you add some hackery you don't even need to do it

20:15 <heat> really?

20:15 <mjg> is this a serious question

20:15 <mjg> wtf mate

20:15 <heat> are cli and sti just speculation barriers or something?

20:15 <heat> I don't know pal

20:15 <mjg> here is a pro tip

20:15 <heat> oh man pro tip

20:15 <mjg> boot up your kernel with rdtscp around these suckers

20:16 <heat> yes, and watch it vmexit

20:16 <heat> :)

20:16 <mjg> oh right

20:16 <mjg> sorry :d

20:16 <mjg> then get a real box and boot patched bsd or linux!

20:16 <mjg> look irq instructions are fucking expensive, i don't remember the exact cost though

20:17 <mjg> you do realize even SOLARIS does not do irq in there

20:17 <mjg> :p

20:18 <heat> linux does

20:18 <mjg> no

20:18 <heat> yes

20:18 <mjg> where

20:18 <heat> slab

20:18 <mjg> irq in the fast path?

20:18 <mjg> and not for some lol setup?

20:18 <heat> you mean irq disabling or irqs enabled?

20:18 <mjg> playing with irq to begin with

20:19 <mjg> are you sure you did not take al ook at something for use in interrupt handlers?

20:20 <heat> omfg

20:20 <heat> linux is a matrioska of functions

20:20 <heat> https://elixir.bootlin.com/linux/latest/source/mm/slab.c#L3255

20:21 <heat> there you have it

20:21 <heat> gave up on slub, but it's the same shit

20:21 <bslsk05> elixir.bootlin.com: slab.c - mm/slab.c - Linux source code (v6.2) - Bootlin

20:21 <heat> every lock, every fast path stuff is irqsave

20:23 <mjg> lemme trace for a sec. maybe that's a sad fallback

20:23 <mjg> if per-cpu caches are depleted

20:24 <heat> there's literally no difference in locking or slabs or that sort in GFP_ATOMIC vs GFP_KERNEL

20:24 <mjg> so i'm in slub as this is what ubuntu is using

20:24 <heat> mjg, it is not, see ____cache_alloc

20:24 <mjg> * lockless fastpaths

20:24 <mjg> * The fast path allocation (slab_alloc_node()) and freeing (do_slab_free())

20:24 <mjg> * cmpxchg_double is possible to use, otherwise slab_lock is taken).

20:24 <mjg> *

20:24 <mjg> * are fully lockless when satisfied from the percpu slab (and when

20:25 <mjg> * They also don't disable preemption or migration or irqs. They rely on

20:25 <mjg> * the transaction id (tid) field to detect being preempted or moved to

20:25 <mjg> * another cpu.

20:25 <mjg> that's slub

20:25 <mjg> slab afair is a legacy shitter?

20:25 <heat> no

20:25 <heat> slab is used in prod for google, and possibly others

20:25 <mjg> can geist confirm? it does look terrible

20:26 <heat> they still have a bunch of internal benchmarks that show that slab is better than slub for them

20:26 <mjg> anyway see slub.c __slab_alloc_node

20:26 <heat> geist has nothing to do with linux so probably not, but I can try and find you the quote from the lkml

20:27 <gog> hi

20:28 <heat> mjg, https://lore.kernel.org/lkml/CAJD7tkaqrz8sGqgbyfQHU_NM3O=a_0bqSHB0gGYRB7Kj+w_05w@mail.gmail.com/

20:28 <bslsk05> lore.kernel.org: Re: Deprecating and removing SLOB - Yosry Ahmed

20:29 <mjg> aight

20:29 <mjg> i guarantee the irq trips *are* slower. it may be there are other properties down below which make a difference, for example how it reacts to changing load

20:29 <mjg> how many elements to fill etc.

20:29 <mjg> and it may be they work better for G

20:30 <mjg> m

20:30 <mjg> you could load a toy kernel module on your laptop

20:30 <mjg> just sayin

20:30 <mjg> :]

20:31 <mjg> maybe some git logging would explain why they roll with irqs over there

20:32 <heat> because they always have?

20:32 <heat> I don't think kmalloc has ever been banned in hardirq context

20:32 <heat> at least not in the last 20 years (2.4?)

20:33 <heat> in any case, even slub slow(er) paths do irqsave

20:34 <heat> it's all irq safe stuff that needs to be called and can be called from hardirq context

20:34 <heat> versus freebsd explicitly saying "no hard irq stuff)" in malloc(9)

20:34 <mjg> i'll hack up simple code, give me few

20:34 <heat> whether that's a lie is beyond me, since freebsd manpages love to lie

20:39 <mjg> you can't use malloc in interrupts

20:41 awita has quit [Ping timeout: 246 seconds]

20:42 <heat> CringeBSD

20:43 <heat> i could probably just replace my preemption disabling with irqs in slab.cpp and see what happens

20:49 <mjg> takes 5 years to boot this motherfucker

20:49 <mjg> https://dpaste.com/F8G8BBX9J

20:49 <bslsk05> dpaste.com <no title>

20:50 <mjg> https://dpaste.com/B8XQDCU37 patch

20:50 <bslsk05> dpaste.com <no title>

20:50 <mjg> basically modulo initiall loller, i'm guessing an interrupt cmaei n, it is all faster to roll with preemption

20:50 <mjg> and i mean liek about twice the throughput

20:51 <mjg> not the most scientific test, but enough to prove the point

20:52 <mjg> just in caes, this is cycle count

20:53 <mjg> so again, interrupt trips are fucking slow man

20:53 <heat> what are critical_enter/exit and intr_disable/enable defined to?

20:53 <mjg> td->td_critnest++;

20:53 <mjg> atomic_interrupt_fence();

20:53 <heat> just a simple percpu add and pushf+pop+cli?

20:53 <heat> yeah

20:54 <mjg> atomic_interrupt_fence();

20:54 <mjg> td->td_critnest--;

20:54 <mjg> atomic_interrupt_fence();

20:54 <mjg> critical_exit_preempt();

20:54 <mjg> if (__predict_false(td->td_owepreempt))

20:54 <mjg> __asm __volatile("pushfq; popq %0" : "=r" (rf));

20:54 <mjg> __asm __volatile("cli" : : : "memory");

20:54 <heat> yes, predictable

20:54 <mjg> __asm __volatile("pushq %0; popfq" : : "r" (rf));

20:54 <mjg> riught

20:54 <mjg> so

20:54 <mjg> as i said, it is faster to not fuck with interrupts

20:55 <mjg> frankly i'm confused how that's even a question. is it because linux is clearly doing it at lesat in slab?

20:55 <heat> it's because linux does it all the time

20:55 <mjg> not in slub fast path, if the comment is to be believed

20:55 <heat> and I don't know if this is some legacy thing they're stuck with for the slab allocators, or something else

20:56 <mjg> i would guess some of it is indeed used from interrupt handlers

20:56 <heat> sure, maybe not the slub fast path, but for sure the other paths

20:56 <mjg> but instead of dedicating a bucket for that purpose they use irqs to syncrho access

20:56 <mjg> shitty tradeoff if you ask me

20:56 <heat> btw what's 20 rdtscp "cycles" going to amount to?

20:56 <mjg> dawg plz

20:57 <heat> feline plz

20:57 <heat> seriously, how much time is that?

20:57 <mjg> i can tell you i already see uma_zalloc/uma_zfree on the profile when pushing packets on freebsd

20:57 <mjg> there is branches which do't need to be there

20:57 <mjg> it would be much worse if it was rolling with interrupts

20:57 <mjg> so that's what

20:58 <mjg> wait maybe i can share one

20:58 <heat> what's your tsc's frequency?

20:58 <mjg> ... no i can't

20:58 <mjg> look we can flame tomorrow

20:58 <mjg> i'm bailingfrom this crap for the day

20:58 <mjg> got a an email backlog :[

21:00 <heat> can you literally just show me the tsc frequency or am I going to have to bench this myself

21:00 <heat> i want to understand how much impact this shit can have

21:01 <heat> i don't want to be distracted with "hey look, flamegraphs!"

21:02 <mjg> have to boot it

21:02 <mjg> again 5 fucking years

21:02 <mjg> regradless of that i defo encourage you to run a similar etst on your machine

21:03 <mjg> you can write a lol module and load it

21:03 gorgonical has quit [Remote host closed the connection]

21:03 <mjg> just flip to the console first in case it panics :p

21:05 <mjg> tell you what though, sometime in next 2 months i suspect i'll patch the allocator to be optimal

21:05 <mjg> once that happens, i'll pessimize it just for you with irq instead of preemption trip

21:06 <mjg> and test

21:06 <mjg> Timecounter "TSC" frequency 2100000221 Hz quality 1000

21:06 <mjg> skylakhw.model: Intel(R) Xeon(R) Platinum 8170 CPU @ 2.10GHz

21:06 <mjg> aka skylake

21:07 DynamiteDan has quit [Excess Flood]

21:08 DynamiteDan has joined #osdev

21:12 <sham1> yawn

21:12 terminalpusher has quit [Remote host closed the connection]

21:13 Left_Turn has joined #osdev

21:13 levitating has quit [Ping timeout: 246 seconds]

21:14 <heat> ok so in theory it amounts to ~9ns

21:14 <heat> for any sort of fast path

21:14 <heat> i wonder, does this verify if you do "cli; cli; cli" etc?

21:15 levitating has joined #osdev

21:15 <heat> as in, do you get a performance penalty by disabling IRQs once they are already disabled

21:15 <Amorphia> smh not using mode-locked Ti:sapphire laser optics for fast computation

21:15 <Amorphia> nanosecond timescales are cringe

21:15 Turn_Left has quit [Ping timeout: 264 seconds]

21:15 slidercrank has quit [Ping timeout: 260 seconds]

21:16 <mjg> heat: that's a too primitive calculation for real impact. as noted, the above is good enough to show there is a difference, but it most likely *underplays* is

21:16 <mrvn> here is an idea: don't enable irqs in the kernel. problem solved.

21:16 <mjg> it

21:16 <mjg> same shit with rolling with atomics on a loop for a bench

21:19 <mrvn> If you have problems with the irq handler allocating memory have you considered giving it a SLAB for it's own (per-cpu)?

21:19 <mjg> that's literally what i recommended above

21:19 <mjg> and no, i don't have the problem

21:19 <mjg> :]

21:22 <mrvn> My IRQ driver gets 4k of memory per irq that it moinors. The driver requests an irq by sending a message, which is where the 4k come from, and the IRQ driver replies with a message when the irq happens waking up the driver that asked for the irq.

21:22 <mrvn> s/moinors/monitors/

21:23 <mrvn> So basically all my IRQs are soft irqs which totaly avoids the problem too and makes the IRQ handler really small and fast.

21:26 bliminse has quit [Quit: leaving]

21:27 awita has joined #osdev

21:29 awita has quit [Remote host closed the connection]

21:50 bgs has quit [Remote host closed the connection]

21:52 k8yun has quit [Quit: Leaving]

22:14 GeDaMo has quit [Quit: That's it, you people have stood in my way long enough! I'm going to clown college!]

22:18 sinvet has quit [Ping timeout: 252 seconds]

22:21 <moon-child> mrvn: not percpu malloc is strawman

22:25 elastic_dog has quit [Ping timeout: 248 seconds]

22:27 elastic_dog has joined #osdev

22:28 dude12312414 has joined #osdev

22:35 dude12312414 has quit [Quit: THE RAM IS TOO DAMN HIGH]

22:52 <heat> mjg, https://github.com/heatd/irq-preemption-test/

22:52 <bslsk05> heatd/irq-preemption-test - Linux kernel module that benchmarks low level preemption counter manipulation vs irqs-off-on (0 forks/0 stargazers)

22:53 <heat> I get 7-10 cycles slower with irqs-off-on vs the preemption counter stuff

22:53 <heat> which amounts to around 5ns

22:53 <heat> Intel(R) Core(TM) i5-8250U CPU @ 1.60GHz, so kabylake

22:54 <heat> if any kind soul wants to try it out on modern Intel or AMD, feel free

22:54 <mrvn> does cli/sti flush the pipeline?

23:06 nikolar has quit [Ping timeout: 248 seconds]

23:20 <mjg> heat: your loop is too microbenchmarky, you need to slap *something* in there

23:21 <mjg> heat: but even then see my previous remark of this being a very primitive approach to begin with

23:21 <mjg> heat: what you can do here is port "fast path" of your allocator and roll with 2 variants

23:22 <heat> that is an interesting idea

23:22 <mjg> or boot your kernel on bare metal

23:22 <mjg> he he... he?

23:22 <heat> why he

23:22 <mjg> he

23:23 <heat> why not she

23:23 <mjg> apologies

23:23 <mjg> sheshe.... she?

23:23 <heat> 1) my kernel definitely boots on bare metal

23:23 <heat> and yeah that's all i have to say lol

23:23 <mjg> if it boots

23:24 <mjg> no need to port squat

23:24 <mjg> just add a loop to your kernel

23:24 <mjg> but you never actually tried, did you

23:25 <heat> tried what

23:25 xenos1984 has quit [Read error: Connection reset by peer]

23:25 <mjg> boot on bare metal

23:25 <heat> ofc I have

23:27 <mjg> well then see above

23:27 <mjg> i'm going to sleep

23:27 <mjg> by the time i wake up i better see an excel spreadsheet with the results

23:28 <mjg> oh wiat, is not your allocator fast path crap to begtin with?

23:28 <mjg> where is that crap

23:29 <heat> https://github.com/heatd/Onyx/blob/master/kernel/kernel/mm/slab.cpp#L602

23:29 <bslsk05> github.com: Onyx/slab.cpp at master · heatd/Onyx · GitHub

23:29 <heat> https://github.com/heatd/Onyx/blob/master/kernel/include/onyx/mm/slab.h#L21

23:29 <bslsk05> github.com: Onyx/slab.h at master · heatd/Onyx · GitHub

23:30 craigo has joined #osdev

23:30 craigo has quit [Read error: Connection reset by peer]

23:30 <mjg> it is half crap

23:31 <heat> wtf

23:31 <mjg> preferably you would pop an obj and if you got null, you *bail* to a noinline slowpath

23:31 <heat> you're mildly annoying

23:31 <mjg> this is a core primitive, it is supposed to be optimized

23:32 <mjg> i also note your sched_enable_preempt is a func call

23:32 <mjg> instead of an inline

23:32 <mjg> 's not good man

23:32 <heat> no

23:33 <heat> wait, yes

23:33 <heat> fuck

23:33 <heat> that's a quick fix

23:33 <heat> i really thought that was inline

23:33 <mjg> same with _disable

23:34 <mjg> also you should unlikely() branch on whether you got preempted

23:34 <mjg> if (cache->flags & KMEM_CACHE_NOPCPU) [[unlikely]]

23:34 <mjg> this should not be here. instead, there should be a dedicated entryp oint for slabs without per-cpu caching

23:34 <heat> hm?

23:34 <heat> nah

23:34 <heat> how would that work?

23:34 <mjg> ?

23:35 <heat> how could I ever have a kmem_cache_alloc if I can't use it for some slabs?

23:35 <mjg> if (c->objsize > PAGE_SIZE)

23:35 <mjg> {

23:35 <mjg> // If these objects are too large, opt out of percpu batch allocation

23:35 <mjg> }

23:35 <mjg> c->flags |= KMEM_CACHE_NOPCPU;

23:36 <mjg> you know for a fact given cache wont be doing per-cpu stuff

23:36 <mjg> so just use kmem_cache_alloc_nopcpu for them

23:36 <mjg> and make it an invariant kmem_cache_alloc is *NOT* used in that case

23:36 <mjg> et voila, branch whacked

23:37 <mjg> all in all openbsd vibes 3/10 theo would be proud

23:37 mctpyt has joined #osdev

23:39 <mjg> https://dpaste.com/AB888DQ32 this is aproximately how it should look like

23:39 <bslsk05> dpaste.com <no title>

23:39 <mjg> with the preempt stuff being inlines

23:40 <mjg> and unlikely() on getting preempted

23:41 <mjg> you got some massive branching going on in your preempt routines

23:41 <mjg> holy shit

23:41 <mjg> even freebsd does not

23:42 <mjg> ya man, with *your* preemption routines you are going to get a disfigured result

23:43 xenos1984 has joined #osdev

23:43 <mjg> i also just remembered preempt_enable/disable on linux may happen to expand to nothing depending on kernel config

23:43 <mjg> the module should probably check for it

23:43 <mjg> :]

23:44 <mjg> or may contain debug

23:45 <mjg> apologies for not mentioning that osoner

23:45 <mjg> you should proably check what it does on your kernel

23:45 <mjg> linux kernel

23:45 <mjg> cheers