#riscv on 2024-01-27 — irc logs at libera.irclog.whitequark.org

2023-08-11 11:05 sorear changed the topic of #riscv to: RISC-V instruction set architecture | https://riscv.org | Logs: https://libera.irclog.whitequark.org/riscv | Matrix: #riscv:catircservices.org

00:06 Andre_Z has quit [Quit: Leaving.]

00:11 <sorear> how much does not having tpidrro* hurt TLS capability restrictions?

00:18 <jrtc27> hm?

00:20 elastic_dog has quit [Ping timeout: 260 seconds]

00:30 <sorear> an object can access TLS belonging to any other object, because it's accessible from ctp

00:30 <sorear> a function cannot assume it has unique mutable access to its own TLS, because you can call a function with another thread's ctp

00:31 <sorear> is cgp use{d,ful} for anything?

00:31 elastic_dog has joined #riscv

00:40 epony has quit [Ping timeout: 264 seconds]

00:43 <jrtc27> the library-based compartmentalisation implementation uses per-library TLS, I believe

00:43 <jrtc27> (and if not, it should)

00:44 <jrtc27> cgp is unused in our current default ABI, but has been used in hesham almatary's compartos (https://arxiv.org/abs/2206.02852)

00:44 <jrtc27> I believe cheriot's ABI also uses it

01:10 <sorear> where should i start looking for how per-library TLS works?

01:41 MaxGanzII_ has quit [Ping timeout: 255 seconds]

01:44 khem has quit [Quit: Connection closed for inactivity]

01:53 <jrtc27> https://github.com/CTSRD-CHERI/cheribsd/tree/main/libexec/rtld-elf, grep for RTLD_SANDBOX

01:55 <jrtc27> https://man.cheribsd.org/cgi-bin/man.cgi/c18n is the manpage that's meant to give an overview of the thing, but won't mention details like that

01:56 <jrtc27> looks like it might just be isolating rtld's tls from the rest of the world's though at the moment, don't obviously see something stopping that (which is a bit of an oversight...)

01:56 <jrtc27> I'll go poke Dapeng on our Slack

01:57 <sorear> how does rtld figure out which tls to use when invoked from an untrusted environment?

01:59 <jrtc27> I'm not so clued up on all the implementation details, but I would guess it would do it with the same per-compartment trampolines it sets up for normal function calls

01:59 * jrtc27 notes the manpage makes no mention of trampolines...

02:00 <jrtc27> see libexec/rtld-elf/aarch64/rtld_c18n_asm.S for various trampoline fragment templates

02:01 <jrtc27> I'd suggest chatting with Dapeng on CHERI-CPU Slack if you want to learn more about the library compartmentalisation design

02:08 <sorear> so I don't think this can be implemented on riscv64 without a system call, since it's relying on an unforgeable thread ID in order to find the correct TLS on compartment switches

02:09 dh` has joined #riscv

02:19 Noisytoot has quit [Quit: ZNC 1.8.2 - https://znc.in]

02:20 Noisytoot has joined #riscv

02:21 <jrtc27> yeah as it stands I don't think you have what you need

02:21 <jrtc27> unless you did something magic with per-thread memory mappings

02:22 <jrtc27> that would work, and doesn't require anything new in the architecture

02:28 Tenkawa has joined #riscv

02:35 Leopold has quit [Remote host closed the connection]

02:36 Leopold has joined #riscv

03:00 <sorear> without arm-style split page table bases that means N*M page directories for N processes on M harts, which is probably a non-starter

03:00 <sorear> how worth reading are the Morello and CHERIoT specs?

03:11 KombuchaKip has quit [Quit: Leaving.]

03:21 Tenkawa has quit [Quit: Was I really ever here?]

03:32 <jrtc27> we already have per-process pages for the temporal safety shadow map

03:32 <jrtc27> and idk, depends what you want

03:58 epony has joined #riscv

04:05 czy` has quit [Remote host closed the connection]

04:05 czy has quit [Remote host closed the connection]

04:45 heat_ has quit [Ping timeout: 260 seconds]

05:05 ball has joined #riscv

05:09 <ball> Is it common for a RISC-V SoC to include an FPU?

05:32 <jrtc27> depends if it's targeting a market where you benefit from having it more than the cost of putting it in

05:38 <ball> That makes sense.

05:44 esv has quit [Ping timeout: 260 seconds]

06:07 BootLayer has joined #riscv

06:12 ball has left #riscv [I'll be back tomorrow, I should think.]

07:16 crossdev has joined #riscv

07:18 Stat_headcrabed has joined #riscv

07:18 Stat_headcrabed has quit [Client Quit]

07:19 Stat_headcrabed has joined #riscv

07:35 TMM_ has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

07:35 TMM_ has joined #riscv

07:54 Stat_headcrabed has quit [Quit: Stat_headcrabed]

08:22 shamoe has quit [Quit: Connection closed for inactivity]

08:39 stolen has joined #riscv

08:46 Stat_headcrabed has joined #riscv

08:48 unlord has quit [Changing host]

08:48 unlord has joined #riscv

08:49 dlan has quit [Remote host closed the connection]

08:52 dlan has joined #riscv

09:28 Stat_headcrabed has quit [Quit: Stat_headcrabed]

09:59 foton has quit [Ping timeout: 255 seconds]

10:03 MaxGanzII_ has joined #riscv

10:21 <sorear> jrtc27: do you mean per-thread pages? most pages in a posix address space are per-process :)

10:21 <jrtc27> you were concerned with processes

10:22 <jrtc27> for threads, the number of harts isn't relevant

10:22 <jrtc27> just the number of threads

10:27 <sorear> if you have more threads than harts, you only need as many "thread ID pages" as there are harts, because you can write the new thread ID into the page when you switch threads on that hart

10:27 <sorear> but you need to use a version of the process address space that maps the correct hart's copy of the thread ID page at the thread ID location

10:28 <sorear> if you have a process with one thread, you could have one version of the process address space at a time, but that would require editing page tables every time that thread is scheduled on a new hart

10:28 <jrtc27> if you want to be smart like that

10:29 <jrtc27> or just don't

10:29 <jrtc27> each thread gets its own real page, and that's that

10:29 <jrtc27> what userspace does with it is its choice

10:29 <jrtc27> it's what, 20k, 24k per thread?

10:30 <jrtc27> that's nothing in a modern OS

10:30 <sorear> years ago there was an extension in the works to make ASIDs optionally equivalent between harts for more efficient use of shared L2 TLBs, i don't know if that's still in the works but it would also have problems with any version of this approach

10:30 <sorear> 20k, 24k whats? minimum stack size?

10:31 <jrtc27> bytes

10:31 <sorear> yes but for what

10:31 <jrtc27> one 4k page for the page itself, 4 or 5 more for the levels of page table

10:32 <jrtc27> or 3 in puny sv39 land

10:32 <jrtc27> and yeah asids get fun

10:32 <sorear> if you give each thread its own real page, you need a satp switch on every thread switch, which historically defeats the purpose of having threads at all

10:34 <jrtc27> well it depends on how you do asids doesn't it

10:34 <jrtc27> satp switch isn't going to add noticeable overhead when you're already in the os if you can keep the asid

10:34 <jrtc27> anyway, must go for a bit

10:35 <jrtc27> it's probably not the right design but it's not entirely stupid

10:36 <jrtc27> but if you need to mess with asid architecture to make it work well then I guess you might as well just add the one register you need to make it all irrelevant

10:36 <sorear> so satp switch to a new page table base with the same asid, then sfence.vma on the thread page in the scheduler? might work although I'm nervous about assuming that a sfence.vma to a single page will be cheap on all relevant impls

10:37 ezulian has joined #riscv

10:38 <jrtc27> yes

10:39 <jrtc27> it's basically just a special modified asid reuse problem

10:39 <jrtc27> and, eh, if software assumes it's fast people will build it fast lest their products be deemed glacial crap

10:39 <jrtc27> :)

10:39 <jrtc27> anyway, really off now

10:53 ezulian has quit [Remote host closed the connection]

10:58 stolen has quit [Quit: Connection closed for inactivity]

11:24 Stat_headcrabed has joined #riscv

11:40 Stat_headcrabed has quit [Quit: Stat_headcrabed]

11:53 davidlt has joined #riscv

11:54 davidlt has quit [Remote host closed the connection]

11:58 davidlt has joined #riscv

12:00 gianluca has quit [Ping timeout: 256 seconds]

12:03 gianluca has joined #riscv

12:10 alexghiti has joined #riscv

12:15 alexghiti has quit [Ping timeout: 256 seconds]

12:16 alexghiti has joined #riscv

12:28 alexghiti has quit [Ping timeout: 264 seconds]

12:49 junaid_ has joined #riscv

13:19 junaid_ has quit [Quit: leaving]

13:22 jmdaemon has quit [Ping timeout: 256 seconds]

13:27 heat has joined #riscv

13:30 crossdev has quit [Ping timeout: 252 seconds]

13:36 Andre_Z has joined #riscv

13:48 shamoe has joined #riscv

13:56 Tenkawa has joined #riscv

13:58 <sorear> interesting that morello gives m-mode free license to create capabilities in registers and write to tag memory

14:18 Stat_headcrabed has joined #riscv

14:25 esv has joined #riscv

14:30 <sorear> E2H strikes, i was going to ask what the difference was between CPTR_EL2.TC and CPTR_EL2.CEN ...

14:30 esv has quit [Ping timeout: 260 seconds]

14:34 crabbedhaloablut has quit []

14:37 crabbedhaloablut has joined #riscv

14:37 esv has joined #riscv

14:49 davidlt has quit [Ping timeout: 260 seconds]

15:10 <sorear> kind of odd that cheriot went with a load barrier instead of a store barrier for revocation, there are fewer of those and they're less latency sensity

15:12 <courmisch> I see anathematic Armv8 register names!

15:17 <sorear> what of it

15:18 <sorear> the Am29000 has a branch target cache which stores the first 16 bytes of code at the *targets* of up to 32 branches; i haven't seen that in any modern design, instruction caches and zero-bubble BTBs/next line predictors work too well, but I wonder if it could be a relevant implementation strategy for cheri-type designs where accessing the instruction cache requires a bounds check. a BTC is _not memory_

15:48 alexghiti has joined #riscv

15:49 Andre_Z has quit [Quit: Leaving.]

15:53 MaxGanzII_ has quit [Remote host closed the connection]

15:54 MaxGanzII_ has joined #riscv

16:04 jacklsw has joined #riscv

16:12 n_crm has quit [Quit: ...]

16:13 n_crm has joined #riscv

16:18 ldevulder has quit [Ping timeout: 264 seconds]

16:18 Stat_headcrabed has quit [Quit: Stat_headcrabed]

16:24 ksbedfordjr has joined #riscv

16:24 <ksbedfordjr> hey guys

16:24 <ksbedfordjr> what is the server or debian

16:25 <ksbedfordjr> i am having a a odd pkg issue

16:26 <ksbedfordjr> willl post in a min

16:27 Stat_headcrabed has joined #riscv

16:27 <ksbedfordjr> it has to do with the start-stop-daemon

16:27 <ksbedfordjr> driving me up a wall

16:27 Stat_headcrabed has quit [Client Quit]

16:28 <ksbedfordjr> running build for error

16:28 davidlt has joined #riscv

16:29 davidlt has quit [Remote host closed the connection]

16:31 davidlt has joined #riscv

16:32 <ksbedfordjr> [🐳|🔨] debsums: changed file /usr/sbin/start-stop-daemon (from dpkg package)

16:33 <ksbedfordjr> but its in /sbin now

16:33 <sorear> what are you looking for exactly? I can't parse "server or debian"

16:34 <ksbedfordjr> try and find out what is causing this error with debsums

16:34 <ksbedfordjr> and the pkg

16:34 <sorear> those might be the same directory, https://wiki.debian.org/UsrMerge

16:36 <Tenkawa> ksbedfordjr: and need much more output (upload to paste.debian.net) .. this isn't much to work with.

16:37 <ksbedfordjr> thats the only error it gives

16:38 <ksbedfordjr> i will dig for more info brb

16:38 <Tenkawa> That might be the only context that means anything to you.. but is that "all" of the emtire process output?

16:38 <Tenkawa> er entire

16:41 <ksbedfordjr> it looks like sid is using /sbin/start-stop-daemon and they have not moved it to /usr/sbin

16:41 <ksbedfordjr> so it might be a pkg issue

16:42 <another|> welcome to unstable

16:43 <ksbedfordjr> someday this will all get fixed

16:43 <ksbedfordjr> grr

16:43 ldevulder has joined #riscv

16:44 <sorear> https://lists.debian.org/debian-devel/2023/10/msg00024.html since october, packages have been allowed to assume that /sbin is a symlink to /usr/sbin and built and tested in such an environment

16:48 <ksbedfordjr> so explain the move fro / to /usr

16:48 <ksbedfordjr> ./ should be all the base apps

16:48 <sorear> i've never heard of debsums before, is this something you ran or something that got automatically

16:48 <sorear> all of the binaries are physically in /usr/bin and /usr/sbin, /bin and /sbin are just symlinks. were you asking why?

16:48 <Tenkawa> sorear: debsums has been around since the dawn of time

16:49 <ksbedfordjr> https://manpages.ubuntu.com/manpages/trusty/man1/debsums.1.html

16:49 <Tenkawa> and usrmerge also was done years ago not just last yeat

16:49 <Tenkawa> er year

16:50 <sorear> https://wiki.debian.org/CheckingDebsums only says to use debsums if you have an old version of dpkg, although it doesn't quite call it obsolete

16:50 <sorear> i second Tenkawa's request for you to use paste.debian.org

16:50 <another|> https://manpages.debian.org/unstable/debsums/debsums.1.en.html to be precise

16:51 <Tenkawa> Yeah.. we need more context on that error

16:51 <sorear> hmm, url doesn't work

16:52 <Tenkawa> sorear: this is a great example of where the wiki needs major updatinf: CheckingDebsums (last modified 2016-12-22 06:53:18)

16:52 <Tenkawa> 2016? you have to be kidding me

16:53 <sorear> paste.debian.net?

16:53 <Tenkawa> debsums is still being used in bookworm

16:53 <Tenkawa> no.. debsums

16:53 <Tenkawa> A lot of us use it a lot

17:00 stolen has joined #riscv

17:09 <ksbedfordjr> I fixed it

17:12 <ksbedfordjr> changed lines from /sbin/start-stop-daemon to /usr/sbin/start-stop-daemon

17:12 <ksbedfordjr> and the builder is workign again

17:12 <ksbedfordjr> si simple fix for now

17:16 crabbedhaloablut has quit []

17:18 crabbedhaloablut has joined #riscv

17:21 <ksbedfordjr> thanks for the pointers

17:30 alexghiti has quit [Ping timeout: 260 seconds]

17:31 <Tenkawa> np.

17:38 <ksbedfordjr> so nxt is to get patches for boards

17:38 <ksbedfordjr> and get the riscv boards like nezha/d1 back working in the builder

17:39 <ksbedfordjr> currenlty now workingon cli based imgs then onto desktops

17:43 crossdev has joined #riscv

17:52 epony has quit [Ping timeout: 264 seconds]

17:54 TMM_ has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

17:54 TMM_ has joined #riscv

17:55 erg_ has joined #riscv

17:56 crossdev has quit [Read error: Connection reset by peer]

17:58 erg__ has joined #riscv

18:01 erg_ has quit [Ping timeout: 260 seconds]

18:05 jacklsw has quit [Quit: Back to the real life]

18:10 epony has joined #riscv

18:24 psydroid has joined #riscv

18:34 esv has quit [Remote host closed the connection]

18:34 esv has joined #riscv

18:39 ksbedfordjr has quit [Quit: Leaving...]

18:46 notgull has joined #riscv

18:48 hightower2 has joined #riscv

18:48 erg__ has quit [Ping timeout: 240 seconds]

18:52 esv has quit [Quit: Leaving]

18:53 esv has joined #riscv

18:53 notgull has quit [Ping timeout: 256 seconds]

18:58 crabbedhaloablut has quit []

19:00 crabbedhaloablut has joined #riscv

19:01 notgull has joined #riscv

19:01 MaxGanzII_ has quit [Remote host closed the connection]

19:02 MaxGanzII_ has joined #riscv

19:04 MaxGanzII_ has quit [Remote host closed the connection]

19:04 MaxGanzII_ has joined #riscv

19:08 sevan has quit [Ping timeout: 256 seconds]

19:14 EchelonX has joined #riscv

19:17 notgull has quit [Ping timeout: 268 seconds]

19:19 stolen has quit [Quit: Connection closed for inactivity]

19:21 esv has quit [Ping timeout: 268 seconds]

19:29 sevan has joined #riscv

19:32 sevan has joined #riscv

19:32 sevan has quit [Changing host]

19:37 khem has joined #riscv

19:38 esv has joined #riscv

19:39 notgull has joined #riscv

19:53 BootLayer has quit [Quit: Leaving]

19:56 <dh`> if you start messing with the mmu to add magic for per-thread pages, the simplest way to do it is with a register to override the page table

20:02 esv has quit [Ping timeout: 276 seconds]

20:03 esv has joined #riscv

20:04 <sorear> override how

20:10 esv has quit [Quit: Leaving]

20:10 esv has joined #riscv

20:23 <dh`> simple example: add a register that provides the page table entry for the last page of virtual memory

20:27 <dh`> or the tlb entry, if the tlb entry has more stuff in it

20:27 <dh`> that won't do for threads in user processes, but you could add a second register that holds the virtual address to override

20:28 <dh`> and require an sfence.vma for changing the override address to avoid weirdnesses

20:28 <dh`> it assumes that one page is enough, but it normally is

20:34 davidlt has quit [Ping timeout: 260 seconds]

20:56 mlw has quit [Ping timeout: 264 seconds]

21:12 <sorear> if I'm going to add a register i might as well add tpidruro and not mess with the translation logic

21:15 erg__ has joined #riscv

21:16 erg__ has quit [Remote host closed the connection]

21:30 <pabs3> Tenkawa: merged-/usr isn't done in Debian. because usrmerge does terrible hacks behind dpkg's back, and dpkg doesn't like that, we have to do some more terrible hacks to move files inside the .deb files to /usr, to ensure that files don't just disappear on upgrades that move files from / to /usr, and we aren't fixing dpkg, so packages outside of Debian will have the same issue without the same workarounds

21:34 <Tenkawa> pabs3: "Where" it's done is irrelevant... Debian is "using" a combined /usr structure

21:34 foton has joined #riscv

21:34 <Tenkawa> That's the only thing I was pointing out

21:35 <pabs3> "done" is the word I was reacting too, perhaps I misread what you meant by it

21:35 <Tenkawa> How/Where/Why... that's for the Debian core to fix... on a running system.. . Debian rootfs /usr and /usr/sbin are not unique

21:36 <Tenkawa> er /sbin

21:36 <pabs3> right

21:36 <Tenkawa> Some people still didn't know that

21:37 <pabs3> huh, its been that way for years

21:37 <jrtc27> sorear: SCTAG exists because Arm were concerned about swap performance, but their firmware turns it off and everyone agrees it shouldn't make it into a product

21:37 <Tenkawa> I know.... "long story..."

21:37 <Tenkawa> lol

21:39 foton has quit [Ping timeout: 256 seconds]

21:39 foton_x has joined #riscv

21:58 <sorear> why shouldn't it make it into a product? double standard where power/clock glitches setting chcr_el2.settag=1 is considered a catastrophic security flaw, but power/clock glitches setting the tag of a c register is fine?

21:59 crabbedhaloablut has quit []

22:01 crabbedhaloablut has joined #riscv

22:05 psydroid has quit [Quit: KVIrc 5.0.0 Aria http://www.kvirc.net/]

22:12 <jrtc27> don't care about glitches, if your hardware isn't digital then that's not our problem to solve

22:12 <jrtc27> it shouldn't exist as an option for software to use, as it provides an architectural way to totally bypass the security

22:13 <jrtc27> the only reason you'd want it to exist if there are things that need it to exist, but there aren't, so it shouldn't

22:16 <sorear> but if it were a write-only capability register that contained an infinity cap, that would be fine

22:17 sm2n has quit [Read error: Connection reset by peer]

22:17 raghavgururajan has quit [Read error: Connection reset by peer]

22:17 catcream_ has quit [Read error: Connection reset by peer]

22:17 jleightcap has quit [Remote host closed the connection]

22:17 sumoon has quit [Remote host closed the connection]

22:17 shreyasminocha has quit [Read error: Connection reset by peer]

22:17 BratishkaErik has quit [Remote host closed the connection]

22:17 pld has quit [Remote host closed the connection]

22:17 yyp has quit [Remote host closed the connection]

22:18 sumoon has joined #riscv

22:18 raghavgururajan has joined #riscv

22:18 catcream_ has joined #riscv

22:18 shreyasminocha has joined #riscv

22:18 pld has joined #riscv

22:18 sm2n has joined #riscv

22:18 yyp has joined #riscv

22:18 BratishkaErik has joined #riscv

22:18 jleightcap has joined #riscv

22:18 <jrtc27> hm? write-only capability register?

22:20 <sorear> sorry, write-only capability system register

22:20 <sorear> if you have an infinity cap in a register or system register then any operation is monotonic...

22:23 <jrtc27> if it's write-only then you by definition cannot access it and therefore should not regard it as part of your transitively reachable set of authorities

22:24 <sorear> make it read-write-WARL then, writes to Null or Infinity only

22:24 <jrtc27> and then you don't need the magic instruction

22:24 <jrtc27> you can just read it and buildcap

22:25 <jrtc27> and then no capability forgery is present

22:25 <jrtc27> and who needs a register, just put it in memory somewhere, and you can even have more flexibility than just null or infinity

22:25 <jrtc27> and then you don't need any special architecture

22:26 <jrtc27> this is the point, it buys you nothing but pollutes the architecture to have such an instruction

22:26 <jrtc27> so long as you have an efficient buildcap thing

22:26 <jrtc27> and if you put it in memory, you can build a little compartment to contain the powerful thing

22:27 <jrtc27> which gives you much more fine-grained control over who can use it and for what, with the right software framework

22:27 <jrtc27> rather than an all-or-nothing if and only if M-mode (and hopefully ASR)

22:29 <sorear> CHERI documents are inconsistent on whether buildcap can create sealed capabilities without O(N object types) steps

22:31 <jrtc27> https://github.com/CTSRD-CHERI/cheribsd/blob/698d1636dd1fe2322e5bc7029e415928c80b76b1/sys/vm/swap_pager.c#L2221-L2223

22:31 <jrtc27> three instructions needed

22:31 <sorear> if the choice is between SCTAG and a hundred ISAv9 CCopyType operations for the object types present in my system...

22:31 <jrtc27> we carefully designed the instructions to not need that

22:32 <jrtc27> clearly O(N) would be untenable

22:32 <jrtc27> we're not stupid...

22:32 <jrtc27> our CHERI has never had SCTAG in it, that was Arm's doing for Morello

22:35 <sorear> "permit a fast branchless rederivation sequence with multiple sealing authorities with a single CBuildCap and a set of CCopyType and CCSeal pairs" i didn't find the definition of "sealing authority" and the only thing I could think of at the time is that it meant one per object type

22:36 <sorear> apparently "pairs" means "one pair"

22:45 mwette has joined #riscv

22:54 <jrtc27> it's however many you hav

22:55 <jrtc27> you start with one almighty one and can subdivide it however you like

22:55 <jrtc27> if you want one per otype you can, but that's not particularly efficient if you're juggling a large number of them

22:55 <jrtc27> sealing authority is just a valid unsealed capability with seal permission

23:08 JanC_ has joined #riscv

23:09 JanC has quit [Killed (lead.libera.chat (Nickname regained by services))]

23:09 JanC_ is now known as JanC

23:28 Zer0day1984 has joined #riscv

23:28 Zer0day1984 has quit [Changing host]

23:28 Zer0day1984 has joined #riscv

23:58 mlw has joined #riscv