#riscv on 2021-06-04 — irc logs at libera.irclog.whitequark.org

2021-05-20 20:58 sorear changed the topic of #riscv to: RISC-V instruction set architecture | https://riscv.org | Logs: https://libera.irclog.whitequark.org/riscv | Backup if libera.chat and freenode fall over: irc.oftc.net

00:04 <xentrac> I think L4's approach to space allocation is inspiring

00:06 jn has quit [Remote host closed the connection]

00:07 jn has joined #riscv

00:12 <dh`> it is, in a certain way, but it's also extremely elaborate

00:13 <sorear> L4 or seL4?

00:13 * sorear wants to rank kernels by per-page VM overhead now

00:13 * jrtc27 hides

00:16 dionysos is now known as zoombie

00:16 <sorear> a sel4 dynamic system needs a Page and an Untyped for every page to support memory reuse, add the PTE for a uniquely mapped page and you have 9 words before the VM server's userspace allocation _starts_

00:17 zoombie is now known as dionysos

00:24 <xentrac> well, I was thinking of seL4 really

00:24 <xentrac> but I'm not entirely clear on the evolution of the mechanism through the illustrious and sordid history of L4

00:26 <sorear> http://sigops.org/s/conferences/sosp/2013/papers/p133-elphinstone.pdf §3.4.2 ?

00:26 <xentrac> the thing I thought was inspiring about it, in particular, was the total elimination of allocation failure from the kernel, because (as I understand it) all the pages are owned by userland code, even the ones the kernel uses

00:26 <xentrac> let's see

00:26 <xentrac> I've read this paper before but forgotten all of it

00:27 vagrantc has quit [Quit: leaving]

00:28 <xentrac> it says the mechanism in question was specifically invented in seL4

00:30 <xentrac> > This led us to a radically new resource-management model, where all spatial allocation is explicit and directed by user-level code, including kernel memory (Elkaduwe et al. 2008).

00:33 <xentrac> (§2.2)

00:33 <xentrac> I should actually try using seL4 to see what it's like

00:37 Sos has quit [Quit: Leaving]

00:40 <xentrac> I have a hypothesis about seL4 that I am uncertain about: I think there is no way for a process accepting a mapping of a memory page from another process to guarantee that the grantor has not retained access to the page. do you know if that is true, sorear?

00:40 <sorear> that's correct

00:41 <sorear> in a bunch of different ways sel4 requires acceptors of capabilities to trust their sources; if you want to set up a channel between two user processes you need a mutually trusted server to create the needed resources

00:43 <xentrac> if you have a mutually trusted server, can they then safely pass a memory region back and forth in a way that guarantees that only one of them has access to it at a time?

00:44 <xentrac> my motivating example here is sending a frame of video from an windowed application to a window server, which retains the frame until it has copied the visible part of it into the hardware framebuffer, then sends it back to the application for recycling

00:45 <xentrac> it's not a very good motivating example, I admit, because it only guards against accidental bugs rather than security violations

00:47 <xentrac> but there are historically lots of cases where fairly similar communication patterns have given rise to TOCTOU vulnerabilities

00:54 <sorear> xentrac: sure, the MTS can unmap from a process (if the process was never given the frame cap) or revoke the frame cap (if it was)

00:55 <sorear> although I think the idea is to use TOCTOU-safe ring buffers for most things and avoid remapping during operation

00:56 <xentrac> "avoid remapping"? well, I do want the page to get unmapped from process A and mapped in process B, and then vice versa; isn't that "remapping"?

01:09 FluffyMask has quit [Quit: WeeChat 2.9]

01:11 <sorear> yes, that's what you have stated you want, but it's not super well optimized for in either sel4 (to map 50 pages into another address space you need to make 50 syscalls) or hardware (TLB flushing)

01:13 <xentrac> hmm, I wonder if there's a way to get around the TLB flushing problem. obviously the 50-syscalls problem is a thing you could fix

01:13 <xentrac> without custom hardware :)

01:16 <xentrac> a typical graphics window is on the order of 4 megabytes, so in that case a small number of huge pages might be a reasonable solution; copying those 4 megabytes into the framebuffer is going to take tens or hundreds of microseconds, which would swamp the cost of even a full TLB flush

01:16 <xentrac> but it's not clear that that's a good solution for things like Unix text pipelines

01:17 <xentrac> the cases where zero-copy communication matters (which is sort of what I'm really after) are the cases where the volume of data is fairly high and the computational intensity (in the HPC sense) is low

01:17 <sorear> back when wayland was new they did tests and found that the breakeven point for memory remapping was consistently around 256kb, but that was with Linux and (now) 10 year old hw

01:18 <xentrac> which way do you suppose it's gone?

01:18 <sorear> hardware side? up. tlbs have gotten bigger and the cost of a TLB miss is a bunch of extra round trips to memory, which have been getting more expensive in cycle terms

01:19 <sorear> linux vs sel4? could go either way

01:19 <xentrac> if the computational intensity is high then you might as well just copy. but I'm maybe unreasonably infatuated by this vision that I can map in a FlatBuffers file and navigate it selectively

01:19 <xentrac> follow pointers

01:19 <sorear> if you're using a marshalling system like flatbuffers it can be made toctou-safe without unmapping

01:20 <sorear> you just need to guarantee that you're accessing each word once

01:20 <xentrac> ?

01:21 <sorear> "if there's a way to get around the TLB flushing" theoretically it's straightforward to make the TLB coherent with an inclusive L2 cache, this would have adverse effects on TLB reach that are probably large but hard for me estimate without trying it

01:22 <sorear> if you never access a word in the buffer more than once at the asm level (in particular you are using volatile/atomic reads), then it doesn't matter whether it's being concurrently modified or not because you will see the old version or the new version

01:23 <jrtc27> would your coherent TLB roll back speculative TLB hits?

01:24 <jrtc27> or would you still want some form of fence that doesn't flush the TLB, just the pipeline

01:24 <sorear> the latter is simpler since the ISA already has sfence.vma (and fence.i)

01:25 <jrtc27> yeah

01:25 <jrtc27> I don't think you want to have it as speculative state for every instruction...

01:25 <jrtc27> :P

01:25 <sorear> the Fun Part here is that each TLB entry is in the worst case pinning 3 cache lines

01:26 <sorear> or much worse if you have sv48+H

01:26 <jrtc27> yeah... though you could do exclusive on the proviso that DMA isn't coherent

01:26 <jrtc27> (because wth are you doing DMA'ing to page tables)

01:29 <sorear> you have a memory address instruction spanning two pages and accessing a third and suddenly you need like 50 associativity to guarantee forward progress

01:30 <jrtc27> 3*4*2 is "only" 24 :)

01:31 <sorear> 3*4^4

01:31 <jrtc27> can make it 32 if it's an unaligned access supported in hardware

01:31 <sorear> 3*4^2 rather

01:31 <jrtc27> why squared?

01:31 <sorear> because each level of the stage 1 PTW requires a separate stage 2 PTW

01:31 adjtm has joined #riscv

01:31 <jrtc27> oh

01:31 <jrtc27> oh right

01:31 <jrtc27> those are guest physical

01:32 <jrtc27> is it not 5*4 then?

01:32 <jrtc27> 4 PTEs and the actual page you want

01:32 <sorear> get the cursed thing working and then realize you have 3/4 of a HTM

01:32 <jrtc27> 5 guest physical addresses

01:32 <sorear> I think so but didn't want to do that much math that quickly

01:32 <jrtc27> pretty sure it's 3*5*4=60 then :D

01:35 <jrtc27> which I believe is what the kids call a "big oof"

01:36 <sorear> not sure if I've seen a fleshed out design for a variable-associativity victim cache (in hw - qemu's tlb doesn't count)

01:39 <sorear> really though you just mark the TLB entries as "potentially stale" when their underlying cache lines are evicted, and ignore that bit until the sfence

01:40 <jrtc27> hence my comment about exclusivity being fine so long as DMA isn't a thing

01:41 <sorear> I don't understand that bit. what is being exclusive of what, and how does DMA come in?

01:42 <jrtc27> as in, what you said

01:42 <jrtc27> you're allowed to evict from the L2

01:42 <jrtc27> without evicting from the TLB

01:42 <jrtc27> exclusive is probably the wrong term, but it's not an inclusive cache

01:42 <jrtc27> and the DMA is just because if it's not in the L2 you probably aren't snooping DMA to make it coherent with the TLBs

01:43 <jrtc27> (seems NINE, Non-Inclusive Non-Exclusive, is the term...)

02:45 riff_IRC has quit [Read error: Connection reset by peer]

02:45 ovh has joined #riscv

02:56 davidlt has joined #riscv

02:58 <xentrac> sorear: true, I suppose it's impossible to distinguish a concurrent modification from a previous incomplete modification if you're only accessing each word in the potentially shared region at most once. but of course it doesn't ensure that what you see is consistent, which is the property of interest in my framebuffer example — but relying on the data you see to be consistent in order to, say, not

02:58 <xentrac> crash or disclose secrets, means you are relying on its sender!

02:59 <xentrac> 60 is horrifyingly bad, like VAX bad

03:00 <sorear> exactly, if you're trusting the sender you can trust the sender to respect synchronization, if you're *not* trusting the sender then a "torn" page is no worse than what the sender might otherwise have sent

03:00 <xentrac> I think this is pointing out some important holes in my thinking, and I really appreciate these insights

03:01 <xentrac> yeah. echoing my previous whining about hypervisor shibboleths, I've been trying to shift from "trust" terminology to "rely on" terminology for a couple of reasons:

03:02 <xentrac> 1. "trust" has lots of warm fuzzies associated with it which are extremely counterproductive in security discussions, since "trusting" other components to function properly is the thing we want to minimize

03:04 <xentrac> 2. the whole template is something like "in order to do W, X relies on Y to do/not do Z" and "rely on" seems to encourage people to at least *mention* W in a way that "trust" does not (although maybe it should; I think it was Alan Karp who said he trusts his relatives to watch his kids but not keep his money, but he trusts his bank to keep his money but not watch his kids)

03:05 <xentrac> what do you think?

03:06 <sorear> i agree with the importance of specifying WXYZ. *bangs drum* security is meaningless without a threat model!

03:06 <xentrac> heh

03:06 <xentrac> amen brother

03:06 <xentrac> or sister

03:06 <xentrac> amen sibling!

03:08 <TwoNotes> VAX was a product of its times

03:09 <xentrac> so was the 68010 and it didn't have a way to cause 60 page faults in one instruction

03:10 <xentrac> another of my sort of motivating examples is that I'd like to be able to run very short processes to control information flow, and doing this efficiently sort of suggests mapping in most of the data the process could want in a FlatBuffers-like form. maybe only into its virtual address space, though, rather than necessarily prefetching it from your SSD

03:11 <xentrac> and you don't necessarily want that stuff mapped read/write

03:12 <TwoNotes> VAX put the Complex into CISC

03:12 <xentrac> Linux takes about 100 μs to fork+exit+wait, which is a pretty discouragingly large amount of overhead for a fundamental security isolation primitive

03:13 <TwoNotes> Trouble was, they let the high-level language people and mathmaticians design the instruction set. I know - I was there

03:14 <xentrac> yeah? what were you working on?

03:14 <TwoNotes> Bliss compilers

03:16 <xentrac> ever write a bliss-86?

03:16 <xentrac> by contrast a context switch between existing processes is down below 10 μs

03:17 <xentrac> I've never written anything in bliss-*, closest I've come is various Forths

03:20 <sorear> the last time i benchmarked context switches on linux, on an ultra low power laptop, i was getting 5 µs for pipes or futexes and 20 µs for TCP sockets

03:20 <xentrac> sounds about right

03:21 <xentrac> the order-of-magnitude performance difference makes it tempting to keep a process running longer than would be ideal for security purposes. Lucet can start and stop a wasm "process" in more like 10 μs, so there's less incentive to reuse possibly-corrupted memory state across, for example, data received from multiple identities

03:22 <xentrac> (Linux with glibc is closer to 600 μs for fork+exit+wait!)

03:22 <xentrac> of course in the Lucet case, you're relying on the Lucet compiler as well as the CPU, and these days the CPU is already bad enough

03:23 <xentrac> (to provide isolation between the "processes")

03:23 <xentrac> TwoNotes: what's your favorite language these days?

03:25 ovh is now known as riff-IRC

03:48 <TwoNotes> I like the odd ones

03:48 <TwoNotes> Erlang, for example

03:49 <TwoNotes> Forth is cool.

03:49 <TwoNotes> Bliss was in a battle against Pascal to be THE system programming language. They chose Bliss at DEC

03:49 <TwoNotes> Then in the end it was C that won out

03:50 <TwoNotes> But programming in RISC-V AS is really fun. Remdinds me a lot of IBM 360 BAL

03:51 <TwoNotes> I spent way too long programming in Java to not want to ever look at it again

03:52 <TwoNotes> ANother one I have dabbled in is COmmon Lisp

03:58 <xentrac> C is kind of like a cross between BLISS and Pascal

03:59 * sorear , knowing pascal and C, tries to extrapolate

04:01 <xentrac> sorear: https://en.wikipedia.org/wiki/BLISS#Source_example

04:08 davidlt has quit [Ping timeout: 245 seconds]

04:20 <TwoNotes> The Bliss compiler had a VERY powerful macro package. Most people could not make use of it because it was quite complicated. I was quite good at it because I maintained the part of the compiler that implemented it.

04:20 <sorear> C superior due to its support for lowercase letters? /s

04:21 <sorear> TwoNotes: isn't that how it always goes

04:22 <TwoNotes> Just about all of thge compilers for the VAX were written in Bliss

04:23 <TwoNotes> https://compilers.iecc.com/comparch/article/87-07-029

04:23 rvalles has quit [Read error: Connection reset by peer]

04:23 rvalles has joined #riscv

04:24 TwoNotes has quit [Quit: Leaving]

04:25 <sorear> ~rust~ bliss compiler written in bliss and making "excessive" use of advanced language features? the more things change...

04:35 <xentrac> heh

04:37 <xentrac> https://compilers.iecc.com/comparch/article/87-08-003 talks a little about what I mean by C being a cross between BLISS and Pascal

05:23 rvalles has quit [Read error: Connection reset by peer]

05:23 rvalles has joined #riscv

05:36 smartin has joined #riscv

05:38 zjason` has joined #riscv

05:40 zjason has quit [Read error: Connection reset by peer]

05:43 davidlt has joined #riscv

06:42 frost has joined #riscv

07:18 cmuellner has quit [Ping timeout: 245 seconds]

07:49 Sos has joined #riscv

07:53 adjtm has quit [Ping timeout: 272 seconds]

08:08 valentin has joined #riscv

08:10 hendursaga has quit [Ping timeout: 252 seconds]

08:15 hendursaga has joined #riscv

08:30 davidlt has quit [Ping timeout: 272 seconds]

08:49 cmuellner has joined #riscv

09:17 TMM_ has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

09:18 TMM_ has joined #riscv

10:07 choozy has joined #riscv

10:10 usama has joined #riscv

10:13 choozy has quit [Remote host closed the connection]

11:11 TwoNotes has joined #riscv

11:40 zjason` is now known as zjason

11:48 choozy has joined #riscv

12:31 choozy has quit [Ping timeout: 272 seconds]

12:38 tgamblin has quit [Remote host closed the connection]

12:40 tgamblin has joined #riscv

12:56 <hendursaga> TwoNotes: Common Lisp you say? Have you heard of the Nyxt browser? You might like it...

12:57 wingsorc has joined #riscv

13:00 Andre_H has joined #riscv

13:32 davidlt has joined #riscv

13:54 <TwoNotes> A browser written in Lisp?

13:59 choozy has joined #riscv

14:13 jotweh has quit [Ping timeout: 268 seconds]

14:24 <leah2> can i see the cpu frequency from freedom-sdk userland?

14:24 <leah2> (on a unmatched)

14:50 <enthusi> with freedom-sdk you mean not as described in the SiFive forum thread?

15:00 <leah2> i mean some official image

15:00 <leah2> cpupower doesnt seem to support it, and the dmesg doesnt show i i think

15:03 jotweh has joined #riscv

15:16 frost has quit [Quit: Connection closed]

15:17 choozy has quit [Ping timeout: 245 seconds]

15:28 Andre_H has quit [Ping timeout: 272 seconds]

15:31 <jimwilson> leah2, cpupower frequency driver not ported yet, you can get an estimate by using "perf stat /bin/ls", for exact value you can read clock config with devmem2 and decode it as I mentioned in the forums

15:38 <leah2> cute trick, thanks

15:50 usama has quit [Quit: Leaving.]

16:01 usama has joined #riscv

16:02 <hendursaga> TwoNotes: Well, the renderer isn't in CL, but the rest is pretty much all CL. Really crazy stuff.

16:07 iorem has quit [Quit: Connection closed]

16:11 Andre_H has joined #riscv

16:14 <TwoNotes> From the appropriate PLL CSR data you can figure out the clock freuency provided you know what the installed XTAL is.

16:14 cwebber has joined #riscv

16:21 TwoNotes has quit [Remote host closed the connection]

16:22 TwoNotes has joined #riscv

16:24 wingsorc has quit [Quit: Leaving]

16:27 FluffyMask has joined #riscv

16:35 vagrantc has joined #riscv

16:36 psydroid has quit [Quit: node-irc says goodbye]

16:36 llamp[m] has quit [Quit: node-irc says goodbye]

16:36 demostanis[m] has quit [Quit: node-irc says goodbye]

16:36 khem has quit [Quit: node-irc says goodbye]

16:36 ahs3[m] has quit [Quit: node-irc says goodbye]

16:37 llamp[m] has joined #riscv

16:40 demostanis[m] has joined #riscv

16:40 psydroid has joined #riscv

16:40 ahs3[m] has joined #riscv

16:40 khem has joined #riscv

16:55 choozy has joined #riscv

17:27 TMM_ has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

17:28 TMM_ has joined #riscv

17:31 usama has quit [Quit: Leaving.]

17:35 riff_IRC has joined #riscv

17:37 riff-IRC has quit [Ping timeout: 252 seconds]

17:49 riff_IRC has quit [Remote host closed the connection]

17:49 riff_IRC has joined #riscv

18:10 usama has joined #riscv

18:14 mahmutov has joined #riscv

18:28 ats has quit [Ping timeout: 272 seconds]

18:31 ats has joined #riscv

18:53 usama has quit [Ping timeout: 264 seconds]

18:58 jeancf has joined #riscv

19:18 jeancf has quit [Ping timeout: 272 seconds]

19:19 <xentrac> have a somber Tiananmen Square Day

19:20 davidlt has quit [Ping timeout: 264 seconds]

19:31 <riff_IRC> ^ lol

20:11 smartin has quit [Quit: smartin]

20:15 mahmutov has quit [Read error: Connection reset by peer]

20:18 mahmutov has joined #riscv

20:19 valentin has quit [Quit: Leaving]

20:27 mhorne has quit [Ping timeout: 245 seconds]

20:27 mhorne has joined #riscv

20:29 choozy has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

20:42 ahs3 has quit [Ping timeout: 252 seconds]

20:48 ahs3|afk has joined #riscv

20:49 SwitchToFreenode has quit [Remote host closed the connection]

20:49 KREYREEN has joined #riscv

21:00 Sos has quit [Quit: Leaving]

21:59 ahs3|afk has quit [Ping timeout: 268 seconds]

22:00 TwoNotes has quit [Quit: Leaving]

22:04 ahs3|afk has joined #riscv

22:28 ahs3|afk has quit [Ping timeout: 245 seconds]

22:31 ats_ has joined #riscv

22:31 ats has quit [Ping timeout: 268 seconds]

22:58 Andre_H has quit [Ping timeout: 265 seconds]

23:32 elastic_dog has quit [Quit: elastic_dog]

23:34 elastic_dog has joined #riscv

23:43 jellydonut has quit [Quit: jellydonut]