#riscv on 2024-01-24 — irc logs at libera.irclog.whitequark.org

2023-08-11 11:05 sorear changed the topic of #riscv to: RISC-V instruction set architecture | https://riscv.org | Logs: https://libera.irclog.whitequark.org/riscv | Matrix: #riscv:catircservices.org

00:04 geertu has quit [Ping timeout: 256 seconds]

00:13 mlw has joined #riscv

00:13 notgull has joined #riscv

00:22 shamoe has quit [Quit: Connection closed for inactivity]

00:23 pecastro has quit [Ping timeout: 264 seconds]

00:31 geertu has joined #riscv

00:38 shamoe has joined #riscv

00:43 khem has joined #riscv

00:49 jacklsw has joined #riscv

01:06 MaxGanzII_ has quit [Ping timeout: 240 seconds]

01:14 balrog has quit [Quit: Bye]

01:16 TMM_ has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

01:16 TMM_ has joined #riscv

01:18 balrog has joined #riscv

01:20 notgull has quit [Ping timeout: 256 seconds]

01:34 HumanG33k has quit [Ping timeout: 276 seconds]

01:42 Tenkawa has quit [Quit: Was I really ever here?]

01:47 HumanG33k has joined #riscv

02:19 mlw has quit [Ping timeout: 256 seconds]

02:56 notgull has joined #riscv

03:14 BootLayer has joined #riscv

03:29 mlw has joined #riscv

03:32 <sorear> RV32 capabilities aligned mod 4 with a length between 256 and 508 can be encoded in two redundant ways (T8=1 EF=1 or EF=0 E=0)?

03:35 mlw has quit [Ping timeout: 252 seconds]

03:36 mlw has joined #riscv

03:37 <jrtc27> if EF=1 then E includes T8?

03:41 <jrtc27> need to think some more tomorrow

03:44 <jrtc27> yeah ok, right

03:44 <jrtc27> T[EW / 2 - 1:0] = TE

03:44 <jrtc27> B[EW / 2 - 1:0] = BE

03:45 <jrtc27> if you try and make TE and BE big enough to make E=0, then the low bits of at least one of T and B are non-zero

03:46 <jrtc27> if you try and make TE and BE 0 to make it match the EF=0 case you're trying to redundantly encode, then E will be > 0 and your LMSB gets shifted up

03:47 <jrtc27> once this is in Sail (I don't know why it isn't to be honest, it's not that hard...) it can easily be model checked to ensure there aren't two non-malformed capabilities that mean the same thing

03:48 <sorear> I'm adding a note to my feedback to treat E=0 EF=0 XLEN=32 as malformed bounds, E=-1 EF=0 already is and I'm just moving the compare by one

03:49 mlw has quit [Ping timeout: 264 seconds]

03:50 <sorear> (I've finally figured out what the encoding and representability is supposed to be, it's written far more complicated than it is)

03:50 <jrtc27> E=0 IE=1 in normal CHERI Concentrate is a normal thing to have

03:50 mlw has joined #riscv

03:50 <jrtc27> not sure if the T8 situation changes that

03:51 <jrtc27> but my instinct is it shouldnt'

03:52 <sorear> T8 doubles the "subnormal" range from [0,255) to [0,511) completely covering and AFAICT eliminating the need for E=0 normals which are [256,511)

03:52 * muurkha is in the subnormal range

03:53 <jrtc27> oh hm this is the EF=1 case which means E is always 0

03:53 <jrtc27> maybe you're right then

03:56 heat_ has quit [Remote host closed the connection]

03:56 <sorear> hypothesis: the post-Moore chasing of basis points on cost efficiency and energy efficiency will eventually lead to computer systems where a majority of memory is undervolted to the point of being noticeably unreliable; software will need to adapt to this situation and not store capabilities or pointers in unreliable memory

03:57 heat_ has joined #riscv

03:58 <muurkha> sorear: that's an interesting idea, but wouldn't it be more sensible to use ECC?

03:58 <sorear> operationally: "cacheable but not tagged" is a PMA combination that is meaningful and likely to become and remain common, not just in weird transitional and CXL setups

03:58 <jrtc27> that would be a fundamental paradigm shift

03:58 <muurkha> it's already the case that a majority of memory is noticeably unreliable if you overheat it or rowhammer it

03:58 <muurkha> and sandbox escapes with heat lamps and hair dryers have been demonstrated

03:59 <jrtc27> if you have memory that gets corrupted all bets are off today, you don't need capabilities for that

04:00 <sorear> normal ECC (36/72-bit hamming code) isn't strong enough to be useful under adversarial conditions and the codes that are are noticably expensive

04:00 <muurkha> (in the current context of sandboxes mostly being used against users, this is fortunate for, though I know jrtc27 really doesn't like me talking about this, fundamental human rights)

04:02 <sorear> it's not clear whether the physical access game is a win for the attacker or the defender. biology has been playing it possibly since before there were cells with no clear consensus

04:02 <muurkha> agreed

04:03 <muurkha> hmm, I realize I don't actually know how cheap you can make stronger ECC. but I think it's plausibly easier if you can slap the ECC on top of some kind of larger block transfer, like a 16-byte cache line or 2048-byte page, which is what NAND Flash chips routinely do already

04:03 <sorear> it's mainly a function of block size and latency

04:04 <muurkha> but how much latency does hardware Reed-Solomon decoding necessarily impose on a memory read?

04:04 <muurkha> I mean, plausibly you don't want to do something like Gallager codes which can take a variable amount of time to decode

04:05 <sorear> remember that most reads are of recently written data (compared to a NAND chip that's been sitting unpowered on a shelf for a year, anyway) with no bitflips, so you mostly only need to _optimise_ the zero bit error case

04:06 <sorear> all widely used binary codes are linear, so you can check if the codeword is valid with a limited number of XOR gates

04:06 <muurkha> I wonder if you could do the ECC during DRAM refresh

04:08 mlw has quit [Ping timeout: 256 seconds]

04:08 <sorear> LDPC is problematic for a security feature because there's no rigorous theory, there's extensive experimental data but that can't observe events with probability below 2^-64 or so

04:09 <sorear> periodic scrubbing is a normal feature of ECC systems. combining it with LPDDR won't work because your codewords are spread out over multiple chips, and refresh cycles take place inside each chip with the pin drivers off to conserve power, I think modern DDR works the same way

04:10 <sorear> I vaguely recall GDDR storing cache lines in a single chip each, but that's a very different latency/throughput/power tradeoff

04:11 <muurkha> like, https://download.semiconductor.samsung.com/resources/user-manual/x16%20only_8G_C_DDR4_Samsung_Spec_Rev1.5_Apr.17.pdf says the page size ("number of bytes of data delivered from the array to the internal sense amplifiers when an ACTIVE command is registered") on their 512 megabits × 16 chips is 2 kilobytes

04:12 <muurkha> DRAM has to "scrub" through all its pages every few milliseconds to refresh it by delivering it from the array to its internal sense amplifiers

04:12 <muurkha> which re-up the charge on the capacitors

04:13 <muurkha> in order to remain reliable at the rated temperature range (though as the Coldboot attack showed, they remain disturbingly reliable for orders of magnitude longer than that at low temperatures)

04:13 <sorear> many chips these days have internal ECC to optimise the retention/BER tradeoff, and you can scrub that, but "ECC" normally refers to end-to-end ECC which needs to be split between chips so that a dead chip doesn't cause data loss

04:13 <muurkha> in the case of this chip, if I'm reading this right, the rated refresh interval is 7.8 milliseconds

04:15 <muurkha> at 0.75 nanoseconds per clock cycle, if you were doing a refresh every clock cycle, you'd refresh the whole chip in 390μs, which is a lot less than 7.8 milliseconds

04:15 <muurkha> so I think you could quite plausibly design in an ECC circuit which detects bit errors during refresh and rewrites the corrected data to the page

04:16 <muurkha> and it could operate over an entire two-kilobyte page

04:17 <muurkha> you wouldn't want to do this off-chip because it would require wiring the 16384 outputs of the sense amplifier lines off-chip to the ECC circuitry

04:18 <muurkha> does that make sense? I think I'll stick this in pavnotes2. how should I call you there, sorear?

04:19 <sorear> you are literally describing how DDR5 works, I don't know how it's exposed in the spec but "in-chip ECC, not the same as ECC on the wire" is a builtin feature

04:20 <muurkha> wow, I had no idea, thanks for telling me :)

04:20 <muurkha> I guess I won't write it in pavnotes2 anyway

04:20 <sorear> (sorear) sure

04:20 <muurkha> *in that case

04:21 <muurkha> but in that case wouldn't we not have to worry about making memory noticeably unreliable because of undervolting?

04:21 <muurkha> we can always just use more ECC

04:25 <sorear> most bus protocols these days support critical-word-first - if I read miss word 5 of an 8-word cache line, the data comes back from RAM in order 5,6,7,0,1,2,3,4 or 5,4,7,6,1,0,3,2 and stays in that order all the way to the load/store unit, saving a few cycles

04:26 <muurkha> nice

04:27 <muurkha> I should get to work on some actual hardware rather than fantasy inaccessible hardware (I'm a long way from being able to fab DRAM)

04:27 <sorear> if you're doing ECC or MAC over the entire cache line, or longer, you need to either force the entire line to wait in the memory controller, or have some mechanism to tell the core "that word I gave you ten cycles ago wasn't actually valid, please pipeline replay" which is not a feature that exists in AMBA/ACE or TileLink, although IF/HT might have it

04:28 <muurkha> oh, I wasn't suggesting doing it at read time

04:28 <muurkha> I was suggesting doing it at refresh time

04:28 <muurkha> every 7.8 milliseconds or whatever

04:30 heat_ has quit [Remote host closed the connection]

04:30 mlw has joined #riscv

04:30 heat_ has joined #riscv

04:31 <muurkha> it's still possible for a bit to flip undetectedly in the 7.8 milliseconds since the last refresh, but it would almost surely have to be a single bit, no? which the 72-bit Hamming code will have no trouble correcting, once Intel stops using that as price discrimination

04:33 <muurkha> (and even if Intel does, you can still do it in the RAM chip)

05:31 heat_ has quit [Ping timeout: 264 seconds]

05:34 BootLayer has quit [Quit: Leaving]

05:42 alexghiti has joined #riscv

06:07 notgull has quit [Ping timeout: 264 seconds]

06:08 <sorear> jrtc27: i'm going to finish reading the draft, do an edit pass, remove things that are already reported and make one or more issues tomorrow or friday but if you want an early look https://gist.github.com/sorear/f248aef96641a010c5d2eee848e600e9

06:17 <jrtc27> on "mepcc need never hold a sealed capability." specifically: yes it absolutely does; morello screwed this up and didn't allow it, which means cheribsd has some gross workarounds to emulate unsealing sentries in celr_el1

06:18 <muurkha> oops

06:18 <jrtc27> there are various cases where privileged software gets a function pointer from userspace, which is a sentry

06:19 <jrtc27> thread creation, set_context and signal handlers all need to mess with that on morello

06:20 <jrtc27> (specifically called out in 3.9 Sealed Entry Capabilities of ISAv9 because Arm screwed this up / based Morello on an earlier CHERI-MIPS that also made this mistake before we realised and then fixed it)

06:23 <jrtc27> and re zcheri_legacy, no, ddc is handled like any other user-accessible register that affects S-mode in S-mode's trap handler

06:23 <jrtc27> it just saves it and switches context as needed

06:23 <jrtc27> (in purecap)

06:24 <jrtc27> since the S-mode OS is capability-aware

06:24 <jrtc27> (and if it's not then it shouldn't have enabled capability use even for itself in the first place, so there is no CHERI)

06:25 <jrtc27> other things I agree with, need more time to think about or disagree with but they need too long a response than can be given here and now

07:00 Kyuvi has joined #riscv

07:05 zBeeble42 has joined #riscv

07:06 zBeeble has quit [Ping timeout: 240 seconds]

07:09 Kyuvi has quit [Ping timeout: 250 seconds]

07:36 MaxGanzII_ has joined #riscv

07:37 markh has quit [Remote host closed the connection]

07:52 shamoe has quit [Quit: Connection closed for inactivity]

07:52 ZipCPU has quit [Ping timeout: 260 seconds]

07:53 ZipCPU has joined #riscv

07:59 Stat_headcrabed has joined #riscv

08:00 Stat_headcrabed has quit [Client Quit]

08:01 Stat_headcrabed has joined #riscv

08:17 davidlt has joined #riscv

08:17 heat_ has joined #riscv

08:18 davidlt has quit [Remote host closed the connection]

08:22 davidlt has joined #riscv

08:27 ldevulder has joined #riscv

08:31 jobol has joined #riscv

08:34 Stat_headcrabed has quit [Ping timeout: 246 seconds]

08:45 markh has joined #riscv

08:56 danilogondolfo has joined #riscv

08:57 Stat_headcrabed has joined #riscv

08:58 Stat_headcrabed has quit [Client Quit]

08:59 jacklsw has quit [Ping timeout: 264 seconds]

09:06 Andre_Z has joined #riscv

09:17 davidlt has quit [Ping timeout: 264 seconds]

09:18 pecastro has joined #riscv

09:36 Andre_Z has quit [Quit: Leaving.]

09:38 <sorear> jrtc27: (sentries) i did not realize that sentries were the intended ABI for function pointers, partly since I'm intentionally reviewing this mostly without reference to ISAv9, partly since cjalr accepts both, partly since there's no CADDISEAL but materializing a function pointer is rare enough that it doesn't matter if it takes three instructions. OK.

09:39 crossdev has joined #riscv

09:40 <sorear> jrtc27: (zcheri_legacy) ah, but our caps-naive S-mode OS _didn't_ enable capability use for itself, it's running with menvcfg.CME=0 ... which isn't enough to prevent U-mode access to ddc, only changing UXL does that

09:58 davidlt has joined #riscv

10:25 <sorear> challenge: what's the minimum number of representability checkers you need to add to a simple pipelined Zcheri+MSU, other than the obvious one in the execution stage to handle CINCOFFSET, load/store offsets, etc. depressingly high

10:39 czy has quit [Remote host closed the connection]

10:39 czy has joined #riscv

10:45 notgull has joined #riscv

11:29 ezulian has quit [Quit: ezulian]

11:29 ezulian has joined #riscv

11:33 psydroid has joined #riscv

11:57 MaxGanzII_ has quit [Ping timeout: 240 seconds]

12:14 alexghiti has quit [Ping timeout: 256 seconds]

12:19 notgull has quit [Ping timeout: 264 seconds]

12:19 anonpreet has joined #riscv

12:21 anonpreet has quit [Remote host closed the connection]

12:22 anonpreet has joined #riscv

12:24 KREYREN__ has quit [Remote host closed the connection]

12:25 KREYREN__ has joined #riscv

12:45 Stat_headcrabed has joined #riscv

12:45 ntwk has joined #riscv

12:47 Stat_headcrabed has quit [Client Quit]

12:48 Stat_headcrabed has joined #riscv

12:49 Stat_headcrabed has quit [Client Quit]

12:49 Stat_headcrabed has joined #riscv

12:52 Stat_headcrabed has quit [Client Quit]

12:53 anonpreet has quit [Remote host closed the connection]

13:00 maxinux has quit [Quit: Brb]

13:14 jmdaemon has quit [Ping timeout: 256 seconds]

13:22 shamoe has joined #riscv

13:27 <sorear> review complete, gist updated, filing issues now

13:50 MaxGanzII_ has joined #riscv

13:50 MaxGanzII_ has quit [Remote host closed the connection]

13:51 MaxGanzII_ has joined #riscv

13:55 MaxGanzII_ has quit [Remote host closed the connection]

14:22 hightower2 has quit [Ping timeout: 276 seconds]

14:22 ntwk has quit [Read error: Connection reset by peer]

14:23 Tenkawa has joined #riscv

14:23 ntwk has joined #riscv

14:58 hightower2 has joined #riscv

15:12 <sorear> (zcheri_legacy) https://github.com/riscv/riscv-cheri/issues/39 we'll see if I successfully communicated any of this

15:13 Nixkernal has joined #riscv

15:43 ntwk has quit [Quit: ntwk]

16:03 frkazoid333 has joined #riscv

16:04 frkzoid has quit [Ping timeout: 260 seconds]

16:10 <jrtc27> sorear: it's quite possible not enough enable bits made it into the current draft

16:10 <jrtc27> I thought I'd been pretty clear about what was needed...

16:27 KREYREN_ has joined #riscv

16:29 KREYREN__ has quit [Ping timeout: 240 seconds]

16:31 alexghiti has joined #riscv

16:45 Tenkawa has quit [Quit: Was I really ever here?]

16:47 maxinux has joined #riscv

16:53 heat_ has quit [Remote host closed the connection]

16:53 heat has joined #riscv

16:59 BootLayer has joined #riscv

17:01 vagrantc has joined #riscv

17:01 heat has quit [Remote host closed the connection]

17:01 heat has joined #riscv

17:17 another| has quit [Remote host closed the connection]

17:19 another has joined #riscv

17:32 TMM_ has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

17:32 TMM_ has joined #riscv

17:43 cronos has quit [Quit: ZNC - https://znc.in]

17:44 cronos has joined #riscv

17:45 Tenkawa has joined #riscv

17:47 ___nick___ has joined #riscv

17:52 cronos has quit [Quit: ZNC - https://znc.in]

17:52 cronos has joined #riscv

17:53 cronos has quit [Client Quit]

18:01 bjdooks has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

18:04 bjdooks has joined #riscv

18:08 cronos has joined #riscv

18:11 ldevulder has quit [Ping timeout: 264 seconds]

18:27 ldevulder has joined #riscv

18:27 jobol has quit [Quit: Leaving]

18:29 cronos has quit [Quit: ZNC - https://znc.in]

18:30 cronos has joined #riscv

18:31 cronos has quit [Client Quit]

18:31 KREYREN_ has quit [Remote host closed the connection]

18:31 cronos has joined #riscv

18:32 KREYREN_ has joined #riscv

19:20 Andre_Z has joined #riscv

19:27 justache has quit [Read error: Connection reset by peer]

19:28 justache has joined #riscv

19:31 duckworld has quit [*.net *.split]

19:31 duckworld has joined #riscv

19:39 BootLayer has quit [Quit: Leaving]

19:42 justache has quit [Read error: Connection reset by peer]

19:49 justache has joined #riscv

19:51 shamoe has quit [Quit: Connection closed for inactivity]

20:02 KREYREN__ has joined #riscv

20:04 KREYREN_ has quit [Ping timeout: 240 seconds]

20:06 vagrantc has quit [Quit: leaving]

20:12 vagrantc has joined #riscv

20:22 davidlt has quit [Ping timeout: 260 seconds]

20:25 another is now known as another|

20:36 ___nick___ has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

20:38 ___nick___ has joined #riscv

20:38 ___nick___ has quit [Client Quit]

20:41 ___nick___ has joined #riscv

20:43 crossdev has quit [Remote host closed the connection]

20:59 KREYREN_ has joined #riscv

21:01 KREYREN__ has quit [Ping timeout: 240 seconds]

21:04 ___nick___ has quit [Ping timeout: 260 seconds]

21:07 shamoe has joined #riscv

21:33 EchelonX has joined #riscv

21:36 ntwk has joined #riscv

21:40 KREYREN__ has joined #riscv

21:42 KREYREN_ has quit [Ping timeout: 240 seconds]

21:58 epony has joined #riscv

22:32 psydroid has quit [Quit: KVIrc 5.0.0 Aria http://www.kvirc.net/]

22:33 vagrantc has quit [Quit: leaving]

22:50 ntwk has quit [Read error: Connection reset by peer]

23:02 jmdaemon has joined #riscv

23:13 ntwk has joined #riscv

23:16 notgull has joined #riscv

23:37 Andre_Z has quit [Quit: Leaving.]

23:42 Tenkawa has quit [Quit: Was I really ever here?]