#riscv on 2024-01-31 — irc logs at libera.irclog.whitequark.org

2023-08-11 11:05 sorear changed the topic of #riscv to: RISC-V instruction set architecture | https://riscv.org | Logs: https://libera.irclog.whitequark.org/riscv | Matrix: #riscv:catircservices.org

00:08 hightower2 has quit [Remote host closed the connection]

00:09 jmdaemon has joined #riscv

00:09 hightower2 has joined #riscv

00:10 vagrantc has quit [Quit: leaving]

00:14 hightower3 has joined #riscv

00:16 hightower2 has quit [Ping timeout: 264 seconds]

00:18 maxinux has quit [Quit: Brb]

00:18 maxinux has joined #riscv

00:23 kaaliakahn2 has joined #riscv

00:23 kaaliakahn has quit [Read error: Connection reset by peer]

00:23 stazthebox has quit [Quit: Ping timeout (120 seconds)]

00:23 stazthebox has joined #riscv

00:30 hightower3 has quit [Remote host closed the connection]

00:30 hightower3 has joined #riscv

00:39 KREYREN has quit [Remote host closed the connection]

00:40 KREYREN has joined #riscv

01:01 epony has quit [Remote host closed the connection]

01:02 epony has joined #riscv

01:07 hightower3 has quit [Remote host closed the connection]

01:30 aburgess has quit [Remote host closed the connection]

01:30 aburgess has joined #riscv

01:37 Pokey has quit [Ping timeout: 264 seconds]

01:44 jmdaemon has quit [Ping timeout: 264 seconds]

01:50 <palmer> does anyone have more QEMU vector slowdown examples? I'm proposing a GSoC project: https://gitlab.com/qemu-project/qemu/-/issues/2137

01:53 jmdaemon has joined #riscv

01:54 <unlord> palmer: sure, I've got RVV optimizations in dav1d (with benchmarking code) can run in QEMU and see the slowdown

01:54 <unlord> https://code.videolan.org/videolan/dav1d/-/merge_requests/1463

01:56 <palmer> Cool, thanks. Presumably these also run faster on k230? That'd be really nice to see, as it kind of validates the workload

01:56 <unlord> instead of being 7x to 8x faster, it runs 40% the speed of scalar in QEMU

01:57 <unlord> palmer: yes, there are perf numbers from the K230 in the commit messages, e.g., https://code.videolan.org/videolan/dav1d/-/merge_requests/1463/diffs?commit_id=d2b59409f4b2ab65ec1ba3c8cab90bca4b35a2e8

01:59 jmdaemon has quit [Ping timeout: 264 seconds]

02:05 heat has quit [Ping timeout: 264 seconds]

02:07 <palmer> awesome, I just added a link to the QEMU tracker

02:08 <unlord> perfect!

02:09 maxinux has quit [Quit: Brb]

02:09 maxinux has joined #riscv

02:10 EchelonX has quit [Quit: Leaving]

02:12 <unlord> palmer: your comment in 2137 is not accurate. The numbers linked in the commit message are speedups from C -> RVV code on hardware

02:13 <unlord> the delta from C -> RVV in emulation is more like 1x -> 0.48x

02:13 <unlord> so half the throughput

02:13 <palmer> ah, sorry, I misunderstood it

02:14 <unlord> no worries

02:14 <palmer> should be fixed

02:14 <unlord> see it, thanks

02:14 <palmer> you might be able to edit it? IDK how QEMU's gitlab permissions work...

02:15 <unlord> palmer: I'll drop a link to my FOSDEM slides when I finish making them

02:16 shamoe has quit [Quit: Connection closed for inactivity]

02:29 jmdaemon has joined #riscv

02:37 KombuchaKip has quit [Quit: Leaving.]

02:49 KombuchaKip has joined #riscv

02:56 jmdaemon has quit [Ping timeout: 252 seconds]

02:57 Stat_headcrabed has joined #riscv

03:01 Stat_headcrabed has quit [Ping timeout: 268 seconds]

03:05 jacklsw has joined #riscv

03:09 jmdaemon has joined #riscv

03:11 Tenkawa has quit [Quit: Was I really ever here?]

03:12 BootLayer has joined #riscv

03:18 mwette has quit [Read error: Connection reset by peer]

03:22 Stat_headcrabed has joined #riscv

03:26 Stat_headcrabed has quit [Ping timeout: 256 seconds]

03:32 epony has quit [Remote host closed the connection]

03:35 epony has joined #riscv

03:41 KREYREN has quit [Quit: Leaving]

03:45 Stat_headcrabed has joined #riscv

03:50 Stat_headcrabed has quit [Ping timeout: 260 seconds]

04:23 davidlt has joined #riscv

04:24 maxinux has quit [Quit: Brb]

04:25 ntwk has quit [Ping timeout: 276 seconds]

05:05 foton has quit [Ping timeout: 260 seconds]

05:09 TMM_ has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

05:09 TMM_ has joined #riscv

05:09 foton has joined #riscv

05:23 davidlt has quit [Ping timeout: 268 seconds]

05:38 BootLayer has quit [Quit: Leaving]

06:06 frkzoid has quit [Ping timeout: 260 seconds]

06:22 Stat_headcrabed has joined #riscv

06:23 crossdev has joined #riscv

06:23 crossdev has quit [Remote host closed the connection]

06:26 crossdev has joined #riscv

06:30 notgull has quit [Ping timeout: 256 seconds]

06:31 Stat_headcrabed has quit [Ping timeout: 260 seconds]

06:32 Stat_headcrabed has joined #riscv

06:32 notgull has joined #riscv

06:34 epony has quit [Remote host closed the connection]

06:57 frkazoid333 has joined #riscv

07:00 Stat_headcrabed has quit [Remote host closed the connection]

07:00 Stat_headcrabed has joined #riscv

07:07 Stat_headcrabed has quit [Ping timeout: 268 seconds]

07:09 jobol has joined #riscv

07:13 davidlt has joined #riscv

07:19 Stat_headcrabed has joined #riscv

07:23 Stat_headcrabed has quit [Ping timeout: 276 seconds]

07:24 Stat_headcrabed has joined #riscv

07:34 Stat_headcrabed has quit [Remote host closed the connection]

07:43 crossdev has quit [Remote host closed the connection]

07:44 crossdev has joined #riscv

07:45 mark4o has joined #riscv

07:49 Pokey has joined #riscv

07:49 markh has quit [Ping timeout: 260 seconds]

07:49 mark4o is now known as markh

07:55 ezulian has joined #riscv

08:01 crossdev has quit [Remote host closed the connection]

08:01 crossdev has joined #riscv

08:15 pabs3 has quit [Quit: Don't rest until all the world is paved in moss and greenery.]

08:17 pabs3 has joined #riscv

08:37 MaxGanzII__ has joined #riscv

08:41 jacklsw has quit [Ping timeout: 264 seconds]

08:47 Leopold has quit [Remote host closed the connection]

08:48 Leopold has joined #riscv

09:00 prabhakar has joined #riscv

09:00 prabhakarlad has joined #riscv

09:35 prabhakarlad has quit [Ping timeout: 250 seconds]

09:35 prabhakar has quit [Ping timeout: 268 seconds]

09:47 heat has joined #riscv

09:54 lagash has quit [Ping timeout: 256 seconds]

10:28 unnick has quit [Ping timeout: 255 seconds]

10:32 felixonmars_ has joined #riscv

10:33 felixonmars has quit [Remote host closed the connection]

10:47 felixonmars_ is now known as felixonmars

11:09 davidlt has quit [Ping timeout: 268 seconds]

11:19 prabhakar has joined #riscv

11:19 prabhakarlad has joined #riscv

11:30 <unlord> palmer: so [fixing the missing headers and] running the example code at https://gitlab.com/qemu-project/qemu/-/issues/2137 I'm getting a speed up on HW if it is not vectorized

11:31 <unlord> https://paste.debian.net/1305856/

11:32 <unlord> not sure this is really the motivating example you want

11:33 crabbedhaloablut has quit []

11:34 crossdev has quit [Ping timeout: 252 seconds]

11:36 crabbedhaloablut has joined #riscv

11:38 heat has quit [Read error: Connection reset by peer]

11:38 heat_ has joined #riscv

11:41 psydroid has joined #riscv

11:43 paulk has quit [Quit: WeeChat 3.0]

11:56 davidlt has joined #riscv

11:56 davidlt has quit [Remote host closed the connection]

11:58 davidlt has joined #riscv

12:01 lagash has joined #riscv

12:03 paulk has joined #riscv

12:21 MaxGanzII__ has quit [Remote host closed the connection]

12:22 Tenkawa has joined #riscv

12:22 MaxGanzII has joined #riscv

12:23 crossdev has joined #riscv

12:24 MaxGanzII has quit [Remote host closed the connection]

12:25 MaxGanzII has joined #riscv

12:25 MaxGanzII has quit [Remote host closed the connection]

12:30 epony has joined #riscv

12:42 MaxGanzII has joined #riscv

12:43 mlw has quit [Ping timeout: 268 seconds]

12:44 mlw has joined #riscv

13:00 ezulian has quit [Quit: ezulian]

13:00 ezulian has joined #riscv

13:14 prabhakarlad has quit [Quit: Client closed]

13:14 prabhakar has quit [Quit: Connection closed]

13:27 MaxGanzII has quit [Remote host closed the connection]

13:34 prabhakar has joined #riscv

13:34 prabhakarlad has joined #riscv

13:35 <unlord> palmer: the dav1d RVV code has landed, I cannot edit the description on https://gitlab.com/qemu-project/qemu/-/issues/2137 but when you get a chance please update the link to this commit: https://code.videolan.org/videolan/dav1d/-/commit/219befef

13:39 jacklsw has joined #riscv

13:42 muurkha has left #riscv [#riscv]

13:45 _whitelogger has joined #riscv

13:46 billchenchina has joined #riscv

13:47 dogukan has joined #riscv

13:48 MaxGanzII has joined #riscv

13:48 JanC has quit [Ping timeout: 255 seconds]

13:49 JanC has joined #riscv

13:50 prabhakarlad has quit [Ping timeout: 250 seconds]

13:52 MaxGanzII has quit [Remote host closed the connection]

13:55 ntwk has joined #riscv

13:59 MaxGanzII has joined #riscv

14:04 MaxGanzII has quit [Remote host closed the connection]

14:09 dogukan has quit [Quit: Konversation terminated!]

14:22 MaxGanzII has joined #riscv

14:22 jacklsw has quit [Ping timeout: 260 seconds]

14:22 heat_ is now known as heat

14:30 maxinux has joined #riscv

14:58 unnick has joined #riscv

15:17 MaxGanzII has quit [Remote host closed the connection]

15:17 MaxGanzII has joined #riscv

15:22 MaxGanzII has quit [Ping timeout: 255 seconds]

15:26 maxinux has quit [Quit: Brb]

15:28 <palmer> unlord: odd, so maybe we just have some bad codegen. I guess there's an extra factor of 10x badness in there on QEMU, but maybe that doesn't matter if it's just something like scatter-gather that goes slow on HW too.

15:28 <palmer> (also, I updated the description)

15:29 <palmer> Patrick is running the fuzzer in some mode that looks for performance issues like this, so hopefully we'll have some better examples soon...

15:29 <unlord> I guess I could try these binaries with qemu-user-riscv64 and see the output

15:32 shamoe has joined #riscv

15:34 <palmer> oh, my numbers were all user-mode

15:34 <palmer> that's how we run SPEC and the compiler test suites, so that's what's really been hammering folks around here

15:35 BootLayer has joined #riscv

15:42 <unlord> yeah, I am seeing even worse than you posted, roughly 275x slower!

15:46 <unlord> palmer: feel free to toss this link into the issue as well https://paste.debian.net/1305885/

15:47 billchenchina has quit [Remote host closed the connection]

16:06 davidlt has quit [Remote host closed the connection]

16:07 davidlt has joined #riscv

16:21 prabhakarlad has joined #riscv

16:22 foxbat has quit [Read error: Connection reset by peer]

16:23 heat_ has joined #riscv

16:23 heat has quit [Read error: Connection reset by peer]

16:30 crossdev has quit [Ping timeout: 268 seconds]

16:33 foxbat has joined #riscv

16:35 vagrantc has joined #riscv

16:36 crossdev has joined #riscv

16:43 dzaima[m] has joined #riscv

16:43 <dzaima[m]> just built latest qemu (took a bit due to https://github.com/llvm/llvm-project/issues/75168) - a simple vlmax e8,m8 vle8.v takes ~1020ns on a vlen=128 config - 8ns/element

16:49 <courmisch> palmer: full FFmpeg checkasm bench takes half an hour on K230, and I have not bothered to try on QEMU. On individual tests, I see RVV being like 60% slower than C in QEMU.

16:50 <courmisch> while on hardware, I see anywhere from 2x to 8x speedup from C to RVV

16:56 <palmer> ya, that's what unlord was saying too

16:57 <courmisch> FFmpeg has many more cases and RVV instruction coverage than dav1d as of now

16:57 <courmisch> definitely totally not to brag

16:57 <palmer> ;)

16:58 <palmer> if you know of anything that's specifically slow then please just point me at it (or post on the QEMU bug or whatever), Patrick's going to try and get some reduced test cases from toolchain land but I don't hack on FFMPEG so I don't really know what's up over there

16:58 MaxGanzII has joined #riscv

16:58 <palmer> I think getting all this fixed will probably be too big for an intern project, but hopefully there's some low hanging fruit left we can deal w ith

17:00 <courmisch> I mean, it's basically "git clone ...; cd ffmpeg ; ./configure; make tests/checkasm/checkasm ; tests/checkasm/checkasm --bench"

17:05 <gurki> you might want to delve into the specific libraries ffmpeg uses and use them directly if you want to do benchmarking

17:05 <gurki> ffmpeg is little more than a (admittedly very sophisticated) wrapper for these

17:05 <gurki> encoding/decoding libraries*

17:06 <gurki> palmer: if you have problems with rvv you might want to run a linpack

17:07 <gurki> thats more of a hpc thing, but is a very good metric whether you get as much performance from simd as expected

17:07 <courmisch> uh

17:07 <courmisch> are you for real?

17:08 <courmisch> did you crunch then numbers on native implementations vs external libraries in FFmpeg?

17:09 <gurki> do you get significantly more performance when using e.g. libx265 within ffmpeg instead of externally when you ignore all the boilerplate video handling parts?

17:09 <gurki> this would surprise me, but im happy to be convinced otherwise by numbers

17:10 MaxGanzII has quit [Remote host closed the connection]

17:10 <gurki> i did _not_ mean to belittle ffmpeg efforts btw.

17:10 <gurki> sorry if i gave that impression.

17:12 MaxGanzII has joined #riscv

17:12 <courmisch> ffmpeg is little more than a wrapper for these [specific libraries]

17:12 <courmisch> ^ literally what you wrote

17:12 <courmisch> clearly you have no clue what you are on about, as any cursory look at the code base would invalidate such statement

17:13 <gurki> https://trac.ffmpeg.org/wiki/CompilationGuide/Ubuntu <- i just checked whether things significantly changed. this still reads like they essentially use a lot of external libs for the heavy encoding/decoding lifting

17:13 <gurki> you are correct about "gurki is not familiar with the code itself".

17:14 <gurki> im a mere (happy!) user as far as ffmpeg is concerned.

17:14 <courmisch> and libx265 doesn't have *any* RISC-V optimisation as of yet

17:15 <gurki> so your statement is that there are quite a bunch of rv specific optimizations which have yet to be ported to these libraries, but there are some optimizations for the internal parts?

17:15 <gurki> (no offense, genuinely trying to grasp your point)

17:15 <courmisch> I think you're missing the point

17:15 <courmisch> there are a few things were ffmpeg uses external lirbaries

17:16 <courmisch> but by and large it does stuff natively. Otherwise gst, mpv and VLC wouldn't care to use FFmpeg

17:16 <courmisch> even h264 and h265 are decoded natively, only encoding is delegated to x26x

17:27 ldevulder_ has joined #riscv

17:27 mwette has joined #riscv

17:29 ldevulder has quit [Ping timeout: 268 seconds]

17:35 mlw has quit [Ping timeout: 252 seconds]

17:37 <unlord> courmisch: hey, give it some time. dav1d just got RISC-V 4 hours ago!

17:38 <courmisch> VLC got RVV optimisations 2 years ago

17:39 <unlord> courmisch: don't ask me, I created that dav1d MR in 2022

17:40 <unlord> but yeah, we should probably add the FFmpeg checkasm to that QEMU issue, more tests are definitely better for anyone working on codegen in QEMU

17:40 <courmisch> I don't really see the point in optimising QEMU RVV.

17:41 <unlord> let me help, https://android.googlesource.com/platform/bionic/+/refs/heads/main/libc/arch-riscv64/dynamic_function_dispatch.cpp#47

17:42 <courmisch> eww

17:42 mlw has joined #riscv

17:43 <courmisch> doesn't this assume that any RISC-V board is a QEMU VM?

17:44 <unlord> only if it has that file

17:44 <courmisch> isn't that file present if OpenSBI is present?

17:44 <jrtc27> any kernel config that enables the sbi console should have it, yes

17:44 <courmisch> which it pretty much always is

17:45 <jrtc27> IIRC there was a period it wasn't due to legacy sbi support being dropped prior to the new dbcn extension being added?

17:54 mwette has quit [Ping timeout: 252 seconds]

18:00 prabhakarlad has quit [Ping timeout: 250 seconds]

18:01 <unlord> courmisch: this is really a temporary measure because of how slow QEMU

18:09 <palmer> unlord: we should probably just give the QEMU virt board some m{vendor,arch,impl}id values, that'd be a more reliable way to detect this kind of thing

18:09 <unlord> palmer: that sounds reasonable

18:13 ___nick___ has joined #riscv

18:17 ldevulder_ has quit [Ping timeout: 268 seconds]

18:21 ___nick___ has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

18:23 ___nick___ has joined #riscv

18:24 ___nick___ has quit [Client Quit]

18:25 crossdev has quit [Ping timeout: 252 seconds]

18:26 ___nick___ has joined #riscv

18:30 <palmer> https://lore.kernel.org/qemu-devel/20240131182430.20174-1-palmer@rivosinc.com/T/#u

18:32 ntwk has quit [Read error: Connection reset by peer]

18:45 bitoff has joined #riscv

18:46 jfsimon1981 has joined #riscv

18:47 ldevulder_ has joined #riscv

18:50 heat_ is now known as heat

18:57 KombuchaKip has quit [Quit: Leaving.]

19:18 KombuchaKip has joined #riscv

19:19 BootLayer has quit [Quit: Leaving]

19:25 crossdev has joined #riscv

19:29 <courmisch> unlord: your "temporary measure" is as good as always returning false

19:31 <courmisch> it's effectively checking if the kernel was compiled with support for the SBI console (which should always be the case). It does not distinguish QEMU from real hardware

19:33 <courmisch> palmer: that won't help userspace

19:34 <palmer> you can get them via hwprobe

19:36 <courmisch> okay but even then, if you emulate real hardware, you have to fake the values, so that's pointless

19:36 <courmisch> at least usermode QEMU can be detected by reading uname(&utsname.machine)

19:37 <courmisch> which should return the real ISA, as opposed to riscv

19:38 <courmisch> (obviously won't help system emulation)

19:40 vagrantc has quit [Ping timeout: 260 seconds]

19:52 shamoe has quit [Quit: Connection closed for inactivity]

19:55 maylay has quit [Ping timeout: 252 seconds]

19:56 davidlt has quit [Ping timeout: 252 seconds]

20:24 vagrantc has joined #riscv

20:28 crossdev has quit [Remote host closed the connection]

20:31 vagrantc has quit [Ping timeout: 264 seconds]

20:35 EchelonX has joined #riscv

20:52 vagrantc has joined #riscv

20:52 ezulian has quit [Ping timeout: 268 seconds]

20:55 maylay has joined #riscv

20:59 Andre_Z has joined #riscv

21:03 ntwk has joined #riscv

21:04 ___nick___ has quit [Ping timeout: 252 seconds]

21:10 shamoe has joined #riscv

21:13 jfsimon1981 has quit [Remote host closed the connection]

21:13 <sorear> I can't speak for the system call but "uname -a" returns the emulated architecture in a chroot running with binfmt_misc and qemu, many things would fail to compile otherwise

21:14 <sorear> traumatize the longer-seving people here by making them remember HTIF...

21:14 <conchuod> maybe a silly question, but outside of the virt machine do you really care about detecting whether something is qemu or not?

21:15 <sorear> the topic was "detecting fast V" and "let's use qemu as a proxy for V being much slower than scalars", or possibly the opposite

21:16 <jrtc27> HTIF lives on as the interface used for shutting down QEMU

21:16 <jrtc27> by virtue of conforming to syscon-power's interface

21:17 <sorear> I do think that hwprobe should grow at least a few flags of the form "segmented loads are as fast as unit-stride", "vrgather is as fast as arithmetic at LMUL=1", "masked operations with tu mu are as cheap as unmasked" all of which differ widely between current hw implementations and affect sw optimization

21:18 <sorear> I am aware that getting that data in a way that satisfies kernel and firmware stakeholders will be a nigh-insurmountable challenge, but there's a clear userspace need for every library to not reinvent the runtime benchmarking wheel

21:18 <palmer> sorear: ya, I think we're going to need a bunch of vector performance flags. The only hardware I know if is the K230, if there's other stuff we can probably start to look into the differences and see what makes sense to be generic

21:18 <palmer> ya, we can at least get the uABI sorted out and then deal with the probing later ;)

21:20 <sorear> gurki: fp _also_ sucks in qemu, unless integer linpack is a thing you'll get a severely biased view of vector perf

21:20 <jrtc27> 2024 may well be the year of V 1.0, SG2380 (in the Milk-V Oasis) claims to be shipping Q3

21:24 <gurki> sorear: thank you for explaining the underlying issue in a way a gurki understands :)

21:25 <gurki> i did not expect that

21:27 <sorear> when I was actively working on the qemu riscv port fp instructions all generated helper calls that used the berkeley-softfloat library. I think there was an effort to use actual float instructions in the JITing in at least some cases, but qemu has always prioritized correctness over speed and there's only so much you can do to optimize implementing a different platform's NaN and flags rules

21:28 crossdev has joined #riscv

21:32 crossdev has quit [Remote host closed the connection]

21:34 crossdev has joined #riscv

21:38 jobol has quit [Quit: Leaving]

21:47 <geist> yah the V bits in particular seem to be fairly impossible to JIT natively, due to the variable width stuff

21:47 <geist> looks like basically every V instruction falls to a helper that does a big for loop for every element

21:48 <geist> some internal folks at work that were working with linux on riscv on qemu have found it's much slower to enable V than to run with a machine without it

21:49 esv has quit [Remote host closed the connection]

21:49 <geist> (though looks like i just repeated basically what everyone has been saying. i should read scrollback before blabbing :) )

21:53 esv has joined #riscv

22:00 Andre_Z has quit [Quit: Leaving.]

22:02 <sorear> It's not like you couldn't vectorize that for loop. Maybe use tb_flags to distinguish between VL=VLMAX, where you can unroll inline and use unpredicated traditional SIMD, and VL<VLMAX where you really need lengths or predication

22:19 crossdev has quit [Remote host closed the connection]

22:19 psydroid has quit [Quit: KVIrc 5.0.0 Aria http://www.kvirc.net/]

22:48 bitoff has quit [Ping timeout: 256 seconds]

23:00 Zeroday_ has joined #riscv

23:04 Zer0day1984 has quit [Ping timeout: 256 seconds]

23:04 Zeroday_ is now known as Zer0day1984

23:07 bitoff has joined #riscv

23:18 <sorear> how does risc-v external debug achieve a usable speed? reading all registers naively requires hundreds of roundtrips between the debugger and the debug module. are USB roundtrips reliably sub-ms? do we assume the existence of debug transport hardware that can do the roundtrippy bits at µs hardware speed? does lazy register and memory access work better than it sounds?

23:18 <geist> yah, trouble is you already have to have made it into a helper function at that point

23:18 <geist> so you're already out of JIT land, but you're right, could at least optimize that loop

23:19 <geist> this is where maybe some templaty C++ stuff would help since the loop is basically repeated for every opcode

23:19 <jrtc27> how much memory are you trying to read via external debug?

23:19 <jrtc27> normal guiding principle is steer clear of bare metal debugging on any arch where possible

23:19 <jrtc27> ie embrace a crummy uart printf :)

23:19 <sorear> if you're targeting SVE or AVX512, most single-width LMUL=1 instructions can be turned into a single host instruction

23:20 <geist> well, not really, because its based on what vlen was previously set to

23:21 <sorear> jrtc27: I'm imagining "enough memory to do the LOC + backtrace + locals most graphical debuggers do on a breakpoint" but I don't have a good idea of the problem space so that might not be the best answer

23:21 <geist> vlen could be like 3 or 7 or something

23:21 <jrtc27> my experience of debugging a soft core with the horrendously slow DMI-over-JTAG is it really isn't that bad

23:21 <sorear> you mean vl, and vsetvli would populate a mask register corresponding to vl

23:21 <jrtc27> admittedly not a graphical debugger, just boring tui gdb

23:22 <jrtc27> but LOC comes from the debug symbols on the host

23:22 <jrtc27> backtrace is two pointer-sized memory reads per frame

23:22 <jrtc27> locals you normally do lazily

23:22 <geist> hmm, i suppose yes. you could set the host mask register

23:23 <jrtc27> also I remember the speed being totally fine when doing bare metal debugging of a HiFive Unmatched

23:24 <jrtc27> but that was less extensive

23:24 <sorear> while "it's fine" is relevant information, i'm primarily asking "why is it fine"

23:24 <jrtc27> because it's not *that* much data

23:25 <jrtc27> I would imagine

23:25 <jrtc27> you can do 10s or 100s of KiB/s IIRC for some slow 100 MHz FPGA

23:26 <jrtc27> admittedly that's for writing to memory, not the slower back-and-forth for registers, but still

23:32 <jrtc27> and, FWIW, USB 2.0 uses 0.125ms frames

23:41 TMM_ has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

23:41 TMM_ has joined #riscv