#riscv on 2021-08-09 — irc logs at libera.irclog.whitequark.org

2021-08-01 01:31 sorear changed the topic of #riscv to: RISC-V instruction set architecture | https://riscv.org | Logs: https://libera.irclog.whitequark.org/riscv

00:00 hendursaga has quit [Remote host closed the connection]

00:00 hendursaga has joined #riscv

01:25 jimwilson has joined #riscv

01:37 vagrantc has quit [Quit: leaving]

04:58 radu242407 has quit [*.net *.split]

04:58 mcfrdy has quit [*.net *.split]

04:58 dobson has quit [*.net *.split]

04:59 mcfrdy has joined #riscv

04:59 radu242407 has joined #riscv

04:59 dobson has joined #riscv

04:59 Raito_Bezarius has joined #riscv

04:59 linkliu59 has joined #riscv

04:59 Raito_Bezarius has quit [Max SendQ exceeded]

04:59 Raito_Bezarius has joined #riscv

05:01 zapb_ has joined #riscv

05:01 edf0_ has joined #riscv

05:01 Bigcheese_ has joined #riscv

05:01 nosliot has joined #riscv

05:01 gordonDrogon has joined #riscv

05:01 jc has joined #riscv

05:02 stefanct has joined #riscv

05:02 riff-IRC has joined #riscv

05:02 hl has joined #riscv

05:02 sirn has joined #riscv

05:02 sjs has joined #riscv

05:04 scruffyfurn_ has joined #riscv

05:05 cp- has joined #riscv

05:05 awordnot has joined #riscv

05:05 Gravis has joined #riscv

05:05 leah2 has joined #riscv

05:05 pho has joined #riscv

05:05 Maylay has joined #riscv

05:05 kbingham_ has joined #riscv

05:05 awordnot has quit [Signing in (awordnot)]

05:05 awordnot has joined #riscv

05:07 kgz has joined #riscv

05:11 klys has joined #riscv

05:17 pierce has joined #riscv

05:21 BOKALDO has joined #riscv

05:32 freakazoid12345 has quit [Read error: Connection reset by peer]

05:34 charlesap[m] has joined #riscv

06:01 kaji has joined #riscv

06:04 khem has joined #riscv

06:05 winterflaw has joined #riscv

06:17 CarlosEDP has joined #riscv

06:22 EmanuelLoos[m] has joined #riscv

06:34 winterflaw has quit [Ping timeout: 244 seconds]

06:37 GenTooMan has quit [Ping timeout: 240 seconds]

06:42 GenTooMan has joined #riscv

07:21 geertu has quit [Quit: leaving]

07:22 geertu has joined #riscv

07:33 leah2 has quit [Quit: trotz alledem!]

07:33 leah2 has joined #riscv

07:38 valentin has joined #riscv

07:45 TMM_ has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

07:45 TMM_ has joined #riscv

07:56 winterflaw has joined #riscv

08:06 hendursa1 has joined #riscv

08:08 hendursaga has quit [Ping timeout: 244 seconds]

08:46 zjason has joined #riscv

08:59 theruran has quit [Quit: Connection closed for inactivity]

10:36 Esmil has joined #riscv

10:44 wolfshappen has joined #riscv

10:52 dlan has quit [Ping timeout: 245 seconds]

10:54 dlan has joined #riscv

11:01 drewfustini has quit []

11:01 wolfshappen has quit [Quit: later]

11:01 drewfustini has joined #riscv

11:02 wolfshappen has joined #riscv

11:24 jedix has quit [Ping timeout: 258 seconds]

11:25 jedix has joined #riscv

11:38 jwillikers has joined #riscv

12:02 dogukan has joined #riscv

12:12 dogukan has quit [Quit: Konversation terminated!]

12:13 dogukan has joined #riscv

12:14 dogukan has quit [Client Quit]

12:14 dogukan has joined #riscv

12:14 dogukan has quit [Client Quit]

13:26 rjek has quit []

13:26 rjek has joined #riscv

13:39 mthall has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

13:40 mthall has joined #riscv

14:13 GenTooMan has quit [Ping timeout: 256 seconds]

14:15 GenTooMan has joined #riscv

14:17 GenTooMan has quit [Excess Flood]

14:17 GenTooMan has joined #riscv

14:23 hendursa1 has quit [Quit: hendursa1]

14:23 hendursaga has joined #riscv

14:25 GenTooMan has quit [Ping timeout: 272 seconds]

14:28 GenTooMan has joined #riscv

14:40 GenTooMan has quit [Ping timeout: 256 seconds]

14:45 GenTooMan has joined #riscv

14:53 Andre_H has joined #riscv

14:59 compscipunk has joined #riscv

15:13 freakazoid333 has joined #riscv

15:21 adomas has quit []

15:33 iorem has joined #riscv

15:57 iorem has quit [Quit: Connection closed]

16:32 nvmd has joined #riscv

16:42 TMM_ has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

16:42 TMM_ has joined #riscv

17:08 <solrize> sorear around?

17:08 psydroid has joined #riscv

17:09 <sorear> hi

17:09 <solrize> hey can i go way off topic here for a little while, or move somewhere else? i want to talk about some AVR8 code

17:10 <solrize> i'm looking at an AVR flashlight controller which fills up the code space on the smaller AVR parts, and it seems to me that the avr-gcc output is not all that dense, so i'm wondering about the idea of using a bytecode interpreter or similar

17:11 <sorear> sure i guess, noone else seems to want the floor right now, although i wonder what makes this the most attractive channel for you

17:11 <solrize> i remember you wrote a post about code density on different cpus so i looked for you here

17:12 <solrize> figuring you might have thoughts on the topic

17:12 <solrize> sec

17:13 <sorear> ah. I don't recall ever making "a post" on that subject although it's come up on IRC a few times

17:16 <sorear> I also have only the most passing knowledge of AVR instruction encoding

17:17 <solrize> hmm maybe i'm confusing you with someone... it wasn't about avr or hardware cpus as much as encodings in general. it compared forth with smalltalk

17:19 <solrize> or the question of machine code (C compiler output that does a fair amount of 16 bit ops) vs interpreted code on 8 bit cpus in general

17:21 <sorear> the one that comes up somewhat regularly is vincent weaver's work (which I rather disagree with on the grounds that his choice of mostly compression-related benchmarks is not representative of benchmarks I would pick), but that doesn't address either forth or smalltalk

17:21 <solrize> hmm ok i'll see if i can find vincent weaver's work and also will look for the post i'm thinking of

17:25 <sorear> it's not this is it? https://dercuano.github.io/notes/tiny-interpreters-for-microcontrollers.html

17:26 <sorear> this is actually the first time i've heard of anyone using *smalltalk* specifically as a base for deeply embedded systems; forth is much more well-trodden ground

17:26 <solrize> the smalltalk comparison was only about code density

17:27 <solrize> this flashlight thing might have been an ok forth application though

17:29 <solrize> is this you? https://dercuano.github.io/notes/tiny-interpreters-for-microcontrollers.html

17:29 <sorear> no

17:30 <sorear> (my github handle is my irc handle)

17:30 <sorear> i just spent a few minutes trying to find a post based on your description above

17:31 <solrize> ah ok sorry

17:31 <solrize> the person who wrote that is another regular here

17:31 <solrize> no wonder i confused you

17:32 <solrize> http://web.eece.maine.edu/~vweaver/papers/iccd09/ll_document.pdf about to look at this

17:33 <sorear> (who's a regular here?)

17:35 <solrize> dercuano i don't remember what nick he uses

17:36 <sorear> xentrac?

17:39 <solrize> yes, thanks

17:39 <solrize> sorry to have confused the two of you

17:40 <sorear> it's rare for this to happen *to* me, usually i'm the one that can't tell other people apart

17:42 <solrize> heh

17:42 <solrize> here is the other weaver/mckee paper http://web.eece.maine.edu/~vweaver/papers/iccd09/iccd09_density.pdf

17:42 <jrtc27> tbf sorear and xentrac both show as green to me, albeit slightly different shades :D

17:43 <solrize> but yeah i looked at the later one and it is a little bit suspect

17:52 <meowray> what's summary of psABI Task Group meeting - 2021/08/09?

17:55 <solrize> the earlier weaver/mckee paper is not very informative, i just looked at it

18:16 <jrtc27> meowray: minutes are at https://github.com/riscv-admin/psabi/blob/master/MINUTES/meeting-20210809.adoc

18:18 mahmutov has joined #riscv

18:26 <meowray> "Yes, issue is embedded, people care a lot about code size there so can’t change the implementation until binutils has relaxation support." citation needed for the embedded claim

18:28 <jrtc27> yes, well, I've given up fighting over 10/12 bytes there

18:28 <wingsorc> I care about code size you can cite me :)

18:30 <jimwilson> I have pointed at uses of undefined weak in newlib many times. Particularly in crt0.S.

18:33 <jimwilson> The main issue is with naive users that just build a toolchain, build a benchmark, and then decide that RISC-V is broken because code is larger than ARM, without any attempt to understand what is actually going on. This is a problem for the entire RISC-V community. psABI changes that increase code size are reckless, and I won't agree to them.

18:34 <wingsorc> to be honest people roll their own crt0.S

18:35 <jimwilson> but a naive user looking at RISC-V for the first time for a quick evaluation isn't going to do that

18:37 <wingsorc> true. Actually we had people coming in complaining that RISC-V code was 10% larger than ARM

18:37 <wingsorc> I don't remember the exact configuration that was used though...

18:41 <meowray> the people who roll their own crt0.o very likely need -mcmodel=medany -fno-pic ..

18:41 <meowray> s/likely/unlikely/

18:56 haritz has joined #riscv

18:56 haritz has quit [Changing host]

18:56 haritz has joined #riscv

19:01 BOKALDO has quit [Quit: Leaving]

19:06 zjason has quit [Read error: Connection reset by peer]

19:06 zjason has joined #riscv

19:13 <solrize> is risc-v code larger than arm in real life?

19:14 <solrize> is it a matter of adding a feature to binutils (relaxation = shrinking down variable length operations when possible?)

19:14 <solrize> brb

19:14 GenTooMan has quit [Ping timeout: 258 seconds]

19:25 <jrtc27> the answer is likely "which Arm, which RISC-V and what software"...

19:33 GenTooMan has joined #riscv

19:35 <jimwilson> for embedded code, yes, risc-v is larger than arm in real life, the B extension helps a little, the zce* extensions will help more

19:39 <jimwilson> the C extension was designed using SPEC which is a good unix benchmark, but useless for embedded, this is why we have compressed float/double load/store, because SPEC needs them, but not compressed char/short load/store, because SPEC doesn't need them, even though many embedded systems have no float, and have a lot of char/short data to reduce data size, so this hurts embedded code size, but zce* will fix this

19:39 GenTooMan has quit [Ping timeout: 248 seconds]

19:40 <jrtc27> Zce ranges from "this is an obvious omission" to "what on earth no that's not what RISC-V should look like" IMO..

19:40 <jrtc27> hopefully the latter ones are not needed to be competitive for code size, because I really don't like them...

19:41 <jrtc27> how much has GCC been optimised for code size, too? I know Craig and people keep finding new code size wins in LLVM

19:41 <jrtc27> some of it could just be a lack of having time (money...) poured into it

19:46 GenTooMan has joined #riscv

19:54 <jimwilson> gcc is well optimized for dhrystone and coremark code size and performance

19:55 <jimwilson> we get slightly better results for SPEC CPU2006 with gcc than llvm, but we have more people working on llvm than gcc now, so I expect that to eventually change

19:56 <jrtc27> I know a lack of linker relaxation support does hurt LLD, we see that with our tiny set of embedded benchmarks

19:59 <jimwilson> there were some jump threading patches in llvm recently that helped narrow the gap to gcc

20:02 <jrtc27> oh I remember that one, caught my eye as it mentioned coremark explicitly

20:16 Andre_H has quit [Ping timeout: 248 seconds]

20:34 <solrize> i hadn't heard about zce before

20:34 <solrize> it's different from C extension

20:36 <solrize> hmm

20:44 <jimwilson> https://github.com/riscv/riscv-code-size-reduction

20:52 <solrize> thanks

20:53 dermato has quit [Ping timeout: 258 seconds]

20:55 <solrize> i'm glad this stuff is being addressed, like 1 and 2 byte operations

20:55 dermato has joined #riscv

20:55 <solrize> i still want to see bignum benchmarks to check the claim that int overflow detection doesn't matter

20:56 <jrtc27> what do you mean? why would trapping be helpful?

20:56 <jrtc27> (or flags)

20:57 <jrtc27> surely you'd need exactly the same amount of code to proactively detect overflow and allocate more space as to reactively detect it?

20:57 <solrize> well on most cpus if you want a multi precision add, you use a carry flag, and there is an add with carry instruction

20:57 <solrize> and if you divide by 0 there is a hardware trap

20:58 <solrize> and ideally since int overflow is usually a bug, a hw trap would help there too

20:58 <solrize> so you have to emit extra instructions to test all that stuff

21:00 <jrtc27> if I wanted to make add-with-carry efficient I'd probably have c.slti[u] exist and then do c.slti[u]; c.addi

21:00 <jrtc27> and then macro-op fuse that

21:00 <jrtc27> uh, no, you do not want to trap on int overflow

21:00 <jrtc27> mips tried that, it was unused

21:01 vagrantc has joined #riscv

21:01 <jrtc27> everything just used the non-trapping instruction

21:02 <solrize> were the trapping ones slower or anything like that?

21:02 <solrize> and mips, that was before people cared about this stuff

21:02 <jrtc27> it was mips, everything was slow

21:02 <jrtc27> but, you just broke too much code

21:03 <solrize> if code depended on non-trapping it was already broken--signed int overflow in C is UB

21:03 <jrtc27> r6 removed the trapping version

21:03 <jrtc27> sure

21:03 <jrtc27> lots of things are UB

21:03 <jrtc27> shitty code still exists

21:03 <jrtc27> and people like to assume two's complement

21:03 <solrize> thus the desirability of traps, to flag the shitty code instead of running it and letting it corrupt stuff

21:04 <solrize> if they want 2s complement they can use unsigned or -fwrapv

21:04 <solrize> which disables some optimizations

21:04 <jrtc27> I like your optimism that this forces people to fix their code rather than makes people just ignore mips

21:05 <solrize> they ignore mips for many other reasons why not one more

21:08 <solrize> anyway it's a significant sticking point, if people want C to always allow wrapping then they should take it up with the C standard committee. unintentional overflow may not happen much on 64 bit machines but it was a real issue with 32 bit because it often escaped detection. with 16 bit it happened so much that it usually got caught

21:10 <jrtc27> -fsanitize=undefined

21:10 <solrize> hmm ok if that reliability catches overflow, but i mean if it inserts a bunch of extra code and slows down the program then people won't use it

21:11 <solrize> i tried -trapv and there wasn't much difference on x86

21:11 <jrtc27> well it does a whole bunch of things, integer overflow detection being just one of them

21:11 <solrize> nice

21:12 <solrize> i will start using it

21:12 <solrize> i've also wanted to try kcc

21:12 <solrize> or switching from C to ada lol

21:12 <jrtc27> (https://clang.llvm.org/docs/UndefinedBehaviorSanitizer.html FWIW)

21:13 <jrtc27> ubsan is pretty cheap, it's things like msan where it gets slow

21:13 <jrtc27> the headline figure for msan is ~3 times slower, and ~2 times slower for asan

21:14 <solrize> nice thanks right now i primarily use gcc

21:14 <solrize> but i think gcc also has sanitize undefined

21:14 <jrtc27> gcc has support for some of them, don't know exactly what though

21:14 <solrize> yeah

21:14 <jrtc27> yeah it vendors parts of llvm in its tree

21:14 <solrize> wow interesting i didn't know that

21:15 <jrtc27> (the run-time parts of the sanitizers, in libsanitizer)

21:16 <solrize> thanks

21:54 valentin has quit [Quit: Leaving]

22:09 peeps[zen] has quit [Read error: Connection reset by peer]

22:12 peepsalot has joined #riscv

22:18 nvmd has quit [Quit: Later, nerds.]

22:33 <meowray> -fsanitize-trap=undefined is needed to make ubsan cheap

22:34 <dh`> with 16 bit it happened so much that it usually got caught

22:35 <dh`> so you'd think, but virtually every DOS game has some 16-bit overflow in it

22:35 <dh`> I remember in the original railroad tycoon there was a whole succession of 16-bit overflows you'd hit as you expanded your railroad

22:37 <sorear> zce is surprisingly reasonable imo... i'd like to see the detailed benchmark results (later), hopefully this wasn't just tested on one decompression algorithm

22:37 <jrtc27> which parts of zce?

22:37 <jrtc27> most of it is fine

22:38 <jrtc27> a couple of the instructions are way too specialist, and a couple are just "no" (e.g. tbljal, no, don't do that, please)

22:38 <jrtc27> push/pop, meh, I hate it but people do that on microcontroller ISAs

22:38 <sorear> tbljal is close to word for word something I worked out months ago while trying to come up with a non-terrible version of the andes code density instruction

22:39 <jrtc27> non-terrible != good...

22:41 <jimwilson> zce benchmark info https://docs.google.com/spreadsheets/d/1UYll7HGR_QLGTsHcjGoNL4EodM5BNO41hXdxVAFaxFs/edit#gid=1281210325

22:42 <jrtc27> if you want to do tbljal, make it a less architecturally crippled version and just add a load-and-branch instruction...

22:42 <jrtc27> what I don't like is that it's using a new CSR

22:43 <jrtc27> as the implicit base

22:44 <sorear> hmm, if it used gp it'd be compatible with fdpic shared libs... or you could make it truncate pc

22:44 <jimwilson> push/pop and tlbjal are the ones that give the most benefit, but you don't have to implement them on unix parts where performance matters more than code size

22:45 <jrtc27> did they consider a generalised load-and-branch?

22:45 <jrtc27> because that has wider applicability

22:45 <jrtc27> and yeah you could have a compressed form that used say gp as the base

22:46 <sorear> I-types don't grow on trees, especially if you insist on encoding imm[0:1] despite the fact it will always be zero

22:47 <jrtc27> you could make it a J-type at least and shave off bit 0

22:47 <sorear> J = 8 times the space of I

22:47 <jrtc27> oh right

22:47 <jrtc27> hmm

22:48 <jimwilson> I don't recall discussion of load-and-branch, but I haven't followed all of the discussions

22:49 wingsorc__ has joined #riscv

22:49 <jrtc27> yeah I haven't either for various reasons

22:49 <jrtc27> still have concerns about the mismatch between the code corpus in use and the intended application space for the more interesting instructions...

22:50 wingsorc has quit [Read error: Connection reset by peer]

23:06 Xark has joined #riscv

23:13 devcpu has joined #riscv

23:14 theruran has joined #riscv

23:44 mahmutov has quit [Ping timeout: 272 seconds]

23:53 winterflaw has quit [Ping timeout: 244 seconds]

23:57 ntwk has joined #riscv