#riscv on 2023-05-11 — irc logs at libera.irclog.whitequark.org

2021-08-01 01:31 sorear changed the topic of #riscv to: RISC-V instruction set architecture | https://riscv.org | Logs: https://libera.irclog.whitequark.org/riscv

00:16 MaxGanzII_ has joined #riscv

00:51 Tenkawa has quit [Quit: Was I really ever here?]

01:13 meta-coder has joined #riscv

01:31 motherfsck has quit [Ping timeout: 265 seconds]

01:35 jacklsw has joined #riscv

01:35 <dlan> drewfustini: ffffaf8000000000 is the first address of direct mapping, so it's should be the start address of ddr/phys address? check page table for the detail if possible?

01:56 TMM_ has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

01:56 TMM_ has joined #riscv

02:49 MaxGanzII_ has quit [Ping timeout: 240 seconds]

02:51 terminalpusher has quit [Ping timeout: 245 seconds]

02:55 prabhakarlad has quit [Quit: Client closed]

03:17 PobodysNerfect has joined #riscv

03:22 PobodysNerfect has quit [Ping timeout: 256 seconds]

03:59 BootLayer has joined #riscv

04:01 heat has quit [Ping timeout: 240 seconds]

04:16 vagrantc has joined #riscv

04:24 jay321 has quit [Remote host closed the connection]

04:25 jay321 has joined #riscv

04:25 meta-coder has quit [Ping timeout: 264 seconds]

04:28 meta-coder has joined #riscv

04:30 jay321 has quit [Ping timeout: 246 seconds]

04:38 meta-coder has quit [Quit: leaving]

04:38 meta-coder has joined #riscv

05:47 junaid_ has joined #riscv

05:49 PobodysNerfect has joined #riscv

05:51 vagrantc has quit [Quit: leaving]

05:55 PobodysNerfect has quit [Ping timeout: 264 seconds]

06:00 junaid_ has quit [Remote host closed the connection]

06:15 bauruine has joined #riscv

06:40 MaxGanzII_ has joined #riscv

06:40 junaid_ has joined #riscv

06:42 crabbedhaloablut has quit [Ping timeout: 256 seconds]

06:43 crabbedhaloablut has joined #riscv

06:54 PobodysNerfect has joined #riscv

07:39 ldevulder has joined #riscv

07:44 Andre_Z has joined #riscv

07:50 m5zs7k has quit [Ping timeout: 240 seconds]

07:52 MaxGanzII_ has quit [Ping timeout: 240 seconds]

07:55 m5zs7k has joined #riscv

08:06 aburgess has quit [Ping timeout: 268 seconds]

08:25 zjason` is now known as zjason

08:32 PobodysNerfect_ has joined #riscv

08:36 PobodysNerfect has quit [Ping timeout: 268 seconds]

08:41 meta-coder has quit [Ping timeout: 240 seconds]

08:51 meta-coder has joined #riscv

08:57 aburgess has joined #riscv

09:04 MaxGanzII_ has joined #riscv

09:07 MaxGanzII_ has quit [Remote host closed the connection]

09:07 aburgess_ has joined #riscv

09:08 aburgess has quit [Ping timeout: 240 seconds]

09:12 aburgess_ has quit [Ping timeout: 240 seconds]

09:19 wingsorc__ has quit [Ping timeout: 256 seconds]

09:19 meta-coder has quit [Ping timeout: 240 seconds]

09:22 meta-coder has joined #riscv

09:30 shoragan has quit [Remote host closed the connection]

09:31 shoragan has joined #riscv

09:42 jacklsw has quit [Ping timeout: 264 seconds]

10:00 prabhakarlad has joined #riscv

10:07 pharonix71 has quit [Ping timeout: 240 seconds]

10:09 pharonix71 has joined #riscv

10:28 BootLayer has quit [Quit: Leaving]

10:33 MaxGanzII_ has joined #riscv

10:58 wingsorc has joined #riscv

11:08 meta-coder has quit [Quit: leaving]

11:30 aburgess has joined #riscv

11:51 junaid_ has quit [Ping timeout: 240 seconds]

11:53 jmdaemon has quit [Ping timeout: 246 seconds]

11:55 Tenkawa has joined #riscv

11:56 BootLayer has joined #riscv

11:56 meta-coder has joined #riscv

12:18 junaid_ has joined #riscv

12:31 heat has joined #riscv

12:37 jay321 has joined #riscv

12:41 jay321 has quit [Ping timeout: 240 seconds]

12:57 jay321 has joined #riscv

12:59 mahk has quit [Ping timeout: 256 seconds]

13:03 jay321 has quit [Ping timeout: 246 seconds]

13:13 mahk has joined #riscv

13:23 meta-coder has quit [Quit: leaving]

13:48 pedja has joined #riscv

13:57 meta-coder has joined #riscv

14:01 Trifton_ has quit [Ping timeout: 265 seconds]

14:20 meta-coder has quit [Ping timeout: 240 seconds]

14:21 mahk has quit [Changing host]

14:21 mahk has joined #riscv

14:34 Andre_Z has quit [Ping timeout: 265 seconds]

14:45 elastic_dog is now known as Guest4205

14:45 elastic_dog has joined #riscv

14:45 Guest4205 has quit [Ping timeout: 256 seconds]

14:52 meta-coder has joined #riscv

14:59 jacklsw has joined #riscv

15:00 jack_lsw has joined #riscv

15:09 jacklsw has quit [Quit: Back to the real world]

15:10 jack_lsw has quit [Quit: Back to the real life]

15:33 motherfsck has joined #riscv

15:46 jay321 has joined #riscv

15:47 jay321 has quit [Read error: Connection reset by peer]

15:48 Andre_Z has joined #riscv

15:49 meta-coder has quit [Ping timeout: 240 seconds]

15:51 jay321 has joined #riscv

15:53 bjoto has quit [Ping timeout: 268 seconds]

15:55 bjoto has joined #riscv

15:56 junaid_ has quit [Ping timeout: 240 seconds]

16:04 TMM_ has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

16:04 TMM_ has joined #riscv

16:04 aburgess has quit [Ping timeout: 246 seconds]

16:09 prabhakarlad has quit [Quit: Client closed]

16:21 meta-coder has joined #riscv

16:44 jmdaemon has joined #riscv

16:51 meta-coder has quit [Quit: leaving]

16:54 meta-coder has joined #riscv

16:55 jmdaemon has quit [Ping timeout: 265 seconds]

16:56 vagrantc has joined #riscv

17:04 jmdaemon has joined #riscv

17:08 MaxGanzII_ has quit [Ping timeout: 240 seconds]

17:08 Stat_headcrabed has joined #riscv

17:13 Leopold has quit [Ping timeout: 265 seconds]

17:13 Leopold has joined #riscv

17:15 motherfsck has quit [Quit: quit]

17:23 ldevulder_ has joined #riscv

17:25 ldevulder has quit [Ping timeout: 265 seconds]

17:26 <Tenkawa> Got my Star64 in hand now... time to start taking a look

17:35 <palmer> drewfustini, dlan, conchuod: can you guys post on LKML if you're getting a concrete failure here? There was a post on sw-dev too, looks like there's some undocumented memory layout issues we've hit

17:36 <drewfustini> I was moving from 6.2 to 6.4-rc1 when I noticed it. It stops the boot. I since tried 6.3 and it works okay.

17:36 <drewfustini> I need to bisect to see where the problem started between 6.3..6.4-rc1

17:37 <drewfustini> *I have since tried 6.3 [..]

17:37 <conchuod> 6.3.0 is fine drew?

17:38 <drewfustini> Yes, it works okay.

17:39 meta-coder has quit [Ping timeout: 256 seconds]

17:40 MaxGanzII_ has joined #riscv

17:42 <drewfustini> One caveat is that this SoC has errata where TVAL is not sign extended correctly. I had been observing this as an oops on invalid virtual address for badaddr. The "fix" was to sign extend bad addr in do_page_fault (a todo is to correctly use alternatives in the future). This has been working okay in 6.0, 6.1, 6.2, 6.3. One difference in 6.4-rc1 is that the name of the page fault function changed.

17:43 <drewfustini> Current fix prior to 6.4-rc1

17:43 <drewfustini> https://www.irccloud.com/pastebin/xnwlRS8m/

17:43 meta-coder has joined #riscv

17:43 <drewfustini> System works fine with SMP. running benchmarks okay. runs ubuntu 23.04 rootfs okay.

17:44 <drewfustini> It's a bit of an internal hack for now but everything working okay

17:45 <drewfustini> In 6.4-rc1, because of generic entry series, do_page_fault is now handle_page_fault

17:45 <drewfustini> https://www.irccloud.com/pastebin/j5KydTwZ/

17:46 <dh`> how do people ship hardware with such basic blunders? did they not boot linux even once on any test of their cpu ever?

17:46 <drewfustini> It is a pretty trivial change so I am not leaning towards this sign-extend hack being the problem

17:46 <drewfustini> It is not shipping :)

17:46 <drewfustini> It's an internal project.

17:46 <drewfustini> I mention it because I noticed the problem with 6.4-rc1

17:46 <dh`> ah, so you are the lucky guy to run that test

17:46 <dh`> :-)

17:46 <drewfustini> yeah :)

17:47 <drewfustini> I still need to bisect but I wanted to mention the caveat that I do have this "hacky" patch on top.

17:48 <jrtc27> the U74 has that erratum

17:49 <jrtc27> but for some unspecified subset of the time

17:50 <jrtc27> ah no it is specified

17:50 <jrtc27> for instruction faults

17:50 <drewfustini> Yes, the sifive errata has similar fix. I tried for awhile to get a similar alternative to work like they did but had no luck. Most likely I just don't understand C macro stuff well enough

17:50 <jrtc27> which you don't notice because normally you don't take such faults

17:50 <jrtc27> but how this isn't tested by architectural tests...

17:50 <jrtc27> (I know how, because riscv has crap testing)

17:50 <drewfustini> Anyways, I punted until later as this simple hack is good enough for internal use for now

17:51 <jrtc27> well, the trouble is, that's not a good fix

17:51 <jrtc27> because then it means using a zero-extended address doesn't fault like it should

17:51 <jrtc27> and potentially you end up in an infinite trap loop if it originated in the kernel

17:51 <drewfustini> I think the SiFive one is CIP 453: ./arch/riscv/errata/sifive/errata_cip_453.S

17:52 <jrtc27> (kernel accesses zero-extended address, kernel page fault handler says looks good to me, try again, fault reoccurs, repeat)

17:52 <drewfustini> I tried to do something similar for the TVAL sign extend issue that I have but I couldn't get it to work. I'm just carrying this arch/riscv/mm/fault.c hack for now.

17:52 <jrtc27> you can only get away with it for instruction fetch because mepc gives you the sign of PC

17:53 <jrtc27> (well, sepc in the case of unix)

17:53 <jrtc27> and yes, that's the right number

17:53 <jrtc27> (source is https://sifive.cdn.prismic.io/sifive/167a1a56-03f4-4615-a79e-b2a86153148f_FU740_errata_20210205.pdf)

17:54 <jrtc27> (sidenote: it drives me nuts that sifive still ask for personal data to download that file...)

17:54 <jrtc27> (but it's just there in the javascript to scrape the url of and direct link to...)

17:56 <palmer> drewfustini: if it's an internal erratum then we can't do much about it upstream. Do you know if it's the same one as the sw-dev post?

17:56 <palmer> jrtc27: if you just Google for the PDF title then the first result skips the SiFive pages and goes straight to the CDN

17:56 <jrtc27> we have a link on the freebsd wiki

17:57 <drewfustini> The issue I saw was different. It gets into the linux boot. Mounts the rootfs from eMMC. And then I got the fatal "Oops - store (or AMO) access fault [#1]"

17:57 <jrtc27> but it's pretty hostile to devs...

17:57 <drewfustini> https://www.irccloud.com/pastebin/AKK2sOZV/

17:57 <palmer> jrtc27: they sent someone a cease and desist for it, but they backed off after the Google bit

17:57 <palmer> drewfustini: does it reproduce on something public? if so, can you post it on LKML?

17:57 <drewfustini> I am doing some other things on the system right now, but later I want to bisect to understand where the problem starts between 6.3 and 6.4-rc1.

17:58 <jrtc27> morons

17:58 <jrtc27> how to piss off the people supporting your hardware in one swift move

17:58 <palmer> drewfustini: thanks. I'm worried there's something lurking here. It could be a proper Linux bug, as we've gota lot in 6.4

17:58 <drewfustini> Good question... I will try it on some of the SBCs I have

17:59 <palmer> drewfustini: sweet, thanks. I've got most of them bouncing around somewhere, but I only use QEMU... ;)

17:59 <Tenkawa> I'm curious when i can start to try to integrate 6.4-rc into my testing

18:00 <drewfustini> I'm also interested to find out where the regression for my internal system started. There is a service processor that does all the heavy lifting (clocks, resets, etc), so I've always been able to run upstream Linux with just that one patch to sign extend TVAL.

18:00 <Tenkawa> Many areas still look much thinner than the modified 6.2

18:01 <Tenkawa> This Star64 builder is.... odd

18:01 <drewfustini> Without that TVAL sign extend patch, upstream Linux up to and including 6.3 still works, but it copy_process will sometimes fail when badaddr gets the top bits cleared by the hardware bug and results in an invalid virtual address.

18:02 <palmer> Tenkawa: what do you mean by thinner?

18:03 <Tenkawa> palmer: I compared the dts I have for the work Esmil had been doing on 6.2 and its much larger than the current one in 6.4-rc for the VF2

18:03 <Tenkawa> the nodes look very... sparse

18:03 <Tenkawa> on 6.4-rc

18:05 Stat_headcrabed has quit [Quit: Stat_headcrabed]

18:06 <palmer> Tenkawa: OK, that makes sense. I'm not really following the various SOC downstreams, though, so I'm probably the least likely to know ;)

18:06 <Tenkawa> Yeah I am working on te VF2 and the Star64

18:06 <Tenkawa> Just got the Star64 today

18:07 <Tenkawa> the build wants yocto for this though ... thats going to have to go....

18:08 <palmer> looks like they're both jh7110? there's some patches in the queue for that, IIUC there's no errata (the jh7100 is blocked on the DMA stuff)

18:09 <Tenkawa> Yeah.. its a nice 7110 (all built on the board too)

18:10 <drewfustini> jrtc27: thanks for the insights. re-reading what you wrote, I see that you mean this simple "fix" to sign extend tval could cause other, potentially worse effects.

18:10 paddymahoney has quit [Ping timeout: 265 seconds]

18:23 MaxGanzII_ has quit [Ping timeout: 240 seconds]

18:26 MaxGanzII_ has joined #riscv

18:39 ___nick___ has joined #riscv

18:49 billchenchina- has quit [Ping timeout: 264 seconds]

18:50 <conchuod> Tenkawa: Much of the jh7110 stuff is hung up on the clock drivers being applied. There's a bunch of stuff sitting around waiting for that.

18:51 <conchuod> 6.4-rc1 is usable, as long as your definition of that is "boot an initramfs & access w/ uart"

18:53 wingsorc has quit [Quit: Leaving]

18:54 aredridel has quit [Read error: Connection reset by peer]

18:54 aredridel has joined #riscv

19:02 aerkiaga has joined #riscv

19:03 BootLayer has quit [Quit: Leaving]

19:17 billchenchina- has joined #riscv

19:20 ___nick___ has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

19:22 ___nick___ has joined #riscv

19:25 ___nick___ has quit [Client Quit]

19:27 ___nick___ has joined #riscv

19:41 aburgess has joined #riscv

19:43 <palmer> and that's all I do!

19:56 jmdaemon has quit [Ping timeout: 240 seconds]

20:00 billchenchina- has quit [Ping timeout: 240 seconds]

20:03 billchenchina- has joined #riscv

20:04 ___nick___ has quit [Ping timeout: 256 seconds]

20:08 Andre_Z has quit [Quit: Leaving.]

20:20 aerkiaga has quit [Remote host closed the connection]

20:22 bauruine has quit [Remote host closed the connection]

20:26 wingsorc__ has joined #riscv

20:50 ldevulder_ has quit [Quit: Leaving]

20:58 crabbedhaloablut has quit [Read error: Connection reset by peer]

21:01 aerkiaga has joined #riscv

21:02 crabbedhaloablut has joined #riscv

21:02 PobodysNerfect_ has quit [Quit: Gone to sleep. ZZZzzz…]

21:04 billchenchina- has quit [Ping timeout: 246 seconds]

21:07 loki_val has joined #riscv

21:07 crabbedhaloablut has quit [Ping timeout: 264 seconds]

21:09 billchenchina has joined #riscv

21:20 PobodysNerfect has joined #riscv

21:24 PobodysNerfect has quit [Ping timeout: 240 seconds]

21:32 duckworld has quit [*.net *.split]

21:32 duckworld has joined #riscv

21:46 duckworld has quit [*.net *.split]

21:46 duckworld has joined #riscv

21:46 duckworld has quit [Max SendQ exceeded]

21:46 duckworld has joined #riscv

21:48 sh1r4s3 has quit [Ping timeout: 256 seconds]

21:54 duckworld has quit [*.net *.split]

21:54 duckworld has joined #riscv

21:58 MaxGanzII_ has quit [Ping timeout: 240 seconds]

22:05 aerkiaga has quit [Remote host closed the connection]

22:08 pbsds has quit [Ping timeout: 240 seconds]

22:09 pbsds has joined #riscv

22:48 elastic_dog is now known as Guest1681

22:48 Guest1681 has quit [Killed (erbium.libera.chat (Nickname regained by services))]

22:48 elastic_dog has joined #riscv

23:01 ahs3 has joined #riscv

23:05 Tenkawa has quit [Quit: Was I really ever here?]

23:14 ashtin has joined #riscv

23:40 KombuchaKip has joined #riscv

23:50 jmdaemon has joined #riscv

23:51 Tenkawa has joined #riscv

23:58 vagrantc has quit [Quit: leaving]