#fedora-riscv on 2024-03-06 — irc logs at libera.irclog.whitequark.org

2021-06-01 15:14 dgilmore changed the topic of #fedora-riscv to: Fedora on RISC-V https://fedoraproject.org/wiki/Architectures/RISC-V || Logs: https://libera.irclog.whitequark.org/fedora-riscv || Alt Arch discussions are welcome in #fedora-alt-arches

03:05 JasenChao has joined #fedora-riscv

04:34 JasenChao has quit [Quit: Client closed]

05:39 JasenChao has joined #fedora-riscv

05:42 davidlt has joined #fedora-riscv

05:43 davidlt has quit [Remote host closed the connection]

05:45 davidlt has joined #fedora-riscv

06:02 <TelegramRelayBot> jasenchao joined the group via invite link.

06:05 JasenChao has quit [Quit: Client closed]

08:06 <davidlt> rwmjones, could we get NVR (rawhide and f40) for libunwind: https://src.fedoraproject.org/rpms/libunwind/commits/rawhide

08:06 <davidlt> This is just ExclusiveArch change

08:14 <davidlt> rwmjones, libjpeg-turbo, fp-contract: http://fedora.riscv.rocks:3000/rpms/libjpeg-turbo/commit/0588cdd40ad07d6c935fdb18d16117732569f86b

08:30 <davidlt> rwmjones, hyphen, valgrind arches: http://fedora.riscv.rocks:3000/rpms/hyphen/commit/89c03fae82978573259a993afda06b2d5ec2f80c

08:31 <davidlt> rwmjones, guile22, switch to use --disable-rpath instead of sed, http://fedora.riscv.rocks:3000/rpms/guile22/commit/6ba6be5a04d440c2cacba81ffc1517e77586ce70

08:34 <davidlt> rwmjones, cmake, update timing out tests for riscv64 to incl. one more (Qt6Autogen.MocIncludeSymlin): http://fedora.riscv.rocks:3000/rpms/cmake/commit/0510331e78061e4f1ca99bfb269d80740f411377

08:34 <davidlt> Do not disable test in general, this was just needed to double check the list

08:35 <davidlt> rwmjones, libssh, fix provides for riscv64 (same as on other arches): http://fedora.riscv.rocks:3000/rpms/libssh/commit/a2378456fe5c98edacd80694810d1db45cb623ba

08:38 <davidlt> rwmjones, kernel-srpm-macros, add riscv64 to kernel_arches macro: http://fedora.riscv.rocks:3000/rpms/kernel-srpm-macros/commit/208ecde39e98d412abf9ad17fcdf2a1c96df1180

08:39 <davidlt> Do git grep to double check if we need to do anything else in this package. Most likely not, but double check.

08:42 <davidlt> rwmjones, ghc-srpm-macros, add riscv64 to %ghc_arches: http://fedora.riscv.rocks:3000/rpms/ghc-srpm-macros/commit/2e39f60e53edf6b9b2aed571dc79d6ba8861baf8

08:42 <davidlt> I am not sure if this is needed or not, as it exist for backward compatibility.

08:42 <davidlt> Double check with the maintainer if we need it.

08:42 <davidlt> Maybe some folks still expect it.

09:52 <rwmjones> morning, will do shortly

10:03 <rwmjones> davidlt: https://koji.fedoraproject.org/koji/taskinfo?taskID=114550920 libunwind-1.8.0-3.fc41

10:07 <rwmjones> davidlt: https://koji.fedoraproject.org/koji/taskinfo?taskID=114551094 kernel-srpm-macros-1.0-23.fc41

10:07 <rwmjones> I just updated kernel-srpm-macros without a PR as there is nothing to review

10:09 <rwmjones> there were lots of other changes in kernel-srpm-macros but none appeared related to riscv64

10:10 <rwmjones> %ghc_arches is not used anywhere, but better to have it IMHO

10:11 <davidlt> rwmjones, just reminder to make fc40 NVRs too

10:12 <rwmjones> f40, not f41?

10:12 <davidlt> Well we are building f40, not f41 yet

10:13 <rwmjones> that could be more complicated, let me see ...

10:13 <rwmjones> so there's also something to tell you about f40 ... the fork of c10s from f40 has completed, and I'm told they won't be updating / synchronizing c10s packages from f40 at all in future

10:14 <rwmjones> that means i'll need to pull all our f41 changes into c10s (merging not open yet)

10:14 <rwmjones> it's not really a problem but it's not what I expected

10:34 <davidlt> rwmjones, well, I expected this

10:34 <davidlt> as soon as they did the final sync (after mass rebuild) it's a separate thing

10:35 <davidlt> at least until RHEL X.0 happens, and then it's open to changes as long as that doesn't go against RHEL policy

10:41 <rwmjones> davidlt: https://koji.fedoraproject.org/koji/taskinfo?taskID=114552245 hyphen-2.8.8-24.fc41 & https://koji.fedoraproject.org/koji/taskinfo?taskID=114552256 hyphen-2.8.8-24.fc40

10:41 <davidlt> Thanks

10:42 <rwmjones> later today I'm going to go through all the existing PRs and merge all the valgrind_arches ones if maintainers haven't done so already

10:42 <rwmjones> these are not controversial changes IMHO

10:51 <rwmjones> davidlt: https://koji.fedoraproject.org/koji/taskinfo?taskID=114552453 libssh-0.10.6-5.fc41 & https://koji.fedoraproject.org/koji/taskinfo?taskID=114552487 libssh-0.10.6-5.fc40

10:52 <rwmjones> davidlt: I don't understand the cmake change (riscv64/main commit 0510331e78061e4), it seems to both disable the tests and filter out a failing test?

10:54 <rwmjones> I will try compiling it with just the filter bit

10:55 <davidlt> rwmjones, check my message, it explained it

10:55 <davidlt> update timing out tests for riscv64 to incl. one more (Qt6Autogen.MocIncludeSymlin)

10:55 <davidlt> Do not disable test in general, this was just needed to double check the list

10:55 <rwmjones> oh I see, alright let me try it

10:55 <davidlt> it's just about Qt6Autogen.MocIncludeSymlink part

10:55 <davidlt> so tiny change

10:55 <rwmjones> got it

10:56 <rwmjones> is it really "Qt6Autogen.MocIncludeSymlin" or "Qt6Autogen.MocIncludeSymlink" ?

11:50 <davidlt> check commit

11:50 <davidlt> ah

11:50 <davidlt> checking

11:51 <davidlt> rwmjones,

11:51 <davidlt> Qt6Autogen.MocIncludeSymlin

11:51 <rwmjones> I wonder if it does prefix matching and so it just worked anyway

11:51 <davidlt> The following tests FAILED:

11:51 <davidlt> 661 - Qt6Autogen.MocIncludeSymlink (Timeout)

11:51 <rwmjones> I'm actually doing a local build to see if I can reproduce the problem, but the build takes forever :-(

11:53 <rwmjones> you may have noticed I'm going through our PR backlog and pushing things ...

11:53 <rwmjones> I'll add comments to the relevant PRs

11:54 <davidlt> rwmjones, I am cooking lunch right now ;)

11:54 <rwmjones> no problem :-)

11:55 <davidlt> I always said, this is 24/7 :) Even in the kitchen.

12:13 kalev has joined #fedora-riscv

12:47 zsun has joined #fedora-riscv

13:01 <davidlt> rwmjones, I tried building mold, two tests failed: http://fedora.riscv.rocks/koji/taskinfo?taskID=1635825

13:02 <davidlt> I wonder if this could be related to: https://patchwork.ozlabs.org/project/gcc/patch/20231007113225.3196037-1-yanzhang.wang@intel.com/#3194942

13:02 <davidlt> See: https://inbox.sourceware.org/glibc-cvs/20240112193514.40B1E3858D28@sourceware.org/T/

13:04 <davidlt> gcc bit landed in October 2023: https://gcc.gnu.org/git/?p=gcc.git;a=commit;h=b20e59f49b51b7baf05e1b727be5da947e617496

13:05 <rwmjones> will look at mold a bit later, I'm currently going through existing PRs

13:05 <davidlt> glibc landed too, mid-Jan: https://sourceware.org/git/gitweb.cgi?p=glibc.git;h=e0590f41fe1e7a54169e8f8828efe62b5064139e

13:06 <davidlt> it's part of 2.39 from what I can see

13:18 <davidlt> I see LLVM 18 is making it's way towards F40 too

14:37 zsun has quit [Quit: Leaving.]

14:41 <rwmjones> davidlt: re mold, yes it's plausible ...

14:41 <rwmjones> the gcc patch is old enough that surely we have that in gcc 14 already

14:41 <rwmjones> the glibc patch is more recent though

14:41 <davidlt> but it's in a release

14:41 <rwmjones> in Fedora already?

14:41 <davidlt> I checked commit, but I don't know if all this is related to mold anyway

14:42 <davidlt> yeah, it's part of glibc 2.39. It landed weeks ago

14:42 <rwmjones> hard to say, I'm just doing a local build of mold from rawhide here to see

14:42 <rwmjones> maybe there'll be more details in the log file

14:42 <davidlt> ah, 12 days ago: [PATCH] RISC-V: Fix the static-PIE non-relocated object check

14:43 <davidlt> Reported-by: Andreas Schwab <schwab@suse.de>

14:43 <davidlt> Closes: BZ #31317

14:43 <davidlt> Fixes: e0590f41fe ("RISC-V: Enable static-pie.")

14:43 <davidlt> Signed-off-by: Palmer Dabbelt <palmer@rivosinc.com>

14:43 <rwmjones> that's in mold?

14:43 <davidlt> https://inbox.sourceware.org/glibc-bugs/bug-31317-131@http.sourceware.org%2Fbugzilla%2F/

14:43 <davidlt> Yes

14:43 <rwmjones> ok let me see if that can be backported then

14:44 <davidlt> https://sourceware.org/bugzilla/show_bug.cgi?id=31317

14:44 <davidlt> I don't think it landed yet

14:46 <rwmjones> so if a fix is needed in glibc, we could either temporarily skip the failing mold tests with a link to the glibc fix in mold.spec

14:46 <rwmjones> or just wait a bit

14:47 <rwmjones> it's not really clear to me if mold is actually needed anywhere, it seems to be used only for optimization in projects like ceph, and optionally there

14:48 <rwmjones> I got through quite a lot of the PRs today, the easy ones I merged and did f40 & f41 builds

14:49 <rwmjones> still waiting on cmake build to finish

14:57 <davidlt> cmake build is kinda slow

15:00 <rwmjones> it's doing tests, but also competing with another build on my vf2

15:17 <rwmjones> https://kojipkgs.fedoraproject.org//work/tasks/215/114560215/build.log <- I think this is what happens if you completely ignore warnings in your C++ project

15:55 davidlt has quit [Ping timeout: 268 seconds]

16:40 davidlt has joined #fedora-riscv

16:41 davidlt has quit [Remote host closed the connection]

16:42 davidlt has joined #fedora-riscv

17:45 cyberpear has joined #fedora-riscv

18:56 <davidlt> rwmjones, a reminder, join Matrix :)

19:02 <conchuod> davidlt: I ordered one of these k230 boards, is there an eta on this x60 thing you mentioned?

19:04 <davidlt> conchuod, it was never announced

19:05 <davidlt> I mean the date and the price

19:05 <davidlt> they are working on the wiki and youtube videos, shouldn't be too long

19:06 <davidlt> fun thing, they added more content and this time it's called K1X. Sometimes they call it K1.

19:07 <davidlt> It seems that two clusters might be different a bit.

19:08 <davidlt> "AI" stuff is only on cluster 0, and it also as extra 512KB "TCM" next to L2 cache

19:08 <davidlt> AI stuff is "X60TM extends 16 AI instructions, including matrix multiplication and sliding window calculation."

19:09 <davidlt> I would be surprised to see it only on one cluster only.

19:09 <davidlt> Anyways, that's something Linux will not care anyways.

19:09 <davidlt> It has section "Easy to buy", but it's still empty.

19:10 <davidlt> I don't see anything on Aliexpress too.

19:12 <davidlt> I have no idea why it needs Mini PCIe slot.

19:12 <davidlt> The SPI chip is 4MB.

19:12 <davidlt> There is 2Kbit EEPROM too.

19:12 <davidlt> 16GB eMMC

19:13 <davidlt> I am more worried that 8GB of RAM is mentioned more often than 16GB.

19:14 <sorear> fixed TCM/L2 split? ew

19:14 <davidlt> and it some cases it's 4GB.

19:14 <davidlt> I don't know what TCM is.

19:14 <davidlt> But it's 512K L2 + 512K TCM on cluster 0.

19:14 <sorear> tightly-coupled memory, SRAM in the package with fixed-cycle access latency

19:14 <davidlt> cluster 1 has 512K L2.

19:14 <sorear> and fixed addresses

19:15 <sorear> linux won't ever use it but if you're doing bare metal you can put data in the TCM with a linker script

19:16 <davidlt> Yeah, sounds cool after reading a bit on Google search results

19:16 <davidlt> oh, found this: https://www.kernel.org/doc/html/v5.8/arm/tcm.html

19:16 <davidlt> ARM TCM (Tightly-Coupled Memory) handling in Linux

19:16 <sorear> u54/u74 have a single pool of memory which can be configurably split into an L2 cache and a L2 TCM (sifive calls it a "loosely coupled memory" to distinguish it from L1 TCM but that's not standard terminology)

19:17 <davidlt> Quote: Notice that this is not a MMU table: you actually move the physical location of the TCM around. At the place you put it, it will mask any underlying RAM from the CPU so it is usually wise not to overlap any physical RAM with the TCM.

19:18 <davidlt> Quote: To avoid confusion the current Linux implementation will map the TCM 1 to 1 from physical to virtual memory in the location specified by the kernel. Currently Linux will map ITCM to 0xfffe0000 and on, and DTCM to 0xfffe8000 and on, supporting a maximum of 32KiB of ITCM and 32KiB of DTCM.

19:18 <sorear> there's no guarantee spacemit will let you configure the TCM physical address at runtime, sifive doesn't

19:19 <davidlt> I never knew you could do this, so this is new and cool to me :)

19:19 <sorear> andes has a TCM with a fixed *virtual* address that overlays the default mapping address for position-dependent executables, this is blatantly in violation of the privileged architecture but pointing it out doesn't fix anything

19:20 <sorear> or rather Renesas does in the RZ/Five; I don't know how much of that is directly caused by Andes TCM limitations

19:21 <davidlt> oh

19:21 <davidlt> Linux commit: csky: Tightly-Coupled Memory or Sram support

19:21 <conchuod> sorear: We use that TCM on polarfire to run the firmware out of.

19:21 <davidlt> https://github.com/torvalds/linux/commit/f525bb2c9e7cf1e3c43ab57704c9e1c836d30b34

19:21 <conchuod> I think the sifive term is "lim"

19:21 <conchuod> because why reuse the standard term

19:22 <davidlt> because you cannot market it otherwise :)

19:23 <davidlt> I get impression that 512K TCM is a lot reading about ARM stuff

19:23 <sorear> it's more common for a TCM to be connected at the level of the L1 caches

19:24 <sorear> with size and access latency to match

19:24 <davidlt> wouldn't that have a bigger impact in core layout?

19:25 <davidlt> Especially 512K, that's probably a large area compared to those X60 cores (1.3x perf of A55)

19:26 <sorear> yes, which is why they're normally smaller than that

19:27 <davidlt> Looking at various stuff about it I got impression this is designed for AI/ML data processing.

19:28 <conchuod> davidlt: "it was never announced" but you often seem to know more than you should!

19:28 <davidlt> conchuod, I know nothing

19:29 <davidlt> Technically it's all public, and other folks found bits online too

19:29 <conchuod> hows your sg2042?

19:29 <davidlt> Short answer? Annoying.

19:30 <davidlt> I keep corrupting NVMe even with my lower Koji load settings.

19:30 <davidlt> Like it's fine building GCC, LLVM, but once I got towards more than maxjobs=1 it's a risk.

19:31 <davidlt> I am not sure why. I will be doing a new LLVM (18) and GCC 14 builds, maybe it will break this time.

19:32 <conchuod> davidlt: Did you see the starfive pci issue?

19:33 <davidlt> I might talk with SOPHGO about it, but not sure. Still waiting for a contact, I guess.

19:33 <davidlt> conchuod, I did, but I don't think we truly know what's the issue.

19:33 <davidlt> I would like to see a proper errata doc with details.

19:34 <conchuod> The lads at work were saying to me they shoulda hid that shit in opensbi and never mentioned it on lkm

19:34 <conchuod> lkml*

19:34 <davidlt> It seems all the boards are broken in one way or another :)

19:35 <davidlt> We can start guessing what's broken on SpacemiT K1 :)

19:35 <conchuod> It's a custom CPU, so all bets are off.

19:35 <davidlt> Or JH8100 :)

19:36 <conchuod> I have high hopes for 8100 actually.

19:36 <davidlt> Well, they said it was properly tested, but that's kinda it.

19:38 <sorear> if the issue is actually "writes issued by devices are reordered" that doesn't sound like an issue opensbi can fix

19:38 <conchuod> Aye, but the kernel isn't fixing it either I don't think.

19:39 <conchuod> But you do that in your vendor opensbi, noone asks questions and your driver gets merged..

19:39 <sorear> _can_ the kernel do anything about it except on a driver by driver basis?

19:39 <rwmjones> sorear: so is TCM accessed through the cache hierarchy or is it fast enough to serve directly to the core?

19:41 <sorear> rwmjones: on u54 the L1 TCM goes directly to the core, the L2/"LIM" physically goes through the L1 caches but has an uncacheable PMA

19:41 <sorear> I don't have whatever davidlt is looking at

19:41 <rwmjones> I see

19:41 <davidlt> there is no extra details

19:42 <davidlt> it's literally written 512K L2 CACHE + 512K TCM

19:42 <davidlt> cluster 0

19:42 <davidlt> I haven't see SoC datasheet or anything like it (yet)

19:43 <sorear> love to see people internalize "not talking about safety and correctness problems makes them go away" right as STS-51-L passes out of living memory of the people now entering management positions

19:47 <conchuod> sorear: I'm just surprised they didn't try to hide it, I'm not saying that hiding it is what I want them to do.

19:50 <davidlt> Just be happy we don't get to see what's broken in Intel and AMD CPUs :)

19:52 <conchuod> davidlt: Do you know if the jh7110 a "real" device for them with non sbc customers or are they kinda just tiding themselves over til something more powerful?

19:52 <davidlt> conchuod, I don't know, but my guess is that it has not real customer that would cover the development, etc.

19:53 <davidlt> I would say the same is with JH8100.

19:54 <davidlt> Don't get me wrong, I bet these things still will be sold/used in China.

19:54 <conchuod> I wonder how any of these companies actually fund the chips they make, but the answer probably is that they don't.

19:55 <davidlt> Government helps with funding, plus push to use local.

19:55 <davidlt> ByteDance is now funding StarFive too.

19:55 <davidlt> (I think)

19:56 <davidlt> https://pandaily.com/baidu-invests-in-risc-v-chip-startup-starfive/

19:56 <conchuod> Which I guess makes sense, better starfive than give alibaba t-head money

19:56 <davidlt> 150 million USD is a good start :)

19:57 <davidlt> I think Baidu and Alibaba are competitors in China

19:58 <davidlt> There are several large companies (massive ones) in China that could have enough money too cook something custom

19:59 <conchuod> yah

20:00 <conchuod> I suppose they probably also don't do as much validation as more established places either.

20:01 <davidlt> Well it costs tons of money and time

20:01 <davidlt> Instead you could move fast, like SpaceX :)

20:02 <davidlt> When I worked with Huawei they could SoCs very fast.

20:03 <davidlt> Like I never seen so fast moving silicon company. It's like every shuttle in the fab had to have a new improved design.

20:04 <davidlt> The only sad thing is that you are left with tons of non-production hardware.

20:04 cyberpear has quit [Quit: Connection closed for inactivity]

20:05 <conchuod> We are omega slow, which I guess makes sense given how much variance we have to try to validate.

20:08 <davidlt> Yeah, but you have established market/customers. You don't want to have mistakes.

20:08 <conchuod> I mean, it's just a completely different world to the "you must iterate every year" phone SoC companies etc

20:09 <davidlt> I recall ARM having multiple teams (3?) for Cortex-A era stuff to deliver a new design every year

20:11 <sorear> there's a joke about netburst and merced nearly killing intel because intel was no good at handling pipeline mispredictions

20:15 <rwmjones> davidlt: did you see this one? https://github.com/rhboot/shim/pull/420

20:15 <rwmjones> it's from canonical and looks fairly sensible to me

20:15 <davidlt> rwmjones, yes, but it was blocked by pjones

20:16 <rwmjones> blocked as in he actively blocked it, or he just needs to review it & didn't?

20:16 <davidlt> rwmjones, IIRC he wanted binutils changes, which all landed in binutils 2.42

20:18 <davidlt> rwmjones, quote from him (looking at emails):

20:18 <davidlt> This is one of those places where RISC-V seems to want to make every

20:18 <davidlt> single mistake ARM made with AArch64, and I have to push back. There's

20:18 <davidlt> no way we want to have more arches that are build with "ld -O binary".

20:18 <davidlt> Binutils needs to support our binary targets.

20:19 <davidlt> All what is needed landed in binutils 2.42.

20:19 <davidlt> IIRC shim also depends on a gnu-efi, and that needs rebase

20:20 <davidlt> https://github.com/rhboot/gnu-efi/pull/3

20:20 <davidlt> https://github.com/rhboot/gnu-efi/issues/2

20:22 <davidlt> Unless someone starts looking into updating/rebasing/reviewing/etc. riscv64 stuff under "rhboot" I am not planning on shipping GRUB2 (x86_64/aarch64-like bootflow).

20:22 <davidlt> systemd-boot for the win, especially as it no longer requires gnu-efi :)

20:22 <rwmjones> sure

20:23 <davidlt> We could have had the same bootflow a long time ago

20:23 <davidlt> Incl. shim

20:23 <rwmjones> does systemd-boot use shim?

20:24 <davidlt> shim is not strictly needed, so we just don't use it

20:24 <davidlt> I disabled it in Pungi compose, or in some places (templates?), but that's annoying

20:24 <rwmjones> from RH point of view, some kind of secure boot for RHEL is essential

20:25 <davidlt> Yeah, but in that case rebase gnu-efi, rebase GRUB2, and get shim support in.

20:25 <davidlt> Canonical sent initial patches 2-3 years ago

20:26 <davidlt> I am not willing to look at it on my own, just way too many out-of-tree patches (gnu-efi is the easiest here).

20:26 <davidlt> GRUB2 is nonsense. It's like 350 or more patches. I am not looking into that.

20:26 <rwmjones> agreed

20:26 <rwmjones> I just asked peter jones what's going on and if he'll merge the shim PR

20:27 <davidlt> brianredbeard refreshed a PR two weeks ago: https://github.com/rhboot/shim/pull/641

20:27 <rwmjones> yeah we're discussing this on a private (grr) thread

20:28 <rwmjones> let's see what peter says

20:30 <davidlt> rwmjones, I think you still need gnu-efi update

20:33 <davidlt> there is one more problem

20:33 <davidlt> IIRC shim is hardcoded to boot grub next, thus whatever next stage is it must be renamed to grubefi or whatever

20:34 <davidlt> I guess systemd-boot switch does that, but I don't recall.

22:13 davidlt has quit [Ping timeout: 264 seconds]