#linux-rockchip on 2025-03-20 — irc logs at libera.irclog.whitequark.org

2023-07-10 07:36 mmind00 changed the topic of #linux-rockchip to: Rockchip development discussion | public log at https://libera.irclog.whitequark.org/linux-rockchip

01:22 raster has quit [Quit: Gettin' stinky!]

01:34 Spirit532 has quit [Killed (NickServ (GHOST command used by Spirit5324))]

01:34 Spirit532 has joined #linux-rockchip

02:18 Daanct12 has joined #linux-rockchip

03:07 hexdump0815 has quit [Ping timeout: 248 seconds]

03:09 hexdump0815 has joined #linux-rockchip

03:10 darfo has quit [Ping timeout: 245 seconds]

03:12 darfo has joined #linux-rockchip

03:53 tlwoerner has quit [Ping timeout: 268 seconds]

03:54 tlwoerner has joined #linux-rockchip

03:57 dsimic has quit [Ping timeout: 265 seconds]

03:58 dsimic has joined #linux-rockchip

04:39 Daanct12 has quit [Quit: WeeChat 4.5.2]

04:48 Daanct12 has joined #linux-rockchip

05:00 System_Error has quit [Remote host closed the connection]

05:06 System_Error has joined #linux-rockchip

05:10 Daanct12 has quit [Ping timeout: 252 seconds]

05:10 Daaanct12 has joined #linux-rockchip

06:55 naoki has joined #linux-rockchip

06:58 <naoki> hmm, I got kernel panic around rk_iommu_of_xlate

06:58 <naoki> recent linux-next

07:02 <Daaanct12> naoki: apply https://lore.kernel.org/linux-iommu/cover.1741886382.git.robin.murphy@arm.com/

07:04 <naoki> Daaanct12: thanks!

07:17 franoosh has joined #linux-rockchip

07:25 naoki has quit [Quit: naoki]

07:25 naoki has joined #linux-rockchip

07:53 ldevulder has joined #linux-rockchip

07:57 franoosh has quit [Remote host closed the connection]

07:59 warpme has joined #linux-rockchip

08:05 <naoki> sre: Are you working on the Radxa ROCK 5B+?

08:07 <Daaanct12> /buffer 53

08:07 <Daaanct12> whoops

08:19 mripard has joined #linux-rockchip

08:22 franoosh has joined #linux-rockchip

09:34 psydroid2 has joined #linux-rockchip

09:43 krei-se has quit [Quit: ZNC 1.9.1 - https://znc.in]

09:54 krei-se has joined #linux-rockchip

09:58 Daaanct12 has quit [Ping timeout: 252 seconds]

10:00 Daanct12 has joined #linux-rockchip

10:28 naoki has quit [Quit: naoki]

10:42 warpme has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

10:56 <montjoie> doing cutdown on arm64_defconfig lead to succesfull boot, so something cutted is the issue

10:57 <qschulz> montjoie: do you have a recent BL31?

10:58 <qschulz> blob from Rockchip, or upstream TF-A?

10:58 <qschulz> ah no, you said 6.6 booted fine, so probably not related to that then

10:59 <qschulz> and a cutdown config helped, why didn't I read more before writing :)

11:06 <montjoie> qschulz: I need to upgrade uboot, it is an old 2023.10-rc4-00039-g252592214f

11:08 <qschulz> montjoie: U-Boot itself doesn't necessarily matter, but BL31 that is in there, yes

11:08 <qschulz> we had random RCU stalls and reboots on an old BL31 from Rockchip

11:09 <qschulz> we use v1.47 now, seems to work fine

11:09 <qschulz> you can also just pick upstream TF-A I assume

11:09 <montjoie> BL31: v2.3():v2.3-589-g3389cfdda:derrick.huang

11:10 * qschulz shrugs

11:10 <qschulz> I am not sure they store the Rockchip version anywhere in the binary blob

11:10 <qschulz> they do for the DDR init blob, not sure for BL31

11:11 <qschulz> you can simply run `strings` on the v1.47 and check if the number after v2.3- is higher than 589

11:11 <montjoie> let see what will do the revert-cutdown, I re-enabled sound/video

11:16 warpme has joined #linux-rockchip

11:23 ldevulder has quit [Remote host closed the connection]

11:23 ldevulder has joined #linux-rockchip

11:26 <qschulz> sre: I have the following message on next-20250319:

11:26 <qschulz> rockchip-pm-domain fd8d8000.power-management:power-controller: Failed to create device link (0x180) with supplier spi2.0 for /power-management@fd8d8000/power-controller/power-domain@12

11:27 <qschulz> it seems like there are some commits attempting to fix similar issues with other drivers, c.f. 74ffe43bad3af3e2a786ca017c205555ba87ebad

11:28 <qschulz> I assume spi2.0 is the internal name for the pmic (CS0 on SPI2)

11:29 <qschulz> on RK3588 Tiger, forgot to mention it

12:40 Daanct12 has quit [Ping timeout: 244 seconds]

12:46 Daanct12 has joined #linux-rockchip

13:01 <montjoie> ok it is kernel size related

13:01 <montjoie> removed some useless pltform, or just wireless give the same result

13:16 <qschulz> montjoie: check your load addresses in U-Boot

13:16 <qschulz> maybe do a checksum after loading it from the storage as well

13:19 warpme has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

13:21 <montjoie> qschulz: checked, no possible overwrite

13:21 <qschulz> montjoie: also doesn't overwrite the FDT?

13:22 <montjoie> yes

13:22 <qschulz> not sure if/how we handled memory holes on RK3588 in that very old U-Boot

13:22 <qschulz> maybe something related to that

13:22 chewitt has joined #linux-rockchip

13:23 <montjoie> kernel 0x0400000 ramdisk 0xa200000 dtb 0xa100000

13:23 <qschulz> not adding the reserved nodes and telling the kernel everything is fine

13:23 <montjoie> and why it fail only at userspace and not while booting ? strange

13:25 <qschulz> the kernel typically uses the top GB in DRAM or something like that

13:25 <qschulz> and we know there aren't holes in there aside from TF-A (first 2MiB)

13:25 <qschulz> or OP-TEE maybe

13:25 warpme has joined #linux-rockchip

13:25 <qschulz> but userspace would try to use anything else, at any location

13:25 <qschulz> random thoughts, nothing to prove that though

13:26 <montjoie> I try to change load address then upgrade uboot just in case

13:26 <qschulz> as to why it is size related, not sure it actually is, could simply be that random addresses allocated by the kernel for userspace are not reproducible (as they should with ASAN or whatever), then it's just "luck"

13:27 <qschulz> maybe if you use the stripped down kernel that doesn't seem to be crashing and load it with IO/allocations in userspace, it may still crash eventually?

13:29 <qschulz> 0xa100000 - 0x0400000 is 157MiB, so not impossible to reach with a full kernel with most drivers built in

13:30 <montjoie> let see if it boots with 0x0300000

13:30 <qschulz> just check the size of the binary you're loading :D

13:31 <qschulz> (could be also relocated, especially if compressed, but I know too little there)

13:34 <montjoie> checked it was 32619008 (1f1ba00 hex)

13:34 <montjoie> so far from 0xa100000

13:35 <qschulz> maybe try a recent U-Boot with a recent BL31 (upstream TF-A even :) ) to see if you can reproduce?

13:35 <qschulz> we have changed some addresses though, so probably not an easy conclusion if it works on recent U-Boot

13:35 <qschulz> (but maybe we don't care if it works on a recent U-Boot :) )

13:39 <montjoie> yeah uboot upgrade is planned, I upgrade all uboot in lava lab

13:39 <montjoie> no more 201x-vendor-shit

13:40 <qschulz> 2023.10 isn't a vendor (at least not from Rockchip :) ) U-Boot

13:40 <montjoie> for this one, yes it is alreacy quite recent

13:40 <qschulz> Rockchip is stuck on 2017.09 AFAIR

13:42 <montjoie> no change with 0x0300000

13:49 raster has joined #linux-rockchip

13:49 darfo_ has joined #linux-rockchip

13:50 darfo has quit [Ping timeout: 252 seconds]

13:58 <Daanct12> can you boot downstream kernel with mainline uboot?

13:58 <Daanct12> at least on the rk3588

13:58 darfo_ has quit [Ping timeout: 252 seconds]

14:01 <qschulz> we do

14:01 <qschulz> what would be the issue?

14:11 darfo has joined #linux-rockchip

14:18 <sre> naoki: yes, I have one of those now and plan to send a series to add support in the next few days after doing more testing to verify everything works as expected and avoid any regressions

14:19 <sre> qschulz: That message is due to the GPU supply regulator. The device link framework sees a circular dependency between the PMIC (which requires SPI, which requires power domains) and the power domain controller.

14:20 warpme has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

14:21 <sre> This message is totally correct of course (there is a circular dependency). That's why the regulator is only resolved once the domain is actually being enabled.

14:22 <qschulz> sre: scary message is scary though

14:22 <sre> The message itself is harmless, though. The device link framework basically gives up trying to optimize the probe order and relies on -EPROBE_DEFER once it happens.

14:23 <sre> Ideally I would like to get rid of it, but it's not trivial.

14:23 <qschulz> if I understood mmind00 properly (in private), we have this because the PMIC checks for the devices using its regulators but find that the PM domain is already probed, and thus give up with that error message?

14:25 <sre> No. Nowadays we have the device link framework, which tries to reorder the driver probe order based on links described in DT to avoid -EPROBE_DEFER as much as possible.

14:26 <sre> Which obviously runs into problems if there is a cycle - i.e. device A needs device B and device B needs device A.

14:26 <sre> To probe the PMIC driver we need SPI and for SPI we need power domains. But the power domain references a regulator from the PMIC.

14:26 <sre> So we get a cyclic dependency

14:28 <mmind00> sre: yep ... but I did track the message down to https://web.git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/drivers/base/core.c#n776

14:28 <mmind00> sre: essentially device-link saying, this is a state-tracking device-link, but the thing I'm waiting to probe, has already probed, so nothing to do ;-)

14:29 <mmind00> sre: which is of course coming from the same cycling thing of course :)

14:29 <mmind00> sre: but as you said, the message is harmless, though scary

14:45 Daanct12 has quit [Quit: WeeChat 4.5.2]

14:48 warpme has joined #linux-rockchip

15:39 warpme has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

15:51 ldevulder has quit [Remote host closed the connection]

15:51 ldevulder has joined #linux-rockchip

15:52 darfo has quit [Ping timeout: 252 seconds]

16:00 darfo has joined #linux-rockchip

16:39 warpme has joined #linux-rockchip

16:54 warpme has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

17:23 ungeskriptet has quit [Remote host closed the connection]

17:48 chewitt has quit [Quit: Zzz..]

17:49 ungeskriptet has joined #linux-rockchip

18:02 raster has quit [Quit: Gettin' stinky!]

18:18 <CounterPillow> harmless though scary sounds like a reason to either reword the message or change its loglevel, but that's for the driver core people to decide. I don't think anything else needs changing or is worth the time investment to change here, the thing it's doing right now is arguably already the best solution to the situation it's encountering.

18:42 ldevulder has quit [Ping timeout: 252 seconds]

19:04 <montjoie> normal to not have ethernet on rock5b ?

19:04 <linkmauve> It works fine on mine, which kernel are you using?

19:05 <montjoie> 6.13.7

19:06 <linkmauve> With the standard arm64 defconfig?

19:06 <linkmauve> Do you have errors in dmesg?

19:07 <montjoie> It worked on 6.10.14

19:07 <linkmauve> If this seems like a regression to you maybe bisect?

19:07 <linkmauve> Or try mainline to check whether it’s already been fixed.

19:08 <sre> I'm not aware of any regression in 6.13.

19:08 <sre> Apart from checking for errors in dmesg I suggest checking lspci as the Rock 5B uses a PCIe based network card.

19:09 <montjoie> it seems pcie is probed at 19s instead of 2s

19:10 <montjoie> sorry it is not that, but pcie do not came

19:14 <montjoie> platform a41000000.pcie: deferred probe pending: platform: supplier regulator-vcc3v3-pcie2x1l2 not ready

19:14 <montjoie> this message is not in 6.10.14 where gmac is ok

19:19 <sre> check if you have CONFIG_REGULATOR_FIXED_VOLTAGE in your kernel config.

19:28 <montjoie> it have, retryed 6.10.14, no eth, perhaps a timing issue, I add some sleep to wait for ethernet

19:32 <montjoie> no change

19:41 ldevulder has joined #linux-rockchip

19:42 <montjoie> perhaps the problem is removing all CONFIG_DRM..., this is the only removal

19:48 <montjoie> lol this is it

19:54 montjoie has quit [Ping timeout: 248 seconds]

19:56 montjoie has joined #linux-rockchip

20:10 ldevulder has quit [Ping timeout: 248 seconds]

20:13 <montjoie> really fun, not having DRM break ethernet

20:27 ldevulder has joined #linux-rockchip

20:27 <montjoie> ouch, mainline uboot of rock5b, do not have ethernet....

20:36 <montjoie> what is this shitty platform

20:37 <CounterPillow> ? works for me

20:38 <CounterPillow> you need to start pcie first

20:38 <CounterPillow> only then does it show up to u-boot

20:42 <CounterPillow> `pci start && pci enum && dhcp && wget 0x0a000000 http://192.168.0.115:8000/rock5.itb && bootm 0x0a000000` is how I automatically boot kernels over HTTP on my ROCK5B, with mainline u-boot. So I am 100% certain that mainline uboot has ethernet on rock5b

20:47 <montjoie> according to config it have

20:47 <montjoie> ok pci enum did the trick thanks

20:48 <montjoie> healcheck passed, now retry full v6.13.7

20:53 <montjoie> still fail

21:27 stikonas has joined #linux-rockchip

21:43 stikonas has quit [Remote host closed the connection]

21:48 psydroid2 has quit [Quit: KVIrc 5.2.6 Quasar http://www.kvirc.net/]

22:09 franoosh has quit [Remote host closed the connection]

23:17 ldevulder has quit [Quit: Leaving]

23:31 naoki has joined #linux-rockchip

23:40 <naoki> sre: then, my 5B+ patch should be discarded and 5T patch should be rebased to your 5B/5B+ patch...

23:42 <naoki> I think I'd be better off giving up on my 5B+/5T and 5C patches and remaking the patch that fixes 5A/5B...

23:43 System_Error has quit [Remote host closed the connection]

23:47 System_Error has joined #linux-rockchip