#u-boot on 2022-03-16 — irc logs at libera.irclog.whitequark.org

2022-02-14 22:05 Tartarus changed the topic of #u-boot to: SOURCE MOVED TO https://source.denx.de/u-boot/u-boot.git / U-Boot v2022.01, v2022.04-rc2 are OUT / Merge Window is CLOSED / Release v2022.04 is scheduled for 4 April 2022 / http://www.denx.de/wiki/U-Boot / Channel archives at https://libera.irclog.whitequark.org/u-boot

00:27 akaWolf has quit [Ping timeout: 272 seconds]

00:32 akaWolf has joined #u-boot

01:10 apritzel has quit [Ping timeout: 250 seconds]

01:11 torez has quit [Quit: torez]

01:22 camus has quit [Quit: camus]

01:28 matthias_bgg has quit [Ping timeout: 256 seconds]

01:32 qschulz has quit [Remote host closed the connection]

01:35 qschulz has joined #u-boot

01:41 matthias_bgg has joined #u-boot

01:42 camus has joined #u-boot

01:52 vagrantc has quit [Quit: leaving]

02:03 mmu_man has quit [Ping timeout: 252 seconds]

02:24 thopiekar has quit [Ping timeout: 240 seconds]

02:25 thopiekar has joined #u-boot

02:56 Gravis has quit [Quit: Murdered]

02:56 Gravis has joined #u-boot

03:13 <sjg1> Tartarus: sorry, spam filter, found it

03:15 <sjg1> rfs613: Patman tries to detect the current project (see detect_project()) or you can use -p to set it

03:16 <sjg1> rfs613: You can have different settings for each. Quite a few people use it for both Linux and U-Boot

03:37 jclsn8 has joined #u-boot

03:38 jclsn has quit [Ping timeout: 240 seconds]

06:09 sbach has quit [Read error: Connection reset by peer]

06:11 sbach has joined #u-boot

06:13 mps has quit [Ping timeout: 272 seconds]

06:57 jclsn8 is now known as jclsn

07:03 camus has quit [Ping timeout: 240 seconds]

07:07 MiNuS_89 has joined #u-boot

07:07 guillaume_g has joined #u-boot

07:19 tre has joined #u-boot

07:25 xypron has quit [Quit: xypron]

07:25 xypron has joined #u-boot

07:28 mps has joined #u-boot

07:29 mckoan|away is now known as mckoan

07:37 frieder has joined #u-boot

07:40 apritzel has joined #u-boot

07:46 monstr has joined #u-boot

07:47 tre has quit [Ping timeout: 252 seconds]

07:59 tre has joined #u-boot

08:00 apritzel has quit [Ping timeout: 240 seconds]

08:02 matthias_bgg has quit [Ping timeout: 240 seconds]

08:07 tnovotny has joined #u-boot

08:37 zibolo has joined #u-boot

08:44 matthias_bgg has joined #u-boot

09:16 camus has joined #u-boot

10:39 apritzel has joined #u-boot

10:42 MiNuS_89 has quit [Ping timeout: 252 seconds]

10:57 lucaceresoli has joined #u-boot

11:16 <milkylainen> can u-boot create gpt tables on raw nand?

11:17 <milkylainen> was gpt ever designed for anything beside regular blockdevices?

11:20 mmu_man has joined #u-boot

11:26 darkapex has quit [Ping timeout: 256 seconds]

11:27 prabhakarlad has joined #u-boot

11:30 darkapex has joined #u-boot

11:42 sughosh has joined #u-boot

11:46 sughosh has quit [Remote host closed the connection]

11:46 <marex> milkylainen: the later

11:46 <marex> milkylainen: use mtdparts of raw nand

11:51 <milkylainen> marex: Yes. But everyone keeps telling me I shouldn't be using gpt for raw nand, but no-one gives me a real technical reason. :) I've heard some, but nothing compelling enough to tell me it's a bad idea. Well... Obviously it's a bad idea for multilevel cells without any FTL of any kind.

11:52 <milkylainen> GPT has backup tables. SLC nand is rather resilient for something not written all the time.

11:52 <milkylainen> mtdparts are logical. Are there any on-media partition formats?

11:53 <marex> milkylainen: you need FTL, so you could stack up "raw nand -> ubi -> ubiblock -> gpt"

11:53 <marex> otherwise you can place the GPT into a bad block or develop a bitflip in the GPT

11:53 <milkylainen> So could your bootloader residing on raw nand?

11:53 <marex> ok, so assuming you detect the bit flip (how?), you use backup table ... backup table location develops a bad block

11:53 <marex> then when ?

11:54 <marex> you need to relocate the backup table somewhere, how ?

11:54 <marex> milkylainen: if your bootloader resides in raw nand, you are in a lot of problems ... it is doable and it sucks

11:55 <marex> milkylainen: the usual approach is to have multiple copies of SPL in the first few erase blocks and that SPL should ideally support UBI and load everything from UBI (U-Boot, Linux, rootfs on ubifs)

11:55 <milkylainen> marex: Yes it sucks. Most would go for an emmc or something with a builtin FTL.

11:55 <marex> but that SPL itself then needs to be maintained ... so when you boot Linux, you need to have a copy of the SPL somewhere in the ubifs and every once in a while, verify that neither copy of SPL got corrupted or developed a bitflip and if it did, rewrite it

11:56 <marex> ubi also needs maintainering , there was some daemon for that in mtd-utils

11:56 <marex> derRichard: hello :-)

11:56 <milkylainen> I thought ubi did scrubbing in a kernel thread?

11:57 <derRichard> milkylainen: no, it does not

11:58 <derRichard> it does scrubbing only if it detects bitflips

11:58 <milkylainen> ok. So only relocation on failures?

11:58 <milkylainen> ah

11:58 <derRichard> but it does not scan the nand for you

11:59 <marex> what was it, ubihealthd ?

11:59 <marex> that was the daemon ?

11:59 <derRichard> yes. ubihealthd is a small daemon that will randomly ask ubi to scrub a peb

12:00 <derRichard> if you let it run over time it will sooner or later scan all the nand for you

12:00 <marex> or you can just dd the whole ubi into /dev/null from cron if there is some known idle time

12:00 <derRichard> the daemon itself is minimal, if you need a more advanced scrubbing plan, use ubi's ioctl interface

12:00 <derRichard> marex: will will not cause ubi to re-read ec and vid headers

12:00 <derRichard> nor internal volumes

12:01 <marex> hah

12:02 <milkylainen> do emmcs do hardware scrubbing in the background or what is the typical emmc behavior? I guess it's hidden, but emmcs don't seem overly complex from the controller perspective?

12:04 <derRichard> milkylainen: some do. but it is all hidden

12:04 <marex> there is a way to enforce that however

12:04 <marex> what was that called ... hold on

12:05 <marex> bkops ?

12:07 <derRichard> yeah, you can trigger background operations on some devices

12:08 <derRichard> and TBH, managed nand (emmc and such) won the war. raw nand is fading out

12:10 <milkylainen> I fully agree. This isn't a question of technology merit but rather a theoretical one. Can an ondisk partition format like GPT live on a raw nand device, SLC or similar.

12:10 <marex> milkylainen: not without FTL, see above

12:12 <milkylainen> Yeah. I must be stupid or something, still don't see why. I don't really see the difference of writing an ondisk format between a SLC nand or a magnetic disk media. Eraseblocks sizes aside, it's just blocks with failure statistics?

12:12 <milkylainen> I know it's a bad idea. :)

12:15 <milkylainen> And yes. I know GPT typically lives on block devices, not raw nand.

12:22 <marex> milkylainen: bitflips are the problem

12:22 <milkylainen> mmm.

12:22 <marex> milkylainen: they develop at random, blocks fail at random and need to be relocated

12:23 <milkylainen> marex: You mean detected on read disturbs?

12:24 <marex> I mean, the bitflips just happen, they just develop in the NAND over time

12:24 <marex> derRichard: can you correct me if I'm wrong ?

12:24 <milkylainen> Yes I know. So do all blocks of all types. Magnetic etc.

12:25 ladis has joined #u-boot

12:25 <marex> milkylainen: not at the rate they do in NAND, not nearly so much

12:26 <milkylainen> But that's read disturb you're talking about. Bad blocks do happen in nand, SLC types are pretty resilient though. I mean, what would be the big difference between an emmc not doing scrubbing if it relocates after the error is detected (depending on how severe the bitflip error is)?

12:27 <milkylainen> Huh. Nintendo Switch seems like GPT on raw NAND?

12:27 <milkylainen> Funny thing. I wouldn't recommend that. But seems like a curious choice.

12:32 <milkylainen> marex: fwiw, thanks for bothering to explain the obvious stuff. :)

12:33 <milkylainen> Just a curiousity thing this... "Why can't I have a ondisk partition format that survives a raw nand? Couldn't GPT?"

12:34 <derRichard> marex: yes, bitflips develop over time

12:35 <marex> milkylainen: leave the NAND in storage for a bit and bitflips just develop because the caps in the NAND array discharge below threshold

12:36 <milkylainen> marex: But that's read disturb (on a previously discharged cel)? And you can't correct for any number of errors with or without ubi?

12:36 <marex> milkylainen: read disturb is you read a cell and another cell develops a bitflip

12:37 <ladis> milkylainen: and that's why you want to run ubihealthd even on read only volumes

12:38 <milkylainen> ladis: Absolutely.

12:38 <milkylainen> Also. The Nintendo Switch seems to be prone to GPT errors on that raw nand. :)

12:39 <milkylainen> So conclusion. There is no technical aspect to stop you from using GPT on a raw nand? It's just a VERY BAD idea?

12:40 <marex> are you sure the tegra does not have some managed nand goo in it ?

12:40 <derRichard> milkylainen: anything on raw nand is a bad idea if you don't manage it

12:41 <marex> why it is a bad idea to put data in raw nand, see above

12:41 <derRichard> same applies for dtb or kernels on raw nand

12:42 <milkylainen> I fully agree. There are very severe corruption issues with storing data on raw nand. But as I said, not merit. Technical. Can you technically use GPT on a raw nand device?

12:42 <milkylainen> I was thinking at first that GPT can't cope with the "strange" block sizes of nand etc.

12:42 <derRichard> you can use it on raw nand if you have a copy and a mechanism to recover

12:43 <derRichard> just like the boot block on modern socs wenn you boot from nand. there you have also many copies...

12:44 <milkylainen> marex: Hmm. Dunno, but looks like people are having issues with corrupt gpt. I guess if it was managed in a decent fashion there would be less corruption?

12:46 <marex> maybe, fix it and send them a patch

12:56 <milkylainen> :)

13:04 cyrozap-ZNC has quit [Quit: Client quit]

13:04 cyrozap has joined #u-boot

13:21 <Tartarus> sjg1: kaki is full

13:56 MiNuS_89 has joined #u-boot

14:07 <marex> MiNuS_89: hi, that ar93xx stuff you do, is that something on top of mainline u-boot ?

14:07 <marex> (its good to see it even works after all this time)

14:08 <marex> hthiery: re mx8mq clock driver, no ... I am so effing overloaded ...

14:08 <marex> hthiery: I just hope NXP can check the tables and then we should merge it for v2022.07

14:10 <hthiery> marex: ah ok ... I hope I can get the uart working in my case

14:10 <marex> hthiery: btw /wrt that MALLOC_F stuff ... I ran into it with mx8mp

14:10 <marex> hthiery: that's why it was my first suggestion

14:11 <marex> commenting out part of the clock tables suddenly made it work

14:11 <marex> hthiery: which UART ?

14:11 <marex> hthiery: are you also working on that ... phone ... or some other mx8mq board ?

14:11 <marex> the uh .... ah ... librem5 phone

14:13 <hthiery> marex: hmmm ... my initail value was 0x2000 .. and now I have to increase it to 0x10000

14:14 <marex> hthiery: because there is like a ton of little driver structures for each single clock which just adds up

14:14 <marex> hthiery: it sucks, big time

14:14 <hthiery> marex: I use the kontron-pitx-imx8m board also with an imx8mq

14:14 <marex> hthiery: ah, you're working with frieder then ?

14:15 <marex> hthiery: we likely need some way to flag only the clock which should be probed early on and ignore the rest , maybe with u-boot,dm-spl in DT somehow

14:15 <marex> and dm-pre-reloc

14:16 <MiNuS_89> marex: I sent a mail to the general mailinglist today to see if I can have few stuff to dig at to try to solve my board. And yes I do that on top of the mainline u-boot

14:17 <MiNuS_89> ArcherC7V5 # version

14:17 <MiNuS_89> U-Boot 2022.04-rc4 (Mar 16 2022 - 14:43:01 +0100)

14:18 <hthiery> marex: in the broadest sense yes ...different locations

14:19 <marex> hthiery: ah cool

14:20 torez has joined #u-boot

14:20 <marex> MiNuS_89: do you have JTAG access to that board ?

14:20 <marex> MiNuS_89: as for the spew ... could it be you have both earlycon/earlyprintk enabled and when regular console kicks in, those two both write into the UART IP ?

14:21 <MiNuS_89> marex: yes I already failed a couple of builds trying to play around with registers ;-)

14:27 <marex> MiNuS_89: well what happens if you disable PCIe support in the kernel, does the system finish booting ?

14:27 <marex> maybe PCIe access screws something up

14:28 <MiNuS_89> This kernel boot with the device original U-Boot (1.1.4)

14:31 <marex> could be the kernel depends on some odd clock settings done by the original bootloader

14:31 <marex> disable the PCIe support, see if it boots with new u-boot then, if so, then your problem is isolated to PCIe

14:31 <marex> then check the clock IP and PCIe IP for any writes from the original u-boot, there might be something in start.S or arch/mips

14:31 <marex> likely some undocumented or poorly documented register

14:33 MiNuS_89 has quit [Ping timeout: 250 seconds]

14:34 MiNuS_89 has joined #u-boot

14:36 <MiNuS_89> marex: BTW I have access to the flash via a CH341A (direct Flash access as there is no JTAG on this board). I also have serial access on the board (a port is available)

14:37 <marex> yikes

14:37 <marex> pity they removed the jtag, it used to be present in those routers

14:37 <MiNuS_89> yes until v4

14:38 <MiNuS_89> but it's fine with a direct flash access also. You don't need to remove the chip each time. Hot flash is doable so it's fairly simple

14:39 <marex> MiNuS_89: with jtag you can start the u-boot without even rewriting the flash, which takes a bit of time to complete

14:39 <MiNuS_89> true

14:52 lucaceresoli_ has joined #u-boot

14:53 lucaceresoli has quit [Quit: Leaving]

14:53 lucaceresoli_ has quit [Client Quit]

14:54 lucaceresoli has joined #u-boot

14:54 tre has quit [Remote host closed the connection]

16:00 vagrantc has joined #u-boot

16:10 mmu_man has quit [Ping timeout: 240 seconds]

16:16 _whitelogger has joined #u-boot

16:21 mmu_man has joined #u-boot

16:43 tnovotny has quit [Quit: Leaving]

17:00 matthias_bgg has quit [Ping timeout: 256 seconds]

17:18 mmu_man has quit [Ping timeout: 256 seconds]

17:19 mckoan is now known as mckoan|away

18:17 frieder has quit [Remote host closed the connection]

18:19 redbrain has quit [Read error: Connection reset by peer]

18:25 redbrain has joined #u-boot

18:38 <rfs613> sjg1: thanks for the reply regarding patman. Am currently trying it in a repo that is neither linux nor u-boot. Got it working, though I had to add a phony alias, and also --no-check as there is no checkpatch.pl in this project.

18:46 kabel has joined #u-boot

18:47 <kabel> so I am pushing to a PR on github, but the system no longer starts CI tests on that PR. Anybody knowns the reason why?

19:09 <marex> kabel: travis is no longer used

19:09 <marex> I suppose that's what you are trying to trigger, right ?

19:12 <kabel> marex: no longer used as of 2 hours ago?

19:12 <kabel> bnecause 2 hours ago it worked. I am talking about azure builds

19:12 <marex> Tartarus: ^

19:13 <Tartarus> So, I guess the CI doc needs another update then

19:14 sobkas has joined #u-boot

19:14 <Tartarus> Checking, https://dev.azure.com/u-boot/u-boot/_build/results?buildId=3842&view=results is public

19:14 <Tartarus> And yes, it hasn't started since we're free tier

19:14 <Tartarus> So, we're at the back of the queue and should get run again someday

19:17 <marex> Tartarus: m$ loves linux but not u-boot :(

19:34 <tg-bridge-bot-ub> <Clamor_S> Was fastboot_raw_partition_ descriptor tested? It doesn't work for me.

19:42 <kabel> :(

19:43 <kabel> but .... my patches ...

19:44 <Forty-Bot> rfs613: generally I use `--ignore-bad-tags --ignore-errors --no-maintainers` (aka -tim) when invoking patman

19:44 <Forty-Bot> I think you can stick that in ~/.patman, but they are named differently and I didn't get it working

19:48 Jacmet has joined #u-boot

19:58 <Tartarus> kabel: yeah, it'll run, it just takes a while sometimes

20:02 mmu_man has joined #u-boot

20:03 <rfs613> Forty-Bot: thanks, that does seem to work, it still warns about alias not found, but carries on. Though it seems like a rather big hammer.

20:03 <Forty-Bot> well, I've found tags are 100% useless

20:03 <Forty-Bot> same with maintainers, since get_maintainers will pick up people who changed 1 line in the code

20:04 <Forty-Bot> the ignore errors is because sometimes there are checkpatch "errors" which are the correct way to do things

20:04 <rfs613> i'm running it without checkpatch and without get_maintainers, so I expected some turbulence ;-)

20:05 <marex> Forty-Bot: I usually look up the last few people who worked on that code and CC those + subsystem maintainer

20:06 <Forty-Bot> that's what I do as well

20:06 <rfs613> another small nit is Series-prefix, seems to always get expanded with a space following it.

20:06 <rfs613> i want to prefix with another [tag] so the end result is [tag][ PATCH]

20:07 <Forty-Bot> did you try format.subjectprefix

20:07 <rfs613> i did not ;-)

20:08 <rfs613> oh gosh, it's right there in the README, how did I miss that! :P

20:08 <marex> git send-email --subject-prefix="tag][PATCH"

20:10 <rfs613> marex: yeah that is how I have been doing it (before patman).

20:21 ladis has quit [Quit: Leaving]

20:24 lucaceresoli has quit [Quit: Leaving]

20:26 MiNuS_89 has quit [Ping timeout: 240 seconds]

20:49 torez has quit [Remote host closed the connection]

20:58 monstr has quit [Remote host closed the connection]

21:19 <Tartarus> kabel: OK, so looking at the CI doc again, the other option we document is that individuals can setup their own freebie azure tier and run the pipeline. That won't be limited by what we're otherwise doing (other users, pushes to master/next/testing my own branches)

21:23 <kabel> Tartarus: thanks, I will try to setup my own tier

22:37 MiNuS_89 has joined #u-boot

22:40 <MiNuS_89> marex: After comparing the boot log of the same kernel with the original boot loader and the new one I found that difference. The 6 lines below are missing with the new bootloader.

22:40 <MiNuS_89> [ 0.000000] On node 0 totalpages: 32768

22:40 <MiNuS_89> [ 0.000000] Normal zone: 288 pages used for memmap

22:40 <MiNuS_89> [ 0.000000] Normal zone: 0 pages reserved

22:40 <MiNuS_89> [ 0.000000] Normal zone: 32768 pages, LIFO batch:7

22:40 <MiNuS_89> [ 0.000000] pcpu-alloc: s0 r0 d32768 u32768 alloc=1*32768

22:40 <MiNuS_89> [ 0.000000] pcpu-alloc: [0] 0

22:44 <MiNuS_89> I can't stay connected tonight but will look at the irc logs if you or someone else reply. I may miss something on memory init that generate the crash i get later.

23:20 sakman has quit [Remote host closed the connection]

23:21 sakman has joined #u-boot

23:51 <marex> try without the PCIe enabled first

23:53 <MiNuS_89> will generate a new kernel tomorrow

23:54 MiNuS_89 has quit [Remote host closed the connection]